Hands-onTroubleshooting

Troubleshooting

Systematic troubleshooting patterns for instance reachability, IAM permission failures, networking misconfiguration, and scaling issues.

Case 1: Instance Not Reachable

Case 2: Permission Issues

Case 3: Networking Issues

Case 4: Scaling Issues

Hands-on Runbook

# Quick diagnostics
aws ec2 describe-instance-status --instance-ids i-1234567890abcdef0
aws autoscaling describe-scaling-activities --auto-scaling-group-name web-asg --max-items 10
aws cloudtrail lookup-events --lookup-attributes AttributeKey=EventName,AttributeValue=AuthorizeSecurityGroupIngress --max-results 5

Interview Questions

Beginner: First check when EC2 is unreachable?
Security group and network path (subnet route + IGW/NAT) plus instance health.
Intermediate: Why can allow policy still fail?
Because explicit deny elsewhere overrides allow.
Scenario: ASG adds instances but traffic still fails. What next?
Check target group health checks, app startup completion, and listener rules.

Summary