Pod troubleshooting
Decision trees for the most common pod failures: Pending, CrashLoopBackOff, ImagePullBackOff, OOMKilled, and stuck terminating.
Workload Identity troubleshooting
Decision tree for diagnosing Azure Workload Identity authentication failures in AKS pods.
Network troubleshooting
Decision trees for AKS networking failures: service not reachable, ingress broken, DNS failures, egress blocked, and private cluster access issues.
Cluster troubleshooting
Decision trees for cluster-level AKS issues: node NotReady, upgrade failures, API server unreachable, certificate expiry, and quota exhaustion.