AI/ML workloads | AKS Learning Path

📄️GPU Node Pools

Provisioning and managing GPU nodes in AKS for ML/AI workloads

Deploy LLMs on AKS with one custom resource using the Kubernetes AI Toolchain Operator

Production LLM serving on AKS -- KAITO, vLLM, TGI, and autoscaling strategies

Run AI inference workloads on AKS at production scale: GPU node pool strategy, model caching, autoscaling, cost controls, and multi-model serving.