keda

Name: Entuit Enterprise Solutions Inc.
Address: Toronto, ON, CA

1 post tagged “keda”

June 7, 2026

Building a Hybrid LLM Platform on EKS, Part 5: Serving Local Models with vLLM and KEDA

Part 5 of our hands-on EKS series. We deploy vLLM model servers on the GPU pool from Part 4, load Qwen2.5-7B model weights from Amazon S3 via an init container, and wire KEDA autoscaling that scales replicas with live queue depth and drives GPU nodes to zero overnight.

eks kubernetes aws-cdk vllm keda gpu autoscaling llm ai-infrastructure typescript