mirror of https://github.com/vllm-project/vllm.git synced 2026-06-06 00:16:14 +00:00

Files

T

Paco Xu 7493c51c55 [Docs] add Dynamo/aibrix integration and kubeai/aks link (#32767 )

Signed-off-by: Paco Xu <paco.xu@daocloud.io>

2026-03-05 17:39:50 +08:00

621 B

Raw Permalink Blame History

NVIDIA Dynamo

NVIDIA Dynamo is an open-source framework for distributed LLM inference that can run vLLM on Kubernetes with flexible serving architectures (e.g. aggregated/disaggregated, optional router/planner).

For Kubernetes deployment instructions and examples (including vLLM), see the Deploying Dynamo on Kubernetes guide.

Background reading: InfoQ news coverage — NVIDIA Dynamo simplifies Kubernetes deployment for LLM inference.

621 B Raw Permalink Blame History

NVIDIA Dynamo

621 B

Raw Permalink Blame History