mirror of https://github.com/vllm-project/vllm.git synced 2026-06-06 00:16:14 +00:00

Files

T

Ray Wang 87e31455b0 [Doc] Sync CLI guide with actual help modes and launch subcommand (#40326 )

Signed-off-by: Rui Wang <raygorous@gmail.com>
Co-authored-by: Rui Wang <raygorous@gmail.com>

2026-05-20 02:32:03 -07:00

vllm launch render

Overview

vllm launch render starts a GPU-less rendering server for preprocessing and postprocessing only.

vllm launch render meta-llama/Llama-3.2-1B-Instruct --port 8100

This command reuses the standard serving parser, so model, frontend, networking, and related CLI options follow the same conventions as vllm serve.

--8<-- "docs/cli/json_tip.inc.md"

--8<-- "docs/generated/argparse/launch_render.inc.md"