mirror of
https://github.com/vllm-project/vllm.git
synced 2026-06-06 00:16:14 +00:00
87e31455b0
Signed-off-by: Rui Wang <raygorous@gmail.com> Co-authored-by: Rui Wang <raygorous@gmail.com>
504 B
504 B
vllm launch render
Overview
vllm launch render starts a GPU-less rendering server for preprocessing and
postprocessing only.
vllm launch render meta-llama/Llama-3.2-1B-Instruct --port 8100
This command reuses the standard serving parser, so model, frontend,
networking, and related CLI options follow the same conventions as
vllm serve.
JSON CLI Arguments
--8<-- "docs/cli/json_tip.inc.md"
Arguments
--8<-- "docs/generated/argparse/launch_render.inc.md"