mirror of https://github.com/vllm-project/vllm.git synced 2026-06-06 00:16:14 +00:00

Files

T

wang.yuqi 257af77bc2 [Docs] Reorganize online serving docs. (#41907 )

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2026-05-19 14:43:18 +08:00

908 B

Raw Blame History

toc_depth

toc_depth
3

Engine Arguments

Engine arguments control the behavior of the vLLM engine.

For offline inference, they are part of the arguments to [LLM][vllm.LLM] class.
For online serving, they are part of the arguments to vllm serve.

The engine argument classes, [EngineArgs][vllm.engine.arg_utils.EngineArgs] and [AsyncEngineArgs][vllm.engine.arg_utils.AsyncEngineArgs], are a combination of the configuration classes defined in [vllm.config][]. Therefore, if you are interested in developer documentation, we recommend looking at these configuration classes as they are the source of truth for types, defaults and docstrings.

--8<-- "docs/cli/json_tip.inc.md"

`EngineArgs`

--8<-- "docs/generated/argparse/engine_args.inc.md"

`AsyncEngineArgs`

--8<-- "docs/generated/argparse/async_engine_args.inc.md"

908 B Raw Blame History

Engine Arguments

EngineArgs

AsyncEngineArgs

908 B

Raw Blame History

`EngineArgs`

`AsyncEngineArgs`