mirror of
https://github.com/vllm-project/vllm.git
synced 2026-06-06 00:16:14 +00:00
257af77bc2
Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by: wang.yuqi <noooop@126.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
908 B
908 B
toc_depth
| toc_depth |
|---|
| 3 |
Engine Arguments
Engine arguments control the behavior of the vLLM engine.
- For offline inference, they are part of the arguments to [LLM][vllm.LLM] class.
- For online serving, they are part of the arguments to
vllm serve.
The engine argument classes, [EngineArgs][vllm.engine.arg_utils.EngineArgs] and [AsyncEngineArgs][vllm.engine.arg_utils.AsyncEngineArgs], are a combination of the configuration classes defined in [vllm.config][]. Therefore, if you are interested in developer documentation, we recommend looking at these configuration classes as they are the source of truth for types, defaults and docstrings.
--8<-- "docs/cli/json_tip.inc.md"
EngineArgs
--8<-- "docs/generated/argparse/engine_args.inc.md"
AsyncEngineArgs
--8<-- "docs/generated/argparse/async_engine_args.inc.md"