TensorRT-LLMs/examples/llm-api/extra-llm-api-config.yml
Guoming Zhang b941d7acbb
[https://nvbugs/5634220][fix] Add developer guide back and fix some i… (#8911)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-11-05 10:17:01 +08:00

6 lines
99 B
YAML

cuda_graph_config:
enable_padding: True
max_batch_size: 16
moe_config:
backend: trtllm