TensorRT-LLMs/examples/disaggregated/slurm/simple_example/gen_extra-llm-api-config.yaml
Shi Xiaowei fe7dda834d
[TRTLLM-7030][fix] Refactor the example doc of dist-serving (#6766)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2025-08-13 17:39:27 +08:00

4 lines
70 B
YAML

cache_transceiver_config:
backend: UCX
max_tokens_in_buffer: 2048