[https://nvbugs/5569754][chore] Adjust max batch size to prevent OOM (#8876)

Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2026-01-14 06:27:45 +08:00 · 2025-11-05 01:34:26 +08:00 · 2025-11-05 01:34:26 +08:00 · c329f5f78b
commit c329f5f78b
parent cacb8a84f2
1 changed files with 2 additions and 1 deletions
--- a/examples/llm-api/llm_mgmn_llm_distributed.sh
+++ b/examples/llm-api/llm_mgmn_llm_distributed.sh
@ -51,5 +51,6 @@ srun -l \
        trtllm-llmapi-launch python3 $script \
            --model_dir $LOCAL_MODEL \
            --prompt 'Hello, how are you?' \
-            --tp_size 2
+            --tp_size 2 \
+            --max_batch_size 256
    "