[https://nvbugs/5569754][chore] Adjust max batch size to prevent OOM (#8876)

Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
This commit is contained in:
JunyiXu-nv 2025-11-05 01:34:26 +08:00 committed by GitHub
parent cacb8a84f2
commit c329f5f78b
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -51,5 +51,6 @@ srun -l \
trtllm-llmapi-launch python3 $script \
--model_dir $LOCAL_MODEL \
--prompt 'Hello, how are you?' \
--tp_size 2
--tp_size 2 \
--max_batch_size 256
"