mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Zero Zeng 16bb76c31d [None][chore] Update benchmark script (#7860 ) Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com> Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>		2025-09-23 03:15:42 -07:00
..
process_gen_iterlog.py	[None] [chore] Update wide-ep genonly scripts (#6995 )	2025-08-19 07:44:07 -04:00
README.md	[None] [chore] Update wide-ep genonly scripts (#6995 )	2025-08-19 07:44:07 -04:00
submit_e2e.sh	[None][chore] Update benchmark script (#7860 )	2025-09-23 03:15:42 -07:00
submit_gen_only.sh	[None][chore] Update benchmark script (#7860 )	2025-09-23 03:15:42 -07:00

README.md

TensorRT-LLM Wide-EP Benchmark Scripts

This directory contains scripts for benchmarking TensorRT-LLM wide-ep performance using SLURM job scheduler.

⚠️ DISCLAIMER

These scripts are currently not QA'ed and are provided for demonstration purposes only.

Please note that:

These scripts have not undergone formal quality assurance testing
They are intended for demonstration and educational purposes
Use at your own risk in production environments
Always review and test scripts thoroughly before running in your specific environment

Scripts Overview

Core Scripts

Note that, core implementation of the slurm scripts are included in examples/disaggregated/slurm/benchmark.

submit_e2e.sh - Main entry point for submitting E2E benchmark jobs
submit_gen_only.sh - Main entry point for submitting gen-only benchmark jobs
process_gen_iterlog.py - Processes benchmark results and generates reports

Usage

Prerequisites

Before running the scripts, ensure you have:

Access to a SLURM cluster
Container image with TensorRT-LLM installed
Model files accessible on the cluster
Required environment variables set

Run E2E Benchmarks

# Refer to `examples/disaggregated/slurm/benchmark/`
# Please find the `disaggr_torch.slurm` script in the `examples/disaggregated/slurm/benchmark/` directory.
# Make sure that SLURM parameters are correctly set in `disaggr_torch.slurm` before executing this script.
./submit_e2e.sh

Run gen-only Benchmarks and post-processes the results using `process_gen_iterlog.py`

./submit_gen_only.sh

python3 process_gen_iterlog.py --dir_prefix <path>

process_gen_iterlog.py will be responsible for:

Parses iteration logs from workers
Calculates throughput metrics
Generates CSV reports
Supports MTP (Multi-Token Prediction) analysis