mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Zero Zeng 4545700fcf [None][chore] Move submit.sh to python and use yaml configuration (#8003 ) Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>		2025-10-20 22:36:50 -04:00
..
process_gen_iterlog.py	[None] [chore] Update wide-ep genonly scripts (#6995 )	2025-08-19 07:44:07 -04:00
README.md	[None][chore] Move submit.sh to python and use yaml configuration (#8003 )	2025-10-20 22:36:50 -04:00

README.md

TensorRT LLM Wide-EP Benchmark Scripts

This directory contains scripts for benchmarking TensorRT LLM wide-ep performance using SLURM job scheduler.

⚠️ DISCLAIMER

These scripts are currently not QA'ed and are provided for demonstration purposes only.

Please note that:

These scripts have not undergone formal quality assurance testing
They are intended for demonstration and educational purposes
Use at your own risk in production environments
Always review and test scripts thoroughly before running in your specific environment

Scripts Overview

Core Scripts

Note that, core implementation of the slurm scripts are included in examples/disaggregated/slurm/benchmark.

process_gen_iterlog.py - Processes benchmark results and generates reports

Usage

Prerequisites

Before running the scripts, ensure you have:

Access to a SLURM cluster
Container image with TensorRT LLM installed
Model files accessible on the cluster
Required environment variables set

Run Benchmarks

# Please find the `submit.py` script and an example `config.yaml` in the `examples/disaggregated/slurm/benchmark/` directory.
python3 submit.py -c your_config.yaml