TensorRT-LLMs/benchmarks
Shi Xiaowei d010b2043a
[TRTLLM-7030][fix] BREAKING CHANGE: Mismatch between docs and actual commands (#7191)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2025-08-25 20:21:43 +08:00
..
cpp [TRTLLM-7030][fix] BREAKING CHANGE: Mismatch between docs and actual commands (#7191) 2025-08-25 20:21:43 +08:00
README.md chore: Remove deprecated Python runtime benchmark (#4171) 2025-05-14 18:41:05 +08:00

TensorRT-LLM Benchmarks

Overview

There are currently two workflows to benchmark TensorRT-LLM:

  • trtllm-bench
    • trtllm-bench is native to TensorRT-LLM and is a Python benchmarker for reproducing and testing the performance of TensorRT-LLM.
    • NOTE: This benchmarking suite is a current work in progress and is prone to large changes.
  • C++ benchmarks
    • The recommended workflow that uses TensorRT-LLM C++ API and can take advantage of the latest features of TensorRT-LLM.