TensorRT-LLMs/tensorrt_llm/bench/benchmark
nv-guomingz 6e48ac25a6
chore: remove cuda_graph_ prefix from cuda_graph_config filed members. (#5585)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-06-30 12:23:14 -04:00
..
utils chore: remove cuda_graph_ prefix from cuda_graph_config filed members. (#5585) 2025-06-30 12:23:14 -04:00
__init__.py Update TensorRT-LLM (#2389) 2024-10-29 22:24:38 +08:00
low_latency.py [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312) 2025-06-20 03:01:10 +08:00
throughput.py Update trtllm-bench to support new Pytorch default. (#5491) 2025-06-26 17:05:43 -07:00