TensorRT-LLMs/tensorrt_llm/bench/benchmark
nv-guomingz 578430e64c
[TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-06-30 11:05:40 +08:00
..
utils [TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014) 2025-06-30 11:05:40 +08:00
__init__.py Update TensorRT-LLM (#2389) 2024-10-29 22:24:38 +08:00
low_latency.py [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312) 2025-06-20 03:01:10 +08:00
throughput.py Update trtllm-bench to support new Pytorch default. (#5491) 2025-06-26 17:05:43 -07:00