TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Zheng Duan e666a704f5 [None][doc] add visualization of perf metrics in time breakdown tool doc (#8530 ) Signed-off-by: zhengd-nv <200704041+zhengd-nv@users.noreply.github.com>		2025-10-23 22:09:21 -04:00
..
time_breakdown	[None][doc] add visualization of perf metrics in time breakdown tool doc (#8530 )	2025-10-23 22:09:21 -04:00
__init__.py	feat: Make benchmark_serving part of the library (#5428 )	2025-06-25 23:13:56 +08:00
backend_request_func.py	[https://nvbugs/5369366 ] [fix] Report failing requests (#7060 )	2025-09-04 12:56:23 -07:00
benchmark_dataset.py	[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851 )	2025-09-25 21:02:35 +08:00
benchmark_serving.py	[None][feat] Add request timing breakdown option in benchmark_serving (#8128 )	2025-10-10 09:24:54 -07:00
benchmark_utils.py	[feat] support sharegpt downloading in benchmark_serving (#4578 )	2025-05-30 17:27:53 +08:00