TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-22 03:35:00 +08:00

History

JunyiXu-nv b210f22c7e [https://nvbugs/5703953 ][fix] Preserving ip:port for trtllm-serve before initializing llm (#9646 ) Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>		2025-12-06 20:13:48 -08:00
..
__init__.py	Update TensorRT-LLM (#613 )	2023-12-08 17:49:24 +08:00
bench.py	[#9463 ][feat] Add revision option to trtllm commands (#9498 )	2025-11-27 09:30:01 +08:00
build.py	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
eval.py	[None][chore] Fix trtllm-eval and move GroupedGemmInputsHelper (#9612 )	2025-12-03 07:55:03 +08:00
prune.py	Update TensorRT-LLM (#2008 )	2024-07-23 23:05:09 +08:00
refit.py	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
serve.py	[https://nvbugs/5703953 ][fix] Preserving ip:port for trtllm-serve before initializing llm (#9646 )	2025-12-06 20:13:48 -08:00