TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-11 13:33:40 +08:00

History

yuxianq 7b03350527 Add thread leak check and fix thread/memory leak issues. (#3270 ) Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>		2025-04-08 19:03:18 +08:00
..
compile	perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench (#3041 )	2025-03-26 14:33:14 -07:00
custom_ops	feat: Apply the new torch-flow compatible AutoTuner to both Fused MoE and NVFP4 Linear operators. (#3151 )	2025-04-08 14:28:36 +08:00
distributed	Add thread leak check and fix thread/memory leak issues. (#3270 )	2025-04-08 19:03:18 +08:00
models	Update TensorRT-LLM (#2873 )	2025-03-11 21:13:42 +08:00
shim	Add thread leak check and fix thread/memory leak issues. (#3270 )	2025-04-08 19:03:18 +08:00
transformations	chore: remove usernames from comments (#3291 )	2025-04-05 13:44:28 +08:00
utils	Refactor imports inside tensorrt_llm._torch. (#3015 )	2025-03-26 11:01:07 +08:00
__init__.py	Update TensorRT-LLM (#2820 )	2025-02-25 21:21:49 +08:00