TensorRT-LLMs/tensorrt_llm/_torch/auto_deploy
yuxianq 7b03350527
Add thread leak check and fix thread/memory leak issues. (#3270)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
2025-04-08 19:03:18 +08:00
..
compile perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench (#3041) 2025-03-26 14:33:14 -07:00
custom_ops feat: Apply the new torch-flow compatible AutoTuner to both Fused MoE and NVFP4 Linear operators. (#3151) 2025-04-08 14:28:36 +08:00
distributed Add thread leak check and fix thread/memory leak issues. (#3270) 2025-04-08 19:03:18 +08:00
models Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
shim Add thread leak check and fix thread/memory leak issues. (#3270) 2025-04-08 19:03:18 +08:00
transformations chore: remove usernames from comments (#3291) 2025-04-05 13:44:28 +08:00
utils Refactor imports inside tensorrt_llm._torch. (#3015) 2025-03-26 11:01:07 +08:00
__init__.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00