TensorRT-LLMs/tensorrt_llm/_torch/auto_deploy
2025-04-03 11:08:12 +08:00
..
compile perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench (#3041) 2025-03-26 14:33:14 -07:00
custom_ops perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench (#3041) 2025-03-26 14:33:14 -07:00
distributed Refactor imports inside tensorrt_llm._torch. (#3015) 2025-03-26 11:01:07 +08:00
models Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
shim fix: Fix an error related to dummy request when MTP is used (#3146) 2025-04-03 11:08:12 +08:00
transformations perf: [AutoDeploy] Enable AutoDeploy as a backend in trtllm-bench (#3041) 2025-03-26 14:33:14 -07:00
utils Refactor imports inside tensorrt_llm._torch. (#3015) 2025-03-26 11:01:07 +08:00
__init__.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00