This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-14 06:53:50 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
fdd5bd49fc
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
auto_deploy
/
unit
History
Eran Geva
5f2a42b3df
[TRTLLM-6142][feat] AutoDeploy: set torch recompile_limit based on cuda_graph_batch_sizes and refactored (
#7219
)
...
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
2025-09-08 08:45:58 -04:00
..
multigpu
[TRTLLM-6342][fix] Fixed triggering BMM sharding (
#7389
)
2025-09-04 02:01:27 -04:00
singlegpu
[TRTLLM-6142][feat] AutoDeploy: set torch recompile_limit based on cuda_graph_batch_sizes and refactored (
#7219
)
2025-09-08 08:45:58 -04:00