TensorRT-LLMs/tests/unittest/_torch/auto_deploy/unit
Eran Geva 5f2a42b3df
[TRTLLM-6142][feat] AutoDeploy: set torch recompile_limit based on cuda_graph_batch_sizes and refactored (#7219)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
2025-09-08 08:45:58 -04:00
..
multigpu [TRTLLM-6342][fix] Fixed triggering BMM sharding (#7389) 2025-09-04 02:01:27 -04:00
singlegpu [TRTLLM-6142][feat] AutoDeploy: set torch recompile_limit based on cuda_graph_batch_sizes and refactored (#7219) 2025-09-08 08:45:58 -04:00