TensorRT-LLMs/tests/unittest/_torch/auto_deploy/unit
Chenghao Zhang ddf2d010e2
[TRTLLM-8814][feat] AutoDeploy: Use TRTLLM kernels for FP8 linear (#8820)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
Signed-off-by: nvchenghaoz <211069071+nvchenghaoz@users.noreply.github.com>
Co-authored-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
2025-11-06 11:00:10 -08:00
..
multigpu [TRTLLM-8201][feat] Nemotron H MoE Sharding (#8744) 2025-11-05 12:35:29 -08:00
singlegpu [TRTLLM-8814][feat] AutoDeploy: Use TRTLLM kernels for FP8 linear (#8820) 2025-11-06 11:00:10 -08:00