TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Perkz Zheng 064b67e40c [https://nvbugs/5727952 ][fix] a pdl bug in trtllm-gen fmha kernels (#9913 ) Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>		2025-12-16 00:34:37 -08:00
..
batchedGemm	[https://nvbugs/5661741 ][fix] Fix accuracy issue in TRTLLM MoE introduced in #9377 (#9999 )	2025-12-15 03:31:56 -08:00
blockScaleMoe	[None][feat] Add routing support for the new model for both cutlass and trtllm moe backend (#9792 )	2025-12-15 19:59:08 -08:00
fmha	[https://nvbugs/5727952 ][fix] a pdl bug in trtllm-gen fmha kernels (#9913 )	2025-12-16 00:34:37 -08:00
gemm	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
gemmGatedAct	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
CMakeLists.txt	feat: update DeepSeek FP8 TRT-LLM Gen cubins (#4643 )	2025-06-03 14:07:54 -07:00