TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Anthony Chang ad12b795c9 [https://nvbugs/5661741 ][fix] Fix accuracy issue in TRTLLM MoE introduced in #9377 (#9999 ) Signed-off-by: Anthony Chang <27950904+rosenrodt@users.noreply.github.com>		2025-12-15 03:31:56 -08:00
..
batchedGemm	[https://nvbugs/5661741 ][fix] Fix accuracy issue in TRTLLM MoE introduced in #9377 (#9999 )	2025-12-15 03:31:56 -08:00
blockScaleMoe	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
fmha	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
gemm	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
gemmGatedAct	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
CMakeLists.txt	feat: update DeepSeek FP8 TRT-LLM Gen cubins (#4643 )	2025-06-03 14:07:54 -07:00