TensorRT-LLMs/cpp/tensorrt_llm/kernels/trtllmGenKernels
Nikita Korobov 3b4f26e4d1
[None][feat] update TRT-LLM Gen MoE for NvFp4 + bias with tileN=256 (#9734)
Signed-off-by: Nikita Korobov <14355239+nekorobov@users.noreply.github.com>
2025-12-18 11:58:23 +01:00
..
batchedGemm [None][feat] update TRT-LLM Gen MoE for NvFp4 + bias with tileN=256 (#9734) 2025-12-18 11:58:23 +01:00
blockScaleMoe [None][feat] Add routing support for the new model for both cutlass and trtllm moe backend (#9792) 2025-12-15 19:59:08 -08:00
fmha [https://nvbugs/5727952][fix] a pdl bug in trtllm-gen fmha kernels (#9913) 2025-12-16 00:34:37 -08:00
gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
gemmGatedAct [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
CMakeLists.txt feat: update DeepSeek FP8 TRT-LLM Gen cubins (#4643) 2025-06-03 14:07:54 -07:00