TensorRT-LLMs/cpp/tensorrt_llm/kernels/cutlass_kernels
benzh-2025 6df2c8a074
[None][feat] add fp4 gemm + allreduce (#9729)
Signed-off-by: benzh 
Signed-off-by: benzh-2025
2026-01-13 21:11:13 +08:00
..
allreduce_gemm [None][feat] add fp4 gemm + allreduce (#9729) 2026-01-13 21:11:13 +08:00
fp4_gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
fp8_blockscale_gemm [None][feat] Drop non-deepgemm fp8 block scale gemm (#10256) 2025-12-25 14:52:52 +08:00
fp8_rowwise_gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
fpA_intB_gemm [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
fused_gated_gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
include [https://nvbugs/5726962][feat] Apply fusion for W4AFP8_AWQ MoE (#9838) 2026-01-06 10:16:41 +08:00
int8_gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
low_latency_gemm [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
moe_gemm [https://nvbugs/5726962][feat] Apply fusion for W4AFP8_AWQ MoE (#9838) 2026-01-06 10:16:41 +08:00
python [https://nvbugs/5726962][feat] Apply fusion for W4AFP8_AWQ MoE (#9838) 2026-01-06 10:16:41 +08:00
CMakeLists.txt [https://nvbugs/5690172][fix] Fix Qwen3-235B ATP accuracy issue with PDL (#9530) 2025-12-01 09:10:21 +08:00
cutlass_heuristic.cpp [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
cutlass_heuristic.h [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
cutlass_preprocessors.cpp [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
cutlass_preprocessors.h [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
cutlass_type_conversion.h [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00