..
allreduce_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
fp4_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
fp8_blockscale_gemm
[None][feat] Drop non-deepgemm fp8 block scale gemm ( #10256 )
2025-12-25 14:52:52 +08:00
fp8_rowwise_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
fpA_intB_gemm
[None][feat] sm100 weight-only kernel ( #10190 )
2026-01-05 09:44:36 +08:00
fused_gated_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
include
[ https://nvbugs/5726962 ][feat] Apply fusion for W4AFP8_AWQ MoE ( #9838 )
2026-01-06 10:16:41 +08:00
int8_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
low_latency_gemm
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
moe_gemm
[ https://nvbugs/5726962 ][feat] Apply fusion for W4AFP8_AWQ MoE ( #9838 )
2026-01-06 10:16:41 +08:00
python
[ https://nvbugs/5726962 ][feat] Apply fusion for W4AFP8_AWQ MoE ( #9838 )
2026-01-06 10:16:41 +08:00
CMakeLists.txt
[ https://nvbugs/5690172 ][fix] Fix Qwen3-235B ATP accuracy issue with PDL ( #9530 )
2025-12-01 09:10:21 +08:00
cutlass_heuristic.cpp
[None][feat] sm100 weight-only kernel ( #10190 )
2026-01-05 09:44:36 +08:00
cutlass_heuristic.h
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
cutlass_preprocessors.cpp
[None][feat] sm100 weight-only kernel ( #10190 )
2026-01-05 09:44:36 +08:00
cutlass_preprocessors.h
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00
cutlass_type_conversion.h
[None][fix] Introduce inline namespace to avoid symbol collision ( #9541 )
2025-12-12 23:32:15 +08:00