TensorRT-LLMs/cpp/include/tensorrt_llm/deep_gemm
yunruis 30c5b4183a
refactoring: port customized kernels with public cutlass version (#5027)
Signed-off-by: yunruis 

Merge this to unblock others since the full CI has been run through
2025-06-13 16:19:31 +08:00
..
compiler.cuh Feat: add deep_gemm swapab Kernel (#4430) 2025-05-21 10:48:43 +08:00
fp8_gemm_impl.cuh Feat: add deep_gemm swapab Kernel (#4430) 2025-05-21 10:48:43 +08:00
fp8_gemm.cuh refactoring: port customized kernels with public cutlass version (#5027) 2025-06-13 16:19:31 +08:00
jit_utils.cuh Feat: add deep_gemm swapab Kernel (#4430) 2025-05-21 10:48:43 +08:00
mma_utils.cuh refactoring: port customized kernels with public cutlass version (#5027) 2025-06-13 16:19:31 +08:00
nvrtc_cutlass.cuh feat: use NVRTC for DeepGEMM JIT compilation (#3239) 2025-04-07 20:29:23 +08:00
nvrtc_std.cuh feat: use NVRTC for DeepGEMM JIT compilation (#3239) 2025-04-07 20:29:23 +08:00
runtime.cuh feat: use NVRTC for DeepGEMM JIT compilation (#3239) 2025-04-07 20:29:23 +08:00
scheduler.cuh refactoring: port customized kernels with public cutlass version (#5027) 2025-06-13 16:19:31 +08:00
tma_utils.cuh refactoring: port customized kernels with public cutlass version (#5027) 2025-06-13 16:19:31 +08:00
utils.cuh refactoring: port customized kernels with public cutlass version (#5027) 2025-06-13 16:19:31 +08:00