TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Gabriel Wu 2e0cd7922e fix: add SM90 guard for FP8 Blockscale GEMM (#3575 ) * fix: add SM90 guard for FP8 Blockscale GEMM Signed-off-by: Zihua Wu <13583761+lucifer1004@users.noreply.github.com> * fix: add SM90 guard for FP8 Blockscale GEMM Signed-off-by: Zihua Wu <13583761+lucifer1004@users.noreply.github.com> --------- Signed-off-by: Zihua Wu <13583761+lucifer1004@users.noreply.github.com> Co-authored-by: Tao Li @ NVIDIA <tali@nvidia.com>		2025-04-16 14:44:37 +08:00
..
compiler.cuh	fix: add SM90 guard for FP8 Blockscale GEMM (#3575 )	2025-04-16 14:44:37 +08:00
fp8_gemm_impl.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
fp8_gemm.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
jit_utils.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
mma_utils.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
nvrtc_cutlass.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
nvrtc_std.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
runtime.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
scheduler.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
tma_utils.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00
utils.cuh	feat: use NVRTC for DeepGEMM JIT compilation (#3239 )	2025-04-07 20:29:23 +08:00