TensorRT-LLMs/cpp/kernels/fmha_v2/src/fmha/hopper
Zhou Yuxin f01101f687
[None][feat] Hopper Fp8 context mla (#7116)
Signed-off-by: Yuxin <yuxinz@nvidia.com>
2025-08-26 17:10:20 +08:00
..
arrive_wait.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
compute_tile.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
fragment.h [None][feat] Add support for Hopper MLA chunked prefill (#6655) 2025-08-14 10:39:26 +08:00
gmem_tile_o_packed.h [None][feat] Hopper Fp8 context mla (#7116) 2025-08-26 17:10:20 +08:00
gmem_tile_qkv_packed.h hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
gmma_descriptor.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
kernel_traits.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
smem_tile_o.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
smem_tile.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
tma_descriptor.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
tma_types.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
utils_gmma.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
utils_hgmma_bf16.h hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
utils_hgmma.h hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
utils_igmma.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
utils_qgmma.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
utils_tma.h hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
utils_warpgroup.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00