TensorRT-LLMs/cpp/kernels/fmha_v2/src/fmha
Faraz 27a5091fcb
[None][feat] GPT-OSS Sm120/Sm121 Support (#7937)
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Signed-off-by: list <58580514+farazkh80@users.noreply.github.com>
Signed-off-by: Vincent Huang <vincenth@nvidia.com>
Co-authored-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: Vincent Huang <vincenth@nvidia.com>
2025-10-06 16:59:06 -04:00
..
hopper [None][feat] Hopper Fp8 context mla (#7116) 2025-08-26 17:10:20 +08:00
warpspec [TRTLLM-7192][feat] optimize MLA chunked prefill && support fp8 mla chunked prefill (#7477) 2025-09-15 21:43:49 +08:00
alibi_params.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
fragment.h [None][feat] GPT-OSS Sm120/Sm121 Support (#7937) 2025-10-06 16:59:06 -04:00
gemm.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
gmem_tile_o_packed.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
gmem_tile_o.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
gmem_tile_ps.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
gmem_tile_qkv_packed.h [None][feat] Use Separate QKV Input Layout for Context MLA (#6538) 2025-08-19 22:04:48 +08:00
gmem_tile_qkv.h hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
kernel_traits.h [None][feat] Use Separate QKV Input Layout for Context MLA (#6538) 2025-08-19 22:04:48 +08:00
mask.h [TRTLLM-6674][feat] (Breaking Change) Hopper SWA non-cyclic kernels + KV reuse + Spec Dec (#6379) 2025-08-05 07:47:41 +00:00
numeric_types.h fix fmha v2 tests (#4661) 2025-05-27 09:47:01 +08:00
paged_kv_cache.h chore: Improve documentation of Kv_block_array (#5765) 2025-07-05 22:25:27 +02:00
smem_tile_o.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
smem_tile_qkv.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
smem_tile_v.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
smem_tile.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
softmax.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
traits.h infra: open source fmha v2 kernels (#4185) 2025-05-15 10:56:34 +08:00
utils.h Clean: fmha codes (#4496) 2025-05-21 11:45:47 +08:00