TensorRT-LLMs/cpp/kernels
qsang-nv e9cd810071
keep sm90 headsize 128 cubins (#5320)
Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
2025-06-26 12:14:01 +08:00
..
fmha_v2 keep sm90 headsize 128 cubins (#5320) 2025-06-26 12:14:01 +08:00
xqa [feat]: improve performance of XQA-MLA for sm120 (#5087) 2025-06-18 14:19:22 +08:00