TensorRT-LLMs/cpp/kernels
Zhou Yuxin f01101f687
[None][feat] Hopper Fp8 context mla (#7116)
Signed-off-by: Yuxin <yuxinz@nvidia.com>
2025-08-26 17:10:20 +08:00
..
fmha_v2 [None][feat] Hopper Fp8 context mla (#7116) 2025-08-26 17:10:20 +08:00
xqa [fix]: use safeInitRowMax instead of fp32_lowest to avoid NaN (#7087) 2025-08-20 22:12:21 -07:00