TensorRT-LLMs/cpp/kernels
qsang-nv 8ef8e73002
update spec_dec (#6079)
Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
2025-07-16 17:50:43 +08:00
..
fmha_v2 [https://nvbugspro.nvidia.com/bug/5355054] fallback to cubins for fp8 fmha kernels on Ada. (#5779) 2025-07-14 17:17:30 +08:00
xqa update spec_dec (#6079) 2025-07-16 17:50:43 +08:00