TensorRT-LLMs/cpp/kernels
Bruce-Lee-LY 8c82ee2803
[fix] xqa precision for fp16/bf16 kv cache (#6573)
Signed-off-by: Bruce-Lee-LY <yong-li14@tsinghua.org.cn>
Co-authored-by: Bruce-Lee-LY <yong-li14@tsinghua.org.cn>
2025-08-04 14:34:20 +08:00
..
fmha_v2 hopper-style context MLA (#5713) 2025-07-23 14:37:20 +08:00
xqa [fix] xqa precision for fp16/bf16 kv cache (#6573) 2025-08-04 14:34:20 +08:00