TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-06 03:01:50 +08:00

History

Tian Zheng 5efee01da1 [None][feat] Add Skip Softmax MLA kernels for Blackwell and Fix an accuracy bug of NVFP4 KV (#10813 ) Signed-off-by: Tian Zheng <29906817+Tom-Zheng@users.noreply.github.com>		2026-01-26 16:46:33 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	[https://nvbugs/5661741 ][feat] Add 250K-token NVFP4 MoE + PDL regression tests (#10911 )	2026-01-26 01:48:29 -05:00
test-db	[None][feat] Add Skip Softmax MLA kernels for Blackwell and Fix an accuracy bug of NVFP4 KV (#10813 )	2026-01-26 16:46:33 +08:00
waives.txt	[None][feat] Add Skip Softmax MLA kernels for Blackwell and Fix an accuracy bug of NVFP4 KV (#10813 )	2026-01-26 16:46:33 +08:00