TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

bhsueh_NV 2e14c8f443 [Fix][Chore][Qwen3] fix bug of using fp4 on sm120 (#6065 ) Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>		2025-07-20 10:25:25 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	[refactor] Unify name of NGram speculative decoding (#5937 )	2025-07-19 12:59:57 +08:00
test-db	[TRTLLM-6452][feat]: Two-model engine KV cache reuse support (#6133 )	2025-07-19 13:17:15 +08:00
waives.txt	[Fix][Chore][Qwen3] fix bug of using fp4 on sm120 (#6065 )	2025-07-20 10:25:25 +08:00