TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Enwei Zhu c31ca1688c [https://nvbugs/5214229 ] [fix] Unwaive lm_head quantization case (#4222 ) unwaive Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>		2025-05-12 20:23:06 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	test: add llama_3.2_1B model and fix for test lora script issue (#4139 )	2025-05-12 14:51:59 +08:00
test-db	feat: add kv cache aware router (#3831 )	2025-05-12 07:23:57 -04:00
waives.txt	[https://nvbugs/5214229 ] [fix] Unwaive lm_head quantization case (#4222 )	2025-05-12 20:23:06 +08:00