TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-25 05:02:59 +08:00

History

Yuxian Qiu 87b50a5736 fix: [nvbugs/5289912][nvbugs/5232406] use thread pool for multi-thread weight loading in fused moe. (#4699 ) Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>		2025-05-28 08:13:06 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	[TRTLLM-4932] Add QA accuracy tests for NIM-prioritized models (#4242 )	2025-05-24 19:17:21 +08:00
test-db	[fix] Fix Llama4 allgather error due to None tensor (#4511 )	2025-05-24 19:12:12 +08:00
waives.txt	fix: [nvbugs/5289912][nvbugs/5232406] use thread pool for multi-thread weight loading in fused moe. (#4699 )	2025-05-28 08:13:06 +08:00