This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-25 05:02:59 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
1bfc7d4c29
TensorRT-LLMs
/
tests
/
integration
/
test_lists
History
Yuxian Qiu
87b50a5736
fix: [nvbugs/5289912][nvbugs/5232406] use thread pool for multi-thread weight loading in fused moe. (
#4699
)
...
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
2025-05-28 08:13:06 +08:00
..
dev
Update (
#2978
)
2025-03-23 16:39:35 +08:00
qa
[TRTLLM-4932] Add QA accuracy tests for NIM-prioritized models (
#4242
)
2025-05-24 19:17:21 +08:00
test-db
[fix] Fix Llama4 allgather error due to None tensor (
#4511
)
2025-05-24 19:12:12 +08:00
waives.txt
fix: [nvbugs/5289912][nvbugs/5232406] use thread pool for multi-thread weight loading in fused moe. (
#4699
)
2025-05-28 08:13:06 +08:00