This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-17 08:15:10 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
af899d2fe7
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
thop
History
nvxuanyuc
a5a37227d6
[None][feat] Fused kernels (qknormrope + moe routing) and two-model MTP support for glm4moe (
#9852
)
...
Signed-off-by: Xuanyu Chen <xuanyuc@nvidia.com>
2025-12-14 10:47:24 +08:00
..
parallel
[None][feat] Fused kernels (qknormrope + moe routing) and two-model MTP support for glm4moe (
#9852
)
2025-12-14 10:47:24 +08:00
serial
[
https://nvbugs/5575841
] [fix] Nvbug 5575841: Remove additional test waivers for TestMoEFP4 (
#9788
)
2025-12-09 13:37:55 +00:00