This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-15 23:44:02 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
3f9dbc76c0
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
thop
History
ChristinaZ
c8b9998acb
[TRTLLM-8637][feat] Optimize the routing kernel for DeepseekV3 (MoE CUTLASS backend); Add support for KimiK2 and Qwen-next (MoE TRTLLM backend) (
#7761
)
...
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
2025-10-20 10:08:31 +08:00
..
parallel
[TRTLLM-8637][feat] Optimize the routing kernel for DeepseekV3 (MoE CUTLASS backend); Add support for KimiK2 and Qwen-next (MoE TRTLLM backend) (
#7761
)
2025-10-20 10:08:31 +08:00
serial
[TRTLLM-7457][ci] Update unittest parallel config (
#7297
)
2025-08-29 09:28:04 +08:00