This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-06 11:11:36 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
cc4c980e03
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
auto_deploy
/
unit
History
Tailing Yuan
cc4c980e03
[None][feat] Add Qwen3-Next to layer-wise benchmarks (
#9065
)
...
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
2025-11-14 10:03:00 +08:00
..
multigpu
[TRTLLM-8201][feat] Nemotron H MoE Sharding (
#8744
)
2025-11-05 12:35:29 -08:00
singlegpu
[None][feat] Add Qwen3-Next to layer-wise benchmarks (
#9065
)
2025-11-14 10:03:00 +08:00