TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-06 11:11:36 +08:00

History

Tailing Yuan cc4c980e03 [None][feat] Add Qwen3-Next to layer-wise benchmarks (#9065 ) Signed-off-by: Tailing Yuan <yuantailing@gmail.com>		2025-11-14 10:03:00 +08:00
..
multigpu	[TRTLLM-8201][feat] Nemotron H MoE Sharding (#8744 )	2025-11-05 12:35:29 -08:00
singlegpu	[None][feat] Add Qwen3-Next to layer-wise benchmarks (#9065 )	2025-11-14 10:03:00 +08:00