Frank
|
8bb3eea285
|
perf: Readd iteration logging for trtllm-bench. (#3039)
Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>
|
2025-04-01 08:13:09 +08:00 |
|
Mike Iovine
|
5416966ddb
|
Add initial EAGLE-3 implementation (#3035)
Signed-off-by: Mike Iovine <miovine@nvidia.com>
|
2025-03-29 22:31:24 +08:00 |
|
Yan Chunwei
|
82edd90350
|
fix gpus_per_node in trtllm-bench when world_size < device_count (#3007)
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
|
2025-03-27 09:31:40 +08:00 |
|
Anurag Mukkara
|
7361c7d401
|
Add second possible output (#3043)
Signed-off-by: Anurag Mukkara <amukkara@nvidia.com>
|
2025-03-25 12:59:27 -07:00 |
|
Yan Chunwei
|
c29cebf79d
|
Deprecate model_api examples (#2999)
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
|
2025-03-25 09:37:20 +08:00 |
|
Kaiyu Xie
|
2631f21089
|
Update (#2978)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-03-23 16:39:35 +08:00 |
|