Kaiyu Xie
|
810249c304
|
[https://nvbugs/5769926] [fix] Add no container mount home WAR (#10431)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2026-01-06 13:09:25 +08:00 |
|
fredricz-20070104
|
621156ad44
|
[None][chore] Fix GB300 support issues (#10196)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
Signed-off-by: fredricz-20070104 <226039983+fredricz-20070104@users.noreply.github.com>
|
2025-12-23 10:42:41 +08:00 |
|
Kaiyu Xie
|
5a611cb8f5
|
[None] [feat] Enhancements to slurm scripts (#10112)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-21 10:24:56 -05:00 |
|
Venky
|
dfa11d810e
|
[TRTC-102][docs] --extra_llm_api_options->--config in docs/examples/tests (#10005)
|
2025-12-19 13:48:43 -05:00 |
|
Kaiyu Xie
|
02fd13448b
|
[None] [feat] Enhancements to slurm scripts (#10031)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-16 19:31:27 -08:00 |
|
Kaiyu Xie
|
ef4ea955b2
|
[None] [fix] Fix slrum scripts (#10007)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-15 04:20:53 -08:00 |
|
Kaiyu Xie
|
504ede707e
|
[None] [fix] Fix nsys_on argument for slurm scripts (#9995)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-14 22:41:30 -08:00 |
|
Kaiyu Xie
|
0788635d6c
|
[TRTLLM-9762] [doc] Update documents for GB300 NVL72 (#9987)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-14 19:30:28 -08:00 |
|
Balaram Buddharaju
|
6a6e41f802
|
[TRTLLM-9468][chore] Update disagg benchmarking scripts to support context parallelism (#9720)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-12-12 22:29:41 -08:00 |
|
Kaiyu Xie
|
110820bb15
|
[TRTLLM-9792] [feat] Support multiple instances on single node for slurm scripts (#9900)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-12 12:12:08 +08:00 |
|
fredricz-20070104
|
341cb1a12c
|
[None][chore] Add GB300 support since it does not support segment (#9731)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2025-12-10 18:36:55 -08:00 |
|
Kaiyu Xie
|
069b05cf3d
|
[TRTLLM-9706] [doc] Update wide EP documents (#9724)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-12-08 11:21:11 +08:00 |
|
Zhenhuan Chen
|
24004535fe
|
[None][chore] refactor disaggregated scripts to use named arguments (#9581)
Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>
|
2025-12-01 17:33:47 +08:00 |
|
Enwei Zhu
|
34e2fa5c96
|
[https://nvbugs/5690172][fix] Fix Qwen3-235B ATP accuracy issue with PDL (#9530)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-12-01 09:10:21 +08:00 |
|
Kaiyu Xie
|
0d3c0c2156
|
[None] [chore] Enhancements and clean up to slurm scripts (#9493)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-11-28 16:41:41 +08:00 |
|
Enwei Zhu
|
c2562fc800
|
[https://nvbugs/5687820][fix] Remove self.abort() in DetokenizedGenerationResult (#9449)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-11-27 22:54:40 +08:00 |
|
Zero Zeng
|
43896af1b1
|
[None][chore] benchmark refactor (#9207)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
|
2025-11-17 23:29:28 -08:00 |
|
Zero Zeng
|
c6cce398f5
|
[TRTLLM-9053][feat] Support accuracy test and install from wheel (#9038)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
|
2025-11-13 23:34:47 -08:00 |
|
fredricz-20070104
|
fdd9e4fe00
|
[TRTLLM-7251][test] Get submit eplb slots empty key work (#8945)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2025-11-05 05:21:02 -08:00 |
|
Kaiyu Xie
|
db2a42f641
|
[None][chore] Add sample yaml for wide-ep example and minor fixes (#8825)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
|
2025-11-03 07:48:34 -08:00 |
|
Zero Zeng
|
4545700fcf
|
[None][chore] Move submit.sh to python and use yaml configuration (#8003)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
|
2025-10-20 22:36:50 -04:00 |
|
Xianjie Qiao
|
d145e87f6f
|
[None][chore] Update disagg benchmark configs (#8289)
Signed-off-by: Xianjie <5410381+qiaoxj07@users.noreply.github.com>
Signed-off-by: Xianjie Qiao <5410381+qiaoxj07@users.noreply.github.com>
|
2025-10-13 18:15:46 +08:00 |
|
Zero Zeng
|
16bb76c31d
|
[None][chore] Update benchmark script (#7860)
Signed-off-by: Zero Zeng <38289304+zerollzeng@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-09-23 03:15:42 -07:00 |
|
Kaiyu Xie
|
6eef19297f
|
[None] [chore] cherry pick changes on slurm scripts from release/1.1.0rc2 (#7750)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-09-16 16:07:13 +08:00 |
|
Kaiyu Xie
|
23f72c8bbd
|
[None] [feat] Use numa to bind CPU (#7304)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-08-28 06:27:11 -04:00 |
|
Kaiyu Xie
|
8a619be828
|
[None] [chore] Make disagg example compatible with recommended usage (#7121)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-08-27 23:57:46 +08:00 |
|
Raayan Dhar
|
82bd1871ea
|
[None][chore] update disagg readme and scripts for pipeline parallelism (#6875)
Signed-off-by: raayandhar <rdhar@nvidia.com>
|
2025-08-27 00:53:57 -04:00 |
|
Xianjie Qiao
|
19667304b5
|
[None] [chore] Update wide-ep genonly scripts (#6995)
Signed-off-by: Xianjie <5410381+qiaoxj07@users.noreply.github.com>
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-08-19 07:44:07 -04:00 |
|
Shi Xiaowei
|
1095dfd03c
|
[None][fix] BREAKING CHANGE: Mismatch between docs and actual commands (#6323)
|
2025-08-14 03:48:57 -04:00 |
|
Shi Xiaowei
|
fe7dda834d
|
[TRTLLM-7030][fix] Refactor the example doc of dist-serving (#6766)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2025-08-13 17:39:27 +08:00 |
|