Venky
|
639c939a4f
|
[TRTC-1943][feat] Env vars override support in LLM API (#9104)
Signed-off-by: Venky Ganesh <23023424+venkywonka@users.noreply.github.com>
|
2025-12-01 10:04:49 -08:00 |
|
Yanchao Lu
|
7127c4407a
|
[None][test] [None][test] Waive main branch test failures 12/1 (#9566)
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-12-01 21:54:53 +08:00 |
|
Shi Xiaowei
|
48b1d31895
|
[https://nvbugs/5651854][infra] Enable perf metrics during accuracy testing (#9140)
|
2025-12-01 20:15:32 +08:00 |
|
JadoTu
|
a92af27411
|
[None][chore] remove qwen3-next accuracy tests (#9534)
Signed-off-by: jiant <107457950+JadoTu@users.noreply.github.com>
|
2025-12-01 11:49:37 +08:00 |
|
Pengbo Wang
|
aa3310f64f
|
[https://nvbugs/5503479][fix] Temporarily lower reference accuracy to stabilize CI (#9398)
Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
|
2025-12-01 11:49:14 +08:00 |
|
Enwei Zhu
|
2e3ac3c48f
|
[https://nvbugs/5684703][fix] Unwaive disagg guided decoding test (#9466)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-12-01 11:39:40 +08:00 |
|
JunyiXu-nv
|
3f588198dc
|
[None][fix] Fix port conflict in disagg tests (#9474)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
|
2025-11-30 17:33:22 +08:00 |
|
Emma Qiao
|
c927ccf510
|
[None][infra] Wiave failed tests for main branch on 11/30 (#9555)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-30 16:13:20 +08:00 |
|
brb-nv
|
b77f4ffe54
|
[TRTLLM-5971][feat] Integrate helix parallelism (#9342)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-11-29 15:17:30 -08:00 |
|
dominicshanshan
|
6345074686
|
[None][chore] Weekly mass integration of release/1.1 -- rebase (#9522)
Signed-off-by: yunruis <205571022+yunruis@users.noreply.github.com>
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
Signed-off-by: Mike Iovine <miovine@nvidia.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
Signed-off-by: qgai <qgai@nvidia.com>
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
Signed-off-by: Simeng Liu <simengl@nvidia.com>
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Signed-off-by: Vincent Zhang <vinczhang@nvidia.com>
Signed-off-by: peaceh <103117813+peaceh-nv@users.noreply.github.com>
Signed-off-by: Michal Guzek <mguzek@nvidia.com>
Signed-off-by: Michal Guzek <moraxu@users.noreply.github.com>
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Co-authored-by: yunruis <205571022+yunruis@users.noreply.github.com>
Co-authored-by: sunnyqgg <159101675+sunnyqgg@users.noreply.github.com>
Co-authored-by: brb-nv <169953907+brb-nv@users.noreply.github.com>
Co-authored-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Co-authored-by: JunyiXu-nv <219237550+JunyiXu-nv@users.noreply.github.com>
Co-authored-by: Simeng Liu <109828133+SimengLiu-nv@users.noreply.github.com>
Co-authored-by: Guoming Zhang <137257613+nv-guomingz@users.noreply.github.com>
Co-authored-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
Co-authored-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Co-authored-by: Vincent Zhang <vcheungyi@163.com>
Co-authored-by: peaceh-nv <103117813+peaceh-nv@users.noreply.github.com>
Co-authored-by: Michal Guzek <moraxu@users.noreply.github.com>
Co-authored-by: Chang Liu <9713593+chang-l@users.noreply.github.com>
Co-authored-by: Leslie Fang <leslief@nvidia.com>
Co-authored-by: Shunkangz <182541032+Shunkangz@users.noreply.github.com>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: QI JUN <22017000+QiJune@users.noreply.github.com>
|
2025-11-29 21:48:48 +08:00 |
|
dominicshanshan
|
70efa3ac43
|
[None][infra] Waive failed case in pre-merge on 11/28 (#9537)
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-11-28 20:53:45 +08:00 |
|
Emma Qiao
|
2d7421b314
|
[None][infra] Waive failed cases for main branch on 11/28 (#9539)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-28 17:19:55 +08:00 |
|
yufeiwu-nv
|
08755a809d
|
[https://nvbugs/5689658][test] Fix gpu lock issue running on cluster (#9441)
Signed-off-by: yufeiwu <230315618+yufeiwu-nv@users.noreply.github.com>
|
2025-11-28 13:59:22 +08:00 |
|
JunyiXu-nv
|
c87e81c1d8
|
[https://nvbugs/5685015][fix] Update invalid max_token test (#9435)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
|
2025-11-28 11:41:16 +08:00 |
|
Bo Li
|
19f3f4e520
|
[https://nvbugs/5637037][chore] Update waive lists. (#9386)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
Co-authored-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-11-28 10:45:22 +08:00 |
|
Yueh-Ting (eop) Chen
|
4cbfc10b28
|
[https://nvbugs/5674665][chore] Add test coverage for https://nvbugspro.nvidia.com/bug/5674665 (#9518)
Signed-off-by: eopXD <yuehtingc@nvidia.com>
|
2025-11-27 21:40:34 +08:00 |
|
Fanrong Li
|
2d5eadf65f
|
[None][fix] fix TP support for DeepSeek-V3.2 on hopper (#9484)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-11-27 21:02:25 +08:00 |
|
JadoTu
|
51bf7164d3
|
[None][feat] add qwen3-next CI test of accuracy on BF16 and NVFP4 (#9330)
Signed-off-by: jiant <107457950+JadoTu@users.noreply.github.com>
|
2025-11-27 18:05:00 +08:00 |
|
Lizhi Zhou
|
8104a78931
|
[None][chore] revert batch_size=1 to prevent timeout and lower accuracy reference by 0.12% as a WAR (#9447)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
Co-authored-by: Shi Xiaowei <39303645+Shixiaowei02@users.noreply.github.com>
|
2025-11-27 14:25:44 +08:00 |
|
Emma Qiao
|
0442510304
|
[None][infra] Waive failed case in pre-merge on 11/27 (#9507)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-27 13:53:33 +08:00 |
|
HuiGao-NV
|
03331bc43d
|
[https://nvbugs/5547414][fix] enable case after using local cache model (#9473)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-11-27 12:18:20 +08:00 |
|
Patrice Castonguay
|
1b2da426cd
|
[https://nvbugs/5680310][fix] Fix ctx only timed out test (#9410)
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
|
2025-11-27 11:21:21 +08:00 |
|
Shi Xiaowei
|
e76e149861
|
[https://nvbugs/5608930][fix] Fix a typo (#9487)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2025-11-27 09:05:17 +08:00 |
|
Chang Liu
|
b10137fdd5
|
[None][feat] Support MLA chunked prefill for DeepSeek V3.2 model (#9376)
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
|
2025-11-26 16:38:25 +08:00 |
|
JunyiXu-nv
|
b7308a4000
|
[https://nvbugs/5580099][fix] Cherry pick IMA issue fix from release/1.1 (#9032)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
|
2025-11-26 13:09:06 +08:00 |
|
Wanli Jiang
|
d100599ea7
|
[TRTLLM-9264][fix] Add accuracy/unit tests/doc for phi4mm (#9246)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
|
2025-11-26 11:12:35 +08:00 |
|
QI JUN
|
5972119e1c
|
[None][ci] move some slow test cases of DGX-B200 to post merge (#9467)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-26 10:48:53 +08:00 |
|
fredricz-20070104
|
6a64cb4c71
|
[TRTLLM-8936][test] Add disagg and wideep multi-node multi-gpu test cases (#9356)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2025-11-26 10:34:49 +08:00 |
|
Chuang Zhu
|
0e9c7f8c07
|
[https://nvbugs/5685143][fix] avoid cudaFree overlap with cuda graph (#9438)
Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
|
2025-11-25 16:20:29 -08:00 |
|
Suyog Gupta
|
e484bec82f
|
[None][chore] AutoDeploy add multi stream moe pass to default.yaml (#9430)
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2025-11-25 14:16:13 -08:00 |
|
Fanrong Li
|
8da59103d6
|
[https://nvbugs/5680905][fix] Relax the MMLU accuracy requirement for DS-v3.2 (#9439)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-11-26 00:32:20 +08:00 |
|
Yan Chunwei
|
1f43dc8174
|
[None][ci] waive a test (#9458)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
|
2025-11-25 07:04:20 -08:00 |
|
YueWeng
|
cc336c4abd
|
[TRTLLM-8160][feat] Add draft token tree runtime on CDL (#8586)
Signed-off-by: Yue Weng <25103990+yweng0828@users.noreply.github.com>
|
2025-11-25 09:40:55 -05:00 |
|
Shi Xiaowei
|
60786574db
|
[None][fix] Mitigate test timeout issues (#9445)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2025-11-25 20:17:54 +08:00 |
|
Chao Ni
|
a2d9e6250a
|
[https://nvbugs/5667922][fix] Update long context evaluation config (#9426)
Signed-off-by: mni <125171826+baize97@users.noreply.github.com>
|
2025-11-25 19:33:38 +08:00 |
|
Yanchao Lu
|
ff02e0f05c
|
[None][ci] Move more test stages to use OCI machines (#9395)
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Matt Lefebvre <matthewelefebvre@gmail.com>
|
2025-11-25 15:59:13 +08:00 |
|
Eran Geva
|
6af01dc664
|
[#8391][chore] test_perf.py to lock clocks read from gpu_configs.yml instead of max freq (#9409)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
|
2025-11-25 09:20:33 +02:00 |
|
Emma Qiao
|
15616e3ee5
|
[None][infra] Waive failed cases for main branch on 11/25 (#9429)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-24 23:18:15 -08:00 |
|
Suyog Gupta
|
efd503751f
|
[#9271][perf] Enable multi-stream MOE optimization in AutoDeploy (#9322)
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2025-11-24 19:50:10 -08:00 |
|
kris1025
|
d1c724958d
|
[None][chore] unwaive ampere kernels test (#9389)
Signed-off-by: linquanh <linquanh@nvidia.com>
|
2025-11-25 11:28:43 +08:00 |
|
xinhe-nv
|
0a9ae2e3e6
|
[None][chore] Remove closed bugs (#9381)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-24 18:49:57 -08:00 |
|
QI JUN
|
786d308b88
|
[https://nvbugs/5685428][fix] fix test_openai_chat_multimodal.py (#9406)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-24 16:56:33 -08:00 |
|
Yibin Li
|
1ce483c999
|
[TRTLLM-7967][feat] Adding Starcoder2 PyTorch Backend Support (#8923)
Signed-off-by: Yibin Li <109242046+yibinl-nvidia@users.noreply.github.com>
|
2025-11-24 11:23:22 -08:00 |
|
Emma Qiao
|
2c869f2bda
|
[None][infra] Waive failed cases for main (#9400)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-24 17:42:19 +08:00 |
|
Emma Qiao
|
af72d93fa9
|
[None][infra] Waive failed cases on main branch (#9384)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-23 22:53:02 -08:00 |
|
brb-nv
|
c045e359a7
|
[https://nvbugs/5637012][fix] Fix helix unit tests (#9369)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-11-23 19:34:22 -08:00 |
|
QI JUN
|
34a6d2d28f
|
[TRTLLM-9302][chore] Move build config from BaseLlmArgs to TrtLlmArgs (#9249)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-24 10:54:41 +08:00 |
|
Chenghao Zhang
|
e1c9aa7d6a
|
[None][chore] AutoDeploy: Add the Nemotron MOE to CI (#9328)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
Co-authored-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2025-11-23 12:12:12 -08:00 |
|
Yan Chunwei
|
1ef69ecbb1
|
[None][ci] waive two ray tests (#9375)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
|
2025-11-23 15:39:01 +08:00 |
|
dongfengy
|
268ea9bb8a
|
[None][test] Add one-model and overlap-scheduling to eagle tests for GPTOSS (#9312)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
|
2025-11-21 22:52:53 -08:00 |
|