Bo Li
6b251cc7fa
[TRTLLM-9390][chore] Add Fake OPs for One-Sided AlltoAll. ( #11002 )
...
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
2026-01-27 15:55:07 +08:00
Lizhi Zhou
93ae8a14ab
[ #10889 ][fix] fix pydantic deepcopy bug ( #11004 )
...
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
2026-01-27 02:40:13 -05:00
xinhe-nv
069ad30bdb
[None][chore] Remove closed bugs ( #10982 )
...
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
2026-01-27 15:35:44 +08:00
Emma Qiao
c761b68481
[None][infra] Waive failed cases for main on 01/27 ( #11017 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2026-01-27 15:24:54 +08:00
zhhuang-nv
ca9f70f78c
[ https://nvbugs/5612438 ][fix] Add timeout for SeedOSS test ( #8683 )
...
Signed-off-by: Zhen Huang <145532724+zhhuang-nv@users.noreply.github.com>
2026-01-27 15:22:21 +08:00
sunnyqgg
ff0dd6076e
[TRTLLM-10062][feat] Enable MTP for Nemotron Super ( #10754 )
...
Signed-off-by: qgai <qgai@nvidia.com>
2026-01-26 11:23:26 -05:00
Lucas Liebenwein
00f341be49
[ #8982 ][feat] AutoDeploy attention dp support ( #10728 )
...
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
2026-01-26 09:43:33 -05:00
Linda
ce556290c9
[None][chore] Removing pybind11 bindings and references ( #10550 )
...
Signed-off-by: Linda-Stadter <57756729+Linda-Stadter@users.noreply.github.com>
2026-01-26 08:19:12 -05:00
Pengyun Lin
ce37e27066
[ #10614 ][fix] gpt_oss first iteration streaming in trtllm-serve ( #10808 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2026-01-26 20:53:11 +08:00
Pengbo Wang
5d7a5e6800
[ https://nvbugs/5779536 ][fix] Cherry-pick #10855 : Unwaive Llama 3.3 related multi GPU tests ( #10942 )
...
Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
2026-01-26 05:40:29 -05:00
Bo Li
e405468230
[TRTLLM-10048][feat] Fuse the AllGather for expert statistics required by the EPLB. ( #10885 )
...
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
2026-01-26 17:59:03 +08:00
Tian Zheng
5efee01da1
[None][feat] Add Skip Softmax MLA kernels for Blackwell and Fix an accuracy bug of NVFP4 KV ( #10813 )
...
Signed-off-by: Tian Zheng <29906817+Tom-Zheng@users.noreply.github.com>
2026-01-26 16:46:33 +08:00
Emma Qiao
a3a3ceb17f
[None][infra] Waive failed case for main branch on 01/26 ( #10994 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2026-01-26 03:20:53 -05:00
xinhe-nv
d3406cb515
[None][chore] Add failed cases into waives.txt ( #10976 )
...
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
2026-01-26 02:23:05 -05:00
yingguo-trt
c8f1745a6e
[ https://nvbugs/5661741 ][feat] Add 250K-token NVFP4 MoE + PDL regression tests ( #10911 )
...
Signed-off-by: yingguo-trt <244492186+yingguo-trt@users.noreply.github.com>
2026-01-26 01:48:29 -05:00
xinhe-nv
2d8245d125
[None][chore] Add failed cases into waives.txt ( #10974 )
...
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
2026-01-26 00:33:50 -05:00
Enwei Zhu
ffab217974
[None][fix] Fix CuteDSL MoE unittest ( #10983 )
...
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2026-01-26 08:34:17 +08:00
Yanchao Lu
45d7022cc3
[None][test] Waive failed tests on main 1/25 ( #10984 )
...
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
2026-01-26 00:32:02 +08:00
Enwei Zhu
72ef732bcf
[TRTLLM-10147][perf] Balanced random MoE workload generator for CuteDSL kernel UT, autotuner and layerwise benchmark ( #10279 )
...
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2026-01-25 21:02:30 +08:00
Pengyun Lin
fd7fd8c39d
[ https://nvbugs/5747938 ][infra] Unwaive trtllm serve example test ( #10820 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
dominicshanshan
c98c286c0f
[ https://nvbugs/5814203 ][fix] Fix port 8000 being used issue in stress test. ( #10756 )
...
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Ivy Zhang
bcd2dc490c
[None][test] Update case for release ( #10811 )
...
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Emma Qiao
44aa6c3b8e
[None][infra] Waive failed cases for release branch on 01/20 ( #10828 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Patrice Castonguay
8959c41d8b
[ https://nvbugs/5748664 ][fix] Increasing disagg acc test timeout ( #10764 )
...
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Ivy Zhang
4ebc1b1596
[None][test] Update test case for release ( #10763 )
...
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
ruodil
4df0ca8bd1
[None][test] modify ctx config in 128k8k disagg cases ( #10779 )
...
Signed-off-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Ruodi Lu <ruodil@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Emma Qiao
af49fbdf65
[None][infra] Waive failed case for release branch on 01/19 ( #10795 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Emma Qiao
4b833492fb
[None][infra] Waive failed cases for release on 10/18 ( #10781 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2026-01-25 18:12:21 +08:00
Yao Yao
6f07fa81d7
[TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 ( #10736 )
...
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com>
KVCacheManagerV2 is a new python-based implementation of the KV cache manager, featuring cleaner API, better abstraction and better code quality without the accumulated legacy.
2026-01-24 04:48:39 -05:00
Yanchao Lu
78a008d61a
[None][ci] Remove long-running sanity check tests on GH200 ( #10924 ) ( #10969 )
...
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
2026-01-24 13:06:28 +08:00
Kaiyu Xie
da967d0bd7
[TRTLLM-10334] [feat] Support overlap scheduler for disagg ctx instances ( #10755 )
...
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2026-01-23 22:29:37 -05:00
Taylor Yeonbok Lee
1fbbb1f3cd
[None][feat] AutoDeploy: Enhance memory consumption for MoE fusion transform ( #10772 )
...
Signed-off-by: Taylor Yeonbok Lee <249374542+taylor-yb-lee@users.noreply.github.com>
2026-01-23 15:22:54 -08:00
Jin Li
b560598c79
[ https://nvbugs/5707359 ][fix] Unwaive the test that due to flashinfer… ( #10570 )
...
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2026-01-23 13:09:04 -05:00
yuanjingx87
f4b52d3b78
[None][infra] Regenerate out dated lock file ( #10940 )
...
Signed-off-by: Yuanjing Xue <197832395+yuanjingx87@users.noreply.github.com>
2026-01-23 09:21:03 -08:00
Yihan Wang
1d68fab49c
[ https://nvbugs/5814215 ][fix] Unwaive test_trtllm_flashinfer_symbol_collision.py::test_flashinfer_fused_moe_matches_torch_moe ( #10930 )
...
Signed-off-by: Yihan Wang <yihwang@nvidia.com>
2026-01-24 01:09:18 +08:00
Yihan Wang
43f2b51e94
[ https://nvbugs/5833795 ][chore] Waive test test_e2e.py::test_ptp_quickstart_advanced[GPT-OSS-120B-gpt_oss/gpt-oss-120b] ( #10953 )
...
Signed-off-by: Yihan Wang <yihwang@nvidia.com>
2026-01-23 06:04:57 -05:00
Emma Qiao
ae114ec7cf
[None][infra] Waive a failed case in pre-merge stage ( #10948 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2026-01-23 04:40:17 -05:00
Stanley Sun
0f7192c7fe
[None][test] Remove unused test list ( #10916 )
...
Signed-off-by: Stanley Sun <stsun@nvidia.com>
2026-01-23 10:24:06 +08:00
Leslie Fang
31d04dfa12
[TRTLLM-9108][feat] Add test configurable moe module multi gpu ( #10699 )
...
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
2026-01-23 10:16:58 +08:00
Lucas Liebenwein
d793bd973d
[ https://nvbugs/5688721 ][fix] unwaive NemotronH accuracy test ( #10852 )
...
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
2026-01-22 16:23:28 -05:00
William Zhang
2146c23786
[ #9306 ][refactor] Refactor AutoDeployConfig into LlmArgs ( #10613 )
...
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
2026-01-22 16:02:49 -05:00
Grzegorz Kwasniewski
d8e6e22060
[ https://nvbugs/5819002 ][fix] fix sharding tests ( #10775 )
...
Signed-off-by: greg-kwasniewski1 <213329731+greg-kwasniewski1@users.noreply.github.com>
2026-01-22 20:02:48 +01:00
Shi Xiaowei
944c304bbb
[TRTLLM-9527][feat] Python transceiver components (step 2) ( #10494 )
...
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2026-01-22 10:14:50 -08:00
Venky
b3146d095d
[TRTC-122][feat] Eagle3 Specdec UX improvements ( #10124 )
...
Signed-off-by: Venky Ganesh <23023424+venkywonka@users.noreply.github.com>
2026-01-22 07:24:11 -08:00
Yan Chunwei
30ffa58b54
[ https://nvbugs/5783876 ][fix] fix hmac launch ( #10434 )
...
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
2026-01-22 23:20:53 +08:00
Bo Deng
a218cf02fd
[ https://nvbugs/5768068 ][chore] improve disagg acc tests ( #10833 )
...
Signed-off-by: Bo Deng <deemod@nvidia.com>
2026-01-22 09:45:35 -05:00
Pengyun Lin
5e34112b27
[TRTLLM-10388][feat] Support logprobs for Completions API ( #10809 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2026-01-22 21:25:24 +08:00
Jiayu Chang
1dc49b266e
[ https://nvbugs/5322131 ][feat] Multi-LoRA serving with CUDA Graph ( #8279 )
...
Signed-off-by: Jiayu Chang <jiayuc@nvidia.com>
2026-01-22 14:01:18 +01:00
Yihan Wang
cdb9ffd0ab
[ https://nvbugs/5741304 ][chore] Update flashinfer-python to 0.6.1 ( #10872 )
...
Signed-off-by: Yihan Wang
2026-01-22 19:29:16 +08:00
tcherckez-nvidia
128d4ac5be
[None][chore] NVFP4 MoE - Move weights transformation to fusion phase… ( #10803 )
...
Signed-off-by: Tal Cherckez <tcherckez@nvl72070-T11.cm.cluster>
Signed-off-by: Tal Cherckez <tcherckez@nvl72039-T03.cm.cluster>
Signed-off-by: Tal Cherckez <tcherckez@nvl72098-T11.cm.cluster>
Signed-off-by: tcherckez-nvidia <127761168+tcherckez-nvidia@users.noreply.github.com>
Co-authored-by: Tal Cherckez <tcherckez@nvl72070-T11.cm.cluster>
Co-authored-by: Tal Cherckez <tcherckez@nvl72039-T03.cm.cluster>
Co-authored-by: Tal Cherckez <tcherckez@nvl72098-T11.cm.cluster>
2026-01-22 13:08:05 +02:00