Lizhi Zhou
|
fdf29ab8fa
|
[TRTLLM-7846][feat] Http disagg-cluster management implemention (#7869)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
|
2025-10-09 09:44:01 +08:00 |
|
QI JUN
|
6884d06aed
|
[None][ci] move some llama4 test cases to pre merge (#8189)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-10-08 18:34:08 -07:00 |
|
Liao Lanyu
|
ed8e00ad4a
|
[https://nvbugs/5522746][fix] unwaive tests caused by node issues after rebooting (#8193)
Signed-off-by: Lanyu Liao <lancelly@users.noreply.github.com>
Co-authored-by: Lanyu Liao <lancelly@users.noreply.github.com>
|
2025-10-09 08:45:56 +08:00 |
|
Mike Iovine
|
c88913dc03
|
[https://nvbugs/5541545][fix] Remove test_llama4 (#8031)
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
|
2025-10-08 15:20:15 -07:00 |
|
brb-nv
|
80517b7812
|
[None][chore] Waive some tests failing on main post merge (#8186)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-10-08 06:52:30 -07:00 |
|
mpikulski
|
8298e93bd8
|
[TRTLLM-8414][chore] BREAKING CHANGE: refine sampling strategy selection (#8132)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2025-10-08 15:46:50 +02:00 |
|
Liao Lanyu
|
d57b8f0951
|
[https://nvbugs/5455140][fix] unwaive tests related to GB200 OOM (#8159)
Signed-off-by: Lanyu Liao <lancelly@users.noreply.github.com>
Co-authored-by: Lanyu Liao <lancelly@users.noreply.github.com>
|
2025-10-08 13:14:12 +08:00 |
|
Emma Qiao
|
ca9da1f1c2
|
[None][infra] Skip failed cases for main (#8176)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-10-07 06:37:51 -07:00 |
|
xiweny
|
9298f1bdcc
|
[None] [test] Add B300 cases to CI (#8056)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
|
2025-10-06 19:23:31 -07:00 |
|
Faraz
|
27a5091fcb
|
[None][feat] GPT-OSS Sm120/Sm121 Support (#7937)
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Signed-off-by: list <58580514+farazkh80@users.noreply.github.com>
Signed-off-by: Vincent Huang <vincenth@nvidia.com>
Co-authored-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
Co-authored-by: Vincent Huang <vincenth@nvidia.com>
|
2025-10-06 16:59:06 -04:00 |
|
Lucas Liebenwein
|
3492391feb
|
[None][chore] AutoDeploy: clean up accuracy test configs (#8134)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-10-06 12:51:01 -07:00 |
|
Yan Chunwei
|
fb51de6c2e
|
[TRTLLM-8189][chore] enhance GenerationExecutor with RPC (part1) (#5543)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: chunweiy <chunweiy@nvidia.com>
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
Signed-off-by: chunweiy <328693+Superjomn@users.noreply.github.com>
|
2025-10-05 17:28:20 +08:00 |
|
Jonas Yang CN
|
88ea2c4ee9
|
[TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Co-authored-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
Co-authored-by: Erin Ho <14718778+hchings@users.noreply.github.com>
|
2025-10-04 08:12:24 +08:00 |
|
Lucas Liebenwein
|
2c454e8003
|
[None][feat] AutoDeploy: Nemotron-H accuracy test (#8133)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-10-03 15:39:03 -07:00 |
|
Michal Guzek
|
38da871db3
|
[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (#6806)
Signed-off-by: Michal Guzek <mguzek@nvidia.com>
|
2025-10-03 12:10:48 -07:00 |
|
Mike Iovine
|
ca8291133a
|
[None][fix] Fix MTP 2-model (#8115)
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
Signed-off-by: Mike Iovine <miovine@nvidia.com>
|
2025-10-03 10:13:50 -07:00 |
|
Patrice Castonguay
|
b77f19f4ff
|
[https://nvbugs/5434320][fix] fix: Unwaiving disagg pp tests (#8069)
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
|
2025-10-01 00:33:59 -04:00 |
|
Emma Qiao
|
b1e3fef8aa
|
[None][infra] Skip failed tests in post-merge for main (#8102)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-10-01 10:12:10 +08:00 |
|
brb-nv
|
84aa3c981e
|
[None][chore] Waive failing MNNVL alltoall multi-gpu test (#8106)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-09-30 20:05:42 -04:00 |
|
xinhe-nv
|
1dba9fa89e
|
[TRTLLM-6239][feat] add test cases into QA test list (#8081)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-30 00:23:45 -04:00 |
|
Kaiyu Xie
|
b0cb9ca50e
|
[None] [test] Add MNNVL AlltoAll tests to pre-merge (#7466)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
Co-authored-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
|
2025-09-29 23:12:24 -04:00 |
|
Cheng Hang
|
cdce68c3e0
|
[TRTLLM-6741][fix] Add heuristics for lm head tp size when enable_lm_head_tp_in_adp=True (#7891)
Signed-off-by: Cheng Hang <chang@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-09-30 09:24:35 +08:00 |
|
xiweny
|
48e779ae8c
|
[https://nvbugs/5541494] [fix] add back missing sm100f bmm kernels (#8051)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
|
2025-09-29 05:35:44 -04:00 |
|
xinhe-nv
|
20e6cd39f1
|
[None][chore] Add failed cases into waives.txt (#8043)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-29 03:37:39 -04:00 |
|
Emma Qiao
|
ce381d6813
|
[None][infra] Waive failed cases for main on 0929 (#8053)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-09-29 02:46:02 -04:00 |
|
HuiGao-NV
|
7ac932d45e
|
[https://nvbugs/5532087][CI] Enable test case (#8029)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-09-29 01:46:28 -04:00 |
|
Eran Geva
|
9cea6bfb30
|
[#7288][feat] Added AutoDeploy backend support to test_perf.py (#7588)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
|
2025-09-28 21:21:27 -07:00 |
|
Emma Qiao
|
2be05cbd6e
|
[None][infra] Skip failed test for main branch on 9/28 (#8040)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-09-28 07:00:55 -04:00 |
|
ChristinaZ
|
95eac2cda7
|
[https://nvbugs/5537738][fix] Add fp8 post-quant allgather support (#8008)
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
|
2025-09-28 15:32:45 +08:00 |
|
Iman Tabrizian
|
33282351a2
|
[TRTLLM-6106][feat] Add support for KVCache transfer from KVCache reuse path (#6348)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2025-09-27 19:29:30 -04:00 |
|
Emma Qiao
|
c8bef27ebb
|
[None][infra] Waive failed cases in post-merge 2305 (#8019)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-09-26 10:20:12 -07:00 |
|
xinhe-nv
|
ba6ab62bd1
|
[None][chore] Add failed cases into waives.txt (#8004)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-26 00:41:02 -07:00 |
|
xinhe-nv
|
f32f5730b2
|
[None][chore] Add failed cases into waives.txt (#7986)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-25 23:50:09 -07:00 |
|
Lucas Liebenwein
|
3a96d75a3c
|
[https://nvbugs/5527956][fix] AutoDeploy: fix IMA due to outdated metadata (#8002)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-09-25 22:05:55 -07:00 |
|
Yiqing Yan
|
108248ece1
|
[TRTLLM-7999][infra] Add B300/GB300 single gpu test (#7951)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
|
2025-09-26 09:59:11 +08:00 |
|
Emma Qiao
|
2dc93c6371
|
[None][infra] Waive failed tests on main (#8001)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-09-25 08:13:39 -07:00 |
|
Yan Chunwei
|
5342c607cd
|
[https://nvbugs/5516710][fix] fix Llama 3.3 TP PP case (#7717)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-25 21:02:35 +08:00 |
|
xinhe-nv
|
e30d9aced9
|
[https://nvbugs/4955671][fix] update test list (#7980)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-25 02:58:09 -07:00 |
|
Emma Qiao
|
cb53261aaf
|
[None][infra] Unwaive some tests since dev already have a PR to collect more info (#7984)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-09-25 01:03:13 -07:00 |
|
fredricz-20070104
|
0945403174
|
[TRTLLM-6541][test] Add NIM perf test cases (#7924)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2025-09-25 13:15:26 +08:00 |
|
Iman Tabrizian
|
be7e51727e
|
[https://nvbugs/5456485][bug] unwaive triton test (#7966)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2025-09-24 17:02:55 -07:00 |
|
Pamela Peng
|
b1dc84b4a3
|
[TRTLLM-7399][test] Add DS-R1/Qwen3 test cases for RTX 6000 (#7662)
Signed-off-by: Pamela <179191831+pamelap-nvidia@users.noreply.github.com>
Signed-off-by: Pamela Peng <179191831+pamelap-nvidia@users.noreply.github.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
|
2025-09-24 11:40:26 -04:00 |
|
HuiGao-NV
|
c8bda4b3a9
|
[None][ci] Waive some intermittent failures (#7955)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-09-24 19:00:38 +08:00 |
|
Enwei Zhu
|
a1a57e83b8
|
[TRTLLM-5235][feat] Enable regex and EBNF grammar in trtllm-serve (#7925)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-09-24 18:30:23 +08:00 |
|
xinhe-nv
|
b8bfa63197
|
[None][chore] add test_w4_1gpu[True-True-cutlass-fp8] & TestKimiK2::test_fp8_blocks… (#7944)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-09-24 03:25:17 -07:00 |
|
QI JUN
|
18ff1e31b8
|
[None][ci] remove duplicate test cases (#7956)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-09-24 17:47:22 +08:00 |
|
yufeiwu-nv
|
f323b74d42
|
[None][test] Update llm_models_root to improve path handling on BareMetal environment (#7876)
Signed-off-by: yufeiwu <230315618+yufeiwu-nv@users.noreply.github.com>
Signed-off-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
Co-authored-by: ruodil <200874449+ruodil@users.noreply.github.com>
|
2025-09-24 17:35:57 +08:00 |
|
HuiGao-NV
|
29e63d3bc2
|
[https://nvbugs/5532248][fix] Fix fused_moe OOM (#7931)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-09-24 02:22:38 -07:00 |
|
QI JUN
|
946ffcd2eb
|
[None][ci] optimize test cases of dgx b200 (#7948)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-09-24 00:39:45 -07:00 |
|
Pengbo Wang
|
b890d7fea4
|
[None][infra] Skip failed test for nvbugs 5537738 (#7946)
Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
|
2025-09-23 23:48:50 -07:00 |
|