Commit Graph

1112 Commits

Author SHA1 Message Date
sunnyqgg
c2fe686e3e
[https://nvbugs/5608930][fix] Wavie TestQwen3_8B::test_chunked_prefill for bug 5608930 (#8940)
Signed-off-by: qgai <qgai@nvidia.com>
2025-11-05 01:52:09 -08:00
Bo Deng
43843778a7
[https://nvbugs/5601682][fix] unwaive test_disaggregated_deepseek_v3_… (#8888)
Signed-off-by: Bo Deng <deemod@nvidia.com>
2025-11-05 09:33:57 +08:00
Simeng Liu
0206d8d0fc
[https://nvbugs/5606136][fix] Fix torch.onnx.export with pytorch upgrade to fallback to dynamo=False. (#8917)
Signed-off-by: Simeng Liu <simengl@nvidia.com>
2025-11-04 14:11:48 -08:00
Yan Chunwei
cacb8a84f2
[https://nvbugs/5606266][test] move qwen3 multi-node test to the qa list (#8908)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
2025-11-04 02:12:02 -08:00
Shi Xiaowei
324f63f26a
[https://nvbugs/5451272][fix] unwaive the test (#8608)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2025-11-04 01:31:41 -08:00
xiweny
7d8a913406
[https://nvbugs/5596343] [test] Update accuracy baseline for GPT-OSS-20B (#8842)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
Signed-off-by: dongfengy <99041270+dongfengy@users.noreply.github.com>
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-11-04 16:04:11 +08:00
Ivy Zhang
baa6ba0d69
[None][chore] Update test list (#8835)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2025-11-03 21:42:01 -08:00
brb-nv
095b7a3ad5
[https://nvbugs/5521253][fix] Enable Gemma3 12B & 27B on SM100 (#8666)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-11-03 14:49:36 -08:00
Emma Qiao
9f1d274a26
[None][infra] Waive failed tests for release branch on 11/03 (#8879)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-11-03 02:59:33 -08:00
sunnyqgg
ccadb66efe
[https://nvbugs/5461796][fix] Unwaive test test_llmapi_speculative_decoding_mtp (#8832)
Signed-off-by: qgai <qgai@nvidia.com>
2025-11-03 16:53:24 +08:00
sunnyqgg
d82197846d
[https://nvbugs/5608930][fix] Unwaive test 5608930 (#8831)
Signed-off-by: qgai <qgai@nvidia.com>
2025-11-03 15:09:58 +08:00
yunruis
07077fb070
[https://nvbugs/5606268][fix] Fix program exit segment fault triggered CublasMMWarpper deconstructor (#8834)
Signed-off-by: yunruis <205571022+yunruis@users.noreply.github.com>
2025-11-03 14:46:01 +08:00
Yan Chunwei
0d105448b1
[https://nvbugs/5606266][fix] unwaive test (#8867)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
2025-11-02 21:43:58 -08:00
Zhanrui Sun
776bb25bfd
[TRTLLM-8658][infra] upgrade to DLFW 25.10 and pytorch 2.9.0 / triton 3.5.0 (#8621)
Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com>
Signed-off-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-11-03 09:24:58 +08:00
dongxuy04
d81ebb5b4d
[https://nvbugs/5444687][fix] Cherrypick online EPLB CI fix from main to release 1.1 (#8854)
Signed-off-by: Dongxu Yang <78518666+dongxuy04@users.noreply.github.com>
2025-11-03 09:17:51 +08:00
Barry Kang
f22a87f296
[https://nvbugs/5325296][fix] Enable relaxed acceptance test on Blackwell (#8709)
Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
2025-10-31 15:02:06 -07:00
Zhanrui Sun
d2071d7ed7
[None][infra] Remove invaild waived tests which not in release branch (#8841)
Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com>
2025-10-31 03:02:34 -07:00
Emma Qiao
421d48f402
[None][infra] Skip failed tests for release branch (#8833)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-31 15:04:54 +08:00
Jin Li
28673f3e9c
[https://nvbugs/5488118][fix] Unwaive passed tests (#8758)
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-31 10:46:44 +08:00
Emma Qiao
9ee0075921
[TRTLLM-8971][infra] Cherry-pick for Update gpu key for B300/GB300 (#8724) (#8796)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-30 06:12:16 -07:00
Emma Qiao
ec510ad72a
[None][infra] Waive failed tests for release branch (#8760)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-29 06:32:30 -07:00
xiweny
f49f42db59
[https://nvbugs/5601203] [fix]Restrict fp8 blockscale moe case (#8583)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-10-29 10:47:32 +08:00
Eran Geva
db3c373d3a
[https://nvbugs/5572320][fix] Ported test_ad_trtllm_bench.py from main (#8671)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
2025-10-28 09:41:32 +02:00
Yukun He
e04354bc09
[https://nvbugs/5608489][fix] Fix output unpack issues for Llama3/4 NVFP4 models. (#8679)
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-10-28 14:21:47 +08:00
Emma Qiao
b05555faeb
[None][infra] Waive failed tests for release 10/24 (#8656)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
2025-10-24 21:53:35 +08:00
Ivy Zhang
1859b55d22
[None][test] Clean cache for certain easily hang cases (#8619)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Co-authored-by: Larry Xu <197874197+LarryXFly@users.noreply.github.com>
2025-10-24 08:17:32 -04:00
Jie Li
7749ec406b
[https://nvbugs/5587456][fix] Remove multimodal test cases using TRT backend (#8611)
Signed-off-by: Jie Li <lijie@nvidia.com>
2025-10-24 18:04:43 +08:00
Jie Li
4b52054bdd
[https://nvbugs/5541145][fix] Remove DeepSeekR1 test case from H20 to prevent OOM (#8610)
Signed-off-by: Jie Li <lijie@nvidia.com>
2025-10-24 05:20:40 -04:00
Lizhi Zhou
686298d2d5
[https://nvbugs/5575902][fix] set max_batch_size=1 to stabilize accuracy test result (#8609)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
2025-10-23 07:28:29 -07:00
Ivy Zhang
5d27034295
[TRTLLM-8785][fix] create output_dir before test begin (cherry-pick #8518) (#8575)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2025-10-23 04:41:54 -04:00
Chang Liu
e5b6d335eb
[https://nvbugs/5568961][fix] Fix a merge conflict (cherrypick from PR 8365) (#8553)
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
Co-authored-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-23 14:05:16 +08:00
Lizhi Zhou
3f82cdbdad
[https://nvbugs/5582277][fix] rework DisaggPPTerminationHandler to fix hang issue (#8519)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
2025-10-23 09:43:59 +08:00
Pengyun Lin
e86d6db9ec
[https://nvbugs/5575829][fix] Unwaive gpt-oss test (#8576)
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2025-10-22 07:31:56 -04:00
Emma Qiao
09349ccbfe
[None][infra] Waive failed tests for release 10/22 (#8574)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-22 04:41:00 -04:00
Bo Deng
9e30f14da8
[https://nvbugs/5565549][fix] unwaive test_disaggregated_spec_dec_bat… (#8500)
Signed-off-by: Bo Deng <deemod@nvidia.com>
2025-10-22 14:59:59 +08:00
JunyiXu-nv
0acdecb2c3
[https://nvbugs/5569713][fix] Disable fp8 deep gemm for EXAONE-4.0-32B-FP8 (#8429)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2025-10-21 12:37:56 -04:00
mpikulski
f256eb9063
[TRTLLM-8650][fix] beam search request validation (#8433)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
2025-10-21 10:50:27 +02:00
Emma Qiao
2b0a10e4d5
[None][infra] Waive tests for release 1021 (#8522)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-21 03:21:00 -04:00
bhsueh_NV
14d0f5d683
[https://nvbugs/5516666][fix] cherry-pick PR 8130 to unwaive the Qwen3 CI (#8444)
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
2025-10-19 23:14:10 -04:00
Ivy Zhang
f904348cd6
[TRTLLM-8580][test] save runtime report periodically (#8312) (#8455)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2025-10-20 10:54:24 +08:00
Yukun He
437a3fc642
[None][chore] Remove duplicate log outputs in test_perf.py (#8418)
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-10-17 14:11:32 +08:00
Yan Chunwei
995b93bc38
[https://nvbugs/5437384][test] fix trtllm-llmapi-launch multi tests with single launch (#8397)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
2025-10-16 21:14:43 -07:00
ruodil
20c2de4924
[None][test] cherry-pick: add test-model-suites in integration conftest.py (#8388)
Signed-off-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
2025-10-15 23:26:32 -07:00
Patrice Castonguay
7862372ee2
[https://nvbugs/5552889][fix] fix: Prevent empty batch when using attention DP with disagg (#8372)
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
2025-10-16 09:11:04 +08:00
Ivy Zhang
4751bdbcb6
[None][chore] Update nim test list (#8356)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
2025-10-15 02:04:20 -07:00
Emma Qiao
988f93790f
[None][infra] Waive failed tests in release post-merge 10/15 (#8386)
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-15 16:06:08 +08:00
Stanley Sun
cce97e6e15
[TRTLLM-8113][test] Add pytorch workflow e2e tests with pp enabled (#8357)
Signed-off-by: Stanley Sun <stsun@nvidia.com>
2025-10-15 15:09:21 +08:00
xiweny
d5b79268e7
[https://nvbugs/5565565] [fix] fp8 wideep support sm103 (#8228)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-10-15 10:17:08 +08:00
Yiqing Yan
7b5ba7ca66
[https://nvbugs/5565541][fix] Add timeout threshold for H100 FHMA test (#8354)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
2025-10-14 01:23:08 -07:00
bhsueh_NV
66aa88739b
[https://nvbugs/5574556][fix] fix bug of Qwen3_235B_A22B::test_fp8 CI (#8351)
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
2025-10-14 15:26:15 +08:00