Emma Qiao
421d48f402
[None][infra] Skip failed tests for release branch ( #8833 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-31 15:04:54 +08:00
Yukun He
a1d912688c
[ https://nvbugs/5623960 ][fix] Compress the warning log of AutoTuner when encountering tactic failures. ( #8795 )
...
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-10-31 12:55:56 +08:00
Jin Li
28673f3e9c
[ https://nvbugs/5488118 ][fix] Unwaive passed tests ( #8758 )
...
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-31 10:46:44 +08:00
Emma Qiao
9ee0075921
[TRTLLM-8971][infra] Cherry-pick for Update gpu key for B300/GB300 ( #8724 ) ( #8796 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-30 06:12:16 -07:00
Dom Brown
9410ce3bea
[ https://nvbugs/5575841 ] [test] Move test_moe.py to serial tests to improve stability + unwaive FP4 MoE torch unit tests ( #8422 )
...
Signed-off-by: Dom Brown <3886319+DomBrown@users.noreply.github.com>
2025-10-30 13:57:56 +01:00
Jin Li
0dac57f2bc
[ https://nvbugs/5569534 ][fix] Warm up with different sizes for more s… ( #8515 )
...
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-29 22:29:06 -07:00
Enwei Zhu
c1bac95382
[ https://nvbugs/5422621 ][fix] fix EPLB init hang (cherry-pick #8649 ) ( #8727 )
...
Signed-off-by: Dongxu Yang <78518666+dongxuy04@users.noreply.github.com>
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
Co-authored-by: dongxuy04 <78518666+dongxuy04@users.noreply.github.com>
2025-10-30 10:31:34 +08:00
Emma Qiao
ec510ad72a
[None][infra] Waive failed tests for release branch ( #8760 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-29 06:32:30 -07:00
JunyiXu-nv
6adccd758d
[ https://nvbugs/5606268 ][fix] Separate cuda graph workspace to prevent IMA ( #8685 )
...
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2025-10-29 09:43:30 +01:00
sunnyqgg
e9aa8b222f
[ https://nvbugs/5556020 ][fix] cherry-pick fix test_disaggregated_serving.py::TestLlama3_1_8BInstruct::test_eagle3 dimension mismatch ( #8644 )
...
Signed-off-by: qgai <qgai@nvidia.com>
2025-10-29 15:44:25 +08:00
Zhanrui Sun
beafc39764
[None][fix] add readme copy to wheel stage to avoid setup.py failure (cherry-pick #8736 ) ( #8754 )
...
Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com>
Co-authored-by: Faraz <58580514+farazkh80@users.noreply.github.com>
2025-10-29 00:27:37 -07:00
xiweny
f49f42db59
[ https://nvbugs/5601203 ] [fix]Restrict fp8 blockscale moe case ( #8583 )
...
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-10-29 10:47:32 +08:00
Chuang Zhu
b326be25e7
[ https://nvbugs/5578175 ][fix] Fix block range index ( #8470 )
...
Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
2025-10-28 11:42:23 -07:00
Pengyun Lin
b334102544
[ https://nvbugs/5564465 ][fix] Overwrite only if default_max_tokens is legal ( #8538 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2025-10-28 10:15:26 +01:00
Eran Geva
db3c373d3a
[ https://nvbugs/5572320 ][fix] Ported test_ad_trtllm_bench.py from main ( #8671 )
...
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
2025-10-28 09:41:32 +02:00
Yukun He
e04354bc09
[ https://nvbugs/5608489 ][fix] Fix output unpack issues for Llama3/4 NVFP4 models. ( #8679 )
...
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-10-28 14:21:47 +08:00
Shiyu Li
28c9a51c06
[ https://nvbugs/5597647 ][fix] Fix MNNVL Allreduce accuracy issue on Hopper ( #8612 )
...
Signed-off-by: Shiyu Li <shili@nvidia.com>
2025-10-26 23:06:45 -07:00
Yanchao Lu
389cbd7611
[None][docs] Update Python wheel's short-/long-descriptions ( #8485 )
...
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
2025-10-27 08:36:29 +08:00
Emma Qiao
b05555faeb
[None][infra] Waive failed tests for release 10/24 ( #8656 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
2025-10-24 21:53:35 +08:00
Ivy Zhang
1859b55d22
[None][test] Clean cache for certain easily hang cases ( #8619 )
...
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Co-authored-by: Larry Xu <197874197+LarryXFly@users.noreply.github.com>
2025-10-24 08:17:32 -04:00
Jie Li
7749ec406b
[ https://nvbugs/5587456 ][fix] Remove multimodal test cases using TRT backend ( #8611 )
...
Signed-off-by: Jie Li <lijie@nvidia.com>
2025-10-24 18:04:43 +08:00
Yiqing Yan
25ec125726
[None][chore] Disable GB300 stages in release branch due to nodes will be offline temporarily ( #8645 )
...
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
2025-10-24 05:21:14 -04:00
Jie Li
4b52054bdd
[ https://nvbugs/5541145 ][fix] Remove DeepSeekR1 test case from H20 to prevent OOM ( #8610 )
...
Signed-off-by: Jie Li <lijie@nvidia.com>
2025-10-24 05:20:40 -04:00
Leslie Fang
d9d898e8b7
[ https://nvbugs/5608461 ][fix] exclude InductorSubproc from thread leak check ( #8624 )
...
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
2025-10-24 13:08:42 +08:00
Zheyu Fu
d2c976ceac
[ https://nvbugs/5576192 ][fix] Unwaive the test for test_weight_only_quant_gemm. ( #8546 )
...
Signed-off-by: Zheyu Fu <zheyuf@NVIDIA.com>
2025-10-23 15:46:09 -07:00
Lizhi Zhou
686298d2d5
[ https://nvbugs/5575902 ][fix] set max_batch_size=1 to stabilize accuracy test result ( #8609 )
...
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
2025-10-23 07:28:29 -07:00
Emma Qiao
4e11e0bd20
[None][infra] Disable rtxpro6000 stages due to nodes will be offline temporarily ( #8616 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-23 10:21:21 -04:00
Ivy Zhang
5d27034295
[TRTLLM-8785][fix] create output_dir before test begin (cherry-pick #8518 ) ( #8575 )
...
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2025-10-23 04:41:54 -04:00
Chang Liu
f4e1cc7b39
[ https://nvbugs/5549081 ][fix] Fix device id assignment for some visio… ( #8552 )
...
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
Signed-off-by: Chang Liu <9713593+chang-l@users.noreply.github.com>
2025-10-23 14:06:13 +08:00
Chang Liu
e5b6d335eb
[ https://nvbugs/5568961 ][fix] Fix a merge conflict (cherrypick from PR 8365) ( #8553 )
...
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
Co-authored-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-23 14:05:16 +08:00
Lizhi Zhou
3f82cdbdad
[ https://nvbugs/5582277 ][fix] rework DisaggPPTerminationHandler to fix hang issue ( #8519 )
...
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
2025-10-23 09:43:59 +08:00
Yan Chunwei
0d929f8dc7
[ https://nvbugs/5569754 ][fix] trtllm-llmapi-launch port conflict ( #8582 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
2025-10-23 09:14:35 +08:00
Kaiyu Xie
c7b06b1b0a
[ https://nvbugs/5488576 ][fix] Propagate disable_finalize_fusion config flag in WIDEEP MoE backend (cherry-pick #8141 ) ( #8566 )
...
Signed-off-by: Sergey Klevtsov <sklevtsov@nvidia.com>
Co-authored-by: Sergey Klevtsov <141879860+sklevtsov-nvidia@users.noreply.github.com>
2025-10-22 21:46:59 +08:00
Pengyun Lin
e86d6db9ec
[ https://nvbugs/5575829 ][fix] Unwaive gpt-oss test ( #8576 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2025-10-22 07:31:56 -04:00
Emma Qiao
09349ccbfe
[None][infra] Waive failed tests for release 10/22 ( #8574 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-22 04:41:00 -04:00
Bo Deng
9e30f14da8
[ https://nvbugs/5565549 ][fix] unwaive test_disaggregated_spec_dec_bat… ( #8500 )
...
Signed-off-by: Bo Deng <deemod@nvidia.com>
2025-10-22 14:59:59 +08:00
Jin Li
6631791c60
[ https://nvbugs/5546510 ][fix] Move torch.cuda.Stream out of torch com… ( #8494 )
...
Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
2025-10-22 11:21:58 +08:00
Guoming Zhang
a519c2c43c
[ https://nvbugs/5504095 ][fix] Unwaive test_user_specify_workspace case. ( #8316 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-10-22 09:31:24 +08:00
Simeng Liu
1375b9f074
[ https://nvbugs/5515753 ][ci] Add NCCL_DEBUG=INFO flag to collect more info with CI failure. ( #8440 )
...
Signed-off-by: Simeng Liu <simengl@nvidia.com>
2025-10-21 18:12:05 -07:00
JunyiXu-nv
0acdecb2c3
[ https://nvbugs/5569713 ][fix] Disable fp8 deep gemm for EXAONE-4.0-32B-FP8 ( #8429 )
...
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2025-10-21 12:37:56 -04:00
mpikulski
f256eb9063
[TRTLLM-8650][fix] beam search request validation ( #8433 )
...
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
2025-10-21 10:50:27 +02:00
Emma Qiao
2b0a10e4d5
[None][infra] Waive tests for release 1021 ( #8522 )
...
Signed-off-by: qqiao <qqiao@nvidia.com>
2025-10-21 03:21:00 -04:00
Yuxian Qiu
4faa5150ab
[ https://nvbugs/5569081 ][fix] Upgrade fmha_v2. (cherry-pick from https://github.com/NVIDIA/TensorRT-LLM/pull/8364 ) ( #8499 )
...
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
2025-10-21 12:32:13 +08:00
Pengbo Wang
8ce2dc5cb7
[ https://nvbugs/5501820 ][fix] Add requirements for numba-cuda version to WAR mem corruption ( #7992 ) ( #8414 )
...
Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
2025-10-20 09:01:08 +02:00
bhsueh_NV
14d0f5d683
[ https://nvbugs/5516666 ][fix] cherry-pick PR 8130 to unwaive the Qwen3 CI ( #8444 )
...
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
2025-10-19 23:14:10 -04:00
Ivy Zhang
f904348cd6
[TRTLLM-8580][test] save runtime report periodically ( #8312 ) ( #8455 )
...
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2025-10-20 10:54:24 +08:00
danielafrimi
a0b7fe9e36
[ https://nvbugs/5524714 ][fix] Fix TP sharding of fused-QKV weight scales in W4A16 AWQ ( #8432 )
...
Signed-off-by: Daniel Afrimi <dafrimi@nvidia.com>
2025-10-19 15:27:23 +03:00
xiweny
af2450c266
[ https://nvbugs/5565565 ] [fix] Remove waiver ( #8450 )
...
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-10-17 01:13:01 -07:00
Yukun He
437a3fc642
[None][chore] Remove duplicate log outputs in test_perf.py ( #8418 )
...
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-10-17 14:11:32 +08:00
Yan Chunwei
995b93bc38
[ https://nvbugs/5437384 ][test] fix trtllm-llmapi-launch multi tests with single launch ( #8397 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
2025-10-16 21:14:43 -07:00