Emma Qiao
|
35c24424f6
|
[None][infra] Waive failed cases in post-merge on 01/14 (#10668)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2026-01-14 21:39:32 +08:00 |
|
HuiGao-NV
|
b10704428d
|
[https://nvbugs/5787566][fix] Only keep a limited number of performance statistic data (#10569)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2026-01-14 07:53:01 -05:00 |
|
Bo Li
|
582dec5bb5
|
[https://nvbugs/5774869][infra] Use 2 GPUs to test skip softmax attention on H100. (#10420)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2026-01-14 07:03:01 -05:00 |
|
shuyixiong
|
babd5ecacc
|
[https://nvbugs/5760740][fix] Enable ray tests (#10272)
Signed-off-by: shuyix <219646547+shuyixiong@users.noreply.github.com>
|
2026-01-14 19:25:46 +08:00 |
|
xinhe-nv
|
272688c663
|
[None][fix] fix L0 issues (#10670)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-14 18:09:40 +08:00 |
|
jmydurant
|
e7882d5c74
|
[None][feat] MiniMax M2 support (#10532)
Signed-off-by: Mingyang Jiang <13463932+jmydurant@users.noreply.github.com>
|
2026-01-14 17:38:58 +08:00 |
|
mpikulski
|
052c36ddd2
|
[TRTLLM-9522][feat] support image_embeds in OpenAI API (#9715)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2026-01-14 10:31:03 +01:00 |
|
Bo Li
|
487287a412
|
[None][chore] Update test name MNNVL->NVLinkTwoSided. (#9672)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2026-01-14 04:29:57 -05:00 |
|
QI JUN
|
c4da4fd462
|
[https://nvbugs/5637220][ci] unwaive TestQwen3_235B_A22B::test_nvfp4[latency_moe_trtllm_attention_dp] (#9870)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: QI JUN <22017000+QiJune@users.noreply.github.com>
|
2026-01-14 15:41:14 +08:00 |
|
Yuxian Qiu
|
39cefd6125
|
[None][refactor] Unify the usage of MPIDist and TorchDist. (#10380)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2026-01-14 14:05:47 +08:00 |
|
xxi
|
f841b43cde
|
[None][chore] waive the CI failure (#10655)
Signed-off-by: xxi <xxi@nvidia.com>
|
2026-01-14 13:59:15 +08:00 |
|
JennyLiu
|
92ae490410
|
[None][test] Spark - Change testlist name and perf yml format (#10626)
Signed-off-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
Co-authored-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
|
2026-01-13 23:07:11 -05:00 |
|
xinhe-nv
|
07d9390e9b
|
[None][test] add test into qa test list (#10627)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-13 22:43:00 -05:00 |
|
xinhe-nv
|
7305c61fc9
|
[TRTLLM-8638][fix] Add failed cases into waives.txt (#10589)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-13 22:00:20 -05:00 |
|
Leslie Fang
|
bc119f5644
|
[None][chore] Add test configurable moe module (#10575)
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
|
2026-01-14 07:25:57 +08:00 |
|
Balaram Buddharaju
|
ccdfa43a6e
|
[https://nvbugs/5791900][fix] Fix HelixCpMnnvlMemory init with PP (#10533)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2026-01-13 15:48:42 -05:00 |
|
Frida Hou
|
bf16fbd86c
|
[#9283][feat] AutoDeploy: separate rms pattern detection from fusion (#9969)
Signed-off-by: Fridah-nv <201670829+Fridah-nv@users.noreply.github.com>
|
2026-01-13 14:57:27 -05:00 |
|
dongfengy
|
6ee8dbfe0b
|
[https://nvbugs/5772396][fix] WAR: Disable TinyGEMM PDL due to accuracy issues (#10619)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
|
2026-01-13 12:40:11 -05:00 |
|
benzh-2025
|
6df2c8a074
|
[None][feat] add fp4 gemm + allreduce (#9729)
Signed-off-by: benzh
Signed-off-by: benzh-2025
|
2026-01-13 21:11:13 +08:00 |
|
Guoming Zhang
|
c1b0b7350f
|
[None][test] Unwaive qwen3 next test case. (#9877)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2026-01-13 20:42:31 +08:00 |
|
Tailing Yuan
|
38296a472b
|
[None][feat] Layer-wise benchmarks: make model init more general and support weights loading (#10562)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
|
2026-01-13 19:17:03 +08:00 |
|
Erin
|
55580f8ec1
|
[NVBUG-5670458][chore] Unwaive lp tests (#10524)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Signed-off-by: Erin <14718778+hchings@users.noreply.github.com>
|
2026-01-13 04:31:27 -05:00 |
|
Guoming Zhang
|
bdaee87895
|
[TRTLLM-10060][feat] Enable attention dp for Nemotron Super v3. (#10347)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2026-01-13 17:13:55 +08:00 |
|
JunyiXu-nv
|
e291a834db
|
[TRTLLM-8462][feat] Support GET/DELETE v1/responses/{response_id} (#9937)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
|
2026-01-13 03:57:14 -05:00 |
|
JennyLiu
|
2967d299fb
|
[TRTLLM-10271][test] Add Spark QA functional and performance cases (#10564)
Signed-off-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
Co-authored-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
|
2026-01-13 13:20:15 +08:00 |
|
fredricz-20070104
|
bbe535fddf
|
[None][chore] Fix disagg assert (#10596)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2026-01-12 21:39:57 -05:00 |
|
Iman Tabrizian
|
48b09e5a25
|
[https://nvbugs/5689235][fix] Fix cancellation+chunked prefill+disagg (#10111)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2026-01-12 18:23:26 -05:00 |
|
Anish Shanbhag
|
dacc881993
|
[https://nvbugs/5761391][fix] Use correct model names for config database regression tests (#10192)
Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>
|
2026-01-12 10:55:07 -08:00 |
|
Suyog Gupta
|
a1385243e1
|
[#10580][fix] re-enable NemotronH MOE MMLU test (#10594)
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2026-01-12 09:26:07 -08:00 |
|
Emma Qiao
|
9f044b9dd9
|
[None][infra] Waive failed tests for main 01/12 (#10604)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2026-01-12 10:24:54 -05:00 |
|
mpikulski
|
bf7998f1b8
|
[TRTLLM-9522][test] cover LLM API multi_modal_embeddings (#9963)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2026-01-12 11:38:22 +01:00 |
|
Wanli Jiang
|
11da7e3605
|
[None][fix] Solve pillow version conflict (#10537)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
|
2026-01-12 04:05:54 -05:00 |
|
Zhenhuan Chen
|
3bd319dc8e
|
[https://nvbugs/5794796][chore] waive test blocking premerge (#10593)
Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>
|
2026-01-12 15:39:07 +08:00 |
|
yufeiwu-nv
|
8e806abac3
|
[None][test] Remove most TRT-backend test cases in llm_perf_nim.yml (#10572)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
Signed-off-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
Co-authored-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2026-01-12 15:34:55 +08:00 |
|
yingguo-trt
|
c5914f9085
|
[None][chore] update deepseekv3.2 test parameter (#10595)
Signed-off-by: yingguo-trt <244492186+yingguo-trt@users.noreply.github.com>
|
2026-01-12 01:43:22 -05:00 |
|
chenfeiz0326
|
54459377d2
|
[TRTLLM-10248][feat] Support Bot to Send Perf Regression Msg to Slack Channel (#10489)
Signed-off-by: Chenfei Zhang <chenfeiz@nvidia.com>
|
2026-01-12 14:23:23 +08:00 |
|
Jie Li
|
5e0dbba0c9
|
[None][chore]: update waive list (#10577)
Signed-off-by: Jie Li <lijie@nvidia.com>
|
2026-01-11 22:18:04 -05:00 |
|
Eran Geva
|
c5d5af9e7f
|
[#8391][chore] removed llama and added deepseek to AutoDeploy's L0 perf test (#10585)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
|
2026-01-11 16:31:24 -05:00 |
|
Ivy Zhang
|
7f018c89e9
|
[None][test] update core test list (#10538)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2026-01-11 14:08:20 -05:00 |
|
Yechan Kim
|
8e0d20d901
|
[TRTLLM-10195][feat] K-EXAONE support (#10355)
Signed-off-by: Jaedeok Kim <jaedeokk@nvidia.com>
Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com>
Co-authored-by: Jaedeok Kim <jaedeokk@nvidia.com>
|
2026-01-12 00:29:51 +09:00 |
|
HuiGao-NV
|
3c65ec3c55
|
[None][chore] waive test case (#10581)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2026-01-10 18:53:36 -05:00 |
|
fredricz-20070104
|
f6045fac09
|
[None][chore] Fix Gitlab CI termination issues (#10576)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
Signed-off-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
Co-authored-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
|
2026-01-10 07:51:18 -05:00 |
|
William Zhang
|
ff7eb93f31
|
[https://nvbugs/5669097][tests] Add MMMU test for mistral small (#10530)
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
|
2026-01-09 16:09:28 -08:00 |
|
Chenghao Zhang
|
38f249b479
|
[https://nvbugs/5548861][fix] AutoDeploy: Fix the test (#10521)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
|
2026-01-09 13:30:24 -08:00 |
|
yingguo-trt
|
d80f01d205
|
[None][feat] Add support for DeepSeek v3.2 tests (#10561)
Signed-off-by: yingguo-trt <244492186+yingguo-trt@users.noreply.github.com>
|
2026-01-09 10:20:29 -05:00 |
|
Yechan Kim
|
7295af68ba
|
[None][fix] Enable AttentionDP on Qwen3-VL and fix test (#10435)
Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com>
|
2026-01-10 00:13:26 +09:00 |
|
Iman Tabrizian
|
ced88424ef
|
[https://nvbugs/5756008][fix] unwaive test (#10523)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2026-01-09 09:40:07 -05:00 |
|
Jie Li
|
627d306df9
|
[None][chore] remove some model support; add device constraint (#10563)
Signed-off-by: Jie Li <lijie@nvidia.com>
|
2026-01-09 09:36:23 -05:00 |
|
ruodil
|
2b72d33fdc
|
[TRTLLM-9932][test] add kimi_k2 single node perf test (#10436)
Signed-off-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Ruodi Lu <ruodil@users.noreply.github.com>
|
2026-01-09 05:36:50 -05:00 |
|
bhsueh_NV
|
4a09acd012
|
[https://nvbugs/5785206][infra] unwaive the accuracy/test_llm_api_pytorch.py::TestQwen3_30B_A3B (#10560)
Signed-off-by: bhsueh <11360707+byshiue@users.noreply.github.com>
|
2026-01-09 03:13:29 -05:00 |
|