Emma Qiao
|
35c24424f6
|
[None][infra] Waive failed cases in post-merge on 01/14 (#10668)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2026-01-14 21:39:32 +08:00 |
|
HuiGao-NV
|
b10704428d
|
[https://nvbugs/5787566][fix] Only keep a limited number of performance statistic data (#10569)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2026-01-14 07:53:01 -05:00 |
|
Bo Li
|
582dec5bb5
|
[https://nvbugs/5774869][infra] Use 2 GPUs to test skip softmax attention on H100. (#10420)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2026-01-14 07:03:01 -05:00 |
|
shuyixiong
|
babd5ecacc
|
[https://nvbugs/5760740][fix] Enable ray tests (#10272)
Signed-off-by: shuyix <219646547+shuyixiong@users.noreply.github.com>
|
2026-01-14 19:25:46 +08:00 |
|
Kyungmin Lee
|
25148d3fee
|
[None][feat] Support new Transformers RoPE configuration format (#10636)
Signed-off-by: lkm2835 <lkm2835@gmail.com>
|
2026-01-14 19:41:27 +09:00 |
|
xxi
|
e9817461ba
|
[None][chore] improve the readability of log for cutlass can only sup… (#10630)
Signed-off-by: xxi <xxi@nvidia.com>
|
2026-01-14 05:33:45 -05:00 |
|
xxi
|
d8862505b9
|
[None][chore] enable EPLB for DEEPGEMM (#10617)
Signed-off-by: xxi <xxi@nvidia.com>
|
2026-01-14 05:28:08 -05:00 |
|
xinhe-nv
|
272688c663
|
[None][fix] fix L0 issues (#10670)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-14 18:09:40 +08:00 |
|
jmydurant
|
e7882d5c74
|
[None][feat] MiniMax M2 support (#10532)
Signed-off-by: Mingyang Jiang <13463932+jmydurant@users.noreply.github.com>
|
2026-01-14 17:38:58 +08:00 |
|
mpikulski
|
052c36ddd2
|
[TRTLLM-9522][feat] support image_embeds in OpenAI API (#9715)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2026-01-14 10:31:03 +01:00 |
|
Bo Li
|
487287a412
|
[None][chore] Update test name MNNVL->NVLinkTwoSided. (#9672)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2026-01-14 04:29:57 -05:00 |
|
Zhenhuan Chen
|
287f6c2e0f
|
[None][test] add log_samples and output_path for trtllm_eval (#10629)
Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>
|
2026-01-14 16:01:38 +08:00 |
|
QI JUN
|
c4da4fd462
|
[https://nvbugs/5637220][ci] unwaive TestQwen3_235B_A22B::test_nvfp4[latency_moe_trtllm_attention_dp] (#9870)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: QI JUN <22017000+QiJune@users.noreply.github.com>
|
2026-01-14 15:41:14 +08:00 |
|
Yukun He
|
15281de799
|
[None][fix] Reduce host overhead for unified nvfp4 gemm tuning path. (#10503)
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
|
2026-01-14 14:26:18 +08:00 |
|
Yuxian Qiu
|
39cefd6125
|
[None][refactor] Unify the usage of MPIDist and TorchDist. (#10380)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2026-01-14 14:05:47 +08:00 |
|
xxi
|
f841b43cde
|
[None][chore] waive the CI failure (#10655)
Signed-off-by: xxi <xxi@nvidia.com>
|
2026-01-14 13:59:15 +08:00 |
|
JennyLiu
|
92ae490410
|
[None][test] Spark - Change testlist name and perf yml format (#10626)
Signed-off-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
Co-authored-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
|
2026-01-13 23:07:11 -05:00 |
|
xinhe-nv
|
07d9390e9b
|
[None][test] add test into qa test list (#10627)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-13 22:43:00 -05:00 |
|
tburt-nv
|
b65c515314
|
[None][chore] update allowlist 2026-01-13 (#10645)
Signed-off-by: Tyler Burt <195370667+tburt-nv@users.noreply.github.com>
|
2026-01-13 22:23:03 -05:00 |
|
TensorRT LLM
|
dd22324675
|
[None][infra] Check in most recent lock file from nightly pipeline
Signed-off-by: TensorRT LLM <90828364+tensorrt-cicd@users.noreply.github.com>
|
2026-01-14 03:07:57 +00:00 |
|
xinhe-nv
|
7305c61fc9
|
[TRTLLM-8638][fix] Add failed cases into waives.txt (#10589)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2026-01-13 22:00:20 -05:00 |
|
Leslie Fang
|
795e690bca
|
[https://nvbugs/5753788][chore] Padding empty chunk for configurable moe (#10451)
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
|
2026-01-14 10:42:17 +08:00 |
|
Yuxian Qiu
|
d3f4fbb742
|
[None][fix] Avoid write-write race for async pp send. (#10488)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2026-01-14 09:39:36 +08:00 |
|
Yuxian Qiu
|
2acd03030a
|
[https://nvbugs/5781589][fix] Implement pp skip forward for all spec workers. (#10578)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2026-01-14 09:36:35 +08:00 |
|
Leslie Fang
|
bc119f5644
|
[None][chore] Add test configurable moe module (#10575)
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
|
2026-01-14 07:25:57 +08:00 |
|
Balaram Buddharaju
|
ccdfa43a6e
|
[https://nvbugs/5791900][fix] Fix HelixCpMnnvlMemory init with PP (#10533)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2026-01-13 15:48:42 -05:00 |
|
Frida Hou
|
bf16fbd86c
|
[#9283][feat] AutoDeploy: separate rms pattern detection from fusion (#9969)
Signed-off-by: Fridah-nv <201670829+Fridah-nv@users.noreply.github.com>
|
2026-01-13 14:57:27 -05:00 |
|
Neta Zmora
|
7b7f1e2ba1
|
[None][feat] AutoDeploy: refactor memory usage logging (#8505)
Signed-off-by: Neta Zmora <96238833+nzmora-nvidia@users.noreply.github.com>
Signed-off-by: Gal Hubara-Agam <96368689+galagam@users.noreply.github.com>
Co-authored-by: Gal Hubara-Agam <96368689+galagam@users.noreply.github.com>
|
2026-01-13 21:03:09 +02:00 |
|
dongfengy
|
6ee8dbfe0b
|
[https://nvbugs/5772396][fix] WAR: Disable TinyGEMM PDL due to accuracy issues (#10619)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
|
2026-01-13 12:40:11 -05:00 |
|
Yiteng Niu
|
7a47e29dcb
|
[None][infra] support overriding nspect version (#10402)
Signed-off-by: Yiteng Niu <6831097+niukuo@users.noreply.github.com>
|
2026-01-13 23:39:45 +08:00 |
|
benzh-2025
|
6df2c8a074
|
[None][feat] add fp4 gemm + allreduce (#9729)
Signed-off-by: benzh
Signed-off-by: benzh-2025
|
2026-01-13 21:11:13 +08:00 |
|
Guoming Zhang
|
c1b0b7350f
|
[None][test] Unwaive qwen3 next test case. (#9877)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2026-01-13 20:42:31 +08:00 |
|
Tailing Yuan
|
38296a472b
|
[None][feat] Layer-wise benchmarks: make model init more general and support weights loading (#10562)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
|
2026-01-13 19:17:03 +08:00 |
|
mpikulski
|
50c78179dd
|
[TRTLLM-8425][doc] document Torch Sampler details (#10606)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2026-01-13 12:01:20 +01:00 |
|
Erin
|
55580f8ec1
|
[NVBUG-5670458][chore] Unwaive lp tests (#10524)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Signed-off-by: Erin <14718778+hchings@users.noreply.github.com>
|
2026-01-13 04:31:27 -05:00 |
|
Void
|
7d16f3a28b
|
[https://nvbugs/5788127][fix] Use uint64_t as the dtype of lamport_buffer_size to avoid overflow (#10499)
Signed-off-by: Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
|
2026-01-13 17:16:22 +08:00 |
|
Guoming Zhang
|
bdaee87895
|
[TRTLLM-10060][feat] Enable attention dp for Nemotron Super v3. (#10347)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2026-01-13 17:13:55 +08:00 |
|
JunyiXu-nv
|
e291a834db
|
[TRTLLM-8462][feat] Support GET/DELETE v1/responses/{response_id} (#9937)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
|
2026-01-13 03:57:14 -05:00 |
|
Yuxian Qiu
|
04b112651b
|
[None][feat] Hang detection for executor loop and worker. (#10480)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2026-01-13 02:34:32 -05:00 |
|
Yiteng Niu
|
50c22b80d7
|
[None][infra] Update allowlist 2026.01.08 (#10535)
Signed-off-by: Yiteng Niu <6831097+niukuo@users.noreply.github.com>
|
2026-01-13 15:28:53 +08:00 |
|
tburt-nv
|
7d41475954
|
[None][infra] try removing shared cache dir mount (#10609)
Signed-off-by: Tyler Burt <195370667+tburt-nv@users.noreply.github.com>
|
2026-01-13 15:07:12 +08:00 |
|
JennyLiu
|
2967d299fb
|
[TRTLLM-10271][test] Add Spark QA functional and performance cases (#10564)
Signed-off-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
Co-authored-by: Jenny Liu <JennyLiu-nv+JennyLiu@users.noreply.github.com>
|
2026-01-13 13:20:15 +08:00 |
|
TensorRT LLM
|
ba1cb6831d
|
[None][infra] Check in most recent lock file from nightly pipeline
Signed-off-by: TensorRT LLM <90828364+tensorrt-cicd@users.noreply.github.com>
|
2026-01-13 03:08:08 +00:00 |
|
fredricz-20070104
|
bbe535fddf
|
[None][chore] Fix disagg assert (#10596)
Signed-off-by: FredricZ-2007 <226039983+fredricz-20070104@users.noreply.github.com>
|
2026-01-12 21:39:57 -05:00 |
|
xxi
|
ba1037ca4a
|
[https://nvbugs/5762336][fix] support to parse the keyword modules_to_not_convert of the HF model config" (#10527)
Signed-off-by: xxi <xxi@nvidia.com>
|
2026-01-12 20:21:01 -05:00 |
|
Iman Tabrizian
|
48b09e5a25
|
[https://nvbugs/5689235][fix] Fix cancellation+chunked prefill+disagg (#10111)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2026-01-12 18:23:26 -05:00 |
|
Gal Hubara-Agam
|
18a33764b5
|
[None][chore] Print correct backend name in benchmark report (#10597)
Signed-off-by: Gal Hubara Agam <96368689+galagam@users.noreply.github.com>
|
2026-01-12 14:46:00 -05:00 |
|
Anish Shanbhag
|
dacc881993
|
[https://nvbugs/5761391][fix] Use correct model names for config database regression tests (#10192)
Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>
|
2026-01-12 10:55:07 -08:00 |
|
Suyog Gupta
|
a1385243e1
|
[#10580][fix] re-enable NemotronH MOE MMLU test (#10594)
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2026-01-12 09:26:07 -08:00 |
|
Emma Qiao
|
9f044b9dd9
|
[None][infra] Waive failed tests for main 01/12 (#10604)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2026-01-12 10:24:54 -05:00 |
|