mpikulski
|
46dd9886bb
|
[https://nvbugs/5661877][fix] fix test regression in TestBatchedSampling::test_samples (#9215)
Signed-off-by: ixlmar <206748156+ixlmar@users.noreply.github.com>
|
2025-11-19 01:44:44 -08:00 |
|
xinhe-nv
|
0f77fec932
|
[None][chore] Add failed cases into waives.txt (#9289)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-19 17:03:43 +08:00 |
|
nvxuanyuc
|
a79c0dfb43
|
[None][fix] Update GLM model accuracy test (#9286)
Signed-off-by: Xuanyu Chen <xuanyuc@nvidia.com>
|
2025-11-18 21:59:01 -08:00 |
|
Emma Qiao
|
67d3eb26af
|
[None][infra] Waive failed cases for main branch on 11/17 (#9266)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-18 20:07:03 -08:00 |
|
xinhe-nv
|
286ace22ed
|
[None][chore] Add failed cases into waives.txt (#9242)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-18 19:27:55 -08:00 |
|
Ivy Zhang
|
782dfca7e8
|
[TRTLLM-9050][test] add llama4 disagg case to cover kv cache overflow error (#9172)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 18:26:32 -08:00 |
|
xinhe-nv
|
35658eab55
|
[None][chore] Add failed cases into waives.txt (#9193)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-18 17:47:55 -08:00 |
|
Enwei Zhu
|
7c4777a571
|
[TRTLLM-9286][feat] Integration of CuteDSL NVFP4 grouped GEMM (#8880)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-11-18 17:40:12 -08:00 |
|
Lizhi Zhou
|
c789000a62
|
[https://nvbugs/5649010][fix] increase status-checking interval to avoid instability (#9203)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
|
2025-11-19 08:55:42 +08:00 |
|
Bo Deng
|
34f845bf69
|
[TRTLLM-9287][infra] Use NIXL backend for accuracy tests (#9247)
Signed-off-by: Bo Deng <deemod@nvidia.com>
|
2025-11-18 14:46:20 -08:00 |
|
Ajinkya Rasane
|
8d7cda2318
|
[None][chore] Update the Flux autodeploy example (#8434)
Signed-off-by: ajrasane <131806219+ajrasane@users.noreply.github.com>
Co-authored-by: Frida Hou <201670829+Fridah-nv@users.noreply.github.com>
|
2025-11-18 14:16:04 -08:00 |
|
Kaiyu Xie
|
d076aa44d3
|
[None] [tests] Unwaive wide ep related tests (#9204)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-11-18 08:54:46 -08:00 |
|
Ivy Zhang
|
160b361588
|
[TRTLLM-8949][test] Add rcca test case for eagle3 consistency check (#9088)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 05:55:00 -08:00 |
|
Ivy Zhang
|
ca41a71f92
|
[TRTLLM-8948][test] Add long bench case (#9165)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 04:41:48 -08:00 |
|
Tri Dao
|
fc088e642c
|
[None][feat] Support Glm4MoeForCausalLM (#8256)
Signed-off-by: Tri Dao <daominhtri0503@gmail.com>
Co-authored-by: Xuanyu Chen <xuanyuc@nvidia.com>
|
2025-11-18 09:43:21 +08:00 |
|
QI JUN
|
c3376fa114
|
[None][ci] split speculative test case into several small cases (#9209)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-17 17:02:25 -08:00 |
|
Robin Kobus
|
df41f220a2
|
[TRTLLM-8831][feat] Enable early exit with overlap scheduler (#8587)
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
|
2025-11-17 18:07:13 +01:00 |
|
Emma Qiao
|
d16b1a84c5
|
[None][infra] Waive a failed case in pre-merge stage 11/16 (#9192)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-17 09:36:56 +08:00 |
|
Emma Qiao
|
2854f0cf3d
|
[None][infra] Waive failed tests for main branch 11/15 (#9187)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
|
2025-11-16 01:48:25 -08:00 |
|
brb-nv
|
63237494db
|
[None][chore] Waive failing tests blocking pre-merge (#9189)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-11-16 01:06:03 -08:00 |
|
Chang Liu
|
bed4e95e9f
|
[https://nvbugs/5629887][fix] Add missing device count guard for DSv32 multiGPU tests (#9159)
|
2025-11-14 07:52:23 -08:00 |
|
xinhe-nv
|
49b7e6301a
|
[None][chore] Add failed cases into waives.txt (#9156)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-14 06:28:22 -08:00 |
|
yuanjingx87
|
d72321a32e
|
[None][ci] Waive unittest/_torch/sampler/test_torch_sampler.py::TestBatchedSampling (#9161)
Signed-off-by: Yuanjing Xue <197832395+yuanjingx87@users.noreply.github.com>
|
2025-11-14 01:49:26 -08:00 |
|
QI JUN
|
3c950910a0
|
[None][ci] waive test_disaggregated.py::test_disaggregated_mixed[TinyLlama-1.1B-Chat-v1.0] (#9162)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-11-13 18:56:37 -08:00 |
|
Tailing Yuan
|
cc4c980e03
|
[None][feat] Add Qwen3-Next to layer-wise benchmarks (#9065)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
|
2025-11-14 10:03:00 +08:00 |
|
Erin
|
44d1c75701
|
[TRTLLM-8988][feat] Unify MPI & Ray's req/response handling with RPC Client/Server (#8765)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
|
2025-11-13 17:21:24 -08:00 |
|
William Zhang
|
121140cfec
|
[None][fixes] Add tool call parsing fixes and Qwen3 coder parser (#8817)
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
|
2025-11-13 04:34:38 -08:00 |
|
Lizhi Zhou
|
48a27c7bef
|
[https://nvbugs/5633340][chore] unwaive test_auto_scaling.py::test_disagg_server_restart (#9131)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com>
|
2025-11-13 01:45:36 -08:00 |
|
Emma Qiao
|
d0ea417ec8
|
[None][infra] Waive failed tests for main 11/13 (#9132)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-13 01:00:40 -08:00 |
|
xinhe-nv
|
548f5ce4bc
|
[None][fix] waive failed tests (#9090)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-12 23:40:00 -08:00 |
|
xinhe-nv
|
8fa3c55c76
|
[None][chore] Remove closed bugs (#9114)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-12 22:49:37 -08:00 |
|
ruodil
|
c86e36fe38
|
[None][test] add deepseek and qwen cases for rtx series (#8839)
Signed-off-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Ruodi Lu <ruodil@users.noreply.github.com>
|
2025-11-12 22:28:02 -08:00 |
|
HuiGao-NV
|
cde18c12da
|
[https://nvbugs/5640873][fix] Move thop tests to pre-merge (#9094)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-11-13 13:08:13 +08:00 |
|
Yan Chunwei
|
4fd93bdc2c
|
[None][ci] Waive test_llm_rpc and test_llm_rpc_streaming (#9118)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
|
2025-11-12 19:55:09 -08:00 |
|
Zhenhuan Chen
|
943b05e2d3
|
[TRTLLM-9179][feat] add pp_partition to customize each rank's layer number (#9003)
Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>
|
2025-11-13 10:34:17 +08:00 |
|
QI JUN
|
3416efbc29
|
[None][ci] waive test_disaggregated_serving.py::TestQwen3_8B::test_chunked_prefill (#9111)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-13 10:06:32 +08:00 |
|
dongxuy04
|
9241ccaf27
|
[None][feat] Enable EPLB for trtllm-gen and cutlass backend (#8886)
Signed-off-by: Dongxu Yang <78518666+dongxuy04@users.noreply.github.com>
|
2025-11-12 12:30:27 -08:00 |
|
Chenghao Zhang
|
5f26c31954
|
[https://nvbugs/5636912][fix] AutoDeploy: Unwaive the test (#9018)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
|
2025-11-12 12:26:38 -08:00 |
|
Fanrong Li
|
780d4f9dc5
|
[None][feat] Add MTP>1 support for DS-v3.2 (#9045)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-11-12 09:56:12 -08:00 |
|
Iman Tabrizian
|
cdde15b275
|
[TRTLLM-8540][feat] Add support for disagg in DSv3.2 (#8735)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2025-11-12 08:21:11 -08:00 |
|
yufeiwu-nv
|
b7a2574c60
|
[https://nvbugs/5568991][test] Remove Phi-3 models (#9066)
Signed-off-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
|
2025-11-12 03:16:36 -08:00 |
|
QI JUN
|
4003dc7574
|
[None][ci] waive some test cases of disaggregated serving (#9085)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-11-12 15:06:21 +08:00 |
|
Emma Qiao
|
bb6eb9510d
|
[None][infra] Waive a failed case of disaggregated/test_disaggregated.py (#9074)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-11 19:38:32 -08:00 |
|
QI JUN
|
fd703fbb7b
|
[None][ci] run speculative unit tests serially (#9080)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-11 19:06:44 -08:00 |
|
Lucas Liebenwein
|
aca56097cb
|
[None][fix] AutoDeploy: update nano3 accuracy test (#9061)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-11-11 12:26:31 -08:00 |
|
Wanli Jiang
|
ebdd1cc8e0
|
[TRTLLM-8119][feat] Update doc/tests/chat_template for nano-v2-vlm (#8840)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
|
2025-11-11 07:48:23 -08:00 |
|
QI JUN
|
0ce22ce928
|
[None][ci] waive test_disaggregated_serving.py::TestQwen3_8B::test_auto_dtype[False] (#9069)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-11 02:11:15 -08:00 |
|
Yiqing Yan
|
b7d51c5549
|
[None][chore] Remove duplicated waive test (#9067)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
|
2025-11-11 16:49:49 +08:00 |
|
Emma Qiao
|
da1f0e2465
|
[None][infra] Waive failed tests on main 11/11 (#9058)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-11 13:19:30 +08:00 |
|
xinhe-nv
|
fac522056c
|
[None][chore] Add failed cases into waives.txt (#8998)
Signed-off-by: Jie Li <lijie@nvidia.com>
Co-authored-by: Jie Li <lijie@nvidia.com>
|
2025-11-11 12:40:59 +08:00 |
|