Ivy Zhang
|
782dfca7e8
|
[TRTLLM-9050][test] add llama4 disagg case to cover kv cache overflow error (#9172)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 18:26:32 -08:00 |
|
xinhe-nv
|
35658eab55
|
[None][chore] Add failed cases into waives.txt (#9193)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-18 17:47:55 -08:00 |
|
Enwei Zhu
|
7c4777a571
|
[TRTLLM-9286][feat] Integration of CuteDSL NVFP4 grouped GEMM (#8880)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
|
2025-11-18 17:40:12 -08:00 |
|
Lizhi Zhou
|
c789000a62
|
[https://nvbugs/5649010][fix] increase status-checking interval to avoid instability (#9203)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
|
2025-11-19 08:55:42 +08:00 |
|
Kaiyu Xie
|
d076aa44d3
|
[None] [tests] Unwaive wide ep related tests (#9204)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-11-18 08:54:46 -08:00 |
|
Ivy Zhang
|
160b361588
|
[TRTLLM-8949][test] Add rcca test case for eagle3 consistency check (#9088)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 05:55:00 -08:00 |
|
Ivy Zhang
|
ca41a71f92
|
[TRTLLM-8948][test] Add long bench case (#9165)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
|
2025-11-18 04:41:48 -08:00 |
|
Tri Dao
|
fc088e642c
|
[None][feat] Support Glm4MoeForCausalLM (#8256)
Signed-off-by: Tri Dao <daominhtri0503@gmail.com>
Co-authored-by: Xuanyu Chen <xuanyuc@nvidia.com>
|
2025-11-18 09:43:21 +08:00 |
|
QI JUN
|
c3376fa114
|
[None][ci] split speculative test case into several small cases (#9209)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-17 17:02:25 -08:00 |
|
Emma Qiao
|
d16b1a84c5
|
[None][infra] Waive a failed case in pre-merge stage 11/16 (#9192)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-17 09:36:56 +08:00 |
|
Emma Qiao
|
2854f0cf3d
|
[None][infra] Waive failed tests for main branch 11/15 (#9187)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
|
2025-11-16 01:48:25 -08:00 |
|
brb-nv
|
63237494db
|
[None][chore] Waive failing tests blocking pre-merge (#9189)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-11-16 01:06:03 -08:00 |
|
Chang Liu
|
bed4e95e9f
|
[https://nvbugs/5629887][fix] Add missing device count guard for DSv32 multiGPU tests (#9159)
|
2025-11-14 07:52:23 -08:00 |
|
xinhe-nv
|
49b7e6301a
|
[None][chore] Add failed cases into waives.txt (#9156)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-14 06:28:22 -08:00 |
|
yuanjingx87
|
d72321a32e
|
[None][ci] Waive unittest/_torch/sampler/test_torch_sampler.py::TestBatchedSampling (#9161)
Signed-off-by: Yuanjing Xue <197832395+yuanjingx87@users.noreply.github.com>
|
2025-11-14 01:49:26 -08:00 |
|
QI JUN
|
3c950910a0
|
[None][ci] waive test_disaggregated.py::test_disaggregated_mixed[TinyLlama-1.1B-Chat-v1.0] (#9162)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-11-13 18:56:37 -08:00 |
|
Tailing Yuan
|
cc4c980e03
|
[None][feat] Add Qwen3-Next to layer-wise benchmarks (#9065)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
|
2025-11-14 10:03:00 +08:00 |
|
Erin
|
44d1c75701
|
[TRTLLM-8988][feat] Unify MPI & Ray's req/response handling with RPC Client/Server (#8765)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
|
2025-11-13 17:21:24 -08:00 |
|
William Zhang
|
121140cfec
|
[None][fixes] Add tool call parsing fixes and Qwen3 coder parser (#8817)
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
|
2025-11-13 04:34:38 -08:00 |
|
Lizhi Zhou
|
48a27c7bef
|
[https://nvbugs/5633340][chore] unwaive test_auto_scaling.py::test_disagg_server_restart (#9131)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com>
|
2025-11-13 01:45:36 -08:00 |
|
Emma Qiao
|
d0ea417ec8
|
[None][infra] Waive failed tests for main 11/13 (#9132)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-13 01:00:40 -08:00 |
|
xinhe-nv
|
548f5ce4bc
|
[None][fix] waive failed tests (#9090)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-12 23:40:00 -08:00 |
|
xinhe-nv
|
8fa3c55c76
|
[None][chore] Remove closed bugs (#9114)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-12 22:49:37 -08:00 |
|
ruodil
|
c86e36fe38
|
[None][test] add deepseek and qwen cases for rtx series (#8839)
Signed-off-by: Ruodi Lu <ruodil@users.noreply.github.com>
Co-authored-by: Ruodi Lu <ruodil@users.noreply.github.com>
|
2025-11-12 22:28:02 -08:00 |
|
HuiGao-NV
|
cde18c12da
|
[https://nvbugs/5640873][fix] Move thop tests to pre-merge (#9094)
Signed-off-by: Hui Gao <huig@nvidia.com>
|
2025-11-13 13:08:13 +08:00 |
|
Yan Chunwei
|
4fd93bdc2c
|
[None][ci] Waive test_llm_rpc and test_llm_rpc_streaming (#9118)
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
|
2025-11-12 19:55:09 -08:00 |
|
QI JUN
|
3416efbc29
|
[None][ci] waive test_disaggregated_serving.py::TestQwen3_8B::test_chunked_prefill (#9111)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-13 10:06:32 +08:00 |
|
dongxuy04
|
9241ccaf27
|
[None][feat] Enable EPLB for trtllm-gen and cutlass backend (#8886)
Signed-off-by: Dongxu Yang <78518666+dongxuy04@users.noreply.github.com>
|
2025-11-12 12:30:27 -08:00 |
|
Chenghao Zhang
|
5f26c31954
|
[https://nvbugs/5636912][fix] AutoDeploy: Unwaive the test (#9018)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
|
2025-11-12 12:26:38 -08:00 |
|
Iman Tabrizian
|
cdde15b275
|
[TRTLLM-8540][feat] Add support for disagg in DSv3.2 (#8735)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
|
2025-11-12 08:21:11 -08:00 |
|
yufeiwu-nv
|
b7a2574c60
|
[https://nvbugs/5568991][test] Remove Phi-3 models (#9066)
Signed-off-by: yufeiwu-nv <230315618+yufeiwu-nv@users.noreply.github.com>
|
2025-11-12 03:16:36 -08:00 |
|
QI JUN
|
4003dc7574
|
[None][ci] waive some test cases of disaggregated serving (#9085)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
|
2025-11-12 15:06:21 +08:00 |
|
Emma Qiao
|
bb6eb9510d
|
[None][infra] Waive a failed case of disaggregated/test_disaggregated.py (#9074)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-11 19:38:32 -08:00 |
|
Wanli Jiang
|
ebdd1cc8e0
|
[TRTLLM-8119][feat] Update doc/tests/chat_template for nano-v2-vlm (#8840)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
|
2025-11-11 07:48:23 -08:00 |
|
QI JUN
|
0ce22ce928
|
[None][ci] waive test_disaggregated_serving.py::TestQwen3_8B::test_auto_dtype[False] (#9069)
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
|
2025-11-11 02:11:15 -08:00 |
|
Yiqing Yan
|
b7d51c5549
|
[None][chore] Remove duplicated waive test (#9067)
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
|
2025-11-11 16:49:49 +08:00 |
|
Emma Qiao
|
da1f0e2465
|
[None][infra] Waive failed tests on main 11/11 (#9058)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-11 13:19:30 +08:00 |
|
xinhe-nv
|
fac522056c
|
[None][chore] Add failed cases into waives.txt (#8998)
Signed-off-by: Jie Li <lijie@nvidia.com>
Co-authored-by: Jie Li <lijie@nvidia.com>
|
2025-11-11 12:40:59 +08:00 |
|
xiweny
|
50c486367a
|
[https://nvbugs/5619396][fix] Add sm103 to CutlassFP8RowwiseGemm (#9042)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
|
2025-11-10 08:12:14 -08:00 |
|
xinhe-nv
|
f848d844d9
|
[None][chore] Add failed cases into waives.txt (#9030)
Signed-off-by: xinhe-nv <200704525+xinhe-nv@users.noreply.github.com>
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
|
2025-11-09 23:36:05 -08:00 |
|
Fanrong Li
|
a7033a9193
|
[TRTLLM-9001][feat] add TP support for DeepSeek-V3.2 (#8943)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-11-10 12:16:01 +08:00 |
|
Bo Li
|
67af7c15a5
|
[https://nvbugs/5637037][fix] Update unwaive list. (#9001)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-11-10 08:53:07 +08:00 |
|
Emma Qiao
|
183778d58a
|
[None][infra] Waive failed tests for main 11/07 (#9008)
Signed-off-by: qqiao <qqiao@nvidia.com>
|
2025-11-08 08:51:35 -08:00 |
|
Emma Qiao
|
2af6a537ad
|
[TRTLLM-8999][infra] Reduce gb200 multi-node test stages (#8778)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
|
2025-11-08 06:34:24 -08:00 |
|
Yuxian Qiu
|
7b82ba90da
|
[https://nvbugs/5629790][chore] unwaive test. (#8967)
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
|
2025-11-07 18:41:32 +08:00 |
|
Lizhi Zhou
|
b26e1617f2
|
[https://nvbugs/5633340][fix] kill processes properly after test (#8970)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
|
2025-11-06 21:45:38 -08:00 |
|
xiweny
|
ee20e679a9
|
[https://nvbugs/5636986][fix] Fix DeepGemmMoe get_buffer calls (#8939)
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
Signed-off-by: xiweny <13230610+VALLIS-NERIA@users.noreply.github.com>
|
2025-11-06 19:57:19 -08:00 |
|
Simeng Liu
|
9f8d93f89a
|
[https://nvbugs/5606136][ci] Remove tests for deprecating triton multimodal models. (#8926)
Signed-off-by: Simeng Liu <simengl@nvidia.com>
|
2025-11-06 17:58:42 -08:00 |
|
Lucas Liebenwein
|
7a552c450a
|
[https://nvbugs/5606166][fix] AutoDeploy: unwaive test for use tuples for cudagraph shape lookup (#8957)
also updated test waive for another nvbug
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-11-05 16:27:00 -08:00 |
|
Fanrong Li
|
c2feed798a
|
[https://nvbugs/5630345][chore] unwaive DS-v32 nvfp4 and fp8 tests (#8887)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-11-05 03:49:23 -08:00 |
|