TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

Author	SHA1	Message	Date
sunnyqgg	35b176ae78	[https://nvbugs/5461796 ][fix] Unwaive and extend time for test_llmapi_speculative_decoding_mtp (#9092 ) Signed-off-by: qgai <qgai@nvidia.com>	2025-11-18 19:20:07 +08:00
Chuang Zhu	1c4c737206	[https://nvbugs/5582133 ][fix] unwaive nixl test (#9244 ) Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>	2025-11-18 13:07:30 +08:00
Wanli Jiang	6640aed0c2	[None][fix] Bypass key-word matching for multimodal tests (#9170 ) Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>	2025-11-18 10:33:07 +08:00
sunnyqgg	55a9771ff0	[https://nvbugs/5649826 ][fix] Unwaive test test_llm_commandr_plus_4gpus_summary (#9201 ) Signed-off-by: qgai <qgai@nvidia.com>	2025-11-16 23:11:44 -08:00
brb-nv	6d28e6c3a6	[https://nvbugs/5568836 ][fix] Skip keyword matching for Gemma3 e2e test (#9158 ) Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>	2025-11-14 02:18:24 -08:00
Michal Guzek	8e9409ce04	[https://nvbugs/5628204 ][fix] Stop token IDs - fast path optimization for single stop token IDs only (#9014 ) Signed-off-by: Michal Guzek <mguzek@nvidia.com> Signed-off-by: Michal Guzek <moraxu@users.noreply.github.com>	2025-11-13 14:17:20 +01:00
Chuang Zhu	12fa81c679	[https://nvbugs/5628952 ][fix] avoid cudaFree overlap with cuda graph (#8903 ) Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>	2025-11-12 09:08:05 +01:00
peaceh-nv	f1d02b5664	[https://nvbugs/5570575 ][fix] : Use less kv cache memory on SM120 (#9054 ) Signed-off-by: peaceh <103117813+peaceh-nv@users.noreply.github.com>	2025-11-11 15:42:08 +08:00
Lizhi Zhou	0649b77d16	[https://nvbugs/5608743 ][chore] unwaive test (#8994 ) Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>	2025-11-10 05:59:29 -08:00
Yiqing Yan	572f9be06f	[None][chore] Lock onnx version <1.20.0 and remove WAR for TRT 10.13 (#9007 ) Signed-off-by: Yiqing Yan <yiqingy@nvidia.com> Signed-off-by: Yanchao Lu <yanchaol@nvidia.com> Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>	2025-11-10 12:50:37 +08:00
Emma Qiao	a74ce266d3	[None][infra] Waive failed tests for release branch 11/07 (#9026 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-11-09 18:18:49 +08:00
dominicshanshan	def2ad5107	[https://nvbugs/5575920 ][fix] Fix cublas/cublasLt handle creation memory not sufficient error (#8900 ) Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>	2025-11-07 10:14:00 -08:00
Ivy Zhang	5cf3f0c981	[https://nvbugs/5636946 ][fix] Update test model (#8993 ) Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>	2025-11-07 15:13:29 +08:00
Emma Qiao	ede230cb3a	[None][infra] Waive failed tests for release branch 11/06 (#8966 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-11-07 09:01:26 +08:00
Shiyu Li	519eda29bd	[https://nvbugs/5597647 ][fix] Fix MNNVL unit test failed due to accuracy issue on Hopper (#8891 ) Signed-off-by: Shiyu Li <shili@nvidia.com> Signed-off-by: Shiyu Li <timlee0212@outlook.com>	2025-11-06 18:28:06 +01:00
shuyixiong	69dec201bd	[https://nvbugs/5630700 ][chore] Unwaive Qwen3_235B_A22B test (#8901 ) Signed-off-by: Shuyi Xiong <219646547+shuyixiong@users.noreply.github.com>	2025-11-06 15:32:39 +08:00
Jin Li	f040ef9ffd	[https://nvbugs/5467531 ][fix] Fix moe test and wide ep fake impl (#8883 ) Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>	2025-11-06 11:40:50 +08:00
sunnyqgg	c2fe686e3e	[https://nvbugs/5608930 ][fix] Wavie TestQwen3_8B::test_chunked_prefill for bug 5608930 (#8940 ) Signed-off-by: qgai <qgai@nvidia.com>	2025-11-05 01:52:09 -08:00
Bo Deng	43843778a7	[https://nvbugs/5601682 ][fix] unwaive test_disaggregated_deepseek_v3_… (#8888 ) Signed-off-by: Bo Deng <deemod@nvidia.com>	2025-11-05 09:33:57 +08:00
Simeng Liu	0206d8d0fc	[https://nvbugs/5606136 ][fix] Fix torch.onnx.export with pytorch upgrade to fallback to dynamo=False. (#8917 ) Signed-off-by: Simeng Liu <simengl@nvidia.com>	2025-11-04 14:11:48 -08:00
Yan Chunwei	cacb8a84f2	[https://nvbugs/5606266 ][test] move qwen3 multi-node test to the qa list (#8908 ) Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>	2025-11-04 02:12:02 -08:00
Shi Xiaowei	324f63f26a	[https://nvbugs/5451272 ][fix] unwaive the test (#8608 ) Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>	2025-11-04 01:31:41 -08:00
xiweny	7d8a913406	[https://nvbugs/5596343 ] [test] Update accuracy baseline for GPT-OSS-20B (#8842 ) Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com> Signed-off-by: dongfengy <99041270+dongfengy@users.noreply.github.com> Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>	2025-11-04 16:04:11 +08:00
Ivy Zhang	baa6ba0d69	[None][chore] Update test list (#8835 ) Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>	2025-11-03 21:42:01 -08:00
brb-nv	095b7a3ad5	[https://nvbugs/5521253 ][fix] Enable Gemma3 12B & 27B on SM100 (#8666 ) Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>	2025-11-03 14:49:36 -08:00
Emma Qiao	9f1d274a26	[None][infra] Waive failed tests for release branch on 11/03 (#8879 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-11-03 02:59:33 -08:00
sunnyqgg	ccadb66efe	[https://nvbugs/5461796 ][fix] Unwaive test test_llmapi_speculative_decoding_mtp (#8832 ) Signed-off-by: qgai <qgai@nvidia.com>	2025-11-03 16:53:24 +08:00
sunnyqgg	d82197846d	[https://nvbugs/5608930 ][fix] Unwaive test 5608930 (#8831 ) Signed-off-by: qgai <qgai@nvidia.com>	2025-11-03 15:09:58 +08:00
yunruis	07077fb070	[https://nvbugs/5606268 ][fix] Fix program exit segment fault triggered CublasMMWarpper deconstructor (#8834 ) Signed-off-by: yunruis <205571022+yunruis@users.noreply.github.com>	2025-11-03 14:46:01 +08:00
Yan Chunwei	0d105448b1	[https://nvbugs/5606266 ][fix] unwaive test (#8867 ) Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>	2025-11-02 21:43:58 -08:00
Zhanrui Sun	776bb25bfd	[TRTLLM-8658][infra] upgrade to DLFW 25.10 and pytorch 2.9.0 / triton 3.5.0 (#8621 ) Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com> Signed-off-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com> Signed-off-by: Yanchao Lu <yanchaol@nvidia.com> Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>	2025-11-03 09:24:58 +08:00
dongxuy04	d81ebb5b4d	[https://nvbugs/5444687 ][fix] Cherrypick online EPLB CI fix from main to release 1.1 (#8854 ) Signed-off-by: Dongxu Yang <78518666+dongxuy04@users.noreply.github.com>	2025-11-03 09:17:51 +08:00
dongfengy	f5575a9146	[https://nvbugs/5474119 ][fix] Cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/8809 (#8847 ) Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>	2025-11-02 16:44:06 -08:00
Barry Kang	f22a87f296	[https://nvbugs/5325296 ][fix] Enable relaxed acceptance test on Blackwell (#8709 ) Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>	2025-10-31 15:02:06 -07:00
Lucas Liebenwein	752cc3a8cb	[https://nvbugs/5606166 ][fix] AutoDeploy: use tuples for cudagraph shape lookup (#8772 ) Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>	2025-10-31 13:59:48 +01:00
Zhanrui Sun	d2071d7ed7	[None][infra] Remove invaild waived tests which not in release branch (#8841 ) Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com>	2025-10-31 03:02:34 -07:00
Emma Qiao	421d48f402	[None][infra] Skip failed tests for release branch (#8833 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-10-31 15:04:54 +08:00
Jin Li	28673f3e9c	[https://nvbugs/5488118 ][fix] Unwaive passed tests (#8758 ) Signed-off-by: Jin Li <59594262+liji-nv@users.noreply.github.com>	2025-10-31 10:46:44 +08:00
Emma Qiao	9ee0075921	[TRTLLM-8971][infra] Cherry-pick for Update gpu key for B300/GB300 (#8724 ) (#8796 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-10-30 06:12:16 -07:00
Dom Brown	9410ce3bea	[https://nvbugs/5575841 ] [test] Move test_moe.py to serial tests to improve stability + unwaive FP4 MoE torch unit tests (#8422 ) Signed-off-by: Dom Brown <3886319+DomBrown@users.noreply.github.com>	2025-10-30 13:57:56 +01:00
Emma Qiao	ec510ad72a	[None][infra] Waive failed tests for release branch (#8760 ) Signed-off-by: qqiao <qqiao@nvidia.com>	2025-10-29 06:32:30 -07:00
xiweny	f49f42db59	[https://nvbugs/5601203 ] [fix]Restrict fp8 blockscale moe case (#8583 ) Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>	2025-10-29 10:47:32 +08:00
Eran Geva	db3c373d3a	[https://nvbugs/5572320 ][fix] Ported test_ad_trtllm_bench.py from main (#8671 ) Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>	2025-10-28 09:41:32 +02:00
Yukun He	e04354bc09	[https://nvbugs/5608489 ][fix] Fix output unpack issues for Llama3/4 NVFP4 models. (#8679 ) Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>	2025-10-28 14:21:47 +08:00
Shiyu Li	28c9a51c06	[https://nvbugs/5597647 ][fix] Fix MNNVL Allreduce accuracy issue on Hopper (#8612 ) Signed-off-by: Shiyu Li <shili@nvidia.com>	2025-10-26 23:06:45 -07:00
Emma Qiao	b05555faeb	[None][infra] Waive failed tests for release 10/24 (#8656 ) Signed-off-by: qqiao <qqiao@nvidia.com> Signed-off-by: Emma Qiao <qqiao@nvidia.com>	2025-10-24 21:53:35 +08:00
Ivy Zhang	1859b55d22	[None][test] Clean cache for certain easily hang cases (#8619 ) Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> Co-authored-by: Larry Xu <197874197+LarryXFly@users.noreply.github.com>	2025-10-24 08:17:32 -04:00
Jie Li	7749ec406b	[https://nvbugs/5587456 ][fix] Remove multimodal test cases using TRT backend (#8611 ) Signed-off-by: Jie Li <lijie@nvidia.com>	2025-10-24 18:04:43 +08:00
Jie Li	4b52054bdd	[https://nvbugs/5541145 ][fix] Remove DeepSeekR1 test case from H20 to prevent OOM (#8610 ) Signed-off-by: Jie Li <lijie@nvidia.com>	2025-10-24 05:20:40 -04:00
Leslie Fang	d9d898e8b7	[https://nvbugs/5608461 ][fix] exclude InductorSubproc from thread leak check (#8624 ) Signed-off-by: leslie-fang25 <leslief@nvidia.com>	2025-10-24 13:08:42 +08:00

1 2 3 4 5 ...

1688 Commits