TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-17 08:15:10 +08:00

History

William Zhang abb8106c01 [https://nvbugs/5835925 ][fix] Add EPD disagg support for Qwen3 VL MoE (#10962 ) * Why? Trying to instantiate a `MultimodalEncoder` for a Qwen3 VL MoE model would fail during weight loading. * What? This commit fixes the bug, alongside: - explicit, intentional support for EPD for Qwen3 VL MoE. - extends EPD unit tests for Qwen3 VL MoE, albeit with dummy weights. - unit tests for the weight mapper fixes. Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com> Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>		2026-02-09 23:53:40 +08:00
..
_torch	[https://nvbugs/5835925 ][fix] Add EPD disagg support for Qwen3 VL MoE (#10962 )	2026-02-09 23:53:40 +08:00
api_stability	[TRTLLM-9457][feat] Add cute dsl fp8 gemm for Blackwell (#10130 )	2026-02-06 09:49:30 +08:00
bindings	[TRTLLM-9527][feat] change context params and disagg params (step3) (#10495 )	2026-01-27 16:34:17 +08:00
disaggregated	[https://nvbugs/5826689 ][fix] replace etcd3 with etcd-sdk-python (#10886 )	2026-02-09 23:53:40 +08:00
executor	[https://nvbugs/5720482 ][fix] Fix test rpc streaming (#9902 )	2025-12-13 01:14:43 -08:00
kv_cache_manager_v2_tests	[None][feat] Enhance support for complex models (#11254 )	2026-02-05 17:28:26 +08:00
llmapi	[https://nvbugs/5804146 ][fix] Enable responses tests and remove ds to… (#10925 )	2026-02-09 23:53:40 +08:00
others	[https://nvbugs/5761391 ][fix] Include triton-kernels as a packaged dependency (#10471 )	2026-01-28 19:56:32 -08:00
scaffolding	[None][feat] Refactor scaffolding streaming feature and fix openai wo… (#8622 )	2025-10-30 16:02:40 +08:00
tools	[None][feat] Add performance alignment to layer-wise benchmarks (#11018 )	2026-01-29 14:01:51 +08:00
trt	[TRTLLM-8682][chore] Remove auto_parallel module (#8329 )	2025-10-22 20:53:08 -04:00
utils	[https://nvbugs/5761391 ][fix] Include triton-kernels as a packaged dependency (#10471 )	2026-01-28 19:56:32 -08:00
conftest.py	[TRTLLM-10415][feat] Dump thread stacks for hanging tests before time… (#10708 )	2026-01-29 20:43:34 +08:00
dump_checkpoint_stats.py
gc_utils.py
profile_utils.py
pytest.ini	[TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726 )	2025-12-16 05:16:32 -08:00
test_model_runner_cpp.py
test_pip_install.py	[TRTLLM-10561][infra] Fix jaraco-context and wheel vulnerability (#10901 )	2026-02-03 09:54:11 +08:00