TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

William Zhang c0ae6bbdbe [None][feat] EPD for Qwen3 VL (#10470 ) * Why? We would like to support EPD disaggregated serving for Qwen3 VL. * What? This commit adds such support, and extends existing unit tests for correctness checks. Some minor (protected) interface changes had to be made to the weight mapper as a side-effect. Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>		2026-01-08 06:45:54 -05:00
..
hf	[None][feat] EPD for Qwen3 VL (#10470 )	2026-01-08 06:45:54 -05:00
mistral	[None][fix] Mistral large 3 few code refine (#10405 )	2026-01-08 06:38:49 -05:00
__init__.py	[None][feat] support Qwen3-VL dense model in pytorch backend (#9060 )	2025-12-31 17:54:26 +09:00
auto_mapper.py	[TRTLLM-5493] Add core infrastructure to enable loading of custom checkpoint formats (#5372 )	2025-07-17 00:50:30 +08:00
base_checkpoint_loader.py	[TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583 )	2025-12-05 16:07:20 +01:00
base_config_loader.py	[TRTLLM-5493] Add core infrastructure to enable loading of custom checkpoint formats (#5372 )	2025-07-17 00:50:30 +08:00
base_weight_loader.py	[TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583 )	2025-12-05 16:07:20 +01:00
base_weight_mapper.py	[None][feat] EPD for Qwen3 VL (#10470 )	2026-01-08 06:45:54 -05:00