TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-22 02:35:21 +08:00

History

William Zhang ca9537e17c [TRTLLM-10858][feat] Multi-image support for EPD disagg (#11264 ) * Why? Prior to this commit, we only supported a single multimodal input for E/P/D disaggregated serving. * What? This commit does a minor refactor of the multimodal embedding handles that cross process boundaries to enable this. Existing unit tests are updated accordingly to test this. The `RequestOutput` has its `mm_embedding_handle` replaced in favor of `disaggregated_params`, addressing a previous TODO. Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>		2026-02-11 20:50:00 -08:00
..
test_external_embedding.py	[None][fix] InputProcessor config naming convention fix (#8705 )	2025-11-03 22:29:21 -08:00
test_find_num_image_tokens.py	[TRTLLM-10858][feat] Multi-image support for EPD disagg (#11264 )	2026-02-11 20:50:00 -08:00
test_fuse_input_embeds.py	[TRTLLM-7440][fix] Split `fused_input_embed` to separate out host sync (#7280 )	2025-09-06 23:11:39 -04:00
test_mm_encoder_standalone.py	[TRTLLM-10858][feat] Multi-image support for EPD disagg (#11264 )	2026-02-11 20:50:00 -08:00
test_multimodal_runtime.py	[TRTLLM-6903][feat] Support chunked prefill for multimodal models (#6843 )	2025-09-14 20:10:10 -07:00
test_share_multiparams.py	[TRTLLM-7385][feat] Optimize Qwen2/2.5-VL performance (#7250 )	2025-09-22 03:40:02 -07:00