TensorRT-LLMs/docs/source/reference/multimodal-feature-support-matrix.md
Chang Liu faa2f46554
[TRTLLM-5059][feat] Enable KV-cache reuse and add E2E tests for llava-next (#7349)
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
2025-09-09 14:51:36 -04:00

1.1 KiB

Multimodal Feature Support Matrix (PyTorch Backend)

Model CUDA Graph Encoder IFB KV Cache Reuse Chunked Prefill
Gemma 3 Yes Yes N/A N/A
HyperCLOVA Yes Yes No No
VILA Yes No No No
LLaVA-NeXT Yes Yes Yes No
Llama 4 Yes Yes No No
Mistral-Small-3.1 Yes Yes No No
Phi-4-multimodal Yes Yes No No
Qwen2-VL Yes Yes Yes No
Qwen2.5-VL Yes Yes Yes No