TensorRT-LLMs/docs/source/reference/multimodal-feature-support-matrix.md
dongfengy 367ff88a5e
[None][feat] Refactor llama4 for multimodal encoder IFB (#6844)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
2025-08-28 13:22:19 -07:00

1.1 KiB

Multimodal Feature Support Matrix (PyTorch Backend)

Model CUDA Graph Encoder IFB KV Cache Reuse Chunked Prefill
Gemma 3 Yes Yes No No
HyperCLOVA Yes Yes No No
VILA Yes No No No
LLaVA-NeXT Yes Yes No No
Llama 4 Yes Yes No No
Mistral-Small-3.1 Yes Yes No No
Phi-4-multimodal Yes Yes No No
Qwen2-VL Yes Yes Yes No
Qwen2.5-VL Yes Yes Yes No