TensorRT-LLMs/docs/source/reference/multimodal-feature-support-matrix.md
dongfengy 367ff88a5e
[None][feat] Refactor llama4 for multimodal encoder IFB (#6844)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
2025-08-28 13:22:19 -07:00

14 lines
1.1 KiB
Markdown

# Multimodal Feature Support Matrix (PyTorch Backend)
| Model | CUDA Graph | Encoder IFB | KV Cache Reuse | Chunked Prefill |
| :----------------- | :--------- | :------------------ | :------------- | :-------------- |
| Gemma 3 | Yes | Yes | No | No |
| HyperCLOVA | Yes | Yes | No | No |
| VILA | Yes | No | No | No |
| LLaVA-NeXT | Yes | Yes | No | No |
| Llama 4 | Yes | Yes | No | No |
| Mistral-Small-3.1 | Yes | Yes | No | No |
| Phi-4-multimodal | Yes | Yes | No | No |
| Qwen2-VL | Yes | Yes | Yes | No |
| Qwen2.5-VL | Yes | Yes | Yes | No |