TensorRT-LLMs/docs/source/reference/multimodal-feature-support-matrix.md
Chang Liu 47e37755a3
[TRTLLM-6903][feat] Support chunked prefill for multimodal models (#6843)
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
2025-09-14 20:10:10 -07:00

14 lines
1.1 KiB
Markdown

# Multimodal Feature Support Matrix (PyTorch Backend)
| Model | CUDA Graph | Encoder IFB | KV Cache Reuse | Chunked Prefill |
| :----------------- | :--------- | :------------------ | :------------- | :-------------- |
| Gemma 3 | Yes | Yes | N/A | N/A |
| HyperCLOVA | Yes | Yes | No | No |
| VILA | Yes | No | No | No |
| LLaVA-NeXT | Yes | Yes | Yes | Yes |
| Llama 4 | Yes | Yes | No | No |
| Mistral-Small-3.1 | Yes | Yes | No | No |
| Phi-4-multimodal | Yes | Yes | No | No |
| Qwen2-VL | Yes | Yes | Yes | Yes |
| Qwen2.5-VL | Yes | Yes | Yes | Yes |