TensorRT-LLMs/docs/source/reference/multimodal-feature-support-matrix.md
Chang Liu faa2f46554
[TRTLLM-5059][feat] Enable KV-cache reuse and add E2E tests for llava-next (#7349)
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
2025-09-09 14:51:36 -04:00

14 lines
1.1 KiB
Markdown

# Multimodal Feature Support Matrix (PyTorch Backend)
| Model | CUDA Graph | Encoder IFB | KV Cache Reuse | Chunked Prefill |
| :----------------- | :--------- | :------------------ | :------------- | :-------------- |
| Gemma 3 | Yes | Yes | N/A | N/A |
| HyperCLOVA | Yes | Yes | No | No |
| VILA | Yes | No | No | No |
| LLaVA-NeXT | Yes | Yes | Yes | No |
| Llama 4 | Yes | Yes | No | No |
| Mistral-Small-3.1 | Yes | Yes | No | No |
| Phi-4-multimodal | Yes | Yes | No | No |
| Qwen2-VL | Yes | Yes | Yes | No |
| Qwen2.5-VL | Yes | Yes | Yes | No |