llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-28 15:20:20 +00:00

Files

T

Xuan-Son Nguyen 72e60f500d mtmd: add chunks and fix preproc for qwen3a (#23073 )

* mtmd: add chunks and fix preproc for qwen3a

* add attn_mask

* limit mtmd_chunk size (avoid blow up memory)

* correct audio tokens

* re-order the set_input case

* remove attn_mask

2026-05-15 19:32:47 +02:00

cogvlm.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

conformer.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

deepseekocr.cpp

mtmd: Add DeepSeekOCR Support (#17400 )

2026-03-25 19:57:40 +01:00

dotsocr.cpp

mtmd: support dots.ocr (#17575 )

2026-04-09 12:16:38 +02:00

gemma4a.cpp

mtmd: add Gemma 4 audio conformer encoder support (#21421 )

2026-04-12 14:15:26 +02:00

gemma4v.cpp

model, mtmd: fix gguf conversion for audio/vision mmproj (#21309 )

2026-04-02 17:10:32 +02:00

glm4v.cpp

mtmd: Add DeepSeekOCR Support (#17400 )

2026-03-25 19:57:40 +01:00

granite-speech.cpp

mtmd: add granite-speech support (ibm-granite/granite-4.0-1b-speech) (#22101 )

2026-05-06 14:40:59 +02:00

hunyuanocr.cpp

mtmd, llama : Update HunyuanVL vision-language model support (#22037 )

2026-04-22 11:58:43 +02:00

internvl.cpp

clip: move model cgraphs into their own files (#17965 )

2025-12-12 21:14:48 +01:00

kimik25.cpp

model: Add Kimi-K2.5 support (#19170 )

2026-02-11 16:47:30 +01:00

kimivl.cpp

clip: move model cgraphs into their own files (#17965 )

2025-12-12 21:14:48 +01:00

llama4.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

llava.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

mimovl.cpp

mtmd: add MiMo v2.5 vision (#22883 )

2026-05-12 11:11:14 +02:00

minicpmv.cpp

mtmd : support MiniCPM-V 4.6 (#22529 )

2026-05-06 21:54:09 +02:00

mobilenetv5.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

models.h

mtmd: add MiMo v2.5 vision (#22883 )

2026-05-12 11:11:14 +02:00

nemotron-v2-vl.cpp

mtmd : Add Nemotron Nano 12B v2 VL support (#19547 )

2026-02-14 14:07:00 +01:00

paddleocr.cpp

model: Add PaddleOCR-VL model support (#18825 )

2026-02-19 17:05:25 +01:00

pixtral.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

qwen2vl.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

qwen3a.cpp

mtmd: add chunks and fix preproc for qwen3a (#23073 )

2026-05-15 19:32:47 +02:00

qwen3vl.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00

siglip.cpp

mtmd: Add DeepSeekOCR Support (#17400 )

2026-03-25 19:57:40 +01:00

step3vl.cpp

model : support step3-vl-10b (#21287 )

2026-04-08 09:51:31 +02:00

whisper-enc.cpp

mtmd : add MERaLiON-2 multimodal audio support (#21756 )

2026-04-11 14:15:48 +02:00

yasa2.cpp

mtmd: Add support for Reka Edge 2603 (#21616 )

2026-04-21 20:02:49 +02:00

youtuvl.cpp

mtmd: add clip_graph::build_mm() (#20751 )

2026-03-19 13:11:39 +01:00