llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-28 15:20:20 +00:00

Files

T

Aman Gupta de6f727aae llama: limit max outputs of llama_context (#23861 )

* llama: save more VRAM by reserving n_outputs == n_seqs when possible

* add n_outputs_per_seq

* move n_outputs_max to server-context

* change ubatch to batch everywhere

2026-06-01 18:01:38 +03:00

batched-bench

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

cli

app: add llama update self updater (#23865 )

2026-05-29 23:02:40 +02:00

completion

app: add llama update self updater (#23865 )

2026-05-29 23:02:40 +02:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

export-lora

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

gguf-split

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

imatrix

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

llama-bench

Support -fa auto in llama-bench (#23714 )

2026-05-31 02:03:57 +05:30

mtmd

model: Add EXAONE 4.5 implementations (#21733 )

2026-06-01 11:48:53 +02:00

parser

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity

perplexity : fix format specifier in LOG_ERR (#23788 )

2026-05-28 10:34:58 +03:00

quantize

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

results

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

rpc

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server

llama: limit max outputs of llama_context (#23861 )

2026-06-01 18:01:38 +03:00

tokenize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

tts

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: fix ETag truncation with MSVC compiler (#23917 )

2026-05-31 11:21:23 +02:00

CMakeLists.txt

ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064 )

2026-05-16 02:02:40 +02:00