llama.cpp/tools at b9616 - llama.cpp - Gitea: Git with a cup of tea

kanshan/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-28 15:20:20 +00:00

Files

T

History

Georgi Gerganov ebc10770ac server : fix reasoning budget WebUI precedence over model.ini (#24517 )

When reasoning-budget is set in model.ini, the per-request
thinking_budget_tokens from the WebUI was ignored because the
model.ini value took unconditional precedence.

Swap the precedence so the WebUI per-request value is checked
first, with the model.ini value serving as a fallback default.

Assisted-by: pi:llama.cpp/Qwen3.6-27B

2026-06-12 17:59:56 +03:00

..

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

mtmd : add video input support (#24269 )

2026-06-08 14:40:12 +03:00

completion : remove useless statics (#24226 )

2026-06-06 12:16:16 +02:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

Move duplicated imatrix code into single common imatrix-loader.cpp (#22445 )

2026-06-04 17:45:40 +02:00

Support -fa auto in llama-bench (#23714 )

2026-05-31 02:03:57 +05:30

graph: Fix granite speech model inference by applying embedding scale when deepstack is not used (#24357 )

2026-06-09 19:46:27 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity : fix format specifier in LOG_ERR (#23788 )

2026-05-28 10:34:58 +03:00

docs: Update quantization readme (#24133 )

2026-06-05 12:21:26 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server : fix reasoning budget WebUI precedence over model.ini (#24517 )

2026-06-12 17:59:56 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: PWA support (#23871 )

2026-06-12 15:53:26 +02:00

CMakeLists.txt

cmake: skip cvector-generator and export-lora when CPU backend is disabled (#24053 )

2026-06-04 13:13:19 +03:00