llama.cpp/tools at b9196 - llama.cpp - Gitea: Git with a cup of tea

kanshan/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-28 15:20:20 +00:00

Files

T

History

Rares Vernica 1a68ec9378 server : honor --embd-normalize CLI arg (#23125 )

The --embd-normalize flag was registered only for the embedding and debug
examples, so llama-server rejected it and the /embedding handler used a
hard-coded default of 2 (L2). Add LLAMA_EXAMPLE_SERVER to the flag's
example set and read params.embd_normalize as the handler's default. The
per-request "embd_normalize" body field continues to override.

2026-05-17 09:39:04 +03:00

..

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

llama + spec: MTP Support (#22673 )

2026-05-16 20:06:23 +08:00

llama + spec: MTP Support (#22673 )

2026-05-16 20:06:23 +08:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

spec : refactor params (#22397 )

2026-04-28 09:07:33 +03:00

mtmd: add chunks and fix preproc for qwen3a (#23073 )

2026-05-15 19:32:47 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server : honor --embd-normalize CLI arg (#23125 )

2026-05-17 09:39:04 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

webui: support video files as input (#22830 )

2026-05-17 02:13:44 +02:00

CMakeLists.txt

ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064 )

2026-05-16 02:02:40 +02:00