llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-07-01 08:40:19 +00:00

Files

T

Georgi Gerganov 1f63e75f3b metal : use less stack memory in FA kernel (#14088 )

* metal : use less stack memory in FA kernel

ggml-ci

* cont : fix BF16 variant

2025-06-09 23:05:02 +03:00

2025-05-29 12:50:25 +02:00

2025-06-01 13:43:57 +03:00

2025-06-09 23:05:02 +03:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2025-06-09 16:47:13 +02:00