llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-28 07:10:21 +00:00

Files

T

Reese Levine 35266573b9 ggml webgpu: actually add softmax, fix rms_norm offset (#16400 )

* implement soft_max

* Fix soft_max data race

* Temporary fix, wait on each submit

2025-10-04 20:59:31 -07:00

2025-08-07 13:45:41 +02:00

2025-10-04 12:49:16 +03:00

2025-10-04 20:59:31 -07:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2025-10-01 23:09:25 +02:00