mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-06-25 22:00:21 +00:00
a976ff081b
* tests: add end-to-end tests per model architecture * fixup for rebase * fix use-after-free in llama-model-loader.cpp * fix CI * fix WebGPU * fix CI * disable CI for macOS-latest-cmake-arm64 * use expert_weights_scale only if != 0.0f * comments