mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-06-30 16:20:20 +00:00
1d6092fc72
* tests : add -INF blocks to the KQ mask in the FA tests * cont : bump -INF block size to 64 Co-authored-by: Jeff Bolz <jbolz@nvidia.com> * ggml : prevent division by zero in FA CPU op --------- Co-authored-by: Jeff Bolz <jbolz@nvidia.com>