mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-07-01 08:40:19 +00:00
34c961b181
When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64