mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-07-02 01:00:20 +00:00
d1b34251bc
* spec: add DFlash v2 support * dflash: support sliding window attention per layer_types * docs: add dflash section --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>