Files
llama.cpp/src
Sigbjørn Skjæret 6ab397e12b graph : support non-contiguous Q in build_attn_mha (#15908)
* support non-contiguous Q in build_attn_mha

* Update src/llama-graph.cpp

ggml-ci

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-09-10 19:08:59 +02:00
..
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-07-15 21:54:22 +02:00