llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-07-03 01:30:23 +00:00

Files

T

AidanBeltonS fadde67135 Dequant improvements rebase (#8255 )

* Single load for half2

* Store scales in local mem

* Vec load quantized values

2024-07-03 09:55:34 +08:00

2024-06-26 18:33:02 +03:00

2024-07-02 12:18:10 -04:00

2024-07-03 09:55:34 +08:00

CMakeLists.txt

2024-06-26 21:34:14 +02:00

ggml_vk_generate_shaders.py

2024-06-26 18:33:02 +03:00