llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-07-01 08:40:19 +00:00

Files

T

uvos 34c961b181 CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315 )

When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to
selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need
to avoid launching them with parameters for warp64

2025-03-12 10:14:11 +01:00

cmake

cmake: Fix ggml backend dependencies and installation (#11818 )

2025-02-27 09:42:48 +02:00

include

ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (#12154 )

2025-03-06 02:26:10 +01:00

src

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315 )

2025-03-12 10:14:11 +01:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

opencl: use OpenCL C standard supported by the device (#12221 )

2025-03-10 09:57:00 -07:00