llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-26 06:10:19 +00:00

Files

T

Alexey Kopytko 581d020b12 SYCL: implement ggml_sycl_pool_vmm (#22862 )

* SYCL: implement ggml_sycl_pool_vmm

* Add an option to bypass VMM with GGML_SYCL_DISABLE_VMM

* Clean up debugging logging

* document GGML_SYCL_DISABLE_VMM

* Multi-stream MoE optimization

* Revert "Multi-stream MoE optimization"

This reverts commit 938929c3f1.

* Update common.hpp

Co-authored-by: Neo Zhang <zhang.jianyu@outlook.com>

* Flip GGML_SYCL_DISABLE_VMM to GGML_SYCL_ENABLE_VMM

* add logging for GGML_SYCL_ENABLE_VMM when extension is not available (SYCL_EXT_ONEAPI_VIRTUAL_MEM macro)

* Apply suggestions from code review

Co-authored-by: Alexey Kopytko <alexey@kopytko.com>

* Apply suggestion from @sanmai

* Apply suggestion from @sanmai

---------

Co-authored-by: Neo Zhang <zhang.jianyu@outlook.com>

2026-05-26 07:59:00 +03:00

snapdragon

snapdragon: bump toolchain docker to v0.7 to fix ui build issues (#23680 )

2026-05-25 10:57:43 -07:00

VirtGPU

ggml-virtgpu: Fix some build commands (#20341 )

2026-03-12 15:47:45 +08:00

BLIS.md

make : deprecate (#10514 )

2024-12-02 21:22:53 +02:00

CANN.md

CANN: update docker images to 8.5.0 and improve CANN.md (#20801 )