Default Branch

a4107133a6 · llama : add guard for K/V rotation input when buffer is unallocated (#25215) · Updated 2026-07-04 20:37:38 +00:00

Branches

d93ff58322 · models : fix LFM2 tensors · Updated 2025-11-27 12:54:51 +00:00    kanshan

2644
1

05429433a1 · examples: add model-backend-compare tool to compare intermediate device tensors with CPU reference · Updated 2025-11-25 17:05:56 +00:00    kanshan

2666
1

72f80499ee · server : headers cleanup · Updated 2025-11-24 10:50:50 +00:00    kanshan

2726
5

722f9defe9 · vulkan: intel mmv fix attempt · Updated 2025-11-23 09:13:19 +00:00    kanshan

2688
1

6cdda87baf · ci : disable op offload in some tests · Updated 2025-11-20 15:16:50 +00:00    kanshan

2751
3

dba1cbceb3 · tune for RDNA3 · Updated 2025-11-16 19:21:22 +00:00    kanshan

2759
4

e6dbc81569 · metal : cap threadgroups size of set_rows · Updated 2025-11-10 14:17:09 +00:00    kanshan

2828
1

3ad533689c · ggml : remove KQ mask padding · Updated 2025-11-10 12:35:25 +00:00    kanshan

2830
1

2ef41855cf · convert : for FP8, use scale type to decide auto type · Updated 2025-11-07 03:55:53 +00:00    kanshan

2868
16

e996f3aef8 · convert : fix no-lazy dtypes from direct safetensors · Updated 2025-11-07 03:33:09 +00:00    kanshan

2868
3

128118fdbe · convert : use F32 for dequant of pack-quantized tensors · Updated 2025-11-07 02:59:32 +00:00    kanshan

2868
6

23b70f4f70 · Initial plan · Updated 2025-11-04 11:00:12 +00:00    kanshan

2896
1

d441c31b19 · metal : remove stray return · Updated 2025-11-02 16:24:00 +00:00    kanshan

2920
9

d7f794eadb · convert : avoid dequantizing mxfp4 for GPT-OSS · Updated 2025-10-24 11:56:26 +00:00    kanshan

3007
1

93fbd407f3 · Merge branch 'master' into compilade/convert-prequant · Updated 2025-10-23 18:23:12 +00:00    kanshan

3010
6

f0076dc5a0 · metal : adjust .get_alloc_size to be alloc friendly · Updated 2025-10-19 14:20:54 +00:00    kanshan

3040
1

96f9f391c7 · ggml : fix unaligned access in AMX code · Updated 2025-09-29 07:37:15 +00:00    kanshan

3220
1

a8b0089a5b · ggml : remove SVE paths · Updated 2025-09-28 17:26:03 +00:00    kanshan

3220
1

837b1b4563 · ggml : remove KQ mask padding · Updated 2025-09-28 15:10:17 +00:00    kanshan

3223
6

17ca6ed540 · Implement llama-pull tool · Updated 2025-09-20 16:25:21 +00:00    kanshan

3311
1