Adrien Gallouët
|
e3a74b2990
|
bench : add --offline (#24511)
* bench : add --offline
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
* Add default
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
---------
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2026-06-16 08:26:05 +02:00 |
|
Ruixiang Wang
|
689a9a470e
|
server-bench : add speed-bench for speculative decoding benchmarking (#23869)
* spec: add speed-bench support for benchmarking
* speed-bench : add trailing newline to requirements.txt
* speed-bench : bump datasets to 4.8.0 to fix ty check
* server-bench : remove now-unused type: ignore after datasets bump
|
2026-05-29 23:09:47 +02:00 |
|
Sigbjørn Skjæret
|
29b28a9824
|
ci : switch from pyright to ty (#20826)
* type fixes
* switch to ty
* tweak rules
* tweak more rules
* more tweaks
* final tweak
* use common import-not-found rule
|
2026-03-21 08:54:34 +01:00 |
|
Georgi Gerganov
|
9ebebef62f
|
llama : remove KV cache defragmentation logic (#15473)
ggml-ci
|
2025-08-22 12:22:13 +03:00 |
|
Diego Devesa
|
1d36b3670b
|
llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2025-05-02 20:27:13 +02:00 |
|