mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-07-04 10:10:19 +00:00
721354fbdf
* server: real-time model load progress tracking via /models/sse * update docs * server: move model download to child process * rm unused * fix most problems * clean up * nit fixes * fix test case * do not detact() thread * shorter MODEL_DOWNLOAD_TIMEOUT in test * throttle