mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-06-28 15:20:20 +00:00
0821c5fcfd
* server: in SSE mode, send HTTP headers when slot starts * ref to pr * stream should be false by default