mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-07-01 16:50:20 +00:00
36680f6e40
1. guess n_layers; 2. relax warnings on context size; 3. add a note that its derivations are also supported. Co-authored-by: Judd <foldl@boxvest.com>
embedding
TODO