llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-07-02 09:10:21 +00:00

Files

T

Georgi Gerganov d197545530 llama : bump max layers from 256 to 512 (#8530 )

* llama : bump max layers from 256 to 512

* llama : replace asserts with exceptions

2024-07-19 16:50:47 +03:00

llama.h

2024-07-19 16:50:47 +03:00