Commit Graph

3 Commits (7810b757c9120a93533fbcf56d169272b881d6bb)

Author SHA1 Message Date
Jared Van Bortel 061d1969f8
expose n_gpu_layers parameter of llama.cpp (#1890)
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
7 months ago
Jared Van Bortel d1c56b8b28
Implement configurable context length (#1749) 9 months ago
Adam Treat 0efdbfcffe Bert 1 year ago