Commit Graph

16 Commits (6da62a62f062704cc96f32d75d411c4428f23a4a)

Author SHA1 Message Date
Jared Van Bortel bf493bb048
Mixtral crash fix and python bindings v2.2.0 (#1931)
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
8 months ago
Jared Van Bortel 061d1969f8
expose n_gpu_layers parameter of llama.cpp (#1890)
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
8 months ago
Jared Van Bortel 38c61493d2 backend: update to latest commit of llama.cpp Vulkan PR
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
8 months ago
Jared Van Bortel d1c56b8b28
Implement configurable context length (#1749) 9 months ago
Jared Van Bortel dfd8ef0186
backend: use ggml_new_graph for GGML backend v2 (#1719) 10 months ago
Adam Treat cce5fe2045 Fix macos build. 10 months ago
Adam Treat 371e2a5cbc LocalDocs version 2 with text embeddings. 10 months ago
cebtenzzre fd0c501d68
backend: support GGUFv3 (#1582) 11 months ago
Cebtenzzre 050e7f076e backend: port GPT-J to GGUF 12 months ago
Cebtenzzre 42bcb814b3 backend: port BERT to GGUF 12 months ago
Adam Treat d90d003a1d Latest rebase on llama.cpp with gguf support. 12 months ago
Aaron Miller 0bc2274869 bump llama.cpp version + needed fixes for that 1 year ago
Aaron Miller 1c4a244291 bump mem allocation a bit 1 year ago
Adam Treat ee4186d579 Fixup bert python bindings. 1 year ago
Adam Treat 0efdbfcffe Bert 1 year ago
Adam Treat ae8eb297ac Add sbert backend. 1 year ago