Commit Graph

18 Commits (0cc5a806563a9264b7c7751f4ca50e2f700c0847)

Author SHA1 Message Date
Jared Van Bortel 007d469034
bert: fix layer norm epsilon value (#1946)
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
4 months ago
Jared Van Bortel 4fc4d94be4
fix chat-style prompt templates (#1970)
Also use a new version of Mistral OpenOrca.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
4 months ago
Jared Van Bortel bf493bb048
Mixtral crash fix and python bindings v2.2.0 (#1931)
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
5 months ago
Jared Van Bortel 061d1969f8
expose n_gpu_layers parameter of llama.cpp (#1890)
Also dynamically limit the GPU layers and context length fields to the maximum supported by the model.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>
5 months ago
Jared Van Bortel 38c61493d2 backend: update to latest commit of llama.cpp Vulkan PR
Signed-off-by: Jared Van Bortel <jared@nomic.ai>
5 months ago
Jared Van Bortel d1c56b8b28
Implement configurable context length (#1749) 7 months ago
Jared Van Bortel dfd8ef0186
backend: use ggml_new_graph for GGML backend v2 (#1719) 7 months ago
Adam Treat cce5fe2045 Fix macos build. 8 months ago
Adam Treat 371e2a5cbc LocalDocs version 2 with text embeddings. 8 months ago
cebtenzzre fd0c501d68
backend: support GGUFv3 (#1582) 8 months ago
Cebtenzzre 050e7f076e backend: port GPT-J to GGUF 9 months ago
Cebtenzzre 42bcb814b3 backend: port BERT to GGUF 9 months ago
Adam Treat d90d003a1d Latest rebase on llama.cpp with gguf support. 9 months ago
Aaron Miller 0bc2274869 bump llama.cpp version + needed fixes for that 10 months ago
Aaron Miller 1c4a244291 bump mem allocation a bit 12 months ago
Adam Treat ee4186d579 Fixup bert python bindings. 12 months ago
Adam Treat 0efdbfcffe Bert 12 months ago
Adam Treat ae8eb297ac Add sbert backend. 12 months ago