gpt4all

Commit Graph

Author	SHA1	Message	Date
Adam Treat	ea66669cef	Switch to new models2.json for new gguf release and bump our version to 2.5.0.	12 months ago
Andriy Mulyar	a9668eb2e4	Added optional top_p and top_k	1 year ago
Elin Angelov	4d855afe97	Update README.md (#1260 ) * Update README.md Signed-off-by: Elin Angelov <me@zetxx.eu> * Update README.md Signed-off-by: Elin Angelov <me@zetxx.eu> * Update README.md Signed-off-by: Elin Angelov <me@zetxx.eu> * Changed wording a tiny bit again Signed-off-by: niansa/tuxifan <tuxifan@posteo.de> * Added missing space Signed-off-by: niansa/tuxifan <tuxifan@posteo.de> --------- Signed-off-by: Elin Angelov <me@zetxx.eu> Signed-off-by: niansa/tuxifan <tuxifan@posteo.de> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>	1 year ago
David Okpare	889c8d1758	Add embeddings endpoint for gpt4all-api (#1314 ) * Add embeddings endpoint * Add test for embedding endpoint	1 year ago
Zach Nussbaum	91a32c0e84	ci: pin (#1292 )	1 year ago
Andriy Mulyar	14f4b522d5	Allow you to monitor GPT4All-API with Sentry (#1271 )	1 year ago
Zach Nussbaum	b3f84c56e7	fix: don't pass around the same dict object (#1264 )	1 year ago
Andriy Mulyar	2befff83d6	top_p error in gpt4all-api	1 year ago
Andriy Mulyar	3d10110314	Moved model check into cpu only paths	1 year ago
Zach Nussbaum	8aba2c9009	GPU Inference Server (#1112 ) * feat: local inference server * fix: source to use bash + vars * chore: isort and black * fix: make file + inference mode * chore: logging * refactor: remove old links * fix: add new env vars * feat: hf inference server * refactor: remove old links * test: batch and single response * chore: black + isort * separate gpu and cpu dockerfiles * moved gpu to separate dockerfile * Fixed test endpoints * Edits to API. server won't start due to failed instantiation error * Method signature * fix: gpu_infer * tests: fix tests --------- Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com>	1 year ago
Andriy Mulyar	58f0fcab57	Added health endpoint Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	1 year ago
Brandon Beiler	fb576fbd7e	Update to gpt4all version 1.0.1. Implement the Streaming version of the completions endpoint. Implemented an openai python client test for the new streaming functionality. (#1129 ) Co-authored-by: Brandon <bbeiler@ridgelineintl.com>	1 year ago
Andriy Mulyar	13f0f546ed	Update makefile gpt4all-api Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	1 year ago
Andriy Mulyar	390994ea5e	Update README.md to include inference example Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	1 year ago
Andriy Mulyar	a67f8132e1	Update README.md Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	1 year ago
Andriy Mulyar	633e2a2137	GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839 ) * GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions * Edits for docker building * FastAPI app builds and pydantic models are accurate * Added groovy download into dockerfile * improved dockerfile * Chat completions endpoint edits * API uni test sketch * Working example of groovy inference with open ai api * Added lines to test * Set default to mpt	1 year ago
Richard Guo	02d1bdb0be	mono repo structure	1 year ago

17 Commits (a10f3aea5e2a26ce35195093a712b1addced7d6f)