Commit Graph

17 Commits (a10f3aea5e2a26ce35195093a712b1addced7d6f)

Author SHA1 Message Date
Adam Treat ea66669cef Switch to new models2.json for new gguf release and bump our version to
2.5.0.
12 months ago
Andriy Mulyar a9668eb2e4 Added optional top_p and top_k 1 year ago
Elin Angelov 4d855afe97
Update README.md (#1260)
* Update README.md

Signed-off-by: Elin Angelov <me@zetxx.eu>

* Update README.md

Signed-off-by: Elin Angelov <me@zetxx.eu>

* Update README.md

Signed-off-by: Elin Angelov <me@zetxx.eu>

* Changed wording a tiny bit again

Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>

* Added missing space

Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>

---------

Signed-off-by: Elin Angelov <me@zetxx.eu>
Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>
1 year ago
David Okpare 889c8d1758
Add embeddings endpoint for gpt4all-api (#1314)
* Add embeddings endpoint

* Add test for embedding endpoint
1 year ago
Zach Nussbaum 91a32c0e84
ci: pin (#1292) 1 year ago
Andriy Mulyar 14f4b522d5
Allow you to monitor GPT4All-API with Sentry (#1271) 1 year ago
Zach Nussbaum b3f84c56e7
fix: don't pass around the same dict object (#1264) 1 year ago
Andriy Mulyar 2befff83d6 top_p error in gpt4all-api 1 year ago
Andriy Mulyar 3d10110314 Moved model check into cpu only paths 1 year ago
Zach Nussbaum 8aba2c9009
GPU Inference Server (#1112)
* feat: local inference server

* fix: source to use bash + vars

* chore: isort and black

* fix: make file + inference mode

* chore: logging

* refactor: remove old links

* fix: add new env vars

* feat: hf inference server

* refactor: remove old links

* test: batch and single response

* chore: black + isort

* separate gpu and cpu dockerfiles

* moved gpu to separate dockerfile

* Fixed test endpoints

* Edits to API. server won't start due to failed instantiation error

* Method signature

* fix: gpu_infer

* tests: fix tests

---------

Co-authored-by: Andriy Mulyar <andriy.mulyar@gmail.com>
1 year ago
Andriy Mulyar 58f0fcab57
Added health endpoint
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
1 year ago
Brandon Beiler fb576fbd7e
Update to gpt4all version 1.0.1. Implement the Streaming version of the completions endpoint. Implemented an openai python client test for the new streaming functionality. (#1129)
Co-authored-by: Brandon <bbeiler@ridgelineintl.com>
1 year ago
Andriy Mulyar 13f0f546ed
Update makefile gpt4all-api
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
1 year ago
Andriy Mulyar 390994ea5e
Update README.md to include inference example
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
1 year ago
Andriy Mulyar a67f8132e1
Update README.md
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
1 year ago
Andriy Mulyar 633e2a2137
GPT4All API Scaffolding. Matches OpenAI OpenAPI spec for chats and completions (#839)
* GPT4All API Scaffolding. Matches OpenAI OpenAI spec for engines, chats and completions

* Edits for docker building

* FastAPI app builds and pydantic models are accurate

* Added groovy download into dockerfile

* improved dockerfile

* Chat completions endpoint edits

* API uni test sketch

* Working example of groovy inference with open ai api

* Added lines to test

* Set default to mpt
1 year ago
Richard Guo 02d1bdb0be mono repo structure 1 year ago