gpt4all

mirror of https://github.com/nomic-ai/gpt4all synced 2024-11-06 09:20:33 +00:00

Author	SHA1	Message	Date
Jared Van Bortel	061d1969f8	expose n_gpu_layers parameter of llama.cpp (#1890 ) Also dynamically limit the GPU layers and context length fields to the maximum supported by the model. Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-01-31 14:17:44 -05:00
Jared Van Bortel	b881598166	py: improve README (#1860 ) Signed-off-by: Jared Van Bortel <jared@nomic.ai>	2024-01-21 19:53:55 -05:00
Jared Van Bortel	03a9f0bedf	csharp: update C# bindings to work with GGUF (#1651 )	2024-01-16 14:33:41 -05:00
Jared Van Bortel	f8564398fc	minor change to trigger CircleCI	2024-01-12 16:13:46 -05:00
Jared Van Bortel	eef604fd64	python: release bindings version 2.1.0 The backend has a breaking change for Falcon and MPT models, so we need to make a new release.	2024-01-12 09:38:16 -05:00
Daniel Salvatierra	c72c73a94f	app.py: add --device option for GPU support (#1769 ) Signed-off-by: Daniel Salvatierra <dsalvat1@gmail.com> Co-authored-by: Jared Van Bortel <jared@nomic.ai>	2023-12-20 16:01:03 -05:00
Jared Van Bortel	d1c56b8b28	Implement configurable context length (#1749 )	2023-12-16 17:58:15 -05:00
Jacob Nguyen	7aa0f779de	Update mkdocs.yml (#1759 ) update doc routing	2023-12-15 13:37:29 -06:00
Jacob Nguyen	a1f27072c2	fix/macm1ts (#1746 ) * make runtime library backend universal searchable * corepack enable * fix * pass tests * simpler * add more jsdoc * fix testS * fix up circle ci * bump version * remove false positive warning * add disclaimer * update readme * revert * update ts docs --------- Co-authored-by: Matthew Nguyen <matthewpnguyen@Matthews-MacBook-Pro-7.local>	2023-12-15 12:44:39 -06:00
Jared Van Bortel	778264fbab	python: don't use importlib as_file for a directory The only reason to use as_file is to support copying a file from a frozen package. We don't currently support this anyway, and as_file isn't supported until Python 3.9, so get rid of it. Fixes #1605	2023-12-11 13:35:56 -05:00
aj-gameon	7facb8207b	docs: golang --recurse-submodules (#1720 ) Co-authored-by: aj-gameon <aj@gameontechnology.com>	2023-12-11 12:58:58 -05:00
AT	84749a4ced	Update gpt4all_chat.md Signed-off-by: AT <manyoso@users.noreply.github.com>	2023-11-21 12:21:43 -05:00
AT	f1c58d0e2c	Update gpt4all_chat.md Signed-off-by: AT <manyoso@users.noreply.github.com>	2023-11-21 11:55:14 -05:00
Jared Van Bortel	d4ce9f4a7c	llmodel_c: improve quality of error messages (#1625 )	2023-11-07 11:20:14 -05:00
aj-gameon	8fabf0be4a	Updated readme for correct install instructions (#1607 ) Co-authored-by: aj-gameon <aj@gameontechnology.com>	2023-11-03 11:21:44 -04:00
Jacob Nguyen	45d76d6234	ts/tooling (#1602 )	2023-11-02 16:25:33 -05:00
Jacob Nguyen	da95bcfb4b	vulkan support for typescript bindings, gguf support (#1390 ) * adding some native methods to cpp wrapper * gpu seems to work * typings and add availibleGpus method * fix spelling * fix syntax * more * normalize methods to conform to py * remove extra dynamic linker deps when building with vulkan * bump python version (library linking fix) * Don't link against libvulkan. * vulkan python bindings on windows fixes * Bring the vulkan backend to the GUI. * When device is Auto (the default) then we will only consider discrete GPU's otherwise fallback to CPU. * Show the device we're currently using. * Fix up the name and formatting. * init at most one vulkan device, submodule update fixes issues w/ multiple of the same gpu * Update the submodule. * Add version 2.4.15 and bump the version number. * Fix a bug where we're not properly falling back to CPU. * Sync to a newer version of llama.cpp with bugfix for vulkan. * Report the actual device we're using. * Only show GPU when we're actually using it. * Bump to new llama with new bugfix. * Release notes for v2.4.16 and bump the version. * Fallback to CPU more robustly. * Release notes for v2.4.17 and bump the version. * Bump the Python version to python-v1.0.12 to restrict the quants that vulkan recognizes. * Link against ggml in bin so we can get the available devices without loading a model. * Send actual and requested device info for those who have opt-in. * Actually bump the version. * Release notes for v2.4.18 and bump the version. * Fix for crashes on systems where vulkan is not installed properly. * Release notes for v2.4.19 and bump the version. * fix typings and vulkan build works on win * Add flatpak manifest * Remove unnecessary stuffs from manifest * Update to 2.4.19 * appdata: update software description * Latest rebase on llama.cpp with gguf support. * macos build fixes * llamamodel: metal supports all quantization types now * gpt4all.py: GGUF * pyllmodel: print specific error message * backend: port BERT to GGUF * backend: port MPT to GGUF * backend: port Replit to GGUF * backend: use gguf branch of llama.cpp-mainline * backend: use llamamodel.cpp for StarCoder * conversion scripts: cleanup * convert scripts: load model as late as possible * convert_mpt_hf_to_gguf.py: better tokenizer decoding * backend: use llamamodel.cpp for Falcon * convert scripts: make them directly executable * fix references to removed model types * modellist: fix the system prompt * backend: port GPT-J to GGUF * gpt-j: update inference to match latest llama.cpp insights - Use F16 KV cache - Store transposed V in the cache - Avoid unnecessary Q copy Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> ggml upstream commit 0265f0813492602fec0e1159fe61de1bf0ccaf78 * chatllm: grammar fix * convert scripts: use bytes_to_unicode from transformers * convert scripts: make gptj script executable * convert scripts: add feed-forward length for better compatiblilty This GGUF key is used by all llama.cpp models with upstream support. * gptj: remove unused variables * Refactor for subgroups on mat * vec kernel. * Add q6_k kernels for vulkan. * python binding: print debug message to stderr * Fix regenerate button to be deterministic and bump the llama version to latest we have for gguf. * Bump to the latest fixes for vulkan in llama. * llamamodel: fix static vector in LLamaModel::endTokens * Switch to new models2.json for new gguf release and bump our version to 2.5.0. * Bump to latest llama/gguf branch. * chat: report reason for fallback to CPU * chat: make sure to clear fallback reason on success * more accurate fallback descriptions * differentiate between init failure and unsupported models * backend: do not use Vulkan with non-LLaMA models * Add q8_0 kernels to kompute shaders and bump to latest llama/gguf. * backend: fix build with Visual Studio generator Use the $<CONFIG> generator expression instead of CMAKE_BUILD_TYPE. This is needed because Visual Studio is a multi-configuration generator, so we do not know what the build type will be until `cmake --build` is called. Fixes #1470 * remove old llama.cpp submodules * Reorder and refresh our models2.json. * rebase on newer llama.cpp * python/embed4all: use gguf model, allow passing kwargs/overriding model * Add starcoder, rift and sbert to our models2.json. * Push a new version number for llmodel backend now that it is based on gguf. * fix stray comma in models2.json Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Speculative fix for build on mac. * chat: clearer CPU fallback messages * Fix crasher with an empty string for prompt template. * Update the language here to avoid misunderstanding. * added EM German Mistral Model * make codespell happy * issue template: remove "Related Components" section * cmake: install the GPT-J plugin (#1487) * Do not delete saved chats if we fail to serialize properly. * Restore state from text if necessary. * Another codespell attempted fix. * llmodel: do not call magic_match unless build variant is correct (#1488) * chatllm: do not write uninitialized data to stream (#1486) * matmat for q4_0, q8_0 do not process prompts on gpu yet * python: support Path in GPT4All.__init__ (#1462) * llmodel: print an error if the CPU does not support AVX (#1499) * python bindings should be quiet by default * disable llama.cpp logging unless GPT4ALL_VERBOSE_LLAMACPP envvar is nonempty * make verbose flag for retrieve_model default false (but also be overridable via gpt4all constructor) should be able to run a basic test: ```python import gpt4all model = gpt4all.GPT4All('/Users/aaron/Downloads/rift-coder-v0-7b-q4_0.gguf') print(model.generate('def fib(n):')) ``` and see no non-model output when successful * python: always check status code of HTTP responses (#1502) * Always save chats to disk, but save them as text by default. This also changes the UI behavior to always open a 'New Chat' and setting it as current instead of setting a restored chat as current. This improves usability by not requiring the user to wait if they want to immediately start chatting. * Update README.md Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> * fix embed4all filename https://discordapp.com/channels/1076964370942267462/1093558720690143283/1161778216462192692 Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * Improves Java API signatures maintaining back compatibility * python: replace deprecated pkg_resources with importlib (#1505) * Updated chat wishlist (#1351) * q6k, q4_1 matmat update mini-orca 3b to gguf2, license Signed-off-by: Aaron Miller <apage43@ninjawhale.com> * convert scripts: fix AutoConfig typo (#1512) * publish config https://docs.npmjs.com/cli/v9/configuring-npm/package-json#publishconfig (#1375) merge into my branch * fix appendBin * fix gpu not initializing first * sync up * progress, still wip on destructor * some detection work * untested dispose method * add js side of dispose * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/index.cc Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.d.ts Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/gpt4all.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * Update gpt4all-bindings/typescript/src/util.js Co-authored-by: cebtenzzre <cebtenzzre@gmail.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> * fix tests * fix circleci for nodejs * bump version --------- Signed-off-by: Aaron Miller <apage43@ninjawhale.com> Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Signed-off-by: Jacob Nguyen <76754747+jacoobes@users.noreply.github.com> Co-authored-by: Aaron Miller <apage43@ninjawhale.com> Co-authored-by: Adam Treat <treat.adam@gmail.com> Co-authored-by: Akarshan Biswas <akarshan.biswas@gmail.com> Co-authored-by: Cebtenzzre <cebtenzzre@gmail.com> Co-authored-by: Jan Philipp Harries <jpdus@users.noreply.github.com> Co-authored-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com> Co-authored-by: Alex Soto <asotobu@gmail.com> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>	2023-11-01 14:38:58 -05:00
cebtenzzre	3c561bcdf2	python: bump bindings version for AMD fixes	2023-10-30 17:00:05 -04:00
cebtenzzre	79a5522931	fix references to old backend implementations	2023-10-30 10:37:05 -04:00
cebtenzzre	78d930516d	app.py: change default model to Mistral Instruct (#1564 )	2023-10-24 12:43:30 -04:00
cebtenzzre	e90263c23f	make scripts executable (#1555 )	2023-10-24 09:28:21 -04:00
cebtenzzre	7e5e84fbb7	python: change default extension to .gguf (#1559 )	2023-10-23 22:18:50 -04:00
cebtenzzre	37b007603a	bindings: replace references to GGMLv3 models with GGUF (#1547 )	2023-10-22 11:58:28 -04:00
Andriy Mulyar	d50803ff8e	GGUF Python Release (#1539 )	2023-10-19 19:11:03 -04:00
cebtenzzre	245c5ce5ea	update default model URLs (#1538 )	2023-10-19 15:25:37 -04:00
cebtenzzre	0fe2e19691	llamamodel: re-enable error messages by default (#1537 )	2023-10-19 13:46:33 -04:00
cebtenzzre	5fbeeb1cb4	python: connection resume and MSVC support (#1535 )	2023-10-19 12:06:38 -04:00
cebtenzzre	017c3a9649	python: prepare version 2.0.0rc1 (#1529 )	2023-10-18 20:24:54 -04:00
cebtenzzre	fd3014016b	docs: clarify Vulkan dep in build instructions for bindings (#1525 )	2023-10-18 12:09:52 -04:00
cebtenzzre	4d4275d1b8	python: replace deprecated pkg_resources with importlib (#1505 )	2023-10-12 13:35:27 -04:00
Alex Soto	3c45a555e9	Improves Java API signatures maintaining back compatibility	2023-10-12 07:53:12 -04:00
Aaron Miller	f39df0906e	fix embed4all filename https://discordapp.com/channels/1076964370942267462/1093558720690143283/1161778216462192692 Signed-off-by: Aaron Miller <apage43@ninjawhale.com>	2023-10-12 07:52:56 -04:00
cebtenzzre	aed2068342	python: always check status code of HTTP responses (#1502 )	2023-10-11 18:11:28 -04:00
Aaron Miller	afaa291eab	python bindings should be quiet by default * disable llama.cpp logging unless GPT4ALL_VERBOSE_LLAMACPP envvar is nonempty * make verbose flag for retrieve_model default false (but also be overridable via gpt4all constructor) should be able to run a basic test: ```python import gpt4all model = gpt4all.GPT4All('/Users/aaron/Downloads/rift-coder-v0-7b-q4_0.gguf') print(model.generate('def fib(n):')) ``` and see no non-model output when successful	2023-10-11 14:14:36 -07:00
cebtenzzre	f81b4b45bf	python: support Path in GPT4All.__init__ (#1462 )	2023-10-11 14:12:40 -04:00
Aaron Miller	a10f3aea5e	python/embed4all: use gguf model, allow passing kwargs/overriding model	2023-10-05 18:16:19 -04:00
Adam Treat	ea66669cef	Switch to new models2.json for new gguf release and bump our version to 2.5.0.	2023-10-05 18:16:19 -04:00
Cebtenzzre	40c78d2f78	python binding: print debug message to stderr	2023-10-05 18:16:19 -04:00
Cebtenzzre	4392bf26e0	pyllmodel: print specific error message	2023-10-05 18:16:19 -04:00
Cebtenzzre	34f2ec2b33	gpt4all.py: GGUF	2023-10-05 18:16:19 -04:00
kevinbazira	17cb4a86d1	Replace git clone SSH URI with HTTPS URL Running `git clone --recurse-submodules git@github.com:nomic-ai/gpt4all.git` returns `Permission denied (publickey)` as shown below: ``` git clone --recurse-submodules git@github.com:nomic-ai/gpt4all.git Cloning into gpt4all... git@github.com: Permission denied (publickey). fatal: Could not read from remote repository. ``` This change replaces `git@github.com:nomic-ai/gpt4all.git` with `https://github.com/nomic-ai/gpt4all.git` which runs without permission issues. resolves nomic-ai/gpt4all#8, resolves nomic-ai/gpt4all#49	2023-09-20 09:48:47 -04:00
Adam Treat	0f046cf905	Bump the Python version to python-v1.0.12 to restrict the quants that vulkan recognizes.	2023-09-15 09:12:20 -04:00
Aaron Miller	f0735efa7d	vulkan python bindings on windows fixes	2023-09-12 14:16:02 -07:00
Aaron Miller	0ad1472b62	bump python version (library linking fix)	2023-09-11 09:42:06 -07:00
Andriy Mulyar	b6e38d69ed	Python version bump Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	2023-09-01 13:21:41 -04:00
Andriy Mulyar	707b91a24f	Update Python bindings README.md (#1389 ) Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>	2023-09-01 13:01:40 -04:00
Adam Treat	987546c63b	Nomic vulkan backend licensed under the Software for Open Models License (SOM), version 1.0.	2023-08-31 15:29:54 -04:00
Jacob Nguyen	b43eec0e2c	fix ts tests on windows (#1342 ) * fix ts tests on windows * fix cleanup * fix tests * hold on c sharp workflows * fix: downloadModel doesnt not mkdirp	2023-08-17 10:32:08 -04:00
Jacob Nguyen	4e55940edf	feat(typescript)/dynamic template (#1287 ) (#1326 ) * feat(typescript)/dynamic template (#1287) * remove packaged yarn * prompt templates update wip * prompt template update * system prompt template, update types, remove embed promises, cleanup * support both snakecased and camelcased prompt context * fix #1277 libbert, libfalcon and libreplit libs not being moved into the right folder after build * added support for modelConfigFile param, allowing the user to specify a local file instead of downloading the remote models.json. added a warning message if code fails to load a model config. included prompt context docs by amogus. * snakecase warning, put logic for loading local models.json into listModels, added constant for the default remote model list url, test improvements, simpler hasOwnProperty call * add DEFAULT_PROMPT_CONTEXT, export new constants * add md5sum testcase and fix constants export * update types * throw if attempting to list models without a source * rebuild docs * fix download logging undefined url, toFixed typo, pass config filesize in for future progress report * added overload with union types * bump to 2.2.0, remove alpha * code speling --------- Co-authored-by: Andreas Obersteiner <8959303+iimez@users.noreply.github.com>	2023-08-14 12:45:45 -04:00
cosmic-snow	af6fe5fbb5	Update gpt4all_faq.md - minor oversight: there are now six supported architectures - LLAMA -> LLaMA (for v1) - note about Llama 2 and link to license - limit some of the paragraphs to 150 chars Signed-off-by: cosmic-snow <134004613+cosmic-snow@users.noreply.github.com>	2023-08-10 23:56:54 +02:00

1 2 3 4 5

232 Commits