gpt4all

Commit Graph

Author	SHA1	Message	Date
Aaron Miller	198b5e4832	add Falcon 7B model Tested with https://huggingface.co/TheBloke/falcon-7b-instruct-GGML/blob/main/falcon7b-instruct.ggmlv3.q4_0.bin	1 year ago
Aaron Miller	db34a2f670	llmodel: skip attempting Metal if model+kvcache > 53% of system ram	1 year ago
Aaron Miller	b19a3e5b2c	add requiredMem method to llmodel impls most of these can just shortcut out of the model loading logic llama is a bit worse to deal with because we submodule it so I have to at least parse the hparams, and then I just use the size on disk as an estimate for the mem size (which seems reasonable since we mmap() the llama files anyway)	1 year ago
Aaron Miller	88616fde7f	llmodel: change tokenToString to not use string_view (#968 ) fixes a definite use-after-free and likely avoids some other potential ones - std::string will convert to a std::string_view automatically but as soon as the std::string in question goes out of scope it is already freed and the string_view is pointing at freed memory - this is mostly fine if its returning a reference to the tokenizer's internal vocab table but it's, imo, too easy to return a reference to a dynamically constructed string with this as replit is doing (and unfortunately needs to do to convert the internal whitespace replacement symbol back to a space)	1 year ago
Adam Treat	b906fb4057	When recalculating context we can't erase the BOS.	1 year ago
Aaron Miller	d3ba1295a7	Metal+LLama take two (#929 ) Support latest llama with Metal --------- Co-authored-by: Adam Treat <adam@nomic.ai> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>	1 year ago
Adam Treat	b162b5c64e	Revert "llama on Metal (#885 )" This reverts commit `c55f81b860`.	1 year ago
Aaron Miller	c55f81b860	llama on Metal (#885 ) Support latest llama with Metal --------- Co-authored-by: Adam Treat <adam@nomic.ai> Co-authored-by: niansa/tuxifan <tuxifan@posteo.de>	1 year ago
Adam Treat	301d2fdbea	Fix up for newer models on reset context. This fixes the model from totally failing after a reset context.	1 year ago
AT	bbe195ee02	Backend prompt dedup (#822 ) * Deduplicated prompt() function code	1 year ago
Peter Gagarinov	23391d44e0	Only default mlock on macOS where swap seems to be a problem Repeating the change that once was done in https://github.com/nomic-ai/gpt4all/pull/663 but then was overriden by `48275d0dcc` Signed-off-by: Peter Gagarinov <pgagarinov@users.noreply.github.com>	1 year ago
niansa/tuxifan	f3564ac6b9	Fixed tons of warnings and clazy findings (#811 )	1 year ago
Adam Treat	a41bd6ac0a	Trying to shrink the copy+paste code and do more code sharing between backend model impl.	1 year ago
niansa	a3d08cdcd5	Dlopen better implementation management (Version 2)	1 year ago
AT	48275d0dcc	Dlopen backend 5 (#779 ) Major change to the backend that allows for pluggable versions of llama.cpp/ggml. This was squashed merged from dlopen_backend_5 where the history is preserved.	1 year ago
Adam Treat	9bfff8bfcb	Add new reverse prompt for new localdocs context feature.	1 year ago
Juuso Alasuutari	81fdc28e58	llmodel: constify LLModel::threadCount()	1 year ago
Adam Treat	8204c2eb80	Only default mlock on macOS where swap seems to be a problem.	1 year ago
Adam Treat	aba1147a22	Always default mlock to true.	1 year ago
aaron miller	e6fd0a240d	backend: fix buffer overrun in repeat penalty code Caught with AddressSanitizer running a basic prompt test against llmodel standalone. This fix allows ASan builds to complete a simple prompt without illegal accesses but there are still notably several leaks.	1 year ago
kuvaus	507e913faf	gpt4all-backend: Add MSVC support to backend (#595 ) * Add MSVC compatibility * Add _MSC_VER macro --------- Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>	1 year ago
Adam Treat	d918b02c29	Move the llmodel C API to new top-level directory and version it.	1 year ago

22 Commits (8d77d9ad895019b95011abc821305f08ff966560)