Commit Graph

52 Commits (5c5e10c1f5ac03f9dbab4cc4d8c5bb02d286b46f)

Author SHA1 Message Date
niansa/tuxifan 5c5e10c1f5 Synced llama.cpp.cmake with upstream (#887) 1 year ago
Richard Guo 7a472bea88 Replit Model (#713)
* porting over replit code model to gpt4all

* replaced memory with kv_self struct

* continuing debug

* welp it built but lot of sus things

* working model loading and somewhat working generate.. need to format response?

* revert back to semi working version

* finally got rid of weird formatting

* figured out problem is with python bindings - this is good to go for testing

* addressing PR feedback

* output refactor

* fixed prompt reponse collection

* cleanup

* addressing PR comments

* building replit backend with new ggmlver code

* chatllm replit and clean python files

* cleanup

* updated replit to match new llmodel api

* match llmodel api and change size_t to Token

* resolve PR comments

* replit model commit comment
1 year ago
Adam Treat fff1194b38 Fix llama models on linux and windows. 1 year ago
Adam Treat 31b1f966e0 Fix symbol resolution on windows. 1 year ago
Adam Treat 3cd7d2f3c7 Make installers work with mac/windows for big backend change. 1 year ago
Adam Treat 1a8548b876 Update to latest llama.cpp 1 year ago
Adam Treat b36ea3dde5 Fix up for newer models on reset context. This fixes the model from totally failing after a reset context. 1 year ago
AT 964e2ffc1b We no longer have an avx_only repository and better error handling for minimum hardware requirements. (#833) 1 year ago
AT b5971b0d41 Backend prompt dedup (#822)
* Deduplicated prompt() function code
1 year ago
Ikko Eltociear Ashimine e53195a002 Update README.md
huggingface -> Hugging Face

Signed-off-by: Ikko Eltociear Ashimine <eltociear@gmail.com>
1 year ago
Peter Gagarinov 21df8a771e Only default mlock on macOS where swap seems to be a problem
Repeating the change that once was done in https://github.com/nomic-ai/gpt4all/pull/663 but then was overriden by 9c6c09cbd2

Signed-off-by: Peter Gagarinov <pgagarinov@users.noreply.github.com>
1 year ago
niansa/tuxifan 8203d65445 Fixed tons of warnings and clazy findings (#811) 1 year ago
niansa/tuxifan 1832a887b5 Fixed model type for GPT-J (#815)
Signed-off-by: niansa/tuxifan <tuxifan@posteo.de>
1 year ago
Richard Guo d051ac889c more cleanup 1 year ago
Richard Guo fb09f412ff cleanup 1 year ago
Richard Guo 67b7641390 fixed finding model libs 1 year ago
Adam Treat 15c6bf09e9 Fix mac build again. 1 year ago
Adam Treat 1b755b6cba Try and fix build on mac. 1 year ago
Adam Treat 7ee32d605f Trying to shrink the copy+paste code and do more code sharing between backend model impl. 1 year ago
Tim Miller 455e6aa7ce Fix MSVC Build, Update C# Binding Scripts 1 year ago
niansa/tuxifan c4f9535fd0 Allow user to specify custom search path via $GPT4ALL_IMPLEMENTATIONS_PATH (#789) 1 year ago
niansa ab56119470 Fixed double-free in LLModel::Implementation destructor 1 year ago
niansa/tuxifan 8aa707fdb4 Cleaned up implementation management (#787)
* Cleaned up implementation management

* Initialize LLModel::m_implementation to nullptr

* llmodel.h: Moved dlhandle fwd declare above LLModel class
1 year ago
Adam Treat 8be42683ac Add fixme's and clean up a bit. 1 year ago
niansa b68d359b4f Dlopen better implementation management (Version 2) 1 year ago
niansa/tuxifan 991a0e4bd8 Advanced avxonly autodetection (#744)
* Advanced avxonly requirement detection
1 year ago
AT 9c6c09cbd2 Dlopen backend 5 (#779)
Major change to the backend that allows for pluggable versions of llama.cpp/ggml. This was squashed merged from dlopen_backend_5 where the history is preserved.
1 year ago
Adam Treat 4a317eeb33 Revert "New tokenizer implementation for MPT and GPT-J"
This reverts commit ee3469ba6c.
1 year ago
Adam Treat 06434f0042 Revert "buf_ref.into() can be const now"
This reverts commit 840e011b75.
1 year ago
Adam Treat 92bc92d232 Revert "add tokenizer readme w/ instructions for convert script"
This reverts commit 9c15d1f83e.
1 year ago
aaron miller 9c15d1f83e add tokenizer readme w/ instructions for convert script 1 year ago
Aaron Miller 840e011b75 buf_ref.into() can be const now 1 year ago
Aaron Miller ee3469ba6c New tokenizer implementation for MPT and GPT-J
Improves output quality by making these tokenizers more closely
match the behavior of the huggingface `tokenizers` based BPE
tokenizers these models were trained with.

Featuring:
 * Fixed unicode handling (via ICU)
 * Fixed BPE token merge handling
 * Complete added vocabulary handling
1 year ago
Adam Treat d40735a2d2 Get the backend as well as the client building/working with msvc. 1 year ago
Adam Treat 80024a029c Add new reverse prompt for new localdocs context feature. 1 year ago
Juuso Alasuutari 8d822f9898 llmodel: constify some casts in LLModelWrapper 1 year ago
Juuso Alasuutari f2528e6f62 llmodel: constify LLModel::threadCount() 1 year ago
Juuso Alasuutari f0b942d323 llmodel: fix wrong and/or missing prompt callback type
Fix occurrences of the prompt callback being incorrectly specified, or
the response callback's prototype being incorrectly used in its place.

Signed-off-by: Juuso Alasuutari <juuso.alasuutari@gmail.com>
1 year ago
Adam Treat 9e13f813d5 Only default mlock on macOS where swap seems to be a problem. 1 year ago
Adam Treat d5dd4e87de Always default mlock to true. 1 year ago
aaron miller 08f3bd2a82 backend: fix buffer overrun in repeat penalty code
Caught with AddressSanitizer running a basic prompt test against llmodel
standalone. This fix allows ASan builds to complete a simple prompt
without illegal accesses but there are still notably several leaks.
1 year ago
kuvaus 4f2b7f7be4 Bugfix on llmodel_model_create function
Fixes the bug where llmodel_model_create prints "Invalid model file" even though the model is loaded correctly. Credits and thanks to @serendipity for the fix.
1 year ago
kuvaus a0b98dc55d gpt4all-backend: Add llmodel create and destroy functions (#554)
* Add llmodel create and destroy functions

* Fix capitalization

* Fix capitalization

* Fix capitalization

* Update CMakeLists.txt

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
1 year ago
kuvaus 4f021ebcbb gpt4all-backend: Add MSVC support to backend (#595)
* Add MSVC compatibility

* Add _MSC_VER macro

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
1 year ago
Aaron Miller 9aaa355d41 backend: dedupe tokenizing code in mpt/gptj 1 year ago
Aaron Miller fc2869f0b7 backend: dedupe tokenizing code in gptj/mpt 1 year ago
Aaron Miller 16b7bf01a8 backend: make initial buf_size const in model impls
more unifying mpt and gptj code - this one's never written so also
changing the name to be clearer
1 year ago
Aaron Miller 0c9b7a6ae8 mpt: use buf in model struct (thread safety) 1 year ago
AT 48ca4a047c Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
1 year ago
Zach Nussbaum 53730c5f7f fix: use right conversion script 1 year ago