Commit Graph

12 Commits (5cbf4f28e843a366c0180e619a25d8f3591a67ae)

Author SHA1 Message Date
aaron miller 08f3bd2a82 backend: fix buffer overrun in repeat penalty code
Caught with AddressSanitizer running a basic prompt test against llmodel
standalone. This fix allows ASan builds to complete a simple prompt
without illegal accesses but there are still notably several leaks.
1 year ago
kuvaus 4f2b7f7be4 Bugfix on llmodel_model_create function
Fixes the bug where llmodel_model_create prints "Invalid model file" even though the model is loaded correctly. Credits and thanks to @serendipity for the fix.
1 year ago
kuvaus a0b98dc55d gpt4all-backend: Add llmodel create and destroy functions (#554)
* Add llmodel create and destroy functions

* Fix capitalization

* Fix capitalization

* Fix capitalization

* Update CMakeLists.txt

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
1 year ago
kuvaus 4f021ebcbb gpt4all-backend: Add MSVC support to backend (#595)
* Add MSVC compatibility

* Add _MSC_VER macro

---------

Co-authored-by: kuvaus <kuvaus@users.noreply.github.com>
1 year ago
Aaron Miller 9aaa355d41 backend: dedupe tokenizing code in mpt/gptj 1 year ago
Aaron Miller fc2869f0b7 backend: dedupe tokenizing code in gptj/mpt 1 year ago
Aaron Miller 16b7bf01a8 backend: make initial buf_size const in model impls
more unifying mpt and gptj code - this one's never written so also
changing the name to be clearer
1 year ago
Aaron Miller 0c9b7a6ae8 mpt: use buf in model struct (thread safety) 1 year ago
AT 48ca4a047c Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
1 year ago
Zach Nussbaum 53730c5f7f fix: use right conversion script 1 year ago
Adam Treat 8e7b96bd92 Move the llmodel C API to new top-level directory and version it. 1 year ago
Richard Guo 6304f6d322 mono repo structure 1 year ago