Commit Graph

1521 Commits (ffef60912f6e6f5446bd87aa3144f7040bdabeea)
 

Author SHA1 Message Date
Adam Treat ffef60912f Update to llama.cpp 11 months ago
Adam Treat bc88271520 Bump version to v2.5.3 and release notes. 11 months ago
cebtenzzre 5508e43466 build_and_run: clarify which additional Qt libs are needed
Signed-off-by: cebtenzzre <cebtenzzre@gmail.com>
11 months ago
cebtenzzre 79a5522931 fix references to old backend implementations 11 months ago
Adam Treat f529d55380 Move this logic to QML. 11 months ago
Adam Treat f5f22fdbd0 Update llama.cpp for latest bugfixes. 11 months ago
Adam Treat 5c0d077f74 Remove leading whitespace in responses. 11 months ago
Adam Treat 131cfcdeae Don't regenerate the name for deserialized chats. 11 months ago
Adam Treat dc2e7d6e9b Don't start recalculating context immediately upon switching to a new chat
but rather wait until the first prompt. This allows users to switch between
chats fast and to delete chats more easily.

Fixes issue #1545
11 months ago
cebtenzzre 7bcd9e8089 update llama.cpp-mainline 11 months ago
cebtenzzre fd0c501d68
backend: support GGUFv3 (#1582) 11 months ago
Adam Treat 14b410a12a Update to latest version of llama.cpp which fixes issue 1507. 11 months ago
Adam Treat ab96035bec Update to llama.cpp submodule for some vulkan fixes. 11 months ago
Aaron Miller 9193a9517a
make codespell happy again (#1574)
* make codespell happy again

* no belong

Signed-off-by: Aaron Miller <apage43@ninjawhale.com>

---------

Signed-off-by: Aaron Miller <apage43@ninjawhale.com>
11 months ago
cebtenzzre 8d7a3f26d3 gpt4all-training: delete old chat executables
Signed-off-by: cebtenzzre <cebtenzzre@gmail.com>
11 months ago
Andriy Mulyar 3444a47cad
Update README.md
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
11 months ago
Adam Treat 89a59e7f99 Bump version and add release notes for 2.5.1 11 months ago
cebtenzzre f5dd74bcf0
models2.json: add tokenizer merges to mpt-7b-chat model (#1563) 11 months ago
cebtenzzre 78d930516d
app.py: change default model to Mistral Instruct (#1564) 11 months ago
cebtenzzre 83b8eea611 README: add clear note about new GGUF format
Signed-off-by: cebtenzzre <cebtenzzre@gmail.com>
11 months ago
Andriy Mulyar 1bebe78c56
Update README.md
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
11 months ago
Andriy Mulyar b75a209374
Update README.md
Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com>
11 months ago
cebtenzzre e90263c23f
make scripts executable (#1555) 11 months ago
Aaron Miller f414c28589 llmodel: whitelist library name patterns
this fixes some issues that were being seen on installed windows builds of 2.5.0

only load dlls that actually might be model impl dlls, otherwise we pull all sorts of random junk into the process before it might expect to be

Signed-off-by: Aaron Miller <apage43@ninjawhale.com>
11 months ago
cebtenzzre 7e5e84fbb7
python: change default extension to .gguf (#1559) 11 months ago
cebtenzzre 37b007603a
bindings: replace references to GGMLv3 models with GGUF (#1547) 11 months ago
cebtenzzre c25dc51935 chat: fix syntax error in main.qml 11 months ago
Thomas 34daf240f9
Update Dockerfile.buildkit (#1542)
corrected model download directory

Signed-off-by: Thomas <tvhdev@vonhaugwitz-softwaresolutions.de>
11 months ago
Victor Tsaran 721d854095
chat: improve accessibility fields (#1532)
Co-authored-by: cebtenzzre <cebtenzzre@gmail.com>
11 months ago
Andriy Mulyar d50803ff8e
GGUF Python Release (#1539) 11 months ago
Adam Treat 9e99cf937a Add release notes for 2.5.0 and bump the version. 11 months ago
cebtenzzre 245c5ce5ea
update default model URLs (#1538) 11 months ago
cebtenzzre 4338e72a51
MPT: use upstream llama.cpp implementation (#1515) 11 months ago
cebtenzzre 0fe2e19691
llamamodel: re-enable error messages by default (#1537) 11 months ago
cebtenzzre f505619c84
README: remove star history (#1536) 11 months ago
cebtenzzre 5fbeeb1cb4
python: connection resume and MSVC support (#1535) 11 months ago
cebtenzzre 017c3a9649
python: prepare version 2.0.0rc1 (#1529) 11 months ago
cebtenzzre bcbcad98d0
CI: increase minimum macOS version of Python bindings to 10.15 (#1511) 11 months ago
cebtenzzre fd3014016b
docs: clarify Vulkan dep in build instructions for bindings (#1525) 11 months ago
cebtenzzre ac33bafb91
docs: improve build_and_run.md (#1524) 11 months ago
cebtenzzre 9a19c740ee
kompute: fix library loading issues with kp_logger (#1517) 11 months ago
Aaron Miller f79557d2aa speedup: just use mat*vec shaders for mat*mat
so far my from-scratch mat*mats are still slower than just running more
invocations of the existing Metal ported mat*vec shaders - it should be
theoretically possible to make a mat*mat that's faster (for actual
mat*mat cases) than an optimal mat*vec, but it will need to be at
*least* as fast as the mat*vec op and then take special care to be
cache-friendly and save memory bandwidth, as the # of compute ops is the
same
11 months ago
cebtenzzre 22de3c56bd
convert scripts: fix AutoConfig typo (#1512) 11 months ago
Aaron Miller 10f9b49313 update mini-orca 3b to gguf2, license
Signed-off-by: Aaron Miller <apage43@ninjawhale.com>
11 months ago
Aaron Miller 2490977f89 q6k, q4_1 mat*mat 11 months ago
niansa/tuxifan a35f1ab784
Updated chat wishlist (#1351) 11 months ago
cebtenzzre 4d4275d1b8
python: replace deprecated pkg_resources with importlib (#1505) 11 months ago
Alex Soto 3c45a555e9 Improves Java API signatures maintaining back compatibility 11 months ago
Aaron Miller f39df0906e fix embed4all filename
https://discordapp.com/channels/1076964370942267462/1093558720690143283/1161778216462192692

Signed-off-by: Aaron Miller <apage43@ninjawhale.com>
11 months ago
umarmnaq 005c092943 Update README.md
Signed-off-by: umarmnaq <102142660+umarmnaq@users.noreply.github.com>
11 months ago