Commit Graph

374 Commits

Author SHA1 Message Date
AT
41fdd24664 Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-10 09:09:29 -04:00
AT
707a8e1a0a Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
2023-05-10 09:09:04 -04:00
Adam Treat
0978c260c4 Bump the version to 2.4.2 2023-05-10 09:05:39 -04:00
AT
6ae1e8a842 Update issue templates 2023-05-09 23:57:06 -04:00
Adam Treat
27599d4a0a Fix some usage events. 2023-05-09 23:43:16 -04:00
Adam Treat
c8e9259bd1 Default to true for compat hardware. 2023-05-09 23:17:36 -04:00
AT
dd71513309 Update README.md 2023-05-09 23:10:53 -04:00
AT
148225372d Update README.md 2023-05-09 23:10:06 -04:00
AT
ec81db43df Update README.md 2023-05-09 23:04:54 -04:00
Adam Treat
dcea2f3491 Rename to build_and_run.md 2023-05-09 23:02:41 -04:00
AT
ac665f3739 Update dev_setup.md 2023-05-09 23:00:50 -04:00
AT
bb2ac26459 Update dev_setup.md 2023-05-09 22:36:02 -04:00
AT
37ea0f6c29 Update dev_setup.md 2023-05-09 22:00:42 -04:00
AT
f14458db44 Update dev_setup.md 2023-05-09 21:59:11 -04:00
Adam Treat
3b802fb0f5 Add a page to fill in for setting up a dev environment. 2023-05-09 21:38:24 -04:00
Adam Treat
dfe641222b Shorten text. 2023-05-09 20:54:16 -04:00
Adam Treat
09b5f87b8d Couple of bugfixes. 2023-05-09 19:15:18 -04:00
Adam Treat
f3c81c42a7 Provide a user default model setting and honor it. 2023-05-09 17:10:47 -04:00
Adam Treat
ff257eb52c Add MPT info to the download list and fix it so that isDefault will work even if the required version isn't there. 2023-05-09 12:09:49 -04:00
Adam Treat
5d95085cbe Move this script and rename. 2023-05-09 11:48:32 -04:00
Adam Treat
8eeca20fd7 Simplify. 2023-05-09 11:46:33 -04:00
Adam Treat
8d295550eb Don't keep this in memory when it is not needed. 2023-05-08 21:05:50 -04:00
Adam Treat
7094fd0788 Gracefully handle when we have a previous chat where the model that it used has gone away. 2023-05-08 20:51:03 -04:00
Adam Treat
ad82aaebb1 Copy pasta. 2023-05-08 19:10:22 -04:00
Adam Treat
9c66308922 Fix for special im_end token in mpt-7b-chat model. 2023-05-08 18:57:40 -04:00
Adam Treat
a4bec78ec6 Allow these to load for gptj too. 2023-05-08 18:31:20 -04:00
Aaron Miller
821b28a4fa mpt: allow q4_2 quantized models to load 2023-05-08 18:23:36 -04:00
Aaron Miller
49fc7b315a mpt tokenizer: better special token handling
closer to the behavior of huggingface `tokenizers`,
do not attempt to handle additional tokens as if they were part
of the original vocabulary as this cannot prevent them from being
split into smaller chunks - handle added tokens *before*
the regular tokenizing pass

note this is still necessary even with a "proper" tokenizer implementation
2023-05-08 18:23:36 -04:00
Adam Treat
9da4fac023 Fix gptj to have lower memory requirements for kv cache and add versioning to the internal state to smoothly handle such a fix in the future. 2023-05-08 17:23:02 -04:00
Adam Treat
c7f5280f9f Fix the version. 2023-05-08 16:50:21 -04:00
Adam Treat
be9e748abe Remove as upstream has removed. 2023-05-08 15:09:23 -04:00
Adam Treat
126dd97b0a This shouldn't have snuck in. 2023-05-08 15:09:23 -04:00
Adam Treat
3a8ad1f700 Update to the alibi version that Zach made. 2023-05-08 12:27:01 -04:00
Adam Treat
90b2bcfebe Match Helly's impl of kv cache. 2023-05-08 12:21:30 -04:00
Adam Treat
368886015d Use F16 for kv cache on mpt. 2023-05-08 12:21:30 -04:00
Adam Treat
00804c4e3e Fix for special tokens. 2023-05-08 12:21:30 -04:00
Adam Treat
98e19ebc25 Fix up mpt. 2023-05-08 12:21:30 -04:00
Zach Nussbaum
712aeb8866 fix: helly changes 2023-05-08 12:21:30 -04:00
Zach Nussbaum
d14b93222f fix: model loading 2023-05-08 12:21:30 -04:00
Zach Nussbaum
28f0f76b9f fix: convert script working 2023-05-08 12:21:30 -04:00
Zach Nussbaum
d928540a08 feat: load model 2023-05-08 12:21:30 -04:00
Zach Nussbaum
f8f248c18a chore: import for mpt 2023-05-08 12:21:30 -04:00
Zach Nussbaum
e3f17c8e82 feat: mpt convert from hf to ggml 2023-05-08 12:21:30 -04:00
Zach Nussbaum
285e57ca68 feat: build works + tokenizer 2023-05-08 12:21:30 -04:00
Zach Nussbaum
199a585ad1 feat: add ln 2, rename vars 2023-05-08 12:21:30 -04:00
Zach Nussbaum
21f2aa4911 feat: mpt wip 2023-05-08 12:21:30 -04:00
Adam Treat
a066cba17d Scaffolding for the mpt <-> ggml project. 2023-05-08 12:21:30 -04:00
Adam Treat
da5b057041 Only generate three words max. 2023-05-08 12:21:30 -04:00
Adam Treat
2b76fa6b20 Restore defaults for repeat penalty too. 2023-05-08 12:21:30 -04:00
Adam Treat
ee016e10ab Send info on how many are running into this error. 2023-05-08 08:31:35 -04:00