Commit Graph

1315 Commits

Author SHA1 Message Date
Zach Nussbaum
6c4f449b7a
fix: update train scripts and configs for other models (#1164)
* feat: falcon config

* feat: mpt config

* chore: gitignore

* refactor: step calculation

* fix: attention mask + shuffle on epoch end

* fix: return tensors

* fix: wait for everyone

* chore: config

* chore: ds config

* fix: remove ccols

* fix: logging and saving

* chore: add einops
2023-07-12 15:18:24 -04:00
Adam Treat
e8b19b8e82 Bump version to 2.4.14 and provide release notes. 2023-07-12 14:58:45 -04:00
Adam Treat
8eb0844277 Check if the trimmed version is empty. 2023-07-12 14:31:43 -04:00
Adam Treat
be395c12cc Make all system prompts empty by default if model does not include in training data. 2023-07-12 14:31:43 -04:00
Aaron Miller
6a8fa27c8d Correctly find models in subdirs of model dir
QDirIterator doesn't seem particular subdir aware, its path() returns
the iterated dir. This was the simplest way I found to get this right.
2023-07-12 14:18:40 -04:00
Adam Treat
8893db5896 Add wizard model and rename orca to be more specific. 2023-07-12 14:12:46 -04:00
Adam Treat
60627bd41f Prefer 7b models in order of default model load. 2023-07-12 12:50:18 -04:00
Aaron Miller
5df4f1bf8c codespell 2023-07-12 12:49:06 -04:00
Aaron Miller
10ca2c4475 center the spinner 2023-07-12 12:49:06 -04:00
Adam Treat
e9897518d1 Show busy if models.json download taking longer than expected. 2023-07-12 12:49:06 -04:00
Aaron Miller
432b7ebbd7 include windows.h just to be safe 2023-07-12 12:46:46 -04:00
Aaron Miller
95b8fb312e windows/msvc: use high level processor feature detection API
see https://learn.microsoft.com/en-us/windows/win32/api/processthreadsapi/nf-processthreadsapi-isprocessorfeaturepresent
2023-07-12 12:46:46 -04:00
Aaron Miller
ad0e7fd01f chatgpt: ensure no extra newline in header 2023-07-12 10:53:25 -04:00
Aaron Miller
f0faa23ad5
cmakelists: always export build commands (#1179)
friendly for using editors with clangd integration that don't also
manage the build themselves
2023-07-12 10:49:24 -04:00
Adam Treat
0d726b22b8 When we explicitly cancel an operation we shouldn't throw an error. 2023-07-12 10:34:10 -04:00
Adam Treat
13b2d47be5 Provide an error dialog if for any reason we can't access the settings file. 2023-07-12 08:50:21 -04:00
Adam Treat
e9d42fba35 Don't show first start more than once. 2023-07-11 18:54:53 -04:00
mvenditto
8a31239e90 bump version 2023-07-11 18:09:39 -04:00
mvenditto
b96b6ef38f pack metal files nuget 2023-07-11 18:09:39 -04:00
mvenditto
7efb43c2e4 copy metal kernels on macos builds 2023-07-11 18:09:39 -04:00
mvenditto
991b7468c9 fix native lib loader tests 2023-07-11 18:09:39 -04:00
mvenditto
4d0201ac33 copy metal kernels for macos 2023-07-11 18:09:39 -04:00
mvenditto
c92c1af697 nuget pack and push 2023-07-11 18:09:39 -04:00
mvenditto
f3b6f49684 fix workspace symlinks on unix, fix persist_workspace on windows and macos runtimes dir 2023-07-11 18:09:39 -04:00
mvenditto
2927d11a28 fix 2023-07-11 18:09:39 -04:00
mvenditto
620ccda696 try fix 2023-07-11 18:09:39 -04:00
mvenditto
cac18c273e More experiments 2023-07-11 18:09:39 -04:00
mvenditto
4a99e6662a fix csharp jobs deps 2023-07-11 18:09:39 -04:00
mvenditto
380bbcf18f fix cmakelist path 2023-07-11 18:09:39 -04:00
mvenditto
0277e8400a debug ls 2023-07-11 18:09:39 -04:00
mvenditto
9eb50cc115 refine runtimes persist + c# linux build 2023-07-11 18:09:39 -04:00
mvenditto
4b7b9975c5 add lib loading tests + remove dummy test 2023-07-11 18:09:39 -04:00
mvenditto
6d9575e103 copy only needed mingw dlls 2023-07-11 18:09:39 -04:00
mvenditto
113c25e4de fix win mingw dll path 2023-07-11 18:09:39 -04:00
mvenditto
d107cccf18 msvc dll path fix 2023-07-11 18:09:39 -04:00
mvenditto
51928cd6c3 fix msvc putting file in target dir 2023-07-11 18:09:39 -04:00
mvenditto
99ca80cf1a change build-bindings-backend when condition 2023-07-11 18:09:39 -04:00
mvenditto
289c96cdf8 remove bad cp 2023-07-11 18:09:39 -04:00
mvenditto
f4a0fc6cef add holds 2023-07-11 18:09:39 -04:00
mvenditto
ddd087dadb fix wrong cmake arg in macos job 2023-07-11 18:09:39 -04:00
mvenditto
2e131053e8 fix missing cmake in win msvc job 2023-07-11 18:09:39 -04:00
mvenditto
11ac85b01f add needed sudo in ubuntu machine scenario 2023-07-11 18:09:39 -04:00
mvenditto
021a388b38 typo again, should sleep 2023-07-11 18:09:39 -04:00
mvenditto
cd3bfea09b fix filters 2023-07-11 18:09:39 -04:00
mvenditto
fec2fd2832 try to fix error 2023-07-11 18:09:39 -04:00
mvenditto
3c126ffa03 typo 2023-07-11 18:09:39 -04:00
mvenditto
ec9148f52c further tests 2023-07-11 18:09:39 -04:00
mvenditto
422aecc5ba revert some bad changes 2023-07-11 18:09:39 -04:00
mvenditto
d151beb8bf fix 2023-07-11 18:09:39 -04:00
mvenditto
4697b968a8 better restore cache + some experimentation 2023-07-11 18:09:39 -04:00