Zach Nussbaum
|
6c4f449b7a
|
fix: update train scripts and configs for other models (#1164)
* feat: falcon config
* feat: mpt config
* chore: gitignore
* refactor: step calculation
* fix: attention mask + shuffle on epoch end
* fix: return tensors
* fix: wait for everyone
* chore: config
* chore: ds config
* fix: remove ccols
* fix: logging and saving
* chore: add einops
|
2023-07-12 15:18:24 -04:00 |
|
Adam Treat
|
e8b19b8e82
|
Bump version to 2.4.14 and provide release notes.
|
2023-07-12 14:58:45 -04:00 |
|
Adam Treat
|
8eb0844277
|
Check if the trimmed version is empty.
|
2023-07-12 14:31:43 -04:00 |
|
Adam Treat
|
be395c12cc
|
Make all system prompts empty by default if model does not include in training data.
|
2023-07-12 14:31:43 -04:00 |
|
Aaron Miller
|
6a8fa27c8d
|
Correctly find models in subdirs of model dir
QDirIterator doesn't seem particular subdir aware, its path() returns
the iterated dir. This was the simplest way I found to get this right.
|
2023-07-12 14:18:40 -04:00 |
|
Adam Treat
|
8893db5896
|
Add wizard model and rename orca to be more specific.
|
2023-07-12 14:12:46 -04:00 |
|
Adam Treat
|
60627bd41f
|
Prefer 7b models in order of default model load.
|
2023-07-12 12:50:18 -04:00 |
|
Aaron Miller
|
5df4f1bf8c
|
codespell
|
2023-07-12 12:49:06 -04:00 |
|
Aaron Miller
|
10ca2c4475
|
center the spinner
|
2023-07-12 12:49:06 -04:00 |
|
Adam Treat
|
e9897518d1
|
Show busy if models.json download taking longer than expected.
|
2023-07-12 12:49:06 -04:00 |
|
Aaron Miller
|
432b7ebbd7
|
include windows.h just to be safe
|
2023-07-12 12:46:46 -04:00 |
|
Aaron Miller
|
95b8fb312e
|
windows/msvc: use high level processor feature detection API
see https://learn.microsoft.com/en-us/windows/win32/api/processthreadsapi/nf-processthreadsapi-isprocessorfeaturepresent
|
2023-07-12 12:46:46 -04:00 |
|
Aaron Miller
|
ad0e7fd01f
|
chatgpt: ensure no extra newline in header
|
2023-07-12 10:53:25 -04:00 |
|
Aaron Miller
|
f0faa23ad5
|
cmakelists: always export build commands (#1179)
friendly for using editors with clangd integration that don't also
manage the build themselves
|
2023-07-12 10:49:24 -04:00 |
|
Adam Treat
|
0d726b22b8
|
When we explicitly cancel an operation we shouldn't throw an error.
|
2023-07-12 10:34:10 -04:00 |
|
Adam Treat
|
13b2d47be5
|
Provide an error dialog if for any reason we can't access the settings file.
|
2023-07-12 08:50:21 -04:00 |
|
Adam Treat
|
e9d42fba35
|
Don't show first start more than once.
|
2023-07-11 18:54:53 -04:00 |
|
mvenditto
|
8a31239e90
|
bump version
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
b96b6ef38f
|
pack metal files nuget
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
7efb43c2e4
|
copy metal kernels on macos builds
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
991b7468c9
|
fix native lib loader tests
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4d0201ac33
|
copy metal kernels for macos
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
c92c1af697
|
nuget pack and push
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
f3b6f49684
|
fix workspace symlinks on unix, fix persist_workspace on windows and macos runtimes dir
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
2927d11a28
|
fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
620ccda696
|
try fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
cac18c273e
|
More experiments
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4a99e6662a
|
fix csharp jobs deps
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
380bbcf18f
|
fix cmakelist path
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
0277e8400a
|
debug ls
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
9eb50cc115
|
refine runtimes persist + c# linux build
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4b7b9975c5
|
add lib loading tests + remove dummy test
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
6d9575e103
|
copy only needed mingw dlls
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
113c25e4de
|
fix win mingw dll path
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
d107cccf18
|
msvc dll path fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
51928cd6c3
|
fix msvc putting file in target dir
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
99ca80cf1a
|
change build-bindings-backend when condition
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
289c96cdf8
|
remove bad cp
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
f4a0fc6cef
|
add holds
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
ddd087dadb
|
fix wrong cmake arg in macos job
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
2e131053e8
|
fix missing cmake in win msvc job
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
11ac85b01f
|
add needed sudo in ubuntu machine scenario
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
021a388b38
|
typo again, should sleep
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
cd3bfea09b
|
fix filters
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
fec2fd2832
|
try to fix error
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
3c126ffa03
|
typo
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
ec9148f52c
|
further tests
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
422aecc5ba
|
revert some bad changes
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
d151beb8bf
|
fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4697b968a8
|
better restore cache + some experimentation
|
2023-07-11 18:09:39 -04:00 |
|