Adam Treat
|
0c0a4f2c22
|
Add the docs.
|
2023-07-14 10:48:18 -04:00 |
|
Adam Treat
|
6656f0f41e
|
Fix the test to work and not do timings.
|
2023-07-14 09:48:57 -04:00 |
|
Adam Treat
|
bb2b82e1b9
|
Add docs and bump version since we changed python api again.
|
2023-07-14 09:48:57 -04:00 |
|
Aaron Miller
|
c77ab849c0
|
LLModel objects should hold a reference to the library
prevents llmodel lib from being gc'd before live model objects
|
2023-07-14 09:48:57 -04:00 |
|
Aaron Miller
|
1c4a244291
|
bump mem allocation a bit
|
2023-07-14 09:48:57 -04:00 |
|
Aaron Miller
|
936dcd2bfc
|
use default n_threads
|
2023-07-14 09:48:57 -04:00 |
|
Aaron Miller
|
15f1fe5445
|
rename embedder
|
2023-07-14 09:48:57 -04:00 |
|
Adam Treat
|
ee4186d579
|
Fixup bert python bindings.
|
2023-07-14 09:48:57 -04:00 |
|
cosmic-snow
|
6200900677
|
Fix Windows MSVC arch detection (#1194)
- in llmodel.cpp to fix AVX-only handling
Signed-off-by: cosmic-snow <134004613+cosmic-snow@users.noreply.github.com>
|
2023-07-13 14:44:17 -04:00 |
|
Adam Treat
|
4963db8f43
|
Bump the version numbers for both python and c backend.
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
0efdbfcffe
|
Bert
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
315a1f2aa2
|
Move it back as internal class.
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
ae8eb297ac
|
Add sbert backend.
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
1f749d7633
|
Clean up backend code a bit and hide impl. details.
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
33557b1f39
|
Move the implementation out of llmodel class.
|
2023-07-13 14:21:46 -04:00 |
|
Adam Treat
|
64b409e0b8
|
keep trying
|
2023-07-13 13:57:22 -04:00 |
|
Adam Treat
|
e59946f05d
|
try again to unbreak circleci
|
2023-07-13 13:55:22 -04:00 |
|
Adam Treat
|
b72b409d40
|
try again to unbreak circlci
|
2023-07-13 13:52:55 -04:00 |
|
Adam Treat
|
59cae1132c
|
Try and unbreak circleci.
|
2023-07-13 13:45:47 -04:00 |
|
Adam Treat
|
a0dae86a95
|
Add bert to models.json
|
2023-07-13 13:37:12 -04:00 |
|
AT
|
18ca8901f0
|
Update README.md
Signed-off-by: AT <manyoso@users.noreply.github.com>
|
2023-07-12 16:30:56 -04:00 |
|
cosmic-snow
|
00a945eaee
|
Update gpt4all_faq.md
- Add information about AVX/AVX2.
- Update supported architectures.
Signed-off-by: cosmic-snow <134004613+cosmic-snow@users.noreply.github.com>
|
2023-07-12 15:19:26 -04:00 |
|
Zach Nussbaum
|
6c4f449b7a
|
fix: update train scripts and configs for other models (#1164)
* feat: falcon config
* feat: mpt config
* chore: gitignore
* refactor: step calculation
* fix: attention mask + shuffle on epoch end
* fix: return tensors
* fix: wait for everyone
* chore: config
* chore: ds config
* fix: remove ccols
* fix: logging and saving
* chore: add einops
|
2023-07-12 15:18:24 -04:00 |
|
Adam Treat
|
e8b19b8e82
|
Bump version to 2.4.14 and provide release notes.
|
2023-07-12 14:58:45 -04:00 |
|
Adam Treat
|
8eb0844277
|
Check if the trimmed version is empty.
|
2023-07-12 14:31:43 -04:00 |
|
Adam Treat
|
be395c12cc
|
Make all system prompts empty by default if model does not include in training data.
|
2023-07-12 14:31:43 -04:00 |
|
Aaron Miller
|
6a8fa27c8d
|
Correctly find models in subdirs of model dir
QDirIterator doesn't seem particular subdir aware, its path() returns
the iterated dir. This was the simplest way I found to get this right.
|
2023-07-12 14:18:40 -04:00 |
|
Adam Treat
|
8893db5896
|
Add wizard model and rename orca to be more specific.
|
2023-07-12 14:12:46 -04:00 |
|
Adam Treat
|
60627bd41f
|
Prefer 7b models in order of default model load.
|
2023-07-12 12:50:18 -04:00 |
|
Aaron Miller
|
5df4f1bf8c
|
codespell
|
2023-07-12 12:49:06 -04:00 |
|
Aaron Miller
|
10ca2c4475
|
center the spinner
|
2023-07-12 12:49:06 -04:00 |
|
Adam Treat
|
e9897518d1
|
Show busy if models.json download taking longer than expected.
|
2023-07-12 12:49:06 -04:00 |
|
Aaron Miller
|
432b7ebbd7
|
include windows.h just to be safe
|
2023-07-12 12:46:46 -04:00 |
|
Aaron Miller
|
95b8fb312e
|
windows/msvc: use high level processor feature detection API
see https://learn.microsoft.com/en-us/windows/win32/api/processthreadsapi/nf-processthreadsapi-isprocessorfeaturepresent
|
2023-07-12 12:46:46 -04:00 |
|
Aaron Miller
|
ad0e7fd01f
|
chatgpt: ensure no extra newline in header
|
2023-07-12 10:53:25 -04:00 |
|
Aaron Miller
|
f0faa23ad5
|
cmakelists: always export build commands (#1179)
friendly for using editors with clangd integration that don't also
manage the build themselves
|
2023-07-12 10:49:24 -04:00 |
|
Adam Treat
|
0d726b22b8
|
When we explicitly cancel an operation we shouldn't throw an error.
|
2023-07-12 10:34:10 -04:00 |
|
Adam Treat
|
13b2d47be5
|
Provide an error dialog if for any reason we can't access the settings file.
|
2023-07-12 08:50:21 -04:00 |
|
Adam Treat
|
e9d42fba35
|
Don't show first start more than once.
|
2023-07-11 18:54:53 -04:00 |
|
mvenditto
|
8a31239e90
|
bump version
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
b96b6ef38f
|
pack metal files nuget
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
7efb43c2e4
|
copy metal kernels on macos builds
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
991b7468c9
|
fix native lib loader tests
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4d0201ac33
|
copy metal kernels for macos
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
c92c1af697
|
nuget pack and push
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
f3b6f49684
|
fix workspace symlinks on unix, fix persist_workspace on windows and macos runtimes dir
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
2927d11a28
|
fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
620ccda696
|
try fix
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
cac18c273e
|
More experiments
|
2023-07-11 18:09:39 -04:00 |
|
mvenditto
|
4a99e6662a
|
fix csharp jobs deps
|
2023-07-11 18:09:39 -04:00 |
|