Pavel Samygin
|
57f4e0a899
|
add identity path
|
2022-06-17 10:15:15 +03:00 |
|
justheuristic
|
35310698f0
|
newer hivemind version
|
2022-06-17 10:12:09 +03:00 |
|
justheuristic
|
8959727dea
|
add minimalistic benchmarks
|
2022-06-14 15:18:38 +03:00 |
|
justheuristic
|
a798ea04a6
|
add minimalistic benchmarks
|
2022-06-14 15:18:11 +03:00 |
|
justheuristic
|
e3a7d5af30
|
inference mode
|
2022-06-14 14:51:06 +03:00 |
|
justheuristic
|
3b16d6ffdb
|
black-isort
|
2022-06-14 13:33:25 +03:00 |
|
justheuristic
|
a28ea0aa6f
|
Merge pull request #9 from learning-at-home/rpc
Rudimentary decentralization
|
2022-06-14 09:46:42 +03:00 |
|
justheuristic
|
14e316b52a
|
black-isort
|
2022-06-14 09:46:14 +03:00 |
|
justheuristic
|
eea9287182
|
RemoteTransformerBlock
|
2022-06-14 09:45:22 +03:00 |
|
justheuristic
|
3e9fd63a02
|
RemoteTransformerBlock
|
2022-06-14 09:45:06 +03:00 |
|
justheuristic
|
1cca611c9f
|
undo rename
|
2022-06-14 09:23:32 +03:00 |
|
justheuristic
|
7ce7cd7a97
|
basic backend
|
2022-06-14 08:49:42 +03:00 |
|
justheuristic
|
1c49bcb741
|
basic backend
|
2022-06-14 08:26:05 +03:00 |
|
justheuristic
|
3215945882
|
use logger
|
2022-06-14 08:25:06 +03:00 |
|
justheuristic
|
ce5dedd2c7
|
rename
|
2022-06-14 05:09:13 +03:00 |
|
justheuristic
|
8f5d022d18
|
save hidden size
|
2022-06-14 05:09:07 +03:00 |
|
justheuristic
|
e2e9d0e94c
|
memory cache for attention KVs
|
2022-06-13 00:47:55 +03:00 |
|
justheuristic
|
e5e8c9ed12
|
expel all bloom-specific files to src.bloom
|
2022-06-12 21:44:36 +03:00 |
|
justheuristic
|
324ea2dc96
|
save non-transformer params separately
|
2022-06-12 09:35:58 +03:00 |
|
justheuristic
|
54cf292374
|
backend schema
|
2022-06-12 08:40:33 +03:00 |
|
justheuristic
|
3ccd0b5e2d
|
account for layer_past in alibi
|
2022-06-12 07:21:17 +03:00 |
|
justheuristic
|
fb3bfbb78f
|
optimization TODOs for later
|
2022-06-12 06:54:12 +03:00 |
|
justheuristic
|
902bf6400a
|
[temp workaround] create alibi
|
2022-06-12 06:50:19 +03:00 |
|
justheuristic
|
05faa0b3c8
|
add quantization script for cpu
|
2022-06-12 05:59:11 +03:00 |
|
justheuristic
|
ffb56a65ed
|
keep hidden_size as property
|
2022-06-12 05:11:28 +03:00 |
|
justheuristic
|
ffce7718ac
|
layer.py -> block.py
|
2022-06-12 05:09:28 +03:00 |
|
justheuristic
|
9c929b6f3b
|
layer.py -> bloom_block.py
|
2022-06-12 05:08:18 +03:00 |
|
justheuristic
|
79925fc5f7
|
[more disclaimers]
|
2022-06-12 05:03:16 +03:00 |
|
justheuristic
|
fde9d50850
|
remove slow_but_exact, add quantization
|
2022-06-12 05:00:01 +03:00 |
|
justheuristic
|
43399d7898
|
remove slow_but_exact, add quantization
|
2022-06-12 04:56:26 +03:00 |
|
justheuristic
|
51e96ac19b
|
adopt transformers bloom model
Co-authored-by: Thomwolf <thomwolf@gmail.com>
Co-authored-by: Thomas Wolf <thomas@huggingface.co>
Co-authored-by: thomasw21 <24695242+thomasw21@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: sIncerass <sheng.s@berkeley.edu>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>
Co-authored-by: Nicolas Patry <Narsil@users.noreply.github.com>
Co-authored-by: thomasw21 <thomasw21@users.noreply.github.com>
Co-authored-by: sgugger <sgugger@users.noreply.github.com>
Co-authored-by: patrickvonplaten <patrickvonplaten@users.noreply.github.com>
Co-authored-by: LysandreJik <LysandreJik@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: justheuristic <justheuristic@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>
|
2022-06-12 04:24:12 +03:00 |
|
justheuristic
|
99059ae667
|
install script
|
2022-06-12 04:23:38 +03:00 |
|
justheuristic
|
71717c5f00
|
Merge remote-tracking branch 'origin/main' into main
|
2022-06-12 03:19:02 +03:00 |
|
justheuristic
|
b370b43110
|
freeze hivemind and transformers versions
|
2022-06-12 03:18:53 +03:00 |
|
justheuristic
|
c1a908dc66
|
Update README.md
|
2022-06-12 03:13:40 +03:00 |
|
justheuristic
|
af04479cf9
|
Initial commit
|
2022-06-12 03:10:27 +03:00 |
|