Commit Graph

28 Commits

Author SHA1 Message Date
justheuristic
eea9287182 RemoteTransformerBlock 2022-06-14 09:45:22 +03:00
justheuristic
3e9fd63a02 RemoteTransformerBlock 2022-06-14 09:45:06 +03:00
justheuristic
1cca611c9f undo rename 2022-06-14 09:23:32 +03:00
justheuristic
7ce7cd7a97 basic backend 2022-06-14 08:49:42 +03:00
justheuristic
1c49bcb741 basic backend 2022-06-14 08:26:05 +03:00
justheuristic
3215945882 use logger 2022-06-14 08:25:06 +03:00
justheuristic
ce5dedd2c7 rename 2022-06-14 05:09:13 +03:00
justheuristic
8f5d022d18 save hidden size 2022-06-14 05:09:07 +03:00
justheuristic
e2e9d0e94c memory cache for attention KVs 2022-06-13 00:47:55 +03:00
justheuristic
e5e8c9ed12 expel all bloom-specific files to src.bloom 2022-06-12 21:44:36 +03:00
justheuristic
324ea2dc96 save non-transformer params separately 2022-06-12 09:35:58 +03:00
justheuristic
54cf292374 backend schema 2022-06-12 08:40:33 +03:00
justheuristic
3ccd0b5e2d account for layer_past in alibi 2022-06-12 07:21:17 +03:00
justheuristic
fb3bfbb78f optimization TODOs for later 2022-06-12 06:54:12 +03:00
justheuristic
902bf6400a [temp workaround] create alibi 2022-06-12 06:50:19 +03:00
justheuristic
05faa0b3c8 add quantization script for cpu 2022-06-12 05:59:11 +03:00
justheuristic
ffb56a65ed keep hidden_size as property 2022-06-12 05:11:28 +03:00
justheuristic
ffce7718ac layer.py -> block.py 2022-06-12 05:09:28 +03:00
justheuristic
9c929b6f3b layer.py -> bloom_block.py 2022-06-12 05:08:18 +03:00
justheuristic
79925fc5f7 [more disclaimers] 2022-06-12 05:03:16 +03:00
justheuristic
fde9d50850 remove slow_but_exact, add quantization 2022-06-12 05:00:01 +03:00
justheuristic
43399d7898 remove slow_but_exact, add quantization 2022-06-12 04:56:26 +03:00
justheuristic
51e96ac19b adopt transformers bloom model
Co-authored-by: Thomwolf <thomwolf@gmail.com>
Co-authored-by: Thomas Wolf <thomas@huggingface.co>
Co-authored-by: thomasw21 <24695242+thomasw21@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: sIncerass <sheng.s@berkeley.edu>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com>
Co-authored-by: Nicolas Patry <Narsil@users.noreply.github.com>
Co-authored-by: thomasw21 <thomasw21@users.noreply.github.com>
Co-authored-by: sgugger <sgugger@users.noreply.github.com>
Co-authored-by: patrickvonplaten <patrickvonplaten@users.noreply.github.com>
Co-authored-by: LysandreJik <LysandreJik@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: justheuristic <justheuristic@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>
2022-06-12 04:24:12 +03:00
justheuristic
99059ae667 install script 2022-06-12 04:23:38 +03:00
justheuristic
71717c5f00 Merge remote-tracking branch 'origin/main' into main 2022-06-12 03:19:02 +03:00
justheuristic
b370b43110 freeze hivemind and transformers versions 2022-06-12 03:18:53 +03:00
justheuristic
c1a908dc66
Update README.md 2022-06-12 03:13:40 +03:00
justheuristic
af04479cf9
Initial commit 2022-06-12 03:10:27 +03:00