Aleksandr Borzunov
f42e559c77
Update README.md
2 years ago
Alexander Borzunov
6beb686909
Add link to privacy & security Wiki ( #144 )
2 years ago
Alexander Borzunov
e99bf36647
Use common folder for all caches, make it a volume in Dockerfile ( #141 )
2 years ago
Alexander Borzunov
5f50ea9c79
Update Anaconda instructions ( #140 )
2 years ago
Alexander Borzunov
4cb0ac4718
Update texts in "Terms of use" and "Privacy and security" sections ( #138 )
2 years ago
Alexander Borzunov
a94c91d870
Add Docker commands, use permanent Discord links ( #137 )
2 years ago
Alexander Borzunov
66f1799d32
Set default --step_timeout to 5 min ( #133 )
2 years ago
Alexander Borzunov
b873d92ffa
Update README.md
2 years ago
Alexander Borzunov
5d5d2666b8
Mention parallel inference
2 years ago
Alexander Borzunov
955eae30b3
Mention 1 sec/token explicitly
2 years ago
Alexander Borzunov
33c210b973
Update Colab notebook
2 years ago
Max Ryabinin
3ca8b4f082
Fix typos with codespell ( #126 )
2 years ago
Alexander Borzunov
d29ef70c85
Update README.md
2 years ago
Alexander Borzunov
1d9aa77697
Update README.md
2 years ago
Alexander Borzunov
da36470a4b
Update README.md
2 years ago
Alexander Borzunov
81b94df14b
Rework readme, move code example to the top, link draft of Colab ( #118 )
2 years ago
Alexander Borzunov
7bd5916744
Make Petals a pip-installable package (attempt 2) ( #102 )
...
1. Petals can be now installed using `pip install git+https://github.com/bigscience-workshop/petals `
- In case if you already cloned the repo, you can do `pip install .` or `pip install .[dev]`
2. Moved `src` => `src/petals`
- Replaced `from src.smth import smth` with `from petals.smth import smth`
3. Moved `cli` => `src/petals/cli`
- Replaced `python -m cli.run_smth` with `python -m petals.cli.run_smth` (all utilities are now available right after pip installation)
4. Moved the `requirements*.txt` contents to `setup.cfg` (`requirements.txt` for packages is not supported well by modern packaging utils)
5. Increased the package version from `0.2` to `1.0alpha1`
2 years ago
Alexander Borzunov
0c3781a89c
Shorten bullet points in readme
2 years ago
Alexander Borzunov
898f614515
Fix floating point issues in block_selection.py ( #89 )
2 years ago
Alexander Borzunov
c07a7e0812
Add "Terms of Use"
2 years ago
Artem Chumachenko
0d9c7de0bd
Add sst-2 ipynb example ( #86 )
...
- Add sst-2 example of a prompt-based training
- Have some enhancement in the persona-chat example
2 years ago
Alexander Borzunov
640bbc38a9
Make even smaller readability changes
2 years ago
Alexander Borzunov
d1b012b479
Make small readability & style changes to the instructions ( #77 )
2 years ago
justheuristic
fef48d7d99
Use bitsandbytes==0.34.0, update readme ( #76 )
...
* unlock bnb backward
* Fix bnb version in README
* Update requirements.txt
2 years ago
justheuristic
8caf1145a8
Quality of life changes: update readme, simplify run_server interface ( #75 )
...
- run_server now accepts model name as both positional and keyword argument
- changed names in README to account for interface updates
- moved model conversion from README to a separate wiki page
- updated requirements.txt
2 years ago
Alexander Borzunov
54ad745bed
Warn that current instructions involve 6B model but we will replace them soon ( #63 )
2 years ago
Alexander Borzunov
5f0c5329d4
Update readme with arxiv link and more discussions ( #62 )
...
Co-authored-by: justheuristic <justheuristic@gmail.com>
2 years ago
Alexander Borzunov
9bea7b9ea8
Update bullet points with feedback from Tim and other people ( #61 )
...
Co-authored-by: Tim Dettmers <tim.dettmers@gmail.com>
2 years ago
Alexander Borzunov
7653562aa1
Use latest version of Petals scheme, shrink Petals logo ( #59 )
2 years ago
Alexander Borzunov
2eb5843852
Update readme for the 1st public release ( #57 )
2 years ago
Pavel Samygin
0be21775af
remove transformer block, implement as sequential of size 1 ( #54 )
...
* remove transformer block, implement as sequence size 1
* reimplement get_remote_module
* fix readme
Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
2 years ago
justheuristic
d271b75dd4
Let users specify sequence length instead of assuming 2048 ( #52 )
...
- Maximum length is now provided in `.inference_session(max_length=100)`
- previously, we would always assume max length = 2048
- added a generic way to forward **kwargs to inference session
- for compatibility with #47
- Note to @borzunov : it does *not* pass them arbitrarily, but instead checks for kwarg names at the bottom level
- run_server can be started with a custom max_length for inference
- renamed --cache_size_bytes to --attention_cache_bytes (to avoid collision with --cache_dir)
- --attn_cache_bytes can now support humane file sizes (e.g. 300MB instead of 314572800)
- made some server-side errors more human-readable to user (e.g. when max length is exceeded)
Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
2 years ago
Dmitry Baranchuk
11a424837f
integrate mixed-8bit model ( #39 )
...
* integrate mixed-8bit model
* Fix bug with model duplication in RAM
* set throughput=1.0 to fix zero throughput problem
* add revision support
* update hivemind and bitsandbytes
* update deploy scripts
* update installation instructions
2 years ago
Alexander Borzunov
7d39d46966
Use "PETALS" as the readme title ( #40 )
...
Since we've chosen the system name, let's use it in the repo name and the readme title.
2 years ago
justheuristic
ccdcefe405
Add instructions to test the full model ( #25 )
...
add instructions to test the full model
2 years ago
justheuristic
eb0a6be716
Clean up readme ( #24 )
...
Remove some deprecated sections of README and turns on CI for main branch
2 years ago
Alexander Borzunov
7e9f337a63
Remove excess line from readme
2 years ago
Alexander Borzunov
aba43f1308
Implement block selection on servers ( #20 )
2 years ago
justheuristic
4eadd00a2c
rm prefix from tests
2 years ago
justheuristic
2e90ac30a0
use default prefix in readme
2 years ago
justheuristic
4695071ad2
WIP: make DistributedBloom compliant with HF interface
2 years ago
justheuristic
8de7c1687b
list latest additions
2 years ago
justheuristic
d688cb0d22
stupid, slow, fragile, but correct full model inference
2 years ago
Aleksandr Borzunov
b78d713347
refactor, add swarm info
2 years ago
justheuristic
ca3c08acc1
Update README.md
2 years ago
justheuristic
1cdf8a77fb
basic chained inference (multiple blocks per one RPC call)
2 years ago
justheuristic
0e7afea026
Merge remote-tracking branch 'origin/main' into main
2 years ago
justheuristic
2eb47cbedd
support hosting multiple instances of the same block
2 years ago
justheuristic
14b6d04b0f
install hivemind from pip
2 years ago
justheuristic
f3722d52cf
Update README.md
2 years ago
justheuristic
aaaf0c2dad
better testing readme
2 years ago
justheuristic
2bf83b42e5
add testing guide
2 years ago
justheuristic
1ab5fb1630
fetch a specific bloom block without downloading the entire model
2 years ago
justheuristic
2d55e6e4fe
instructions to test distributed inference
2 years ago
justheuristic
9be7c81b78
instructions to test distributed inference
2 years ago
justheuristic
cc9a76625d
warn about long runtime
2 years ago
justheuristic
82214699f2
notes on hosting servers
2 years ago
justheuristic
1555d98f66
push converted model to hub
2 years ago
justheuristic
736f1d1085
push converted model to hub
2 years ago
justheuristic
7fba411dff
extended run_serverexample
2 years ago
justheuristic
5a15c13ca7
switch to hivemind-master
2 years ago
justheuristic
35310698f0
newer hivemind version
2 years ago
justheuristic
8959727dea
add minimalistic benchmarks
2 years ago
justheuristic
a798ea04a6
add minimalistic benchmarks
2 years ago
justheuristic
99059ae667
install script
2 years ago
justheuristic
c1a908dc66
Update README.md
2 years ago
justheuristic
af04479cf9
Initial commit
2 years ago