Commit Graph

51 Commits (7bd59167447adfcda646361f97dbfb47d4172072)

Author SHA1 Message Date
Alexander Borzunov 7bd5916744
Make Petals a pip-installable package (attempt 2) (#102)
1. Petals can be now installed using `pip install git+https://github.com/bigscience-workshop/petals`
    - In case if you already cloned the repo, you can do `pip install .` or `pip install .[dev]`
2. Moved `src` => `src/petals`
    - Replaced `from src.smth import smth` with `from petals.smth import smth`
3. Moved `cli` => `src/petals/cli`
    - Replaced `python -m cli.run_smth` with `python -m petals.cli.run_smth` (all utilities are now available right after pip installation)
4. Moved the `requirements*.txt` contents to `setup.cfg` (`requirements.txt` for packages is not supported well by modern packaging utils)
5. Increased the package version from `0.2` to `1.0alpha1`
1 year ago
Alexander Borzunov 0c3781a89c
Shorten bullet points in readme 1 year ago
Alexander Borzunov 898f614515
Fix floating point issues in block_selection.py (#89) 2 years ago
Alexander Borzunov c07a7e0812
Add "Terms of Use" 2 years ago
Artem Chumachenko 0d9c7de0bd
Add sst-2 ipynb example (#86)
- Add sst-2 example of a prompt-based training
- Have some enhancement in the persona-chat example
2 years ago
Alexander Borzunov 640bbc38a9
Make even smaller readability changes 2 years ago
Alexander Borzunov d1b012b479
Make small readability & style changes to the instructions (#77) 2 years ago
justheuristic fef48d7d99
Use bitsandbytes==0.34.0, update readme (#76)
* unlock bnb backward
* Fix bnb version in README
* Update requirements.txt
2 years ago
justheuristic 8caf1145a8
Quality of life changes: update readme, simplify run_server interface (#75)
- run_server now accepts model name as both positional and keyword argument
- changed names in README to account for interface updates
- moved model conversion from README to a separate wiki page
- updated requirements.txt
2 years ago
Alexander Borzunov 54ad745bed
Warn that current instructions involve 6B model but we will replace them soon (#63) 2 years ago
Alexander Borzunov 5f0c5329d4
Update readme with arxiv link and more discussions (#62)
Co-authored-by: justheuristic <justheuristic@gmail.com>
2 years ago
Alexander Borzunov 9bea7b9ea8
Update bullet points with feedback from Tim and other people (#61)
Co-authored-by: Tim Dettmers <tim.dettmers@gmail.com>
2 years ago
Alexander Borzunov 7653562aa1
Use latest version of Petals scheme, shrink Petals logo (#59) 2 years ago
Alexander Borzunov 2eb5843852
Update readme for the 1st public release (#57) 2 years ago
Pavel Samygin 0be21775af
remove transformer block, implement as sequential of size 1 (#54)
* remove transformer block, implement as sequence size 1
* reimplement get_remote_module
* fix readme

Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
2 years ago
justheuristic d271b75dd4
Let users specify sequence length instead of assuming 2048 (#52)
- Maximum length is now provided in `.inference_session(max_length=100)`
   - previously, we would always assume max length = 2048
- added a generic way to forward **kwargs to inference session
  - for compatibility with #47 
  - Note to @borzunov : it does *not* pass them arbitrarily, but instead checks for kwarg names at the bottom level
- run_server can be started with a custom max_length for inference
- renamed --cache_size_bytes to --attention_cache_bytes (to avoid collision with --cache_dir)
- --attn_cache_bytes can now support humane file sizes (e.g. 300MB instead of 314572800)
- made some server-side errors more human-readable to user (e.g. when max length is exceeded)

Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
2 years ago
Dmitry Baranchuk 11a424837f
integrate mixed-8bit model (#39)
* integrate mixed-8bit model
* Fix bug with model duplication in RAM
* set throughput=1.0 to fix zero throughput problem
* add revision support
* update hivemind and bitsandbytes
* update deploy scripts
* update installation instructions
2 years ago
Alexander Borzunov 7d39d46966
Use "PETALS" as the readme title (#40)
Since we've chosen the system name, let's use it in the repo name and the readme title.
2 years ago
justheuristic ccdcefe405
Add instructions to test the full model (#25)
add instructions to test the full model
2 years ago
justheuristic eb0a6be716
Clean up readme (#24)
Remove some deprecated sections of README and turns on CI for main branch
2 years ago
Alexander Borzunov 7e9f337a63
Remove excess line from readme 2 years ago
Alexander Borzunov aba43f1308
Implement block selection on servers (#20) 2 years ago
justheuristic 4eadd00a2c rm prefix from tests 2 years ago
justheuristic 2e90ac30a0 use default prefix in readme 2 years ago
justheuristic 4695071ad2 WIP: make DistributedBloom compliant with HF interface 2 years ago
justheuristic 8de7c1687b
list latest additions 2 years ago
justheuristic d688cb0d22 stupid, slow, fragile, but correct full model inference 2 years ago
Aleksandr Borzunov b78d713347 refactor, add swarm info 2 years ago
justheuristic ca3c08acc1
Update README.md 2 years ago
justheuristic 1cdf8a77fb basic chained inference (multiple blocks per one RPC call) 2 years ago
justheuristic 0e7afea026 Merge remote-tracking branch 'origin/main' into main 2 years ago
justheuristic 2eb47cbedd support hosting multiple instances of the same block 2 years ago
justheuristic 14b6d04b0f install hivemind from pip 2 years ago
justheuristic f3722d52cf
Update README.md 2 years ago
justheuristic aaaf0c2dad better testing readme 2 years ago
justheuristic 2bf83b42e5 add testing guide 2 years ago
justheuristic 1ab5fb1630 fetch a specific bloom block without downloading the entire model 2 years ago
justheuristic 2d55e6e4fe instructions to test distributed inference 2 years ago
justheuristic 9be7c81b78 instructions to test distributed inference 2 years ago
justheuristic cc9a76625d warn about long runtime 2 years ago
justheuristic 82214699f2 notes on hosting servers 2 years ago
justheuristic 1555d98f66 push converted model to hub 2 years ago
justheuristic 736f1d1085 push converted model to hub 2 years ago
justheuristic 7fba411dff extended run_serverexample 2 years ago
justheuristic 5a15c13ca7 switch to hivemind-master 2 years ago
justheuristic 35310698f0 newer hivemind version 2 years ago
justheuristic 8959727dea add minimalistic benchmarks 2 years ago
justheuristic a798ea04a6 add minimalistic benchmarks 2 years ago
justheuristic 99059ae667 install script 2 years ago
justheuristic c1a908dc66
Update README.md 2 years ago