Commit Graph

85 Commits (5d7395e1b55a9b4b308ab0a4b0df818152148514)

Author SHA1 Message Date
Alexander Borzunov 5d7395e1b5
Prompt-tuning notebooks: suggest to use a smaller model for faster prototyping (#234) 1 year ago
Muhtasham Oblokulov 0ebf6de117
Add citation to readme (#219)
Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>
1 year ago
Alexander Borzunov fa5ac6e3b4
Mention BLOOMZ in readme (#221) 1 year ago
Alexander Borzunov e651d73f11
Add one more link to the "Getting started" tutorial (#218)
Some people miss the "Try now in Colab" link or don't understand that it leads to the comprehensive tutorial, so I added one more explicit link.
1 year ago
Alexander Borzunov 37373a66c3 Update Anaconda installation commands (#205) 1 year ago
Alexander Borzunov 391c855208
Add readme subsections (#185) 1 year ago
Alexander Borzunov f344c7801b
Add link to health.petals.ml to readme (#184) 1 year ago
Alexander Borzunov 27406a9377
Add more links to BLOOM to readme (#183) 1 year ago
Alexander Borzunov e27706358c
Use slightly less memory in .generate() (#177) 1 year ago
Alexander Borzunov 356e099c3d
Make Docker command more visible (#175) 1 year ago
Alexander Borzunov 779959bc70
Add link to PyPI (#173) 1 year ago
Alexander Borzunov cdc3b6a25a
Add PyPI badge, update instructions and links in readme (#172) 1 year ago
justheuristic 4014442a0f
Fix instruction for developers (#170) 1 year ago
Alexander Borzunov 26e6120288
Fix code example in readme (#169)
Makes it closer to runnable code, except for imports and defining tokenizer & data loader.
1 year ago
Alexander Borzunov 0b0277ed6f
Add link to chat.petals.ml (#168) 1 year ago
Vadim Peretokin 50fb8205de
Correct grammar in readme (#166) 1 year ago
Alexander Borzunov 714da529e6
Update wording in readme (#165) 1 year ago
Alexander Borzunov 523a7cad33
Fix issues related to `petals` as a module (#159)
1. Added `from petals.client import *` to `petals/__init__.py`, so you can write just that:

    ```python
    from petals import DistributedBloomForCausalLM
    ```

    I didn't do the same with server, since its classes are supposed to by used by `petals.cli.run_server`, not end-users. Though it's still possible to do `from petals.server.smth import smth` if necessary.

2. Fixed one more logging issue: log lines from hivemind were shown twice due to a bug in #156.

3. Removed unused `runtime.py`, since the server actually uses `hivemind.moe.Runtime`, and `runtime.py` has no significant changes comparing to it.
1 year ago
Aleksandr Borzunov f42e559c77 Update README.md 1 year ago
Alexander Borzunov 6beb686909
Add link to privacy & security Wiki (#144) 1 year ago
Alexander Borzunov e99bf36647
Use common folder for all caches, make it a volume in Dockerfile (#141) 1 year ago
Alexander Borzunov 5f50ea9c79
Update Anaconda instructions (#140) 1 year ago
Alexander Borzunov 4cb0ac4718
Update texts in "Terms of use" and "Privacy and security" sections (#138) 1 year ago
Alexander Borzunov a94c91d870
Add Docker commands, use permanent Discord links (#137) 1 year ago
Alexander Borzunov 66f1799d32
Set default --step_timeout to 5 min (#133) 1 year ago
Alexander Borzunov b873d92ffa
Update README.md 1 year ago
Alexander Borzunov 5d5d2666b8
Mention parallel inference 1 year ago
Alexander Borzunov 955eae30b3
Mention 1 sec/token explicitly 1 year ago
Alexander Borzunov 33c210b973
Update Colab notebook 1 year ago
Max Ryabinin 3ca8b4f082
Fix typos with codespell (#126) 1 year ago
Alexander Borzunov d29ef70c85
Update README.md 1 year ago
Alexander Borzunov 1d9aa77697
Update README.md 1 year ago
Alexander Borzunov da36470a4b
Update README.md 1 year ago
Alexander Borzunov 81b94df14b
Rework readme, move code example to the top, link draft of Colab (#118) 1 year ago
Alexander Borzunov 7bd5916744
Make Petals a pip-installable package (attempt 2) (#102)
1. Petals can be now installed using `pip install git+https://github.com/bigscience-workshop/petals`
    - In case if you already cloned the repo, you can do `pip install .` or `pip install .[dev]`
2. Moved `src` => `src/petals`
    - Replaced `from src.smth import smth` with `from petals.smth import smth`
3. Moved `cli` => `src/petals/cli`
    - Replaced `python -m cli.run_smth` with `python -m petals.cli.run_smth` (all utilities are now available right after pip installation)
4. Moved the `requirements*.txt` contents to `setup.cfg` (`requirements.txt` for packages is not supported well by modern packaging utils)
5. Increased the package version from `0.2` to `1.0alpha1`
1 year ago
Alexander Borzunov 0c3781a89c
Shorten bullet points in readme 1 year ago
Alexander Borzunov 898f614515
Fix floating point issues in block_selection.py (#89) 2 years ago
Alexander Borzunov c07a7e0812
Add "Terms of Use" 2 years ago
Artem Chumachenko 0d9c7de0bd
Add sst-2 ipynb example (#86)
- Add sst-2 example of a prompt-based training
- Have some enhancement in the persona-chat example
2 years ago
Alexander Borzunov 640bbc38a9
Make even smaller readability changes 2 years ago
Alexander Borzunov d1b012b479
Make small readability & style changes to the instructions (#77) 2 years ago
justheuristic fef48d7d99
Use bitsandbytes==0.34.0, update readme (#76)
* unlock bnb backward
* Fix bnb version in README
* Update requirements.txt
2 years ago
justheuristic 8caf1145a8
Quality of life changes: update readme, simplify run_server interface (#75)
- run_server now accepts model name as both positional and keyword argument
- changed names in README to account for interface updates
- moved model conversion from README to a separate wiki page
- updated requirements.txt
2 years ago
Alexander Borzunov 54ad745bed
Warn that current instructions involve 6B model but we will replace them soon (#63) 2 years ago
Alexander Borzunov 5f0c5329d4
Update readme with arxiv link and more discussions (#62)
Co-authored-by: justheuristic <justheuristic@gmail.com>
2 years ago
Alexander Borzunov 9bea7b9ea8
Update bullet points with feedback from Tim and other people (#61)
Co-authored-by: Tim Dettmers <tim.dettmers@gmail.com>
2 years ago
Alexander Borzunov 7653562aa1
Use latest version of Petals scheme, shrink Petals logo (#59) 2 years ago
Alexander Borzunov 2eb5843852
Update readme for the 1st public release (#57) 2 years ago
Pavel Samygin 0be21775af
remove transformer block, implement as sequential of size 1 (#54)
* remove transformer block, implement as sequence size 1
* reimplement get_remote_module
* fix readme

Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
2 years ago
justheuristic d271b75dd4
Let users specify sequence length instead of assuming 2048 (#52)
- Maximum length is now provided in `.inference_session(max_length=100)`
   - previously, we would always assume max length = 2048
- added a generic way to forward **kwargs to inference session
  - for compatibility with #47 
  - Note to @borzunov : it does *not* pass them arbitrarily, but instead checks for kwarg names at the bottom level
- run_server can be started with a custom max_length for inference
- renamed --cache_size_bytes to --attention_cache_bytes (to avoid collision with --cache_dir)
- --attn_cache_bytes can now support humane file sizes (e.g. 300MB instead of 314572800)
- made some server-side errors more human-readable to user (e.g. when max length is exceeded)

Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
Co-authored-by: Alexander Borzunov <hxrussia@gmail.com>
2 years ago