petals

Commit Graph

Author	SHA1	Message	Date
justheuristic	e51e84631d	Update to petals.dev (#390 ) Since `petals.ml` DNS record is still unavailable, we're switching everything to https://petals.dev Co-authored-by: Aleksandr Borzunov <hxrussia@gmail.com>	10 months ago
Alexander Borzunov	b1ff8bdd6c	Bump version to 2.0.0.post1 (#384 )	10 months ago
Alexander Borzunov	895327a0ae	Fix readme code example, require Python < 3.11 until supported (#374 ) * Fix readme code example * Require Python < 3.11 until it's supported	11 months ago
Alexander Borzunov	c735dd7ba3	Update transformers to 4.31.0 and peft to 0.4.0 (#371 )	11 months ago
Alexander Borzunov	3b300c32e4	Update readme to show new models (#365 )	11 months ago
Alexander Borzunov	9517dd1e3d	Update readme and "Getting started" link (#360 ) This updates readme with the latest updates and fixes an old Colab link, as pointed out in #359.	11 months ago
Alexander Borzunov	294970fe18	Update Colab link	11 months ago
Alexander Borzunov	515a5120cb	Mention LLaMA in readme (#344 )	11 months ago
Alexander Borzunov	8f6342a861	Refactor RemoteSequenceManager (#309 ) This PR: 1. Extracts `SequenceManagerConfig` and `SequenceManagerState` subclasses. The config is provided by caller and never changed from inside `RemoteSequenceManager`. The state is a part of the `RemoteSequenceManager`'s state shared between the main manager and its slices. We fix some slicing bugs along the way. 2. Removes `dht_prefix` and `p2p` arguments, makes `dht` argument optional. `dht_prefix` can always be overridden using `config.dht_prefix`. `p2p` actually needed only under the hood of `RemoteSequenceManager`, so it can extract it by itself without exposing this low-level class to callers. If strictly necessary, a caller can provide `p2p` as a part of `SequenceManagerState`. `dht` is also needed only by `RemoteSequenceManager`, so we can make it optional in the parent classes and create it automatically when it's not provided. 3. Simplifies retry logic. Previously, we could have "nested" retry loops: one in `._update()`, another in inference/forward/backward steps. The loop in `._update()` could introduce issues to concurrent inference/forward/backward calls, since it blocks the entire class if its delay period becomes too high. Now this logic is simplified: `._update()` performs only one attempt to fetch the DHT info, any retries are triggered by the inference/forward/backward steps. 4. Removes deprecated `RemoteTransformerBlock`. `RemoteTransformerBlock` was deprecated a long time ago, before Petals 1.0.0. Its removal is long due. 5. Removes `dht_utils.get_remote_module()`, `dht_utils.get_remote_sequence()`. This functions duplicate the functionality of the `RemoteSequential` constructor. 6. (minor) Removes `RemoteSequential.is_subsequence` flag. This flag worked incorrectly and was never used. I am removing it for the sake of simplicity.	1 year ago
Alexander Borzunov	5c0b4286b2	Suggest commands for Docker first (#304 )	1 year ago
Alexander Borzunov	a7d3d02194	Fix invalid author email in setup.cfg (#287 )	1 year ago
Alexander Borzunov	8dab37c1a9	Add benchmarks to readme (#284 )	1 year ago
Alexander Borzunov	42594e5173	Link FAQ in readme (#260 )	1 year ago
Alexander Borzunov	5d7395e1b5	Prompt-tuning notebooks: suggest to use a smaller model for faster prototyping (#234 )	1 year ago
Muhtasham Oblokulov	0ebf6de117	Add citation to readme (#219 ) Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>	1 year ago
Alexander Borzunov	fa5ac6e3b4	Mention BLOOMZ in readme (#221 )	1 year ago
Alexander Borzunov	e651d73f11	Add one more link to the "Getting started" tutorial (#218 ) Some people miss the "Try now in Colab" link or don't understand that it leads to the comprehensive tutorial, so I added one more explicit link.	1 year ago
Alexander Borzunov	37373a66c3	Update Anaconda installation commands (#205 )	1 year ago
Alexander Borzunov	391c855208	Add readme subsections (#185 )	1 year ago
Alexander Borzunov	f344c7801b	Add link to health.petals.ml to readme (#184 )	1 year ago
Alexander Borzunov	27406a9377	Add more links to BLOOM to readme (#183 )	1 year ago
Alexander Borzunov	e27706358c	Use slightly less memory in .generate() (#177 )	1 year ago
Alexander Borzunov	356e099c3d	Make Docker command more visible (#175 )	1 year ago
Alexander Borzunov	779959bc70	Add link to PyPI (#173 )	1 year ago
Alexander Borzunov	cdc3b6a25a	Add PyPI badge, update instructions and links in readme (#172 )	1 year ago
justheuristic	4014442a0f	Fix instruction for developers (#170 )	1 year ago
Alexander Borzunov	26e6120288	Fix code example in readme (#169 ) Makes it closer to runnable code, except for imports and defining tokenizer & data loader.	1 year ago
Alexander Borzunov	0b0277ed6f	Add link to chat.petals.ml (#168 )	1 year ago
Vadim Peretokin	50fb8205de	Correct grammar in readme (#166 )	1 year ago
Alexander Borzunov	714da529e6	Update wording in readme (#165 )	1 year ago
Alexander Borzunov	523a7cad33	Fix issues related to `petals` as a module (#159 ) 1. Added `from petals.client import *` to `petals/__init__.py`, so you can write just that: ```python from petals import DistributedBloomForCausalLM ``` I didn't do the same with server, since its classes are supposed to by used by `petals.cli.run_server`, not end-users. Though it's still possible to do `from petals.server.smth import smth` if necessary. 2. Fixed one more logging issue: log lines from hivemind were shown twice due to a bug in #156. 3. Removed unused `runtime.py`, since the server actually uses `hivemind.moe.Runtime`, and `runtime.py` has no significant changes comparing to it.	1 year ago
Aleksandr Borzunov	f42e559c77	Update README.md	1 year ago
Alexander Borzunov	6beb686909	Add link to privacy & security Wiki (#144 )	1 year ago
Alexander Borzunov	e99bf36647	Use common folder for all caches, make it a volume in Dockerfile (#141 )	1 year ago
Alexander Borzunov	5f50ea9c79	Update Anaconda instructions (#140 )	1 year ago
Alexander Borzunov	4cb0ac4718	Update texts in "Terms of use" and "Privacy and security" sections (#138 )	1 year ago
Alexander Borzunov	a94c91d870	Add Docker commands, use permanent Discord links (#137 )	1 year ago
Alexander Borzunov	66f1799d32	Set default --step_timeout to 5 min (#133 )	2 years ago
Alexander Borzunov	b873d92ffa	Update README.md	2 years ago
Alexander Borzunov	5d5d2666b8	Mention parallel inference	2 years ago
Alexander Borzunov	955eae30b3	Mention 1 sec/token explicitly	2 years ago
Alexander Borzunov	33c210b973	Update Colab notebook	2 years ago
Max Ryabinin	3ca8b4f082	Fix typos with codespell (#126 )	2 years ago
Alexander Borzunov	d29ef70c85	Update README.md	2 years ago
Alexander Borzunov	1d9aa77697	Update README.md	2 years ago
Alexander Borzunov	da36470a4b	Update README.md	2 years ago
Alexander Borzunov	81b94df14b	Rework readme, move code example to the top, link draft of Colab (#118 )	2 years ago
Alexander Borzunov	7bd5916744	Make Petals a pip-installable package (attempt 2) (#102 ) 1. Petals can be now installed using `pip install git+https://github.com/bigscience-workshop/petals` - In case if you already cloned the repo, you can do `pip install .` or `pip install .[dev]` 2. Moved `src` => `src/petals` - Replaced `from src.smth import smth` with `from petals.smth import smth` 3. Moved `cli` => `src/petals/cli` - Replaced `python -m cli.run_smth` with `python -m petals.cli.run_smth` (all utilities are now available right after pip installation) 4. Moved the `requirements*.txt` contents to `setup.cfg` (`requirements.txt` for packages is not supported well by modern packaging utils) 5. Increased the package version from `0.2` to `1.0alpha1`	2 years ago
Alexander Borzunov	0c3781a89c	Shorten bullet points in readme	2 years ago
Alexander Borzunov	898f614515	Fix floating point issues in block_selection.py (#89 )	2 years ago

1 2

98 Commits (lru)