You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/bloom
Alexander Borzunov 2116df08bc
Fix deps, enable 8-bit by default for TP (#298)
This PR fixes issues of #290:

- hivemind bfloat16 codec crashed on dummy tensors (with 0 elements), see https://github.com/learning-at-home/hivemind/pull/560 (this PR makes Petals depend on the latest hivemind version from the repo, it's temporary)
- transformers version check mismatched with the version allowed in `setup.cfg`

Also:

- This PR enables 8-bit by default for TP. Even though TP in 8-bit may be slower, we currently prefer to host more blocks to increase the network's stability.
1 year ago
..
__init__.py Bump transformers to 4.25.1 (#151) 2 years ago
block.py Fix deps, enable 8-bit by default for TP (#298) 1 year ago
from_pretrained.py Speed up loading blocks using init with meta weights (#285) 1 year ago
modeling_utils.py Fix use_chunked_forward="auto" on non-x86_64 machines (#267) 1 year ago