You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals
Artem Chumachenko d2fcbbc72e
Add Mixtral models (#553)
* Add somehow workable version

* Fix generation

* Fixes

* Choose right attn

* style

* fix bloom

* remove unnes

* Update src/petals/models/mixtral/model.py

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

* fix order of init

---------

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>
2 months ago
..
cli Improve default arguments for clients and servers (#530) 8 months ago
client Bump transformers and accelerate versions (#554) 4 months ago
models Add Mixtral models (#553) 2 months ago
server Add Mixtral models (#553) 2 months ago
utils Bump transformers and accelerate versions (#554) 4 months ago
__init__.py Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563) 3 months ago
constants.py Update to petals.dev (#390) 11 months ago
data_structures.py Store (start_block, end_block) in each DHT record for reliability (#510) 9 months ago
dht_utils.py Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht (#463) 10 months ago