You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/cli
Alexander Borzunov dd4a3230bc
Add Falcon support (#499)
This PR adds:

- Support for models based on `transformers.FalconModel` (the in-library format for Falcon). Tested on Falcon-40B.
- CI tests for Falcon-RW-1B.
- `--throughput dry_run` option to evaluate throughput and exit right away (implemented by @mryab).

Limitations:

- Backward pass support is broken for now, will be fixed in #500.

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>
9 months ago
..
__init__.py Make Petals a pip-installable package (attempt 2) (#102) 2 years ago
run_dht.py Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 10 months ago
run_prod_server.sh Make server use smart defaults (#115) 2 years ago
run_server.py Add Falcon support (#499) 9 months ago