You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/.github/workflows
Artem Chumachenko d6f4f80f3f
Fix Mixtral-related issues (#570)
This PR fixes problems related to #569:
- block initialization
- throughput calculation and cache usage
- mixtral in tests

Beam search is removed for Mixtral and Llama for now. Those models use DynamicCache, which requires special function to change: (see https://github.com/huggingface/transformers/blob/main/src/transformers/cache_utils.py#L161)

---------

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>
2 weeks ago
..
check-style.yaml Add Python 3.10 to CI (#299) 1 year ago
push-docker-image.yaml Clean disk space in push-docker-image.yaml (#558) 2 months ago
run-tests.yaml Fix Mixtral-related issues (#570) 2 weeks ago