mirror of
https://github.com/bigscience-workshop/petals
synced 2024-10-31 09:20:41 +00:00
5af04524dd
This PR is designed to avoid OOMs when processing long sequences that happen due to the huge attention logits matrices. Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com> |
||
---|---|---|
.. | ||
check-style.yaml | ||
push-docker-image.yaml | ||
run-tests.yaml |