petals

mirror of https://github.com/bigscience-workshop/petals synced 2024-10-31 09:20:41 +00:00

History

justheuristic 5af04524dd Split long sequences into chunks (#403 ) This PR is designed to avoid OOMs when processing long sequences that happen due to the huge attention logits matrices. Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com>		2023-07-22 23:10:46 +04:00
..
check-style.yaml	Add Python 3.10 to CI (#299 )	2023-03-29 04:41:07 +04:00
push-docker-image.yaml	Merge inference pools into one to increase inference speed (#225 )	2023-01-19 19:38:21 +04:00
run-tests.yaml	Split long sequences into chunks (#403 )	2023-07-22 23:10:46 +04:00