petals

mirror of https://github.com/bigscience-workshop/petals synced 2024-10-31 09:20:41 +00:00

History

Alexander Borzunov 21c3526ec1 Start SequenceManager's thread only after first .make_sequence() (#301 ) Why? - We'd like to avoid excess threads for the original sequence manager in case if we only use its slices (e.g. when we add adapters or need only a subset of model blocks): - If we create a sequence manager just before a fork (e.g. in a web app backend or a multi-thread benchmark), we'd like to avoid excess threads in the original process and only use this thread in child processes where we actually call `.make_sequence()`.		2023-04-12 21:38:43 +04:00
..
scripts	Fix arguments in remove_old_models.py (#153 )	2022-12-13 19:01:12 +03:00
conftest.py	Fix logging: do not duplicate lines, enable colors in Colab (#156 )	2022-12-15 09:12:18 +04:00
test_aux_functions.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_block_exact_match.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_chained_calls.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_full_model.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_priority_pool.py	Fix issues related to `petals` as a module (#159 )	2022-12-16 09:09:06 +04:00
test_remote_sequential.py	Start SequenceManager's thread only after first .make_sequence() (#301 )	2023-04-12 21:38:43 +04:00
test_sequence_manager.py	Start SequenceManager's thread only after first .make_sequence() (#301 )	2023-04-12 21:38:43 +04:00
test_server_stats.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_tensor_parallel.py	Speed up loading blocks using init with meta weights (#285 )	2023-03-13 00:49:04 +03:00
test_utils.py	Implement RemoteSequential slicing and extra repr, add tests (#30 )	2022-07-19 04:28:04 +03:00
test.id	Add automated tests (#23 )	2022-07-16 01:59:23 +03:00