You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
21c3526ec1
**Why?** - We'd like to avoid excess threads for the original sequence manager in case if we only use its slices (e.g. when we add adapters or need only a subset of model blocks): - If we create a sequence manager just before a fork (e.g. in a web app backend or a multi-thread benchmark), we'd like to avoid excess threads in the original process and only use this thread in child processes where we actually call `.make_sequence()`. |
1 year ago | |
---|---|---|
.. | ||
scripts | 2 years ago | |
conftest.py | 2 years ago | |
test.id | 2 years ago | |
test_aux_functions.py | 2 years ago | |
test_block_exact_match.py | 2 years ago | |
test_chained_calls.py | 2 years ago | |
test_full_model.py | 2 years ago | |
test_priority_pool.py | 2 years ago | |
test_remote_sequential.py | 1 year ago | |
test_sequence_manager.py | 1 year ago | |
test_server_stats.py | 2 years ago | |
test_tensor_parallel.py | 2 years ago | |
test_utils.py | 2 years ago |