You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/tests
justheuristic a2634001e9
Reduce vocabulary size in test model, fix bug in routing when overlapped (#45)
This PR reduces this vocabulary size to save memory during conversion, keeping only the first 50k tokens
As a result, 

* tests that load client-side embeddings need significantly less RAM
* we can now run CI tests with 4 servers instead of 2 - needed to test routing - see bugs uncovered
* some of the servers now use load balancing
* CI convert_model now takes 4-5 minutes (was 6-7)
2 years ago
..
scripts Reduce vocabulary size in test model, fix bug in routing when overlapped (#45) 2 years ago
conftest.py Implement RemoteSequential slicing and extra repr, add tests (#30) 2 years ago
test.id Add automated tests (#23) 2 years ago
test_block_exact_match.py Implement RemoteSequential slicing and extra repr, add tests (#30) 2 years ago
test_chained_calls.py Implement RemoteSequential slicing and extra repr, add tests (#30) 2 years ago
test_full_model.py Reduce vocabulary size in test model, fix bug in routing when overlapped (#45) 2 years ago
test_remote_sequential.py Deep distributed prompt tuning (#42) 2 years ago
test_utils.py Implement RemoteSequential slicing and extra repr, add tests (#30) 2 years ago