You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/tests
justheuristic 5f58f00649
Return available cache size in rpc_info() (#191)
This PR makes servers return their free cache (in tokens * layers to make it compression-agnostic)

To be used when calling make_sequence(optimize="inference")
1 year ago
..
scripts Fix arguments in remove_old_models.py (#153) 1 year ago
conftest.py Fix logging: do not duplicate lines, enable colors in Colab (#156) 1 year ago
test.id Add automated tests (#23) 2 years ago
test_aux_functions.py Add local tensor-parallel fwd/bwd (#143) 1 year ago
test_block_exact_match.py Add local tensor-parallel fwd/bwd (#143) 1 year ago
test_chained_calls.py Make Petals a pip-installable package (attempt 2) (#102) 1 year ago
test_full_model.py Fix logging: do not duplicate lines, enable colors in Colab (#156) 1 year ago
test_linear8bitlt.py Support --load_in_8bit on pre-Turing GPUs (#113) 1 year ago
test_priority_pool.py Fix issues related to `petals` as a module (#159) 1 year ago
test_remote_sequential.py Add local tensor-parallel fwd/bwd (#143) 1 year ago
test_sequence_manager.py Use length-weighted sampling in routing for inference (#204) 1 year ago
test_server_stats.py Return available cache size in rpc_info() (#191) 1 year ago
test_tensor_parallel.py Increase tolerances in test_tp_block (#196) 1 year ago
test_utils.py Implement RemoteSequential slicing and extra repr, add tests (#30) 2 years ago