petals

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

History

justheuristic c08d09c4d3 Rewrite MemoryCache alloc_timeout logic (#434 ) - rpc_inference: server will now accept allocation timeout from user, defaults to no timeout - bugfix: inference timeout is now measured from the moment the request is received - previously, you would have to wait for your timeout plus the time it takes to sort through the queue (other users' timeout) - now, you get AllocationFailed if you had to wait for over (timeout) seconds - regardless of other users - a request for inference with no timeout will now fail instantly if there is not enough memory available - dtype number of bytes is now correctly determined for int, bool & other types --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: Alexander Borzunov <borzunov.alexander@gmail.com> Co-authored-by: Aleksandr Borzunov <hxrussia@gmail.com>		9 months ago
..
bootstrap.id	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
conftest.py	Fix logging: do not duplicate lines, enable colors in Colab (#156 )	1 year ago
server2.id	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
test_aux_functions.py	Add customizable input tensors (#445 )	10 months ago
test_block_exact_match.py	Prioritize short inference, unmerge pools for long inference (#458 )	10 months ago
test_cache.py	Rewrite MemoryCache alloc_timeout logic (#434 )	9 months ago
test_chained_calls.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
test_dtype.py	Add LLaMA support (#323 )	11 months ago
test_full_model.py	Make client compatible with transformers' GenerationMixin (#464 )	10 months ago
test_peft.py	Support peft LoRA adapters (#335 )	11 months ago
test_priority_pool.py	Fix issues related to `petals` as a module (#159 )	1 year ago
test_remote_sequential.py	Add `blocked_servers` argument (#462 )	10 months ago
test_sequence_manager.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
test_server_stats.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
test_tensor_parallel.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	10 months ago
test_utils.py	Support peft LoRA adapters (#335 )	11 months ago