You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/server
Alexander Borzunov c6e1b5a8e5
Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97)
2 years ago
..
__init__.py basic backend 2 years ago
backend.py Add Beam Search decoding algorithm (#87) 2 years ago
block_selection.py Use public swarm by default (#92) 2 years ago
cache.py Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97) 2 years ago
handler.py Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97) 2 years ago
runtime.py Priority tasks (#47) 2 years ago
server.py Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97) 2 years ago
task_pool.py Fix "Too many open files" during rebalancing (#83) 2 years ago
task_prioritizer.py Priority tasks (#47) 2 years ago
throughput.py Add automated tests (#23) 2 years ago