You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/client/routing
Alexander Borzunov 2a150770a4
Prefer longer servers for fine-tuning, exclude unreachable (#448)
We choose longer servers to minimize the number of hops but leave some randomization to distribute the load. We also exclude servers known to be unreachable.
10 months ago
..
__init__.py Optimize RemoteSequenceManager (#106) 2 years ago
sequence_info.py Report inference, forward, and network RPS separately (#358) 11 months ago
sequence_manager.py Prefer longer servers for fine-tuning, exclude unreachable (#448) 10 months ago
spending_policy.py Optimize RemoteSequenceManager (#106) 2 years ago