You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/server
Alexander Borzunov a9b0e9ff1a
Support loading weights from Safetensors on server (#473)
9 months ago
..
__init__.py Make Petals a pip-installable package (attempt 2) (#102) 2 years ago
backend.py Split long sequences into chunks (#403) 11 months ago
block_functions.py Make client compatible with transformers' GenerationMixin (#464) 10 months ago
block_selection.py Use get_logger(__name__) instead of get_logger(__file__) (#265) 1 year ago
block_utils.py Override float32 in config to bfloat16 (#431) 10 months ago
from_pretrained.py Support loading weights from Safetensors on server (#473) 9 months ago
handler.py Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht (#463) 10 months ago
memory_cache.py Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht (#463) 10 months ago
reachability.py Update to petals.dev (#390) 11 months ago
server.py Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht (#463) 10 months ago
task_pool.py Use get_logger(__name__) instead of get_logger(__file__) (#265) 1 year ago
task_prioritizer.py Make client compatible with transformers' GenerationMixin (#464) 10 months ago
throughput.py Fix missing torch.cuda.synchronize for computing throughput (#456) 10 months ago