You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/server
Max Ryabinin fa464dfc99 WIP Triton+QKV merge 9 months ago
..
__init__.py Make Petals a pip-installable package (attempt 2) (#102) 2 years ago
backend.py Rewrite MemoryCache alloc_timeout logic (#434) 9 months ago
block_functions.py Make client compatible with transformers' GenerationMixin (#464) 10 months ago
block_selection.py Use get_logger(__name__) instead of get_logger(__file__) (#265) 1 year ago
block_utils.py Rewrite MemoryCache alloc_timeout logic (#434) 9 months ago
from_pretrained.py Hide excess key message (#476) 9 months ago
handler.py Rewrite MemoryCache alloc_timeout logic (#434) 9 months ago
memory_cache.py Fix race condition in MemoryCache (#487) 9 months ago
reachability.py Support macOS (#477) 9 months ago
server.py Create model index in DHT (#491) 9 months ago
task_pool.py Remove no-op process in PrioritizedTaskPool (#484) 9 months ago
task_prioritizer.py Make client compatible with transformers' GenerationMixin (#464) 10 months ago
throughput.py WIP Triton+QKV merge 9 months ago