petals

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

History

justheuristic ac9b546706 [Refactor] extract block forward, backward and inference into a separate file (#435 ) This PR does not change any functionality. It merely moves stuff around. List of changes: handler.py/_rpc_forward became block_methods/rpc_forward handler.py/_rpc_backward became block_methods/rpc_backward the math bits of rpc_inference were extracted into block_methods/iterate_rpc_inference --------- Co-authored-by: Your Name <you@example.com> Co-authored-by: artek0chumak <artek.chumak@gmail.com> Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>		10 months ago
..
__init__.py	Make Petals a pip-installable package (attempt 2) (#102 )	2 years ago
backend.py	Split long sequences into chunks (#403 )	11 months ago
block_functions.py	[Refactor] extract block forward, backward and inference into a separate file (#435 )	10 months ago
block_selection.py	Use get_logger(__name__) instead of get_logger(__file__) (#265 )	1 year ago
block_utils.py	Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333 )	11 months ago
from_pretrained.py	Fix routing through relay, default network RPS, --token, logging, readme (#399 )	11 months ago
handler.py	[Refactor] extract block forward, backward and inference into a separate file (#435 )	10 months ago
memory_cache.py	Fix deadlocks in MemoryCache (#396 )	11 months ago
reachability.py	Update to petals.dev (#390 )	11 months ago
server.py	Fix typo and make blocks message more informative (#437 )	10 months ago
task_pool.py	Use get_logger(__name__) instead of get_logger(__file__) (#265 )	1 year ago
task_prioritizer.py	Merge inference pools into one to increase inference speed (#225 )	1 year ago
throughput.py	Penalize servers that use relays during rebalancing (#428 )	10 months ago