petals

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

History

Artem Chumachenko b9f0a5467f Support peft LoRA adapters (#335 ) Implement an option to deploy PEFT adapters to a server. Clients can set active_adapter=... to use these adapters. --------- Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com> Co-authored-by: justheuristic <justheuristic@gmail.com>		11 months ago
..
routing	Support peft LoRA adapters (#335 )	11 months ago
__init__.py	Add LLaMA support (#323 )	12 months ago
from_pretrained.py	Add LLaMA support (#323 )	12 months ago
inference_session.py	Implement direct server-to-server communication (#331 )	11 months ago
lm_head.py	Fix llama's lm_head.weight.requires_grad (#330 )	12 months ago
ptune.py	Fix llama's lm_head.weight.requires_grad (#330 )	12 months ago
remote_forward_backward.py	Lower payload size threshold for stream handlers (#251 )	1 year ago
remote_generation.py	Raise error for unexpected .generate() kwargs (#315 )	1 year ago
remote_sequential.py	Support peft LoRA adapters (#335 )	11 months ago
sequential_autograd.py	Remove unused imports and attributes (#324 )	1 year ago