You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/petals/client
Artem Chumachenko b9f0a5467f
Support peft LoRA adapters (#335)
Implement an option to deploy PEFT adapters to a server. Clients can set active_adapter=... to use these adapters.

---------

Co-authored-by: Aleksandr Borzunov <borzunov.alexander@gmail.com>
Co-authored-by: justheuristic <justheuristic@gmail.com>
11 months ago
..
routing Support peft LoRA adapters (#335) 11 months ago
__init__.py Add LLaMA support (#323) 12 months ago
from_pretrained.py Add LLaMA support (#323) 12 months ago
inference_session.py Implement direct server-to-server communication (#331) 11 months ago
lm_head.py Fix llama's lm_head.weight.requires_grad (#330) 12 months ago
ptune.py Fix llama's lm_head.weight.requires_grad (#330) 12 months ago
remote_forward_backward.py Lower payload size threshold for stream handlers (#251) 1 year ago
remote_generation.py Raise error for unexpected .generate() kwargs (#315) 1 year ago
remote_sequential.py Support peft LoRA adapters (#335) 11 months ago
sequential_autograd.py Remove unused imports and attributes (#324) 1 year ago