You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/src/client
Alexander Borzunov ab41223b17
Fix dtype- and device-related client issues (#98)
This PR:

1. Makes inference/forward/backward calls on client remember the dtype and device of source tensors, then move/cast the outputs to the same dtype/device. This way:
    - Users don't need to make changes in the code launching `RemoteSequential` to make it run on a different device.
    - `model.generate()` also starts to support both CPU and GPU.

2. Sets default `low_cpu_mem_usage=True`, client's request timeout to 20 sec.

3. Removes excess casts to float32 left in Dmitry's code.

4. (minor) Improves error messages.
1 year ago
..
__init__.py Make inference, forward, and backward fully fault-tolerant (#91) 1 year ago
inference_session.py Fix dtype- and device-related client issues (#98) 1 year ago
remote_forward_backward.py Implement timeouts in forward/backward (#90) 2 years ago
remote_generation.py Add Beam Search decoding algorithm (#87) 1 year ago
remote_model.py Fix dtype- and device-related client issues (#98) 1 year ago
remote_sequential.py Fix dtype- and device-related client issues (#98) 1 year ago
sequence_manager.py Fix dtype- and device-related client issues (#98) 1 year ago
sequential_autograd.py Fix dtype- and device-related client issues (#98) 1 year ago
spending_policy.py Priority tasks (#47) 2 years ago