petals/cli
2022-06-20 10:07:42 -07:00
..
__init__.py add quantization script for cpu 2022-06-12 05:59:11 +03:00
config.json add minimalistic benchmarks 2022-06-14 15:18:11 +03:00
convert_model.py black-isort 2022-06-20 16:50:22 +03:00
inference_one_block.py Loading a bloom block working. 2022-06-20 10:07:42 -07:00
run_server.py fetch a specific bloom block without downloading the entire model 2022-06-20 15:33:17 +03:00