petals/cli
2022-06-20 16:50:22 +03:00
..
__init__.py add quantization script for cpu 2022-06-12 05:59:11 +03:00
config.json add minimalistic benchmarks 2022-06-14 15:18:11 +03:00
convert_model.py black-isort 2022-06-20 16:50:22 +03:00
inference_one_block.py black everything 2022-06-19 17:23:08 +03:00
run_server.py fetch a specific bloom block without downloading the entire model 2022-06-20 15:33:17 +03:00