petals/cli
2022-06-20 15:33:17 +03:00
..
__init__.py add quantization script for cpu 2022-06-12 05:59:11 +03:00
config.json add minimalistic benchmarks 2022-06-14 15:18:11 +03:00
convert_model.py push config and tokenizer separately 2022-06-20 14:28:31 +03:00
inference_one_block.py black everything 2022-06-19 17:23:08 +03:00
run_server.py fetch a specific bloom block without downloading the entire model 2022-06-20 15:33:17 +03:00