petals

mirror of https://github.com/bigscience-workshop/petals synced 2024-10-31 09:20:41 +00:00

History

justheuristic 1ab5fb1630 fetch a specific bloom block without downloading the entire model		2022-06-20 15:33:17 +03:00
..
__init__.py	add quantization script for cpu	2022-06-12 05:59:11 +03:00
config.json	add minimalistic benchmarks	2022-06-14 15:18:11 +03:00
convert_model.py	push config and tokenizer separately	2022-06-20 14:28:31 +03:00
inference_one_block.py	black everything	2022-06-19 17:23:08 +03:00
run_server.py	fetch a specific bloom block without downloading the entire model	2022-06-20 15:33:17 +03:00