petals

mirror of https://github.com/bigscience-workshop/petals synced 2024-10-31 09:20:41 +00:00

History

Max Ryabinin 03cbe90234 Optimize LLaMA for inference (#513 ) * Optimize LLaMa for inference * Fix model type detection in tests		2023-11-14 20:14:19 +03:00
..
bootstrap.id	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
conftest.py	Fix logging: do not duplicate lines, enable colors in Colab (#156 )	2022-12-15 09:12:18 +04:00
server2.id	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
test_aux_functions.py	Add customizable input tensors (#445 )	2023-08-14 12:23:16 +04:00
test_block_exact_match.py	Prioritize short inference, unmerge pools for long inference (#458 )	2023-08-11 09:24:33 +04:00
test_cache.py	Support macOS (#477 )	2023-08-29 07:49:27 +04:00
test_chained_calls.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
test_dtype.py	Add LLaMA support (#323 )	2023-06-23 15:46:10 +04:00
test_full_model.py	Fix `.generate(input_ids=...)` (#485 )	2023-08-30 06:59:33 +04:00
test_optimized_layers.py	Optimize LLaMA for inference (#513 )	2023-11-14 20:14:19 +03:00
test_peft.py	Support peft LoRA adapters (#335 )	2023-07-12 15:22:28 +03:00
test_priority_pool.py	Support macOS (#477 )	2023-08-29 07:49:27 +04:00
test_remote_sequential.py	Fix `.generate(input_ids=...)` (#485 )	2023-08-30 06:59:33 +04:00
test_sequence_manager.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
test_server_stats.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
test_tensor_parallel.py	Test Llama, rebalancing, throughput eval, and all CLI scripts (#452 )	2023-08-08 19:10:27 +04:00
test_utils.py	Support peft LoRA adapters (#335 )	2023-07-12 15:22:28 +03:00