petals/tests
Max Ryabinin 03cbe90234
Optimize LLaMA for inference (#513)
* Optimize LLaMa for inference
* Fix model type detection in tests
2023-11-14 20:14:19 +03:00
..
bootstrap.id Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
conftest.py Fix logging: do not duplicate lines, enable colors in Colab (#156) 2022-12-15 09:12:18 +04:00
server2.id Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
test_aux_functions.py Add customizable input tensors (#445) 2023-08-14 12:23:16 +04:00
test_block_exact_match.py Prioritize short inference, unmerge pools for long inference (#458) 2023-08-11 09:24:33 +04:00
test_cache.py Support macOS (#477) 2023-08-29 07:49:27 +04:00
test_chained_calls.py Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
test_dtype.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_full_model.py Fix .generate(input_ids=...) (#485) 2023-08-30 06:59:33 +04:00
test_optimized_layers.py Optimize LLaMA for inference (#513) 2023-11-14 20:14:19 +03:00
test_peft.py Support peft LoRA adapters (#335) 2023-07-12 15:22:28 +03:00
test_priority_pool.py Support macOS (#477) 2023-08-29 07:49:27 +04:00
test_remote_sequential.py Fix .generate(input_ids=...) (#485) 2023-08-30 06:59:33 +04:00
test_sequence_manager.py Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
test_server_stats.py Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
test_tensor_parallel.py Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) 2023-08-08 19:10:27 +04:00
test_utils.py Support peft LoRA adapters (#335) 2023-07-12 15:22:28 +03:00