petals/tests
Alexander Borzunov 11f0d992d7
Report inference, forward, and network RPS separately (#358)
Inference RPS may be very different from forward RPS. E.g., currently bnb uses a completely different algorithm for NF4 inference. We report detailed RPS info that can be then used for shortest-path routing for inference.
2023-07-17 13:45:59 +04:00
..
scripts Fix arguments in remove_old_models.py (#153) 2022-12-13 19:01:12 +03:00
conftest.py Fix logging: do not duplicate lines, enable colors in Colab (#156) 2022-12-15 09:12:18 +04:00
test_aux_functions.py Report inference, forward, and network RPS separately (#358) 2023-07-17 13:45:59 +04:00
test_block_exact_match.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_chained_calls.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_dtype.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_full_model.py Support peft LoRA adapters (#335) 2023-07-12 15:22:28 +03:00
test_peft.py Support peft LoRA adapters (#335) 2023-07-12 15:22:28 +03:00
test_priority_pool.py Fix issues related to petals as a module (#159) 2022-12-16 09:09:06 +04:00
test_remote_sequential.py Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333) 2023-07-03 20:13:04 +04:00
test_sequence_manager.py Test that bitsandbytes is not imported when it's not used (#351) 2023-07-14 18:40:47 +04:00
test_server_stats.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_tensor_parallel.py Add LLaMA support (#323) 2023-06-23 15:46:10 +04:00
test_utils.py Support peft LoRA adapters (#335) 2023-07-12 15:22:28 +03:00
test.id Add automated tests (#23) 2022-07-16 01:59:23 +03:00