.. |
bootstrap.id
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
conftest.py
|
Fix logging: do not duplicate lines, enable colors in Colab (#156)
|
2022-12-15 09:12:18 +04:00 |
server2.id
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_aux_functions.py
|
Add customizable input tensors (#445)
|
2023-08-14 12:23:16 +04:00 |
test_block_exact_match.py
|
Prioritize short inference, unmerge pools for long inference (#458)
|
2023-08-11 09:24:33 +04:00 |
test_chained_calls.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_dtype.py
|
Add LLaMA support (#323)
|
2023-06-23 15:46:10 +04:00 |
test_full_model.py
|
Make client compatible with transformers' GenerationMixin (#464)
|
2023-08-20 19:18:36 +04:00 |
test_peft.py
|
Support peft LoRA adapters (#335)
|
2023-07-12 15:22:28 +03:00 |
test_priority_pool.py
|
priority pool
|
2023-08-17 04:41:18 +03:00 |
test_remote_sequential.py
|
Add blocked_servers argument (#462)
|
2023-08-14 10:41:13 +04:00 |
test_sequence_manager.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_server_stats.py
|
WIP, switching to another PR
|
2023-08-28 06:03:33 +03:00 |
test_tensor_parallel.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_utils.py
|
Support peft LoRA adapters (#335)
|
2023-07-12 15:22:28 +03:00 |