.. |
bootstrap.id
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
conftest.py
|
Fix logging: do not duplicate lines, enable colors in Colab (#156)
|
2022-12-15 09:12:18 +04:00 |
server2.id
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_aux_functions.py
|
Add customizable input tensors (#445)
|
2023-08-14 12:23:16 +04:00 |
test_block_exact_match.py
|
Prioritize short inference, unmerge pools for long inference (#458)
|
2023-08-11 09:24:33 +04:00 |
test_cache.py
|
Support macOS (#477)
|
2023-08-29 07:49:27 +04:00 |
test_chained_calls.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_dtype.py
|
Add LLaMA support (#323)
|
2023-06-23 15:46:10 +04:00 |
test_full_model.py
|
Fix .generate(input_ids=...) (#485)
|
2023-08-30 06:59:33 +04:00 |
test_optimized_layers.py
|
Optimize LLaMA for inference (#513)
|
2023-11-14 20:14:19 +03:00 |
test_peft.py
|
Support peft LoRA adapters (#335)
|
2023-07-12 15:22:28 +03:00 |
test_priority_pool.py
|
Support macOS (#477)
|
2023-08-29 07:49:27 +04:00 |
test_remote_sequential.py
|
Fix .generate(input_ids=...) (#485)
|
2023-08-30 06:59:33 +04:00 |
test_sequence_manager.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_server_stats.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_tensor_parallel.py
|
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2023-08-08 19:10:27 +04:00 |
test_utils.py
|
Support peft LoRA adapters (#335)
|
2023-07-12 15:22:28 +03:00 |