petals/tests/test_priority_pool.py

import multiprocessing as mp
import platform
import time

import pytest
import torch
from hivemind.moe.server.runtime import Runtime

from petals.server.task_pool import PrioritizedTaskPool


def _submit_tasks(runtime_ready, pools, results_valid):
    runtime_ready.wait()

    futures = []
    futures.append(pools[0].submit_task(torch.tensor([0]), priority=1))
    futures.append(pools[0].submit_task(torch.tensor([1]), priority=1))
    time.sleep(0.01)
    futures.append(pools[1].submit_task(torch.tensor([2]), priority=1))
    futures.append(pools[0].submit_task(torch.tensor([3]), priority=2))
    futures.append(pools[0].submit_task(torch.tensor([4]), priority=10))
    futures.append(pools[0].submit_task(torch.tensor([5]), priority=0))
    futures.append(pools[0].submit_task(torch.tensor([6]), priority=1))
    futures.append(pools[1].submit_task(torch.tensor([7]), priority=11))
    futures.append(pools[1].submit_task(torch.tensor([8]), priority=1))
    for i, f in enumerate(futures):
        assert f.result()[0].item() == i**2
    results_valid.set()


@pytest.mark.skipif(platform.system() == "Darwin", reason="Flapping on macOS due to multiprocessing quirks")
@pytest.mark.forked
def test_priority_pools():
    outputs_queue = mp.SimpleQueue()
    runtime_ready = mp.Event()
    results_valid = mp.Event()

    def dummy_pool_func(args, kwargs):
        (x,) = args  # TODO modify the PriorityPool code such that dummy_pool_func can accept x directly
        time.sleep(0.1)
        y = x**2
        outputs_queue.put((x, y))
        return (y,)

    class DummyBackend:
        def __init__(self, pools):
            self.pools = pools

        def get_pools(self):
            return self.pools

    pools = (
        PrioritizedTaskPool(dummy_pool_func, name="A", max_batch_size=1),
        PrioritizedTaskPool(dummy_pool_func, name="B", max_batch_size=1),
    )

    # Simulate requests coming from ConnectionHandlers
    proc = mp.context.ForkProcess(target=_submit_tasks, args=(runtime_ready, pools, results_valid))
    proc.start()

    runtime = Runtime({str(i): DummyBackend([pool]) for i, pool in enumerate(pools)}, prefetch_batches=0)
    runtime.ready = runtime_ready
    runtime.start()

    proc.join()
    assert results_valid.is_set()

    ordered_outputs = []
    while not outputs_queue.empty():
        ordered_outputs.append(outputs_queue.get()[0].item())

    assert ordered_outputs == [0, 5, 1, 2, 6, 8, 3, 4, 7]
    #                          0 - first batch is loaded immediately, before everything else
    #                             5 - highest priority task overall
    #                                1 - first of several tasks with equal lowest priority (1)
    #                                   2 - second earliest task with priority 1, fetched from pool B
    #                                      6 - third earliest task with priority 1, fetched from pool A again
    #                                         8 - last priority-1 task, pool B
    #                                            3 - task with priority 2 from pool A
    #                                               4 - task with priority 10 from pool A
    #                                                  7 - task with priority 11 from pool B

    runtime.shutdown()
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`import multiprocessing as mp`
Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00			`import platform`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`import time`

			`import pytest`
			`import torch`
Fix issues related to `petals` as a module (#159) 1. Added `from petals.client import *` to `petals/__init__.py`, so you can write just that: ```python from petals import DistributedBloomForCausalLM ``` I didn't do the same with server, since its classes are supposed to by used by `petals.cli.run_server`, not end-users. Though it's still possible to do `from petals.server.smth import smth` if necessary. 2. Fixed one more logging issue: log lines from hivemind were shown twice due to a bug in #156. 3. Removed unused `runtime.py`, since the server actually uses `hivemind.moe.Runtime`, and `runtime.py` has no significant changes comparing to it. 2022-12-16 05:09:06 +00:00			`from hivemind.moe.server.runtime import Runtime`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00
Make Petals a pip-installable package (attempt 2) (#102) 1. Petals can be now installed using `pip install git+https://github.com/bigscience-workshop/petals` - In case if you already cloned the repo, you can do `pip install .` or `pip install .[dev]` 2. Moved `src` => `src/petals` - Replaced `from src.smth import smth` with `from petals.smth import smth` 3. Moved `cli` => `src/petals/cli` - Replaced `python -m cli.run_smth` with `python -m petals.cli.run_smth` (all utilities are now available right after pip installation) 4. Moved the `requirements*.txt` contents to `setup.cfg` (`requirements.txt` for packages is not supported well by modern packaging utils) 5. Increased the package version from `0.2` to `1.0alpha1` 2022-11-30 06:41:13 +00:00			`from petals.server.task_pool import PrioritizedTaskPool`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00

Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00			`def _submit_tasks(runtime_ready, pools, results_valid):`
			`runtime_ready.wait()`

			`futures = []`
			`futures.append(pools[0].submit_task(torch.tensor([0]), priority=1))`
			`futures.append(pools[0].submit_task(torch.tensor([1]), priority=1))`
			`time.sleep(0.01)`
			`futures.append(pools[1].submit_task(torch.tensor([2]), priority=1))`
			`futures.append(pools[0].submit_task(torch.tensor([3]), priority=2))`
			`futures.append(pools[0].submit_task(torch.tensor([4]), priority=10))`
			`futures.append(pools[0].submit_task(torch.tensor([5]), priority=0))`
			`futures.append(pools[0].submit_task(torch.tensor([6]), priority=1))`
			`futures.append(pools[1].submit_task(torch.tensor([7]), priority=11))`
			`futures.append(pools[1].submit_task(torch.tensor([8]), priority=1))`
			`for i, f in enumerate(futures):`
			`assert f.result()[0].item() == i**2`
			`results_valid.set()`


			`@pytest.mark.skipif(platform.system() == "Darwin", reason="Flapping on macOS due to multiprocessing quirks")`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`@pytest.mark.forked`
			`def test_priority_pools():`
			`outputs_queue = mp.SimpleQueue()`
Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00			`runtime_ready = mp.Event()`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`results_valid = mp.Event()`

priority pool 2023-08-17 01:41:18 +00:00			`def dummy_pool_func(args, kwargs):`
			`(x,) = args # TODO modify the PriorityPool code such that dummy_pool_func can accept x directly`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`time.sleep(0.1)`
			`y = x**2`
			`outputs_queue.put((x, y))`
			`return (y,)`

			`class DummyBackend:`
			`def __init__(self, pools):`
			`self.pools = pools`

			`def get_pools(self):`
			`return self.pools`

			`pools = (`
			`PrioritizedTaskPool(dummy_pool_func, name="A", max_batch_size=1),`
			`PrioritizedTaskPool(dummy_pool_func, name="B", max_batch_size=1),`
			`)`

Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00			`# Simulate requests coming from ConnectionHandlers`
			`proc = mp.context.ForkProcess(target=_submit_tasks, args=(runtime_ready, pools, results_valid))`
			`proc.start()`

Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`runtime = Runtime({str(i): DummyBackend([pool]) for i, pool in enumerate(pools)}, prefetch_batches=0)`
Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00			`runtime.ready = runtime_ready`
Priority tasks (#47) * priority in handlers and backend pools * simple points system on server side * priortize task in handler before submit task * fix tests * s/expert/block/g Co-authored-by: justheuristic <justheuristic@gmail.com> 2022-09-10 19:24:42 +00:00			`runtime.start()`

			`proc.join()`
			`assert results_valid.is_set()`

			`ordered_outputs = []`
			`while not outputs_queue.empty():`
			`ordered_outputs.append(outputs_queue.get()[0].item())`

			`assert ordered_outputs == [0, 5, 1, 2, 6, 8, 3, 4, 7]`
			`# 0 - first batch is loaded immediately, before everything else`
			`# 5 - highest priority task overall`
			`# 1 - first of several tasks with equal lowest priority (1)`
			`# 2 - second earliest task with priority 1, fetched from pool B`
			`# 6 - third earliest task with priority 1, fetched from pool A again`
			`# 8 - last priority-1 task, pool B`
			`# 3 - task with priority 2 from pool A`
			`# 4 - task with priority 10 from pool A`
			`# 7 - task with priority 11 from pool B`
Support macOS (#477) This PR makes both clients and servers work on macOS. Specifically, it: - Follows https://github.com/learning-at-home/hivemind/pull/586 to run a macOS-compatible `p2pd` binary (both x86-64 and ARM64 are supported) - Fixes forking issues and tests on macOS, Python 3.10+ - Introduces basic support for serving model blocks on Apple M1/M2 GPUs (torch.mps) - Increases max number of open files by default (it's not enough on Linux and is really small on macOS) 2023-08-29 03:49:27 +00:00
			`runtime.shutdown()`