You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
petals/etc/petals-inference.service

12 lines
271 B
INI

[Unit]
Description=Petals Inference
[Service]
#User=petals
#Group=petals
Environment=PJRT_DEVICE=TPU
ExecStart=/home/petals/venv/bin/python -m petals.cli.run_server --port 31330 petals-team/StableBeluga2 --device xla --num_blocks=4
[Install]
WantedBy=multi-user.target