piper/README.md

![Piper logo](etc/logo.png)

A fast, local neural text to speech system that sounds great and is optimized for the Raspberry Pi 4.
Piper is used in a [variety of projects](#people-using-piper).

``` sh
echo 'Welcome to the world of speech synthesis!' | \
  ./piper --model en-us-blizzard_lessac-medium.onnx --output_file welcome.wav
```

[Listen to voice samples](https://rhasspy.github.io/piper-samples) and check out a [video tutorial by Thorsten Müller](https://youtu.be/rjq5eZoWWSo)

[![Sponsored by Nabu Casa](etc/nabu_casa_sponsored.png)](https://nabucasa.com)

Voices are trained with [VITS](https://github.com/jaywalnut310/vits/) and exported to the [onnxruntime](https://onnxruntime.ai/).

## Voices

Our goal is to support Home Assistant and the [Year of Voice](https://www.home-assistant.io/blog/2022/12/20/year-of-voice/).

[Download voices](https://huggingface.co/rhasspy/piper-voices/tree/main) for the supported languages:

* Catalan (ca_ES)
* Danish (da_DK)
* German (de_DE)
* English (en_GB, en_US)
* Spanish (es_ES, es_MX)
* Finnish (fi_FI)
* French (fr_FR)
* Greek (el_GR)
* Icelandic (is_IS)
* Italian (it_IT)
* Georgian (ka_GE)
* Kazakh (kk_KZ)
* Nepali (ne_NP)
* Dutch (nl_BE, nl_NL)
* Norwegian (no_NO)
* Polish (pl_PL)
* Portuguese (pt_BR)
* Russian (ru_RU)
* Swedish (sv_SE)
* Swahili (sw_CD)
* Ukrainian (uk_UA)
* Vietnamese (vi_VN)
* Chinese (zh_CN)


## Installation

Download a release:

* [amd64](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_amd64.tar.gz) (64-bit desktop Linux)
* [arm64](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_arm64.tar.gz) (64-bit Raspberry Pi 4)
* [armv7](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_armv7.tar.gz) (32-bit Raspberry Pi 3/4)

If you want to build from source, see the [Makefile](Makefile) and [C++ source](src/cpp).
You must download and extract [piper-phonemize](https://github.com/rhasspy/piper-phonemize) to `lib/Linux-$(uname -m)/piper_phonemize` before building.
For example, `lib/Linux-x86_64/piper_phonemize/lib/libpiper_phonemize.so` should exist for AMD/Intel machines (as well as everything else from `libpiper_phonemize-amd64.tar.gz`).


## Usage

1. [Download a voice](#voices) and extract the `.onnx` and `.onnx.json` files
2. Run the `piper` binary with text on standard input, `--model /path/to/your-voice.onnx`, and `--output_file output.wav`

For example:

``` sh
echo 'Welcome to the world of speech synthesis!' | \
  ./piper --model en-us-lessac-medium.onnx --output_file welcome.wav
```

For multi-speaker models, use `--speaker <number>` to change speakers (default: 0).

See `piper --help` for more options.


### JSON Input

The `piper` executable can accept JSON input when using the `--json-input` flag. Each line of input must be a JSON object with `text` field. For example:

``` json
{ "text": "First sentence to speak." }
{ "text": "Second sentence to speak." }
```

Optional fields include:

* `speaker` - string
    * Name of the speaker to use from `speaker_id_map` in config (multi-speaker voices only)
* `speaker_id` - number
    * Id of speaker to use from 0 to number of speakers - 1 (multi-speaker voices only, overrides "speaker")
* `output_file` - string
    * Path to output WAV file
    
The following example writes two sentences with different speakers to different files:

``` json
{ "text": "First speaker.", "speaker_id": 0, "output_file": "/tmp/speaker_0.wav" }
{ "text": "Second speaker.", "speaker_id": 1, "output_file": "/tmp/speaker_1.wav" }
```


## People using Piper

Piper has been used in the following projects/papers:

* [Home Assistant](https://github.com/home-assistant/addons/blob/master/piper/README.md)
* [Rhasspy 3](https://github.com/rhasspy/rhasspy3/)
* [NVDA - NonVisual Desktop Access](https://www.nvaccess.org/post/in-process-8th-may-2023/#voices)
* [Image Captioning for the Visually Impaired and Blind: A Recipe for Low-Resource Languages](https://www.techrxiv.org/articles/preprint/Image_Captioning_for_the_Visually_Impaired_and_Blind_A_Recipe_for_Low-Resource_Languages/22133894)
* [Open Voice Operating System](https://github.com/OpenVoiceOS/ovos-tts-plugin-piper)
* [JetsonGPT](https://github.com/shahizat/jetsonGPT)


## Training

See the [training guide](TRAINING.md) and the [source code](src/python).

Pretrained checkpoints are available on [Hugging Face](https://huggingface.co/datasets/rhasspy/piper-checkpoints/tree/main)


## Running in Python

See [src/python_run](src/python_run)

Run `scripts/setup.sh` to create a virtual environment and install the requirements. Then run:

``` sh
echo 'Welcome to the world of speech synthesis!' | scripts/piper \
  --model /path/to/voice.onnx \
  --output_file welcome.wav
```

If you'd like to use a GPU, install the `onnxruntime-gpu` package:


``` sh
.venv/bin/pip3 install onnxruntime-gpu
```

and then run `scripts/piper` with the `--cuda` argument. You will need to have a functioning CUDA environment, such as what's available in [NVIDIA's PyTorch containers](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/pytorch).
Rename to piper 1 year ago			`![Piper logo](etc/logo.png)`
Add license 1 year ago
Typo 1 year ago			`A fast, local neural text to speech system that sounds great and is optimized for the Raspberry Pi 4.`
Add sponsored 1 year ago			`Piper is used in a [variety of projects](#people-using-piper).`
Update README 1 year ago
			``` sh
			`echo 'Welcome to the world of speech synthesis!' \| \`
Rename to piper 1 year ago			`./piper --model en-us-blizzard_lessac-medium.onnx --output_file welcome.wav`
Update README 1 year ago			```

Update README.md 1 year ago			`[Listen to voice samples](https://rhasspy.github.io/piper-samples) and check out a [video tutorial by Thorsten Müller](https://youtu.be/rjq5eZoWWSo)`
Update README 1 year ago
Add sponsored 1 year ago			`[![Sponsored by Nabu Casa](etc/nabu_casa_sponsored.png)](https://nabucasa.com)`

Minor changes to README 1 year ago			`Voices are trained with [VITS](https://github.com/jaywalnut310/vits/) and exported to the [onnxruntime](https://onnxruntime.ai/).`

Update README 1 year ago			`## Voices`

Minor changes to README 1 year ago			`Our goal is to support Home Assistant and the [Year of Voice](https://www.home-assistant.io/blog/2022/12/20/year-of-voice/).`

Add --version 11 months ago			`[Download voices](https://huggingface.co/rhasspy/piper-voices/tree/main) for the supported languages:`

			`* Catalan (ca_ES)`
			`* Danish (da_DK)`
			`* German (de_DE)`
			`* English (en_GB, en_US)`
			`* Spanish (es_ES, es_MX)`
			`* Finnish (fi_FI)`
			`* French (fr_FR)`
			`* Greek (el_GR)`
			`* Icelandic (is_IS)`
			`* Italian (it_IT)`
			`* Georgian (ka_GE)`
			`* Kazakh (kk_KZ)`
			`* Nepali (ne_NP)`
			`* Dutch (nl_BE, nl_NL)`
			`* Norwegian (no_NO)`
			`* Polish (pl_PL)`
			`* Portuguese (pt_BR)`
			`* Russian (ru_RU)`
			`* Swedish (sv_SE)`
			`* Swahili (sw_CD)`
			`* Ukrainian (uk_UA)`
			`* Vietnamese (vi_VN)`
			`* Chinese (zh_CN)`
Update README 1 year ago

			`## Installation`

			`Download a release:`

Add 32-bit ARM version 1 year ago			`* [amd64](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_amd64.tar.gz) (64-bit desktop Linux)`
			`* [arm64](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_arm64.tar.gz) (64-bit Raspberry Pi 4)`
			`* [armv7](https://github.com/rhasspy/piper/releases/download/v1.0.0/piper_armv7.tar.gz) (32-bit Raspberry Pi 3/4)`
Using patched espeak-ng 1 year ago
Update README 1 year ago			`If you want to build from source, see the [Makefile](Makefile) and [C++ source](src/cpp).`
			You must download and extract [piper-phonemize](https://github.com/rhasspy/piper-phonemize) to `lib/Linux-$(uname -m)/piper_phonemize` before building.
			For example, `lib/Linux-x86_64/piper_phonemize/lib/libpiper_phonemize.so` should exist for AMD/Intel machines (as well as everything else from `libpiper_phonemize-amd64.tar.gz`).
Update README 1 year ago

			`## Usage`

			1. [Download a voice](#voices) and extract the `.onnx` and `.onnx.json` files
Rename to piper 1 year ago			2. Run the `piper` binary with text on standard input, `--model /path/to/your-voice.onnx`, and `--output_file output.wav`
Update README 1 year ago
			`For example:`

			``` sh
			`echo 'Welcome to the world of speech synthesis!' \| \`
Fix voice in README 1 year ago			`./piper --model en-us-lessac-medium.onnx --output_file welcome.wav`
Update README 1 year ago			```

			For multi-speaker models, use `--speaker <number>` to change speakers (default: 0).

Rename to piper 1 year ago			See `piper --help` for more options.
Update README 1 year ago

Add --version 11 months ago			`### JSON Input`

			The `piper` executable can accept JSON input when using the `--json-input` flag. Each line of input must be a JSON object with `text` field. For example:

			``` json
			`{ "text": "First sentence to speak." }`
			`{ "text": "Second sentence to speak." }`
			```

			`Optional fields include:`

			* `speaker` - string
			* Name of the speaker to use from `speaker_id_map` in config (multi-speaker voices only)
			* `speaker_id` - number
			`* Id of speaker to use from 0 to number of speakers - 1 (multi-speaker voices only, overrides "speaker")`
			* `output_file` - string
			`* Path to output WAV file`

			`The following example writes two sentences with different speakers to different files:`

			``` json
			`{ "text": "First speaker.", "speaker_id": 0, "output_file": "/tmp/speaker_0.wav" }`
			`{ "text": "Second speaker.", "speaker_id": 1, "output_file": "/tmp/speaker_1.wav" }`
			```


Add projects to README 1 year ago			`## People using Piper`

			`Piper has been used in the following projects/papers:`

			`* [Home Assistant](https://github.com/home-assistant/addons/blob/master/piper/README.md)`
			`* [Rhasspy 3](https://github.com/rhasspy/rhasspy3/)`
			`* [NVDA - NonVisual Desktop Access](https://www.nvaccess.org/post/in-process-8th-may-2023/#voices)`
			`* [Image Captioning for the Visually Impaired and Blind: A Recipe for Low-Resource Languages](https://www.techrxiv.org/articles/preprint/Image_Captioning_for_the_Visually_Impaired_and_Blind_A_Recipe_for_Low-Resource_Languages/22133894)`
			`* [Open Voice Operating System](https://github.com/OpenVoiceOS/ovos-tts-plugin-piper)`
Add JetsonGPT to README 12 months ago			`* [JetsonGPT](https://github.com/shahizat/jetsonGPT)`
Add projects to README 1 year ago

Update README 1 year ago			`## Training`

Add training guide 12 months ago			`See the [training guide](TRAINING.md) and the [source code](src/python).`
Update README 1 year ago
Link to pretrained models 1 year ago			`Pretrained checkpoints are available on [Hugging Face](https://huggingface.co/datasets/rhasspy/piper-checkpoints/tree/main)`

Update README 1 year ago
			`## Running in Python`

			`See [src/python_run](src/python_run)`

			Run `scripts/setup.sh` to create a virtual environment and install the requirements. Then run:

			``` sh
Rename to piper 1 year ago			`echo 'Welcome to the world of speech synthesis!' \| scripts/piper \`
Update README 1 year ago			`--model /path/to/voice.onnx \`
			`--output_file welcome.wav`
			```

			If you'd like to use a GPU, install the `onnxruntime-gpu` package:


			``` sh
			`.venv/bin/pip3 install onnxruntime-gpu`
			```

Rename to piper 1 year ago			and then run `scripts/piper` with the `--cuda` argument. You will need to have a functioning CUDA environment, such as what's available in [NVIDIA's PyTorch containers](https://catalog.ngc.nvidia.com/orgs/nvidia/containers/pytorch).
Update README 1 year ago