SoundGen - a text to speech generator

heading diagram

SounGen is a PowerShell script which converts text into sound files compatible with EdgeTX, OpenTX and Ethos. It runs on a PC without the need for external TTS services.

Most of the code for Sound Gen was generated using Claude AI from prompts by the author.

Requirements

Interactive or batch versions

There are two versions of the script

Voices

The scripts can use any of the voices listed in the Windows Speech settings (the RC-Soar templates use Zira).

Installation

To install the interactive or batch script, right click on the link (above) and save as a text file. Then follow the instructions at the head of each file.

To install FFmpeg go here; the 'essentials' package is sufficient. Make sure that the folder containing the .exe files are in your PATH environment variable.

Example outout .wav files

Click on a link to play:

Limitations

Though reliable in normal use, bear in mind that the code is AI generated. It may fall over if you provoke it for example by including non-speech characters like $,* etc. Also make sure to include the comma separator in the .CSV files.

Screenshots

Interactive version

Example phrase is 'don't forget to move the sticks'.

Comparison of trimmed and untrimmed silences

Upper: base file, untrimmed (play)
Lower: with silences trimmed and loudness boosted 4dB (play)