The eSpeak NG is a compact open source software text-to-speech synthesizer for Linux, Windows, Android and other operating systems. It supports more than 100 languages and accents. It is based on the eSpeak engine created by Jonathan Duddington.
eSpeak NG uses a "formant synthesis" method. This allows many languages to be provided in a small size. The speech is clear, and can be used at high speeds, but is not as natural or smooth as larger synthesizers which are based on human speech recordings. It also supports Klatt formant synthesis, and the ability to use MBROLA as backend speech synthesizer.
- Includes different Voices, whose characteristics can be altered.
- Can produce speech output as a WAV file.
- SSML (Speech Synthesis Markup Language) is supported (not complete),
and also HTML.
- Compact size. The program and its data, including many languages,
totals about few Mbytes.
- Can be used as a front-end to MBROLA diphone voices.
eSpeak NG converts text to phonemes with pitch and length information.
- Can translate text into phoneme codes, so it could be adapted as a
front end for another speech synthesis engine.
- Potential for other languages. Several are included in varying stages
of progress. Help from native speakers for these or other languages is