Published Jun 02, 2026
TTS Concepts
Pipeline and Architecture
Text and Pronunciation
Voice and Quality
Signal Processing
- Mel-Spectrograms — The frequency representation used by neural TTS models.
- Vocoder — Converting acoustic features into audio waveforms.
Generation and Performance