BOOK

Text-to-Speech Synthesis

Abstract

This article gives an introduction to state-of-the-art text-to-speech (TTS) synthesis systems, showing both the natural language processing and the digital signal processing problems involved. Text-to-speech (TTS) synthesis is the art of designing talking machines. The article begins with brief user-oriented description of a general TTS system and comments on its commercial applications. It then gives a functional diagram of a modern TTS system, highlighting its components. It describes its morphosyntactic module. Furthermore, it examines why sentence-level phonetization cannot be achieved by a sequence of dictionary look-ups, and describes possible implementations of the phonetizer. Finally, the article describes prosody generation, outlining how intonation and duration can approximately be computed from text. Prosody refers to certain properties of the speech signal, which are related to audible changes in pitch, loudness, and syllable length. This article also introduces the two main existing categories of techniques for waveform generation: synthesis by rule and concatenative synthesis.

Keywords:
Speech synthesis Prosody Computer science Speech recognition Intonation (linguistics) Syllable Sentence Natural language processing Waveform Artificial intelligence Linguistics

Metrics

8
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.02
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Text-to-Speech Synthesis

Arkadiusz RojczykSteven Jarosz

Year: 2025 Pages: 1-7
BOOK-CHAPTER

Text-to-Speech Synthesis

Yoshinori ShigaJinfu NiKentaro TachibanaTakuma Okamoto

SpringerBriefs in computer science Year: 2019 Pages: 39-52
BOOK-CHAPTER

Text-to-Speech Synthesis

Esther Klabbers

Year: 2019 Pages: 297-317
BOOK-CHAPTER

Text-To-Speech Synthesis

Florian HinterleitnerChristoph NorrenbrockSebastian MöllerUlrich Heute

T-labs series in telecommunication services Year: 2014 Pages: 179-193
BOOK

Text-to-Speech Synthesis

Paul Taylor

Cambridge University Press eBooks Year: 2009
© 2026 ScienceGate Book Chapters — All rights reserved.