JOURNAL ARTICLE

Aerodynamic modeling for concatenative speech synthesis.

Kevin B. McGowan

Year: 2009 Journal:   The Journal of the Acoustical Society of America Vol: 126 (4_Supplement)Pages: 2222-2222   Publisher: Acoustical Society of America

Abstract

Listeners can perceive and use a wide array of fine-grained phonetic details, including the detailed coarticulatory influences of adjacent sounds, when perceiving speech. Details like anticipatory nasalization in can, for example, potentially provide the listener with a rich network of informative cues and are a key to understanding listeners’ ability to disambiguate speech sounds from seemingly ambiguous input. Unfortunately, these coarticulatory cues are generally missing or contradictory in the output of speech synthesis systems. These systems work by concatenating variable-length sound units chosen from a large database of recorded speech. Units are chosen to minimize two functions: the cost of aligning a particular unit with the desired speech output (target cost) and the cost of adjoining the next sound to the most recently selected unit (join cost). Generally, these costs are calculated using features which can be automatically extracted from the acoustic speech signal. A unit selection database is created, automatically segmented and automatically labeled with nasal and oral airflow feature vectors. These aerodynamic features are used as a proxy for articulatory information in the calculation of join and cost functions. Listeners’ mean opinion scores are obtained on output from this system and a baseline acoustic system for comparison.

Keywords:
Computer science Speech recognition Speech synthesis Feature (linguistics)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.07
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Phonetics and Phonology Research
Social Sciences →  Psychology →  Experimental and Cognitive Psychology

Related Documents

JOURNAL ARTICLE

Feature-domain concatenative speech synthesis

Dan ChazanRon Hoory

Journal:   The Journal of the Acoustical Society of America Year: 2007 Vol: 122 (1)Pages: 31-31
JOURNAL ARTICLE

Multi-lingual concatenative speech synthesis

Nick Campbell

Year: 1998 Pages: paper 0024-0
JOURNAL ARTICLE

Limitations to concatenative speech synthesis

Nick Campbell

Year: 2000 Pages: vol. 3, 416-419
JOURNAL ARTICLE

High Quality Arabic Concatenative Speech Synthesis

Abdelkader Chabchoub

Journal:   Signal & Image Processing An International Journal Year: 2011 Vol: 2 (4)Pages: 27-36
© 2026 ScienceGate Book Chapters — All rights reserved.