JOURNAL ARTICLE

Quantitative method for modeling context in concatenative synthesis using large speech database

Abstract

Modeling phonetic context is one of the key points to get natural sounding in concatenativc speech synthesis. In this paper, a new quantitative method to model context is proposed. In the proposed method, the context is measured as the distance between leafs of the top-down likelihood-based decision trees that have been grown during the construction of acoustic inventory. Unlike other context modeling methods, this method allows the unit selection algorithm to borrow unit occurrences from other contexts when their context distances are close. This is done by incorporating the measured distance as an element in the unit selection cost function. The motivation behind this method is that it reduces the required speech modification by using better unit occurrences from near context. This method also makes it easy to use long synthesis units, e.g. syllables or words, in the same unit selection framework.

Keywords:
Computer science Context (archaeology) Context model Selection (genetic algorithm) Speech synthesis Speech recognition Key (lock) Database Artificial intelligence

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
4
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Phonetics and Phonology Research
Social Sciences →  Psychology →  Experimental and Cognitive Psychology

Related Documents

JOURNAL ARTICLE

Aerodynamic modeling for concatenative speech synthesis.

Kevin B. McGowan

Journal:   The Journal of the Acoustical Society of America Year: 2009 Vol: 126 (4_Supplement)Pages: 2222-2222
JOURNAL ARTICLE

Context-adaptive smoothing for concatenative speech synthesis

Ki-Seung LeeSang-Ryoung Kim

Journal:   IEEE Signal Processing Letters Year: 2002 Vol: 9 (12)Pages: 422-425
JOURNAL ARTICLE

Database size and naturalness in concatenative speech synthesis

H. Timothy BunnellJames T. MantellJames B. Polikoff

Journal:   The Journal of the Acoustical Society of America Year: 2006 Vol: 120 (5_Supplement)Pages: 3037-3037
© 2026 ScienceGate Book Chapters — All rights reserved.