JOURNAL ARTICLE

Modeling segmental duration for turkish text-to-speech

Abstract

Text-to-speech (TTS) synthesis can be regarded as the automatic transformation of sentences from their text form into their speech waveform by machines. The most crucial problem confronting TTS systems is the generation of natural sounding voice. In order to obtain natural sounding synthetic speech, prosodic attributes of speech such as pitch frequency, duration and intensity should be modelled appropriately. This paper summarizes the efforts to obtain duration models to be utilized in Turkish TTS systems via machine-learning algorithms. In natural speech, segment durations are highly correlated to context. Similar/same phones differ from each other in their energy, duration and fundamental frequency depending on their context. To obtain natural speech thru TTS, prosodic variations due to context should be modeled. Different methods of modeling duration have been applied over the years. Two corpus-based statistical systems - linear regression and C4.5 decision tree - are employed in modeling segment durations in Turkish.

Keywords:
Duration (music) Computer science Speech recognition Speech synthesis Context (archaeology) Turkish Artificial intelligence Fundamental frequency Context model Natural (archaeology) Natural language processing Speech processing Waveform Acoustics Linguistics Telecommunications

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
18
Refs
0.15
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Modeling segmental duration in German text-to-speech synthesis

Bernd MöbiusJan P. H. van Santen

Journal:   4th International Conference on Spoken Language Processing (ICSLP 1996) Year: 1996 Pages: 2395-2398
JOURNAL ARTICLE

Segmental duration modeling in Turkish

Özlem ÖztürkTolga Çiloğlu

Year: 2006 Pages: paper 2004-Thu1FoP.6
JOURNAL ARTICLE

Assignment of segmental duration in text-to-speech synthesis

Jan P. H. van Santen

Journal:   Computer Speech & Language Year: 1994 Vol: 8 (2)Pages: 95-128
© 2026 ScienceGate Book Chapters — All rights reserved.