JOURNAL ARTICLE

Segmental Duration Modeling for Greek Speech Synthesis

Abstract

In this paper we cope with the task of modeling phoneme duration for Greek speech synthesis. In particular we apply well established machine learning approaches to the WCL-1 prosodic database for predicting segmental durations from shallow morphosyntactic and prosodic features. We employ decision trees, instance based learning and linear regression. Trained on a 5500 word database, both CART and linear regression models proved to be the most effective in terms for the task with a root mean square error off 0. 0252 and 0.0251 respectively.

Keywords:
Computer science Duration (music) Task (project management) Regression Speech recognition Speech synthesis Artificial intelligence Natural language processing Decision tree Cart Linear regression Training set Mean squared error Word (group theory) Regression analysis Machine learning Statistics Linguistics Mathematics Engineering

Metrics

18
Cited By
3.49
FWCI (Field Weighted Citation Impact)
26
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Modeling segmental duration in German text-to-speech synthesis

Bernd MöbiusJan P. H. van Santen

Journal:   4th International Conference on Spoken Language Processing (ICSLP 1996) Year: 1996 Pages: 2395-2398
DISSERTATION

Controlling segmental duration in speech synthesis systems

Marcel Riedi

University:   ETH Zürich Research Collection Year: 1998 Vol: 026
© 2026 ScienceGate Book Chapters — All rights reserved.