Segmental Duration Modeling for Greek Speech Synthesis

Alexandros Lazaridis; Panagiotis Zervas; G. Kokkinakis

doi:10.1109/ictai.2007.33

ScienceGate Book Chapters

JOURNAL ARTICLE

Segmental Duration Modeling for Greek Speech Synthesis

Alexandros Lazaridis Panagiotis Zervas G. Kokkinakis

Year: 2007 Vol: 19 Pages: 518-521

DOI: 10.1109/ictai.2007.33

Get Full-Text PDF Get Analytical Report

Abstract

In this paper we cope with the task of modeling phoneme duration for Greek speech synthesis. In particular we apply well established machine learning approaches to the WCL-1 prosodic database for predicting segmental durations from shallow morphosyntactic and prosodic features. We employ decision trees, instance based learning and linear regression. Trained on a 5500 word database, both CART and linear regression models proved to be the most effective in terms for the task with a root mean square error off 0. 0252 and 0.0251 respectively.

Keywords:

Computer science Duration (music) Task (project management) Regression Speech recognition Speech synthesis Artificial intelligence Natural language processing Decision tree Cart Linear regression Training set Mean squared error Word (group theory) Regression analysis Machine learning Statistics Linguistics Mathematics Engineering

Metrics

Cited By

3.49

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Segmental Duration Modeling for Greek Speech Synthesis

Abstract

Metrics

Citation History

Topics

Related Documents

Modeling segmental duration in German text-to-speech synthesis

Modeling segmental duration in German text-to-speech synthesis

Controlling segmental duration in speech synthesis systems

Modeling segmental duration for turkish text-to-speech

Analysis of segmental duration for Thai speech synthesis