Analysis of segmental duration for Thai speech synthesis

Chatchawarn Hansakunbuntheung; Yoshinori Sagisaka

doi:10.21437/speechprosody.2004-110

ScienceGate Book Chapters

JOURNAL ARTICLE

Analysis of segmental duration for Thai speech synthesis

Chatchawarn Hansakunbuntheung Yoshinori Sagisaka

Year: 2004 Pages: 479-482

DOI: 10.21437/speechprosody.2004-110

Get Full-Text PDF Get Analytical Report

Abstract

This paper presents a characteristic study of Thai segmental duration and adapts the analysis results to construct a Thai phone duration model for Thai speech synthesis. The study uses Hayashi's categorized linear regression model to analyze the effects of various factors including current phonemes themselves, surrounding phonemes, phone positions in word, phone positions in phrase, part-of-speeches and Thai tones. These factors have combined to form a Thai phone duration model. The model gives rather high correlation of 0.788. Thought, it has fairly high RMS error of 33.14 ms, a evaluation shows the high consistency of the model on unknown data.

Keywords:

Duration (music) Phone Phrase Speech recognition Consistency (knowledge bases) Construct (python library) Computer science Word (group theory) Speech synthesis Natural language processing Linguistics Artificial intelligence Acoustics

Metrics

Cited By

2.11

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Hermeneutics and Narrative Identity

Social Sciences → Arts and Humanities → Philosophy

Aging, Elder Care, and Social Issues

Health Sciences → Health Professions → General Health Professions

Health, Medicine and Society

Health Sciences → Health Professions → General Health Professions

Analysis of segmental duration for Thai speech synthesis

Abstract

Metrics

Topics

Related Documents

Segmental Duration Modeling for Greek Speech Synthesis

Statistical analysis for segmental duration rules in Japanese speech synthesis

Controlling segmental duration in speech synthesis systems

Analysis and modeling of syllable duration for Thai speech synthesis

Segmental duration in French text-to-speech synthesis