JOURNAL ARTICLE

Analysis of segmental duration for Thai speech synthesis

Abstract

This paper presents a characteristic study of Thai segmental duration and adapts the analysis results to construct a Thai phone duration model for Thai speech synthesis. The study uses Hayashi's categorized linear regression model to analyze the effects of various factors including current phonemes themselves, surrounding phonemes, phone positions in word, phone positions in phrase, part-of-speeches and Thai tones. These factors have combined to form a Thai phone duration model. The model gives rather high correlation of 0.788. Thought, it has fairly high RMS error of 33.14 ms, a evaluation shows the high consistency of the model on unknown data.

Keywords:
Duration (music) Phone Phrase Speech recognition Consistency (knowledge bases) Construct (python library) Computer science Word (group theory) Speech synthesis Natural language processing Linguistics Artificial intelligence Acoustics

Metrics

1
Cited By
2.11
FWCI (Field Weighted Citation Impact)
7
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Hermeneutics and Narrative Identity
Social Sciences →  Arts and Humanities →  Philosophy
Aging, Elder Care, and Social Issues
Health Sciences →  Health Professions →  General Health Professions
Health, Medicine and Society
Health Sciences →  Health Professions →  General Health Professions
© 2026 ScienceGate Book Chapters — All rights reserved.