Abstract

Duration modeling is a key task for every parametric speech synthesis system.Though such parametric systems have been adapted to many languages, no special attention was paid to explicitly handling Arabic speech characteristics.Actually, in Arabic phoneme duration has a distinctive role, because of consonant gemination and vowel quantity.Therefore, a precise modeling of sound durations is critical.In this paper we compare several modeling of phoneme durations (including duration modeling by HTS and MERLIN toolkits), and we propose a new approach which relies on using a set of models, each one being optimal for a given phoneme class (e.g., simple consonants, geminated consonants, short vowels, and long vowels).An objective evaluation carried out on a set of test sentences shows that the proposed approach leads to a more accurate modeling of the phoneme durations.

Keywords:
Duration (music) Computer science Speech recognition Parametric statistics Consonant Set (abstract data type) Speech synthesis Test set Vowel Parametric model Arabic Artificial intelligence Natural language processing Mathematics Linguistics Acoustics Statistics

Metrics

12
Cited By
1.99
FWCI (Field Weighted Citation Impact)
15
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.