JOURNAL ARTICLE

Cantonese text-to-speech synthesis using sub-syllable units

Abstract

This paper describes our recent investigation on the use of both intra-syllable and cross-syllable acoustic units for Cantonese text-to-speech synthesis. In our previous work, isolated monosyllable units were used for concatenative speech synthesis of Cantonese. The synthetic speech was considered to be unnatural in such a way that there was an obvious lack of perceptual continuity. The proposed system adopts an acoustic inventory that covers all legitimate intrasyllable and cross-syllable acoustic units. Synthetic speech produced via concatenation of such sub-syllable units better captures the pertinent transitory effects that are crucial to perceived naturalness. Different strategies are used to concatenate speech segments with different acoustic-phonetic properties. Subjective listening test shows a noticeable performance improvement that is accounted for mainly by smoother transition between sonorant segments.

Keywords:
Syllable Computer science Speech recognition Speech synthesis Natural language processing Artificial intelligence

Metrics

8
Cited By
2.20
FWCI (Field Weighted Citation Impact)
6
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech and dialogue systems
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Using cross-syllable units for Cantonese speech synthesis

Ka Man LawTan Lee

Year: 2000 Pages: vol. 2, 407-410
JOURNAL ARTICLE

Speech recognition using syllable-like units

Zhihong HuJohan SchalkwykEtienne BarnardRonald A. Cole

Journal:   4th International Conference on Spoken Language Processing (ICSLP 1996) Year: 1996 Pages: 1117-1120
© 2026 ScienceGate Book Chapters — All rights reserved.