JOURNAL ARTICLE

MANDARIN PROSODY BOUNDARY PREDICTION FOR IMPROVING MANDARIN LEARNING OF NON-NATIVE SPEAKERS

Hongwu YangYajing YanJiaolong Jiang

Year: 2020 Journal:   EDULEARN proceedings Vol: 1 Pages: 3328-3334   Publisher: International Academy of Technology, Education and Development

Abstract

Non-native Mandarin speakers always have some types of inherent intonation errors of pronunciation when they speak Mandarin, which is affected by their native language pronunciation habits. Mandarin prosodic structure makes learners speak Chinese sentences in cadence. Therefore, the prediction of prosodic structure from sentences is not only can help learners improving their Mandarin level but also is the key to improving the naturalness of Mandarin speech in the text-to-speech (TTS) system. The higher the accuracy of Mandarin prosody boundary prediction, the more accurate the pronunciation of non-native speakers using the TTS language education system. Most of the existing researches use the statistics-based machine learning method, especially deep learning-based technology such as BiLSTM, to predict the boundaries of the prosodic word and prosodic phrase from Chinese sentence. However, the predictive accuracy is not high, so that the synthesized Mandarin speech is not fluent enough. In this work, we proposed a sequence-to-sequence with attention mechanism (seq2seq+attention) model-based method to improve the prediction accuracy of the prosodic boundaries from Chinese sentences. Firstly, a large-scale text corpus is collected, including 100,000 Chinese sentences as the training corpus that was manually labeled the boundaries of the prosodic word and prosodic phrase under the guidance of a linguistic expert. We then proposed a new feature named syntactic hierarchical number (SHN) to describe the relationship between the syntactic structure and the prosodic structure of Chinese sentences. Finally, we trained the seq2seq+attention model that includes an input layer, an embedding layer, a BiLSTM-based encoder layer, a hidden layer, an LSTM-based decoder layer, and an output layer. The features used for the input layer include word embedding concatenated by part-of-speech, length of the word, and SHN. The experimental results show that the seq2seq+attention model with SHN feature achieves an F1-score of 98.14% in the prosodic word and 83.12% in the prosodic phrase, respectively. The F1-score of prosodic phrase increases by 0.24% compared with the result of the seq2seq+attention model without SHN and 7.02% compared with another method. Therefore, the proposed method can be applied to Mandarin education with artificial intelligence technologies, which uses speech synthesis technology to reduce the influence of native language pronunciation and improve the fluency of speaking Mandarin.

Keywords:
Mandarin Chinese Prosody Computer science Speech recognition Boundary (topology) Natural language processing Artificial intelligence Linguistics Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.15
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Educational Practices and Challenges
Social Sciences →  Social Sciences →  Education
Arabic Language Education Studies
Social Sciences →  Social Sciences →  Education
Technology-Enhanced Education Studies
Social Sciences →  Social Sciences →  Education

Related Documents

JOURNAL ARTICLE

Automatic tone assessment of non-native Mandarin speakers

Jian Cheng

Year: 2012 Pages: 1299-1302
JOURNAL ARTICLE

Improving Fluency of Spoken Mandarin for Nonnative Speakers by Prosodic Boundary Prediction Based on Deep Learning

Hongwu YangLi DongYajing Yan

Journal:   Wireless Communications and Mobile Computing Year: 2022 Vol: 2022 (1)
JOURNAL ARTICLE

English focus prosody processing and production by Mandarin speakers

Chikako TakahashiHyunah BaekSophia KaoAlex Hong-Lun YeungMarie K. HuffmanEllen BroselowJiwon Hwang

Journal:   The Journal of the Acoustical Society of America Year: 2017 Vol: 142 (4_Supplement)Pages: 2519-2519
JOURNAL ARTICLE

Mandarin Prosody Boundary Prediction based on Sequence-to-sequence Model

Yajing YanJiaolong JiangHongwu Yang

Journal:   2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) Year: 2020 Pages: 1013-1017
© 2026 ScienceGate Book Chapters — All rights reserved.