Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model

Keikichi Hirose; Hui Hu; Xiaodong Wang; Nobuaki Minematsu

doi:10.21437/interspeech.2006-600

ScienceGate Book Chapters

JOURNAL ARTICLE

Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model

Keikichi Hirose Hui Hu Xiaodong Wang Nobuaki Minematsu

Year: 2006 Pages: paper 1929-Thu1FoP.10

DOI: 10.21437/interspeech.2006-600

Get Full-Text PDF Get Analytical Report

Abstract

A method is developed for recognizing lexical tone types of Standard Chinese syllables in continuous speech.Neural network (four-layered perceptron) is adopted as classifier.The method includes two steps; first recognizing tone types using prosodic features of voiced part, and then re-recognizing by viewing only on tone nucleus, which is a portion of the syllable showing rather stable fundamental frequency (F 0 ) contour regardless of tone types of the preceding and following syllables.The voiced part (or tone nucleus) is divided into 20 segments, and F 0 , delta-F 0 , F 0 slope and short-term energy of each segment are served as inputs to the neural network.In order to cope with tone coarticulation, prosodic feature parameters for the last 5 segments of the preceding syllable and the initial 5 segments of the following syllable are included in the neural network inputs.Information on syllable length is also added to the inputs.Tone recognition experiment was conducted for a female speaker's utterances included in HKU96 corpus.The average recognition rate was 86.5 % including neutral tone syllables, when the tone nucleus model was not used.It increased to 86.9 %, when the model was used.The obtained rate is higher by more than 3 points as compared to that obtained by the hidden-Markov-model-based tone recognizer developed by the authors formerly.

Keywords:

Speech recognition Tone (literature) Syllable Coarticulation Computer science Artificial neural network Hidden Markov model Artificial intelligence Vowel Linguistics

Metrics

Cited By

0.31

FWCI (Field Weighted Citation Impact)

Refs

0.63

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Phonetics and Phonology Research

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model

Abstract

Metrics

Citation History

Topics

Related Documents

Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network

Tone recognition of Chinese continuous speech using tone critical segments

Tone nucleus modeling for Chinese lexical tone recognition

Study on Tone Recognition of Chinese Continuous Speech

New tone recognition methods for Chinese continuous speech