JOURNAL ARTICLE

Deep neural networks for Mandarin tone recognition

Abstract

This paper investigates the application of deep models including deep maxout networks(DMNs) to Mandarin tone recognition. Our focus is on the capacity of extracting high-level robust features and fusing different kinds of serially-concatenated features of deep models. Furthermore, Maxout networks have been proposed to integrate dropout naturally and achieve state-of-the-art results. Therefore, we investigate the advantage of DMNs when the training data is limited and imbalanced. Our experiments on the ASCCD corpus show that comparing with shallow models such as one-hidden layer multi-perception (MLP) and support vector machine(SVM), deep models improve Mandarin tone recognition significantly. Among the deep models, DMNs can get better performance comparing with other deep neural networks based on sigmoid units or rectified linear units(ReLU).

Keywords:
Mandarin Chinese Computer science Dropout (neural networks) Artificial intelligence Deep learning Support vector machine Tone (literature) Artificial neural network Deep neural networks Speech recognition Focus (optics) Pattern recognition (psychology) Feature extraction Machine learning

Metrics

13
Cited By
1.09
FWCI (Field Weighted Citation Impact)
25
Refs
0.79
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.