JOURNAL ARTICLE

Improved tone recognition for fluent Mandarin speech based on new inter-syllabic features and robust pitch extraction

Abstract

Tone recognition for fluent Mandarin speech has always been a very difficult problem, because the pitch contours vary seriously with the context conditions and the complicated tone behavior is difficult to analyze. A new set of four inter-syllabic features are identified to characterize quantitatively such pitch contour variation with respect to the context conditions. In addition, a robust pitch extraction method is proposed by integrating the adaptive Gabor representation (AGR) and instantaneous frequency amplitude spectrum (IFAS). Experimental results indicate that accurate pitch values can be extracted under various noisy conditions, and the tone recognition accuracy can be improved significantly.

Keywords:
Mandarin Chinese Tone (literature) Speech recognition Computer science Context (archaeology) Syllabic verse Feature extraction Set (abstract data type) Artificial intelligence Pattern recognition (psychology)

Metrics

11
Cited By
2.49
FWCI (Field Weighted Citation Impact)
14
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.