JOURNAL ARTICLE

CNN-BiGRU Speech Emotion Recognition Based on Attention Mechanism

Abstract

The speech emotion recognition system can significantly improve the efficiency of human-computer interaction by accurately recognizing emotional information in speech. This system typically includes two main steps: speech feature extraction and emotion classification. In order to improve accuracy, this article uses MFCC features, short-term energy features, and short-term average zero crossing rate as model inputs, and introduces a convolutional neural network based on attention mechanism and a bidirectional gated loop unit (BiGRU). This method can effectively focus on useful information in speech features. Compared to the CNN-BiLSTM network based on attention mechanism and the CNN-GRU network based on attention mechanism, this method can effectively improve the accuracy of speech emotion recognition when conducting experiments on the Chinese sentiment corpus CASIA.

Keywords:
Computer science Mechanism (biology) Speech recognition Artificial intelligence

Metrics

3
Cited By
0.81
FWCI (Field Weighted Citation Impact)
4
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
© 2026 ScienceGate Book Chapters — All rights reserved.