JOURNAL ARTICLE

Speech Emotion Recognition Using Convolutional- Recurrent Neural Networks with Attention Model

Abstract

Speech Emotion Recognition (SER) plays an important role in human-computer interface and assistant technologies. In this paper, a new method is proposed using distributed Convolution Neural Networks (CNN) to automatically learn affect-salient features from raw spectral information, and then applying Bidirectional Recurrent Neural Network (BRNN) to obtain the temporal information from the output of CNN. In the end, an Attention Mechanism is implemented on the output sequence of the BRNN to focus on target emotion-pertinent parts of an utterance. This attention mechanism not only improves the classification accuracy, but also provides model's interpretability. Experimental results show that this approach can gain 64.08% weighted accuracy and 56.41% unweighted accuracy for four-emotion classification in IEMOCAP dataset, which outperform previous results reported for this dataset.

Keywords:
Computer science Interpretability Convolutional neural network Artificial intelligence Utterance Focus (optics) Recurrent neural network Salient Speech recognition Convolution (computer science) Mechanism (biology) Emotion recognition Sequence (biology) Pattern recognition (psychology) Emotion classification Artificial neural network

Metrics

14
Cited By
0.81
FWCI (Field Weighted Citation Impact)
13
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Speech Emotion Recognition using Convolutional Neural Networks and Recurrent Neural Networks with Attention Model

Journal:   Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering Year: 2019
JOURNAL ARTICLE

3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition

Mingyi ChenXuanji HeJing YangHan Zhang

Journal:   IEEE Signal Processing Letters Year: 2018 Vol: 25 (10)Pages: 1440-1444
JOURNAL ARTICLE

Speech Emotion Recognition using Convolutional Neural Networks with Attention Mechanisms

A. PoongodaiY. V. NandiniT MounikaA J KarishmaN. K. Senthil Kumar

Journal:   International Research Journal of Innovations in Engineering and Technology Year: 2025 Vol: 09 (Special Issue ICCIS)Pages: 162-167
© 2026 ScienceGate Book Chapters — All rights reserved.