Speech Emotion Recognition using Convolutional Neural Networks with Attention Mechanisms

A. Poongodai; Y. V. Nandini; T Mounika; A J Karishma; N. K. Senthil Kumar

doi:10.47001/irjiet/2025.iccis-202526

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech Emotion Recognition using Convolutional Neural Networks with Attention Mechanisms

A. Poongodai Y. V. Nandini T Mounika A J Karishma N. K. Senthil Kumar

Year: 2025 Journal: International Research Journal of Innovations in Engineering and Technology Vol: 09 (Special Issue ICCIS)Pages: 162-167

DOI: 10.47001/irjiet/2025.iccis-202526

Get Full-Text PDF Get Analytical Report

Abstract

Abstract - Speech Emotion Recognition (SER) is a crucial component in enhancing human- computer interaction by enabling machines to recognize and respond to human emotions effectively. This study proposes a novel SER framework using Convolutional Neural Networks (CNNs) augmented with attention mechanisms. The CNNs are employed to capture hierarchical and spatial features from spectrogram representations of speech signals, while Attention mechanisms focus on emotionally salient regions, improving interpretability and accuracy. The proposed model is evaluated on benchmark datasets, demonstrating superior performance compared to traditional methods. This innovative combination of CNNs and attention mechanisms highlights its potential for advancing realworld SER applications such as virtual assistants, customer support systems, and mental health monitoring. By prioritizing critical emotional features, the model improves its practical utility and reliability. This work underlines the importance of deep learning techniques in developing SER technologies, paving the way for more intuitive and effective human-computer interactions. This approach highlights the potential of combining CNNs with attention for advancing SER applications in real-world scenarios.

Keywords:

Interpretability Computer science Convolutional neural network Artificial intelligence Spectrogram Benchmark (surveying) Salient Machine learning Deep learning Focus (optics) Reliability (semiconductor)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.20

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech Emotion Recognition using Convolutional Neural Networks with Attention Mechanisms

Abstract

Metrics

Topics

Related Documents

Speech Emotion Recognition Using Convolutional Neural Networks with Attention Mechanism

Convolutional-Recurrent Neural Networks With Multiple Attention Mechanisms for Speech Emotion Recognition

Speech Emotion Recognition Using Convolutional- Recurrent Neural Networks with Attention Model

Speech Emotion Recognition using Convolutional Neural Networks and Recurrent Neural Networks with Attention Model

Speech Emotion Recognition Using Convolutional Neural Networks