Abstract

This paper performs speech emotion recognition on short voice messages lasting less than three seconds, using one-dimensional convolutional neural networks. The Ravee dataset, voiced by professional actors, is exploited. The proposed convolutional neural network architecture for the speech emotion recognition system aims to improve accuracy and reduce the total processing cost of the speech emotion recognition model. Moreover, Mel-frequency cepstral coefficients are used as the main features for recognition purposes. Additionally, overfitting problems are avoided by utilizing data augmentation techniques and feature extraction algorithms, which enhance testing ac-curacy by increasing the number of training samples. Various simulations are conducted, through which it is observed that the proposed model provides recognition accuracy of up to 83%.

Keywords:
Overfitting Computer science Speech recognition Convolutional neural network Feature extraction Emotion recognition Mel-frequency cepstrum Artificial intelligence Feature (linguistics) Artificial neural network Speaker recognition Pattern recognition (psychology)

Metrics

6
Cited By
1.61
FWCI (Field Weighted Citation Impact)
30
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Speech Emotion Recognition Using Convolutional Neural Networks

Narsi Reddy

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2024 Vol: 12 (8)Pages: 30-36
BOOK-CHAPTER

Speech Emotion Recognition Using Convolutional Neural Networks

Anunya SharmaKiran MalikPoonam Bansal

Communications in computer and information science Year: 2024 Pages: 90-101
BOOK-CHAPTER

Emotion Recognition from Speech Using Convolutional Neural Networks

Bayan MahfoodAshraf ElnagarFiruz Kamalov

Lecture notes in networks and systems Year: 2023 Pages: 719-731
© 2026 ScienceGate Book Chapters — All rights reserved.