JOURNAL ARTICLE

Emotion Recognition using Speech Data with Convolutional Neural Network

Abstract

Abstract—Identifying emotion from speech has a wide range of applications and has drawn special interests in research to improve the human-computer interaction experience. Traditional machine learning approaches usually face the challenge of selecting the optimal feature set for each application. Deep learning, on the other hand, allows end-to-end development of the models and inherent feature extraction. In this study, we evaluate the performance of Convolutional Neural Network on different kinds of spectral features of acoustic signal collections, from two popular open databases Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) and Berlin Database of Emotional Speech (EmoDB). Two-to-eight classes of emotions (RAVDESS) and two-to-seven classes of emotions (EmoDB) are identified by the deep learning model. The results, in terms of unweighted average recall, are 0.888 (two classes) and 0.694 (eight classes) for the RAVDESS dataset. The corresponding results for the EmoDB dataset are 0.993 (two classes) and 0.764 (seven classes)

Keywords:
Computer science Convolutional neural network Speech recognition Recall Feature extraction Deep learning Artificial intelligence Set (abstract data type) Feature (linguistics) Artificial neural network Emotion recognition Data set Pattern recognition (psychology)

Metrics

11
Cited By
0.86
FWCI (Field Weighted Citation Impact)
22
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Speech emotion recognition using 2D-convolutional neural network

Fauzivy ReggiswarashariSari Widya Sihwi

Journal:   International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering Year: 2022 Vol: 12 (6)Pages: 6594-6594
BOOK-CHAPTER

Efficient Speech to Emotion Recognition Using Convolutional Neural Network

R. Ganesh KumarN. M. Dhanya

Lecture notes in electrical engineering Year: 2021 Pages: 267-276
© 2026 ScienceGate Book Chapters — All rights reserved.