Emotion Recognition using Speech Data with Convolutional Neural Network

Minh H. Pham; Farzan Majeed Noori; Jim Tørresen

doi:10.1109/scc53769.2021.9768372

ScienceGate Book Chapters

JOURNAL ARTICLE

Emotion Recognition using Speech Data with Convolutional Neural Network

Minh H. Pham Farzan Majeed Noori Jim Tørresen

Year: 2021 Pages: 182-187

DOI: 10.1109/scc53769.2021.9768372

Get Full-Text PDF Get Analytical Report

Abstract

Abstract—Identifying emotion from speech has a wide range of applications and has drawn special interests in research to improve the human-computer interaction experience. Traditional machine learning approaches usually face the challenge of selecting the optimal feature set for each application. Deep learning, on the other hand, allows end-to-end development of the models and inherent feature extraction. In this study, we evaluate the performance of Convolutional Neural Network on different kinds of spectral features of acoustic signal collections, from two popular open databases Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) and Berlin Database of Emotional Speech (EmoDB). Two-to-eight classes of emotions (RAVDESS) and two-to-seven classes of emotions (EmoDB) are identified by the deep learning model. The results, in terms of unweighted average recall, are 0.888 (two classes) and 0.694 (eight classes) for the RAVDESS dataset. The corresponding results for the EmoDB dataset are 0.993 (two classes) and 0.764 (seven classes)

Keywords:

Computer science Convolutional neural network Speech recognition Recall Feature extraction Deep learning Artificial intelligence Set (abstract data type) Feature (linguistics) Artificial neural network Emotion recognition Data set Pattern recognition (psychology)

Metrics

Cited By

0.86

FWCI (Field Weighted Citation Impact)

Refs

0.75

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Emotion Recognition using Speech Data with Convolutional Neural Network

Abstract

Metrics

Citation History

Topics

Related Documents

Speech emotion recognition using 2D-convolutional neural network

Emotion recognition from speech using convolutional neural network with recurrent neural network architecture

Efficient Speech to Emotion Recognition Using Convolutional Neural Network

Multi-featured Speech Emotion Recognition Using Extended Convolutional Neural Network

Speech Emotion Recognition using Spectral Images and Convolutional Neural Network