Speech emotion recognition using 2D-convolutional neural network

Fauzivy Reggiswarashari; Sari Widya Sihwi

doi:10.11591/ijece.v12i6.pp6594-6601

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech emotion recognition using 2D-convolutional neural network

Fauzivy Reggiswarashari Sari Widya Sihwi

Year: 2022 Journal: International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering Vol: 12 (6)Pages: 6594-6594 Publisher: Institute of Advanced Engineering and Science (IAES)

DOI: 10.11591/ijece.v12i6.pp6594-6601

Get Full-Text PDF Get Analytical Report

Abstract

<span lang="EN-US">This research proposes a speech emotion recognition model to predict human emotions using the convolutional neural network (CNN) by learning segmented audio of specific emotions. Speech emotion recognition utilizes the extracted features of audio waves to learn speech emotion characteristics; one of them is mel frequency cepstral coefficient (MFCC). Dataset takes a vital role to obtain valuable results in model learning. Hence this research provides the leverage of dataset combination implementation. The model learns a combined dataset with audio segmentation and zero padding using 2D-CNN. Audio segmentation and zero padding equalize the extracted audio features to learn the characteristics. The model results in 83.69% accuracy to predict seven emotions: neutral, happy, sad, angry, fear, disgust, and surprise from the combined dataset with the segmentation of the audio files.</span>

Keywords:

Computer science Mel-frequency cepstrum Speech recognition Convolutional neural network Segmentation Emotion recognition Artificial intelligence Surprise Leverage (statistics) Disgust Emotion classification Artificial neural network Pattern recognition (psychology) Feature extraction Psychology

Metrics

Cited By

1.97

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech emotion recognition using 2D-convolutional neural network

Abstract

Metrics

Citation History

Topics

Related Documents

Speech Emotion Recognition Using 1D/2D Convolutional Neural Networks

Emotion Recognition using Speech Data with Convolutional Neural Network

Efficient Speech to Emotion Recognition Using Convolutional Neural Network

Multi-featured Speech Emotion Recognition Using Extended Convolutional Neural Network

Speech Emotion Recognition using Spectral Images and Convolutional Neural Network