Speech Emotion Recognition Using One-Dimensional Convolutional Neural Networks

Majd Saloumi; Omar Alshaltone; Sam Ansari; Soliman A. Mahmoud; Khawla A. Alnajjar; Eqab Almajali; Abir Hussain

doi:10.1109/tsp59544.2023.10197766

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech Emotion Recognition Using One-Dimensional Convolutional Neural Networks

Majd Saloumi Omar Alshaltone Sam Ansari Soliman A. Mahmoud Khawla A. Alnajjar Eqab Almajali Abir Hussain

Year: 2023 Pages: 212-216

DOI: 10.1109/tsp59544.2023.10197766

Get Full-Text PDF Get Analytical Report

Abstract

This paper performs speech emotion recognition on short voice messages lasting less than three seconds, using one-dimensional convolutional neural networks. The Ravee dataset, voiced by professional actors, is exploited. The proposed convolutional neural network architecture for the speech emotion recognition system aims to improve accuracy and reduce the total processing cost of the speech emotion recognition model. Moreover, Mel-frequency cepstral coefficients are used as the main features for recognition purposes. Additionally, overfitting problems are avoided by utilizing data augmentation techniques and feature extraction algorithms, which enhance testing ac-curacy by increasing the number of training samples. Various simulations are conducted, through which it is observed that the proposed model provides recognition accuracy of up to 83%.

Keywords:

Overfitting Computer science Speech recognition Convolutional neural network Feature extraction Emotion recognition Mel-frequency cepstrum Artificial intelligence Feature (linguistics) Artificial neural network Speaker recognition Pattern recognition (psychology)

Metrics

Cited By

1.61

FWCI (Field Weighted Citation Impact)

Refs

0.81

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech Emotion Recognition Using One-Dimensional Convolutional Neural Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Speech Emotion Recognition Using Convolutional Neural Networks

Speech Emotion Recognition Using Convolutional Neural Networks

Emotion Recognition from Speech Using Convolutional Neural Networks

Speech Emotion Recognition Using Convolutional Recurrent Neural Networks

Speech Emotion Recognition Using Quaternion Convolutional Neural Networks