End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

Mengna Gao; Jing Dong; Dongsheng Zhou; Qiang Zhang; Deyun Yang

doi:10.1145/3319921.3319963

ScienceGate Book Chapters

JOURNAL ARTICLE

End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

Mengna Gao Jing Dong Dongsheng Zhou Qiang Zhang Deyun Yang

Year: 2019 Pages: 78-82

DOI: 10.1145/3319921.3319963

Get Full-Text PDF Get Analytical Report

Abstract

Real-time speech emotion recognition has always been a problem. To this end, we proposed an end-to-end speech emotion recognition model based on one-dimensional convolutional neural network, which contains only three convolution layers, two pooling layers and one full-connected layer. Through Adam optimization algorithm and back propagation mechanism, more discriminative features can be extracted continuously. Our model is quite simple in structure and easy to quickly complete the emotional classification task. Compared with traditional methods, there is no need to carry out the complex process of manually extracting features, and the model can automatically learn the emotional features from raw speech signals. In the emotional recognition experiments with EMODB, CASIA, IEMOCAP, and CHEAVD four speech databases, relatively high recognition rates were obtained. Experiments show that the proposed algorithm is of great benefit to the implementation of real-time speech emotion recognition.

Keywords:

Computer science Speech recognition Discriminative model Pooling Convolutional neural network Task (project management) Artificial intelligence Convolution (computer science) Process (computing) Acoustic model Feature extraction Pattern recognition (psychology) End-to-end principle Time delay neural network Artificial neural network Carry (investment) Speech processing

Metrics

Cited By

2.75

FWCI (Field Weighted Citation Impact)

Refs

0.88

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

Abstract

Metrics

Citation History

Topics

Related Documents

End-to-end speech emotion recognition based on neural network

EEG-based emotion recognition using an end-to-end regional-asymmetric convolutional neural network

Adieu Features? End-to-end Speech Emotion Recognition using a Deep Convolutional Recurrent Network

Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network

HOG-BASED EMOTION RECOGNITION USING ONE-DIMENSIONAL CONVOLUTIONAL NEURAL NETWORK