Speech Emotion Recognition Using CNN

Zheng-Wei Huang; Ming Dong; Qirong Mao; Yongzhao Zhan

doi:10.1145/2647868.2654984

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech Emotion Recognition Using CNN

Zheng-Wei Huang Ming Dong Qirong Mao Yongzhao Zhan

Year: 2014 Pages: 801-804

DOI: 10.1145/2647868.2654984

Get Full-Text PDF Get Analytical Report

Abstract

Deep learning systems, such as Convolutional Neural Networks (CNNs), can infer a hierarchical representation of input data that facilitates categorization. In this paper, we propose to learn affect-salient features for Speech Emotion Recognition (SER) using semi-CNN. The training of semi-CNN has two stages. In the first stage, unlabeled samples are used to learn candidate features by contractive convolutional neural network with reconstruction penalization. The candidate features, in the second step, are used as the input to semi-CNN to learn affect-salient, discriminative features using a novel objective function that encourages the feature saliency, orthogonality and discrimination. Our experiment results on benchmark datasets show that our approach leads to stable and robust recognition performance in complex scenes (e.g., with speaker and environment distortion), and outperforms several well-established SER features.

Keywords:

Computer science Convolutional neural network Discriminative model Pattern recognition (psychology) Artificial intelligence Orthogonality Salient Feature (linguistics) Benchmark (surveying) Speech recognition Representation (politics) Feature learning Categorization Mathematics

Metrics

388

Cited By

11.70

FWCI (Field Weighted Citation Impact)

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Speech Emotion Recognition Using CNN

Abstract

Metrics

Citation History

Topics

Related Documents

Emotion Recognition Using Speech

Emotion Recognition using Imperfect Speech Recognition

Emotion recognition using imperfect speech recognition

Speech Emotion Recognition Using ANN

Speech Emotion Recognition Using LSTM