Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

Jian Huang; Ya Li; Jianhua Tao; Zheng Lian; Mingyue Niu; Jiangyan Yi

doi:10.1109/aciiasia.2018.8470363

ScienceGate Book Chapters

JOURNAL ARTICLE

Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

Jian Huang Ya Li Jianhua Tao Zheng Lian Mingyue Niu Jiangyan Yi

Year: 2018 Pages: 1-5

DOI: 10.1109/aciiasia.2018.8470363

Get Full-Text PDF Get Analytical Report

Abstract

As a major branch of speech processing, speech emotion recognition has drawn much attention of researchers. Prior works have proposed a variety of models and feature sets for training a system. In this paper, we propose to use semi-supervised learning with ladder networks to generate robust feature representation for speech emotion recognition. In our method, the input of ladder network is the normalized static acoustic features and is mapped to high level hidden representations. The model is trained to simultaneously minimize the sum of supervised and unsupervised cost functions by back-propagation. The extracted hidden representations are used as emotional features in SVM model for speech emotion recognition. The experimental results, performed on IEMOCAP database, show 2.6% higher performance than denoising auto-encoder, and 5.3% than the static acoustic features.

Keywords:

Computer science Speech recognition Feature (linguistics) Artificial intelligence Artificial neural network Emotion recognition Feature learning Encoder Representation (politics) Feature extraction Pattern recognition (psychology) Speech processing Support vector machine Acoustic model Supervised learning

Metrics

Cited By

4.72

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Music and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks

Abstract

Metrics

Citation History

Topics

Related Documents

Semi-supervised Ladder Networks for Speech Emotion Recognition

Semi-supervised Ladder Networks for Speech Emotion Recognition

Semi-supervised Ladder Networks for Speech Emotion Recognition

Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition

Speech Emotion Recognition Using Semi-Supervised Learning with Efficient Labeling Strategies