Improving automatic emotion recognition from speech signals

Elif Bozkurt; Engin Erzin; Çiğdem Eroğlu Erdem; A. Tanju Erdem

doi:10.21437/interspeech.2009-106

ScienceGate Book Chapters

JOURNAL ARTICLE

Improving automatic emotion recognition from speech signals

Elif Bozkurt Engin Erzin Çiğdem Eroğlu Erdem A. Tanju Erdem

Year: 2009 Pages: 324-327

DOI: 10.21437/interspeech.2009-106

Get Full-Text PDF Get Analytical Report

Abstract

We present a speech signal driven emotion recognition system. Our system is trained and tested with the INTERSPEECH 2009 Emotion Challenge corpus, which includes spontaneous and emotionally rich recordings. The challenge includes classifier and feature sub-challenges with five-class and two-class classification problems. We investigate prosody related, spectral and HMM-based features for the evaluation of emotion recognition with Gaussian mixture model (GMM) based classifiers. Spectral features consist of mel-scale cepstral coefficients (MFCC), line spectral frequency (LSF) features and their derivatives, whereas prosody-related features consist of mean normalized values of pitch, first derivative of pitch and intensity. Unsupervised training of HMM structures are employed to define prosody related temporal features for the emotion recognition problem. We also investigate data fusion of different features and decision fusion of different classifiers, which are not well studied for emotion recognition framework. Experimental results of automatic emotion recognition with the INTERSPEECH 2009 Emotion Challenge corpus are presented. Index Terms: emotion recognition, prosody modeling

Keywords:

Computer science Prosody Speech recognition Mel-frequency cepstrum Artificial intelligence Hidden Markov model Pattern recognition (psychology) Classifier (UML) Emotion recognition Emotion classification Mixture model Feature extraction

Metrics

Cited By

3.59

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Improving automatic emotion recognition from speech signals

Abstract

Metrics

Citation History

Topics

Related Documents

Emotion recognition from Mandarin speech signals

Emotion recognition by speech signals

Emotion Recognition using Speech Signals

Context-Independent Multilingual Emotion Recognition from Speech Signals

Improving Speech Emotion Recognition with Emotion Dynamics