JOURNAL ARTICLE

Improving automatic emotion recognition from speech signals

Abstract

We present a speech signal driven emotion recognition system. Our system is trained and tested with the INTERSPEECH 2009 Emotion Challenge corpus, which includes spontaneous and emotionally rich recordings. The challenge includes classifier and feature sub-challenges with five-class and two-class classification problems. We investigate prosody related, spectral and HMM-based features for the evaluation of emotion recognition with Gaussian mixture model (GMM) based classifiers. Spectral features consist of mel-scale cepstral coefficients (MFCC), line spectral frequency (LSF) features and their derivatives, whereas prosody-related features consist of mean normalized values of pitch, first derivative of pitch and intensity. Unsupervised training of HMM structures are employed to define prosody related temporal features for the emotion recognition problem. We also investigate data fusion of different features and decision fusion of different classifiers, which are not well studied for emotion recognition framework. Experimental results of automatic emotion recognition with the INTERSPEECH 2009 Emotion Challenge corpus are presented. Index Terms: emotion recognition, prosody modeling

Keywords:
Computer science Prosody Speech recognition Mel-frequency cepstrum Artificial intelligence Hidden Markov model Pattern recognition (psychology) Classifier (UML) Emotion recognition Emotion classification Mixture model Feature extraction

Metrics

34
Cited By
3.59
FWCI (Field Weighted Citation Impact)
8
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Emotion Recognition using Speech Signals

S. Harsha VardhanM. P. RahuPuttamreddy KavyasriA. Sraavani

Journal:   International Journal of Advanced Research in Science Communication and Technology Year: 2022 Pages: 126-132
JOURNAL ARTICLE

Context-Independent Multilingual Emotion Recognition from Speech Signals

Vladimir HozjanZdravko Kačič

Journal:   International Journal of Speech Technology Year: 2003 Vol: 6 (3)Pages: 311-320
© 2026 ScienceGate Book Chapters — All rights reserved.