JOURNAL ARTICLE

Revisiting Hidden Markov Models for Speech Emotion Recognition

Abstract

Hidden Markov models (HMMs) have a long tradition in automatic speech recognition (ASR) due to their capability of capturing temporal dynamic characteristics of speech. For emotion recognition from speech, three HMM based architectures are investigated and compared throughout the current paper, namely, the Gaussian mixture model based HMMs (GMM-HMMs), the subspace based Gaussian mixture model based HMMs (SGMM-HMMs) and the hybrid deep neural network HMMs (DNN-HMMs). Extensive emotion recognition experiments are carried out on these three architectures on the CASIA corpus, the Emo-DB corpus and the IEMOCAP database, respectively, and results are compared with those of state-of-the-art approaches. These HMM based architectures prove capable of constituting an effective model for speech emotion recognition. Also, the modeling accuracy is further enhanced by incorporating various advanced techniques from the ASR area. In particular, among all of the architectures, the SGMM-HMMs achieve the best performance in most of the experiments.

Keywords:
Hidden Markov model Computer science Speech recognition Subspace topology Mixture model Artificial intelligence Emotion recognition Pattern recognition (psychology) Artificial neural network Gaussian

Metrics

73
Cited By
6.30
FWCI (Field Weighted Citation Impact)
35
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Speech emotion recognition using hidden Markov models

Tin Lay NweSay Wei FooLiyanage C. De Silva

Journal:   Speech Communication Year: 2003 Vol: 41 (4)Pages: 603-623
JOURNAL ARTICLE

Hidden Markov model-based speech emotion recognition

Björn W. SchullerGerhard RigollM. Lang

Journal:   2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). Year: 2003 Vol: 2 Pages: II-1
JOURNAL ARTICLE

Hidden Markov Models for Speech Recognition

B.-H. JuangL. R. Rabiner

Journal:   Technometrics Year: 1991 Vol: 33 (3)Pages: 251-251
© 2026 ScienceGate Book Chapters — All rights reserved.