Revisiting Hidden Markov Models for Speech Emotion Recognition

Shuiyang Mao; Dehua Tao; Guangyan Zhang; P.C. Ching; Tan Lee

doi:10.1109/icassp.2019.8683172

ScienceGate Book Chapters

JOURNAL ARTICLE

Revisiting Hidden Markov Models for Speech Emotion Recognition

Shuiyang Mao Dehua Tao Guangyan Zhang P.C. Ching Tan Lee

Year: 2019 Pages: 6715-6719

DOI: 10.1109/icassp.2019.8683172

Get Full-Text PDF Get Analytical Report

Abstract

Hidden Markov models (HMMs) have a long tradition in automatic speech recognition (ASR) due to their capability of capturing temporal dynamic characteristics of speech. For emotion recognition from speech, three HMM based architectures are investigated and compared throughout the current paper, namely, the Gaussian mixture model based HMMs (GMM-HMMs), the subspace based Gaussian mixture model based HMMs (SGMM-HMMs) and the hybrid deep neural network HMMs (DNN-HMMs). Extensive emotion recognition experiments are carried out on these three architectures on the CASIA corpus, the Emo-DB corpus and the IEMOCAP database, respectively, and results are compared with those of state-of-the-art approaches. These HMM based architectures prove capable of constituting an effective model for speech emotion recognition. Also, the modeling accuracy is further enhanced by incorporating various advanced techniques from the ASR area. In particular, among all of the architectures, the SGMM-HMMs achieve the best performance in most of the experiments.

Keywords:

Hidden Markov model Computer science Speech recognition Subspace topology Mixture model Artificial intelligence Emotion recognition Pattern recognition (psychology) Artificial neural network Gaussian

Metrics

Cited By

6.30

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Speech and Audio Processing

Physical Sciences → Computer Science → Signal Processing

Revisiting Hidden Markov Models for Speech Emotion Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Speech emotion recognition using hidden Markov models

Speech emotion recognition using hidden Markov models

Hidden Markov model-based speech emotion recognition

Hidden Markov model-based speech emotion recognition

Hidden Markov Models for Speech Recognition