JOURNAL ARTICLE

Audio-visual based emotion recognition using tripled hidden Markov model

Abstract

Emotion recognition is one of the latest challenges in intelligent human/machine communication. Most of previous work on emotion recognition focused on extracting emotions from visual or audio information separately. A novel approach is presented in this paper to recognize the human emotion which uses both visual and audio from video clips. A tripled hidden Markov model is introduced to perform the recognition which allows the state asynchrony of the audio and visual observation sequences while preserving their natural correlation over time. The experimental results show that this approach outperforms only using visual or audio separately.

Keywords:
Computer science Hidden Markov model Speech recognition Emotion recognition Audio visual Artificial intelligence Asynchrony (computer programming) Visualization Pattern recognition (psychology) Multimedia Asynchronous communication

Metrics

28
Cited By
0.26
FWCI (Field Weighted Citation Impact)
11
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.