JOURNAL ARTICLE

Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition

Abstract

To simulate the human ability to assess affects, an automatic affect recognition system should make use of multi-sensor information. In the framework of multi-stream fused hidden Markov model (MFHMM), we present a training combination strategy towards audio-visual affect recognition. Different from the weighting combination scheme, our approach is able to use a variety of learning methods to obtain a robust multi-stream fusion result. We evaluate our approach in personal-independent recognition of 11 affective states from 20 subjects. The experimental results suggest that MFHMM outperforms IHMM which assumes the independence among streams, and the training combination strategy has the superiority over the weighting combination under clean and varying audio channel noise condition.

Keywords:
Computer science Hidden Markov model Speech recognition Weighting Artificial intelligence Noise (video) Audio visual Independence (probability theory) Affect (linguistics) Pattern recognition (psychology) Machine learning Scheme (mathematics) Multimedia Image (mathematics)

Metrics

48
Cited By
3.66
FWCI (Field Weighted Citation Impact)
15
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Blind Source Separation Techniques
Physical Sciences →  Computer Science →  Signal Processing
© 2026 ScienceGate Book Chapters — All rights reserved.