To simulate the human ability to assess affects, an automatic affect recognition system should make use of multi-sensor information. In the framework of multi-stream fused hidden Markov model (MFHMM), we present a training combination strategy towards audio-visual affect recognition. Different from the weighting combination scheme, our approach is able to use a variety of learning methods to obtain a robust multi-stream fusion result. We evaluate our approach in personal-independent recognition of 11 affective states from 20 subjects. The experimental results suggest that MFHMM outperforms IHMM which assumes the independence among streams, and the training combination strategy has the superiority over the weighting combination under clean and varying audio channel noise condition.
Zhihong ZengJilin TuBrian PianfettiMing LiuTong ZhangZhang Zhen-qiuThomas S. HuangS. Levinson
Zhihong ZengJilin TuMing LiuThomas S. Huang
Jingxuan ZhaoXiao WuDongmei Jiang
Mingli SongChun ChenMingyu You