Multimodal Emotion Recognition with Factorized Bilinear Pooling and Adversarial Learning

Haotian Miao; Yifei Zhang; Daling Wang; Shi Feng

doi:10.1145/3487075.3487164

ScienceGate Book Chapters

JOURNAL ARTICLE

Multimodal Emotion Recognition with Factorized Bilinear Pooling and Adversarial Learning

Haotian Miao Yifei Zhang Daling Wang Shi Feng

Year: 2021 Pages: 1-6

DOI: 10.1145/3487075.3487164

Get Full-Text PDF Get Analytical Report

Abstract

With the fast development of social networks, the massive growth of the number of multimodal data such as images and texts allows people have higher demands for information processing from an emotional perspective. Emotion recognition requires a higher ability for the computer to simulate high-level visual perception understanding. However, existing methods often focus on the single-modality investigation. In this work, we propose a multimodal model based on factorized bilinear pooling (FBP) and adversarial learning for emotion recognition. In our model, a multimodal feature fusion network is proposed to encode the inter-modality features under the guidance of the FBP to help the visual and textual feature representation learn from each other interactively. Beyond that, we propose an adversarial network by introducing two discriminative classification tasks, emotion recognition and multimodal fusion prediction. Our entire method can be implemented end-to-end by using a deep neural network framework. Experimental results indicate that our proposed model achieves competitive performance on the extended FI dataset. Progressive results prove the ability of our model for emotion recognition against other single- and multi-modality works respectively.

Keywords:

Computer science Pooling Artificial intelligence Discriminative model Modality (human–computer interaction) Feature (linguistics) Multimodal learning Feature learning Bilinear interpolation Feature extraction Deep learning Machine learning Representation (politics) Pattern recognition (psychology) Computer vision

Metrics

Cited By

0.41

FWCI (Field Weighted Citation Impact)

Refs

0.64

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Emotion and Mood Recognition

Social Sciences → Psychology → Experimental and Cognitive Psychology

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Emotion Recognition with Factorized Bilinear Pooling and Adversarial Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Emotion recognition using heterogeneous convolutional neural networks combined with multimodal factorized bilinear pooling

Deep Fusion: An Attention Guided Factorized Bilinear Pooling for Audio-video Emotion Recognition

Cross-culture Multimodal Emotion Recognition with Adversarial Learning

AMFB: Attention based multimodal Factorized Bilinear Pooling for multimodal Fake News Detection

Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition