JOURNAL ARTICLE

Fusion with Hierarchical Graphs for Multimodal Emotion Recognition

Shuyun TangZhaojie LuoGuoshun NanJun BabaYuichiro YoshikawaHiroshi Ishiguro

Year: 2022 Journal:   2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Abstract

Automatic emotion recognition (AER) based on enriched multimodal inputs, including text, speech, and visual clues, is crucial in the development of emotionally intelligent machines. Although complex modality relationships have been proven effective for AER, they are still largely underexplored because previous works predominantly relied on various fusion mechanisms with simply concatenated features to learn mul-timodal representations for emotion classification. This paper proposes a novel hierarchical fusion graph convolutional net-work (HFGCN) model that learns more informative multimodal representations by considering the modality dependencies during the feature fusion procedure. Specifically, the proposed model fuses multimodality inputs using a two-stage graph construction approach and encodes the modality dependencies into the con-versation representation. We verified the interpretable capabilities of the proposed method by projecting the emotional states to a 2D valence-arousal (VA) subspace. Extensive experiments showed the effectiveness of our proposed model for more accurate AER, which yielded state-of-the-art results on two public datasets, IEMOCAP and MELD.

Keywords:
Computer science Modality (human–computer interaction) Emotion recognition Artificial intelligence Subspace topology Convolutional neural network Valence (chemistry) Graph Affective computing Representation (politics) Pattern recognition (psychology) Machine learning Natural language processing Theoretical computer science

Metrics

10
Cited By
1.62
FWCI (Field Weighted Citation Impact)
46
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multimodal Emotion Recognition Based on Hierarchical Feature Fusion

Yinggang XieNannan ZhouShijuan Zhu

Journal:   電腦學刊 Year: 2025 Vol: 36 (2)Pages: 281-296
JOURNAL ARTICLE

Multimodal Emotion Recognition Based on Cascaded Multichannel and Hierarchical Fusion

Xia LiuZhijing XuKan Huang

Journal:   Computational Intelligence and Neuroscience Year: 2023 Vol: 2023 (1)Pages: 9645611-9645611
JOURNAL ARTICLE

Hierarchical Attention‐Based Multimodal Fusion Network for Video Emotion Recognition

Xiaodong LiuSongyang LiMiao Wang

Journal:   Computational Intelligence and Neuroscience Year: 2021 Vol: 2021 (1)Pages: 5585041-5585041
JOURNAL ARTICLE

Multimodal emotion recognition with hierarchical memory networks

Helang LaiKeke WuLingli Li

Journal:   Intelligent Data Analysis Year: 2021 Vol: 25 (4)Pages: 1031-1045
© 2026 ScienceGate Book Chapters — All rights reserved.