JOURNAL ARTICLE

GCF2-Net: global-aware cross-modal feature fusion network for speech emotion recognition

Abstract

Emotion recognition plays an essential role in interpersonal communication. However, existing recognition systems use only features of a single modality for emotion recognition, ignoring the interaction of information from the different modalities. Therefore, in our study, we propose a global-aware Cross-modal feature Fusion Network (GCF2-Net) for recognizing emotion. We construct a residual cross-modal fusion attention module (ResCMFA) to fuse information from multiple modalities and design a global-aware module to capture global details. More specifically, we first use transfer learning to extract wav2vec 2.0 features and text features fused by the ResCMFA module. Then, cross-modal fusion features are fed into the global-aware module to capture the most essential emotional information globally. Finally, the experiment results have shown that our proposed method has significant advantages than state-of-the-art methods on the IEMOCAP and MELD datasets, respectively.

Keywords:
Fuse (electrical) Feature (linguistics) Emotion recognition Construct (python library) Modality (human–computer interaction) Fusion Feature extraction Pattern recognition (psychology) Sensor fusion

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Agriculture and Farm Safety
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
Human-Animal Interaction Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Genetics
Agriculture Sustainability and Environmental Impact
Physical Sciences →  Environmental Science →  Ecology

Related Documents

JOURNAL ARTICLE

GCF2-Net: global-aware cross-modal feature fusion network for speech emotion recognition

Feng LiJiusong LuoLingling WangWei LiuXiaoshuang Sang

Journal:   Frontiers in Neuroscience Year: 2023 Vol: 17 Pages: 1183132-1183132
BOOK-CHAPTER

Speech Emotion Recognition Using Global-Aware Cross-Modal Feature Fusion Network

Feng LiJiusong Luo

Lecture notes in computer science Year: 2023 Pages: 211-221
JOURNAL ARTICLE

Speech Emotion Recognition with Global-Aware Fusion on Multi-Scale Feature Representation

Wenjing ZhuXiang Li

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 6437-6441
© 2026 ScienceGate Book Chapters — All rights reserved.