JOURNAL ARTICLE

GCF2-Net: global-aware cross-modal feature fusion network for speech emotion recognition

Feng LiJiusong LuoLingling WangWei LiuXiaoshuang Sang

Year: 2023 Journal:   Frontiers in Neuroscience Vol: 17 Pages: 1183132-1183132   Publisher: Frontiers Media

Abstract

Emotion recognition plays an essential role in interpersonal communication. However, existing recognition systems use only features of a single modality for emotion recognition, ignoring the interaction of information from the different modalities. Therefore, in our study, we propose a global-aware Cross-modal feature Fusion Network (GCF 2 -Net) for recognizing emotion. We construct a residual cross-modal fusion attention module (ResCMFA) to fuse information from multiple modalities and design a global-aware module to capture global details. More specifically, we first use transfer learning to extract wav2vec 2.0 features and text features fused by the ResCMFA module. Then, cross-modal fusion features are fed into the global-aware module to capture the most essential emotional information globally. Finally, the experiment results have shown that our proposed method has significant advantages than state-of-the-art methods on the IEMOCAP and MELD datasets, respectively.

Keywords:
Computer science Modal Fuse (electrical) Modalities Feature (linguistics) Emotion recognition Residual Modality (human–computer interaction) Artificial intelligence Construct (python library) Affective computing Speech recognition Machine learning Pattern recognition (psychology) Engineering Algorithm

Metrics

8
Cited By
3.33
FWCI (Field Weighted Citation Impact)
85
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Speech Emotion Recognition Using Global-Aware Cross-Modal Feature Fusion Network

Feng LiJiusong Luo

Lecture notes in computer science Year: 2023 Pages: 211-221
JOURNAL ARTICLE

Speech Emotion Recognition with Global-Aware Fusion on Multi-Scale Feature Representation

Wenjing ZhuXiang Li

Journal:   ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Year: 2022 Pages: 6437-6441
© 2026 ScienceGate Book Chapters — All rights reserved.