JOURNAL ARTICLE

Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition

Le WangYuchen ChangKaiping Wang

Year: 2024 Journal:   Applied Sciences Vol: 14 (20)Pages: 9430-9430   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

In the transition to fully automated driving, the interaction between drivers and vehicles is crucial as drivers’ emotions directly influence their behavior, thereby impacting traffic safety. Currently, relying solely on a backbone based on a convolutional neural network (CNN) to extract single RGB modal facial features makes it difficult to capture enough semantic information. To address this issue, this paper proposes a Dual-branch Multimodal Fusion Network (DMFNet). DMFNet extracts semantic features from visible–infrared (RGB-IR) image pairs effectively capturing complementary information between two modalities and achieving a more accurate understanding of the drivers’ emotional state at a global level. However, the accuracy of facial recognition is significantly affected by variations in the drivers’ head posture and light environment. Thus, we further propose a U-Shape Reconstruction Network (URNet) to focus on enhancing and reconstructing the detailed features of RGB modes. Additionally, we design a Detail Enhancement Block (DEB) embedded in a U-shaped reconstruction network for high-frequency filtering. Compared with the original driver emotion recognition model, our method improved the accuracy by 18.77% on the DEFE++ dataset, proving the superiority of the proposed method.

Keywords:
Computer science Artificial intelligence RGB color model Convolutional neural network Focus (optics) Block (permutation group theory) Computer vision Deep learning Pattern recognition (psychology)

Metrics

3
Cited By
3.29
FWCI (Field Weighted Citation Impact)
66
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Sleep and Work-Related Fatigue
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Emotion and Mood Recognition
Social Sciences →  Psychology →  Experimental and Cognitive Psychology
Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction
© 2026 ScienceGate Book Chapters — All rights reserved.