JOURNAL ARTICLE

MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection

Bin WanXiaofei ZhouYaoqi SunTingyu WangChengtao LvShuai WangHaibing YinChenggang Yan

Year: 2023 Journal:   IEEE Transactions on Multimedia Vol: 26 Pages: 2069-2081   Publisher: Institute of Electrical and Electronics Engineers

Abstract

This article discusses the limitations of single- and two-modal salient object detection (SOD) methods and the emergence of multi-modal SOD techniques that integrate Visible, Depth, or Thermal information. However, current multi-modal methods often rely on simple fusion techniques such as addition, multiplication and concatenation, to combine the different modalities, which is ineffective for challenging scenes, such as low illumination and background messy. To address this issue, we propose a novel multi-modal feature fusion network (MFFNet) for V-D-T salient object detection, where the two key points are the triple-modal deep fusion encoder and the progressive feature enhancement decoder. The MFFNet's triple-modal deep fusion (TDF) module is designed to integrate the features of the three modalities and explore their complementarity by utilizing mutual optimization during the encoding phase. In addition, the progressive feature enhancement decoder consists of the weighted context-enhanced feature (WCF) module, region optimization (RO) module and boundary perception (BP) module to produce region-aware and contour-aware features. After that, a multi-scale fusion (MF) module is proposed to integrate these features and generate high-quality saliency maps. We conduct extensive experiments on the VDT-2048 dataset, and our results show that the proposed MFFNet outperforms 12 state-of-the-art multi-modal methods.

Keywords:
Computer science Artificial intelligence Concatenation (mathematics) Modal Feature (linguistics) Pattern recognition (psychology) Salient Encoder Fusion Computer vision Mathematics

Metrics

36
Cited By
6.55
FWCI (Field Weighted Citation Impact)
75
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

MFFNet: Multimodal feature fusion network for RGB-D transparent object detection

Li ZhuTuanjie LiYuming NingYan Zhang

Journal:   International Journal of Advanced Robotic Systems Year: 2024 Vol: 21 (5)
JOURNAL ARTICLE

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection

Kechen SongHan WangYing ZhaoLiming HuangHongwen DongYunhui Yan

Journal:   Journal of King Saud University - Computer and Information Sciences Year: 2023 Vol: 35 (8)Pages: 101702-101702
© 2026 ScienceGate Book Chapters — All rights reserved.