JOURNAL ARTICLE

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection

Kechen SongHan WangYing ZhaoLiming HuangHongwen DongYunhui Yan

Year: 2023 Journal:   Journal of King Saud University - Computer and Information Sciences Vol: 35 (8)Pages: 101702-101702   Publisher: Elsevier BV

Abstract

In recent years, bimodal salient object detection has developed rapidly. In view of the advanced performance of their robustness to extreme situations such as background similarity and illumination variation, researchers began to focus on RGB-Depth-Thermal salient object detection (RGB-D-T SOD). However, most existing bimodal methods usually need expensive computational costs to complete accurate prediction, and this situation is even more serious for three-modal methods, which undoubtedly limits their applicability. To solve this problem, we are the first to propose a lightweight multi-level feature difference fusion network (MFDF) for real-time RGB-D-T SOD. In view of the depth modality contains less useful information, we design an asymmetric three-stream encoder based on MobileNetV2. Due to the differences in semantics and details between high and low level features, using the same module without discrimination will lead to a large number of redundant parameters. On the contrary, in the coding stage, we introduce a cross-modal enhancement module (CME) and a cross-modal fusion module (CMF) to fuse low-level and high-level features respectively. In order to reduce redundant parameters, we design a low-level feature decoding module (LFD) and a multi-scale high-level feature fusion module (MHFF). A great deal of experiments proves that the proposed MFDF has more advantages than the 17 state-of-the-art methods. On the efficiency side, MFDF has a faster speed (124 FPS when the image size is 320 × 320) and much fewer parameters (8.9 M).

Keywords:
Computer science RGB color model Robustness (evolution) Artificial intelligence Encoder Feature (linguistics) Computer vision Object detection Fuse (electrical) Pattern recognition (psychology) Engineering

Metrics

21
Cited By
3.82
FWCI (Field Weighted Citation Impact)
87
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection

Nianchang HuangQiang JiaoQiang ZhangJungong Han

Journal:   IEEE Transactions on Image Processing Year: 2022 Vol: 31 Pages: 6621-6634
JOURNAL ARTICLE

Transformer-based difference fusion network for RGB-D salient object detection

Zhiqiang CuiFeng WangZhengyong Feng

Journal:   Journal of Electronic Imaging Year: 2022 Vol: 31 (06)
JOURNAL ARTICLE

Edge-guided feature fusion network for RGB-T salient object detection

Yuanlin ChenZhenan SunCheng YanMing Zhao

Journal:   Frontiers in Neurorobotics Year: 2024 Vol: 18 Pages: 1489658-1489658
© 2026 ScienceGate Book Chapters — All rights reserved.