Mamba4SOD: RGB‐T Salient Object Detection Using Mamba‐Based Fusion Module

Xu Yi; Ruichao Hou; Ziheng Qi; Tongwei Ren

doi:10.1049/cvi2.70033

ScienceGate Book Chapters

JOURNAL ARTICLE

Mamba4SOD: RGB‐T Salient Object Detection Using Mamba‐Based Fusion Module

Xu Yi Ruichao Hou Ziheng Qi Tongwei Ren

Year: 2025 Journal: IET Computer Vision Vol: 19 (1) Publisher: Institution of Engineering and Technology

DOI: 10.1049/cvi2.70033

Get Full-Text PDF Get Analytical Report

Abstract

ABSTRACT RGB and thermal salient object detection (RGB‐T SOD) aims to accurately locate and segment salient objects in aligned visible and thermal image pairs. However, existing methods often struggle to produce complete masks and sharp boundaries in challenging scenarios due to insufficient exploration of complementary features from the dual modalities. In this paper, we propose a novel mamba‐based fusion network for RGB‐T SOD task, named Mamba4SOD, which integrates the strengths of Swin Transformer and Mamba to construct robust multi‐modal representations, effectively reducing pixel misclassification. Specifically, we leverage Swin Transformer V2 to establish long‐range contextual dependencies and thoroughly analyse the impact of features at various levels on detection performance. Additionally, we develop a novel Mamba‐based fusion module with linear complexity, boosting multi‐modal enhancement and fusion. Experimental results on VT5000, VT1000 and VT821 datasets demonstrate that our method outperforms the state‐of‐the‐art RGB‐T SOD methods.

Keywords:

RGB color model Artificial intelligence Computer science Computer vision Pixel Salient Leverage (statistics) Boosting (machine learning) Object detection Pattern recognition (psychology) Fusion Transformer Engineering

Metrics

Cited By

4.77

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Face Recognition and Perception

Life Sciences → Neuroscience → Cognitive Neuroscience

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Mamba4SOD: RGB‐T Salient Object Detection Using Mamba‐Based Fusion Module

Abstract

Metrics

Citation History

Topics

Related Documents

Revisiting Feature Fusion for RGB-T Salient Object Detection

STAMF: Synergistic transformer and mamba fusion network for RGB-Polarization based underwater salient object detection

RGB-T salient object detection based on progressive alignment and multiscale fusion

Modal complementary fusion network for RGB-T salient object detection

Feature Enhancement and Fusion for RGB-T Salient Object Detection