Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection

Jianxun Zhao; Xin Wen; Yu He; Xiaowei Yang; Kechen Song

doi:10.3390/s24248159

ScienceGate Book Chapters

JOURNAL ARTICLE

Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection

Jianxun Zhao Xin Wen Yu He Xiaowei Yang Kechen Song

Year: 2024 Journal: Sensors Vol: 24 (24)Pages: 8159-8159 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s24248159

Get Full-Text PDF Get Analytical Report

Abstract

RGB-T salient object detection (SOD) has received considerable attention in the field of computer vision. Although existing methods have achieved notable detection performance in certain scenarios, challenges remain. Many methods fail to fully utilize high-frequency and low-frequency features during information interaction among different scale features, limiting detection performance. To address this issue, we propose a method for RGB-T salient object detection that enhances performance through wavelet transform and channel-wise attention fusion. Through feature differentiation, we effectively extract spatial characteristics of the target, enhancing the detection capability for global context and fine-grained details. First, input features are passed through the channel-wise criss-cross module (CCM) for cross-modal information fusion, adaptively adjusting the importance of features to generate rich fusion information. Subsequently, the multi-scale fusion information is input into the feature selection wavelet transforme module (FSW), which selects beneficial low-frequency and high-frequency features to improve feature aggregation performance and achieves higher segmentation accuracy through long-distance connections. Extensive experiments demonstrate that our method outperforms 22 state-of-the-art methods.

Keywords:

Artificial intelligence Computer science RGB color model Pattern recognition (psychology) Wavelet Feature (linguistics) Computer vision Wavelet transform Image fusion Object detection Segmentation Context (archaeology) Channel (broadcasting) Feature selection Image (mathematics) Telecommunications

Metrics

Cited By

1.06

FWCI (Field Weighted Citation Impact)

Refs

0.71

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Face Recognition and Perception

Life Sciences → Neuroscience → Cognitive Neuroscience

Gaze Tracking and Assistive Technology

Physical Sciences → Computer Science → Human-Computer Interaction

Wavelet-Driven Multi-Band Feature Fusion for RGB-T Salient Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Revisiting Feature Fusion for RGB-T Salient Object Detection

Multi-Modal Salient Feature Enhance for Rgb-T Salient Object Detection

Feature Enhancement and Fusion for RGB-T Salient Object Detection

Edge-guided feature fusion network for RGB-T salient object detection

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection