JOURNAL ARTICLE

S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection

Lei ZhuXiaoqiang WangPing LiXin YangQing ZhangWeiming WangCarola‐Bibiane SchönliebC. L. Philip Chen

Year: 2021 Journal:   IEEE Transactions on Multimedia Vol: 25 Pages: 676-689   Publisher: Institute of Electrical and Electronics Engineers

Abstract

RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. State-of-the-art RGB-D saliency detectors are mainly based on convolutional neural networks but almost suffer from an intrinsic limitation relying on the labeled data, thus degrading detection accuracy in complex cases. In this work, we present a self-supervised self-ensembling network (S3 Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. To be specific, we first build a self-guided convolutional neural network (SG-CNN) as a baseline model by developing a series of three-layer cross-model feature fusion (TCF) modules to leverage complementary information among depth and RGB modalities and formulating an auxiliary task that predicts a self-supervised image rotation angle. After that, to further explore the knowledge from unlabeled data, we assign SG-CNN to a student network and a teacher network, and encourage the saliency predictions and self-supervised rotation predictions from these two networks to be consistent on the unlabeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our network quantitatively and qualitatively outperforms the state-of-the-art methods.

Keywords:
Artificial intelligence Computer science RGB color model Convolutional neural network Pattern recognition (psychology) Leverage (statistics) Benchmark (surveying) Supervised learning Object detection Artificial neural network Machine learning

Metrics

21
Cited By
1.53
FWCI (Field Weighted Citation Impact)
79
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience

Related Documents

JOURNAL ARTICLE

Self-Supervised Pretraining for RGB-D Salient Object Detection

Xiaoqi ZhaoYouwei PangLihe ZhangHuchuan LuXiang Ruan

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2022 Vol: 36 (3)Pages: 3463-3471
JOURNAL ARTICLE

Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection

Cong ChenShouyang DongYe TianKunlin CaoLi LiuYuanhao Guo

Journal:   IEEE Transactions on Multimedia Year: 2021 Vol: 24 Pages: 3679-3692
© 2026 ScienceGate Book Chapters — All rights reserved.