JOURNAL ARTICLE

RGB-Depth Structure Similarity for Self-supervised Monocular Depth Estimation

Abstract

Monocular depth estimation is a fundamental technique for robots to perceive the real (unseen) scene. Supervised methods rely on large-scale datasets with groundtruth (GT) depth labels, which cannot be well generalized to other scenes. A dominant solution is to directly train the model on target scenes in self-supervised way with pseudo depth labels (e.g. generated by stereo matching). However, pseudo depth labels are often unreliable especially near object boundaries. It may disturb the training of the model and consequently decrease the depth quality in the inference. In this paper, we investigate the structure similarity of RGB-Depth based on Gaussian kernels, because the structure of RGB image is always reliable. Such RGB-Depth structure similarity measurement is then used to improve the self-supervised depth estimation in two aspects. It is first utilized to measure the confidence of pseudo depth labels and filter unreliable pixels. It is then utilized to limit the structure of predicted depth maps in the loss. Experiments on the KITTI Eigen Splits datasets verify that the proposed method achieves better or comparable quantitative results and always achieves better visual results with clear depth boundaries compared with five recent baselines.

Keywords:
Artificial intelligence RGB color model Computer science Depth map Monocular Computer vision Similarity (geometry) Pattern recognition (psychology) Supervised learning Image (mathematics) Artificial neural network

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
27
Refs
0.43
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Self-supervised monocular Depth estimation with multi-scale structure similarity loss

Chenggong HanDeqiang ChengQiqi KouXiaoyi WangLiangliang ChenJiamin Zhao

Journal:   Multimedia Tools and Applications Year: 2022 Vol: 82 (24)Pages: 38035-38050
BOOK-CHAPTER

Revisiting Self-supervised Monocular Depth Estimation

Ue-Hwan KimGyeong-Min LeeJong-Hwan Kim

Lecture notes in networks and systems Year: 2022 Pages: 336-350
BOOK-CHAPTER

Self-Distilled Self-Supervised Monocular Depth Estimation

Julio César Díaz MendozaHélio Pedrini

Series on language processing, pattern recognition, and intelligent systems Year: 2024 Pages: 165-185
JOURNAL ARTICLE

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

Xiaoyang LyuLiang LiuMengmeng WangXin KongLina LiuYong LiuXinxin ChenYi Yuan

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (3)Pages: 2294-2301
© 2026 ScienceGate Book Chapters — All rights reserved.