RGB-Depth Structure Similarity for Self-supervised Monocular Depth Estimation

Lulu Zhang; Meng Yang

doi:10.1109/rcar58764.2023.10250088

ScienceGate Book Chapters

JOURNAL ARTICLE

RGB-Depth Structure Similarity for Self-supervised Monocular Depth Estimation

Lulu Zhang Meng Yang

Year: 2023 Vol: 13 Pages: 841-846

DOI: 10.1109/rcar58764.2023.10250088

Get Full-Text PDF Get Analytical Report

Abstract

Monocular depth estimation is a fundamental technique for robots to perceive the real (unseen) scene. Supervised methods rely on large-scale datasets with groundtruth (GT) depth labels, which cannot be well generalized to other scenes. A dominant solution is to directly train the model on target scenes in self-supervised way with pseudo depth labels (e.g. generated by stereo matching). However, pseudo depth labels are often unreliable especially near object boundaries. It may disturb the training of the model and consequently decrease the depth quality in the inference. In this paper, we investigate the structure similarity of RGB-Depth based on Gaussian kernels, because the structure of RGB image is always reliable. Such RGB-Depth structure similarity measurement is then used to improve the self-supervised depth estimation in two aspects. It is first utilized to measure the confidence of pseudo depth labels and filter unreliable pixels. It is then utilized to limit the structure of predicted depth maps in the loss. Experiments on the KITTI Eigen Splits datasets verify that the proposed method achieves better or comparable quantitative results and always achieves better visual results with clear depth boundaries compared with five recent baselines.

Keywords:

Artificial intelligence RGB color model Computer science Depth map Monocular Computer vision Similarity (geometry) Pattern recognition (psychology) Supervised learning Image (mathematics) Artificial neural network

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.43

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

Optical measurement and interference techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

RGB-Depth Structure Similarity for Self-supervised Monocular Depth Estimation

Abstract

Metrics

Citation History

Topics

Related Documents

Self-supervised monocular Depth estimation with multi-scale structure similarity loss

Revisiting Self-supervised Monocular Depth Estimation

Self-Distilled Self-Supervised Monocular Depth Estimation

RA-Depth: Resolution Adaptive Self-supervised Monocular Depth Estimation

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation