JOURNAL ARTICLE

Depth-Relative Self Attention for Monocular Depth Estimation

Abstract

Monocular depth estimation is very challenging because clues to the exact depth are incomplete in a single RGB image. To overcome the limitation, deep neural networks rely on various visual hints such as size, shade, and texture extracted from RGB information. However, we observe that if such hints are overly exploited, the network can be biased on RGB information without considering the comprehensive view. We propose a novel depth estimation model named RElative Depth Transformer (RED-T) that uses relative depth as guidance in self-attention. Specifically, the model assigns high attention weights to pixels of close depth and low attention weights to pixels of distant depth. As a result, the features of similar depth can become more likely to each other and thus less prone to misused visual hints. We show that the proposed model achieves competitive results in monocular depth estimation benchmarks and is less biased to RGB information. In addition, we propose a novel monocular depth estimation benchmark that limits the observable depth range during training in order to evaluate the robustness of the model for unseen depths.

Keywords:
Monocular Artificial intelligence RGB color model Computer science Robustness (evolution) Pixel Computer vision Benchmark (surveying) Geology

Metrics

5
Cited By
0.91
FWCI (Field Weighted Citation Impact)
39
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

LAM-Depth: Laplace-Attention Module-Based Self-Supervised Monocular Depth Estimation

Jiansheng WeiShuguo PanWang GaoPeng Guo

Journal:   IEEE Transactions on Intelligent Transportation Systems Year: 2024 Vol: 25 (10)Pages: 13706-13716
JOURNAL ARTICLE

Visual Attention-Based Self-Supervised Monocular Depth Estimation

Deqiang ChengShuai XuChenggong HanChen LyuQiqi KouJianying Zhang

Journal:   Journal of Computer-Aided Design & Computer Graphics Year: 2024 Vol: 36 (12)Pages: 1920-1931
JOURNAL ARTICLE

Self-supervised monocular depth estimation with coordinate attention

yuhong chenHongfei YuLaide GuoYang Cao

Journal:   Third International Conference on Computer Vision and Data Mining (ICCVDM 2022) Year: 2023 Vol: 30 Pages: 146-146
© 2026 ScienceGate Book Chapters — All rights reserved.