JOURNAL ARTICLE

Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network

Abstract

Monocular depth estimation is known as an ill-posed task in which objects in a 2D image usually do not contain sufficient information to predict their depth. Thus, it acts differently from other tasks (e.g., classification and segmentation) in many ways. In this paper, we find that self-supervised monocular depth estimation shows a direction sensitivity and environmental dependency in the feature representation. But the current backbones borrowed from other tasks pay less attention to handling different types of environmental information, limiting the overall depth accuracy. To bridge this gap, we propose a new Direction-aware Cumulative Convolution Network (DaCCN), which improves the depth feature representation in two aspects. First, we propose a direction-aware module, which can learn to adjust the feature extraction in each direction, facilitating the encoding of different types of information. Secondly, we design a new cumulative convolution to improve the efficiency for aggregating important environmental information. Experiments show that our method achieves significant improvements on three widely used benchmarks, KITTI, Cityscapes, and Make3D, setting a new state-of-the-art performance on the popular benchmarks with all three types of self-supervision. https://github.com/wencheng256/DaCCN.

Keywords:
Computer science Convolution (computer science) Monocular Artificial intelligence Feature (linguistics) Segmentation Feature extraction Representation (politics) Task (project management) Computer vision Pattern recognition (psychology) Convolutional neural network Artificial neural network Engineering

Metrics

23
Cited By
4.19
FWCI (Field Weighted Citation Impact)
61
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Frequency-Aware Self-Supervised Monocular Depth Estimation

Xingyu ChenThomas H. LiRuonan ZhangGe Li

Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2023 Pages: 5797-5806
JOURNAL ARTICLE

Self‐supervised monocular depth estimation via asymmetric convolution block

Lingling HuHao ZhangZhuping WangChao HuangChangzhu Zhang

Journal:   IET Cyber-Systems and Robotics Year: 2022 Vol: 4 (2)Pages: 131-138
© 2026 ScienceGate Book Chapters — All rights reserved.