JOURNAL ARTICLE

Attention-Based Dense Decoding Network for Monocular Depth Estimation

Jianrong WangGe ZhangMei YuTianyi XuTao Luo

Year: 2020 Journal:   IEEE Access Vol: 8 Pages: 85802-85812   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Depth estimation is a classic computer vision task and provides rich representation of objects and environment. In recent years, the performance of end-to-end depth estimation has been significantly improved. However, the stack of convolutions and pooling operations result in losing local detail spatial information, which is extremely important to monocular depth estimation. In order to overcome this problem, in this work, we propose an encoder-decoder framework with skip connections. Based on the self-attention mechanism, we apply the channel-spatial attention module as a transition layer, which captures the depth and spatial positional relationship and improves the presentation ability of channel and space. Then we propose a dense decoding module to make full use of the attention features of different scale ranges in the decoding process. It achieves a more massive and denser receptive field while obtaining multi-scale information. Finally, a novel distance-aware loss is introduced to predict more meticulous edges and local details in the distance. Experiments demonstrate that the proposed method outperforms the state-of-the-art on KITTI and NYU Depth V2 datasets.

Keywords:
Computer science Decoding methods Encoder Monocular Artificial intelligence Channel (broadcasting) Pooling Computer vision Process (computing) Task (project management) Representation (politics) Pattern recognition (psychology) Algorithm

Metrics

5
Cited By
0.42
FWCI (Field Weighted Citation Impact)
63
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

DAttNet: monocular depth estimation network based on attention mechanisms

Armando AstudilloAlejandro BarreraCarlos GuindelAbdulla Al-KaffFernando García

Journal:   Neural Computing and Applications Year: 2023 Vol: 36 (7)Pages: 3347-3356
JOURNAL ARTICLE

Attention-based context aggregation network for monocular depth estimation

Yuru ChenHaitao ZhaoZhengwei HuJingchao Peng

Journal:   International Journal of Machine Learning and Cybernetics Year: 2021 Vol: 12 (6)Pages: 1583-1596
JOURNAL ARTICLE

Dual-Stream Multiscale Attention Monocular Depth Estimation Network

Ying ZouZhe ChenFuliang Yin

Journal:   IEEE Internet of Things Journal Year: 2025 Vol: 12 (13)Pages: 23073-23084
JOURNAL ARTICLE

Patch-Wise Attention Network for Monocular Depth Estimation

Sihaeng LeeJanghyeon LeeByungju KimEojindl YiJunmo Kim

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (3)Pages: 1873-1881
© 2026 ScienceGate Book Chapters — All rights reserved.