Abstract

In this paper, we propose a Bidirectional Attention Network (BANet), an end-to-end framework for monocular depth estimation (MDE) that addresses the limitation of effectively integrating local and global information in convolutional neural networks. The structure of this mechanism derives from a strong conceptual foundation of neural machine translation, and presents a light-weight mechanism for adaptive control of computation similar to the dynamic nature of recurrent neural networks. We introduce bidirectional attention modules that utilize the feed-forward feature maps and incorporate the global context to filter out ambiguity. Extensive experiments reveal the high degree of capability of this bidirectional attention model over feed-forward baselines and other state-of-the-art methods for monocular depth estimation on two challenging datasets - KITTI and DIODE. We show that our proposed approach either outperforms or performs at least on a par with the state-of-the-art monocular depth estimation methods with less memory and computational complexity.

Keywords:
Monocular Computer science Context (archaeology) Artificial intelligence Ambiguity Artificial neural network Feature (linguistics) Convolutional neural network Filter (signal processing) Computer vision

Metrics

67
Cited By
5.83
FWCI (Field Weighted Citation Impact)
55
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Dual-Stream Multiscale Attention Monocular Depth Estimation Network

Ying ZouZhe ChenFuliang Yin

Journal:   IEEE Internet of Things Journal Year: 2025 Vol: 12 (13)Pages: 23073-23084
JOURNAL ARTICLE

Patch-Wise Attention Network for Monocular Depth Estimation

Sihaeng LeeJanghyeon LeeByungju KimEojindl YiJunmo Kim

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (3)Pages: 1873-1881
JOURNAL ARTICLE

DAttNet: monocular depth estimation network based on attention mechanisms

Armando AstudilloAlejandro BarreraCarlos GuindelAbdulla Al-KaffFernando García

Journal:   Neural Computing and Applications Year: 2023 Vol: 36 (7)Pages: 3347-3356
JOURNAL ARTICLE

Attention-based context aggregation network for monocular depth estimation

Yuru ChenHaitao ZhaoZhengwei HuJingchao Peng

Journal:   International Journal of Machine Learning and Cybernetics Year: 2021 Vol: 12 (6)Pages: 1583-1596
© 2026 ScienceGate Book Chapters — All rights reserved.