JOURNAL ARTICLE

Lightweight monocular absolute depth estimation based on attention mechanism

Jiayu JinBo TaoXinbo QianJiaxin HuGongfa Li

Year: 2024 Journal:   Journal of Electronic Imaging Vol: 33 (02)   Publisher: SPIE

Abstract

To solve the problem of obtaining a higher accuracy at the expense of redundant models, we propose a network architecture. We utilize a lightweight network that retains the high-precision advantage of the transformer and effectively combines it with convolutional neural network. By greatly reducing the training parameters, this approach achieves high precision, making it well suited for deployment on edge devices. A detail highlight module (DHM) is added to effectively fuse information from multiple scales, making the depth of prediction more accurate and clearer. A dense geometric constraints module is introduced to recover accurate scale factors in autonomous driving without additional sensors. Experimental results demonstrate that our model improves the accuracy from 98.1% to 98.3% compared with Monodepth2, and the model parameters are reduced by about 80%.

Keywords:
Computer science Artificial intelligence Monocular Computer vision Absolute (philosophy)

Metrics

1
Cited By
0.61
FWCI (Field Weighted Citation Impact)
54
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Enhancement Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.