JOURNAL ARTICLE

Monocular Depth Estimation With Multi-Scale Feature Fusion

Xianfa XuZhe ChenFuliang Yin

Year: 2021 Journal:   IEEE Signal Processing Letters Vol: 28 Pages: 678-682   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Depth estimation from a single image is a crucial but challenging task for reconstructing 3D structures and inferring scene geometry. However, most existing methods fail to extract more detailed information and estimate the distant small-scale objects well. In this paper, we propose a monocular depth estimation based on multi-scale feature fusion. Specifically, to obtain input features of different scales, we first feed the input images of different scales to pre-trained residual networks with sharing weights. Then, an attention mechanism is used to learn the salient features at different scales, which can integrate detailed information at large scale feature maps and scene information at small scale feature maps. Furthermore, inspired by the dense atrous spatial pyramid pooling in semantic segmentation, we build a multi-scale feature fusion dense pyramid to further improve the ability of the feature extraction. Last, a scale-invariant error loss is used to predict depth maps in log space. We evaluate our method on several public benchmark datasets (including NYU Depth V2 and KITTI). The experiment results show that the proposed method obtains better performance than the existing methods and achieves state-of-the-art results.

Keywords:
Artificial intelligence Computer science Pyramid (geometry) Feature extraction Pattern recognition (psychology) Feature (linguistics) Scale (ratio) Monocular Salient Computer vision Scale space Segmentation Benchmark (surveying) Depth map Pooling Robustness (evolution) Image (mathematics) Mathematics Image processing Geography

Metrics

28
Cited By
2.25
FWCI (Field Weighted Citation Impact)
38
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.