JOURNAL ARTICLE

Monocular Depth Estimation with Multiscale Feature Fusion Networks

Abstract

Monocular image depth estimation has some problems, such as fuzzy depth estimation, inaccurate distance information and incomplete details in complex scenes. Aiming at these problems, a monocular depth estimation method based on pyramid vision transformer network optimization is proposed. An encoder with a pyramid transformer as the skeleton network is used to segment the image and obtain the position information between each pixel block, while a lightweight decoder is used and feature fusion is improved. Experiments on the dataset demonstrate that the proposed network can enhance the edge details and improve the accuracy of depth estimation.

Keywords:
Artificial intelligence Computer vision Monocular Computer science Pyramid (geometry) Encoder Feature (linguistics) Pixel Transformer Pattern recognition (psychology) Mathematics Engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.17
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Monocular Accumulated Depth Estimation with Recursive Feature Fusion

Qiuchen WangHui ShuaiQingshan Liu

Journal:   Journal of Computer-Aided Design & Computer Graphics Year: 2022 Vol: 34 (10)Pages: 1533-1541
JOURNAL ARTICLE

Monocular Depth Estimation With Multi-Scale Feature Fusion

Xianfa XuZhe ChenFuliang Yin

Journal:   IEEE Signal Processing Letters Year: 2021 Vol: 28 Pages: 678-682
© 2026 ScienceGate Book Chapters — All rights reserved.