JOURNAL ARTICLE

Monocular Image Depth Estimation Using a Conditional Generative Adversarial Net

Abstract

Depth estimation plays an essential part in understanding the three-dimensional (3D) geometric relations of a scene. Compared with other methods such as binocular vision, estimating depth from monocular image is much more challenging. In this paper, we propose a conditional generative adversarial net (cGAN) to tackle the problem of monocular image depth estimation. For enhancing the learning of our net in the training phrase, cycle consistency is applied to our network to form a closed loop. We use the network to model the mapping between the RGB images domain and the depth images domain. After training the network adequately, the model can output depth image according to the input RGB image. Experiments on NYU Depth v2 dataset demonstrate the proposed method outperforms state-of-art depth estimation approaches.

Keywords:
Monocular Artificial intelligence Computer science Computer vision Consistency (knowledge bases) RGB color model Image (mathematics) Domain (mathematical analysis) Mathematics

Metrics

2
Cited By
0.14
FWCI (Field Weighted Citation Impact)
31
Refs
0.46
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.