Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Jiayu Yang; Wei Mao; José M. Alvarez; Miaomiao Liu

doi:10.1109/tpami.2021.3082562

ScienceGate Book Chapters

JOURNAL ARTICLE

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Jiayu Yang Wei Mao José M. Alvarez Miaomiao Liu

Year: 2021 Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence Vol: 44 (9)Pages: 1-1 Publisher: IEEE Computer Society

DOI: 10.1109/tpami.2021.3082562

Get Full-Text PDF Get Analytical Report

Abstract

We propose a cost volume-based neural network for depth inference from multi-view images. We demonstrate that building a cost volume pyramid in a coarse-to-fine manner instead of constructing a cost volume at a fixed resolution leads to a compact, lightweight network and allows us inferring high resolution depth maps to achieve better reconstruction results. To this end, we first build a cost volume based on uniform sampling of fronto-parallel planes across the entire depth range at the coarsest resolution of an image. Then, given current depth estimate, we construct new cost volumes iteratively to perform depth map refinement. We show that working on cost volume pyramid can lead to a more compact, yet efficient network structure compared with existing works. We further show that the (residual) depth sampling can be fully determined by analytical geometric derivation, which serves as a principle for building compact cost volume pyramid. To demonstrate the effectiveness of our proposed framework, we extend our cost volume pyramid structure to handle the unsupervised depth inference scenario. Experimental results on benchmark datasets show that our model can perform 6x faster with similar performance as state-of-the-art methods for supervised scenario and demonstrates superior performance on unsupervised scenario. Code is available at https://github.com/JiayuYANG/CVP-MVSNet.

Keywords:

Artificial intelligence Pyramid (geometry) Computer science Inference Volume (thermodynamics) Computer vision Pattern recognition (psychology) Mathematics Geometry

Metrics

Cited By

3.78

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Optical measurement and interference techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Computer Graphics and Visualization Techniques

Physical Sciences → Computer Science → Computer Graphics and Computer-Aided Design

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Abstract

Metrics

Citation History

Topics

Related Documents

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Multi-metric Depth Pyramid and Bilateral Segmentation Based Depth Inference for Multi-View Stereo

Recurrent Multi-view Stereo Depth Inference with Pyramid of Images

ICV-Net: An identity cost volume network for multi-view stereo depth inference

Attention aware cost volume pyramid based multi-view stereo network for 3D reconstruction