JOURNAL ARTICLE

Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference

Abstract

Deep learning has recently demonstrated its excellent performance for multi-view stereo (MVS). However, one major limitation of current learned MVS approaches is the scalability: The memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes. In this paper, we introduce a scalable multi-view stereo framework based on the recurrent neural network. Instead of regularizing the entire 3D cost volume in one go, the proposed Recurrent Multi-view Stereo Network (R-MVSNet) sequentially regularizes the 2D cost maps along the depth direction via the gated recurrent unit (GRU). This reduces dramatically the memory consumption and makes high-resolution reconstruction feasible. We first show the state-of-the-art performance achieved by the proposed R-MVSNet on the recent MVS benchmarks. Then, we further demonstrate the scalability of the proposed method on several large-scale scenarios, where previous learned approaches often fail due to the memory constraint. Code is available at https://github.com/YoYo000/MVSNet. © 2019 IEEE.

Keywords:
Computer science Scalability Inference Regularization (linguistics) Artificial intelligence Recurrent neural network Constraint (computer-aided design) Deep learning Code (set theory) Volume (thermodynamics) Deep neural networks Artificial neural network

Metrics

635
Cited By
26.51
FWCI (Field Weighted Citation Impact)
45
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Vision and Imaging
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Optical measurement and interference techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

MVSNet: Depth Inference for Unstructured Multi-view Stereo

Yao YaoZixin LuoShiwei LiTian FangLong Quan

Lecture notes in computer science Year: 2018 Pages: 785-801
JOURNAL ARTICLE

DRI-MVSNet: A depth residual inference network for multi-view stereo images

Ying LiWenyue LiZhijie ZhaoJiahao Fan

Journal:   PLoS ONE Year: 2022 Vol: 17 (3)Pages: e0264721-e0264721
BOOK-CHAPTER

PSP-MVSNet: Deep Patch-Based Similarity Perceptual for Multi-view Stereo Depth Inference

Leiping JieHui Zhang

Lecture notes in computer science Year: 2022 Pages: 316-328
JOURNAL ARTICLE

DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis

Jingliang LiZhengda LuYiqun WangYing WangJun Xiao

Journal:   Proceedings of the 30th ACM International Conference on Multimedia Year: 2022 Pages: 5593-5601
© 2026 ScienceGate Book Chapters — All rights reserved.