ATLAS-MVSNet: Attention Layers for Feature Extraction and Cost Volume Regularization in Multi-View Stereo

Rafael Weilharter; Friedrich Fraundorfer

doi:10.1109/icpr56361.2022.9956633

ScienceGate Book Chapters

JOURNAL ARTICLE

ATLAS-MVSNet: Attention Layers for Feature Extraction and Cost Volume Regularization in Multi-View Stereo

Rafael Weilharter Friedrich Fraundorfer

Year: 2022 Journal: 2022 26th International Conference on Pattern Recognition (ICPR) Pages: 3557-3563

DOI: 10.1109/icpr56361.2022.9956633

Get Full-Text PDF Get Analytical Report

Abstract

We present ATLAS-MVSNet, an end-to-end deep learning architecture relying on local attention layers for depth map inference from multi-view images. Distinct from existing works, we introduce a novel module design for neural networks, which we termed hybrid attention block, that utilizes the latest insights into attention in vision models. We are able to reap the benefits of attention in both, the carefully designed multi-stage feature extraction network and the cost volume regularization network. Our new approach displays significant improvement over its counterpart based purely on convolutions. While many state-of-the-art methods need multiple high-end GPUs in the training phase, we are able to train our network on a single consumer grade GPU. ATLAS-MVSNet exhibits excellent performance, especially in terms of accuracy, on the DTU dataset. \nFurthermore, ATLAS-MVSNet ranks amongst the top published methods on the online Tanks and Temples benchmark.

Keywords:

Atlas (anatomy) Computer science Inference Regularization (linguistics) Artificial intelligence Deep learning Benchmark (surveying) Feature extraction Deep neural networks Architecture Network architecture Artificial neural network Machine learning Pattern recognition (psychology) Cartography

Metrics

Cited By

0.62

FWCI (Field Weighted Citation Impact)

Refs

0.76

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Robotics and Sensor-Based Localization

Physical Sciences → Engineering → Aerospace Engineering

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

ATLAS-MVSNet: Attention Layers for Feature Extraction and Cost Volume Regularization in Multi-View Stereo

Abstract

Metrics

Citation History

Topics

Related Documents

DSC-MVSNet: attention aware cost volume regularization based on depthwise separable convolution for multi-view stereo

SA-MVSNet: Spatial-aware Multi-view Stereo Network with Attention Cost Volume

DCV-MVSNet: Dynamic cost volume for complete multi-view stereo

EFD-MVSNet: Enhanced Feature Distinctiveness for Multi-View Stereo

Attention-enhanced multi-source cost volume multi-view stereo