JOURNAL ARTICLE

Large-scale building extraction in very high-resolution aerial imagery using Mask R-CNN

Abstract

Urban areas are hotspots of complex and dynamic alterations of the Earth’s surface. Using deep learning (DL) techniques in remote sensing applications can significantly contribute to document these tremendous changes. Open source building data at a very high level of detail are still scarce or incomplete for many regions, therefore, hindering research and policy to properly provide knowledge on urban structures. In this study, we use a convolutional neural network to extract buildings for the city of Santiago de Chile. We deploy the recently released Mask R-CNN and use a pretrained model (PM) which already has been trained with remote sensing imagery. We fine-tune PM with very high-resolution (VHR) airborne RGB images from our study region and generate the fine-tuned model (FM). To extend the number of training data, we test several data augmentation methods for training purposes and evaluate their performance in context of urban environments. We achieve highest overall accuracy of 92 % by using augmentations and the generated FM. Our findings encourage to use DL methods in the urban context. The presented method can be adapted and applied to other global urban regions, and, help to overcome lacks in open source building data to assess urban environments.

Keywords:
Computer science Convolutional neural network Context (archaeology) Deep learning RGB color model Remote sensing Artificial intelligence Urban planning Scale (ratio) Aerial image Image (mathematics) Cartography Geography Civil engineering

Metrics

31
Cited By
1.71
FWCI (Field Weighted Citation Impact)
20
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

DISSERTATION

Automated Building Extraction from Aerial Imagery with Mask R-CNN

Zilong Yang

University:   OPAL (Open@LaTrobe) (La Trobe University) Year: 2020
JOURNAL ARTICLE

Extraction of building footprint using MASK-RCNN for high resolution aerial imagery

Jenila Vincent MP. Varalakshmi

Journal:   Environmental Research Communications Year: 2024 Vol: 6 (7)Pages: 075015-075015
JOURNAL ARTICLE

Benchmarking Vectorized Building Footprint Extraction from Very High Resolution Aerial Imagery

Mehmet BüyükdemircioğluSalim MalekElisa Mariarosaria FarellaSultan KocamanMartin KadaFabio Remondino

Journal:   ˜The œinternational archives of the photogrammetry, remote sensing and spatial information sciences/International archives of the photogrammetry, remote sensing and spatial information sciences Year: 2025 Vol: XLVIII-1/W6-2025 Pages: 47-54
© 2026 ScienceGate Book Chapters — All rights reserved.