Abstract

Extracting building footprints from satellite or aerial imagery is critical for many applications. Yet, the precise delineation of buildings from very high spatial resolution remotely sensed images remains challenging. This study investigated the potentiality of using Mask R-CNN based on the Swin Transformer and Feature Pyramid Network (FPN) in extracting building footprints from RGB images in heterogeneous urban landscapes. The Swin Transformer and FPN were used to extract multiscale features. The model's performance was compared with several instance segmentation models based on the ResNet-50 backbone, including Mask scoring R-CNN, YOLCAT, and SOLO. Results showed that the model successfully segmented building footprints with a mAP50 and F-measure of 0.85 and 0.89, respectively, outperformed the evaluated instance segmentation models.

Keywords:
Computer science Artificial intelligence RGB color model Segmentation Feature extraction Transformer Pyramid (geometry) Computer vision Image resolution Pattern recognition (psychology) Image segmentation Remote sensing Geography Engineering Mathematics

Metrics

6
Cited By
3.74
FWCI (Field Weighted Citation Impact)
27
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Automated Road and Building Extraction
Physical Sciences →  Engineering →  Ocean Engineering
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Remote Sensing and LiDAR Applications
Physical Sciences →  Environmental Science →  Environmental Engineering

Related Documents

JOURNAL ARTICLE

Enhanced building footprint extraction from satellite imagery using Mask R-CNN and PointRend

Ahmed NourEldeenM. El-Sayed Wahed

Journal:   Bulletin of Electrical Engineering and Informatics Year: 2024 Vol: 13 (5)Pages: 3601-3608
JOURNAL ARTICLE

BUILDING SEGMENTATION FROM AIRBORNE VHR IMAGES USING MASK R-CNN

Kaibin ZhouYifan ChenIhor SmalRoderik Lindenbergh

Journal:   ˜The œinternational archives of the photogrammetry, remote sensing and spatial information sciences/International archives of the photogrammetry, remote sensing and spatial information sciences Year: 2019 Vol: XLII-2/W13 Pages: 155-161
JOURNAL ARTICLE

Building Extraction Using Mask Scoring R-CNN Network

Yiwen HuFenglin Guo

Journal:   Proceedings of the 3rd International Conference on Computer Science and Application Engineering Year: 2019 Pages: 1-5
© 2026 ScienceGate Book Chapters — All rights reserved.