JOURNAL ARTICLE

A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation

Renxiang ZuoGuangyun ZhangRongting ZhangXiuping Jia

Year: 2021 Journal:   IEEE Transactions on Geoscience and Remote Sensing Vol: 60 Pages: 1-14   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Deformable convolutional networks (DCNs) can mitigate the inherent limited geometric transformation. We reformulate the spatialwise attention mechanism using DCNs in this article for semantic segmentation of high-resolution remote sensing (HRRS) images. It combines the sparse spatial sampling strategy and the long-range relationship modeling capability, namely, deformable attention module (DAM). Such locality awareness, more adaptable to HRRS image structures, can capture each pixel's neighboring structural information. A reasonable multiscale deformable attention net (MDANet) is designed for the HRRS image semantic segmentation with a slightly increased computational cost based on the proposed DAM. Specifically, standard convolutional layers in the raw ResNet50 are equipped with a DAM to control sampling over a broader range of feature levels and aggregate multiscale context information. The experimental results evaluated on Vaihingen and DeepGlobe Land Cover Classification datasets show that the performance accuracy of MDANet is improved by 7.77% and 8.45% compared with the backbone network (ResNet50) in terms of Miou evaluation, respectively. Furthermore, a DAM can perform better than a global spatial attention mechanism with less computation on the $3 \times 64 \times 64$ feature map. In addition, the added ablation studies demonstrate the effectiveness and efficiency of the DAM and multiscale strategy, respectively. Moreover, the sensitivity of critical hyperparameters is analyzed.

Keywords:
Computer science Context (archaeology) Segmentation Feature (linguistics) Pixel Artificial intelligence Hyperparameter Image segmentation Remote sensing Pattern recognition (psychology) Range (aeronautics) Computation Computer vision Algorithm

Metrics

43
Cited By
3.38
FWCI (Field Weighted Citation Impact)
73
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.