JOURNAL ARTICLE

Scale-Aware Detailed Matching for Few-Shot Aerial Image Semantic Segmentation

Xiwen YaoQinglong CaoXiaoxu FengGong ChengJunwei Han

Year: 2021 Journal:   IEEE Transactions on Geoscience and Remote Sensing Vol: 60 Pages: 1-11   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Few-shot semantic segmentation, aiming to segment query images with a few annotated support samples, has drawn increasing attention. Most existing few-shot methods leverage the single prototype obtained from global average pooling to represent all support information and further use the extracted prototype to segment the query images in a matching manner. Although promising results for natural images have been reported, these methods cannot be directly applied on aerial images. The main reason comes from that the extracted single support prototype can only provide a coarse guidance for matching between query and support images and could not handle the large variance of objects’ appearances and scales. To deal with these challenges on aerial images, we propose a scale-aware few-shot semantic segmentation network to perform detailed matching with multiple prototypes. More specifically, the detailed matching module is first constructed to compute the pixel-level similarity between the query features and the extracted multiple support prototypes for providing more accurate parsing guidance. Subsequently, to address the problem of scale imbalance, the scale-aware focal loss is designed to dynamically down-weight the loss assigned to large well-parsed objects and focus training on tiny hard-parsed objects. To facilitate the reproducible research on the task of few-shot semantic segmentation in aerial images, we further provide a few-shot segmentation benchmark iSAID- $5^{\mathrm {i}}$ constructed from the large-scale iSAID dataset [1] . Comprehensive experiments and comparisons with the state-of-the-art few-shot segmentation methods on the iSAID- $5^{\mathrm {i}}$ dataset clearly demonstrate the superiority of our proposed method. The code and dataset are available at https://github.com/caoql98/SDM .

Keywords:
Computer science Parsing Segmentation Leverage (statistics) Artificial intelligence Matching (statistics) Focus (optics) Pooling Scale (ratio) Pattern recognition (psychology) Benchmark (surveying) Image segmentation Computer vision Pixel Similarity (geometry) Image (mathematics) Mathematics

Metrics

73
Cited By
5.83
FWCI (Field Weighted Citation Impact)
42
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation

Qinglong CaoYuntian ChenChao MaXiaokang Yang

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 62 Pages: 1-13
JOURNAL ARTICLE

Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion

Wei AoShunyi ZhengYan MengZhi Gao

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.