Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images

Zhe Dong; Tianzhu Liu; Yanfeng Gu

doi:10.1109/tgrs.2023.3317016

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images

Zhe Dong Tianzhu Liu Yanfeng Gu

Year: 2023 Journal: IEEE Transactions on Geoscience and Remote Sensing Vol: 61 Pages: 1-12 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tgrs.2023.3317016

Get Full-Text PDF Get Analytical Report

Abstract

A critical requirement for the success of supervised deep learning lies in having numerous annotated images, which is often challenging to fulfill in remote sensing semantic segmentation tasks. Self-supervised contrastive learning (CL) offers a strategy for learning general feature representations by pre-training neural networks on vast amounts of unlabeled data and subsequently fine-tuning them on downstream tasks with limited annotations. However, the vast majority of CL methods are designed based on instance discriminative pretext tasks, focusing solely on learning the global representation of the entire image while disregarding the essential spatial and semantic correlations crucial for semantic segmentation tasks. To address the above issues, in this paper, we propose a spatial and semantic consistency contrastive learning (SSCCL) framework for the semantic segmentation task of remote sensing images. Specifically, a consistency branch in SSCCL is designed to learn feature representations with spatial and semantic consistency by maximizing the similarity of the overlapping regions of the two augmented views. Additionally, an instance branch is introduced to learn global representations by enforcing the similarity of two augmented views from one image. Through the integration of the consistency branch and instance branch, the proposed SSCCL framework can learn robust and informative feature representations for semantic segmentation in remote sensing scenarios. The proposed method was evaluated on three publicly available remote sensing semantic segmentation datasets, and the experimental results show that our method achieves superior segmentation performance with limited annotations compared to state-of-the-art CL methods as well as ImageNet pre-training method.

Keywords:

Computer science Consistency (knowledge bases) Segmentation Artificial intelligence Discriminative model Feature learning Feature (linguistics) Pattern recognition (psychology) Representation (politics) Machine learning

Metrics

Cited By

4.34

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images

Abstract

Metrics

Citation History

Topics

Related Documents

Global and Local Contrastive Self-Supervised Learning for Semantic Segmentation of HR Remote Sensing Images

Region-Aware Contrastive Learning For Semi-Supervised Semantic Segmentation of Remote Sensing Images

Semantic Segmentation of Remote Sensing Images With Self-Supervised Semantic-Aware Inpainting

Semantic Segmentation of Remote Sensing Images With Self-Supervised Multitask Representation Learning

Semi-Supervised Semantic Segmentation of Remote Sensing Images With Iterative Contrastive Network