JOURNAL ARTICLE

In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification

Ivica DimitrovskiIvan KitanovskiNikola SimidjievskiDragi Kocev

Year: 2024 Journal:   IEEE Geoscience and Remote Sensing Letters Vol: 21 Pages: 1-5   Publisher: Institute of Electrical and Electronics Engineers

Abstract

We investigate the utility of in-domain self-supervised pre-training of vision models in the analysis of remote sensing imagery. Self-supervised learning (SSL) has emerged as a promising approach for remote sensing image classification due to its ability to exploit large amounts of unlabeled data. Unlike traditional supervised learning, SSL aims to learn representations of data without the need for explicit labels. This is achieved by formulating auxiliary tasks that can be used for pre-training models before fine-tuning them on a given downstream task. A common approach in practice to SSL pre-training is utilizing standard pre-training datasets, such as ImageNet. While relevant, such a general approach can have a sub-optimal influence on the downstream performance of models, especially on tasks from challenging domains such as remote sensing. In this paper, we analyze the effectiveness of SSL pre-training by employing the iBOT framework coupled with Vision transformers trained on Million-AID, a large and unlabeled remote sensing dataset. We present a comprehensive study of different self-supervised pre-training strategies and evaluate their effect across 14 downstream datasets with diverse properties. Our results demonstrate that leveraging large in-domain datasets for self-supervised pre-training consistently leads to improved predictive downstream performance, compared to the standard approaches found in practice.

Keywords:
Computer science Artificial intelligence Machine learning Exploit Training set Labeled data Supervised learning Task (project management) Transformer Domain (mathematical analysis) Semi-supervised learning Co-training Pattern recognition (psychology) Data mining Artificial neural network

Metrics

6
Cited By
3.69
FWCI (Field Weighted Citation Impact)
17
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.