JOURNAL ARTICLE

Recent advances in the application of vision transformers to remote sensing image scene classification

Monika KumariAjay Kaul

Year: 2023 Journal:   Remote Sensing Letters Vol: 14 (7)Pages: 722-732   Publisher: Taylor & Francis

Abstract

Researchers have investigated the potential of transformer-based models in remote sensing (RS) applications, such as scene categorization, after their recent success in natural language processing and computer vision tasks. In this review article, we provide an overview of the recent developments in vision transformer (ViT)-based models for remote sensing image scene classification (RSISC). We first introduce the basic architecture of transformer models and their extensions to computer vision tasks. Then, we summarize the current state-of-the-art ViT-based models for RSISC, including their architectures, training strategies, and performance evaluation. We also discuss the challenges and limitations of the existing ViT-based models. Finally, we outline some potential future directions for developing transformer-based models for RS applications. This review article intends to give a complete analysis of the current state-of-the-art and future research prospects for ViTs in RSISC, which can be used as a reference for researchers and practitioners in this field.

Keywords:
Computer science Transformer Categorization Architecture Artificial intelligence Machine learning Data science Electrical engineering Engineering Voltage

Metrics

9
Cited By
1.95
FWCI (Field Weighted Citation Impact)
28
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote Sensing and Land Use
Physical Sciences →  Earth and Planetary Sciences →  Atmospheric Science
© 2026 ScienceGate Book Chapters — All rights reserved.