Remote Sensing Image Segmentation and Captioning Using Deep Learning

Reem Mostafa Elsady; Youssef Abdelrahman Ahmed; Mohammed A.‐M. Salem

doi:10.1109/smartcities4.056956.2023.10526069

ScienceGate Book Chapters

JOURNAL ARTICLE

Remote Sensing Image Segmentation and Captioning Using Deep Learning

Reem Mostafa Elsady Youssef Abdelrahman Ahmed Mohammed A.‐M. Salem

Year: 2023 Pages: 196-201

DOI: 10.1109/smartcities4.056956.2023.10526069

Get Full-Text PDF Get Analytical Report

Abstract

The analysis of remote-sensing images can be of great importance, as it can directly impact people's lives such as monitoring environmental changes, or even traffic congestion in cities. Describing the content of a remote-sensing image using natural languages and segmenting the image into different semantic classes can help in the analysis process. Deep learning architectures have proved their effectiveness when used for many applications including computer vision tasks. For segmentation, we have used the U-net deep learning model. Regarding captioning, a CNN-Transformer-based model for image captioning has been tested with different CNN configurations and model architectures. We have also used an image processing technique to be applied to the segmented image produced by the U-net to further analyze the image and augment the predicted caption to give more useful information about the image. Our approach has allowed enriching the captions for the remote-sensing images by using the segmentation output.

Keywords:

Closed captioning Computer science Artificial intelligence Image segmentation Deep learning Segmentation Computer vision Image (mathematics) Transformer Process (computing) Image processing Pattern recognition (psychology) Engineering

Metrics

Cited By

0.36

FWCI (Field Weighted Citation Impact)

Refs

0.58

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote Sensing Image Segmentation and Captioning Using Deep Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Remote Sensing Image Captioning Using Deep Learning

Empowering Image Captioning on Indian Remote Sensing Imagery Using Deep Learning

Remote Sensing Image Captioning Using Transformer

Scalable Remote Sensing Image Change Captioning using In-Context Learning

Remote Sensing Image Captioning Using Hire-MLP