Abstract

Image captioning is the process of generating a textual description that accurately represents the features of an image. In the realm of deep learning, this task is of utmost significance and has a wide range of uses. Image captioning involves converting an image, represented as a sequence of pixels, into a sequence of words that are relevant to the image. It can be seen as an end-to-end and sequence-to-sequence challenge, as both the language and visual aspects need to be processed. In this regard, recurrent neural networks are employed for the language processing, while convolutional neural networks are used to extract feature vectors from the images. The findings of this study highlight the effectiveness of this method and show how diverse applications in image processing and description generation are possible. This work promotes the incorporation of these methods into actual image captioning systems in order to provide more precise and contextually appropriate image descriptions.

Keywords:
Closed captioning Computer science Artificial intelligence Convolutional neural network Image (mathematics) Sequence (biology) Image processing Feature (linguistics) Process (computing) Pixel Natural language processing Computer vision Pattern recognition (psychology) Linguistics

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
18
Refs
0.47
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Scene graph captioner: Image captioning based on structural visual representation

Ning XuAn-An LiuJing LiuWeizhi NieYuting Su

Journal:   Journal of Visual Communication and Image Representation Year: 2018 Vol: 58 Pages: 477-485
JOURNAL ARTICLE

ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment

Eslam AbdelrahmanPengzhan SunLi Erran LiMohamed Elhoseiny

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2024 Vol: 38 (19)Pages: 20902-20911
JOURNAL ARTICLE

CA-Captioner: A novel concentrated attention for image captioning

Xiaobao YangYang YangJunsheng WuWei SunSugang MaZhiqiang Hou

Journal:   Expert Systems with Applications Year: 2024 Vol: 250 Pages: 123847-123847
JOURNAL ARTICLE

Visual Image Captioning through Transformer

Muneeb NabiRohit PachauriShouaib AhmadKanishk VarshneyPrachi GoelApurva Jain

Journal:   International Journal for Research in Applied Science and Engineering Technology Year: 2023 Vol: 11 (12)Pages: 2047-2050
© 2026 ScienceGate Book Chapters — All rights reserved.