Image Captioner: Captioning for Visual Impact

Shabareesh Aryan; S. V. Subrahmanya; Pahel Jagtap; Leander Mani; M Sowmya.

doi:10.1109/aikiie60097.2023.10390079

ScienceGate Book Chapters

JOURNAL ARTICLE

Image Captioner: Captioning for Visual Impact

Shabareesh Aryan S. V. Subrahmanya Pahel Jagtap Leander Mani M Sowmya.

Year: 2023 Pages: 1-8

DOI: 10.1109/aikiie60097.2023.10390079

Get Full-Text PDF Get Analytical Report

Abstract

Image captioning is the process of generating a textual description that accurately represents the features of an image. In the realm of deep learning, this task is of utmost significance and has a wide range of uses. Image captioning involves converting an image, represented as a sequence of pixels, into a sequence of words that are relevant to the image. It can be seen as an end-to-end and sequence-to-sequence challenge, as both the language and visual aspects need to be processed. In this regard, recurrent neural networks are employed for the language processing, while convolutional neural networks are used to extract feature vectors from the images. The findings of this study highlight the effectiveness of this method and show how diverse applications in image processing and description generation are possible. This work promotes the incorporation of these methods into actual image captioning systems in order to provide more precise and contextually appropriate image descriptions.

Keywords:

Closed captioning Computer science Artificial intelligence Convolutional neural network Image (mathematics) Sequence (biology) Image processing Feature (linguistics) Process (computing) Pixel Natural language processing Computer vision Pattern recognition (psychology) Linguistics

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.47

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Analysis and Summarization

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Captioner: Captioning for Visual Impact

Abstract

Metrics

Citation History

Topics

Related Documents

Scene graph captioner: Image captioning based on structural visual representation

ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment

CA-Captioner: A novel concentrated attention for image captioning

DSCJA-Captioner: Dual-Branch Spatial and Channel Joint Attention for Image Captioning

Visual Image Captioning through Transformer