Text Detection and Language Identification in Natural Scene Images using YOLOv5

R.S. Latha; G. R. Sreekanth; R.C. Suganthe; R. Rajadevi; V.V. Jagadeeswaran; Logesh Ravi; A. Maheshvar

doi:10.1109/iccci56745.2023.10128400

ScienceGate Book Chapters

JOURNAL ARTICLE

Text Detection and Language Identification in Natural Scene Images using YOLOv5

R.S. Latha G. R. Sreekanth R.C. Suganthe R. Rajadevi V.V. Jagadeeswaran Logesh Ravi A. Maheshvar

Year: 2023 Pages: 1-7

DOI: 10.1109/iccci56745.2023.10128400

Get Full-Text PDF Get Analytical Report

Abstract

Deep learning has immensely evolved ever since digital era. Deep learning also includes feature extraction as a facet. Text snipping from a picture is a difficult task since the image comprises text in a variety of sizes, styles, orientations, alignments, low contrast, noise, and with a complicated backdrop structure. Transformation of an image into different perspective for feature identification is the first step towards text recognition. Scene texts provide rich contextual information that can be applied to several types of vision-based applications, hence over the last few years we have witnessed an increase in interest in the detection and recognition of scene texts. In order to address the issue of language detection from multilingual scene text photos, a deep learning-based solution is suggested in this paper. In this study, the underlying model of a Convolutional neural network is employed to detect objects in real-time with high accuracy. This study employs a single neural network "you only look once" known as YOLO, since it offers predictions with just a single forward propagation trip through the neural network to evaluate the full image. We used COCO 'Common Objects in Context' dataset which is a large-scale object detection, segmentation, and captioning dataset. To evaluate the image YOLO divides the image into smaller parts and forecasts boundary areas and probabilities for every part. The predicted probability weighs these region proposals. It then provides identified objects after non-max linear suppression. We used F1-score which combines accuracy and recall into a single metric by computing their harmonic means.

Keywords:

Computer science Identification (biology) Artificial intelligence Natural language Computer vision Natural (archaeology) Natural language processing Geography Archaeology

Metrics

Cited By

0.73

FWCI (Field Weighted Citation Impact)

Refs

0.65

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Text Detection and Language Identification in Natural Scene Images using YOLOv5

Abstract

Metrics

Citation History

Topics

Related Documents

Text Detection and Language Identification on Natural Scene Images using Faster R-CNN

Text detection and script identification in natural scene images using deep learning

Uyghur Text Detection in Natural Scene Images

Devanagari Text Detection From Natural Scene Images

Robust Text Detection in Natural Scene Images