Abstract

Deep learning has immensely evolved ever since digital era. Deep learning also includes feature extraction as a facet. Text snipping from a picture is a difficult task since the image comprises text in a variety of sizes, styles, orientations, alignments, low contrast, noise, and with a complicated backdrop structure. Transformation of an image into different perspective for feature identification is the first step towards text recognition. Scene texts provide rich contextual information that can be applied to several types of vision-based applications, hence over the last few years we have witnessed an increase in interest in the detection and recognition of scene texts. In order to address the issue of language detection from multilingual scene text photos, a deep learning-based solution is suggested in this paper. In this study, the underlying model of a Convolutional neural network is employed to detect objects in real-time with high accuracy. This study employs a single neural network "you only look once" known as YOLO, since it offers predictions with just a single forward propagation trip through the neural network to evaluate the full image. We used COCO 'Common Objects in Context' dataset which is a large-scale object detection, segmentation, and captioning dataset. To evaluate the image YOLO divides the image into smaller parts and forecasts boundary areas and probabilities for every part. The predicted probability weighs these region proposals. It then provides identified objects after non-max linear suppression. We used F1-score which combines accuracy and recall into a single metric by computing their harmonic means.

Keywords:
Computer science Identification (biology) Artificial intelligence Natural language Computer vision Natural (archaeology) Natural language processing Geography Archaeology

Metrics

4
Cited By
0.73
FWCI (Field Weighted Citation Impact)
15
Refs
0.65
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Text detection and script identification in natural scene images using deep learning

Ashwaq KhalilMoath JarrahMahmoud Al‐AyyoubYaser Jararweh

Journal:   Computers & Electrical Engineering Year: 2021 Vol: 91 Pages: 107043-107043
JOURNAL ARTICLE

Devanagari Text Detection From Natural Scene Images

Sankirti ShiravaleR. JayadevanSanjeev S. Sannakki

Journal:   International Journal of Computer Vision and Image Processing Year: 2020 Vol: 10 (3)Pages: 44-59
BOOK-CHAPTER

Robust Text Detection in Natural Scene Images

Van Khien PhamGuee-Sang Lee

Lecture notes in computer science Year: 2016 Pages: 720-725
© 2026 ScienceGate Book Chapters — All rights reserved.