JOURNAL ARTICLE

Intelligent Deep Learning Empowered Text Detection Model from Natural Scene Images

S. Kiruthika DeviSubalalitha CN

Year: 2022 Journal:   International Journal on Advanced Science Engineering and Information Technology Vol: 12 (3)Pages: 1263-1263   Publisher: Insight Society

Abstract

The scene Text Recognition process has become a hot research topic and a challenging task owing to the complicated background, varying light intensities, colors, font styles, and sizes. Text extraction from natural scene images encompasses two main processes: text detection and text recognition. The latest advancements in Machine Learning (ML) and Deep Learning (DL) concepts can effectually automate the text detection and recognition process by training the model properly. In this view, this paper presents an Automated DL empowered Text Detection model from Natural Scene Images (ADLTD-NSI). The ADLTD-NSI technique includes two important processes: text detection and text recognition. Firstly, a single shot detector (SSD) with Inception-v2 as a baseline model is employed for text detection, an object detector based on the VGG-16 framework for feature map extraction followed by six convolution layers. Secondly, Convolutional Recurrent Neural Network (CRNN) technique is utilized for the text recognition process. Besides, the recurrent layers in the CRNN model utilize long short-term memory (LSTM) for encoding the sequence of feature vectors. Lastly, Connectionist Temporal Classification (CTC) loss is applied to predict text labels equivalent to the sequences from the recurrent layers. A wide range of experiments was carried out on benchmark COCO datasets, and the results are examined in several aspects. The experimental outcomes showcased the better performance of the ADLTD-NSI technique over the other compared methods with a maximum accuracy of 96.78%.

Keywords:
Computer science Artificial intelligence Benchmark (surveying) Text detection Recurrent neural network Convolutional neural network Pattern recognition (psychology) Feature extraction Text recognition Feature (linguistics) Task (project management) Deep learning Connectionism Detector Process (computing) Object detection Convolution (computer science) Artificial neural network Image (mathematics)

Metrics

6
Cited By
0.74
FWCI (Field Weighted Citation Impact)
22
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Deep learning for detection of text polarity in natural scene images

Pavan Kumar Perepu

Journal:   Neurocomputing Year: 2020 Vol: 431 Pages: 1-6
JOURNAL ARTICLE

Devanagari Text Detection From Natural Scene Images

Sankirti ShiravaleR. JayadevanSanjeev S. Sannakki

Journal:   International Journal of Computer Vision and Image Processing Year: 2020 Vol: 10 (3)Pages: 44-59
BOOK-CHAPTER

Text Detection from Natural Scene Images Using Scale Space Model

Qiaoyu SunYue Lu

Communications in computer and information science Year: 2012 Pages: 156-161
© 2026 ScienceGate Book Chapters — All rights reserved.