JOURNAL ARTICLE

Attention-Based Deep Neural Network and Its Application to Scene Text Recognition

Abstract

Recognize text in natural scenes is a challenging task. We proposed an attention-based deep neural network architecture for scene text recognition, which integrates feature extraction, feature attention, feature labeling and transcription into a unified framework. The primary advantages of the proposed model are: (1) it is an end-to-end model, does not require any segmentation of the input image. Convolutional neural network (CNN) is used as encoder to extract features, recurrent neural network (RNN) is used as decoder based on its characteristics of predict sequence, which composed a encoder-decoder architecture; (2) Soft Attention mechanism is introduced in, to further extract features in the input image, and allowing for end-to-end training within a standard back propagation framework; (3) Experiments are performed on several challenging scene text datasets, including IIIT5K, Street View Text, ICDAR2003 and ICDAR2013. Results of the experiments show that the proposed model is comparable or better than other models, which demonstrate the superiority of the proposed algorithm.

Keywords:
Computer science Artificial intelligence Convolutional neural network Recurrent neural network Feature extraction Encoder Feature (linguistics) Pattern recognition (psychology) Segmentation Artificial neural network Deep learning Time delay neural network Image segmentation

Metrics

3
Cited By
0.21
FWCI (Field Weighted Citation Impact)
27
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Deep neural network with attention model for scene text recognition

Shuo LiMin TangQiang GuoJun LeiJun Zhang

Journal:   IET Computer Vision Year: 2017 Vol: 11 (7)Pages: 605-612
JOURNAL ARTICLE

Enhanced scene text recognition using deep learning based hybrid attention recognition network

R.S. PatilGeeta HanjiRakesh Huded

Journal:   IAES International Journal of Artificial Intelligence Year: 2024 Vol: 13 (4)Pages: 4927-4927
JOURNAL ARTICLE

Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network

MVV Prasad KantipudiSandeep KumarAshish Kumar Jha

Journal:   Computational Intelligence and Neuroscience Year: 2021 Vol: 2021 (1)Pages: 2676780-2676780
JOURNAL ARTICLE

Context Attention Network for Scene Text Recognition

田荣 董

Journal:   Software Engineering and Applications Year: 2023 Vol: 12 (02)Pages: 345-353
© 2026 ScienceGate Book Chapters — All rights reserved.