JOURNAL ARTICLE

Recurrent Highway Networks with Attention Mechanism for Scene Text Recognition

Abstract

Scene Text Recognition is an extremely useful but challenging task and has drawn much attention in recent years. The best of previous model is CNN-LSTM model with attention mechanism, and it can recognize the whole text without character-level segmentation and recognition. Compared with LSTM, Recurrent Highway Networks (RHN), as a popular architecture because of its capability of training deep structure, can preform excellently in plenty of situations and has least parameters. Thus, we employ RHN as decoder and combine attention mechanism with it. Moreover, we integrate feature extraction, feature attention and sequence recognition into an end- to-end framework which can be jointly trained. Our proposed method is conducted on challenging public datasets, such as Street View Text and ICDAR 2003, and outperform the results of the best model in some datasets. Nevertheless, our model only contains 6.3 million parameters that is the minimal size of model for this problem.

Keywords:
Computer science Feature (linguistics) Artificial intelligence Task (project management) Segmentation Feature extraction Text recognition Pattern recognition (psychology) Mechanism (biology) Sequence (biology) Architecture Recurrent neural network Machine learning Artificial neural network Image (mathematics)

Metrics

6
Cited By
0.51
FWCI (Field Weighted Citation Impact)
35
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Music and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Convolutional Attention Networks for Scene Text Recognition

Hongtao XieShancheng FangZheng-Jun ZhaYating YangYan LiYongdong Zhang

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2019 Vol: 15 (1s)Pages: 1-17
JOURNAL ARTICLE

An extended attention mechanism for scene text recognition

Zheng XiaoZhenyu NieChao SongAnthony T. Chronopoulos

Journal:   Expert Systems with Applications Year: 2022 Vol: 203 Pages: 117377-117377
JOURNAL ARTICLE

Flexible scene text recognition based on dual attention mechanism

Zhiqiang TianChunhui WangYouzi XiaoYuping Lin

Journal:   Concurrency and Computation Practice and Experience Year: 2020 Vol: 33 (22)
© 2026 ScienceGate Book Chapters — All rights reserved.