JOURNAL ARTICLE

Scene Text Recognition with Multi-decoders

Yao WangJong-Eun Ha

Year: 2021 Journal:   2021 21st International Conference on Control, Automation and Systems (ICCAS) Pages: 1523-1528

Abstract

In this article, we focus on the scene text recognition problem, which is one of the challenging sub-files of computer vision because of the random existence of scene text. Recently, scene text recognition has achieved state-of-art performance because of the improvement of deep learning. At present, encoder-decoder architecture was widely used for scene recognition tasks, which consist of feature extractor, sequence module. Specifically, at the decoder part, connectionist temporal classification(CTC), attention mechanism, and transformer(self-attention) are three main approaches used in recent research. CTC decoder is flexible and can handle sequences with large changes in length for its align sequences features with labels in a frame-wise manner. Attention decoder can learn better and deeper feature expression and get the better position information of each character. Attention decoder can get more robust and accurate performance for both regular and irregular scene text. Moreover, a novel decoder mechanism is introduced in our study. The proposed architecture has several advantages: the model can be trained using the end-to-end manner under the condition of multi decoders, and can deal with the sequences of arbitrary length and the images of arbitrary shape. Extensive experiments on standard benchmarks demonstrate that our model's performance is improved for regular and irregular text recognition.

Keywords:
Computer science Artificial intelligence Connectionism Encoder Decoding methods Feature (linguistics) Focus (optics) Pattern recognition (psychology) Text recognition Feature extraction Computer vision Speech recognition Artificial neural network Image (mathematics) Algorithm

Metrics

1
Cited By
0.06
FWCI (Field Weighted Citation Impact)
28
Refs
0.35
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Scene Text Recognition with Multi-Encoders

Yao WangJong-Eun Ha

Journal:   2022 22nd International Conference on Control, Automation and Systems (ICCAS) Year: 2022 Pages: 1615-1620
JOURNAL ARTICLE

Cascade 2D attentional decoders with context-enhanced encoder for scene text recognition

Hongmei ChiJiaxin CaiXinran Li

Journal:   Neural Computing and Applications Year: 2024 Vol: 36 (14)Pages: 7817-7827
JOURNAL ARTICLE

Scene Text Recognition with Transformer using Multi-patches

Yao WangJong-Eun Ha

Journal:   Journal of Institute of Control Robotics and Systems Year: 2022 Vol: 28 (10)Pages: 862-867
JOURNAL ARTICLE

Multi-scene ancient chinese text recognition

Kaili WangYaohua YiJunjie LiuLiqiong LuYing Song

Journal:   Neurocomputing Year: 2019 Vol: 377 Pages: 64-72
BOOK-CHAPTER

Multi-granularity Prediction for Scene Text Recognition

Peng WangCheng DaCong Yao

Lecture notes in computer science Year: 2022 Pages: 339-355
© 2026 ScienceGate Book Chapters — All rights reserved.