JOURNAL ARTICLE

Flexible scene text recognition based on dual attention mechanism

Zhiqiang TianChunhui WangYouzi XiaoYuping Lin

Year: 2020 Journal:   Concurrency and Computation Practice and Experience Vol: 33 (22)   Publisher: Wiley

Abstract

Summary Scene text recognition (STR) is a very popular topic in the field of computer vision, which can extract text from complex natural scenes. In this article, we propose an end‐to‐end trainable and flexible STR method based on a dual attention mechanism. The proposed method consists of four modules: a thin plate spline transformer for normalizing the original image, a Channel‐Att feature extractor for obtaining representative features, a bidirectional long short‐term memory encoder for encoding sequential context features, and a Self‐Att based decoder for predicting text labels. The results on seven different benchmark datasets IIIT, SVT, IC03, IC13, IC15, SVTP, and CUTE, show that the proposed method is comparable to 13 existing methods. Especially, the average text recognition accuracy of the proposed method is about 1.4% higher than the state‐of‐the‐art method.

Keywords:
Computer science Artificial intelligence Encoder Transformer Pattern recognition (psychology) Dual (grammatical number) Extractor Encoding (memory) Speech recognition Computer vision

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
45
Refs
0.05
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology

Related Documents

BOOK-CHAPTER

Scene Text Recognition Based on Corner Point and Attention Mechanism

Hui WangTao HuXiaowei GengKai Li

Lecture notes in computer science Year: 2024 Pages: 170-181
JOURNAL ARTICLE

An extended attention mechanism for scene text recognition

Zheng XiaoZhenyu NieChao SongAnthony T. Chronopoulos

Journal:   Expert Systems with Applications Year: 2022 Vol: 203 Pages: 117377-117377
JOURNAL ARTICLE

DBCAN: Dual-Branch Cross-Attention Network for Scene Text Recognition

Xinjian GaoYe PangYuyu LiuJun YuMaokun HanKai HouWei Wang

Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Year: 2022 Pages: 1-6
© 2026 ScienceGate Book Chapters — All rights reserved.