Flexible scene text recognition based on dual attention mechanism

Zhiqiang Tian; Chunhui Wang; Youzi Xiao; Yuping Lin

doi:10.1002/cpe.5863

ScienceGate Book Chapters

JOURNAL ARTICLE

Flexible scene text recognition based on dual attention mechanism

Zhiqiang Tian Chunhui Wang Youzi Xiao Yuping Lin

Year: 2020 Journal: Concurrency and Computation Practice and Experience Vol: 33 (22) Publisher: Wiley

DOI: 10.1002/cpe.5863

Get Full-Text PDF Get Analytical Report

Abstract

Summary Scene text recognition (STR) is a very popular topic in the field of computer vision, which can extract text from complex natural scenes. In this article, we propose an end‐to‐end trainable and flexible STR method based on a dual attention mechanism. The proposed method consists of four modules: a thin plate spline transformer for normalizing the original image, a Channel‐Att feature extractor for obtaining representative features, a bidirectional long short‐term memory encoder for encoding sequential context features, and a Self‐Att based decoder for predicting text labels. The results on seven different benchmark datasets IIIT, SVT, IC03, IC13, IC15, SVTP, and CUTE, show that the proposed method is comparable to 13 existing methods. Especially, the average text recognition accuracy of the proposed method is about 1.4% higher than the state‐of‐the‐art method.

Keywords:

Computer science Artificial intelligence Encoder Transformer Pattern recognition (psychology) Dual (grammatical number) Extractor Encoding (memory) Speech recognition Computer vision

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.05

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Flexible scene text recognition based on dual attention mechanism

Abstract

Metrics

Topics

Related Documents

Scene Text Recognition Based on Corner Point and Attention Mechanism

An extended attention mechanism for scene text recognition

SAM: Self Attention Mechanism for Scene Text Recognition Based on Swin Transformer

DBCAN: Dual-Branch Cross-Attention Network for Scene Text Recognition

Recurrent Highway Networks with Attention Mechanism for Scene Text Recognition