Dual Relation Network for Scene Text Recognition

Ming Li; Bin Fu; Han Chen; Junjun He; Yu Qiao

doi:10.1109/tmm.2022.3171108

ScienceGate Book Chapters

JOURNAL ARTICLE

Dual Relation Network for Scene Text Recognition

Ming Li Bin Fu Han Chen Junjun He Yu Qiao

Year: 2022 Journal: IEEE Transactions on Multimedia Vol: 25 Pages: 4094-4107 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tmm.2022.3171108

Get Full-Text PDF Get Analytical Report

Abstract

Local visual and long-range contextual features yield two complementary cues for human reading text in natural scene. Existing scene text recognition methods mainly extract local features at a low level and then model long-range dependencies at a high level, this sequential pipeline may be sub-optimal to construct complete and effective representation. Except for high-level features, long-range contextual relation is of importance in low-level features as well since it can help separate different characters based on the intervals between characters and thus enhance the character features. To address this issue, we develop a dual relation module to extract complementary features in a parallel manner for scene text recognition, which consists of a local visual branch and a long-range contextual branch. The local visual branch employs a topological-aware operation to model intra-character characteristic and extract discriminative features of different characters. Meanwhile, the long-range contextual branch utilizes a simple but effective strategy to incorporate inter-character relations into feature maps. Our dual relation module is a plug-and-play block which can be easily incorporated into modern deep architectures. Experimental results demonstrate that our methods achieved top performance on several standard benchmarks. Code and models will become publicly available in the future.

Keywords:

Computer science Discriminative model Artificial intelligence Relation (database) Pipeline (software) Representation (politics) Feature (linguistics) Block (permutation group theory) Dual (grammatical number) Code (set theory) Pattern recognition (psychology) Range (aeronautics) Data mining

Metrics

Cited By

0.99

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Dual Relation Network for Scene Text Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

A Multi-head Self-relation Network for Scene Text Recognition

DBCAN: Dual-Branch Cross-Attention Network for Scene Text Recognition

Scene Text Recognition With Dual Encoders

Review network for scene text recognition

Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment