JOURNAL ARTICLE

Dual Relation Network for Scene Text Recognition

Ming LiBin FuHan ChenJunjun HeYu Qiao

Year: 2022 Journal:   IEEE Transactions on Multimedia Vol: 25 Pages: 4094-4107   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Local visual and long-range contextual features yield two complementary cues for human reading text in natural scene. Existing scene text recognition methods mainly extract local features at a low level and then model long-range dependencies at a high level, this sequential pipeline may be sub-optimal to construct complete and effective representation. Except for high-level features, long-range contextual relation is of importance in low-level features as well since it can help separate different characters based on the intervals between characters and thus enhance the character features. To address this issue, we develop a dual relation module to extract complementary features in a parallel manner for scene text recognition, which consists of a local visual branch and a long-range contextual branch. The local visual branch employs a topological-aware operation to model intra-character characteristic and extract discriminative features of different characters. Meanwhile, the long-range contextual branch utilizes a simple but effective strategy to incorporate inter-character relations into feature maps. Our dual relation module is a plug-and-play block which can be easily incorporated into modern deep architectures. Experimental results demonstrate that our methods achieved top performance on several standard benchmarks. Code and models will become publicly available in the future.

Keywords:
Computer science Discriminative model Artificial intelligence Relation (database) Pipeline (software) Representation (politics) Feature (linguistics) Block (permutation group theory) Dual (grammatical number) Code (set theory) Pattern recognition (psychology) Range (aeronautics) Data mining

Metrics

8
Cited By
0.99
FWCI (Field Weighted Citation Impact)
91
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

DBCAN: Dual-Branch Cross-Attention Network for Scene Text Recognition

Xinjian GaoYe PangYuyu LiuJun YuMaokun HanKai HouWei Wang

Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Year: 2022 Pages: 1-6
JOURNAL ARTICLE

Scene Text Recognition With Dual Encoders

Yao WangJong-Eun Ha

Journal:   Journal of Institute of Control Robotics and Systems Year: 2023 Vol: 29 (12)Pages: 973-979
JOURNAL ARTICLE

Review network for scene text recognition

Shuo LiAnqi HanXu ChenXiaoqing YinJun Zhang

Journal:   Journal of Electronic Imaging Year: 2017 Vol: 26 (05)Pages: 1-1
JOURNAL ARTICLE

Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment

Yijie HuBin DongKaizhu HuangLei DingWei WangXiaowei HuangQiufeng Wang

Journal:   ACM Transactions on Multimedia Computing Communications and Applications Year: 2023 Vol: 20 (4)Pages: 1-20
© 2026 ScienceGate Book Chapters — All rights reserved.