Deformable Mixed Domain Attention Network for Scene Text Recognition

Yangyang Huang; Wei Fang

doi:10.1109/icsess49938.2020.9237645

ScienceGate Book Chapters

JOURNAL ARTICLE

Deformable Mixed Domain Attention Network for Scene Text Recognition

Yangyang Huang Wei Fang

Year: 2020 Vol: 37 Pages: 142-145

DOI: 10.1109/icsess49938.2020.9237645

Get Full-Text PDF Get Analytical Report

Abstract

As a hot research area in computer vision in recent years, scene text recognition is still challenging due to the large variance in irregular text. The current methods treat the recognition process as a sequence-to-sequence task and solve it by an encoder-decoder framework. In this work, we propose a DMDAN for robust scene text recognition. First, we utilize deformable convolution to strengthen the ability to adapt to irregular text. Then, mix domain visual attention and self-attention are respectively employed in the encoder and decoder, which can effectively alleviate the problem of "attention drifting". Finally, we integrate the center loss to reduce the intra-class distances and make each class easier to distinguish. Extensive experimental results show that our model outperforms the baseline CRNN a lot and achieves a comparable performance against existing attention-based methods on both regular and irregular datasets.

Keywords:

Computer science Domain (mathematical analysis) Artificial intelligence Computer vision Pattern recognition (psychology) Mathematics

Metrics

Cited By

0.10

FWCI (Field Weighted Citation Impact)

Refs

0.41

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Deformable Mixed Domain Attention Network for Scene Text Recognition

Abstract

Metrics

Citation History

Topics

Related Documents

Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention

Scene Text Recognition with Cascade Attention Network

Context Attention Network for Scene Text Recognition

Gaussian Constrained Attention Network for Scene Text Recognition

Spatial attention contrastive network for scene text recognition