An Attention-based Sequence Learning Model for Scene Text Recognition with Text Correction

Chen Fang; Guoqiang Xiao; Conghui Chen

doi:10.1145/3374587.3374645

ScienceGate Book Chapters

JOURNAL ARTICLE

An Attention-based Sequence Learning Model for Scene Text Recognition with Text Correction

Chen Fang Guoqiang Xiao Conghui Chen

Year: 2019 Vol: 90 Pages: 215-220

DOI: 10.1145/3374587.3374645

Get Full-Text PDF Get Analytical Report

Abstract

Recognizing text from images taken in natural scenes is a challenging task and a hot research topic in computer vision. Unlike traditional optical character recognition (OCR), words in natural images often possess irregular layout (e.g. arbitrarily orientation, blurring, perspective distortion) which are difficult to recognize. In this paper, we develop a novel method consisting of a text recognition network and a text correction component, which is more robust to irregular text. The text correction component rectify the text of an input image to a more "readable" text. The text recognition network is a more "location aware" attention-based sequence learning model that take the rectified image as input and recognize the text. The entire networks are trained jointly by only images and word-level annotations. The standard Softmax loss function only considers the separability between classes but does not restrict the aggregation within classes. Therefore, we adopt a new loss function based on the Softmax loss function to enable the model to learn more discriminative features, reduce misjudgments and improve accuracy. Extensive experiments on seven popular standard benchmarks, demonstrate the proposed method is comparable to state-of-the-art performance.

Keywords:

Computer science Text recognition Sequence (biology) Artificial intelligence Natural language processing Sequence learning Speech recognition Pattern recognition (psychology) Image (mathematics)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.19

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

An Attention-based Sequence Learning Model for Scene Text Recognition with Text Correction

Abstract

Metrics

Topics

Related Documents

Triggered attention model for scene text recognition

Scene Text Recognition with Heuristic Local Attention

Scene Text Recognition with Cascade Attention Network

Sequential alignment attention model for scene text recognition

Memory-Augmented Attention Model for Scene Text Recognition