JOURNAL ARTICLE

An Attention-based Sequence Learning Model for Scene Text Recognition with Text Correction

Abstract

Recognizing text from images taken in natural scenes is a challenging task and a hot research topic in computer vision. Unlike traditional optical character recognition (OCR), words in natural images often possess irregular layout (e.g. arbitrarily orientation, blurring, perspective distortion) which are difficult to recognize. In this paper, we develop a novel method consisting of a text recognition network and a text correction component, which is more robust to irregular text. The text correction component rectify the text of an input image to a more "readable" text. The text recognition network is a more "location aware" attention-based sequence learning model that take the rectified image as input and recognize the text. The entire networks are trained jointly by only images and word-level annotations. The standard Softmax loss function only considers the separability between classes but does not restrict the aggregation within classes. Therefore, we adopt a new loss function based on the Softmax loss function to enable the model to learn more discriminative features, reduce misjudgments and improve accuracy. Extensive experiments on seven popular standard benchmarks, demonstrate the proposed method is comparable to state-of-the-art performance.

Keywords:
Computer science Text recognition Sequence (biology) Artificial intelligence Natural language processing Sequence learning Speech recognition Pattern recognition (psychology) Image (mathematics)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
28
Refs
0.19
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Scene Text Recognition with Heuristic Local Attention

Tianlong MaXiangcheng DuYanlong WangXiu-Tao Cui

Journal:   2022 IEEE International Conference on Big Data (Big Data) Year: 2022 Vol: 32 Pages: 4187-4194
JOURNAL ARTICLE

Sequential alignment attention model for scene text recognition

Yan WuJiaxin FanRenshuai TaoJiakai WangHaotong QinAishan LiuXianglong Liu

Journal:   Journal of Visual Communication and Image Representation Year: 2021 Vol: 80 Pages: 103289-103289
© 2026 ScienceGate Book Chapters — All rights reserved.