JOURNAL ARTICLE

Scene Text Recognition With Linear Constrained Rectification

Abstract

Scene Text Recognition remains a challenging problem because of various text styles and image distortions. This paper proposed an end-to-end trainable model with a rectification module network.The rectification module adopts a polynomial based spatial transform network to rectify the distorted input image, the feature representation between the rectification and encoding step is shared. The model can be trained with the scene text images and the corresponding word labels. With the flexible rectifying and feature sharing, this model outperforms previous works through the extensive evaluation results on the standard benchmarks, especially on irregular datasets, 80.2% on IC15 and 85.4% on CUTE, more specifically.

Keywords:
Rectification Computer science Feature (linguistics) Artificial intelligence Representation (politics) Pattern recognition (psychology) Encoding (memory) Image (mathematics) Feature extraction Computer vision Word (group theory) Mathematics

Metrics

1
Cited By
0.10
FWCI (Field Weighted Citation Impact)
49
Refs
0.45
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.