DUAL-STAGE RECTIFICATION AND ATTENTION FRAMEWORK FOR ROBUST SCENE TEXT RECOGNITION

Dr. K. Siva Kumar; Chinnam Lavanya; Cheedella Sai Pranavi; Addagatla Sagari Sailaja Kumari

doi:10.5281/zenodo.15331728

ScienceGate Book Chapters

JOURNAL ARTICLE

DUAL-STAGE RECTIFICATION AND ATTENTION FRAMEWORK FOR ROBUST SCENE TEXT RECOGNITION

Dr. K. Siva Kumar Chinnam Lavanya Cheedella Sai Pranavi Addagatla Sagari Sailaja Kumari

Year: 2025 Journal: Zenodo (CERN European Organization for Nuclear Research) Publisher: European Organization for Nuclear Research

DOI: 10.5281/zenodo.15331728

Get Full-Text PDF Get Analytical Report

Abstract

Recognizing scene text under irregular distortions demands robust rectification prior to decoding. We propose aTwo-Level Rectification Attention Network (TRAN) that unites a Geometry-Level Rectification Network(GEO)—leveraging thin-plate spline (TPS) warping to correct global skew and curvature—with a Pixel-LevelRectification Network (PIX) that applies fine-grained per-pixel offsets to refine local deformations. To handlediverse character scales and appearances, we introduce a Channel-Kernel Attention Unit that dynamicallyweighs feature channels and convolutional kernels. Implemented atop the ClovaAI deep-text-recognitionbenchmark framework with PyTorch and pretrained CNN–RNN backbones, TRAN demonstrates superiorrectification and recognition performance. Large-scale experiments on benchmarks with curved, rotated, andperspective-warped text demonstrate that TRAN's two-stage rectification strategy is far superior to single-stagerectification algorithms. Our results point to the potential of combining multi-level rectification with adaptiveattention as a promising direction for more robust scene text recognition in real-world applications likenavigation systems and reading aid devices.

Keywords:

Rectification Image warping Skew Feature (linguistics) Convolutional neural network Pattern recognition (psychology) Point (geometry) Character recognition Offset (computer science)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.53

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Mast cells and histamine

Life Sciences → Immunology and Microbiology → Immunology

Polyomavirus and related diseases

Health Sciences → Medicine → Oncology

Virology and Viral Diseases

Health Sciences → Medicine → Epidemiology

DUAL-STAGE RECTIFICATION AND ATTENTION FRAMEWORK FOR ROBUST SCENE TEXT RECOGNITION

Abstract

Metrics

Topics

Related Documents

DUAL-STAGE RECTIFICATION AND ATTENTION FRAMEWORK FOR ROBUST SCENE TEXT RECOGNITION

Robust Scene Text Recognition with Automatic Rectification

Weakly Supervised Attention Rectification for Scene Text Recognition

A Two-Level Rectification Attention Network for Scene Text Recognition

Rethinking text rectification for scene text recognition