JOURNAL ARTICLE

Lightweight Scene Text Recognition Based on Transformer

Xin LuanJinwei ZhangMiaomiao XuWushouer SilamuYanbing Li

Year: 2023 Journal:   Sensors Vol: 23 (9)Pages: 4490-4490   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Scene text recognition (STR) has been a hot research field in computer vision, aiming to recognize text in natural scenes using computers. Currently, attention-based encoder–decoder frameworks struggle to precisely align feature regions with the target object when dealing with complex and low-quality images, a phenomenon known as attention drift. Additionally, with the rise of Transformer, the increasing size of parameters results in higher computational costs. In order to solve the above problems, based on the latest research results of Vision Transformer (ViT), we utilize an additional position-enhancement branch to alleviate attention drift and dynamically fused position information with visual information to achieve better recognition accuracy. The experimental results demonstrate that our model achieves a 3% higher average recognition accuracy on the test set compared to the baseline. Meanwhile, our model maintains the advantage of a small number of parameters and fast inference speed, achieving a good balance between accuracy, speed, and computational load.

Keywords:
Computer science Transformer Encoder Artificial intelligence Inference Cognitive neuroscience of visual object recognition Computer vision Pattern recognition (psychology) Feature extraction Voltage Engineering

Metrics

11
Cited By
2.00
FWCI (Field Weighted Citation Impact)
38
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.