JOURNAL ARTICLE

Fine-grained Pseudo Labels for Scene Text Recognition

Abstract

Pseudo-Labeling based semi-supervised learning has shown promising advantages in Scene Text Recognition (STR). Most of them usually use a pre-trained model to generate sequence-level pseudo labels for text images and then re-train the model. Recently, conducting Pseudo-Labeling in a teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has become increasingly popular, which trains in an end-to-end manner and yields outstanding performance in semi-supervised learning. However, applying this framework directly to Pseudo-Labeling STR exhibits unstable convergence, as generating pseudo labels at the coarse-grained sequence-level leads to inefficient utilization of unlabelled data. Furthermore, the inherent domain shift between labeled and unlabeled data results in low quality of derived pseudo labels. To mitigate the above issues, we propose a novel Cross-domain Pseudo-Labeling (CPL) approach for scene text recognition, which makes better utilization of unlabeled data at the character-level and provides more accurate pseudo labels. Specifically, our proposed Pseudo-Labeled Curriculum Learning dynamically adjusts the thresholds for different character classes according to the model's learning status. Moreover, an Adaptive Distribution Regularizer is employed to bridge the domain gap and improve the quality of pseudo labels. Extensive experiments show that CPL boosts those representative STR models to achieve state-of-the-art results on six challenging STR benchmarks. Besides, it can be effectively generalized to handwritten text.

Keywords:
Computer science Sequence labeling Artificial intelligence Domain (mathematical analysis) Sequence (biology) Labeled data Character (mathematics) Pattern recognition (psychology) Quality (philosophy) Convergence (economics) Machine learning Speech recognition Natural language processing Task (project management) Mathematics

Metrics

2
Cited By
0.36
FWCI (Field Weighted Citation Impact)
31
Refs
0.55
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Knowledge Mining with Scene Text for Fine-Grained Recognition

Hao WangJunchao LiaoTianheng ChengZewen GaoHao LíuBo RenXiang BaiWenyu Liu

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022
JOURNAL ARTICLE

Enhanced Adjacency-Constrained Hierarchical Clustering Using Fine-Grained Pseudo Labels

Jie YangChin‐Teng Lin

Journal:   IEEE Transactions on Emerging Topics in Computational Intelligence Year: 2024 Vol: 8 (3)Pages: 2481-2492
© 2026 ScienceGate Book Chapters — All rights reserved.