JOURNAL ARTICLE

Learning Generative Structure Prior for Blind Text Image Super-resolution

Abstract

Blind text image super-resolution (SR) is challenging as one needs to cope with diverse font styles and unknown degradation. To address the problem, existing methods perform character recognition in parallel to regularize the SR task, either through a loss constraint or intermediate feature condition. Nonetheless, the high-level prior could still fail when encountering severe degradation. The prob-lem is further compounded given characters of complex structures, e.g., Chinese characters that combine multiple pictographic or ideographic symbols into a single charac-ter. In this work, we present a novel prior that focuses more on the character structure. In particular, we learn to encapsulate rich and diverse structures in a StyleGAN and exploit such generative structure priors for restoration. To restrict the generative space of StyleGAN so that it obeys the structure of characters yet remains flexible in handling different font styles, we store the discrete features for each character in a codebook. The code subsequently drives the StyleGAN to generate high-resolution structural details to aid text SR. Compared to priors based on character recognition, the proposed structure prior ex-erts stronger character-specific guidance to restore faithful and precise strokes of a designated character. Extensive experiments on synthetic and real datasets demonstrate the compelling performance of the proposed generative structure prior in facilitating robust text SR. Our code is available at https://github.com/csxmli2016/MARCONet.

Keywords:
Codebook Generative grammar Computer science Character (mathematics) Prior probability Artificial intelligence Code (set theory) Generative model Constraint (computer-aided design) Image (mathematics) Feature (linguistics) Pattern recognition (psychology) Natural language processing Bayesian probability Mathematics Linguistics Set (abstract data type) Programming language

Metrics

27
Cited By
4.91
FWCI (Field Weighted Citation Impact)
101
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Digital Media Forensic Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Enhanced Generative Structure Prior for Chinese Text Image Super-Resolution

Xiaoming LiWangmeng ZuoChen Change Loy

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2025 Vol: 47 (12)Pages: 11366-11381
JOURNAL ARTICLE

Text Prior Guided Scene Text Image Super-Resolution

Jianqi MaShi GuoLei Zhang

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: 32 Pages: 1341-1353
© 2026 ScienceGate Book Chapters — All rights reserved.