Learning Generative Structure Prior for Blind Text Image Super-resolution

Xiaoming Li; Wangmeng Zuo; Chen Change Loy

doi:10.1109/cvpr52729.2023.00974

ScienceGate Book Chapters

JOURNAL ARTICLE

Learning Generative Structure Prior for Blind Text Image Super-resolution

Xiaoming Li Wangmeng Zuo Chen Change Loy

Year: 2023

DOI: 10.1109/cvpr52729.2023.00974

Get Full-Text PDF Get Analytical Report

Abstract

Blind text image super-resolution (SR) is challenging as one needs to cope with diverse font styles and unknown degradation. To address the problem, existing methods perform character recognition in parallel to regularize the SR task, either through a loss constraint or intermediate feature condition. Nonetheless, the high-level prior could still fail when encountering severe degradation. The prob-lem is further compounded given characters of complex structures, e.g., Chinese characters that combine multiple pictographic or ideographic symbols into a single charac-ter. In this work, we present a novel prior that focuses more on the character structure. In particular, we learn to encapsulate rich and diverse structures in a StyleGAN and exploit such generative structure priors for restoration. To restrict the generative space of StyleGAN so that it obeys the structure of characters yet remains flexible in handling different font styles, we store the discrete features for each character in a codebook. The code subsequently drives the StyleGAN to generate high-resolution structural details to aid text SR. Compared to priors based on character recognition, the proposed structure prior ex-erts stronger character-specific guidance to restore faithful and precise strokes of a designated character. Extensive experiments on synthetic and real datasets demonstrate the compelling performance of the proposed generative structure prior in facilitating robust text SR. Our code is available at https://github.com/csxmli2016/MARCONet.

Keywords:

Codebook Generative grammar Computer science Character (mathematics) Prior probability Artificial intelligence Code (set theory) Generative model Constraint (computer-aided design) Image (mathematics) Feature (linguistics) Pattern recognition (psychology) Natural language processing Bayesian probability Mathematics Linguistics Set (abstract data type) Programming language

Metrics

Cited By

4.91

FWCI (Field Weighted Citation Impact)

101

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

Learning Generative Structure Prior for Blind Text Image Super-resolution

Abstract

Metrics

Citation History

Topics

Related Documents

Enhanced Generative Structure Prior for Chinese Text Image Super-Resolution

GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution

Blind Single Image Super-Resolution via Iterated Shared Prior Learning

Deep Image and Kernel Prior Learning for Blind Super-Resolution

Text Prior Guided Scene Text Image Super-Resolution