Efficient Scene Text Image Super-Resolution with Semantic Guidance

LeoWu TomyEnrique; Xiangcheng Du; Kangliang Liu; Han Yuan; Zhao Zhou; Cheng Jin

doi:10.1109/icassp48485.2024.10446964

ScienceGate Book Chapters

JOURNAL ARTICLE

Efficient Scene Text Image Super-Resolution with Semantic Guidance

LeoWu TomyEnrique Xiangcheng Du Kangliang Liu Han Yuan Zhao Zhou Cheng Jin

Year: 2024 Pages: 3160-3164

DOI: 10.1109/icassp48485.2024.10446964

Get Full-Text PDF Get Analytical Report

Abstract

Scene text image super-resolution has significantly improved the accuracy of scene text recognition. However, many existing methods emphasize performance over efficiency and ignore the practical need for lightweight solutions in deployment scenarios. Faced with the issues, our work proposes an efficient framework called SGENet to facilitate deployment on resource-limited platforms. SGENet contains two branches: super-resolution branch and semantic guidance branch. We apply a lightweight pre-trained recognizer as a semantic extractor to enhance the understanding of text information. Meanwhile, we design the visual-semantic alignment module to achieve bidirectional alignment between image features and semantics, resulting in the generation of high-quality prior guidance. We conduct extensive experiments on benchmark dataset, and the proposed SGENet achieves excellent performance with fewer computational costs.

Keywords:

Computer science Benchmark (surveying) Semantics (computer science) Software deployment Resource (disambiguation) Image (mathematics) Artificial intelligence Resolution (logic) Information retrieval Computer vision Software engineering Programming language

Metrics

Cited By

2.65

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Processing Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Digital Media Forensic Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing Techniques and Applications

Physical Sciences → Engineering → Media Technology

Efficient Scene Text Image Super-Resolution with Semantic Guidance

Abstract

Metrics

Citation History

Topics

Related Documents

Scene Text Image Super-Resolution with CLIP Prior Guidance

Scene text image super-resolution with semantic-aware interaction

Semantic and Gradient Guided Scene Text Image Super-Resolution

Efficient image super-resolution with semantic guidance and denoising modules

Scene Text Image Super-Resolution Via Semantic Distillation and Text Perceptual Loss