JOURNAL ARTICLE

OCR-friendly Image Synthesis using Generative Adversarial Networks

Abstract

With quality being a deciding factor in the accuracy of OCR predictions for a given source image, there comes the need of pre-processing methods to improve the quality of an image before undergoing this process. Towards that, we present a GAN-based method targeted at improving the quality of source image in the fields of image resolution, blur and noise. The model uses an encoder trained to generate latent image representations for a low-quality image, the representations corresponding to blur and noise types present in the image. These representations act as inputs to the constructed conditional GAN. Besides these labels, the generator takes a low-quality image as input and is trained to generate a high-quality image as represented by the target images.

Keywords:
Artificial intelligence Computer science Image quality Image restoration Computer vision Image (mathematics) Noise (video) Generator (circuit theory) Image processing Quality (philosophy) Process (computing) Pattern recognition (psychology) Encoder

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
12
Refs
0.19
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.