JOURNAL ARTICLE

Detecting Scene Text with Principal Component Analysis Enhanced Image Gradient Auto Encoding

Abstract

Text is rich in information. Scene text detection is still a challenging problem of machine vision due to variations such as script, font, color, scale, lighting, angle of view and other distortions present in the scene. Scene text reading generally requires high-performance computation platform, large training dataset and longer training process. We have attempted to train our auto encoder based text detector to precisely localize text with minimum training on a small dataset and limited computational resources. The idea involves computation of principal component analysis of image, morphological gradient to enhance text on the scene image and to feed it to a gradient auto encoder neural network to locate possible text components. Scripts belonging to multiple languages can be detect by the proposed detector and it is fairly robust against the variations such as color, lighting, scale, orientation and font. The proposed method is trained with only 167 training images of MRRC dataset. Experiments show that the method achieves an F-measure of 0.76 and 0.77 on MRRC dataset and MSRA-TD500 dataset respectively.

Keywords:
Computer science Artificial intelligence Pattern recognition (psychology) Principal component analysis Computer vision Encoding (memory) Computation Orientation (vector space) Encoder Scripting language Detector Algorithm Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
43
Refs
0.16
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Scene Text Detection with Gradient Auto Encoders

S. RaveeshwaraB. H. Shekar

Communications in computer and information science Year: 2023 Pages: 350-361
JOURNAL ARTICLE

Auto-encoding and Distilling Scene Graphs for Image Captioning

Xu YangHanwang ZhangJianfei Cai

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2020 Vol: 44 (5)Pages: 1-1
JOURNAL ARTICLE

Gradient algorithms for principal component analysis

Robert MahonyUwe HelmkeJ.B. Moore

Journal:   The Journal of the Australian Mathematical Society Series B Applied Mathematics Year: 1996 Vol: 37 (4)Pages: 430-450
© 2026 ScienceGate Book Chapters — All rights reserved.