Detecting Scene Text with Principal Component Analysis Enhanced Image Gradient Auto Encoding

S. Raveeshwara; B. H. Shekar

doi:10.1109/aide57180.2022.10060375

ScienceGate Book Chapters

JOURNAL ARTICLE

Detecting Scene Text with Principal Component Analysis Enhanced Image Gradient Auto Encoding

S. Raveeshwara B. H. Shekar

Year: 2022 Vol: 12 Pages: 112-116

DOI: 10.1109/aide57180.2022.10060375

Get Full-Text PDF Get Analytical Report

Abstract

Text is rich in information. Scene text detection is still a challenging problem of machine vision due to variations such as script, font, color, scale, lighting, angle of view and other distortions present in the scene. Scene text reading generally requires high-performance computation platform, large training dataset and longer training process. We have attempted to train our auto encoder based text detector to precisely localize text with minimum training on a small dataset and limited computational resources. The idea involves computation of principal component analysis of image, morphological gradient to enhance text on the scene image and to feed it to a gradient auto encoder neural network to locate possible text components. Scripts belonging to multiple languages can be detect by the proposed detector and it is fairly robust against the variations such as color, lighting, scale, orientation and font. The proposed method is trained with only 167 training images of MRRC dataset. Experiments show that the method achieves an F-measure of 0.76 and 0.77 on MRRC dataset and MSRA-TD500 dataset respectively.

Keywords:

Computer science Artificial intelligence Pattern recognition (psychology) Principal component analysis Computer vision Encoding (memory) Computation Orientation (vector space) Encoder Scripting language Detector Algorithm Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.16

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Detecting Scene Text with Principal Component Analysis Enhanced Image Gradient Auto Encoding

Abstract

Metrics

Topics

Related Documents

Scene Text Detection with Gradient Auto Encoders

Auto-Encoding Scene Graphs for Image Captioning

Principal component analysis of image gradient orientations for face recognition

Auto-encoding and Distilling Scene Graphs for Image Captioning

Gradient algorithms for principal component analysis