Text Extraction from Document Images Using Edge Information

Sachin Grover; Kushal Arora; Suman K. Mitra

doi:10.1109/indcon.2009.5409409

ScienceGate Book Chapters

JOURNAL ARTICLE

Text Extraction from Document Images Using Edge Information

Sachin Grover Kushal Arora Suman K. Mitra

Year: 2009 Vol: 19 Pages: 1-4

DOI: 10.1109/indcon.2009.5409409

Get Full-Text PDF Get Analytical Report

Abstract

Detection of text from documents in which text is embedded in complex colored document images is a very challenging problem. There are a lot of potential uses of text extraction in image searching, archiving documents etc. In this paper, we propose a simple edge based feature to perform this task. It aims at detecting textual regions from the document and separating it from the graphics portion. The algorithm is based on the sharp edges of the characters which are missing in images. We find these edges and use them to classify text from images. This edge information can also be used for other image interpretation tasks.

Keywords:

Computer science Enhanced Data Rates for GSM Evolution Artificial intelligence Feature extraction Graphics Edge detection Document image processing Document layout analysis Task (project management) Feature (linguistics) Information retrieval Information extraction Image (mathematics) Pattern recognition (psychology) Interpretation (philosophy) Computer vision Image segmentation Image processing Computer graphics (images)

Metrics

Cited By

1.86

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Vehicle License Plate Recognition

Physical Sciences → Engineering → Media Technology

Text Extraction from Document Images Using Edge Information

Abstract

Metrics

Citation History

Topics

Related Documents

Text extraction from gray scale document images using edge information

Text extraction from degraded document images

Text extraction from graphical document images using sparse representation

Text Extraction from Document Images using CNN and LSTM

Text-line extraction from handwritten document images using GAN