Extracting table data from images using optical character recognition text

Mehmet Yasin Akpinar; Erdem Emekligil; Seçil Arslan

doi:10.1109/siu.2018.8404746

ScienceGate Book Chapters

JOURNAL ARTICLE

Extracting table data from images using optical character recognition text

Mehmet Yasin Akpinar Erdem Emekligil Seçil Arslan

Year: 2018 Pages: 1-4

DOI: 10.1109/siu.2018.8404746

Get Full-Text PDF Get Analytical Report

Abstract

The conversion of image-based documents into digital and processible forms can be accomplished quite successfully with optical character recognition (OCR) tools. However, there are still problems with preserving the format on the original document. An important one of these problems is the reading of the tabular data. In this paper, a method is proposed in which the tabular data contents of hard-copy documents is extracted from the text and character positions which are obtained from an OCR tool and transferred to digital forms. The performance of the method is measured by the number of detected rows and columns and presented with the results of other commercial products.

Keywords:

Optical character recognition Character (mathematics) Computer science Table (database) Row Character recognition Artificial intelligence Reading (process) Information retrieval Image (mathematics) Pattern recognition (psychology) Row and column spaces Natural language processing Computer graphics (images) Data mining Database Mathematics

Metrics

Cited By

0.58

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Extracting table data from images using optical character recognition text

Abstract

Metrics

Citation History

Topics

Related Documents

Optical Character Recognition from Printed Text Images

An Optical Character Recognition Technique for Extracting Text from Blurred and Low-Quality Images Using TrOCR

Optical character recognition: transforming images into text

Optical Character Recognition from Images

Optical Character Recognition from Images