JOURNAL ARTICLE

Extracting table data from images using optical character recognition text

Abstract

The conversion of image-based documents into digital and processible forms can be accomplished quite successfully with optical character recognition (OCR) tools. However, there are still problems with preserving the format on the original document. An important one of these problems is the reading of the tabular data. In this paper, a method is proposed in which the tabular data contents of hard-copy documents is extracted from the text and character positions which are obtained from an OCR tool and transferred to digital forms. The performance of the method is measured by the number of detected rows and columns and presented with the results of other commercial products.

Keywords:
Optical character recognition Character (mathematics) Computer science Table (database) Row Character recognition Artificial intelligence Reading (process) Information retrieval Image (mathematics) Pattern recognition (psychology) Row and column spaces Natural language processing Computer graphics (images) Data mining Database Mathematics

Metrics

6
Cited By
0.58
FWCI (Field Weighted Citation Impact)
13
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Optical Character Recognition from Printed Text Images

T. Kameswara RaoK. Yashwanth ChowdaryI. Koushik ChowdaryK. Prasanna KumarCh. Ramesh

Journal:   International Journal of Scientific Research in Computer Science Engineering and Information Technology Year: 2019 Pages: 597-604
JOURNAL ARTICLE

Optical Character Recognition from Images

AngelJean Jisha .MVijayalakshmi Shivkhumar

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
JOURNAL ARTICLE

Optical Character Recognition from Images

AngelJean Jisha .MVijayalakshmi Shivkhumar

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2024
© 2026 ScienceGate Book Chapters — All rights reserved.