Automatic character labeling for camera captured document images

Wei Fan; Koichi Kise; Masakazu Iwamura

doi:10.1109/icip.2016.7532967

ScienceGate Book Chapters

JOURNAL ARTICLE

Automatic character labeling for camera captured document images

Wei Fan Koichi Kise Masakazu Iwamura

Year: 2016 Vol: 321 Pages: 3284-3288

DOI: 10.1109/icip.2016.7532967

Get Full-Text PDF Get Analytical Report

Abstract

Character groundtruth for camera captured documents is crucial for training and evaluating advanced OCR algorithms. Manually generating character level groundtruth is a time consuming and costly process. This paper proposes a robust groundtruth generation method based on document retrieval and image registration for camera captured documents. We use an elastic non-rigid alignment method to fit the captured document image which relaxes the flat paper assumption made by conventional solutions. The proposed method allows building very large scale labeled camera captured documents dataset, without any human intervention. We construct a large labeled dataset consisting of 1 million camera captured Chinese character images. Evaluation of samples generated by our approach showed that 99.99% of the images were correctly labeled, even with different distortions specific to cameras such as blur, specularity and perspective distortion.

Keywords:

Computer science Artificial intelligence Computer vision Specularity Character (mathematics) Perspective distortion Perspective (graphical) Distortion (music) Optical character recognition Image (mathematics) Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.13

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Handwritten Text Recognition Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Automatic character labeling for camera captured document images

Abstract

Metrics

Topics

Related Documents

Automatic dewarping of camera-captured comic document images

Automatic dewarping of Camera Captured Born-Digital Bangla Document Images

Mosaicing of camera-captured document images

Restoring camera-captured distorted document images

Restoring camera-captured distorted document images