JOURNAL ARTICLE

Automatic character labeling for camera captured document images

Abstract

Character groundtruth for camera captured documents is crucial for training and evaluating advanced OCR algorithms. Manually generating character level groundtruth is a time consuming and costly process. This paper proposes a robust groundtruth generation method based on document retrieval and image registration for camera captured documents. We use an elastic non-rigid alignment method to fit the captured document image which relaxes the flat paper assumption made by conventional solutions. The proposed method allows building very large scale labeled camera captured documents dataset, without any human intervention. We construct a large labeled dataset consisting of 1 million camera captured Chinese character images. Evaluation of samples generated by our approach showed that 99.99% of the images were correctly labeled, even with different distortions specific to cameras such as blur, specularity and perspective distortion.

Keywords:
Computer science Artificial intelligence Computer vision Specularity Character (mathematics) Perspective distortion Perspective (graphical) Distortion (music) Optical character recognition Image (mathematics) Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
14
Refs
0.13
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Automatic dewarping of camera-captured comic document images

Arpan GaraiArpita DuttaSamit Biswas

Journal:   Multimedia Tools and Applications Year: 2022 Vol: 82 (1)Pages: 1537-1552
JOURNAL ARTICLE

Mosaicing of camera-captured document images

Jian LiangDaniel DeMenthonDavid Doermann

Journal:   Computer Vision and Image Understanding Year: 2008 Vol: 113 (4)Pages: 572-579
JOURNAL ARTICLE

Restoring camera-captured distorted document images

Changsong LiuYu ZhangBaokang WangXiaoqing Ding

Journal:   International Journal on Document Analysis and Recognition (IJDAR) Year: 2014 Vol: 18 (2)Pages: 111-124
JOURNAL ARTICLE

Restoring camera-captured distorted document images

LiuChangsongZhangyuWangBaokangDINGXiao--qing

Journal:   International Journal on Document Analysis and Recognition (IJDAR) Year: 2015
© 2026 ScienceGate Book Chapters — All rights reserved.