Rectification and 3D reconstruction of curved document images

Yuandong Tian; Srinivasa G. Narasimhan

doi:10.1109/cvpr.2011.5995540

ScienceGate Book Chapters

JOURNAL ARTICLE

Rectification and 3D reconstruction of curved document images

Yuandong Tian Srinivasa G. Narasimhan

Year: 2011

DOI: 10.1109/cvpr.2011.5995540

Get Full-Text PDF Get Analytical Report

Abstract

Distortions in images of documents, such as the pages of books, adversely affect the performance of optical charac-ter recognition (OCR) systems. Removing such distortions requires the 3D deformation of the document that is often measured using special and precisely calibrated hardware (stereo, laser range scanning or structured light). In this paper, we introduce a new approach that automatically re-constructs the 3D shape and rectifies a deformed text doc-ument from a single image. We first estimate the 2D distor-tion grid in an image by exploiting the line structure and stroke statistics in text documents. This approach does not rely on more noise-sensitive operations such as image bina-rization and character segmentation. The regularity in the text pattern is used to constrain the 2D distortion grid to be a perspective projection of a 3D parallelogram mesh. Based on this constraint, we present a new shape-from-texture method that computes the 3D deformation up to a scale factor using SVD. Unlike previous work, this formu-lation imposes no restrictions on the shape (e.g., a devel-opable surface). The estimated shape is then used to re-move both geometric distortions and photometric (shading) effects in the image. We demonstrate our techniques on doc-uments containing a variety of languages, fonts and sizes. 1.

Keywords:

Computer science Artificial intelligence Computer vision Distortion (music) Photometric stereo Segmentation Perspective distortion Optical character recognition Projection (relational algebra) 3D reconstruction Grid Structured light Computer graphics (images) Image (mathematics) Geometry Mathematics Algorithm

Metrics

Cited By

2.56

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Vision and Imaging

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Optical measurement and interference techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Computer Graphics and Visualization Techniques

Physical Sciences → Computer Science → Computer Graphics and Computer-Aided Design

Rectification and 3D reconstruction of curved document images

Abstract

Metrics

Citation History

Topics

Related Documents

Metric Rectification of Curved Document Images

Rectification of curved document images based on single view three-dimensional reconstruction

Curved document image rectification

Active Rectification of Curved Document Images Using Structured Beams

Geometric Rectification of Camera-Captured Document Images