JOURNAL ARTICLE

Foreground and Text-lines Aware Document Image Rectification

Abstract

This paper aims at the distorted document image rectification problem, the objective to eliminate the geometric distortion in the document images and realize document intelligence. Improving the readability of distorted documents is crucial to effectively extract information from deformed images. According to our observations, the foreground and text-line of the original warped image can represent the deformation tendency. However, previous distorted image rectification methods pay little attention to the readability of the warped paper. In this paper, we focus on the foreground and text-line regions of distorted paper and proposes a global and local fusion method to improve the rectification effect of distorted images and enhance the readability of document images. We introduce cross attention to capture the features of the foreground and text-lines in the warped document and effectively fuse them. The proposed method is evaluated quantitatively and qualitatively on the public DocUNet benchmark and DIR300 Dataset, which achieve state-of-the-art performances. Experimental analysis shows the proposed method can well perform overall geometric rectification of distorted images and effectively improve document readability (using the metrics of Character Error Rate and Edit Distance). The code is available at https://github.com/xiaomore/Document-Image-Dewarping.

Keywords:
Readability Computer science Image rectification Distortion (music) Artificial intelligence Rectification Benchmark (surveying) Computer vision Image (mathematics) Line (geometry) Focus (optics) Information retrieval Mathematics

Metrics

11
Cited By
2.00
FWCI (Field Weighted Citation Impact)
59
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image and Object Detection Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing Techniques and Applications
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Document Image Rectification Method Based on Lines Space

LUO Xiaoping

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2017
JOURNAL ARTICLE

Multi‐scale document image rectification utilising text‐features

Riming SunShengfa WangLin JiZhenyu Wang

Journal:   Electronics Letters Year: 2018 Vol: 54 (8)Pages: 502-503
JOURNAL ARTICLE

Curved document image rectification

Dhanya M. DhanalakshmyHema P Menon

Year: 2017 Vol: 4 Pages: 783-786
JOURNAL ARTICLE

Deep Unrestricted Document Image Rectification

Hao FengShaokai LiuJiajun DengWengang ZhouHouqiang Li

Journal:   IEEE Transactions on Multimedia Year: 2023 Vol: 26 Pages: 6142-6154
© 2026 ScienceGate Book Chapters — All rights reserved.