JOURNAL ARTICLE

Document image rectification method combined with semantic segmentation

Abstract

Proposed a method combined with semantic segmentation for document image rectification to address the limited applicability of traditional correction methods and difficulties in data annotation. Firstly, diverse experimental data are synthesized to create a target document image and its corresponding mask data. Secondly, a semantic segmentation model based on DeepLabV3 is constructed using pre-trained MobileNetV3, ResNet50, and ResNet101 as the backbone networks, respectively, to separate the document page area(ROI) from the image. Then, the document page area is corrected using corner point detection and perspective transformation to complete the document image rectification. The evaluation of the model shows that the DeepLabV3 model with MobileNetV3 as the backbone network has high processing efficiency, with an IOU of 0.983 and a Total Loss of 0.064 on the validation set. Test results demonstrate that the proposed method has better generalization capabilities and can be easily extended to practical engineering.

Keywords:
Computer science Computer vision Rectification Artificial intelligence Image segmentation Segmentation Image (mathematics) Engineering Electrical engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
21
Refs
0.22
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Computational Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Color-Depth Combined Semantic Image Segmentation Method

Man-Joung KimHyun‐Soo Kang

Journal:   The Journal of the Korean Institute of Information and Communication Engineering Year: 2014 Vol: 18 (3)Pages: 687-696
JOURNAL ARTICLE

A combined method for multi-class image semantic segmentation

Chao GaoXin ZhangHui Wang

Journal:   IEEE Transactions on Consumer Electronics Year: 2012 Vol: 58 (2)Pages: 596-604
JOURNAL ARTICLE

Document Image Rectification Method Based on Lines Space

LUO Xiaoping

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2017
JOURNAL ARTICLE

Curved document image rectification

Dhanya M. DhanalakshmyHema P Menon

Year: 2017 Vol: 4 Pages: 783-786
© 2026 ScienceGate Book Chapters — All rights reserved.