Document image rectification method combined with semantic segmentation

YongWei Wang; Shuangshuang Xu; Yao Xiao; YiGuang Yang; Hao Li; RongZheng Yang

doi:10.1109/mlbdbi60823.2023.10481989

ScienceGate Book Chapters

JOURNAL ARTICLE

Document image rectification method combined with semantic segmentation

YongWei Wang Shuangshuang Xu Yao Xiao YiGuang Yang Hao Li RongZheng Yang

Year: 2023 Vol: 36 Pages: 307-314

DOI: 10.1109/mlbdbi60823.2023.10481989

Get Full-Text PDF Get Analytical Report

Abstract

Proposed a method combined with semantic segmentation for document image rectification to address the limited applicability of traditional correction methods and difficulties in data annotation. Firstly, diverse experimental data are synthesized to create a target document image and its corresponding mask data. Secondly, a semantic segmentation model based on DeepLabV3 is constructed using pre-trained MobileNetV3, ResNet50, and ResNet101 as the backbone networks, respectively, to separate the document page area(ROI) from the image. Then, the document page area is corrected using corner point detection and perspective transformation to complete the document image rectification. The evaluation of the model shows that the DeepLabV3 model with MobileNetV3 as the backbone network has high processing efficiency, with an IOU of 0.983 and a Total Loss of 0.064 on the validation set. Test results demonstrate that the proposed method has better generalization capabilities and can be easily extended to practical engineering.

Keywords:

Computer science Computer vision Rectification Artificial intelligence Image segmentation Segmentation Image (mathematics) Engineering Electrical engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.22

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Image Processing and 3D Reconstruction

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Computational Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Document image rectification method combined with semantic segmentation

Abstract

Metrics

Topics

Related Documents

Color-Depth Combined Semantic Image Segmentation Method

A combined method for multi-class image semantic segmentation

Image semantic segmentation algorithm combined with attention mechanism

Document Image Rectification Method Based on Lines Space

Curved document image rectification