JOURNAL ARTICLE

SEGMENTATION OF NON-TEXT FROM BILINGUAL REAL-TIME OFFICE DOCUMENT IMAGES USING U-NET ARCHITECTURE

SHIVAKUMAR GRAVIKUMAR MSAMPATHKUMAR SSHIVAPRASAD B J

Year: 2022 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

Abstract In this work, we have presented an efficient approach for segmentation of non-text document information from real time office document images which are bilingual using a machine learning approach i.e., U-net architecture for experimentation purpose. We have created our own dataset containing 200 document images. Initially pre-processing is applied on the input document images proposed method is compared with other existing methods and obtained accuracy of 99% different performance measure i.e., (Specificity, Sensitivity, Precision, F1-Score) used in the experimentation.

Keywords:
Segmentation Measure (data warehouse) Architecture Image segmentation Historical document Document image processing Pattern recognition (psychology)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.22
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.