JOURNAL ARTICLE

Robust Document Image Binarization Technique for Degraded Document Images

Bolan SuShijian LuChew Lim Tan

Year: 2012 Journal:   IEEE Transactions on Image Processing Vol: 22 (4)Pages: 1408-1417   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Segmentation of text from badly degraded document images is a very challenging task due to the high inter/intra-variation between the document background and the foreground text of different document images. In this paper, we propose a novel document image binarization technique that addresses these issues by using adaptive image contrast. The adaptive image contrast is a combination of the local image contrast and the local image gradient that is tolerant to text and background variation caused by different types of document degradations. In the proposed technique, an adaptive contrast map is first constructed for an input degraded document image. The contrast map is then binarized and combined with Canny's edge map to identify the text stroke edge pixels. The document text is further segmented by a local threshold that is estimated based on the intensities of detected text stroke edge pixels within a local window. The proposed method is simple, robust, and involves minimum parameter tuning. It has been tested on three public datasets that are used in the recent document image binarization contest (DIBCO) 2009 & 2011 and handwritten-DIBCO 2010 and achieves accuracies of 93.5%, 87.8%, and 92.03%, respectively, that are significantly higher than or close to that of the best-performing methods reported in the three contests. Experiments on the Bickley diary dataset that consists of several challenging bad quality document images also show the superior performance of our proposed method, compared with other techniques.

Keywords:
Artificial intelligence Computer science Pixel Historical document Contrast (vision) Pattern recognition (psychology) Computer vision Image (mathematics) Segmentation Image segmentation Image gradient Enhanced Data Rates for GSM Evolution Image texture

Metrics

277
Cited By
17.42
FWCI (Field Weighted Citation Impact)
45
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Handwritten Text Recognition Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Vehicle License Plate Recognition
Physical Sciences →  Engineering →  Media Technology
Image and Object Detection Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Document Image Binarization Technique for Degraded Document Images

Supriya Sunil LokhandeNitin Ashok Dawande

Journal:   International Journal of Computer Applications Year: 2015 Vol: 122 (22)Pages: 22-29
JOURNAL ARTICLE

Simple and Efficient Document Image Binarization Technique For Degraded Document Images

M. Kezia JosephJijina K.P Jijina

Journal:   International Journal of Scientific Research Year: 2012 Vol: 3 (5)Pages: 217-220
JOURNAL ARTICLE

Image Binarization for Degraded Document Images

N. SushilkumarB. UlhasS. R. Bhagyashree

Journal:   International Journal of Computer Applications Year: 2015 Vol: 128 (15)Pages: 38-43
© 2026 ScienceGate Book Chapters — All rights reserved.