JOURNAL ARTICLE

<title>Color, complex document segmentation and compression</title>

Hei Tao FungKevin J. Parker

Year: 1997 Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Vol: 3027 Pages: 180-191   Publisher: SPIE

Abstract

We propose a novel segmentation algorithm called SMART for color, complex documents. It decomposes a document image into 'binarizable' and 'non-binarizable' components. The segmentation procedure includes color transformation, halftone texture suppression, subdivision of the image into 8 by 8 blocks, classification of the 8 by 8 blocks as 'active' or 'inactive', formation of macroblocks from the active blocks, and classification of the macroblocks as binarizable or non-binarizable. The classification processes involve the DCT coefficients and a histogram analysis. SMART is compared to three well-known segmentation algorithms: CRLA, RXYC, and SPACE. SMART can handle image components of various shapes, multiple backgrounds of different gray levels, different relative grayness of text to this background, tilted image components, and text of different gray levels. To compress the segmented image, we apply JPEG4 to the non-binarizable macroblocks and the Group 4 coding scheme to the binary image representing the binarizable macroblocks and to the bitmap storing the configuration of all macroblocks. Data about the representative gray values, the color information, and other descriptors of the binarizable macroblocks and the background regions are also sent to allow image reconstruction. The gain is using our compression algorithm over using JPEG for the whole image is significant. This gain increases as the proportion of the size of the subjects prefer the reconstructed images from our compression algorithm to those form the bitrate-matching JPEG images. In a series of test images, this document segmentation and compression system enables compression ratios two times to six times improved over standard methods.

Keywords:
Artificial intelligence JPEG Computer science Computer vision Pattern recognition (psychology) Image segmentation Image compression Data compression Segmentation Image texture Binary image Image processing Image (mathematics)

Metrics

5
Cited By
1.32
FWCI (Field Weighted Citation Impact)
0
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Color Science and Applications
Physical Sciences →  Physics and Astronomy →  Atomic and Molecular Physics, and Optics
Advanced Data Compression Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

<title>Color document analysis</title>

Chunghui KuoA. Ravishankar RaoGerry Thompson

Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Year: 2001 Vol: 4663 Pages: 72-80
JOURNAL ARTICLE

<title>Color image segmentation</title>

Kimberley A. McCraeDennis W. RuckSteven K. RogersMark E. Oxley

Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Year: 1994 Vol: 2243 Pages: 306-315
JOURNAL ARTICLE

<title>Segmentation of document images</title>

Philip J. BonesTodd C. GriffinChris M. Carey-Smith

Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Year: 1990 Vol: 1258 Pages: 78-88
JOURNAL ARTICLE

<title>Segmentation of scanned document images for efficient compression</title>

Hei Tao FungKevin J. Parker

Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Year: 1996 Vol: 2727 Pages: 701-712
JOURNAL ARTICLE

<title>Color image segmentation and color constancy</title>

Ruzena BajcsySang W. LeeAleš Leonardis

Journal:   Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE Year: 1990 Vol: 1250 Pages: 245-255
© 2026 ScienceGate Book Chapters — All rights reserved.