JOURNAL ARTICLE

Pattern-baseb content lossless compression of Chinese document images

Abstract

Compression of scanned text document images is important in modern document management, communications and retrieval systems. However, most existing compression techniques have been studied extensively only for documents in English or similar alphabet-based languages. In this paper, we purpose a content-lossless scheme for compression of Chinese text documents. This method utilizes the radical characteristics, unique to Chinese characters, to minimize the size of compressed documents. Our method consists of two main parts. The first part is the development of a radical pattern library. The second part is to utilize the radical pattern library to match character patterns in a document. The technique has been tested with many Chinese text document images with good results.

Keywords:
Lossless compression Computer science Alphabet Information retrieval Compression (physics) Data compression Document management system Artificial intelligence Chinese characters Character (mathematics) Scheme (mathematics) Natural language processing Mathematics Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
5
Refs
0.18
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Data Compression Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Content-lossless document image compression based on structural analysis and pattern matching

Yibing YangHong YanDonggang Yu

Journal:   Pattern Recognition Year: 2000 Vol: 33 (8)Pages: 1277-1293
BOOK-CHAPTER

Two-Stage Lossy/Lossless Compression of Grayscale Document Images

Kris PopatDan S. Bloomberg

Kluwer Academic Publishers eBooks Year: 2005 Pages: 361-370
JOURNAL ARTICLE

Compression of Chinese document images based on morphologic analysis and pattern matching

Hong Yan

Journal:   Optical Engineering Year: 2006 Vol: 45 (10)Pages: 107001-107001
© 2026 ScienceGate Book Chapters — All rights reserved.