JOURNAL ARTICLE

Let sleeping files lie: pattern matching in Z-compressed files

Amihood AmirGary BensonMartı́n Farach-Colton

Year: 1994 Journal:   Symposium on Discrete Algorithms Pages: 705-714

Abstract

The current explosion of stored information necessitates a new model of pattern matching, that of compressed matching. In this model one tries to find all occurrences of a pattern in a compressed text in time proportional to the compressed text size, i.e., without decompressing the text. The most effective general purpose compression algorithms are adaptive, in that the text represented by each compression symbol is determined dynamically by the data. As a result, the encoding of a substring depends on its location. Thus the same substring may ``look different'' every time it appears in the compressed text. In this paper we consider pattern matching without decompression in the UNIX Z-compression. This is a variant of the Lempel Ziv adaptive compression scheme. If n is the length of the compressed text and m is the length of the pattern, our algorithms find the first pattern occurrence in time O(n+m) or O(n log m+m). We also introduce a new criterion to measure compressed matching algorithms, that of extra space. We show how to modify our algorithms to achieve a trade-off between the amount of extra space used and the algorithm's time complexity. ] 1996 Academic Press, Inc.

Keywords:
Substring Pattern matching Compression (physics) Computer science Algorithm Data compression Compressed suffix array Matching (statistics) Compression ratio Encoding (memory) Time complexity Huffman coding String searching algorithm Data structure Mathematics Artificial intelligence Statistics

Metrics

60
Cited By
4.54
FWCI (Field Weighted Citation Impact)
11
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
semigroups and automata theory
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Network Packet Processing and Optimization
Physical Sciences →  Computer Science →  Hardware and Architecture

Related Documents

JOURNAL ARTICLE

Let Sleeping Files Lie: Pattern Matching in Z-Compressed Files

Amihood AmirGary BensonMartı́n Farach-Colton

Journal:   Journal of Computer and System Sciences Year: 1996 Vol: 52 (2)Pages: 299-307
JOURNAL ARTICLE

Pattern Matching in LZW Compressed Files

Tao TaoAnupam Mukherjee

Journal:   IEEE Transactions on Computers Year: 2005 Vol: 54 (8)Pages: 929-938
JOURNAL ARTICLE

Multiple-pattern matching for LZW compressed files

Tao TaoAmar Mukherjee

Year: 2005 Vol: 1645 Pages: 91-96 Vol. 1
JOURNAL ARTICLE

Multiple-Pattern Matching In LZW Compressed Files Using Aho-Corasick Algorithm

Tao TaoAmar Mukherjee

Journal:   Data Compression Conference Year: 2005 Pages: 482-482
BOOK-CHAPTER

COMPRESSED FILES

Year: 2004 Pages: 471-474
© 2026 ScienceGate Book Chapters — All rights reserved.