JOURNAL ARTICLE

Finding Maximal Frequent Itemsets over Online Data Streams Adaptively

Abstract

Due to the characteristics of a data stream, it is very important to confine the memory usage of a data mining process regardless of the amount of information generated in the data stream. For this purpose, this paper proposes a CP-tree (compressed-prefix tree) that can be effectively used in finding either frequent or maximal frequent itemsets over an online data stream. Unlike a prefix tree, a node of a CP-tree can maintain the information of several item-sets together. Based on this characteristic, the size of a CP-tree can be flexibly controlled by merging or splitting nodes. In this paper, a mining method employing a CP-tree is proposed and an adaptive memory utilization scheme is also presented in order to maximize the mining accuracy of the proposed method for confined memory space at all times. Finally, the performance of the proposed method is analyzed by a series of experiments to identify its various characteristics.

Keywords:
Computer science Prefix Data mining Data stream mining Tree (set theory) Trie Data stream Node (physics) Process (computing) Scheme (mathematics) Tree structure Data structure Algorithm Binary tree Mathematics

Metrics

69
Cited By
17.22
FWCI (Field Weighted Citation Impact)
17
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
© 2026 ScienceGate Book Chapters — All rights reserved.