JOURNAL ARTICLE

A new approximate method for mining frequent itemsets from big data

Timur ValiullinZhexue HuangChenghao WeiJianfei YinDingming WuIuliia Egorova

Year: 2020 Journal:   Computer Science and Information Systems Vol: 18 (3)Pages: 641-656   Publisher: ComSIS Consortium

Abstract

Mining frequent itemsets in transaction databases is an important task in many applications. It becomes more challenging when dealing with a large transaction database because traditional algorithms are not scalable due to the limited main memory. In this paper, we propose a new approach for the approximately mining of frequent itemsets in a big transaction database. Our approach is suitable for mining big transaction databases since it uses the frequent itemsets from a subset of the entire database to approximate the result of the whole data, and can be implemented in a distributed environment. Our algorithm is able to efficiently produce high-accurate results, however it misses some true frequent itemsets. To address this problem and reduce the number of false negative frequent itemsets we introduce an additional parameter to the algorithm to discover most of the frequent itemsets contained in the entire data set. In this article, we show an empirical evaluation of the results of the proposed approach.

Keywords:
Computer science Database transaction Scalability Data mining Big data Set (abstract data type) Task (project management) Transaction data Database

Metrics

10
Cited By
1.42
FWCI (Field Weighted Citation Impact)
25
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Mining Approximate Frequent Itemsets over Data Streams

Na SuZhe Hui WuJi Min LiuTai An LiuXin AnChang Qing Yan

Journal:   Applied Mechanics and Materials Year: 2014 Vol: 685 Pages: 536-539
BOOK-CHAPTER

An Approximate Approach for Mining Recently Frequent Itemsets from Data Streams

Jia-Ling KohShu-Ning Shin

Lecture notes in computer science Year: 2006 Pages: 352-362
BOOK-CHAPTER

Mining Frequent Itemsets from Uncertain Data

Chun-kit ChuiBen KaoEdward Hung

Lecture notes in computer science Year: 2007 Pages: 47-58
© 2026 ScienceGate Book Chapters — All rights reserved.