JOURNAL ARTICLE

Research on parallelization of Apriori algorithm in association rule mining

Huanbin WangYangjun Gao

Year: 2021 Journal:   Procedia Computer Science Vol: 183 Pages: 641-647   Publisher: Elsevier BV

Abstract

Aiming at the performance bottleneck of traditional Apriori algorithm when the data set is slightly large, this paper adopts the idea of parallelization and improves the Apriori algorithm based on MapReduce model. Firstly, the local frequent itemsets on each sub node in the cluster are calculated, then all the local frequent itemsets are merged into the global candidate itemsets, and finally, the frequent itemsets that meet the conditions are filtered according to the minimum support threshold. The advantage of the improved algorithm is that it only needs to scan the transaction database twice and calculate the frequent item set in parallel, which improves the efficiency of the algorithm.

Keywords:
Apriori algorithm Computer science Association rule learning Bottleneck Data mining Database transaction A priori and a posteriori Set (abstract data type) Node (physics) Algorithm Cluster (spacecraft) Transaction data Database

Metrics

42
Cited By
10.77
FWCI (Field Weighted Citation Impact)
12
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Big Data and Business Intelligence
Social Sciences →  Business, Management and Accounting →  Management Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.