JOURNAL ARTICLE

Improving Association Rule Mining Using Clustering-based Discretization of Numerical Data

Swee Chuan Tan

Year: 2018 Journal:   2018 International Conference on Intelligent and Innovative Computing Applications (ICONIC) Pages: 1-5

Abstract

Association rule mining is an important data mining technique that help discover interesting attribute relationships that are useful for decision making. Most association rule mining methods use item-set manipulation approach, whereby data type must be categorical in nature. When a dataset contains numerical attributes, they will need to be discretized before rule mining. At the moment, most unsupervised data discretization methods do not account for data distributions, and users have to try different methods and discretization settings in order to improve rule mining results. In this paper, we propose using TwoStep clustering for data discretization. Unlike simple discretization methods, TwoStep automatically determines the discretization intervals by taking into account the unique data distribution property of each attribute. In our experiments, we evaluated the performance of Apriori algorithm based on four datasets, whereby each dataset was pre-processed using TwoStep and three other commonly used discretization methods. Our results show that TwoStep produced the greatest number of high-quality rules, as compared to common discretization methods.

Keywords:
Discretization Association rule learning Cluster analysis Discretization of continuous features Data mining Categorical variable Apriori algorithm Computer science A priori and a posteriori Set (abstract data type) Moment (physics) Knowledge extraction Machine learning Mathematics Discretization error

Metrics

15
Cited By
3.69
FWCI (Field Weighted Citation Impact)
17
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Improving Association Rule Mining Using Clustering-Based Data Mining Model for Traffic Accidents

Mohamad Mohamad ShamieMuhammad Mazen Almustafa

Journal:   Review of Computer Engineering Studies Year: 2021 Vol: 8 (3)Pages: 65-70
JOURNAL ARTICLE

Disease prediction in data mining using association rule mining and keyword based clustering algorithms

Savitha RamasamyK. Nirmala

Journal:   International Journal of Computers and Applications Year: 2017 Vol: 42 (1)Pages: 1-8
JOURNAL ARTICLE

Improving RDF Data Through Association Rule Mining

Ziawasch AbedjanFelix Naumann

Journal:   Datenbank-Spektrum Year: 2013 Vol: 13 (2)Pages: 111-120
© 2026 ScienceGate Book Chapters — All rights reserved.