JOURNAL ARTICLE

Online Clustering of Evolving Data Streams Using a Density Grid-Based Method

Abstract

In recent years, a significant boost in data availability for persistent data streams has been observed.These data streams are continually evolving, with the clusters frequently forming arbitrary shapes instead of regular shapes in the data space.This characteristic leads to an exponential increase in the processing time of traditional clustering algorithms for data streams.In this study, we propose a new online method, which is a density grid-based method for data stream clustering.The primary objectives of the density grid-based method are to reduce the number of distant function calls and to improve the cluster quality.The method is conducted entirely online and consists of two main phases.The first phase generates the Core Micro-Clusters (CMCs), and the second phase combines the CMCs into macro clusters.The grid-based method was utilized as an outlier buffer in order to handle multi-density data and noises.The method was tested on real and synthetic data streams employing different quality metrics and was compared with the popular method of clustering evolving data streams into arbitrary shapes.The proposed method was demonstrated to be an effective solution for reducing the number of calls to the distance function and improving the cluster quality.

Keywords:
Cluster analysis Data stream mining Data stream Data stream clustering Outlier Function (biology) Anomaly detection Cluster (spacecraft) STREAMS

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.41
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Time Series Analysis and Forecasting
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Grid Density Based Evolving Clustering Algorithm for Data Streams

Dan LiWenjuan AnJianyi ZhangXi OuyangShoushan Luo -Xin Yang

Journal:   International Journal of Advancements in Computing Technology Year: 2012 Vol: 4 (9)Pages: 54-63
JOURNAL ARTICLE

Probability Density Grid-based Online Clustering for Uncertain Data Streams

Haitao HeLijuan ChenJiadong RenWenyan Guo

Journal:   INTERNATIONAL JOURNAL ON Advances in Information Sciences and Service Sciences Year: 2011 Vol: 3 (8)Pages: 204-211
JOURNAL ARTICLE

Incremental density-based ensemble clustering over evolving data streams

Imran KhanJoshua Zhexue HuangKamen Ivanov

Journal:   Neurocomputing Year: 2016 Vol: 191 Pages: 34-43
© 2026 ScienceGate Book Chapters — All rights reserved.