JOURNAL ARTICLE

Constraint-based discriminative dimension selection for high-dimensional stream clustering

Kitsana WaiyamaiThanapat Kangkachit

Year: 2018 Journal:   International Journal of Advances in Intelligent Informatics Vol: 4 (3)Pages: 167-167   Publisher: Ahmad Dahlan University

Abstract

Clustering data streams is one of active research topic in data mining. However, runtime of the existing stream clustering algorithms increases and their performance drop in the face of large number of dimensions. Complexity of the stream clustering methods is increased when perform on data with large number of dimensions. In order to reduce the clustering complexity, one possible solution consists in determining the appropriate subset of cluster dimensions via dimension projection. SED-Stream is an efficient clustering algorithm that supports high dimension data streams. The aim of this paper is to increase performance of SED-Stream in terms of both clustering quality and execution-time. In order to improve the clustering process, background or domain expert knowledge are integrated as “constraints” in SEDC-Stream. The new algorithm, SEDC-Stream, supports the evolving characteristics of the dynamic constraints which are activation, fading, outdating and prioritization. SEDC-Stream algorithm is able to reduce cluster splitting time, and place new incoming points to their suitable clusters. Compared to SED-Stream on the three real-world streams datasets, SEDC-Stream is able to generate a better clustering performance in terms of both purity and f-measure.

Keywords:
Cluster analysis Data stream clustering Computer science Data mining Constrained clustering Data stream mining CURE data clustering algorithm Clustering high-dimensional data Data stream Dimension (graph theory) Correlation clustering Artificial intelligence Mathematics

Metrics

4
Cited By
0.40
FWCI (Field Weighted Citation Impact)
31
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.