JOURNAL ARTICLE

K-Means Clustering with Feature Selection for Stream Data

Abstract

K-means clustering is popular for its efficiency and is often chosen for analyzing large-scale data. However, it is hard to deal with high-dimensional data, which often contain lots of redundant features. In addition, in real-world applications, we usually confront with massive data streams, such as transport system and social media, which are often periodically generated in high-dimensional space. Although existing K-means extensions have achieved great success on high-dimensional data by integrating with dimension reduction methods, they are limited to off-line data. To solve these problems, we propose a streaming Kmeans clustering with feature selection. The proposed algorithm divides the traditional clustering procedure into several related multiple clustering tasks and selects the representative features by the group sparsity regularization technique. Besides, within such framework, the shared information among neighbor streams can be properly explored. Experimental results on several benchmark datasets demonstrate the effectiveness of the proposed model.

Keywords:
Cluster analysis Computer science Data stream clustering Data mining Clustering high-dimensional data Benchmark (surveying) CURE data clustering algorithm Feature selection Data stream mining Selection (genetic algorithm) Correlation clustering Regularization (linguistics) Canopy clustering algorithm Affinity propagation Data stream Artificial intelligence

Metrics

2
Cited By
0.20
FWCI (Field Weighted Citation Impact)
10
Refs
0.62
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Deterministic Feature Selection for K-Means Clustering

Christos BoutsidisMalik Magdon-Ismail

Journal:   IEEE Transactions on Information Theory Year: 2013 Vol: 59 (9)Pages: 6099-6110
JOURNAL ARTICLE

K-means clustering based filter feature selection on high dimensional data

Dewi Pramudi IsmiShireen PanchooMurinto Murinto

Journal:   International Journal of Advances in Intelligent Informatics Year: 2016 Vol: 2 (1)Pages: 38-38
JOURNAL ARTICLE

Feature Selection of Interval Valued Data Through Interval K-Means Clustering

D. S. GuruN. Vinay KumarMahamad Suhil

Journal:   International Journal of Computer Vision and Image Processing Year: 2017 Vol: 7 (2)Pages: 64-80
© 2026 ScienceGate Book Chapters — All rights reserved.