K-Means Clustering with Feature Selection for Stream Data

Xiaodong Wang; Rung-Ching Chen; Fei Yan; Hendry Hendry

doi:10.1109/is3c.2018.00120

ScienceGate Book Chapters

JOURNAL ARTICLE

K-Means Clustering with Feature Selection for Stream Data

Xiaodong Wang Rung-Ching Chen Fei Yan Hendry Hendry

Year: 2018 Vol: 6 Pages: 453-456

DOI: 10.1109/is3c.2018.00120

Get Full-Text PDF Get Analytical Report

Abstract

K-means clustering is popular for its efficiency and is often chosen for analyzing large-scale data. However, it is hard to deal with high-dimensional data, which often contain lots of redundant features. In addition, in real-world applications, we usually confront with massive data streams, such as transport system and social media, which are often periodically generated in high-dimensional space. Although existing K-means extensions have achieved great success on high-dimensional data by integrating with dimension reduction methods, they are limited to off-line data. To solve these problems, we propose a streaming Kmeans clustering with feature selection. The proposed algorithm divides the traditional clustering procedure into several related multiple clustering tasks and selects the representative features by the group sparsity regularization technique. Besides, within such framework, the shared information among neighbor streams can be properly explored. Experimental results on several benchmark datasets demonstrate the effectiveness of the proposed model.

Keywords:

Cluster analysis Computer science Data stream clustering Data mining Clustering high-dimensional data Benchmark (surveying) CURE data clustering algorithm Feature selection Data stream mining Selection (genetic algorithm) Correlation clustering Regularization (linguistics) Canopy clustering algorithm Affinity propagation Data stream Artificial intelligence

Metrics

Cited By

0.20

FWCI (Field Weighted Citation Impact)

Refs

0.62

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Data Stream Mining Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Face and Expression Recognition

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

K-Means Clustering with Feature Selection for Stream Data

Abstract

Metrics

Citation History

Topics

Related Documents

Deterministic Feature Selection for K-Means Clustering

K-means clustering based filter feature selection on high dimensional data

Feature Selection of Interval Valued Data Through Interval K-Means Clustering

Particle Swarm Optimization with K-Means for Simultaneous Feature Selection and Data Clustering

Modified K-Means Clustering Algorithms for Feature Selection