Improved K‐means algorithm for clustering non‐spherical data

Honglei He; Yuxuan He; Wang Fang; Wenming Zhu

doi:10.1111/exsy.13062

ScienceGate Book Chapters

JOURNAL ARTICLE

Improved K‐means algorithm for clustering non‐spherical data

Honglei He Yuxuan He Wang Fang Wenming Zhu

Year: 2022 Journal: Expert Systems Vol: 39 (9) Publisher: Wiley

DOI: 10.1111/exsy.13062

Get Full-Text PDF Get Analytical Report

Abstract

Abstract As one of the commonly used data mining algorithms, K‐means has the advantage of fast clustering speed, but the disadvantage is that it is less effective for clustering non‐spherical data. An improved K‐means algorithm (IK‐means) is proposed to enhance clustering efficiency for non‐spherical data. The original dataset is clustered into a relatively larger number of high‐density sub‐clusters, and the final result is obtained by merging connected sub‐clusters respectively. The connectivity among sub‐clusters is evaluated by the sub‐clusters density and the nearest distance class between sub‐clusters. By testing on University of California, Irvine(UCI) datasets and several other artificial simulation datasets, the comparison of proposed IK‐means algorithm against DBSCAN, KGFCM shows its clustering capability for data of arbitrary shape. The clustering Adjusted Rand Index (ARI) value for 72,000 sizes data is 24% higher than DBSCAN, and 95.2% higher than KGFCM. For larger datasets, the IK‐means algorithm is faster than DBSCAN and KGFCM.

Keywords:

DBSCAN Cluster analysis Computer science Data mining CURE data clustering algorithm Determining the number of clusters in a data set Pattern recognition (psychology) Algorithm Correlation clustering Artificial intelligence

Metrics

Cited By

2.15

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Data Management and Algorithms

Physical Sciences → Computer Science → Signal Processing

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Improved K‐means algorithm for clustering non‐spherical data

Abstract

Metrics

Citation History

Topics

Related Documents

Data Segmentation with Improved K-Means Clustering Algorithm

Improved K-Means Clustering Algorithm

Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm

Improved k- means clustering algorithm for two dimensional data

Improved Parallel k-means Clustering Algorithm