JOURNAL ARTICLE

K-means clustering algorithm and Python implementation

Abstract

K-means is a commonly used algorithm in machine learning. It is an unsupervised learning algorithm. It is regularly used for data clustering. Only the number of clusters are needed to be specified for it to automatically aggregate the data into multiple categories, the similarity between data in the same cluster is high, thus, the similarity of data in different clusters is low. K-means algorithm is a typical distance-based clustering algorithm. It takes distance as the evaluation index of similarity, that is, the closer the distance between two objects, the greater similarity. Clustering is also extremely extensive in practical applications, such as: market segmentation, social network analysis, organized computing clusters, and astronomical data analysis. This paper is my own attempt to make K-means code and API, using Python and Java to jointly complete a project. The Python is mainly used to write the framework of the core algorithm of K-means, and the Java to create experimental data. In this research report, I will describe the simple data model provided by K-means, as well as the design and implementation of K-means.

Keywords:
Python (programming language) Computer science Cluster analysis Java Data mining Algorithm Fuzzy clustering Theoretical computer science Artificial intelligence Programming language

Metrics

11
Cited By
0.99
FWCI (Field Weighted Citation Impact)
4
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Implementation of Clustering using K-Means in Python

Ahmad Farhan AlShammari

Journal:   International Journal of Computer Applications Year: 2024 Vol: 186 (40)Pages: 12-17
JOURNAL ARTICLE

K-Means Algorithm Implementation for Project Health Clustering

Ajeng Arifa Chantika RinduRia AstriratmaAti Zaidiah

Journal:   Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Year: 2023 Vol: 7 (5)Pages: 1064-1076
JOURNAL ARTICLE

Research on K-means clustering algorithm and its implementation

Jianming CuiJianming LiuLiao Zhouyu

Journal:   Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) Year: 2013
© 2026 ScienceGate Book Chapters — All rights reserved.