JOURNAL ARTICLE

A k-Means Algorithm with Automatic Outlier Detection

Guojun Gan

Year: 2025 Journal:   Electronics Vol: 14 (9)Pages: 1723-1723   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Data clustering is a fundamental machine learning task found in many real-world applications. However, real data usually contain noise or outliers. Handling outliers in a clustering algorithm can improve the clustering accuracy. In this paper, we propose a variant of the k-means algorithm to provide data clustering and outlier detection simultaneously. In the proposed algorithm, outlier detection is integrated with the clustering process and is achieved via a term added to the objective function of the k-means algorithm. The proposed algorithm generates two partition matrices: one provides cluster groups and the other can be used to detect outliers. We use both synthetic data and real data to demonstrate the effectiveness and efficiency of the proposed algorithm and show that the clustering performance of the proposed approach is better than other, similar methods.

Keywords:
Anomaly detection Outlier Computer science Algorithm Artificial intelligence Pattern recognition (psychology)

Metrics

1
Cited By
4.82
FWCI (Field Weighted Citation Impact)
0
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Fault Detection and Control Systems
Physical Sciences →  Engineering →  Control and Systems Engineering
Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

BOOK-CHAPTER

Automatic K-Means Clustering Algorithm for Outlier Detection

Dajiang LeiQingsheng ZhuJun ChenHai Xiang LinPeng Yang

Lecture notes in electrical engineering Year: 2011 Pages: 363-372
JOURNAL ARTICLE

Outlier Detection Method based on Improved K-means Clustering Algorithm

Wenfen LiuNan WangYuehua Huang

Journal:   Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer Engineering Year: 2021 Pages: 1350-1355
BOOK-CHAPTER

A New K-means-Based Algorithm for Automatic Clustering and Outlier Discovery

Trushali JambudiSavita Gandhi

Smart innovation, systems and technologies Year: 2018 Pages: 457-467
© 2026 ScienceGate Book Chapters — All rights reserved.