JOURNAL ARTICLE

Safe Semi-Supervised Fuzzy ${C}$ -Means Clustering

Haitao Gan

Year: 2019 Journal:   IEEE Access Vol: 7 Pages: 95659-95664   Publisher: Institute of Electrical and Electronics Engineers

Abstract

With the rapid increase in the number of collected data samples, semi-supervised clustering (SSC) has become a useful mining tool to find an intrinsic data structure with the help of prior knowledge. The common used prior knowledge includes pair-wise constraints and cluster labels. In the past decades, many relevant methods are proposed to improve clustering performance of SSC by mining prior knowledge. In general, the prior knowledge is assumed to be beneficial to yielding desirable results. However, one can gather inappropriate prior knowledge in some scenarios, such as wrong cluster labels. In this case, prior knowledge can result in degenerating clustering performance. Therefore, how to raise safe semi-supervised clustering (S3C) should be investigated. A main goal of S3C is that the corresponding result is never inferior to that of the corresponding unsupervised clustering part. To achieve the goal, we propose safe semi-supervised Fuzzy c -Means clustering (S3FCM) which is extended from traditional semi-supervised FCM (SSFCM). In our algorithm, wrongly labeled samples are carefully explored by constraining the corresponding predictions to be those yielded by unsupervised clustering. Meanwhile, the predictions of the other labeled samples should approach to the given labels. Therefore the labeled samples are expected to be safely explored through a balance between unsupervised clustering and SSC. From the reported clustering results on different datasets, we can find that S3FCM can yield comparable, if not the best, performance among different unsupervised clustering and SSC methods even if the wrong ratio achieves 20%.

Keywords:
Cluster analysis Computer science Fuzzy clustering Artificial intelligence Data mining Fuzzy logic Cluster (spacecraft) Machine learning Pattern recognition (psychology)

Metrics

21
Cited By
1.54
FWCI (Field Weighted Citation Impact)
28
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Metric-Based Semi-Supervised Fuzzy C-Means Clustering

Xue YinQi HuangLiang Ming Li

Journal:   Advanced materials research Year: 2011 Vol: 268-270 Pages: 166-171
BOOK-CHAPTER

Semi-supervised Fuzzy c-Means Clustering of Biological Data

Michele CeccarelliAntonio Maratea

Lecture notes in computer science Year: 2006 Pages: 259-266
JOURNAL ARTICLE

A new Semi-Supervised Intuitionistic Fuzzy C-means Clustering

J. AroraMeena Tushir

Journal:   ICST Transactions on Scalable Information Systems Year: 2018 Vol: 0 (0)Pages: 159622-159622
© 2026 ScienceGate Book Chapters — All rights reserved.