JOURNAL ARTICLE

Enhanced Adjacency-Constrained Hierarchical Clustering Using Fine-Grained Pseudo Labels

Jie YangChin‐Teng Lin

Year: 2024 Journal:   IEEE Transactions on Emerging Topics in Computational Intelligence Vol: 8 (3)Pages: 2481-2492   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Hierarchical clustering is able to provide partitions of different granularity levels. However, most existing hierarchical clustering techniques perform clustering in the original feature space of the data, which may suffer from overlap, sparseness, or other undesirable characteristics, resulting in noncompetitive performance. In the field of deep clustering, learning representations using pseudo labels has recently become a research hotspot. Yet most existing approaches employ coarse-grained pseudo labels, which may contain noise or incorrect labels. Hence, the learned feature space does not produce a competitive model. In this paper, we introduce the idea of fine-grained labels of supervised learning into unsupervised clustering, giving rise to the enhanced adjacency-constrained hierarchical clustering (ECHC) model. The full framework comprises four steps. One, adjacency-constrained hierarchical clustering (CHC) is used to produce relatively pure fine-grained pseudo labels. Two, those fine-grained pseudo labels are used to train a shallow multilayer perceptron to generate good representations. Three, the corresponding representation of each sample in the learned space is used to construct a similarity matrix. Four, CHC is used to generate the final partition based on the similarity matrix. The experimental results show that the proposed ECHC framework not only outperforms 14 shallow clustering methods on eight real-world datasets but also surpasses current state-of-the-art deep clustering models on six real-world datasets. In addition, on five real-world datasets, ECHC achieves comparable results to supervised algorithms.

Keywords:
Cluster analysis Computer science Artificial intelligence Adjacency list Correlation clustering Pattern recognition (psychology) Adjacency matrix Hierarchical clustering Fuzzy clustering Feature vector Single-linkage clustering Canopy clustering algorithm Data mining Graph Algorithm Theoretical computer science

Metrics

4
Cited By
2.56
FWCI (Field Weighted Citation Impact)
61
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.