JOURNAL ARTICLE

Parallel Filtered Graphs for Hierarchical Clustering

Abstract

Given all pairwise weights (distances) among a set of objects, filtered graphs provide a sparse representation by only keeping an important subset of weights. Such graphs can be passed to graph clustering algorithms to generate hierarchical clusters. In particular, the directed bubble hierarchical tree (DBHT) algorithm on filtered graphs has been shown to produce good hierarchical clusters for time series data.We propose a new parallel algorithm for constructing triangulated maximally filtered graphs (TMFG), which produces valid inputs for DBHT, and a scalable parallel algorithm for generating DBHTs that is optimized for TMFG inputs. In addition to parallelizing the original TMFG construction, which has limited parallelism, we also design a new algorithm that inserts multiple vertices on each round to enable more parallelism. We show that the graphs generated by our new algorithm have similar quality compared to the original TMFGs, while being much faster to generate. Our new parallel algorithms for TMFGs and DBHTs are 136-2483x faster than state-of-the-art implementations, while achieving up to 41.56x self-relative speedup on 48 cores with hyper-threading, and achieve better clustering results compared to the standard average-linkage and complete-linkage hierarchical clustering algorithms. We show that on a stock data set, our algorithms produce clusters that align well with human experts' classification.

Keywords:
Computer science Cluster analysis Hierarchical clustering Parallel computing Theoretical computer science Artificial intelligence

Metrics

4
Cited By
0.86
FWCI (Field Weighted Citation Impact)
83
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

BOOK-CHAPTER

Efficient Parallel Hierarchical Clustering

Manoranjan DashSimona PetrutiuPeter Scheuermann

Lecture notes in computer science Year: 2004 Pages: 363-371
JOURNAL ARTICLE

Efficient parallel hierarchical clustering algorithms

Sanguthevar Rajasekaran

Journal:   IEEE Transactions on Parallel and Distributed Systems Year: 2005 Vol: 16 (6)Pages: 497-502
JOURNAL ARTICLE

Parallel algorithms for hierarchical clustering

Clark F. Olson

Journal:   Parallel Computing Year: 1995 Vol: 21 (8)Pages: 1313-1325
© 2026 ScienceGate Book Chapters — All rights reserved.