JOURNAL ARTICLE

Efficient Active Algorithms for Hierarchical Clustering

Akshay KrishnamurthySivaraman BalakrishnanMin XuAarti Singh

Year: 2012 Journal:   arXiv (Cornell University) Pages: 267-274   Publisher: Cornell University

Abstract

Advances in sensing technologies and the growth of the internet have resulted in an explosion in the size of modern datasets, while storage and processing power continue to lag behind. This motivates the need for algorithms that are efficient, both in terms of the number of measurements needed and running time. To combat the challenges associated with large datasets, we propose a general framework for active hierarchical clustering that repeatedly runs an off-the-shelf clustering algorithm on small subsets of the data and comes with guarantees on performance, measurement complexity and runtime complexity. We instantiate this framework with a simple spectral clustering algorithm and provide concrete results on its performance, showing that, under some assumptions, this algorithm recovers all clusters of size Ω(log n) using O(n log2 n) similarities and runs in O(n log3 n) time for a dataset of n objects. Through extensive experimentation we also demonstrate that this framework is practically alluring.

Keywords:
Cluster analysis Computer science Hierarchical clustering Simple (philosophy) Time complexity Algorithm Computational complexity theory Data mining Artificial intelligence

Metrics

27
Cited By
4.17
FWCI (Field Weighted Citation Impact)
13
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics

Related Documents

JOURNAL ARTICLE

Efficient parallel hierarchical clustering algorithms

Sanguthevar Rajasekaran

Journal:   IEEE Transactions on Parallel and Distributed Systems Year: 2005 Vol: 16 (6)Pages: 497-502
JOURNAL ARTICLE

Efficient Algorithms for Hierarchical Agglomerative Clustering

Ajay Anandan

Journal:   University of Alberta Library Year: 2013
JOURNAL ARTICLE

Efficient algorithms for agglomerative hierarchical clustering methods

W. H. DayHerbert Edelsbrunner

Journal:   Journal of Classification Year: 1984 Vol: 1 (1)Pages: 7-24
BOOK-CHAPTER

Efficient Hierarchical Clustering Algorithms Using Partially Overlapping Partitions

Manoranjan DashHuan Liu

Lecture notes in computer science Year: 2001 Pages: 495-506
BOOK-CHAPTER

CLUSTERING: HIERARCHICAL ALGORITHMS

Huadong Liu

WORLD SCIENTIFIC eBooks Year: 2006 Pages: 109-120
© 2026 ScienceGate Book Chapters — All rights reserved.