JOURNAL ARTICLE

Fast parallel cosine K-nearest neighbor graph construction

David C. AnastasiuGeorge Karypis

Year: 2016 Journal:   Irregular Applications: Architectures and Algorithms Pages: 50-53

Abstract

The k-nearest neighbor graph is an important structure in many data mining methods for clustering, advertising, recommender systems, and outlier detection. Constructing the graph requires computing up to n2 similarities for a set of n objects. This has led researchers to seek approximate methods, which find many but not all of the nearest neighbors. In contrast, we leverage shared memory parallelism and recent advances in similarity joins to solve the problem exactly, via a filtering based approach. Our method considers all pairs of potential neighbors but quickly filters those that could not be a part of the k-nearest neighbor graph, based on similarity upper bound estimates. We evaluated our solution on several real-world datasets and found that, using 16 threads, our method achieves up to 12.9x speedup over our exact baseline and is sometimes faster even than approximate methods. Moreover, an approximate version of our method is up to 21.7× more efficient than the best approximate state-of-the-art baseline at similar high recall.

Keywords:
Nearest neighbor search k-nearest neighbors algorithm Computer science Nearest neighbor graph Cosine similarity Speedup Leverage (statistics) Graph Cluster analysis Joins Theoretical computer science Algorithm Data mining Artificial intelligence Parallel computing

Metrics

5
Cited By
9.04
FWCI (Field Weighted Citation Impact)
16
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data-Driven Disease Surveillance
Health Sciences →  Medicine →  Epidemiology
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.