JOURNAL ARTICLE

DC-Top-k: A Novel Top-k Selecting Algorithm and Its Parallelization

Abstract

Sorting is a basic computational task in Computer Science. As a variant of the sorting problem, top-k selecting have been widely used. To our knowledge, on average, the state-of-the-art top-k selecting algorithm Partial Quicksort takes C(n, k) = 2(n+1)Hn+2n-6k+6-2(n+3-k)Hn+1-k comparisons and about C(n, k)/6 exchanges to select the largest k terms from n terms, where Hn denotes the n-th harmonic number. In this paper, a novel top-k algorithm called DC-Top-k is proposed by employing a divide-and-conquer strategy. By a theoretical analysis, the algorithm is proved to be competitive with the state-of-the-art top-k algorithm on the compare time, with a significant improvement on the exchange time. On average, DC-Top-k takes at most (2-1/k)n+O(klog2k) comparisons and O(klog2k) exchanges to select the largest k terms from n terms. The effectiveness of the proposed algorithm is verified by a number of experiments which show that DC-Top-k is 1-3 times faster than Partial Quicksort and, moreover, is notably stabler than the latter. With an increase of k, it is also significantly more efficient than Min-heap based top-k algorithm (U. S. Patent, 2012). In the end, DC-Top-k is naturally implemented in a parallel computing environment, and a better scalability than Partial Quicksort is also demonstrated by experiments.

Keywords:
Quicksort Algorithm Computer science Parallel computing Scalability Sorting algorithm Heap (data structure) Sorting Divide and conquer algorithms

Metrics

9
Cited By
0.56
FWCI (Field Weighted Citation Impact)
47
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SETJoin: a novel top-k similarity join algorithm

Hongya WangLihong YangYingyuan Xiao

Journal:   Soft Computing Year: 2020 Vol: 24 (19)Pages: 14577-14592
BOOK-CHAPTER

K-Graphs: Selecting Top-k Data Sources for XML Keyword Queries

Khanh NguyenJinli Cao

Lecture notes in computer science Year: 2011 Pages: 425-439
BOOK-CHAPTER

Top-k Algorithm Based on Extraction

Lingjuan LiXue ZengGuoyu Lu

Advances in intelligent and soft computing Year: 2012 Pages: 113-118
© 2026 ScienceGate Book Chapters — All rights reserved.