Abstract

In this work, we study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning algorithm that improves the performance of the algorithm while satisfying computational, communication and privacy constraints. We explore the impact of different data-selection schemes as well as the impact of introducing deny lists during multiple runs of the algorithm. We test these improvements using extensive experimentation on the Reddit dataset [Caldas et al., 2018] on the task of learning the most frequent words.

Keywords:
Computer science Differential privacy Heuristics Task (project management) Hyperparameter Aggregate (composite) Data mining Online analytical processing Tree (set theory) Big data Selection (genetic algorithm) Machine learning Data warehouse

Metrics

6
Cited By
3.83
FWCI (Field Weighted Citation Impact)
44
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Cryptography and Data Security
Physical Sciences →  Computer Science →  Artificial Intelligence
Internet Traffic Analysis and Secure E-voting
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Locally Differentially Private Heavy Hitter Identification

Tianhao WangNinghui LiSomesh Jha

Journal:   IEEE Transactions on Dependable and Secure Computing Year: 2019 Vol: 18 (2)Pages: 982-993
JOURNAL ARTICLE

Federated Heavy Hitter Analytics with Local Differential Privacy

Yuemin ZhangQingqing YeHaibo Hu

Journal:   Proceedings of the ACM on Management of Data Year: 2025 Vol: 3 (1)Pages: 1-27
BOOK-CHAPTER

Federated Online Learning for Heavy Hitter Detection

Paula Raissa SilvaJoão VinagreJoão Gama

Frontiers in artificial intelligence and applications Year: 2024
JOURNAL ARTICLE

Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory

Xiaochen LiWeiran LiuJian LouYuan HongLei ZhangZhan QinKui Ren

Journal:   Proceedings of the ACM on Management of Data Year: 2024 Vol: 2 (1)Pages: 1-27
© 2026 ScienceGate Book Chapters — All rights reserved.