JOURNAL ARTICLE

Generating Poisson-Distributed Differentially Private Synthetic Data

Harrison Quick

Year: 2021 Journal:   Journal of the Royal Statistical Society Series A (Statistics in Society) Vol: 184 (3)Pages: 1093-1108   Publisher: Royal Statistical Society

Abstract

Abstract The dissemination of synthetic data can be an effective means of making information from sensitive data publicly available with a reduced risk of disclosure. While mechanisms exist for synthesizing data that satisfy formal privacy guarantees, these mechanisms do not typically resemble the models an end-user might use to analyse the data. More recently, the use of methods from the disease mapping literature has been proposed to generate spatially referenced synthetic data with high utility but without formal privacy guarantees. The objective for this paper is to help bridge the gap between the disease mapping and the differential privacy literatures. In particular, we generalize an approach for generating differentially private synthetic data currently used by the US Census Bureau to the case of Poisson-distributed count data in a way that accommodates heterogeneity in population sizes and allows for the infusion of prior information regarding the underlying event rates. Following a pair of small simulation studies, we illustrate the utility of the synthetic data produced by this approach using publicly available, county-level heart disease-related death counts. This study demonstrates the benefits of the proposed approach’s flexibility with respect to heterogeneity in population sizes and event rates while motivating further research to improve its utility.

Keywords:

Metrics

7
Cited By
0.71
FWCI (Field Weighted Citation Impact)
27
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Data-Driven Disease Surveillance
Health Sciences →  Medicine →  Epidemiology
Privacy, Security, and Data Protection
Social Sciences →  Social Sciences →  Sociology and Political Science

Related Documents

JOURNAL ARTICLE

Differentially private GANs for generating synthetic indoor location data

Vahideh MoghtadaieeMina AlishahiMilad Rabiei

Journal:   International Journal of Information Security Year: 2025 Vol: 24 (3)
JOURNAL ARTICLE

Private Sampling: A Noiseless Approach for Generating Differentially Private Synthetic Data

March T. BoedihardjoThomas StrohmerRoman Vershynin

Journal:   SIAM Journal on Mathematics of Data Science Year: 2022 Vol: 4 (3)Pages: 1082-1115
JOURNAL ARTICLE

Collaborative learning from distributed data with differentially private synthetic data

Lukas PredigerJoonas JälköAntti HonkelaSamuel Kaski

Journal:   BMC Medical Informatics and Decision Making Year: 2024 Vol: 24 (1)Pages: 167-167
JOURNAL ARTICLE

Online Differentially Private Synthetic Data Generation

Yiyun HeRoman VershyninYizhe Zhu

Journal:   IEEE Transactions on Privacy Year: 2024 Vol: 1 Pages: 19-30
© 2026 ScienceGate Book Chapters — All rights reserved.