JOURNAL ARTICLE

Bloom Encodings in DGA Detection: Improving Machine Learning Privacy by Building on Privacy-Preserving Record Linkage

Lasse NitzAvikarsha Mandal

Year: 2024 Journal:   Munich Personal RePEc Archive (Ludwig Maximilian University of Munich)   Publisher: Ludwig-Maximilians-Universität München

Abstract

The use of machine learning has shown to benefit a wide range of applications, especially for classification tasks. As such, the detection of algorithmically generated domains to identify corrupted machines has proven itself to be a mature use case with good classification performance. The use of privacy and security sensitive data, however, raises concerns in scenarios that require interaction with external parties. As one of such scenarios, we consider the training of domain generation algorithm detection classifiers in a Machine-Learning-as-a-Service (MLaaS) scenario. We evaluate the use of a Bloom encoding approach from the area of privacy-preserving record linkage to prevent the MLaaS provider from getting to know the exact classification task as well as the data samples transmitted for training and classification. We investigate the threat associated with pattern mining attacks by performing a privacy analysis for two versions of these encodings (basic and randomized). We further identify sets of parameter values which we find to provide an adequate level of protection against these attacks. We see the potential for this approach in machine learning use cases dealing with sensitive data or tasks, especially for MLaaS scenarios dealing with short data samples that lack a clear structure.

Keywords:
Computer science Machine learning Task (project management) Record linkage Artificial intelligence Linkage (software) Domain (mathematical analysis) Encoding (memory) Data mining Computer security Engineering

Metrics

1
Cited By
0.64
FWCI (Field Weighted Citation Impact)
0
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Cryptography and Data Security
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Privacy-preserving record linkage using Bloom filters

Rainer SchnellTobias BachtelerJörg Reiher

Journal:   BMC Medical Informatics and Decision Making Year: 2009 Vol: 9 (1)Pages: 41-41
JOURNAL ARTICLE

Secure Privacy Preserving Record Linkage of Large Databases by Modified Bloom Filter Encodings

Rainer SchnellChristian Borgs

Journal:   International Journal for Population Data Science Year: 2017 Vol: 1 (1)Pages: 29-29
© 2026 ScienceGate Book Chapters — All rights reserved.