JOURNAL ARTICLE

Efficient Pattern Mining Based Cryptanalysis for Privacy-Preserving Record Linkage

Abstract

Privacy-preserving record linkage (PPRL) is the process of identifying records that correspond to the same entities across several databases without revealing any sensitive information about these entities. One popular PPRL technique is Bloom filter (BF) encoding, with first applications of BF based PPRL now being employed in real-world linkage applications. Here we present a cryptanalysis attack that can re-identify attribute values encoded in BFs. Our method applies maximal frequent itemset mining on a BF database to first identify sets of frequently co-occurring bit positions that correspond to encoded frequent q-grams (character substrings extracted from plain-text values). Using a language model, we then identify additional q-grams by applying pattern mining on subsets of BFs that encode a previously identified frequent q-gram. Experiments on a real database show that our attack can successfully re-identify sensitive values even when each BF in a database is unique.

Keywords:
Computer science Substring Bloom filter Encoding (memory) Linkage (software) Cryptanalysis Data mining Character (mathematics) Information retrieval Algorithm Data structure Artificial intelligence Cryptography Mathematics Gene

Metrics

37
Cited By
4.19
FWCI (Field Weighted Citation Impact)
16
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Precise and Fast Cryptanalysis for Bloom Filter Based Privacy-Preserving Record Linkage

Peter ChristenThilina RanbadugeDinusha VatsalanRainer Schnell

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2018 Vol: 31 (11)Pages: 2164-2177
JOURNAL ARTICLE

Differential Cryptanalysis of Bloom Filters for Privacy-Preserving Record Linkage

Weifeng YinLifeng YuanYizhi RenWeizhi MengDong WangQiuhua Wang

Journal:   IEEE Transactions on Information Forensics and Security Year: 2024 Vol: 19 Pages: 6665-6678
JOURNAL ARTICLE

Semantic-based Privacy-preserving Record Linkage.

Lu Yang

Journal:   International Journal for Population Data Science Year: 2022 Vol: 7 (3)
© 2026 ScienceGate Book Chapters — All rights reserved.