Embedding based Encoding Scheme for Privacy Preserving Record Linkage
Sirintra Vaiwsri, Thilina Ranbaduge

TL;DR
This paper introduces an embedding-based encoding scheme for privacy-preserving record linkage that improves matching accuracy and enhances privacy protection over existing methods, enabling secure data integration across sensitive databases.
Contribution
The paper proposes a novel embedding-based encoding technique for PPRL that converts q-grams into binary representations for accurate and privacy-preserving record linkage.
Findings
Better linkage accuracy than state-of-the-art techniques
Enhanced privacy protection against attacks
Effective on real-world datasets
Abstract
To discover new insights from data, there is a growing need to share information that is often held by different organisations. One key task in data integration is the calculation of similarities between records in different databases to identify pairs or sets of records that correspond to the same real-world entities. Due to privacy and confidentiality concerns, however, the owners of sensitive databases are often not allowed or willing to exchange or share their data with other organisations to allow such similarity calculations. Privacy-preserving record linkage (PPRL) is the process of matching records that refer to the same entity across sensitive databases held by different organisations while ensuring no information about the entities is revealed to the participating parties. In this paper, we study how embedding based encoding techniques can be applied in the PPRL context to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Privacy-Preserving Technologies in Data · Advanced Graph Neural Networks
