# AnonTokens: tracing re-identification attacks through decoy records

**Authors:** Spiros Antonatos, Stefano Braghin, Naoise Holohan, Pol MacAonghusa

arXiv: 1906.09829 · 2021-08-18

## TL;DR

This paper introduces AnonTokens, decoy records inserted into anonymized datasets to detect re-identification attacks, demonstrating feasibility and minimal impact on data utility through large-scale dataset evaluations.

## Contribution

First application of honeytokens concept to data privacy for tracing re-identification attacks with practical evaluation.

## Key findings

- AnonTokens can be inserted with minimal data utility loss
- The approach effectively detects re-identification attempts
- Feasibility demonstrated on large-scale datasets

## Abstract

Privacy is of the utmost concern when it comes to releasing data to third parties. Data owners rely on anonymization approaches to safeguard the released datasets against re-identification attacks. However, even with strict anonymization in place, re-identification attacks are still a possibility and in many cases a reality. Prior art has focused on providing better anonymization algorithms with minimal loss of information and how to prevent data disclosure attacks. Our approach tries to tackle the issue of tracing re-identification attacks based on the concept of honeytokens, decoy or "bait" records with the goal to lure malicious users. While the concept of honeytokens has been widely used in the security domain, this is the first approach to apply the concept on the data privacy domain. Records with high re-identification risk, called AnonTokens, are inserted into anonymized datasets. This work demonstrates the feasibility, detectability and usability of AnonTokens and provides promising results for data owners who want to apply our approach to real use cases. We evaluated our concept with real large-scale population datasets. The results show that the introduction of decoy tokens is feasible without significant impact on the released dataset.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.09829/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/1906.09829/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1906.09829/full.md

---
Source: https://tomesphere.com/paper/1906.09829