A Tutorial on Computing $t$-Closeness
Richard Dosselmann, Mehdi Sadeqi, Howard J. Hamilton

TL;DR
This tutorial explains how to compute $t$-closeness, a privacy measure in data publishing, with detailed examples, calculations, and an efficient algorithm for practical implementation.
Contribution
It provides the first comprehensive tutorial with detailed examples and an efficient algorithm for computing $t$-closeness, enhancing understanding and practical application.
Findings
Detailed step-by-step examples of $t$-closeness computation
Introduction of an efficient algorithm for $t$-closeness calculation
Clarification of the use of earth mover's distance in privacy measures
Abstract
This paper presents a tutorial of the computation of -closeness. An established model in the domain of privacy preserving data publishing, -closeness is a measure of the earth mover's distance between two distributions of an anonymized database table. This tutorial includes three examples that showcase the full computation of -closeness in terms of both numerical and categorical attributes. Calculations are carried out using the definition of the earth mover's distance and weighted order distance. This paper includes detailed explanations and calculations not found elsewhere in the literature. An efficient algorithm to calculate the -closeness of a table is also presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Cryptography and Data Security
