Evaluating the Impacts of Swapping on the US Decennial Census
Maria Ballesteros, Cynthia Dwork, Gary King, Conlan Olson, Manish, Raghavan

TL;DR
This paper develops and implements a parameterized swapping algorithm to evaluate and compare the privacy-utility impacts of swapping and TopDown differential privacy methods used by the US Census Bureau.
Contribution
It provides the first publicly available implementation of the swapping method, enabling detailed analysis of its effects on statistical data and comparison with TopDown.
Findings
Swapping and TopDown introduce errors of similar magnitude.
The bias directions of swapping and TopDown differ.
The implementation allows for analysis and correction of disclosure avoidance impacts.
Abstract
To meet its dual burdens of providing useful statistics and ensuring privacy of individual respondents, the US Census Bureau has for decades introduced some form of "noise" into published statistics. Initially, they used a method known as "swapping" (1990-2010). In 2020, they switched to an algorithm called TopDown that ensures a form of Differential Privacy. While the TopDown algorithm has been made public, no implementation of swapping has been released and many details of the deployed swapping methodology deployed have been kept secret. Further, the Bureau has not published (even a synthetic) "original" dataset and its swapped version. It is therefore difficult to evaluate the effects of swapping, and to compare these effects to those of other privacy technologies. To address these difficulties we describe and implement a parameterized swapping algorithm based on Census publications,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
