A Re-examination of the Census Bureau Reconstruction and Reidentification Attack
Krish Muralidhar

TL;DR
This paper critically re-examines the Census Bureau's claims of successful reidentification of individuals from reconstructed census data, demonstrating the existence of multiple reconstructions that challenge the validity of their conclusions.
Contribution
It reveals that the reconstruction process is highly ambiguous with infinitely many solutions, undermining prior claims of reidentification success.
Findings
Multiple reconstructions exist for the same data
Different reconstructions assign different identities
Claims of reidentification are not conclusively supported
Abstract
Recent analysis by researchers at the U.S. Census Bureau claims that by reconstructing the tabular data released from the 2010 Census, it is possible to reconstruct the original data and, using an accurate external data file with identity, reidentify 179 million respondents (approximately 58% of the population). This study shows that there are a practically infinite number of possible reconstructions, and each reconstruction leads to assigning a different identity to the respondents in the reconstructed data. The results reported by the Census Bureau researchers are based on just one of these infinite possible reconstructions and is easily refuted by an alternate reconstruction. Without definitive proof that the reconstruction is unique, or at the very least, that most reconstructions lead to the assignment of the same identity to the same respondent, claims of confirmed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCensus and Population Estimation · Electoral Systems and Political Participation · Survey Methodology and Nonresponse
