Lambda-randomization: multi-dimensional randomized response made easy
Nicolas Ruiz

TL;DR
This paper introduces Lambda-randomization, a novel multi-dimensional randomized response method that reduces computational costs and improves accuracy in estimating true distributions for high-dimensional data, while maintaining privacy guarantees.
Contribution
The paper develops a new theoretical framework and protocol for multi-dimensional randomized response that overcomes the curse of dimensionality with simple parameterizations and low computational complexity.
Findings
Lambda-randomization achieves efficient distribution estimation in high dimensions.
The protocol maintains privacy guarantees while reducing computational costs.
Empirical results demonstrate improved accuracy over traditional methods.
Abstract
Randomized response is a popular local anonymization approach that can deliver anonymized multi-dimensional data sets with rigorous privacy guarantees. At the same time, it can ensure validity for exploratory analysis and machine learning tasks as, under fairly general conditions, unbiased estimates of the underlying true distributions can be retrieved. However, and like for many other anonymization techniques, one of the main pitfalls of this approach is the curse of dimensionality. When coping with data sets with many attributes, one quickly runs into unsustainable computational costs for estimating true distributions, as well as a degradation in their accuracies. Relying on new theoretical insights developed in this paper, we propose an approach to multi-dimensional randomized response that avoids these traditional limitations. From simple yet intuitive parameterizations of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Random Matrices and Applications · Mobile Crowdsensing and Crowdsourcing
