A multiplicative masking method for preserving the skewness of the original micro-records
Nicolas Ruiz

TL;DR
This paper introduces a simple multiplicative masking method that preserves the skewness of original microdata, enhancing data privacy while maintaining key distributional properties, especially for positively skewed variables like income.
Contribution
The paper proposes a novel multiplicative masking technique that preserves skewness in microdata, addressing limitations of existing methods that assume normality.
Findings
The method effectively preserves skewness in continuous variables.
Numerical examples demonstrate reduced disclosure risk.
Applicable to administrative and business microdata.
Abstract
Masking methods for the safe dissemination of microdata consist of distorting the original data while preserving a pre-defined set of statistical properties in the microdata. For continuous variables, available methodologies rely essentially on matrix masking and in particular on adding noise to the original values, using more or less refined procedures depending on the extent of information that one seeks to preserve. Almost all of these methods make use of the critical assumption that the original datasets follow a normal distribution and/or that the noise has such a distribution. This assumption is, however, restrictive in the sense that few variables follow empirically a Gaussian pattern: the distribution of household income, for example, is positively skewed, and this skewness is essential information that has to be considered and preserved. This paper addresses these issues by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIncome, Poverty, and Inequality · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
