A multiplicative masking method for preserving the skewness of the   original micro-records

Nicolas Ruiz

arXiv:1712.02549·cs.CR·December 8, 2017

A multiplicative masking method for preserving the skewness of the original micro-records

Nicolas Ruiz

PDF

Open Access

TL;DR

This paper introduces a simple multiplicative masking method that preserves the skewness of original microdata, enhancing data privacy while maintaining key distributional properties, especially for positively skewed variables like income.

Contribution

The paper proposes a novel multiplicative masking technique that preserves skewness in microdata, addressing limitations of existing methods that assume normality.

Findings

01

The method effectively preserves skewness in continuous variables.

02

Numerical examples demonstrate reduced disclosure risk.

03

Applicable to administrative and business microdata.

Abstract

Masking methods for the safe dissemination of microdata consist of distorting the original data while preserving a pre-defined set of statistical properties in the microdata. For continuous variables, available methodologies rely essentially on matrix masking and in particular on adding noise to the original values, using more or less refined procedures depending on the extent of information that one seeks to preserve. Almost all of these methods make use of the critical assumption that the original datasets follow a normal distribution and/or that the noise has such a distribution. This assumption is, however, restrictive in the sense that few variables follow empirically a Gaussian pattern: the distribution of household income, for example, is positively skewed, and this skewness is essential information that has to be considered and preserved. This paper addresses these issues by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIncome, Poverty, and Inequality · Statistical Methods and Inference · Statistical Methods and Bayesian Inference