Tunable correlation retention: A statistical method for generating synthetic data
Nicklas J\"averg{\aa}rd, Rainey Lyons, Adrian Muntean, Jonas Forsman

TL;DR
This paper introduces a statistical method for generating synthetic data that preserves feature correlations and offers tunable privacy levels by controlling the depth of conditional distributions used.
Contribution
The paper presents a novel, flexible approach to synthetic data generation that maintains inter-feature correlations and allows adjustable privacy settings through a tunable parameter.
Findings
Method accurately reproduces correlation structures in synthetic data.
Tunable parameters effectively balance data fidelity and privacy.
Applicable to various datasets, including real-world energy data.
Abstract
We propose a method to generate statistically representative synthetic data from a given dataset. The main goal of our method is for the created data set to mimic the inter--feature correlations present in the original data, while also offering a tunable parameter to influence the privacy level. In particular, our method constructs a statistical map by using the empirical conditional distributions between the features of the original dataset. Part of the tunability is achieved by limiting the depths of conditional distributions that are being used. We describe in detail our algorithms used both in the construction of a statistical map and how to use this map to generate synthetic observations. This approach is tested in three different ways: with a hand calculated example; a manufactured dataset; and a real world energy-related dataset of consumption/production of households in Madeira…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSparse Evolutionary Training
