Generation and analysis of synthetic data via Bayesian networks: a robust approach for uncertainty quantification via Bayesian paradigm
Larissa N. A. Martins, Fl\'avio B. Gon\c{c}alves, Thais P. Galletti

TL;DR
This paper introduces a Bayesian network-based method for generating synthetic data that accurately reproduces original data analysis results while robustly quantifying uncertainty, balancing data confidentiality and statistical reliability.
Contribution
It presents a fully Bayesian approach using posterior predictive distributions and penalizing priors for Bayesian networks to generate synthetic data with efficient uncertainty quantification.
Findings
Method effectively reproduces original data analysis results.
Algorithm achieves computational efficiency via Monte Carlo sampling.
Empirical tests demonstrate robustness and applicability to real data.
Abstract
Safe and reliable disclosure of information from confidential data is a challenging statistical problem. A common approach considers the generation of synthetic data, to be disclosed instead of the original data. Efficient approaches ought to deal with the trade-off between reliability and confidentiality of the released data. Ultimately, the aim is to be able to reproduce as accurately as possible statistical analysis of the original data using the synthetic one. Bayesian networks is a model-based approach that can be used to parsimoniously estimate the underlying distribution of the original data and generate synthetic datasets. These ought to not only approximate the results of analyses with the original data but also robustly quantify the uncertainty involved in the approximation. This paper proposes a fully Bayesian approach to generate and analyze synthetic data based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
