Bayesian Bootstrap based Gaussian Copula Model for Mixed Data with High Missing Rates
Seongmin Kim, Jeunghun Oh, Hungkuk Ko, Jeongmin Park, Jaeyong Lee

TL;DR
This paper introduces a Bayesian bootstrap-based Gaussian copula model that effectively handles mixed data with high missing rates by capturing dependency structures and quantifying uncertainty in marginal distributions.
Contribution
It presents a novel Bayesian bootstrap approach within the Gaussian copula framework to improve imputation accuracy for mixed data with high missingness.
Findings
Outperforms existing imputation methods across various missing rates.
Demonstrates superior performance on real-world datasets.
Effectively models complex dependencies in mixed data.
Abstract
Missing data is a common issue in various fields such as medicine, social sciences, and natural sciences, and it poses significant challenges for accurate statistical analysis. Although numerous imputation methods have been proposed to address this issue, many of them fail to adequately capture the complex dependency structure among variables. To overcome this limitation, models based on the Gaussian copula framework have been introduced. However, most existing copula-based approaches do not account for the uncertainty in the marginal distributions, which can lead to biased marginal estimates and degraded performance, especially under high missingness rates. In this study, we propose a Bayesian bootstrap-based Gaussian Copula model (BBGC) that explicitly incorporates uncertainty in the marginal distributions of each variable. The proposed BBGC combines the flexible dependency modeling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
