MisGAN: Learning from Incomplete Data with Generative Adversarial Networks
Steven Cheng-Xian Li, Bo Jiang, Benjamin Marlin

TL;DR
This paper introduces MisGAN, a novel GAN-based framework capable of learning from incomplete high-dimensional data by modeling missing data distributions and imputing missing values effectively.
Contribution
MisGAN is the first framework to jointly learn data and missing data distributions using GANs, enabling effective imputation from incomplete datasets.
Findings
Successfully models complex missing data distributions
Achieves accurate data imputation under missing completely at random
Demonstrates effectiveness on various high-dimensional datasets
Abstract
Generative adversarial networks (GANs) have been shown to provide an effective way to model complex distributions and have obtained impressive results on various challenging tasks. However, typical GANs require fully-observed data during training. In this paper, we present a GAN-based framework for learning from complex, high-dimensional incomplete data. The proposed framework learns a complete data generator along with a mask generator that models the missing data distribution. We further demonstrate how to impute missing data by equipping our framework with an adversarially trained imputer. We evaluate the proposed framework using a series of experiments with several types of missing data processes under the missing completely at random assumption.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
