Unsupervised Imputation of Non-ignorably Missing Data Using Importance-Weighted Autoencoders
David K. Lim, Naim U. Rashid, Junier B. Oliva, Joseph G. Ibrahim

TL;DR
This paper introduces NIMIWAE, a novel Variational Autoencoder architecture designed to effectively handle both ignorable and non-ignorable missing data patterns in biomedical datasets, improving imputation and analysis.
Contribution
The paper presents NIMIWAE, the first VAE model that flexibly models both missing data mechanisms, enhancing unsupervised learning and imputation in complex biomedical data.
Findings
Outperforms existing imputation methods in simulations
Accurately models non-ignorable missingness patterns
Effective in real-world ICU dataset analysis
Abstract
Deep Learning (DL) methods have dramatically increased in popularity in recent years. While its initial success was demonstrated in the classification and manipulation of image data, there has been significant growth in the application of DL methods to problems in the biomedical sciences. However, the greater prevalence and complexity of missing data in biomedical datasets present significant challenges for DL methods. Here, we provide a formal treatment of missing data in the context of Variational Autoencoders (VAEs), a popular unsupervised DL architecture commonly utilized for dimension reduction, imputation, and learning latent representations of complex data. We propose a new VAE architecture, NIMIWAE, that is one of the first to flexibly account for both ignorable and non-ignorable patterns of missingness in input features at training time. Following training, samples can be drawn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis · AI in cancer detection
