Neural Network Training with Highly Incomplete Datasets
Yu-Wei Chang, Laura Natali, Oveis Jamialahmadi, Stefano Romeo, and Joana B. Pereira, Giovanni Volpe

TL;DR
GapNet is a novel deep learning method that effectively trains neural networks on highly incomplete datasets by creating specialized models for data subsets and combining them, improving predictive accuracy in medical applications.
Contribution
The paper introduces GapNet, a new approach that trains neural networks on incomplete data without imputation, enabling better utilization of real-world datasets with missing values.
Findings
Improves patient classification for Alzheimer's disease.
Enhances risk prediction for Covid-19 hospitalization.
Effective on real-world medical datasets with high missingness.
Abstract
Neural network training and validation rely on the availability of large high-quality datasets. However, in many cases only incomplete datasets are available, particularly in health care applications, where each patient typically undergoes different clinical procedures or can drop out of a study. Since the data to train the neural networks need to be complete, most studies discard the incomplete datapoints, which reduces the size of the training data, or impute the missing features, which can lead to artefacts. Alas, both approaches are inadequate when a large portion of the data is missing. Here, we introduce GapNet, an alternative deep-learning training approach that can use highly incomplete datasets. First, the dataset is split into subsets of samples containing all values for a certain cluster of features. Then, these subsets are used to train individual neural networks. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Machine Learning in Healthcare · Artificial Intelligence in Healthcare
