Prediction Models That Learn to Avoid Missing Values
Lena Stempfle, Anton Matsson, Newton Mwai, Fredrik D. Johansson

TL;DR
This paper introduces a framework called missingness-avoiding (MA) learning that trains models to minimize reliance on missing features at test time, enhancing interpretability and maintaining accuracy.
Contribution
The paper develops tailored MA algorithms for decision trees, linear models, and ensembles, incorporating regularization to reduce dependence on missing features.
Findings
MA models effectively reduce reliance on missing features
MA models maintain competitive predictive performance
Framework enhances interpretability with missing data
Abstract
Handling missing values at test time is challenging for machine learning models, especially when aiming for both high accuracy and interpretability. Established approaches often add bias through imputation or excessive model complexity via missingness indicators. Moreover, either method can obscure interpretability, making it harder to understand how the model utilizes the observed variables in predictions. We propose missingness-avoiding (MA) machine learning, a general framework for training models to rarely require the values of missing (or imputed) features at test time. We create tailored MA learning algorithms for decision trees, tree ensembles, and sparse linear models by incorporating classifier-specific regularization terms in their learning objectives. The tree-based models leverage contextual missingness by reducing reliance on missing values based on the observed context.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsForecasting Techniques and Applications
