Prediction Models That Learn to Avoid Missing Values

Lena Stempfle; Anton Matsson; Newton Mwai; Fredrik D. Johansson

arXiv:2505.03393·cs.LG·May 7, 2025

Prediction Models That Learn to Avoid Missing Values

Lena Stempfle, Anton Matsson, Newton Mwai, Fredrik D. Johansson

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a framework called missingness-avoiding (MA) learning that trains models to minimize reliance on missing features at test time, enhancing interpretability and maintaining accuracy.

Contribution

The paper develops tailored MA algorithms for decision trees, linear models, and ensembles, incorporating regularization to reduce dependence on missing features.

Findings

01

MA models effectively reduce reliance on missing features

02

MA models maintain competitive predictive performance

03

Framework enhances interpretability with missing data

Abstract

Handling missing values at test time is challenging for machine learning models, especially when aiming for both high accuracy and interpretability. Established approaches often add bias through imputation or excessive model complexity via missingness indicators. Moreover, either method can obscure interpretability, making it harder to understand how the model utilizes the observed variables in predictions. We propose missingness-avoiding (MA) machine learning, a general framework for training models to rarely require the values of missing (or imputed) features at test time. We create tailored MA learning algorithms for decision trees, tree ensembles, and sparse linear models by incorporating classifier-specific regularization terms in their learning objectives. The tree-based models leverage contextual missingness by reducing reliance on missing values based on the observed context.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

healthy-ai/malearn
pytorchOfficial

Videos

Prediction models that learn to avoid missing values· slideslive

Taxonomy

TopicsForecasting Techniques and Applications