Sharing pattern submodels for prediction with missing values

Lena Stempfle; Ashkan Panahi; Fredrik D. Johansson

arXiv:2206.11161·cs.LG·November 27, 2023·1 cites

Sharing pattern submodels for prediction with missing values

Lena Stempfle, Ashkan Panahi, Fredrik D. Johansson

PDF

Open Access 1 Video

TL;DR

This paper introduces sharing pattern submodels for robust prediction with missing data, balancing pattern-specific accuracy and shared information, and demonstrating improved performance and interpretability.

Contribution

It proposes a novel sharing pattern submodel approach with regularization, providing robustness, interpretability, and theoretical guarantees for missing data prediction.

Findings

01

Achieves a good tradeoff between pattern specialization and information sharing.

02

Demonstrates improved predictive performance on synthetic and real-world data.

03

Provides theoretical conditions for optimal sharing models.

Abstract

Missing values are unavoidable in many applications of machine learning and present challenges both during training and at test time. When variables are missing in recurring patterns, fitting separate pattern submodels have been proposed as a solution. However, fitting models independently does not make efficient use of all available data. Conversely, fitting a single shared model to the full data set relies on imputation which often leads to biased results when missingness depends on unobserved factors. We propose an alternative approach, called sharing pattern submodels, which i) makes predictions that are robust to missing values at test time, ii) maintains or improves the predictive power of pattern submodels, and iii) has a short description, enabling improved interpretability. Parameter sharing is enforced through sparsity-inducing regularization which we prove leads to consistent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sharing Pattern Submodels for Prediction with Missing Values· underline

Taxonomy

TopicsMachine Learning and Data Classification · Hydrological Forecasting Using AI · Data Stream Mining Techniques

MethodsTest