Sharing pattern submodels for prediction with missing values
Lena Stempfle, Ashkan Panahi, Fredrik D. Johansson

TL;DR
This paper introduces sharing pattern submodels for robust prediction with missing data, balancing pattern-specific accuracy and shared information, and demonstrating improved performance and interpretability.
Contribution
It proposes a novel sharing pattern submodel approach with regularization, providing robustness, interpretability, and theoretical guarantees for missing data prediction.
Findings
Achieves a good tradeoff between pattern specialization and information sharing.
Demonstrates improved predictive performance on synthetic and real-world data.
Provides theoretical conditions for optimal sharing models.
Abstract
Missing values are unavoidable in many applications of machine learning and present challenges both during training and at test time. When variables are missing in recurring patterns, fitting separate pattern submodels have been proposed as a solution. However, fitting models independently does not make efficient use of all available data. Conversely, fitting a single shared model to the full data set relies on imputation which often leads to biased results when missingness depends on unobserved factors. We propose an alternative approach, called sharing pattern submodels, which i) makes predictions that are robust to missing values at test time, ii) maintains or improves the predictive power of pattern submodels, and iii) has a short description, enabling improved interpretability. Parameter sharing is enforced through sparsity-inducing regularization which we prove leads to consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Hydrological Forecasting Using AI · Data Stream Mining Techniques
MethodsTest
