Maximin effects in inhomogeneous large-scale data
Nicolai Meinshausen, Peter B\"uhlmann

TL;DR
This paper introduces the concept of maximin effects for regression in inhomogeneous large-scale data, providing a computationally efficient estimator with strong predictive performance across diverse data regimes.
Contribution
It proposes a new maximin effects estimator for reliable prediction in inhomogeneous data, offering faster computation than existing methods.
Findings
Estimator achieves good predictive accuracy across data regimes.
Computational speed is significantly faster than penalized regression methods.
Empirical examples demonstrate practical effectiveness.
Abstract
Large-scale data are often characterized by some degree of inhomogeneity as data are either recorded in different time regimes or taken from multiple sources. We look at regression models and the effect of randomly changing coefficients, where the change is either smoothly in time or some other dimension or even without any such structure. Fitting varying-coefficient models or mixture models can be appropriate solutions but are computationally very demanding and often return more information than necessary. If we just ask for a model estimator that shows good predictive properties for all regimes of the data, then we are aiming for a simple linear model that is reliable for all possible subsets of the data. We propose the concept of "maximin effects" and a suitable estimator and look at its prediction accuracy from a theoretical point of view in a mixture model with known or unknown…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
