Beyond Ridge Regression for Distribution-Free Data
Koby Bibas, Meir Feder

TL;DR
This paper introduces a distribution-free prediction method for linear regression that combines luckiness functions with pNML, leading to improved accuracy and robustness over traditional ridge regression, especially under distribution shifts.
Contribution
It proposes the LpNML approach, integrating luckiness with pNML for linear regression, which enhances prediction accuracy and robustness compared to standard ridge regression.
Findings
Reduces Ridge ERM error by up to 20% on PMLB datasets.
Achieves up to 4.9% greater robustness under distribution shift.
Deviates from Ridge ERM predictions in low eigenvalue subspaces.
Abstract
In supervised batch learning, the predictive normalized maximum likelihood (pNML) has been proposed as the min-max regret solution for the distribution-free setting, where no distributional assumptions are made on the data. However, the pNML is not defined for a large capacity hypothesis class as over-parameterized linear regression. For a large class, a common approach is to use regularization or a model prior. In the context of online prediction where the min-max solution is the Normalized Maximum Likelihood (NML), it has been suggested to use NML with ``luckiness'': A prior-like function is applied to the hypothesis class, which reduces its effective size. Motivated by the luckiness concept, for linear regression we incorporate a luckiness function that penalizes the hypothesis proportionally to its l2 norm. This leads to the ridge regression solution. The associated pNML with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · COVID-19 epidemiological studies
MethodsTest · Linear Regression
