Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization
Shu Yu Tew, Mario Boley, Daniel F. Schmidt

TL;DR
The paper introduces a Bayesian EM-based method for tuning ridge regression hyper-parameters that is faster and often more accurate than traditional LOOCV, especially with sparse data.
Contribution
It proposes a novel Bayesian EM approach for hyper-parameter tuning in ridge regression that guarantees a unique solution and reduces computational complexity.
Findings
The method finds a unique optimal hyper-parameter for large enough data.
It reduces computational complexity from O(n^2) to O(n) per iteration.
The approach outperforms LOOCV in speed and accuracy in large-scale settings.
Abstract
We present a novel method for tuning the regularization hyper-parameter, , of a ridge regression that is faster to compute than leave-one-out cross-validation (LOOCV) while yielding estimates of the regression parameters of equal, or particularly in the setting of sparse covariates, superior quality to those obtained by minimising the LOOCV risk. The LOOCV risk can suffer from multiple and bad local minima for finite and thus requires the specification of a set of candidate , which can fail to provide good solutions. In contrast, we show that the proposed method is guaranteed to find a unique optimal solution for large enough , under relatively mild conditions, without requiring the specification of any difficult to determine hyper-parameters. This is based on a Bayesian formulation of ridge regression that we prove to have a unimodal posterior for large enough…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Fault Detection and Control Systems · Statistical Methods and Inference
MethodsSparse Evolutionary Training
