Bayes beats Cross Validation: Efficient and Accurate Ridge Regression   via Expectation Maximization

Shu Yu Tew; Mario Boley; Daniel F. Schmidt

arXiv:2310.18860·stat.ML·November 6, 2023·1 cites

Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization

Shu Yu Tew, Mario Boley, Daniel F. Schmidt

PDF

Open Access 1 Repo 1 Video

TL;DR

The paper introduces a Bayesian EM-based method for tuning ridge regression hyper-parameters that is faster and often more accurate than traditional LOOCV, especially with sparse data.

Contribution

It proposes a novel Bayesian EM approach for hyper-parameter tuning in ridge regression that guarantees a unique solution and reduces computational complexity.

Findings

01

The method finds a unique optimal hyper-parameter for large enough data.

02

It reduces computational complexity from O(n^2) to O(n) per iteration.

03

The approach outperforms LOOCV in speed and accuracy in large-scale settings.

Abstract

We present a novel method for tuning the regularization hyper-parameter, $λ$ , of a ridge regression that is faster to compute than leave-one-out cross-validation (LOOCV) while yielding estimates of the regression parameters of equal, or particularly in the setting of sparse covariates, superior quality to those obtained by minimising the LOOCV risk. The LOOCV risk can suffer from multiple and bad local minima for finite $n$ and thus requires the specification of a set of candidate $λ$ , which can fail to provide good solutions. In contrast, we show that the proposed method is guaranteed to find a unique optimal solution for large enough $n$ , under relatively mild conditions, without requiring the specification of any difficult to determine hyper-parameters. This is based on a Bayesian formulation of ridge regression that we prove to have a unimodal posterior for large enough…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marioboley/fastridge
none

Videos

Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization· slideslive

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Fault Detection and Control Systems · Statistical Methods and Inference

MethodsSparse Evolutionary Training