Approximate cross-validation formula for Bayesian linear regression

Yoshiyuki Kabashima; Tomoyuki Obuchi; Makoto Uemura

arXiv:1610.07733·stat.ML·October 26, 2016·1 cites

Approximate cross-validation formula for Bayesian linear regression

Yoshiyuki Kabashima, Tomoyuki Obuchi, Makoto Uemura

PDF

Open Access

TL;DR

This paper introduces an approximate formula for efficiently estimating leave-one-out cross-validation error in Bayesian linear regression, reducing computational costs for large datasets.

Contribution

It develops a novel analytical approximation for CV error in Bayesian linear regression, avoiding the need for extensive re-computation.

Findings

01

The formula accurately estimates CV error in synthetic models.

02

Application to supernova data confirms practical usefulness.

03

Reduces computational cost significantly for large datasets.

Abstract

Cross-validation (CV) is a technique for evaluating the ability of statistical models/learning systems based on a given data set. Despite its wide applicability, the rather heavy computational cost can prevent its use as the system size grows. To resolve this difficulty in the case of Bayesian linear regression, we develop a formula for evaluating the leave-one-out CV error approximately without actually performing CV. The usefulness of the developed formula is tested by statistical mechanical analysis for a synthetic model. This is confirmed by application to a real-world supernova data set as well.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Fault Detection and Control Systems · Control Systems and Identification