Concentration Inequalities for Cross-validation in Scattered Data   Approximation

Felix Bartel; Ralf Hielscher

arXiv:2105.13790·math.NA·February 24, 2022

Concentration Inequalities for Cross-validation in Scattered Data Approximation

Felix Bartel, Ralf Hielscher

PDF

TL;DR

This paper develops a concentration inequality for cross-validation scores in scattered data approximation, providing non-asymptotic probabilistic bounds that enhance understanding of model selection reliability.

Contribution

It introduces a novel concentration inequality for cross-validation in scattered data approximation, offering non-asymptotic bounds with broad applicability and precise constants for Shepard's model.

Findings

01

Provides a high-probability bound on cross-validation error

02

Applies the inequality to Shepard's model with explicit constants

03

Numerical experiments demonstrate practical relevance

Abstract

Choosing models from a hypothesis space is a frequent task in approximation theory and inverse problems. Cross-validation is a classical tool in the learner's repertoire to compare the goodness of fit for different reconstruction models. Much work has been dedicated to computing this quantity in a fast manner but tackling its theoretical properties occurs to be difficult. So far, most optimality results are stated in an asymptotic fashion. In this paper we propose a concentration inequality on the difference of cross-validation score and the risk functional with respect to the squared error. This gives a pre-asymptotic bound which holds with high probability. For the assumptions we rely on bounds on the uniform error of the model which allow for a broadly applicable framework. We support our claims by applying this machinery to Shepard's model, where we are able to determine precise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.