Pointwise confidence estimation in the non-linear $\ell^2$-regularized least squares

Ilja Kuzborskij; Yasin Abbasi Yadkori

arXiv:2506.07088·cs.LG·June 12, 2025

Pointwise confidence estimation in the non-linear $\ell^2$-regularized least squares

Ilja Kuzborskij, Yasin Abbasi Yadkori

PDF

Open Access

TL;DR

This paper develops a non-asymptotic, pointwise confidence estimation method for non-linear least squares with $ ext{l}^2$ regularization, accounting for the test input's similarity to training data, and demonstrates its effectiveness empirically.

Contribution

It introduces a novel confidence bound that scales with input similarity in the feature space and provides an efficient computation method, extending classical linear confidence intervals to non-linear settings.

Findings

01

The confidence bound adapts to the test input's distance from training data.

02

Empirical results show improved coverage/width trade-off over bootstrap methods.

03

The method is computationally efficient, close to gradient computation cost.

Abstract

We consider a high-probability non-asymptotic confidence estimation in the $ℓ^{2}$ -regularized non-linear least-squares setting with fixed design. In particular, we study confidence estimation for local minimizers of the regularized training loss. We show a pointwise confidence bound, meaning that it holds for the prediction on any given fixed test input $x$ . Importantly, the proposed confidence bound scales with similarity of the test input to the training data in the implicit feature space of the predictor (for instance, becoming very large when the test input lies far outside of the training data). This desirable last feature is captured by the weighted norm involving the inverse-Hessian matrix of the objective function, which is a generalized version of its counterpart in the linear setting, $x^{⊤} Cov^{- 1} x$ . Our generalized result can be regarded as a non-asymptotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Sparse and Compressive Sensing Techniques