Pointwise confidence estimation in the non-linear $\ell^2$-regularized least squares
Ilja Kuzborskij, Yasin Abbasi Yadkori

TL;DR
This paper develops a non-asymptotic, pointwise confidence estimation method for non-linear least squares with $ ext{l}^2$ regularization, accounting for the test input's similarity to training data, and demonstrates its effectiveness empirically.
Contribution
It introduces a novel confidence bound that scales with input similarity in the feature space and provides an efficient computation method, extending classical linear confidence intervals to non-linear settings.
Findings
The confidence bound adapts to the test input's distance from training data.
Empirical results show improved coverage/width trade-off over bootstrap methods.
The method is computationally efficient, close to gradient computation cost.
Abstract
We consider a high-probability non-asymptotic confidence estimation in the -regularized non-linear least-squares setting with fixed design. In particular, we study confidence estimation for local minimizers of the regularized training loss. We show a pointwise confidence bound, meaning that it holds for the prediction on any given fixed test input . Importantly, the proposed confidence bound scales with similarity of the test input to the training data in the implicit feature space of the predictor (for instance, becoming very large when the test input lies far outside of the training data). This desirable last feature is captured by the weighted norm involving the inverse-Hessian matrix of the objective function, which is a generalized version of its counterpart in the linear setting, . Our generalized result can be regarded as a non-asymptotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Sparse and Compressive Sensing Techniques
