Richer Bayesian Last Layers with Subsampled NTK Features
Sergio Calvo-Ordo\~nez, Jonathan Plenk, Richard Bergna, \'Alvaro Cartea, Yarin Gal, Jose Miguel Hern\'andez-Lobato, Kamil Ciosek

TL;DR
This paper enhances Bayesian Last Layers by incorporating subsampled Neural Tangent Kernel features to better estimate epistemic uncertainty with low computational overhead.
Contribution
It introduces a method that projects NTK features onto last-layer features, improving uncertainty estimation while maintaining efficiency.
Findings
Posterior variances are provably greater or equal to standard BLLs.
The method improves calibration and uncertainty estimates across various tasks.
Subsampling reduces computational cost with theoretical approximation bounds.
Abstract
Bayesian Last Layers (BLLs) provide a convenient and computationally efficient way to estimate uncertainty in neural networks. However, they underestimate epistemic uncertainty because they apply a Bayesian treatment only to the final layer, ignoring uncertainty induced by earlier layers. We propose a method that improves BLLs by leveraging a projection of Neural Tangent Kernel (NTK) features onto the space spanned by the last-layer features. This enables posterior inference that accounts for variability of the full network while retaining the low computational cost of inference of a standard BLL. We show that our method yields posterior variances that are provably greater or equal to those of a standard BLL, correcting its tendency to underestimate epistemic uncertainty. To further reduce computational cost, we introduce a uniform subsampling scheme for estimating the projection matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
