Pseudo-Bayesian Learning with Kernel Fourier Transform as Prior
Ga\"el Letarte, Emilie Morvant (LHC), Pascal Germain (MODAL)

TL;DR
This paper reinterprets kernel random Fourier features within a PAC-Bayesian framework, proposing new learning strategies that optimize generalization bounds and provide theoretical justification for kernel alignment methods.
Contribution
It introduces a PAC-Bayesian perspective on RFF, deriving bounds and proposing two novel learning strategies based on this interpretation.
Findings
Derived generalization bounds optimized by a pseudo-posterior
Proposed a landmarks-based data representation method
Provided PAC-Bayesian justification for kernel alignment
Abstract
We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF) method through the lens of the PAC-Bayesian theory. While the primary goal of RFF is to approximate a kernel, we look at the Fourier transform as a prior distribution over trigonometric hypotheses. It naturally suggests learning a posterior on these hypotheses. We derive generalization bounds that are optimized by learning a pseudo-posterior obtained from a closed-form expression. Based on this study, we consider two learning strategies: The first one finds a compact landmarks-based representation of the data where each landmark is given by a distribution-tailored similarity measure, while the second one provides a PAC-Bayesian justification to the kernel alignment method of Sinha and Duchi (2016).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Machine Learning and Algorithms
