On Bochner's and Polya's Characterizations of Positive-Definite Kernels and the Respective Random Feature Maps
Jie Chen, Dehua Cheng, Yan Liu

TL;DR
This paper explores Polya's characterization of positive-definite kernels, introduces new kernels, and demonstrates that a random binning feature map derived from Polya's criterion outperforms the traditional random Fourier map in approximating kernels.
Contribution
It studies Polya's kernel characterization, derives novel kernels, and compares the effectiveness of random binning versus random Fourier feature maps.
Findings
Random binning map provides a closer Euclidean inner product to the kernel.
Random binning outperforms random Fourier in kernel approximation.
Empirical results confirm the superiority of the binning approach in regression and classification.
Abstract
Positive-definite kernel functions are fundamental elements of kernel methods and Gaussian processes. A well-known construction of such functions comes from Bochner's characterization, which connects a positive-definite function with a probability distribution. Another construction, which appears to have attracted less attention, is Polya's criterion that characterizes a subset of these functions. In this paper, we study the latter characterization and derive a number of novel kernels little known previously. In the context of large-scale kernel machines, Rahimi and Recht (2007) proposed a random feature map (random Fourier) that approximates a kernel function, through independent sampling of the probability distribution in Bochner's characterization. The authors also suggested another feature map (random binning), which, although not explicitly stated, comes from Polya's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gaussian Processes and Bayesian Inference · Stochastic Gradient Optimization Techniques
