A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models
Masoud Badiei Khuzani, Liyue Shen, Shahin Shahrampour, Lei Xing

TL;DR
This paper introduces a mean-field theory-based supervised kernel learning method for generative and discriminative models, improving kernel optimization in MMD GANs and SVMs through a distributionally robust approach and particle SGD.
Contribution
It develops a novel distributionally robust kernel optimization framework using mean-field analysis and particle SGD, with proven convergence and improved hypothesis testing power.
Findings
Enhanced kernel MMD test power for larger thresholds.
Empirical results show superior performance over existing kernel learning methods.
Theoretical proof of convergence and consistency of the proposed method.
Abstract
We propose a novel supervised learning method to optimize the kernel in the maximum mean discrepancy generative adversarial networks (MMD GANs), and the kernel support vector machines (SVMs). Specifically, we characterize a distributionally robust optimization problem to compute a good distribution for the random feature model of Rahimi and Recht. Due to the fact that the distributional optimization is infinite dimensional, we consider a Monte-Carlo sample average approximation (SAA) to obtain a more tractable finite dimensional optimization problem. We subsequently leverage a particle stochastic gradient descent (SGD) method to solve the derived finite dimensional optimization problem. Based on a mean-field analysis, we then prove that the empirical distribution of the interactive particles system at each iteration of the SGD follows the path of the gradient descent flow on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
MethodsStochastic Gradient Descent
