A Mean-Field Theory for Learning the Sch\"{o}nberg Measure of Radial Basis Functions
Masoud Badiei Khuzani, Yinyu Ye, Sandy Napel, Lei Xing

TL;DR
This paper introduces a mean-field theoretical framework for learning the Schönberg measure of radial basis functions using a projected particle Langevin method, providing convergence guarantees and practical applications in image retrieval and classification.
Contribution
It develops a novel mean-field analysis of a Langevin-based optimization method for learning radial basis functions, including PDE characterization and steady-state analysis.
Findings
Convergence of empirical measure to a reflected Itô diffusion law.
Derivation of a McKean-Vlasov PDE with Robin boundary conditions.
Successful application to image retrieval and classification tasks.
Abstract
We develop and analyze a projected particle Langevin optimization method to learn the distribution in the Sch\"{o}nberg integral representation of the radial basis functions from training samples. More specifically, we characterize a distributionally robust optimization method with respect to the Wasserstein distance to optimize the distribution in the Sch\"{o}nberg integral representation. To provide theoretical performance guarantees, we analyze the scaling limits of a projected particle online (stochastic) optimization method in the mean-field regime. In particular, we prove that in the scaling limits, the empirical measure of the Langevin particles converges to the law of a reflected It\^{o} diffusion-drift process. Moreover, the drift is also a function of the law of the underlying process. Using It\^{o} lemma for semi-martingales and Grisanov's change of measure for the Wiener…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
