Statistical Efficiency of Score Matching: The View from Isoperimetry
Frederic Koehler, Alexander Heckett, Andrej Risteski

TL;DR
This paper investigates the statistical efficiency of score matching for training energy-based models, revealing its dependence on the isoperimetric properties of the target distribution and comparing it to maximum likelihood.
Contribution
It establishes a theoretical connection between score matching efficiency and isoperimetric constants, providing conditions under which score matching is nearly optimal or substantially less efficient.
Findings
Score matching is efficient for distributions with small isoperimetric constants.
Large isoperimetric constants lead to lower efficiency of score matching compared to MLE.
The results extend to both finite sample and asymptotic regimes, with parallels in discrete settings.
Abstract
Deep generative models parametrized up to a normalizing constant (e.g. energy-based models) are difficult to train by maximizing the likelihood of the data because the likelihood and/or gradients thereof cannot be explicitly or efficiently written down. Score matching is a training method, whereby instead of fitting the likelihood for the training data, we instead fit the score function -- obviating the need to evaluate the partition function. Though this estimator is known to be consistent, its unclear whether (and when) its statistical efficiency is comparable to that of maximum likelihood -- which is known to be (asymptotically) optimal. We initiate this line of inquiry in this paper, and show a tight connection between statistical efficiency of score matching and the isoperimetric properties of the distribution being estimated -- i.e. the Poincar\'e,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods
