Statistical Efficiency of Score Matching: The View from Isoperimetry

Frederic Koehler; Alexander Heckett; Andrej Risteski

arXiv:2210.00726·cs.LG·December 26, 2022·6 cites

Statistical Efficiency of Score Matching: The View from Isoperimetry

Frederic Koehler, Alexander Heckett, Andrej Risteski

PDF

Open Access 1 Video

TL;DR

This paper investigates the statistical efficiency of score matching for training energy-based models, revealing its dependence on the isoperimetric properties of the target distribution and comparing it to maximum likelihood.

Contribution

It establishes a theoretical connection between score matching efficiency and isoperimetric constants, providing conditions under which score matching is nearly optimal or substantially less efficient.

Findings

01

Score matching is efficient for distributions with small isoperimetric constants.

02

Large isoperimetric constants lead to lower efficiency of score matching compared to MLE.

03

The results extend to both finite sample and asymptotic regimes, with parallels in discrete settings.

Abstract

Deep generative models parametrized up to a normalizing constant (e.g. energy-based models) are difficult to train by maximizing the likelihood of the data because the likelihood and/or gradients thereof cannot be explicitly or efficiently written down. Score matching is a training method, whereby instead of fitting the likelihood $lo g p (x)$ for the training data, we instead fit the score function $\nabla_{x} lo g p (x)$ -- obviating the need to evaluate the partition function. Though this estimator is known to be consistent, its unclear whether (and when) its statistical efficiency is comparable to that of maximum likelihood -- which is known to be (asymptotically) optimal. We initiate this line of inquiry in this paper, and show a tight connection between statistical efficiency of score matching and the isoperimetric properties of the distribution being estimated -- i.e. the Poincar\'e,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Statistical Efficiency of Score Matching: The View from Isoperimetry· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods