Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization
Frederic Koehler, Thuy-Duong Vuong

TL;DR
This paper demonstrates that using Langevin diffusion with early stopping, initialized at the empirical distribution and based on estimated score functions, effectively samples multimodal distributions, addressing a longstanding challenge in score-based modeling.
Contribution
It introduces a natural sampling method for multimodal distributions using vanilla score functions with early stopping and data-based initialization.
Findings
Langevin diffusion with early stopping can generate multimodal distributions.
Initialization at the empirical distribution improves sampling quality.
The method effectively samples mixtures of log-concave distributions.
Abstract
There is a long history, as well as a recent explosion of interest, in statistical and generative modeling approaches based on score functions -- derivatives of the log-likelihood of a distribution. In seminal works, Hyv\"arinen proposed vanilla score matching as a way to learn distributions from data by computing an estimate of the score function of the underlying ground truth, and established connections between this method and established techniques like Contrastive Divergence and Pseudolikelihood estimation. It is by now well-known that vanilla score matching has significant difficulties learning multimodal distributions. Although there are various ways to overcome this difficulty, the following question has remained unanswered -- is there a natural way to sample multimodal distributions using just the vanilla score? Inspired by a long line of related experimental works, we prove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Time Series Analysis and Forecasting
MethodsDiffusion
