From optimal score matching to optimal sampling

Zehao Dou; Subhodh Kotekal; Zhehao Xu; Harrison H. Zhou

arXiv:2409.07032·stat.ML·September 12, 2024·2 cites

From optimal score matching to optimal sampling

Zehao Dou, Subhodh Kotekal, Zhehao Xu, Harrison H. Zhou

PDF

Open Access

TL;DR

This paper establishes the optimal statistical rates for score estimation in diffusion models, providing theoretical guarantees that improve understanding of their density estimation capabilities without extra assumptions.

Contribution

It derives the sharp minimax rate of score estimation for smooth densities, clarifying the theoretical limits of score matching in diffusion models.

Findings

01

Optimal score estimation rate derived for smooth densities.

02

Sample generated from diffusion model achieves minimax density estimation rate.

03

No extraneous logarithmic factors or early stopping needed for optimality.

Abstract

The recent, impressive advances in algorithmic generation of high-fidelity image, audio, and video are largely due to great successes in score-based diffusion models. A key implementing step is score matching, that is, the estimation of the score function of the forward diffusion process from training data. As shown in earlier literature, the total variation distance between the law of a sample generated from the trained diffusion model and the ground truth distribution can be controlled by the score matching risk. Despite the widespread use of score-based diffusion models, basic theoretical questions concerning exact optimal statistical rates for score estimation and its application to density estimation remain open. We establish the sharp minimax rate of score estimation for smooth, compactly supported densities. Formally, given \(n\) i.i.d. samples from an unknown…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Sports Analytics and Performance

MethodsEarly Stopping · Diffusion