Understanding Generalization in Diffusion Distillation via Probability Flow Distance

Huijie Zhang; Zijian Huang; Siyi Chen; Jinfan Zhou; Zekai Zhang; Peng Wang; Qing Qu

arXiv:2505.20123·cs.LG·February 13, 2026

Understanding Generalization in Diffusion Distillation via Probability Flow Distance

Huijie Zhang, Zijian Huang, Siyi Chen, Jinfan Zhou, Zekai Zhang, Peng Wang, Qing Qu

PDF

Open Access

TL;DR

This paper introduces probability flow distance (PFD), a practical and theoretically grounded metric for measuring generalization in diffusion distillation models, revealing key behaviors like scaling, double descent, and bias-variance dynamics.

Contribution

The work proposes PFD as a new metric for evaluating generalization in diffusion distillation, connecting theoretical insights with empirical observations.

Findings

01

Quantitative scaling from memorization to generalization

02

Epoch-wise double descent training dynamics

03

Bias-variance decomposition in diffusion models

Abstract

Diffusion distillation provides an effective approach for learning lightweight and few-steps diffusion models with efficient generation. However, evaluating their generalization remains challenging: theoretical metrics are often impractical for high-dimensional data, while no practical metrics rigorously measure generalization. In this work, we bridge this gap by introducing probability flow distance (\texttt{PFD}), a theoretically grounded and computationally efficient metric to measure generalization. Specifically, \texttt{PFD} quantifies the distance between distributions by comparing their noise-to-data mappings induced by the probability flow ODE. Using \texttt{PFD} under the diffusion distillation setting, we empirically uncover several key generalization behaviors, including: (1) quantitative scaling behavior from memorization to generalization, (2) epoch-wise double descent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models

MethodsDiffusion