Revisiting Diffusion Model Predictions Through Dimensionality

Qing Jin; Chaoyang Wang

arXiv:2601.21419·cs.LG·March 30, 2026

Revisiting Diffusion Model Predictions Through Dimensionality

Qing Jin, Chaoyang Wang

PDF

TL;DR

This paper develops a theoretical framework explaining why data prediction becomes optimal in high-dimensional diffusion models and introduces k-Diff, a data-driven method to select the best prediction target.

Contribution

It provides a formal analysis linking data geometry to prediction target optimality and proposes k-Diff to automatically learn this target without explicit dimension estimation.

Findings

01

Theoretical justification for x-prediction superiority in high ambient dimensions.

02

k-Diff outperforms fixed-target baselines in image generation tasks.

03

Demonstrates the importance of data geometry in diffusion model predictions.

Abstract

Recent advances in diffusion and flow matching models have highlighted a shift in the preferred prediction target -- moving from noise ( $ε$ ) and velocity (v) to direct data (x) prediction -- particularly in high-dimensional settings. However, a formal explanation of why the optimal target depends on the specific properties of the data remains elusive. In this work, we provide a theoretical framework based on a generalized prediction formulation that accommodates arbitrary output targets, of which $ε$ -, v-, and x-prediction are special cases. We derive the analytical relationship between data's geometry and the optimal prediction target, offering a rigorous justification for why x-prediction becomes superior when the ambient dimension significantly exceeds the data's intrinsic dimension. Furthermore, while our theory identifies dimensionality as the governing factor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.