Biased Generalization in Diffusion Models

Jerome Garnier-Brun; Luca Biggio; Davide Beltrame; Marc M\'ezard; Luca Saglietti

arXiv:2603.03469·cs.LG·March 5, 2026

Biased Generalization in Diffusion Models

Jerome Garnier-Brun, Luca Biggio, Davide Beltrame, Marc M\'ezard, Luca Saglietti

PDF

Open Access

TL;DR

This paper reveals a phase of biased generalization in diffusion models where models favor training data proximity, challenging the assumption that stopping at test loss minimum ensures optimal generalization.

Contribution

It introduces a quantitative measure of bias in generative models, demonstrates its presence on real images, and explains its mechanism through a hierarchical data model.

Findings

01

Bias increases with training, favoring training data proximity.

02

Early stopping at test loss minimum may not prevent privacy risks.

03

Bias is linked to sequential feature learning in deep networks.

Abstract

Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce novel samples, with evaluation largely driven by held-out performance and perceived sample quality. In practice, training is often stopped at the minimum of the test loss, taken as an operational indicator of generalization. We challenge this viewpoint by identifying a phase of biased generalization during training, in which the model continues to decrease the test loss while favoring samples with anomalously high proximity to training data. By training the same network on two disjoint datasets and comparing the mutual distances of generated samples and their similarity to training data, we introduce a quantitative measure of bias and demonstrate its presence on real images. We then study the mechanism of bias, using a controlled hierarchical data model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Privacy-Preserving Technologies in Data · Face recognition and analysis