Self-Consuming Generative Models with Curated Data Provably Optimize   Human Preferences

Damien Ferbach; Quentin Bertrand; Avishek Joey Bose; Gauthier Gidel

arXiv:2407.09499·cs.CV·July 16, 2024·1 cites

Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences

Damien Ferbach, Quentin Bertrand, Avishek Joey Bose, Gauthier Gidel

PDF

Open Access 1 Video

TL;DR

This paper analyzes how curated human feedback in generative models influences iterative retraining, showing it acts as an implicit preference optimizer and affects model stability and bias amplification.

Contribution

It provides a theoretical framework for understanding data curation's role in self-consuming generative models and demonstrates its impact on reward maximization and bias.

Findings

01

Curated data can implicitly optimize human preferences.

02

Theoretical proof of reward maximization with curated data.

03

Bias amplification observed in experiments.

Abstract

The rapid progress in generative models has resulted in impressive leaps in generation quality, blurring the lines between synthetic and real data. Web-scale datasets are now prone to the inevitable contamination by synthetic data, directly impacting the training of future generated models. Already, some theoretical results on self-consuming generative models (a.k.a., iterative retraining) have emerged in the literature, showcasing that either model collapse or stability could be possible depending on the fraction of generated data used at each retraining step. However, in practice, synthetic data is often subject to human feedback and curated by users before being used and uploaded online. For instance, many interfaces of popular text-to-image generative models, such as Stable Diffusion or Midjourney, produce several variations of an image for a given query which can eventually be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Computer Graphics and Visualization Techniques

MethodsDiffusion