GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

Naoki Murata; Yuhta Takida; Chieh-Hsin Lai; Toshimitsu Uesaka; Bac Nguyen; Stefano Ermon; and Yuki Mitsufuji

arXiv:2601.22651·cs.LG·February 2, 2026

GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

Naoki Murata, Yuhta Takida, Chieh-Hsin Lai, Toshimitsu Uesaka, Bac Nguyen, Stefano Ermon, and Yuki Mitsufuji

PDF

Open Access

TL;DR

GUDA introduces an efficient method for group-wise data attribution in diffusion models by approximating counterfactuals through unlearning, enabling reliable influence estimation without costly retraining.

Contribution

It proposes GUDA, a novel unlearning-based approach that approximates counterfactual models for group attribution, significantly reducing computational costs compared to LOGO retraining.

Findings

01

GUDA outperforms semantic similarity and gradient-based methods in identifying influential groups.

02

It achieves 100x speedup over LOGO retraining on CIFAR-10.

03

GUDA reliably attributes artistic styles in diffusion models.

Abstract

Training-data attribution for vision generative models aims to identify which training data influenced a given output. While most methods score individual examples, practitioners often need group-level answers (e.g., artistic styles or object classes). Group-wise attribution is counterfactual: how would a model's behavior on a generated sample change if a group were absent from training? A natural realization of this counterfactual is Leave-One-Group-Out (LOGO) retraining, which retrains the model with each group removed; however, it becomes computationally prohibitive as the number of groups grows. We propose GUDA (Group Unlearning-based Data Attribution) for diffusion models, which approximates each counterfactual model by applying machine unlearning to a shared full-data model instead of training from scratch. GUDA quantifies group influence using differences in a likelihood-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Aesthetic Perception and Analysis · Domain Adaptation and Few-Shot Learning