GenAI Confessions: Black-box Membership Inference for Generative Image Models

Matyas Bohacek; Hany Farid

arXiv:2501.06399·cs.CV·August 14, 2025

GenAI Confessions: Black-box Membership Inference for Generative Image Models

Matyas Bohacek, Hany Farid

PDF

1 Datasets

TL;DR

This paper introduces a black-box membership inference method to determine if specific images were part of a generative AI model's training data, aiding in model auditing and intellectual property protection.

Contribution

The paper presents a novel, efficient black-box approach for membership inference on generative image models without requiring model architecture or weights.

Findings

01

Method effectively identifies training images in black-box settings

02

Approach is computationally efficient and architecture-agnostic

03

Enables auditing and fair use assessment of generative models

Abstract

From a simple text prompt, generative-AI image models can create stunningly realistic and creative images bounded, it seems, by only our imagination. These models have achieved this remarkable feat thanks, in part, to the ingestion of billions of images collected from nearly every corner of the internet. Many creators have understandably expressed concern over how their intellectual property has been ingested without their permission or a mechanism to opt out of training. As a result, questions of fair use and copyright infringement have quickly emerged. We describe a method that allows us to determine if a model was trained on a specific image or set of images. This method is computationally efficient and assumes no explicit knowledge of the model architecture or weights (so-called black-box membership inference). We anticipate that this method will be crucial for auditing existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

faridlab/stroll
dataset· 76 dl
76 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training · OPT