Towards Black-Box Membership Inference Attack for Diffusion Models

Jingwei Li; Jing Dong; Tianxing He; Jingzhao Zhang

arXiv:2405.20771·cs.CR·August 14, 2025·1 cites

Towards Black-Box Membership Inference Attack for Diffusion Models

Jingwei Li, Jing Dong, Tianxing He, Jingzhao Zhang

PDF

Open Access

TL;DR

This paper presents a novel black-box membership inference attack on diffusion models that does not require internal model access, effectively identifying training data membership using only API calls and output comparisons.

Contribution

The work introduces a new black-box MIA method for diffusion models that operates solely via API, outperforming existing techniques and extending to Diffusion Transformer architectures.

Findings

01

Outperforms previous MIA methods on DDIM and Stable Diffusion

02

Effective in black-box setting without internal model access

03

Extends to Diffusion Transformer architecture

Abstract

Given the rising popularity of AI-generated art and the associated copyright concerns, identifying whether an artwork was used to train a diffusion model is an important research topic. The work approaches this problem from the membership inference attack (MIA) perspective. We first identify the limitation of applying existing MIA methods for proprietary diffusion models: the required access of internal U-nets. To address the above problem, we introduce a novel membership inference attack method that uses only the image-to-image variation API and operates without access to the model's internal U-net. Our method is based on the intuition that the model can more easily obtain an unbiased noise prediction estimate for images from the training set. By applying the API multiple times to the target image, averaging the outputs, and comparing the result to the original image, our approach can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Dropout · Linear Layer · Layer Normalization · Byte Pair Encoding · Adam · Residual Connection · Softmax