Towards More Realistic Membership Inference Attacks on Large Diffusion   Models

Jan Dubi\'nski; Antoni Kowalczuk; Stanis{\l}aw Pawlak; Przemys{\l}aw; Rokita; Tomasz Trzci\'nski; Pawe{\l} Morawiecki

arXiv:2306.12983·cs.LG·November 17, 2023·1 cites

Towards More Realistic Membership Inference Attacks on Large Diffusion Models

Jan Dubi\'nski, Antoni Kowalczuk, Stanis{\l}aw Pawlak, Przemys{\l}aw, Rokita, Tomasz Trzci\'nski, Pawe{\l} Morawiecki

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper investigates the feasibility of membership inference attacks on large diffusion models like Stable Diffusion, highlighting the challenges and proposing a fair evaluation framework to assess attack effectiveness.

Contribution

It introduces a novel, fair evaluation methodology for membership inference attacks on diffusion models and demonstrates that such attacks remain a significant privacy challenge.

Findings

01

Previous evaluation setups are insufficient for understanding attack effectiveness.

02

Membership inference attacks on large diffusion models are still largely effective.

03

Privacy and copyright concerns persist due to the difficulty in defending against these attacks.

Abstract

Generative diffusion models, including Stable Diffusion and Midjourney, can generate visually appealing, diverse, and high-resolution images for various applications. These models are trained on billions of internet-sourced images, raising significant concerns about the potential unauthorized use of copyright-protected images. In this paper, we examine whether it is possible to determine if a specific image was used in the training set, a problem known in the cybersecurity community and referred to as a membership inference attack. Our focus is on Stable Diffusion, and we address the challenge of designing a fair evaluation framework to answer this membership question. We propose a methodology to establish a fair evaluation setup and apply it to Stable Diffusion, enabling potential extensions to other generative models. Utilizing this evaluation setup, we execute membership attacks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

antoniaaa/laion_mi
dataset· 44 dl
44 dl

Videos

Towards More Realistic Membership Inference Attacks on Large Diffusion Models· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsFocus · Diffusion