The Stronger the Diffusion Model, the Easier the Backdoor: Data   Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

Haonan Wang; Qianli Shen; Yao Tong; Yang Zhang; Kenji Kawaguchi

arXiv:2401.04136·cs.CR·May 28, 2024·2 cites

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

Haonan Wang, Qianli Shen, Yao Tong, Yang Zhang, Kenji Kawaguchi

PDF

Open Access

TL;DR

This paper introduces SilentBadDiffusion, a backdoor attack method that subtly poisons training data to induce copyright infringement in diffusion models, revealing vulnerabilities in current protection strategies.

Contribution

It formalizes a novel copyright infringement attack on generative AI and demonstrates its effectiveness without altering the training pipeline.

Findings

01

Poisoning data with 0.20% ratio enables copyright infringement.

02

More advanced diffusion models are more susceptible to the attack.

03

Stealthy poisoning data effectively induces copyrighted image generation.

Abstract

The commercialization of text-to-image diffusion models (DMs) brings forth potential copyright concerns. Despite numerous attempts to protect DMs from copyright issues, the vulnerabilities of these solutions are underexplored. In this study, we formalized the Copyright Infringement Attack on generative AI models and proposed a backdoor attack method, SilentBadDiffusion, to induce copyright infringement without requiring access to or control over training processes. Our method strategically embeds connections between pieces of copyrighted information and text references in poisoning data while carefully dispersing that information, making the poisoning data inconspicuous when integrated into a clean dataset. Our experiments show the stealth and efficacy of the poisoning data. When given specific text prompts, DMs trained with a poisoning ratio of 0.20% can produce copyrighted images.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Law in Society and Culture

MethodsInpainting · Diffusion