DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion   Models

Zhenting Wang; Chen Chen; Lingjuan Lyu; Dimitris N. Metaxas; Shiqing; Ma

arXiv:2307.03108·cs.CV·April 10, 2024

DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models

Zhenting Wang, Chen Chen, Lingjuan Lyu, Dimitris N. Metaxas, Shiqing, Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel detection method for identifying unauthorized data usage in text-to-image diffusion models by injecting and detecting unique, nearly imperceptible content in training images.

Contribution

It proposes a stealthy image watermarking technique to detect illegal data usage in diffusion models, validated across multiple training methods and models.

Findings

01

Effective detection of unauthorized data usage in diffusion models.

02

Works across various training and fine-tuning methods.

03

High detection accuracy demonstrated in experiments.

Abstract

Recent text-to-image diffusion models have shown surprising performance in generating high-quality images. However, concerns have arisen regarding the unauthorized data usage during the training or fine-tuning process. One example is when a model trainer collects a set of images created by a particular artist and attempts to train a model capable of generating similar images without obtaining permission and giving credit to the artist. To address this issue, we propose a method for detecting such unauthorized data usage by planting the injected memorization into the text-to-image diffusion models trained on the protected dataset. Specifically, we modify the protected images by adding unique contents on these images using stealthy image warping functions that are nearly imperceptible to humans but can be captured and memorized by diffusion models. By analyzing whether the model has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhentingwang/diagnosis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling

MethodsDiffusion