MetaCloak: Preventing Unauthorized Subject-driven Text-to-image   Diffusion-based Synthesis via Meta-learning

Yixin Liu; Chenrui Fan; Yutong Dai; Xun Chen; Pan Zhou; and Lichao Sun

arXiv:2311.13127·cs.CV·April 30, 2024·1 cites

MetaCloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning

Yixin Liu, Chenrui Fan, Yutong Dai, Xun Chen, Pan Zhou, and Lichao Sun

PDF

Open Access 1 Repo

TL;DR

MetaCloak introduces a meta-learning based method to craft robust, transferable perturbations that prevent unauthorized personalized image generation from diffusion models, outperforming existing defenses.

Contribution

The paper proposes MetaCloak, a novel meta-learning framework with transformation sampling to generate robust, transferable perturbations against personalized diffusion-based image synthesis.

Findings

01

MetaCloak outperforms existing defenses on VGGFace2 and CelebA-HQ datasets.

02

MetaCloak successfully fools online services like Replicate in black-box settings.

03

The method enhances robustness against simple data transformations like Gaussian filtering.

Abstract

Text-to-image diffusion models allow seamless generation of personalized images from scant reference photos. Yet, these tools, in the wrong hands, can fabricate misleading or harmful content, endangering individuals. To address this problem, existing poisoning-based approaches perturb user images in an imperceptible way to render them "unlearnable" from malicious uses. We identify two limitations of these defending approaches: i) sub-optimal due to the hand-crafted heuristics for solving the intractable bilevel optimization and ii) lack of robustness against simple data transformations like Gaussian filtering. To solve these challenges, we propose MetaCloak, which solves the bi-level poisoning problem with a meta-learning framework with an additional transformation sampling process to craft transferable and robust perturbation. Specifically, we employ a pool of surrogate diffusion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuyixin-louis/metacloak
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsDiffusion