Evaluating the Robustness of Text-to-image Diffusion Models against   Real-world Attacks

Hongcheng Gao; Hao Zhang; Yinpeng Dong; Zhijie Deng

arXiv:2306.13103·cs.CR·June 26, 2023·5 cites

Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks

Hongcheng Gao, Hao Zhang, Yinpeng Dong, Zhijie Deng

PDF

Open Access 1 Models

TL;DR

This paper evaluates the robustness of text-to-image diffusion models against realistic, real-world input errors like typos and phonetic mistakes, revealing significant vulnerabilities through novel black-box attack methods.

Contribution

It introduces the first robustness evaluation of T2I diffusion models against realistic input errors and develops novel distribution-based black-box attack objectives.

Findings

01

T2I models are vulnerable to realistic input errors.

02

Proposed attacks effectively mislead popular T2I models.

03

Robustness issues are not limited to text encoders.

Abstract

Text-to-image (T2I) diffusion models (DMs) have shown promise in generating high-quality images from textual descriptions. The real-world applications of these models require particular attention to their safety and fidelity, but this has not been sufficiently explored. One fundamental question is whether existing T2I DMs are robust against variations over input texts. To answer it, this work provides the first robustness evaluation of T2I DMs against real-world attacks. Unlike prior studies that focus on malicious attacks involving apocryphal alterations to the input texts, we consider an attack space spanned by realistic errors (e.g., typo, glyph, phonetic) that humans can make, to ensure semantic consistency. Given the inherent randomness of the generation process, we develop novel distribution-based attack objectives to mislead T2I DMs. We perform attacks in a black-box manner…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Shion1124/anime-character-lcm-lora
model· 14 dl
14 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Chaos-based Image/Signal Encryption

MethodsFocus · Diffusion