ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir,, Paul Jennings, Xingyu Zhao

TL;DR
ProTIP introduces a probabilistic framework with statistical guarantees to evaluate the robustness of text-to-image diffusion models against stochastic perturbations, addressing computational challenges and providing practical insights.
Contribution
It presents a novel probabilistic robustness verification method, ProTIP, with efficient statistical testing for high-dimensional generative models.
Findings
ProTIP effectively evaluates robustness with statistical guarantees.
The framework is more efficient than existing methods.
Application to defense ranking demonstrates practical utility.
Abstract
Text-to-Image (T2I) Diffusion Models (DMs) have shown impressive abilities in generating high-quality images based on simple text descriptions. However, as is common with many Deep Learning (DL) models, DMs are subject to a lack of robustness. While there are attempts to evaluate the robustness of T2I DMs as a binary or worst-case problem, they cannot answer how robust in general the model is whenever an adversarial example (AE) can be found. In this study, we first introduce a probabilistic notion of T2I DMs' robustness; and then establish an efficient framework, ProTIP, to evaluate it with statistical guarantees. The main challenges stem from: i) the high computational cost of the generation process; and ii) determining if a perturbed input is an AE involves comparing two output distributions, which is fundamentally harder compared to other DL tasks like classification where an AE is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Biomedical Text Mining and Ontologies · Machine Learning in Healthcare
MethodsDiffusion · Early Stopping · Autoencoders
