Low Resource Reconstruction Attacks Through Benign Prompts
Sol Yarkoni, Mahmood Sharif, Roi Livni

TL;DR
This paper introduces a low-resource, accessible attack method that identifies benign prompts capable of unintentionally reconstructing sensitive images from generative models, highlighting privacy risks even for non-expert users.
Contribution
The authors present a novel low-resource attack technique that uncovers risks of unintentional image reconstruction from benign prompts, exposing fundamental privacy vulnerabilities in generative models.
Findings
Benign prompts can generate sensitive images, including real individuals.
Reconstruction risks are linked to scraped e-commerce data and templated layouts.
The attack requires minimal resources and little access to training data.
Abstract
Recent advances in generative models, such as diffusion models, have raised concerns related to privacy, copyright infringement, and data stewardship. To better understand and control these risks, prior work has introduced techniques and attacks that reconstruct images, or parts of images, from training data. While these results demonstrate that training data can be recovered, existing methods often rely on high computational resources, partial access to the training set, or carefully engineered prompts. In this work, we present a new attack that requires low resources, assumes little to no access to the training data, and identifies seemingly benign prompts that can lead to potentially risky image reconstruction. We further show that such reconstructions may occur unintentionally, even for users without specialized knowledge. For example, we observe that for one existing model, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Adversarial Robustness in Machine Learning
