Low Resource Reconstruction Attacks Through Benign Prompts

Sol Yarkoni; Mahmood Sharif; Roi Livni

arXiv:2507.07947·cs.LG·January 8, 2026

Low Resource Reconstruction Attacks Through Benign Prompts

Sol Yarkoni, Mahmood Sharif, Roi Livni

PDF

Open Access

TL;DR

This paper introduces a low-resource, accessible attack method that identifies benign prompts capable of unintentionally reconstructing sensitive images from generative models, highlighting privacy risks even for non-expert users.

Contribution

The authors present a novel low-resource attack technique that uncovers risks of unintentional image reconstruction from benign prompts, exposing fundamental privacy vulnerabilities in generative models.

Findings

01

Benign prompts can generate sensitive images, including real individuals.

02

Reconstruction risks are linked to scraped e-commerce data and templated layouts.

03

The attack requires minimal resources and little access to training data.

Abstract

Recent advances in generative models, such as diffusion models, have raised concerns related to privacy, copyright infringement, and data stewardship. To better understand and control these risks, prior work has introduced techniques and attacks that reconstruct images, or parts of images, from training data. While these results demonstrate that training data can be recovered, existing methods often rely on high computational resources, partial access to the training set, or carefully engineered prompts. In this work, we present a new attack that requires low resources, assumes little to no access to the training data, and identifies seemingly benign prompts that can lead to potentially risky image reconstruction. We further show that such reconstructions may occur unintentionally, even for users without specialized knowledge. For example, we observe that for one existing model, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Adversarial Robustness in Machine Learning