Risks When Sharing LoRA Fine-Tuned Diffusion Model Weights
Dixi Yao

TL;DR
This paper investigates privacy risks associated with sharing LoRA fine-tuned diffusion model weights, revealing that private images can be reconstructed from model weights alone, and existing defenses are ineffective.
Contribution
It introduces a variational autoencoder approach to reconstruct private images from model weights and evaluates the ineffectiveness of current privacy-preserving methods.
Findings
Adversaries can generate images of private identities from model weights.
Existing privacy defenses, including differential privacy, do not prevent leakage.
Sharing fine-tuned diffusion model weights poses significant privacy risks.
Abstract
With the emerging trend in generative models and convenient public access to diffusion models pre-trained on large datasets, users can fine-tune these models to generate images of personal faces or items in new contexts described by natural language. Parameter efficient fine-tuning (PEFT) such as Low Rank Adaptation (LoRA) has become the most common way to save memory and computation usage on the user end during fine-tuning. However, a natural question is whether the private images used for fine-tuning will be leaked to adversaries when sharing model weights. In this paper, we study the issue of privacy leakage of a fine-tuned diffusion model in a practical setting, where adversaries only have access to model weights, rather than prompts or images used for fine-tuning. We design and build a variational network autoencoder that takes model weights as input and outputs the reconstruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Data Processing Techniques · Neural Networks and Applications
MethodsDiffusion
