Differentially Private Adaptation of Diffusion Models via Noisy Aggregated Embeddings
Pura Peetathawatchai, Wei-Ning Chen, Berivan Isik, Sanmi Koyejo, Albert No

TL;DR
This paper introduces DPAgg-TI, a novel method for privately adapting diffusion models using noisy aggregation of textual inversion embeddings, which outperforms traditional DP-SGD fine-tuning especially with small, sensitive datasets.
Contribution
The paper presents DPAgg-TI, a new privacy-preserving adaptation technique that leverages noisy embedding aggregation, significantly improving utility over DP-SGD in small data regimes.
Findings
DPAgg-TI achieves high fidelity style adaptation with privacy guarantees.
DPAgg-TI outperforms DP-SGD in utility and robustness.
Method successfully adapts to sensitive artwork and pictograms.
Abstract
Personalizing large-scale diffusion models poses serious privacy risks, especially when adapting to small, sensitive datasets. A common approach is to fine-tune the model using differentially private stochastic gradient descent (DP-SGD), but this suffers from severe utility degradation due to the high noise needed for privacy, particularly in the small data regime. We propose an alternative that leverages Textual Inversion (TI), which learns an embedding vector for an image or set of images, to enable adaptation under differential privacy (DP) constraints. Our approach, Differentially Private Aggregation via Textual Inversion (DPAgg-TI), adds calibrated noise to the aggregation of per-image embeddings to ensure formal DP guarantees while preserving high output fidelity. We show that DPAgg-TI outperforms DP-SGD finetuning in both utility and robustness under the same privacy budget,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsDiffusion
