Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models
Jack He, Jianxing Zhao, Andrew Bai, Cho-Jui Hsieh

TL;DR
This paper investigates how embedding layer choices in Vision Transformers affect memorization detection in generative models, introducing a fingerprinting method that improves identification accuracy of models involved in deepfake generation.
Contribution
It reveals the relationship between layer depth and memorization sensitivity in ViTs and proposes a novel fingerprinting technique based on layer-wise memorization score distributions.
Findings
Memorization scores decrease in deeper layers of ViTs.
Early layers are more sensitive to low-level memorization, later layers to high-level.
The proposed fingerprinting method improves identification accuracy by 30%.
Abstract
In the rapidly evolving landscape of artificial intelligence, generative models such as Generative Adversarial Networks (GANs) and Diffusion Models have become cornerstone technologies, driving innovation in diverse fields from art creation to healthcare. Despite their potential, these models face the significant challenge of data memorization, which poses risks to privacy and the integrity of generated content. Among various metrics of memorization detection, our study delves into the memorization scores calculated from encoder layer embeddings, which involves measuring distances between samples in the embedding spaces. Particularly, we find that the memorization scores calculated from layer embeddings of Vision Transformers (ViTs) show an notable trend - the latter (deeper) the layer, the less the memorization measured. It has been found that the memorization scores from the early…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling
MethodsDiffusion
