Embedding Space Selection for Detecting Memorization and Fingerprinting   in Generative Models

Jack He; Jianxing Zhao; Andrew Bai; Cho-Jui Hsieh

arXiv:2407.21159·cs.LG·August 1, 2024

Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models

Jack He, Jianxing Zhao, Andrew Bai, Cho-Jui Hsieh

PDF

Open Access

TL;DR

This paper investigates how embedding layer choices in Vision Transformers affect memorization detection in generative models, introducing a fingerprinting method that improves identification accuracy of models involved in deepfake generation.

Contribution

It reveals the relationship between layer depth and memorization sensitivity in ViTs and proposes a novel fingerprinting technique based on layer-wise memorization score distributions.

Findings

01

Memorization scores decrease in deeper layers of ViTs.

02

Early layers are more sensitive to low-level memorization, later layers to high-level.

03

The proposed fingerprinting method improves identification accuracy by 30%.

Abstract

In the rapidly evolving landscape of artificial intelligence, generative models such as Generative Adversarial Networks (GANs) and Diffusion Models have become cornerstone technologies, driving innovation in diverse fields from art creation to healthcare. Despite their potential, these models face the significant challenge of data memorization, which poses risks to privacy and the integrity of generated content. Among various metrics of memorization detection, our study delves into the memorization scores calculated from encoder layer embeddings, which involves measuring distances between samples in the embedding spaces. Particularly, we find that the memorization scores calculated from layer embeddings of Vision Transformers (ViTs) show an notable trend - the latter (deeper) the layer, the less the memorization measured. It has been found that the memorization scores from the early…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling

MethodsDiffusion