Contrasting Deepfakes Diffusion via Contrastive Learning and   Global-Local Similarities

Lorenzo Baraldi; Federico Cocchi; Marcella Cornia; Lorenzo Baraldi,; Alessandro Nicolosi; Rita Cucchiara

arXiv:2407.20337·cs.CV·July 31, 2024

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

Lorenzo Baraldi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi,, Alessandro Nicolosi, Rita Cucchiara

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces CoDE, a contrastive learning-based embedding space tailored for deepfake detection, leveraging global-local similarities, and demonstrates its superior accuracy and generalization on a large diffusion-generated image dataset.

Contribution

The paper proposes CoDE, a novel deepfake detection embedding trained with contrastive learning and global-local similarities, addressing limitations of existing models like CLIP.

Findings

01

Achieves state-of-the-art accuracy on a large diffusion-generated image dataset.

02

Exhibits excellent generalization to unseen image generators.

03

Provides publicly available dataset, code, and models.

Abstract

Discerning between authentic content and that generated by advanced AI methods has become increasingly challenging. While previous research primarily addresses the detection of fake faces, the identification of generated natural images has only recently surfaced. This prompted the recent exploration of solutions that employ foundation vision-and-language models, like CLIP. However, the CLIP embedding space is optimized for global image-to-text alignment and is not inherently designed for deepfake detection, neglecting the potential benefits of tailored training and local image features. In this study, we propose CoDE (Contrastive Deepfake Embeddings), a novel embedding space specifically designed for deepfake detection. CoDE is trained via contrastive learning by additionally enforcing global-local similarities. To sustain the training of our model, we generate a comprehensive dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aimagelab/code
pytorchOfficial

Datasets

elsaEU/ELSA_D3
dataset· 2.3k dl
2.3k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsDiffusion · Contrastive Learning · Contrastive Language-Image Pre-training