Wavelet-Driven Generalizable Framework for Deepfake Face Forgery   Detection

Lalith Bharadwaj Baru; Rohit Boddeda; Shilhora Akshay Patel; Sai Mohan; Gajapaka

arXiv:2409.18301·cs.CV·January 8, 2025

Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection

Lalith Bharadwaj Baru, Rohit Boddeda, Shilhora Akshay Patel, Sai Mohan, Gajapaka

PDF

Open Access 1 Repo

TL;DR

Wavelet-CLIP is a novel deepfake detection framework that combines wavelet transforms with CLIP-based features, significantly improving generalization and robustness against unseen deepfakes and sophisticated manipulations.

Contribution

This paper introduces Wavelet-CLIP, integrating wavelet analysis with CLIP features to enhance deepfake detection, especially for unseen and complex forgeries.

Findings

01

Achieves an average AUC of 0.749 in cross-dataset tests

02

Reaches 0.893 AUC in detecting unseen deepfakes

03

Outperforms existing state-of-the-art methods in robustness

Abstract

The evolution of digital image manipulation, particularly with the advancement of deep generative models, significantly challenges existing deepfake detection methods, especially when the origin of the deepfake is obscure. To tackle the increasing complexity of these forgeries, we propose \textbf{Wavelet-CLIP}, a deepfake detection framework that integrates wavelet transforms with features derived from the ViT-L/14 architecture, pre-trained in the CLIP fashion. Wavelet-CLIP utilizes Wavelet Transforms to deeply analyze both spatial and frequency features from images, thus enhancing the model's capability to detect sophisticated deepfakes. To verify the effectiveness of our approach, we conducted extensive evaluations against existing state-of-the-art methods for cross-dataset generalization and detection of unseen images generated by standard diffusion models. Our method showcases…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lalithbharadwajbaru/wavelet-clip
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis

MethodsContrastive Language-Image Pre-training · Diffusion