C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection
Chuangchuang Tan, Renshuai Tao, Huan Liu, Guanghua Gu, Baoyuan Wu, Yao, Zhao, Yunchao Wei

TL;DR
This paper enhances deepfake detection by understanding CLIP's recognition mechanisms and introducing C2P-CLIP, which injects category prompts into the model, resulting in improved accuracy and state-of-the-art performance.
Contribution
The study decodes CLIP's detection features and proposes C2P-CLIP, a novel method that injects category prompts to improve deepfake detection without extra testing parameters.
Findings
C2P-CLIP improves detection accuracy by 12.41% over original CLIP.
Decoding CLIP reveals it detects deepfakes by recognizing similar concepts.
Experiments on two datasets with 20 models validate state-of-the-art performance.
Abstract
This work focuses on AIGC detection to develop universal detectors capable of identifying various types of forgery images. Recent studies have found large pre-trained models, such as CLIP, are effective for generalizable deepfake detection along with linear classifiers. However, two critical issues remain unresolved: 1) understanding why CLIP features are effective on deepfake detection through a linear classifier; and 2) exploring the detection potential of CLIP. In this study, we delve into the underlying mechanisms of CLIP's detection capabilities by decoding its detection features into text and performing word frequency analysis. Our finding indicates that CLIP detects deepfakes by recognizing similar concepts (Fig. \ref{fig:fig1} a). Building on this insight, we introduce Category Common Prompt CLIP, called C2P-CLIP, which integrates the category common prompt into the text encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications
MethodsContrastive Language-Image Pre-training
