Generalizable Face Forgery Detection via Separable Prompt Learning

Enrui Yang; Yuezun Li

arXiv:2604.17307·cs.CV·April 21, 2026

Generalizable Face Forgery Detection via Separable Prompt Learning

Enrui Yang, Yuezun Li

PDF

1 Repo

TL;DR

This paper introduces Separable Prompt Learning (SePL), a novel method leveraging CLIP's text modality to improve face forgery detection, achieving strong generalization across datasets and methods.

Contribution

It proposes a new SePL strategy that disentangles forgery-related and irrelevant information using cross-modality alignment, enhancing detection performance.

Findings

01

Achieves superior cross-dataset detection accuracy.

02

Demonstrates strong generalization to unseen forgery methods.

03

Outperforms existing methods in various evaluation settings.

Abstract

Detecting face forgeries using CLIP has recently emerged as a promising and increasingly popular research direction. Owing to its rich visual knowledge acquired through large-scale pretraining, most existing methods typically rely on the visual encoder of CLIP, while paying limited attention to the text modality. Given the instructive nature of the text modality, we posit that it can be leveraged to instruct Deepfake detection with meticulous design. Accordingly, we shift the focus from the visual modality to the text modality and propose a new Separable Prompt Learning strategy (SePL) that enables CLIP to serve as an effective face forgery detector. The core idea of SePL is to disentangle forgery-specific and forgery-irrelevant information in images via two types of prompt learning, with the former enhancing detection. To achieve this disentangle, we describe a cross-modality alignment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

OUC-YER/SePL-DeepfakeDetection
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.