MARE: Multimodal Alignment and Reinforcement for Explainable Deepfake Detection via Vision-Language Models

Wenbo Xu; Wei Lu; Xiangyang Luo; Jiantao Zhou

arXiv:2601.20433·cs.CV·February 2, 2026

MARE: Multimodal Alignment and Reinforcement for Explainable Deepfake Detection via Vision-Language Models

Wenbo Xu, Wei Lu, Xiangyang Luo, Jiantao Zhou

PDF

Open Access

TL;DR

MARE leverages multimodal alignment and reinforcement learning with human feedback to improve deepfake detection accuracy and explainability using vision-language models, capturing intrinsic forgery traces.

Contribution

The paper introduces MARE, a novel framework combining multimodal alignment, reinforcement learning, and forgery disentanglement for enhanced explainable deepfake detection.

Findings

01

Achieves state-of-the-art accuracy in deepfake detection.

02

Provides explainable reasoning content aligned with human preferences.

03

Effectively captures intrinsic forgery traces from facial semantics.

Abstract

Deepfake detection is a widely researched topic that is crucial for combating the spread of malicious content, with existing methods mainly modeling the problem as classification or spatial localization. The rapid advancements in generative models impose new demands on Deepfake detection. In this paper, we propose multimodal alignment and reinforcement for explainable Deepfake detection via vision-language models, termed MARE, which aims to enhance the accuracy and reliability of Vision-Language Models (VLMs) in Deepfake detection and reasoning. Specifically, MARE designs comprehensive reward functions, incorporating reinforcement learning from human feedback (RLHF), to incentivize the generation of text-spatially aligned reasoning content that adheres to human preferences. Besides, MARE introduces a forgery disentanglement module to capture intrinsic forgery traces from high-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications