MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM

Tao Chen; Jingyi Zhang; Decheng Liu; Chunlei Peng

arXiv:2507.12232·cs.CV·July 17, 2025

MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM

Tao Chen, Jingyi Zhang, Decheng Liu, Chunlei Peng

PDF

Open Access

TL;DR

This paper introduces MGFFD-VLM, a novel framework that leverages multi-granularity prompt learning and auxiliary losses to improve face forgery detection and interpretability using visual large language models.

Contribution

It proposes a new forgery detection framework with multi-granularity prompts, attribute-driven training, and auxiliary losses, enhancing accuracy and interpretability over existing methods.

Findings

01

Outperforms existing methods in forgery classification accuracy.

02

Enhances interpretability through prompt-based explanations.

03

Achieves superior results on the DD-VQA+ dataset.

Abstract

Recent studies have utilized visual large language models (VLMs) to answer not only "Is this face a forgery?" but also "Why is the face a forgery?" These studies introduced forgery-related attributes, such as forgery location and type, to construct deepfake VQA datasets and train VLMs, achieving high accuracy while providing human-understandable explanatory text descriptions. However, these methods still have limitations. For example, they do not fully leverage face quality-related attributes, which are often abnormal in forged faces, and they lack effective training strategies for forgery-aware VLMs. In this paper, we extend the VQA dataset to create DD-VQA+, which features a richer set of attributes and a more diverse range of samples. Furthermore, we introduce a novel forgery detection framework, MGFFD-VLM, which integrates an Attribute-Driven Hybrid LoRA Strategy to enhance the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Face recognition and analysis

MethodsSparse Evolutionary Training