FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models

Hongyang Wang; Yichen Shi; Zhuofu Tao; Yuhao Gao; Liepiao Zhang; Xun Lin; Jun Feng; Xiaochen Yuan; Zitong Yu; Xiaochun Cao

arXiv:2505.09415·cs.CV·November 18, 2025

FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models

Hongyang Wang, Yichen Shi, Zhuofu Tao, Yuhao Gao, Liepiao Zhang, Xun Lin, Jun Feng, Xiaochen Yuan, Zitong Yu, Xiaochun Cao

PDF

Open Access 1 Video

TL;DR

FaceShield introduces a multimodal large language model tailored for face anti-spoofing, capable of interpretability, reasoning, and attack localization, significantly advancing the state-of-the-art in face presentation attack detection.

Contribution

This work presents FaceShield, the first comprehensive MLLM for FAS with specialized datasets, novel perception and masking strategies, and extensive benchmarking results.

Findings

01

Outperforms previous models on four FAS tasks

02

Demonstrates strong generalization ability

03

Provides interpretable reasoning and attack localization

Abstract

Face anti-spoofing (FAS) is crucial for protecting facial recognition systems from presentation attacks. Previous methods approached this task as a classification problem, lacking interpretability and reasoning behind the predicted results. Recently, multimodal large language models (MLLMs) have shown strong capabilities in perception, reasoning, and decision-making in visual tasks. However, there is currently no universal and comprehensive MLLM and dataset specifically designed for FAS task. To address this gap, we propose FaceShield, a MLLM for FAS, along with the corresponding pre-training and supervised fine-tuning (SFT) datasets, FaceShield-pre10K and FaceShield-sft45K. FaceShield is capable of determining the authenticity of faces, identifying types of spoofing attacks, providing reasoning for its judgments, and detecting attack areas. Specifically, we employ spoof-aware vision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models· underline

Taxonomy

TopicsFace recognition and analysis · Biometric Identification and Security