Exploring Task-Solving Paradigm for Generalized Cross-Domain Face Anti-Spoofing via Reinforcement Fine-Tuning

Fangling Jiang; Qi Li; Weining Wang; Gang Wang; Bing Liu; Zhenan Sun

arXiv:2506.21895·cs.CV·June 30, 2025

Exploring Task-Solving Paradigm for Generalized Cross-Domain Face Anti-Spoofing via Reinforcement Fine-Tuning

Fangling Jiang, Qi Li, Weining Wang, Gang Wang, Bing Liu, Zhenan Sun

PDF

Open Access

TL;DR

This paper introduces a reinforcement fine-tuning approach for face anti-spoofing that enhances cross-domain generalization and interpretability by enabling models to learn reasoning policies rather than memorizing patterns.

Contribution

The proposed method leverages reinforcement learning with novel rewards and optimization strategies to improve generalization and interpretability in face anti-spoofing across unseen domains.

Findings

01

Achieves state-of-the-art cross-domain generalization performance

02

Effectively detects diverse unknown attack types in unseen domains

03

Provides interpretable reasoning without extensive textual annotations

Abstract

Recently the emergence of novel presentation attacks has drawn increasing attention to face anti-spoofing. However, existing methods tend to memorize data patterns from the training set, resulting in poor generalization to unknown attack types across different scenarios and limited interpretability. To address these challenges, this paper presents a reinforcement fine-tuning-based face anti-spoofing method that stimulates the capabilities of multimodal large language models to think and learn how to solve the anti-spoofing task itself, rather than relying on the memorization of authenticity patterns. We design verifiable class consistent reward and reasoning consistent reward, and employ a GRPO-based optimization strategy to guide the model in exploring reasoning policies from multiple perspectives to maximize expected rewards. As a result, through iterative trial-and-error learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiometric Identification and Security · Face recognition and analysis · Adversarial Robustness in Machine Learning