Exploring Task-Solving Paradigm for Generalized Cross-Domain Face Anti-Spoofing via Reinforcement Fine-Tuning
Fangling Jiang, Qi Li, Weining Wang, Gang Wang, Bing Liu, Zhenan Sun

TL;DR
This paper introduces a reinforcement fine-tuning approach for face anti-spoofing that enhances cross-domain generalization and interpretability by enabling models to learn reasoning policies rather than memorizing patterns.
Contribution
The proposed method leverages reinforcement learning with novel rewards and optimization strategies to improve generalization and interpretability in face anti-spoofing across unseen domains.
Findings
Achieves state-of-the-art cross-domain generalization performance
Effectively detects diverse unknown attack types in unseen domains
Provides interpretable reasoning without extensive textual annotations
Abstract
Recently the emergence of novel presentation attacks has drawn increasing attention to face anti-spoofing. However, existing methods tend to memorize data patterns from the training set, resulting in poor generalization to unknown attack types across different scenarios and limited interpretability. To address these challenges, this paper presents a reinforcement fine-tuning-based face anti-spoofing method that stimulates the capabilities of multimodal large language models to think and learn how to solve the anti-spoofing task itself, rather than relying on the memorization of authenticity patterns. We design verifiable class consistent reward and reasoning consistent reward, and employ a GRPO-based optimization strategy to guide the model in exploring reasoning policies from multiple perspectives to maximize expected rewards. As a result, through iterative trial-and-error learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiometric Identification and Security · Face recognition and analysis · Adversarial Robustness in Machine Learning
