S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens
Rizhao Cai, Zitong Yu, Chenqi Kong, Haoliang Li, Changsheng Chen,, Yongjian Hu, Alex Kot

TL;DR
This paper introduces S-Adapter, a novel statistical adapter for Vision Transformers that enhances cross-domain face anti-spoofing by capturing local statistical features and reducing domain style variance, leading to improved generalization.
Contribution
The paper proposes a new Statistical Adapter (S-Adapter) and Token Style Regularization (TSR) to improve cross-domain generalization of Vision Transformers in face anti-spoofing tasks.
Findings
S-Adapter outperforms state-of-the-art methods in zero-shot and few-shot cross-domain tests.
TSR effectively reduces domain style variance, enhancing model robustness.
The approach demonstrates significant improvements on benchmark datasets.
Abstract
Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on deep learning models but their cross-domain generalization capabilities are often hindered by the domain shift problem, which arises due to different distributions between training and testing data. In this study, we develop a generalized FAS method under the Efficient Parameter Transfer Learning (EPTL) paradigm, where we adapt the pre-trained Vision Transformer models for the FAS task. During training, the adapter modules are inserted into the pre-trained ViT model, and the adapters are updated while other pre-trained parameters remain fixed. We find the limitations of previous vanilla adapters in that they are based on linear layers, which lack a spoofing-aware inductive bias and thus restrict the cross-domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiometric Identification and Security · Face recognition and analysis · Forensic and Genetic Research
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer
