S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with   Statistical Tokens

Rizhao Cai; Zitong Yu; Chenqi Kong; Haoliang Li; Changsheng Chen,; Yongjian Hu; Alex Kot

arXiv:2309.04038·cs.CV·June 21, 2024·1 cites

S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens

Rizhao Cai, Zitong Yu, Chenqi Kong, Haoliang Li, Changsheng Chen,, Yongjian Hu, Alex Kot

PDF

Open Access 3 Repos

TL;DR

This paper introduces S-Adapter, a novel statistical adapter for Vision Transformers that enhances cross-domain face anti-spoofing by capturing local statistical features and reducing domain style variance, leading to improved generalization.

Contribution

The paper proposes a new Statistical Adapter (S-Adapter) and Token Style Regularization (TSR) to improve cross-domain generalization of Vision Transformers in face anti-spoofing tasks.

Findings

01

S-Adapter outperforms state-of-the-art methods in zero-shot and few-shot cross-domain tests.

02

TSR effectively reduces domain style variance, enhancing model robustness.

03

The approach demonstrates significant improvements on benchmark datasets.

Abstract

Face Anti-Spoofing (FAS) aims to detect malicious attempts to invade a face recognition system by presenting spoofed faces. State-of-the-art FAS techniques predominantly rely on deep learning models but their cross-domain generalization capabilities are often hindered by the domain shift problem, which arises due to different distributions between training and testing data. In this study, we develop a generalized FAS method under the Efficient Parameter Transfer Learning (EPTL) paradigm, where we adapt the pre-trained Vision Transformer models for the FAS task. During training, the adapter modules are inserted into the pre-trained ViT model, and the adapters are updated while other pre-trained parameters remain fixed. We find the limitations of previous vanilla adapters in that they are based on linear layers, which lack a spoofing-aware inductive bias and thus restrict the cross-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiometric Identification and Security · Face recognition and analysis · Forensic and Genetic Research

MethodsMulti-Head Attention · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer