Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?
Lazaro Janier Gonzalez-Sole, Juan E. Tapia, Christoph Busch

TL;DR
This paper evaluates the potential of foundation models for zero-shot face presentation attack detection, demonstrating their effectiveness in challenging scenarios without extensive training.
Contribution
It assesses foundation models' generalisability for zero-shot PAD and proposes a simple framework that outperforms state-of-the-art methods on challenging datasets.
Findings
Foundation models achieve high performance in difficult scenarios.
Proposed framework requires minimal effort and training.
Outperforms state-of-the-art on SiW-Mv2 database.
Abstract
Although face recognition systems have undergone an impressive evolution in the last decade, these technologies are vulnerable to attack presentations (AP). These attacks are mostly easy to create and, by executing them against the system's capture device, the malicious actor can impersonate an authorised subject and thus gain access to the latter's information (e.g., financial transactions). To protect facial recognition schemes against presentation attacks, state-of-the-art deep learning presentation attack detection (PAD) approaches require a large amount of data to produce reliable detection performances and even then, they decrease their performance for unknown presentation attack instruments (PAI) or database (information not seen during training), i.e. they lack generalisability. To mitigate the above problems, this paper focuses on zero-shot PAD. To do so, we first assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
