M3FAS: An Accurate and Robust MultiModal Mobile Face Anti-Spoofing System
Chenqi Kong, Kexin Zheng, Yibing Liu, Shiqi Wang, Anderson Rocha,, Haoliang Li

TL;DR
M3FAS is a multi-modal mobile face anti-spoofing system that combines visual and auditory data from common sensors to improve robustness and accuracy in real-world scenarios.
Contribution
The paper introduces a novel multi-modal anti-spoofing system using camera, speaker, and microphone data with a new neural network architecture and training strategy.
Findings
Demonstrates high accuracy in diverse environments
Shows robustness against missing or poor-quality modalities
Outperforms existing single-modal and multi-modal methods
Abstract
Face presentation attacks (FPA), also known as face spoofing, have brought increasing concerns to the public through various malicious applications, such as financial fraud and privacy leakage. Therefore, safeguarding face recognition systems against FPA is of utmost importance. Although existing learning-based face anti-spoofing (FAS) models can achieve outstanding detection performance, they lack generalization capability and suffer significant performance drops in unforeseen environments. Many methodologies seek to use auxiliary modality data (e.g., depth and infrared maps) during the presentation attack detection (PAD) to address this limitation. However, these methods can be limited since (1) they require specific sensors such as depth and infrared cameras for data capture, which are rarely available on commodity mobile devices, and (2) they cannot work properly in practical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiometric Identification and Security · Antenna Design and Analysis · Advanced Authentication Protocols Security
