La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection
Hang Zou, Chenxi Du, Hui Zhang, Yuan Zhang, Ajian Liu, Jun Wan, Zhen, Lei

TL;DR
This paper introduces La-SoftMoE CLIP, a novel model that effectively detects both physical and digital face attacks by adaptively handling diverse attack data within a unified framework, achieving state-of-the-art results.
Contribution
The paper presents a new La-SoftMoE CLIP model that uses a flexible self-adapting weighting mechanism within a Mixture of Experts framework for unified face attack detection.
Findings
Achieves state-of-the-art performance on attack detection tasks.
Effectively handles diverse physical and digital attack data.
Demonstrates superior adaptability and accuracy over existing methods.
Abstract
Facial recognition systems are susceptible to both physical and digital attacks, posing significant security risks. Traditional approaches often treat these two attack types separately due to their distinct characteristics. Thus, when being combined attacked, almost all methods could not deal. Some studies attempt to combine the sparse data from both types of attacks into a single dataset and try to find a common feature space, which is often impractical due to the space is difficult to be found or even non-existent. To overcome these challenges, we propose a novel approach that uses the sparse model to handle sparse data, utilizing different parameter groups to process distinct regions of the sparse feature space. Specifically, we employ the Mixture of Experts (MoE) framework in our model, expert parameters are matched to tokens with varying weights during training and adaptively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques
MethodsMixture of Experts · Contrastive Language-Image Pre-training
