Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing

Xun Lin; Shuai Wang; Yi Yu; Zitong Yu; Jiale Zhou; Yizhong Liu; Xiaochun Cao; Alex Kot; Yefeng Zheng

arXiv:2511.14157·cs.CV·November 19, 2025

Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing

Xun Lin, Shuai Wang, Yi Yu, Zitong Yu, Jiale Zhou, Yizhong Liu, Xiaochun Cao, Alex Kot, Yefeng Zheng

PDF

Open Access

TL;DR

This paper introduces RiSe, a framework that improves multimodal face anti-spoofing by learning invariant representations and disentangling modality synergy, leading to better cross-domain generalization.

Contribution

The paper proposes RiSe, a novel framework combining invariant risk minimization and synergy disentanglement to enhance cross-domain multimodal face anti-spoofing performance.

Findings

01

RiSe achieves state-of-the-art cross-domain results.

02

Theoretical analysis confirms reduced generalization error.

03

Experimental results demonstrate improved robustness to unseen attacks.

Abstract

Multimodal Face Anti-Spoofing (FAS) methods, which integrate multiple visual modalities, often suffer even more severe performance degradation than unimodal FAS when deployed in unseen domains. This is mainly due to two overlooked risks that affect cross-domain multimodal generalization. The first is the modal representation invariant risk, i.e., whether representations remain generalizable under domain shift. We theoretically show that the inherent class asymmetry in FAS (diverse spoofs vs. compact reals) enlarges the upper bound of generalization error, and this effect is further amplified in multimodal settings. The second is the modal synergy invariant risk, where models overfit to domain-specific inter-modal correlations. Such spurious synergy cannot generalize to unseen attacks in target domains, leading to performance drops. To solve these issues, we propose a provable framework,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiometric Identification and Security · Face recognition and analysis · Adversarial Robustness in Machine Learning