TL;DR
This paper reveals a causal link between fairness and generalization in deepfake detection, proposing a new framework that improves both by controlling for confounders and neutralizing biases.
Contribution
It formally defines the causal relationship between fairness and generalization and introduces DAID, a novel framework that enhances both in deepfake detectors.
Findings
DAID outperforms state-of-the-art detectors in fairness and generalization.
Controlling confounders improves deepfake detection robustness.
Theoretical analysis supports the effectiveness of DAID.
Abstract
Deepfake detection models face two critical challenges: generalization to unseen manipulations and demographic fairness among population groups. However, existing approaches often demonstrate that these two objectives are inherently conflicting, revealing a trade-off between them. In this paper, we, for the first time, uncover and formally define a causal relationship between fairness and generalization. Building on the back-door adjustment, we show that controlling for confounders (data distribution and model capacity) enables improved generalization via fairness interventions. Motivated by this insight, we propose Demographic Attribute-insensitive Intervention Detection (DAID), a plug-and-play framework composed of: i) Demographic-aware data rebalancing, which employs inverse-propensity weighting and subgroup-wise feature normalization to neutralize distributional biases; and ii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
