TL;DR
Aletheia introduces physics-conditioned features into deepfake detection, significantly improving robustness and accuracy across multiple datasets and attack scenarios.
Contribution
It presents PhyLAA-X, a novel physics-informed attention mechanism that enhances deepfake detection by integrating physical invariants directly into the model.
Findings
Achieves 97.2% accuracy on FaceForensics++
Outperforms baseline by 4.1-7.3% in cross-generator tests
Maintains 79.4% accuracy under adversarial attacks
Abstract
State-of-the-art deepfake detectors achieve near-perfect in-domain accuracy yet degrade under cross-generator shifts, heavy compression, and adversarial perturbations. The core limitation remains the decoupling of semantic artifact learning from physical invariants: optical-flow discontinuities, specular-reflection inconsistencies, and cardiac-modulated reflectance (rPPG) are treated either as post-hoc features or ignored. We introduce PhyLAA-X, a novel physics-conditioned extension of Localized Artifact Attention (LAA-X). PhyLAA-X injects three end-to-end differentiable physics-derived feature volumes - optical-flow curl, specular-reflectance skewness, and spatially-upsampled rPPG power spectra - directly into the LAA-X attention computation via cross-attention gating and a resonance consistency loss. This forces the network to learn manipulation boundaries where semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
