OGAN: Disrupting Deepfakes with an Adversarial Attack that Survives Training
Eran Segalis, Eran Galili

TL;DR
This paper introduces OGAN, a novel training-resistant adversarial attack that disrupts face-swapping autoencoders by generating spatial-temporal distortions, challenging existing deepfake mitigation methods.
Contribution
The paper proposes a new attack, OGAN, optimized to be training-resistant, and demonstrates its effectiveness and transferability against face-swapping autoencoders.
Findings
OGAN outperforms previous distorting attacks in disrupting autoencoders.
OGAN's adversarial effects transfer across different models and faces.
Training-resistant attacks like OGAN pose new challenges for deepfake detection.
Abstract
Recent advances in autoencoders and generative models have given rise to effective video forgery methods, used for generating so-called "deepfakes". Mitigation research is mostly focused on post-factum deepfake detection and not on prevention. We complement these efforts by introducing a novel class of adversarial attacks---training-resistant attacks---which can disrupt face-swapping autoencoders whether or not its adversarial images have been included in the training set of said autoencoders. We propose the Oscillating GAN (OGAN) attack, a novel attack optimized to be training-resistant, which introduces spatial-temporal distortions to the output of face-swapping autoencoders. To implement OGAN, we construct a bilevel optimization problem, where we train a generator and a face-swapping model instance against each other. Specifically, we pair each input image with a target distortion,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
MethodsSolana Customer Service Number +1-833-534-1729
