Hiding Faces in Plain Sight: Disrupting AI Face Synthesis with Adversarial Perturbations
Yuezun Li, Xin Yang, Baoyuan Wu, Siwei Lyu

TL;DR
This paper introduces adversarial perturbations to disrupt AI face synthesis and detection, aiming to protect individuals from fake videos by reducing face detection accuracy across various attack settings.
Contribution
It proposes novel adversarial attack schemes against DNN-based face detectors in white-box, gray-box, and black-box scenarios, demonstrating their effectiveness.
Findings
Effective disruption of state-of-the-art face detectors
Successful attack across multiple datasets
Applicable in white-box, gray-box, and black-box settings
Abstract
Recent years have seen fast development in synthesizing realistic human faces using AI technologies. Such fake faces can be weaponized to cause negative personal and social impact. In this work, we develop technologies to defend individuals from becoming victims of recent AI synthesized fake videos by sabotaging would-be training data. This is achieved by disrupting deep neural network (DNN) based face detection method with specially designed imperceptible adversarial perturbations to reduce the quality of the detected faces. We describe attacking schemes under white-box, gray-box and black-box settings, each with decreasing information about the DNN based face detectors. We empirically show the effectiveness of our methods in disrupting state-of-the-art DNN based face detectors on several datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis
