Metamorphic Testing-based Adversarial Attack to Fool Deepfake Detectors
Nyee Thoang Lim, Meng Yi Kuan, Muxin Pu, Mei Kuan Lim, Chun Yong Chong

TL;DR
This paper applies metamorphic testing to evaluate the robustness of deepfake detectors, revealing that makeup application can significantly degrade their performance, highlighting vulnerabilities in current models.
Contribution
It introduces a metamorphic testing approach to identify factors affecting deepfake detector robustness, specifically demonstrating makeup as an adversarial attack.
Findings
Deepfake detectors' performance drops up to 30% with makeup perturbation.
Metamorphic testing effectively uncovers vulnerabilities in deepfake detection models.
Makeup application can serve as an adversarial attack to fool deepfake detectors.
Abstract
Deepfakes utilise Artificial Intelligence (AI) techniques to create synthetic media where the likeness of one person is replaced with another. There are growing concerns that deepfakes can be maliciously used to create misleading and harmful digital contents. As deepfakes become more common, there is a dire need for deepfake detection technology to help spot deepfake media. Present deepfake detection models are able to achieve outstanding accuracy (>90%). However, most of them are limited to within-dataset scenario, where the same dataset is used for training and testing. Most models do not generalise well enough in cross-dataset scenario, where models are tested on unseen datasets from another source. Furthermore, state-of-the-art deepfake detection models rely on neural network-based classification models that are known to be vulnerable to adversarial attacks. Motivated by the need…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Digital Media Forensic Detection
