On Attribution of Deepfakes
Baiwu Zhang, Jin Peng Zhou, Ilia Shumailov, Nicolas Papernot

TL;DR
This paper presents a probabilistic method to attribute deepfakes to their source models by analyzing entropy sources, achieving high accuracy and robustness, and discusses ethical and legislative implications for transparent use of generative models.
Contribution
The authors introduce a novel entropy-based attribution technique for deepfakes, enabling high accuracy and robustness against adversarial attacks, and propose a framework for plausible deniability for model developers.
Findings
97.62% attribution accuracy on face synthesis deepfakes
Less sensitive to perturbations and adversarial examples
Framework supports plausible deniability for developers
Abstract
Progress in generative modelling, especially generative adversarial networks, have made it possible to efficiently synthesize and alter media at scale. Malicious individuals now rely on these machine-generated media, or deepfakes, to manipulate social discourse. In order to ensure media authenticity, existing research is focused on deepfake detection. Yet, the adversarial nature of frameworks used for generative modeling suggests that progress towards detecting deepfakes will enable more realistic deepfake generation. Therefore, it comes at no surprise that developers of generative models are under the scrutiny of stakeholders dealing with misinformation campaigns. At the same time, generative models have a lot of positive applications. As such, there is a clear need to develop tools that ensure the transparent use of generative modeling, while minimizing the harm caused by malicious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning
