On Attribution of Deepfakes

Baiwu Zhang; Jin Peng Zhou; Ilia Shumailov; Nicolas Papernot

arXiv:2008.09194·cs.LG·March 5, 2021·6 cites

On Attribution of Deepfakes

Baiwu Zhang, Jin Peng Zhou, Ilia Shumailov, Nicolas Papernot

PDF

Open Access

TL;DR

This paper presents a probabilistic method to attribute deepfakes to their source models by analyzing entropy sources, achieving high accuracy and robustness, and discusses ethical and legislative implications for transparent use of generative models.

Contribution

The authors introduce a novel entropy-based attribution technique for deepfakes, enabling high accuracy and robustness against adversarial attacks, and propose a framework for plausible deniability for model developers.

Findings

01

97.62% attribution accuracy on face synthesis deepfakes

02

Less sensitive to perturbations and adversarial examples

03

Framework supports plausible deniability for developers

Abstract

Progress in generative modelling, especially generative adversarial networks, have made it possible to efficiently synthesize and alter media at scale. Malicious individuals now rely on these machine-generated media, or deepfakes, to manipulate social discourse. In order to ensure media authenticity, existing research is focused on deepfake detection. Yet, the adversarial nature of frameworks used for generative modeling suggests that progress towards detecting deepfakes will enable more realistic deepfake generation. Therefore, it comes at no surprise that developers of generative models are under the scrutiny of stakeholders dealing with misinformation campaigns. At the same time, generative models have a lot of positive applications. As such, there is a clear need to develop tools that ensure the transparent use of generative modeling, while minimizing the harm caused by malicious…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning