Do DeepFake Attribution Models Generalize?
Spiros Baxavanakis, Manos Schinas, Symeon Papadopoulos

TL;DR
This paper investigates the generalization capabilities of DeepFake attribution models, comparing binary and multi-class approaches across datasets, and explores contrastive methods to enhance cross-dataset performance.
Contribution
It provides a comprehensive evaluation of DeepFake attribution models, highlighting the impact of model size, data quality, and contrastive learning on generalization.
Findings
Binary models show better cross-dataset generalization.
Larger models and contrastive methods improve attribution accuracy.
Higher data quality enhances model performance.
Abstract
Recent advancements in DeepFake generation, along with the proliferation of open-source tools, have significantly lowered the barrier for creating synthetic media. This trend poses a serious threat to the integrity and authenticity of online information, undermining public trust in institutions and media. State-of-the-art research on DeepFake detection has primarily focused on binary detection models. A key limitation of these models is that they treat all manipulation techniques as equivalent, despite the fact that different methods introduce distinct artifacts and visual cues. Only a limited number of studies explore DeepFake attribution models, although such models are crucial in practical settings. By providing the specific manipulation method employed, these models could enhance both the perceived trustworthiness and explainability for end users. In this work, we leverage five…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Misinformation and Its Impacts · Generative Adversarial Networks and Image Synthesis
