Performance Decay in Deepfake Detection: The Limitations of Training on Outdated Data

Jack Richings; Margaux Leblanc; Ian Groves; Victoria Nockles

arXiv:2511.07009·cs.CV·November 11, 2025

Performance Decay in Deepfake Detection: The Limitations of Training on Outdated Data

Jack Richings, Margaux Leblanc, Ian Groves, Victoria Nockles

PDF

Open Access

TL;DR

Deepfake detection models perform exceptionally well on current data but rapidly decline in accuracy as deepfake technology advances, highlighting the need for ongoing data updates and improved frame-level analysis.

Contribution

This paper demonstrates the rapid performance decay of deepfake detectors over time and emphasizes the importance of continuous data curation and frame-level features for robustness.

Findings

01

Models trained on recent data achieve over 99.8% AUROC.

02

Performance drops over 30% when tested on deepfakes from six months later.

03

Static, frame-level artifacts are key to detection, not temporal inconsistencies.

Abstract

The continually advancing quality of deepfake technology exacerbates the threats of disinformation, fraud, and harassment by making maliciously-generated synthetic content increasingly difficult to distinguish from reality. We introduce a simple yet effective two-stage detection method that achieves an AUROC of over 99.8% on contemporary deepfakes. However, this high performance is short-lived. We show that models trained on this data suffer a recall drop of over 30% when evaluated on deepfakes created with generation techniques from just six months later, demonstrating significant decay as threats evolve. Our analysis reveals two key insights for robust detection. Firstly, continued performance requires the ongoing curation of large, diverse datasets. Second, predictive power comes primarily from static, frame-level artifacts, not temporal inconsistencies. The future of effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Spam and Phishing Detection