SEED: A Large-Scale Benchmark for Provenance Tracing in Sequential Deepfake Facial Edits

Mengieong Hoi; Zhedong Zheng; Ping Liu; Wei Liu

arXiv:2604.10522·cs.CR·April 14, 2026

SEED: A Large-Scale Benchmark for Provenance Tracing in Sequential Deepfake Facial Edits

Mengieong Hoi, Zhedong Zheng, Ping Liu, Wei Liu

PDF

TL;DR

SEED is a comprehensive benchmark dataset designed to evaluate and improve methods for tracing the sequence of edits in facial images manipulated by diffusion-based deepfake techniques.

Contribution

The paper introduces SEED, a large-scale dataset with detailed annotations for sequential provenance analysis, and proposes FAITH, a frequency-aware Transformer model for improved editing event detection.

Findings

01

Spatial-only methods struggle with subtle diffusion artifacts.

02

High-frequency signals like wavelet components aid in identifying edits.

03

FAITH outperforms baseline approaches in sequential provenance tracing.

Abstract

Deepfake content on social networks is increasingly produced through multiple \emph{sequential} edits to biometric data such as facial imagery. Consequently, the final appearance of an image often reflects a latent chain of operations rather than a single manipulation. Recovering these editing histories is essential for visual provenance analysis, misinformation auditing, and forensic or platform moderation workflows that must trace the origin and evolution of AI-generated media. However, existing datasets predominantly focus on single-step editing and overlook the cumulative artifacts introduced by realistic multi-step pipelines. To address this gap, we introduce Sequential Editing in Diffusion (\textbf{SEED}), a large-scale benchmark for sequential provenance tracing in facial imagery. SEED contains over 90K images constructed via one to four sequential attribute edits using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.