SEED: A Large-Scale Benchmark for Provenance Tracing in Sequential Deepfake Facial Edits
Mengieong Hoi, Zhedong Zheng, Ping Liu, Wei Liu

TL;DR
SEED is a comprehensive benchmark dataset designed to evaluate and improve methods for tracing the sequence of edits in facial images manipulated by diffusion-based deepfake techniques.
Contribution
The paper introduces SEED, a large-scale dataset with detailed annotations for sequential provenance analysis, and proposes FAITH, a frequency-aware Transformer model for improved editing event detection.
Findings
Spatial-only methods struggle with subtle diffusion artifacts.
High-frequency signals like wavelet components aid in identifying edits.
FAITH outperforms baseline approaches in sequential provenance tracing.
Abstract
Deepfake content on social networks is increasingly produced through multiple \emph{sequential} edits to biometric data such as facial imagery. Consequently, the final appearance of an image often reflects a latent chain of operations rather than a single manipulation. Recovering these editing histories is essential for visual provenance analysis, misinformation auditing, and forensic or platform moderation workflows that must trace the origin and evolution of AI-generated media. However, existing datasets predominantly focus on single-step editing and overlook the cumulative artifacts introduced by realistic multi-step pipelines. To address this gap, we introduce Sequential Editing in Diffusion (\textbf{SEED}), a large-scale benchmark for sequential provenance tracing in facial imagery. SEED contains over 90K images constructed via one to four sequential attribute edits using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
