ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos
Peijun Bao, Anwei Luo, Gang Pan, Alex C. Kot, Xudong Jiang

TL;DR
ActivityForensics introduces a large-scale benchmark dataset and evaluation protocols for localizing manipulated activity segments in videos, addressing a gap in existing forgery detection research.
Contribution
It provides the first comprehensive benchmark for activity-level video forgery localization, including a new dataset, evaluation protocols, and baseline methods.
Findings
The dataset contains over 6,000 forged video segments with high visual consistency.
The proposed TADiff baseline effectively exposes artifact cues in manipulated videos.
Benchmarking various state-of-the-art methods reveals strengths and limitations in activity forgery localization.
Abstract
Temporal forgery localization aims to temporally identify manipulated segments in videos. Most existing benchmarks focus on appearance-level forgeries, such as face swapping and object removal. However, recent advances in video generation have driven the emergence of activity-level forgeries that modify human actions to distort event semantics, resulting in highly deceptive forgeries that critically undermine media authenticity and public trust. To overcome this issue, we introduce ActivityForensics, the first large-scale benchmark for localizing manipulated activity in videos. It contains over 6K forged video segments that are seamlessly blended into the video context, rendering high visual consistency that makes them almost indistinguishable from authentic content to the human eye. We further propose Temporal Artifact Diffuser (TADiff), a simple yet effective baseline that exposes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
