Spatio-temporal Co-attention Fusion Network for Video Splicing Localization
Man Lin, Gang Cao, Zijie Lou

TL;DR
This paper introduces SCFNet, a novel spatio-temporal co-attention fusion network that effectively detects video splicing forgeries by capturing manipulation traces across frames, outperforming existing methods.
Contribution
The paper presents a new three-stream encoder with co-attention modules and a lightweight decoder, along with a large-scale dataset for training and benchmarking video splicing localization.
Findings
Outperforms state-of-the-art methods in localization accuracy
Demonstrates strong generalization across datasets
Provides a new large-scale dataset for training and evaluation
Abstract
Digital video splicing has become easy and ubiquitous. Malicious users copy some regions of a video and paste them to another video for creating realistic forgeries. It is significant to blindly detect such forgery regions in videos. In this paper, a spatio-temporal co-attention fusion network (SCFNet) is proposed for video splicing localization. Specifically, a three-stream network is used as an encoder to capture manipulation traces across multiple frames. The deep interaction and fusion of spatio-temporal forensic features are achieved by the novel parallel and cross co-attention fusion modules. A lightweight multilayer perceptron (MLP) decoder is adopted to yield a pixel-level tampering localization map. A new large-scale video splicing dataset is created for training the SCFNet. Extensive tests on benchmark datasets show that the localization and generalization performances of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
