Spatio-temporal Co-attention Fusion Network for Video Splicing   Localization

Man Lin; Gang Cao; Zijie Lou

arXiv:2309.09482·cs.CV·September 19, 2023

Spatio-temporal Co-attention Fusion Network for Video Splicing Localization

Man Lin, Gang Cao, Zijie Lou

PDF

Open Access 1 Repo

TL;DR

This paper introduces SCFNet, a novel spatio-temporal co-attention fusion network that effectively detects video splicing forgeries by capturing manipulation traces across frames, outperforming existing methods.

Contribution

The paper presents a new three-stream encoder with co-attention modules and a lightweight decoder, along with a large-scale dataset for training and benchmarking video splicing localization.

Findings

01

Outperforms state-of-the-art methods in localization accuracy

02

Demonstrates strong generalization across datasets

03

Provides a new large-scale dataset for training and evaluation

Abstract

Digital video splicing has become easy and ubiquitous. Malicious users copy some regions of a video and paste them to another video for creating realistic forgeries. It is significant to blindly detect such forgery regions in videos. In this paper, a spatio-temporal co-attention fusion network (SCFNet) is proposed for video splicing localization. Specifically, a three-stream network is used as an encoder to capture manipulation traces across multiple frames. The deep interaction and fusion of spatio-temporal forensic features are achieved by the novel parallel and cross co-attention fusion modules. A lightweight multilayer perceptron (MLP) decoder is adopted to yield a pixel-level tampering localization map. A new large-scale video splicing dataset is created for training the SCFNet. Extensive tests on benchmark datasets show that the localization and generalization performances of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

multimediafor/scfnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning