VADER: Video Alignment Differencing and Retrieval
Alexander Black, Simon Jenni, Tu Bui, Md. Mehrab Tanjim, Stefano, Petrangeli, Ritwik Sinha, Viswanathan Swaminathan, John Collomosse

TL;DR
VADER is a novel method for detecting manipulated videos by matching, aligning, and summarizing differences in video content to verify authenticity and combat misinformation.
Contribution
The paper introduces VADER, a comprehensive framework combining spatio-temporal matching, alignment, and change detection for verifying video authenticity.
Findings
Effective in identifying manipulated regions in videos
Robust against temporal misalignments and non-editorial changes
Enables reliable video provenance verification
Abstract
We propose VADER, a spatio-temporal matching, alignment, and change summarization method to help fight misinformation spread via manipulated videos. VADER matches and coarsely aligns partial video fragments to candidate videos using a robust visual descriptor and scalable search over adaptively chunked video content. A transformer-based alignment module then refines the temporal localization of the query fragment within the matched video. A space-time comparator module identifies regions of manipulation between aligned content, invariant to any changes due to any residual temporal misalignments or artifacts arising from non-editorial changes of the content. Robustly matching video to a trusted source enables conclusions to be drawn on video provenance, enabling informed trust decisions on content encountered.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
VADER: Video Alignment Differencing and Retrieval· youtube
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Video Analysis and Summarization
