Counteracting temporal attacks in Video Copy Detection
Katarzyna Fojcik, Piotr Syga

TL;DR
This paper improves video copy detection by enhancing robustness against temporal attacks and reducing computational costs, making it more practical for real-world applications.
Contribution
It introduces a novel frame selection strategy based on local maxima of interframe differences, improving robustness and efficiency over existing methods.
Findings
Achieves 1.4 to 5.8 times efficiency increase over 1 FPS baseline.
Maintains comparable micro-average precision ($$AP) with improved robustness.
Reduces representation size by 56% and doubles inference speed.
Abstract
Video Copy Detection (VCD) plays a crucial role in copyright protection and content verification by identifying duplicates and near-duplicates in large-scale video databases. The META AI Challenge on video copy detection provided a benchmark for evaluating state-of-the-art methods, with the Dual-level detection approach emerging as a winning solution. This method integrates Video Editing Detection and Frame Scene Detection to handle adversarial transformations and large datasets efficiently. However, our analysis reveals significant limitations in the VED component, particularly in its ability to handle exact copies. Moreover, Dual-level detection shows vulnerability to temporal attacks. To address it, we propose an improved frame selection strategy based on local maxima of interframe differences, which enhances robustness against adversarial temporal modifications while significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques
