Video Segment Copy Detection Using Memory Constrained Hierarchical Batch-Normalized LSTM Autoencoder
Arjun Krishna, A S Akil Arif Ibrahim

TL;DR
This paper presents a deep learning-based video hashing method for scalable and robust video segment copy detection, effectively handling temporal and spatial transformations, especially time cropping, demonstrated on a large dataset.
Contribution
We introduce a novel memory constrained hierarchical batch-normalized LSTM autoencoder for video hashing that captures temporal features and is resilient to time cropping attacks.
Findings
Effective detection of video segment copies with temporal and spatial transformations.
High accuracy demonstrated on a large dataset of 25,000 videos.
Robustness to temporal cropping in video copy detection.
Abstract
In this report, we introduce a video hashing method for scalable video segment copy detection. The objective of video segment copy detection is to find the video (s) present in a large database, one of whose segments (cropped in time) is a (transformed) copy of the given query video. This transformation may be temporal (for example frame dropping, change in frame rate) or spatial (brightness and contrast change, addition of noise etc.) in nature although the primary focus of this report is detecting temporal attacks. The video hashing method proposed by us uses a deep learning neural network to learn variable length binary hash codes for the entire video considering both temporal and spatial features into account. This is in contrast to most existing video hashing methods, as they use conventional image hashing techniques to obtain hash codes for a video after extracting features for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
