Circulant temporal encoding for video retrieval and temporal alignment
Matthijs Douze, J\'er\^ome Revaud, Jakob Verbeek, Herv\'e J\'egou,, Cordelia Schmid

TL;DR
This paper introduces a circulant temporal encoding method for efficient video retrieval and alignment, enabling accurate localization of matching segments and robust global temporal alignment of videos in a computationally efficient manner.
Contribution
It proposes a novel circulant matrix-based encoding for joint appearance and temporal order representation, improving efficiency and accuracy in video retrieval and alignment tasks.
Findings
Significant reduction in computational complexity.
Accurate localization of matching video segments.
Robust global temporal alignment of multiple videos.
Abstract
We address the problem of specific video event retrieval. Given a query video of a specific event, e.g., a concert of Madonna, the goal is to retrieve other videos of the same event that temporally overlap with the query. Our approach encodes the frame descriptors of a video to jointly represent their appearance and temporal order. It exploits the properties of circulant matrices to efficiently compare the videos in the frequency domain. This offers a significant gain in complexity and accurately localizes the matching parts of videos. The descriptors can be compressed in the frequency domain with a product quantizer adapted to complex numbers. In this case, video retrieval is performed without decompressing the descriptors. We also consider the temporal alignment of a set of videos. We exploit the matching confidence and an estimate of the temporal offset computed for all pairs of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
