Graph Based Temporal Aggregation for Video Retrieval
Arvind Srinivasan, Aprameya Bharadwaj, Aveek Saha, Subramanyam, Natarajan

TL;DR
This paper introduces a graph-based method for video retrieval using image queries from outside the dataset, leveraging node features from combined video frames to improve scalability and generalization.
Contribution
The paper proposes a novel graph-based approach for video retrieval through external image queries, addressing scalability and generalization issues of prior methods.
Findings
Effective retrieval on MSR-VTT dataset using external image queries
ResNet-152 outperforms ResNet-50 in retrieval accuracy
The approach achieves competitive P@5, P@10, P@20 metrics
Abstract
Large scale video retrieval is a field of study with a lot of ongoing research. Most of the work in the field is on video retrieval through text queries using techniques such as VSE++. However, there is little research done on video retrieval through image queries, and the work that has been done in this field either uses image queries from within the video dataset or iterates through videos frame by frame. These approaches are not generalized for queries from outside the dataset and do not scale well for large video datasets. To overcome these issues, we propose a new approach for video retrieval through image queries where an undirected graph is constructed from the combined set of frames from all videos to be searched. The node features of this graph are used in the task of video retrieval. Experimentation is done on the MSR-VTT dataset by using query images from outside the dataset.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Video Analysis and Summarization
MethodsGraphSAGE · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Convolution · Average Pooling · Bottleneck Residual Block · Batch Normalization · Residual Connection · Residual Block · Max Pooling
