STAR-GNN: Spatial-Temporal Video Representation for Content-based Retrieval
Guoping Zhao, Bingqing Zhang, Mingyu Zhang, Yaxian Li, Jiajun Liu, and, Ji-Rong Wen

TL;DR
STAR-GNN introduces a graph neural network-based framework for video feature representation that captures spatial and temporal dynamics, improving content-based video retrieval accuracy and robustness.
Contribution
It presents a novel multi-scale lattice feature graph model with a pluggable GNN component trained with triplet loss for enhanced video retrieval.
Findings
Achieves state-of-the-art performance in video retrieval
Effectively models dynamic and semantically rich content
Robust to noise and redundancies
Abstract
We propose a video feature representation learning framework called STAR-GNN, which applies a pluggable graph neural network component on a multi-scale lattice feature graph. The essence of STAR-GNN is to exploit both the temporal dynamics and spatial contents as well as visual connections between regions at different scales in the frames. It models a video with a lattice feature graph in which the nodes represent regions of different granularity, with weighted edges that represent the spatial and temporal links. The contextual nodes are aggregated simultaneously by graph neural networks with parameters trained with retrieval triplet loss. In the experiments, we show that STAR-GNN effectively implements a dynamic attention mechanism on video frame sequences, resulting in the emphasis for dynamic and semantically rich content in the video, and is robust to noise and redundancies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsGraph Neural Network
