Graph Unfolding and Sampling for Transitory Video Summarization via Gershgorin Disc Alignment
Sadid Sahami, Gene Cheung, Chia-Wen Lin

TL;DR
This paper introduces a fast graph sampling method based on Gershgorin disc alignment for efficient transitory video summarization, achieving comparable or better results with lower complexity.
Contribution
It proposes a novel graph unfolding and sampling algorithm utilizing Gershgorin disc alignment, with provable performance bounds for transitory video summarization.
Findings
Achieves comparable or better summarization performance than state-of-the-art methods.
Reduces computational complexity significantly.
Provides theoretical guarantees for graph unfolding procedures.
Abstract
User-generated videos (UGVs) uploaded from mobile phones to social media sites like YouTube and TikTok are short and non-repetitive. We summarize a transitory UGV into several keyframes in linear time via fast graph sampling based on Gershgorin disc alignment (GDA). Specifically, we first model a sequence of frames in a UGV as an -hop path graph for , where the similarity between two frames within time instants is encoded as a positive edge based on feature similarity. Towards efficient sampling, we then "unfold" to a -hop path graph , specified by a generalized graph Laplacian matrix , via one of two graph unfolding procedures with provable performance bounds. We show that maximizing the smallest eigenvalue of a coefficient matrix $\mathbf{B} =…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
