SumGraph: Video Summarization via Recursive Graph Modeling
Jungin Park, Jiyoung Lee, Ig-Jae Kim, and Kwanghoon Sohn

TL;DR
SumGraph introduces a recursive graph modeling approach for video summarization, capturing semantic relationships among frames to improve keyframe selection, achieving state-of-the-art results in both supervised and unsupervised settings.
Contribution
It presents a novel recursive graph modeling network, SumGraph, for video summarization that considers frame relationships and can be adapted to unsupervised learning.
Findings
Achieved state-of-the-art performance on multiple benchmarks.
Effectively models semantic relationships among video frames.
Works well in both supervised and unsupervised scenarios.
Abstract
The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe selection problem by aggregating all frames with equal weight. However, to find informative parts of the video, it is necessary to consider how all the frames of the video are related to each other. To this end, we cast video summarization as a graph modeling problem. We propose recursive graph modeling networks for video summarization, termed SumGraph, to represent a relation graph, where frames are regarded as nodes and nodes are connected by semantic relationships among frames. Our networks accomplish this through a recursive approach to refine an initially estimated graph to correctly classify each node as a keyframe by reasoning the graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
