Video Summarization via Actionness Ranking
Mohamed Elfeki, Ali Borji

TL;DR
This paper introduces a novel video summarization method that leverages the concept of actionness, or the intensity of deliberate motion, to improve the quality of generated summaries, validated through user studies and benchmark tests.
Contribution
It proposes a new approach that uses actionness ranking to guide video summary generation, differing from prior heuristic or complexity-based methods.
Findings
Actionness correlates strongly with human summary inclusion decisions.
The proposed method outperforms state-of-the-art summarization techniques.
User studies confirm the reliability of actionness as a key cue.
Abstract
To automatically produce a brief yet expressive summary of a long video, an automatic algorithm should start by resembling the human process of summary generation. Prior work proposed supervised and unsupervised algorithms to train models for learning the underlying behavior of humans by increasing modeling complexity or craft-designing better heuristics to simulate human summary generation process. In this work, we take a different approach by analyzing a major cue that humans exploit for the summary generation; the nature and intensity of actions. We empirically observed that a frame is more likely to be included in human-generated summaries if it contains a substantial amount of deliberate motion performed by an agent, which is referred to as actionness. Therefore, we hypothesize that learning to automatically generate summaries involves an implicit knowledge of actionness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Human Pose and Action Recognition · Human Motion and Animation
