Use of Affective Visual Information for Summarization of Human-Centric Videos
Berkay K\"opr\"u, Engin Erzin

TL;DR
This paper introduces affective information into supervised human-centric video summarization, leveraging emotion recognition and attention mechanisms to improve summary quality, especially for videos with significant human and emotional content.
Contribution
It proposes novel affective video summarization architectures that incorporate emotional attributes and attention mechanisms, enhancing performance on human-centric videos.
Findings
AVSUM-GRU with early fusion achieves strong results.
Temporal attention improves summarization quality.
Significant performance gains on human-centric videos.
Abstract
Increasing volume of user-generated human-centric video content and their applications, such as video retrieval and browsing, require compact representations that are addressed by the video summarization literature. Current supervised studies formulate video summarization as a sequence-to-sequence learning problem and the existing solutions often neglect the surge of human-centric view, which inherently contains affective content. In this study, we investigate the affective-information enriched supervised video summarization task for human-centric videos. First, we train a visual input-driven state-of-the-art continuous emotion recognition model (CER-NET) on the RECOLA dataset to estimate emotional attributes. Then, we integrate the estimated emotional attributes and the high-level representations from the CER-NET with the visual information to define the proposed affective video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Music and Audio Processing · Advanced Image and Video Retrieval Techniques
MethodsGated Recurrent Unit
