Co-Regularized Deep Representations for Video Summarization
Olivier Mor\`ere, Hanlin Goh, Antoine Veillard, Vijay Chandrasekhar,, Jie Lin

TL;DR
This paper introduces a novel deep learning framework combining CNNs and RBMs with co-regularization to generate more relevant and attractive video summaries, especially effective with fewer keyframes.
Contribution
It presents a new multimodal deep learning approach with co-regularization for improved keyframe selection in video summarization, outperforming existing methods.
Findings
Outperforms baseline schemes in attractiveness and informativeness
More effective for smaller summaries
Consistently superior in user studies
Abstract
Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization framework combining deep convolutional neural networks and restricted Boltzmann machines. An original co-regularization scheme is used to discover meaningful subject-scene associations. The resulting multimodal representations are then used to select highly-relevant keyframes. A comprehensive user study is conducted comparing our proposed method to a variety of schemes, including the summarization currently in use by one of the most popular video sharing websites. The results show that our method consistently outperforms the baseline schemes for any given amount of keyframes both in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Music and Audio Processing
