Video Summarization Using Deep Neural Networks: A Survey
Evlampios Apostolidis, Eleni Adamantidou, Alexandros I. Metsai,, Vasileios Mezaris, Ioannis Patras

TL;DR
This survey reviews recent deep neural network methods for generic video summarization, highlighting advances, evaluation protocols, and future research directions in creating concise video summaries.
Contribution
It provides a comprehensive taxonomy, systematic review, and performance comparison of deep-learning-based video summarization techniques, guiding future research.
Findings
Deep learning methods have significantly advanced video summarization.
Evaluation protocols vary and impact performance assessment.
Future directions include data annotation and protocol standardization.
Abstract
Video summarization technologies aim to create a concise and complete synopsis by selecting the most informative parts of the video content. Several approaches have been developed over the last couple of decades and the current state of the art is represented by methods that rely on modern deep neural network architectures. This work focuses on the recent advances in the area and provides a comprehensive survey of the existing deep-learning-based methods for generic video summarization. After presenting the motivation behind the development of technologies for video summarization, we formulate the video summarization task and discuss the main characteristics of a typical deep-learning-based analysis pipeline. Then, we suggest a taxonomy of the existing algorithms and provide a systematic review of the relevant literature that shows the evolution of the deep-learning-based video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
