Visual Summarization of Scholarly Videos using Word Embeddings and Keyphrase Extraction
Hang Zhou, Christian Otto, Ralph Ewerth

TL;DR
This paper proposes a method for creating visual summaries of scholarly videos by leveraging semantic word embeddings and keyphrase extraction from automatically generated annotations, aiding quick content understanding.
Contribution
It introduces a novel approach combining semantic embeddings and keyphrase extraction from speech and OCR annotations for video summarization.
Findings
Effective summarization of scholarly videos demonstrated
Improved quick content understanding for longer recordings
Utilizes automatic speech and OCR annotations
Abstract
Effective learning with audiovisual content depends on many factors. Besides the quality of the learning resource's content, it is essential to discover the most relevant and suitable video in order to support the learning process most effectively. Video summarization techniques facilitate this goal by providing a quick overview over the content. It is especially useful for longer recordings such as conference presentations or lectures. In this paper, we present an approach that generates a visual summary of video content based on semantic word embeddings and keyphrase extraction. For this purpose, we exploit video annotations that are automatically generated by speech recognition and video OCR (optical character recognition).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
