Visual Summarization of Scholarly Videos using Word Embeddings and   Keyphrase Extraction

Hang Zhou; Christian Otto; Ralph Ewerth

arXiv:1912.10809·cs.MM·December 24, 2019

Visual Summarization of Scholarly Videos using Word Embeddings and Keyphrase Extraction

Hang Zhou, Christian Otto, Ralph Ewerth

PDF

TL;DR

This paper proposes a method for creating visual summaries of scholarly videos by leveraging semantic word embeddings and keyphrase extraction from automatically generated annotations, aiding quick content understanding.

Contribution

It introduces a novel approach combining semantic embeddings and keyphrase extraction from speech and OCR annotations for video summarization.

Findings

01

Effective summarization of scholarly videos demonstrated

02

Improved quick content understanding for longer recordings

03

Utilizes automatic speech and OCR annotations

Abstract

Effective learning with audiovisual content depends on many factors. Besides the quality of the learning resource's content, it is essential to discover the most relevant and suitable video in order to support the learning process most effectively. Video summarization techniques facilitate this goal by providing a quick overview over the content. It is especially useful for longer recordings such as conference presentations or lectures. In this paper, we present an approach that generates a visual summary of video content based on semantic word embeddings and keyphrase extraction. For this purpose, we exploit video annotations that are automatically generated by speech recognition and video OCR (optical character recognition).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.