StreamHover: Livestream Transcript Summarization and Annotation

Sangwoo Cho; Franck Dernoncourt; Tim Ganter; Trung Bui and; Nedim Lipka; Walter Chang; Hailin Jin; Jonathan Brandt; Hassan; Foroosh; Fei Liu

arXiv:2109.05160·cs.CL·September 14, 2021

StreamHover: Livestream Transcript Summarization and Annotation

Sangwoo Cho, Franck Dernoncourt, Tim Ganter, Trung Bui and, Nedim Lipka, Walter Chang, Hailin Jin, Jonathan Brandt, Hassan, Foroosh, Fei Liu

PDF

Open Access 1 Repo

TL;DR

StreamHover introduces a large annotated dataset and a neural extractive summarization model tailored for livestream transcripts, addressing challenges posed by informal spoken language and enabling efficient content browsing.

Contribution

The paper presents a new annotated dataset of over 500 hours of livestream transcripts and a novel neural extractive summarization model using vector-quantized variational autoencoders.

Findings

01

Model outperforms strong baselines in summarization tasks.

02

Large-scale annotated dataset enhances research in livestream transcript summarization.

03

Proposes a generalizable approach for summarizing informal spoken content.

Abstract

With the explosive growth of livestream broadcasting, there is an urgent need for new summarization technology that enables us to create a preview of streamed content and tap into this wealth of knowledge. However, the problem is nontrivial due to the informal nature of spoken language. Further, there has been a shortage of annotated datasets that are necessary for transcript summarization. In this paper, we present StreamHover, a framework for annotating and summarizing livestream transcripts. With a total of over 500 hours of videos annotated with both extractive and abstractive summaries, our benchmark dataset is significantly larger than currently existing annotated corpora. We explore a neural extractive summarization model that leverages vector-quantized variational autoencoder to learn latent vector representations of spoken utterances and identify salient utterances from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucfnlp/streamhover
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Music and Audio Processing