Improved Spoken Document Summarization with Coverage Modeling Techniques
Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang

TL;DR
This paper introduces novel coverage-based techniques for spoken document summarization that directly enhance diversity and relevance, outperforming traditional redundancy-reduction methods like MMR.
Contribution
It proposes two new methods that increase diversity in extractive summaries and integrates various representation techniques to improve summarization quality.
Findings
Proposed methods outperform MMR in relevance and coverage.
Increased diversity leads to more comprehensive summaries.
Empirical results validate the effectiveness of the new techniques.
Abstract
Extractive summarization aims at selecting a set of indicative sentences from a source document as a summary that can express the major theme of the document. A general consensus on extractive summarization is that both relevance and coverage are critical issues to address. The existing methods designed to model coverage can be characterized by either reducing redundancy or increasing diversity in the summary. Maximal margin relevance (MMR) is a widely-cited method since it takes both relevance and redundancy into account when generating a summary for a given document. In addition to MMR, there is only a dearth of research concentrating on reducing redundancy or increasing diversity for the spoken document summarization task, as far as we are aware. Motivated by these observations, two major contributions are presented in this paper. First, in contrast to MMR, which considers coverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
