SupMMD: A Sentence Importance Model for Extractive Summarization using Maximum Mean Discrepancy
Umanga Bista, Alexander Patrick Mathews, Aditya Krishna Menon, Lexing, Xie

TL;DR
SupMMD is a novel summarization technique that uses maximum mean discrepancy to effectively identify and extract new and relevant information from multiple documents, outperforming existing methods.
Contribution
It introduces SupMMD, combining supervised and unsupervised learning with multiple kernel learning for improved generic and update summarization.
Findings
Outperforms state-of-the-art on DUC-2004 and TAC-2009 datasets.
Effectively captures new information in update summarization.
Utilizes multiple similarity sources for better content coverage.
Abstract
Most work on multi-document summarization has focused on generic summarization of information present in each individual document set. However, the under-explored setting of update summarization, where the goal is to identify the new information present in each set, is of equal practical interest (e.g., presenting readers with updates on an evolving news topic). In this work, we present SupMMD, a novel technique for generic and update summarization based on the maximum mean discrepancy from kernel two-sample testing. SupMMD combines both supervised learning for salience and unsupervised learning for coverage and diversity. Further, we adapt multiple kernel learning to make use of similarity across multiple information sources (e.g., text features and knowledge based concepts). We show the efficacy of SupMMD in both generic and update summarization tasks by meeting or exceeding the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
