MeanSum: A Neural Model for Unsupervised Multi-document Abstractive Summarization
Eric Chu, Peter J. Liu

TL;DR
This paper introduces MeanSum, an unsupervised neural model for multi-document abstractive summarization that learns from unpaired reviews, generating fluent and relevant summaries without needing paired datasets.
Contribution
The paper presents a novel end-to-end neural architecture for unsupervised abstractive summarization using only review data, outperforming extractive baselines.
Findings
Generated summaries are highly abstractive and fluent.
The model effectively captures the average sentiment of reviews.
Outperforms strong extractive baseline in automated and human evaluations.
Abstract
Abstractive summarization has been studied using neural sequence transduction methods with datasets of large, paired document-summary examples. However, such datasets are rare and the models trained from them do not generalize to other domains. Recently, some progress has been made in learning sequence-to-sequence mappings with only unpaired examples. In our work, we consider the setting where there are only documents (product or business reviews) with no summaries provided, and propose an end-to-end, neural model architecture to perform unsupervised abstractive summarization. Our proposed model consists of an auto-encoder where the mean of the representations of the input reviews decodes to a reasonable summary-review while not relying on any review-specific features. We consider variants of the proposed architecture and perform an ablation study to show the importance of specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
