TL;DR
This paper introduces ConvoSumm, a new benchmark for online conversation summarization, and enhances abstractive summarization by integrating argument mining to better capture issues, viewpoints, and assertions.
Contribution
It creates four new annotated datasets for diverse online conversations and incorporates argument mining to improve summarization quality.
Findings
State-of-the-art models perform well on new datasets.
Argument mining improves summarization accuracy.
Benchmark establishes strong baselines for conversation summarization.
Abstract
While online conversations can cover a vast amount of information in many different formats, abstractive text summarization has primarily focused on modeling solely news articles. This research gap is due, in part, to the lack of standardized datasets for summarizing online discussions. To address this gap, we design annotation protocols motivated by an issues--viewpoints--assertions framework to crowdsource four new datasets on diverse online conversation forms of news comments, discussion forums, community question answering forums, and email threads. We benchmark state-of-the-art models on our datasets and analyze characteristics associated with the data. To create a comprehensive benchmark, we also evaluate these models on widely-used conversation summarization datasets to establish strong baselines in this domain. Furthermore, we incorporate argument mining through graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
