TL;DR
This paper introduces BigSurvey, a large-scale dataset for multi-document summarization of academic papers, and proposes CAST, a novel method that effectively generates structured summaries from numerous input documents.
Contribution
The paper presents BigSurvey, the first large-scale dataset for academic multi-document summarization, and introduces CAST, a new method for efficient, structured summarization of many papers.
Findings
CAST outperforms existing summarization methods
BigSurvey enables comprehensive academic paper summarization
Efficient processing of long texts achieved
Abstract
Writing a survey paper on one research topic usually needs to cover the salient content from numerous related papers, which can be modeled as a multi-document summarization (MDS) task. Existing MDS datasets usually focus on producing the structureless summary covering a few input documents. Meanwhile, previous structured summary generation works focus on summarizing a single document into a multi-section summary. These existing datasets and methods cannot meet the requirements of summarizing numerous academic papers into a structured summary. To deal with the scarcity of available data, we propose BigSurvey, the first large-scale dataset for generating comprehensive summaries of numerous academic papers on each topic. We collect target summaries from more than seven thousand survey papers and utilize their 430 thousand reference papers' abstracts as input documents. To organize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Attention Dropout · Layer Normalization · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Softmax · Residual Connection · Weight Decay
