ASPECTNEWS: Aspect-Oriented Summarization of News Documents
Ojas Ahuja, Jiacheng Xu, Akshay Gupta, Kevin Horecka, Greg Durrett

TL;DR
This paper introduces AspectNews, a dataset for aspect-oriented news summarization, and evaluates methods for generating focused summaries that address specific subtopics without in-domain training.
Contribution
It presents a new dataset for aspect-oriented summarization and compares training schemes to improve focused summaries in news domains.
Findings
Synthetic training data improves aspect-focused summarization.
Keyword sensitivity affects summary quality.
Focused summaries outperform generic and keyword-based methods.
Abstract
Generic summaries try to cover an entire document and query-based summaries try to answer document-specific questions. But real users' needs often fall in between these extremes and correspond to aspects, high-level topics discussed among similar types of documents. In this paper, we collect a dataset of realistic aspect-oriented summaries, AspectNews, which covers different subtopics about articles in news sub-domains. We annotate data across two domains of articles, earthquakes and fraud investigations, where each article is annotated with two distinct summaries focusing on different aspects for each domain. A system producing a single generic summary cannot concisely satisfy both aspects. Our focus in evaluation is how well existing techniques can generalize to these domains without seeing in-domain training data, so we turn to techniques to construct synthetic training data that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
