EntSUM: A Data Set for Entity-Centric Summarization
Mounica Maddela, Mayank Kulkarni, Daniel Preotiuc-Pietro

TL;DR
EntSUM is a new human-annotated dataset designed for entity-centric controllable summarization, highlighting the challenges and improving existing methods to generate summaries focused on specific named entities.
Contribution
The paper introduces EntSUM, a novel dataset for entity-centric summarization, and proposes extensions to current models that better generate entity-focused summaries.
Findings
Existing controllable summarization methods struggle with entity-centric tasks.
Proposed model extensions significantly improve entity-focused summarization performance.
The dataset reveals the complexity and challenges of entity-centric summarization.
Abstract
Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences to better assist them with their information need, as opposed to the standard summarization setup which build a single generic summary of a document. We introduce a human-annotated data set EntSUM for controllable summarization with a focus on named entities as the aspects to control. We conduct an extensive quantitative analysis to motivate the task of entity-centric summarization and show that existing methods for controllable summarization fail to generate entity-centric summaries. We propose extensions to state-of-the-art summarization approaches that achieve substantially better results on our data set. Our analysis and results show the challenging nature of this task and of the proposed data set.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
