SNaC: Coherence Error Detection for Narrative Summarization
Tanya Goyal, Junyi Jessy Li, Greg Durrett

TL;DR
This paper introduces SNaC, a framework for evaluating narrative coherence in long summaries, addressing gaps in current assessment methods by providing detailed annotations and a classifier for coherence errors.
Contribution
SNaC provides a novel taxonomy, annotation protocol, and automatic classifier for coherence errors in long summaries, advancing evaluation and modeling of narrative coherence.
Findings
Collected span-level annotations for 6.6k sentences across 150 summaries
Developed a strong classifier for localizing coherence errors
Characterized coherence errors in state-of-the-art summarization models
Abstract
Progress in summarizing long texts is inhibited by the lack of appropriate evaluation frameworks. When a long summary must be produced to appropriately cover the facets of that text, that summary needs to present a coherent narrative to be understandable by a reader, but current automatic and human evaluation methods fail to identify gaps in coherence. In this work, we introduce SNaC, a narrative coherence evaluation framework rooted in fine-grained annotations for long summaries. We develop a taxonomy of coherence errors in generated narrative summaries and collect span-level annotations for 6.6k sentences across 150 book and movie screenplay summaries. Our work provides the first characterization of coherence errors generated by state-of-the-art summarization models and a protocol for eliciting coherence judgments from crowd annotators. Furthermore, we show that the collected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques
