Redundancy Aware Multi-Reference Based Gainwise Evaluation of Extractive Summarization
Mousumi Akter, Santu Karmaker

TL;DR
This paper introduces a redundancy-aware, multi-reference, gain-based evaluation metric for extractive summarization that better aligns with human judgments than traditional metrics like ROUGE.
Contribution
It proposes a revised Sem-nCG metric that incorporates redundancy awareness and supports multiple reference summaries, improving evaluation accuracy.
Findings
Revised Sem-nCG correlates more strongly with human judgments.
The new metric outperforms ROUGE and BERTScore in both single and multiple reference evaluations.
Redundancy awareness enhances the evaluation of extractive summaries.
Abstract
The ROUGE metric is commonly used to evaluate extractive summarization task, but it has been criticized for its lack of semantic awareness and its ignorance about the ranking quality of the extractive summarizer. Previous research has introduced a gain-based automated metric called Sem-nCG that addresses these issues, as it is both rank and semantic aware. However, it does not consider the amount of redundancy present in a model summary and currently does not support evaluation with multiple reference summaries. It is essential to have a model summary that balances importance and diversity, but finding a metric that captures both of these aspects is challenging. In this paper, we propose a redundancy-aware Sem-nCG metric and demonstrate how the revised Sem-nCG metric can be used to evaluate model summaries against multiple references as well which was missing in previous research.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
