CASPR: Automated Evaluation Metric for Contrastive Summarization
Nirupan Ananthamurugan, Dat Duong, Philip George, Ankita Gupta,, Sandeep Tata, Beliz Gunel

TL;DR
CASPR is an automated, NLI-based metric designed to reliably evaluate the contrastiveness of summaries in contrastive summarization tasks, outperforming existing token-overlap and BERTScore baselines.
Contribution
This work introduces CASPR, a novel lightweight NLI-based evaluation metric that effectively measures contrastiveness in summaries without human input.
Findings
CASPR more reliably captures contrastiveness than baselines.
CASPR outperforms Distinctiveness Score and BERTScore on CoCoTRIP dataset.
The metric is simple, efficient, and sensitive to meaning-preserving lexical variations.
Abstract
Summarizing comparative opinions about entities (e.g., hotels, phones) from a set of source reviews, often referred to as contrastive summarization, can considerably aid users in decision making. However, reliably measuring the contrastiveness of the output summaries without relying on human evaluations remains an open problem. Prior work has proposed token-overlap based metrics, Distinctiveness Score, to measure contrast which does not take into account the sensitivity to meaning-preserving lexical variations. In this work, we propose an automated evaluation metric CASPR to better measure contrast between a pair of summaries. Our metric is based on a simple and light-weight method that leverages natural language inference (NLI) task to measure contrast by segmenting reviews into single-claim sentences and carefully aggregating NLI scores between them to come up with a summary-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Data Quality and Management · Topic Modeling
MethodsSparse Evolutionary Training
