PerSEval: Assessing Personalization in Text Summarizers

Sourish Dasgupta; Ankush Chander; Parth Borad; Isha Motiyani; Tanmoy; Chakraborty

arXiv:2407.00453·cs.CL·October 28, 2024

PerSEval: Assessing Personalization in Text Summarizers

Sourish Dasgupta, Ankush Chander, Parth Borad, Isha Motiyani, Tanmoy, Chakraborty

PDF

Open Access

TL;DR

This paper introduces PerSEval, a new metric for evaluating personalization in text summarization, demonstrating its reliability and independence from accuracy-based measures through extensive benchmarking.

Contribution

It proposes PerSEval, a novel measure that effectively evaluates the degree of personalization in text summaries, addressing limitations of existing accuracy-based metrics.

Findings

01

PerSEval correlates well with human judgment (Pearson's r=0.73).

02

PerSEval exhibits high rank-stability across models.

03

PerSEval provides a standalone ranking measure independent of EGISES.

Abstract

Personalized summarization models cater to individuals' subjective understanding of saliency, as represented by their reading history and current topics of attention. Existing personalized text summarizers are primarily evaluated based on accuracy measures such as BLEU, ROUGE, and METEOR. However, a recent study argued that accuracy measures are inadequate for evaluating the degree of personalization of these models and proposed EGISES, the first metric to evaluate personalized text summaries. It was suggested that accuracy is a separate aspect and should be evaluated standalone. In this paper, we challenge the necessity of an accuracy leaderboard, suggesting that relying on accuracy-based aggregated results might lead to misleading conclusions. To support this, we delve deeper into EGISES, demonstrating both theoretically and empirically that it measures the degree of responsiveness, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques