PerSEval: Assessing Personalization in Text Summarizers
Sourish Dasgupta, Ankush Chander, Parth Borad, Isha Motiyani, Tanmoy, Chakraborty

TL;DR
This paper introduces PerSEval, a new metric for evaluating personalization in text summarization, demonstrating its reliability and independence from accuracy-based measures through extensive benchmarking.
Contribution
It proposes PerSEval, a novel measure that effectively evaluates the degree of personalization in text summaries, addressing limitations of existing accuracy-based metrics.
Findings
PerSEval correlates well with human judgment (Pearson's r=0.73).
PerSEval exhibits high rank-stability across models.
PerSEval provides a standalone ranking measure independent of EGISES.
Abstract
Personalized summarization models cater to individuals' subjective understanding of saliency, as represented by their reading history and current topics of attention. Existing personalized text summarizers are primarily evaluated based on accuracy measures such as BLEU, ROUGE, and METEOR. However, a recent study argued that accuracy measures are inadequate for evaluating the degree of personalization of these models and proposed EGISES, the first metric to evaluate personalized text summaries. It was suggested that accuracy is a separate aspect and should be evaluated standalone. In this paper, we challenge the necessity of an accuracy leaderboard, suggesting that relying on accuracy-based aggregated results might lead to misleading conclusions. To support this, we delve deeper into EGISES, demonstrating both theoretically and empirically that it measures the degree of responsiveness, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
