PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models
Lemei Zhang, Peng Liu, Marcus Tiedemann Oekland Henriksboe, Even W., Lauvrak, Jon Atle Gulla, Heri Ramampiaro

TL;DR
This paper introduces PersonalSum, a new dataset for personalized summarization that captures individual user preferences, revealing that existing large language models struggle to generate truly personalized summaries despite their generic capabilities.
Contribution
The paper presents the first manually annotated dataset for personalized summarization, focusing on user-specific preferences and analyzing key factors influencing personalized summaries generated by LLMs.
Findings
Entities/topics influence user preferences but are not the sole factor.
Personalized summarization remains a challenging task for current LLMs.
The dataset enables future research on aligning summaries with individual user needs.
Abstract
With the rapid advancement of Natural Language Processing in recent years, numerous studies have shown that generic summaries generated by Large Language Models (LLMs) can sometimes surpass those annotated by experts, such as journalists, according to human evaluations. However, there is limited research on whether these generic summaries meet the individual needs of ordinary people. The biggest obstacle is the lack of human-annotated datasets from the general public. Existing work on personalized summarization often relies on pseudo datasets created from generic summarization datasets or controllable tasks that focus on specific named entities or other aspects, such as the length and specificity of generated summaries, collected from hypothetical tasks without the annotators' initiative. To bridge this gap, we propose a high-quality, personalized, manually annotated abstractive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsFocus
