Step-Back Profiling: Distilling User History for Personalized Scientific Writing
Xiangru Tang, Xingyao Zhang, Yanjun Shao, Jie Wu, Yilun Zhao, Arman, Cohan, Ming Gong, Dongmei Zhang, Mark Gerstein

TL;DR
This paper introduces STEP-BACK PROFILING, a method that personalizes large language models for scientific writing by distilling user history into concise profiles, demonstrated on a new dataset with improved performance over baselines.
Contribution
The paper presents a novel profiling approach for LLM personalization in scientific writing and introduces the PSW dataset for multi-user experiments.
Findings
Effective user profiling improves personalized scientific writing.
Our method outperforms baselines by up to 3.6 points on LaMP.
Ablation studies confirm the importance of each component.
Abstract
Large language models (LLM) excel at a variety of natural language processing tasks, yet they struggle to generate personalized content for individuals, particularly in real-world scenarios like scientific writing. Addressing this challenge, we introduce STEP-BACK PROFILING to personalize LLMs by distilling user history into concise profiles, including essential traits and preferences of users. To conduct the experiments, we construct a Personalized Scientific Writing (PSW) dataset to study multi-user personalization. PSW requires the models to write scientific papers given specialized author groups with diverse academic backgrounds. As for the results, we demonstrate the effectiveness of capturing user characteristics via STEP-BACK PROFILING for collaborative writing. Moreover, our approach outperforms the baselines by up to 3.6 points on the general personalization benchmark (LaMP),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling
