Synthetic Interaction Data for Scalable Personalization in Large Language Models
Yuchen Ma, Yue Huang, Wenjie Wang, Xiaonan Luo, Xiangliang Zhang, Stefan Feuerriegel

TL;DR
This paper introduces PersonaGym, a synthetic data generation framework for personalized LLM interactions, and PPOpt, a prompt optimization method that enhances personalization without modifying the LLM, demonstrated through extensive experiments.
Contribution
The paper presents a novel synthetic data generation framework and a scalable prompt optimization method for personalized LLM deployment, addressing data scarcity and preference modeling challenges.
Findings
PersonaGym produces high-fidelity, diverse interaction data.
PPOpt improves personalization and robustness over baselines.
Enhanced task performance and preference alignment in experiments.
Abstract
Personalized prompting offers large opportunities for deploying large language models (LLMs) to diverse users, yet existing prompt optimization methods primarily focus on task-level optimization while largely overlooking user-specific preferences and latent constraints of individual users. This gap is primarily due to (i) the absence of high-quality, privacy-sensitive data that capture personalized user-LLM interactions at scale, and (ii) the lack of robust reward signals for individual preferences. To overcome existing data limitations, we introduce a high-fidelity synthetic data generation framework called PersonaGym. Unlike prior work that treats personalization as static persona-preference pairs, PersonaGym models a dynamic preference process via an agentic LLM system to simulate realistic preference behaviors and semantic-aware noise in order to generate personalized multi-turn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · Recommender Systems and Techniques · Topic Modeling
