Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models
Teqi Hao, Xioayu Tan, Shaojie Shi, Yinghui Xu, Xihe Qiu

TL;DR
This paper introduces Reflective Personalization Optimization (RPO), a two-stage framework that improves personalization of black-box large language models by decoupling content generation from user alignment through explicit rewriting.
Contribution
RPO presents a novel post-hoc rewriting framework that separates content creation from personalization, outperforming existing context injection methods in quality and control.
Findings
RPO significantly outperforms state-of-the-art baselines on LaMP benchmark.
Explicit response shaping is more effective than implicit context injection.
RPO is model-agnostic and easily integrable with various base models.
Abstract
The personalization of black-box large language models (LLMs) is a critical yet challenging task. Existing approaches predominantly rely on context injection, where user history is embedded into the prompt to directly guide the generation process. However, this single-step paradigm imposes a dual burden on the model: generating accurate content while simultaneously aligning with user-specific styles. This often results in a trade-off that compromises output quality and limits precise control. To address this fundamental tension, we propose Reflective Personalization Optimization (RPO), a novel framework that redefines the personalization paradigm by decoupling content generation from alignment. RPO operates in two distinct stages: first, a base model generates a high-quality, generic response; then, an external reflection module explicitly rewrites this output to align with the user's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications
