Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations
Alireza Salemi, Hamed Zamani

TL;DR
The paper introduces $P^3$, a client-server framework that enhances personalized language model outputs while preserving user privacy by limiting private data exposure during generation.
Contribution
It proposes a novel interactive approach where a small client model refines server-generated drafts using private data, balancing personalization and privacy.
Findings
$P^3$ outperforms non-personalized and personalized baselines by 7.4%-9%.
Recovers 90.3%-95.7% of full-profile utility.
Maintains low privacy leakage with only 1.5%-3.5% additional risk.
Abstract
Personalization is crucial for aligning Large Language Model (LLM) outputs with individual user preferences and background knowledge. State-of-the-art solutions are based on retrieval augmentation, where relevant context from a user profile is retrieved for LLM consumption. These methods deal with a trade-off between exposing retrieved private data to cloud providers and relying on less capable local models. We introduce , an interactive framework for high-quality personalization without revealing private profiles to server-side LLMs. In , a large server-side model generates a sequence of draft tokens based solely on the user query, while a small client-side model, with retrieval access to the user's private profile, evaluates and modifies these drafts to better reflect user preferences. This process repeats until an end token is generated. Experiments on LaMP-QA, a recent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Topic Modeling · Data Quality and Management
