Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations

Alireza Salemi; Hamed Zamani

arXiv:2601.17569·cs.CL·January 27, 2026

Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations

Alireza Salemi, Hamed Zamani

PDF

Open Access

TL;DR

The paper introduces $P^3$, a client-server framework that enhances personalized language model outputs while preserving user privacy by limiting private data exposure during generation.

Contribution

It proposes a novel interactive approach where a small client model refines server-generated drafts using private data, balancing personalization and privacy.

Findings

01

$P^3$ outperforms non-personalized and personalized baselines by 7.4%-9%.

02

Recovers 90.3%-95.7% of full-profile utility.

03

Maintains low privacy leakage with only 1.5%-3.5% additional risk.

Abstract

Personalization is crucial for aligning Large Language Model (LLM) outputs with individual user preferences and background knowledge. State-of-the-art solutions are based on retrieval augmentation, where relevant context from a user profile is retrieved for LLM consumption. These methods deal with a trade-off between exposing retrieved private data to cloud providers and relying on less capable local models. We introduce $P^{3}$ , an interactive framework for high-quality personalization without revealing private profiles to server-side LLMs. In $P^{3}$ , a large server-side model generates a sequence of $k$ draft tokens based solely on the user query, while a small client-side model, with retrieval access to the user's private profile, evaluates and modifies these drafts to better reflect user preferences. This process repeats until an end token is generated. Experiments on LaMP-QA, a recent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Topic Modeling · Data Quality and Management