Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models

Alireza Salemi; Hamed Zamani

arXiv:2409.09510·cs.CL·June 27, 2025·3 cites

Comparing Retrieval-Augmentation and Parameter-Efficient Fine-Tuning for Privacy-Preserving Personalization of Large Language Models

Alireza Salemi, Hamed Zamani

PDF

Open Access 1 Repo

TL;DR

This paper systematically compares retrieval-augmented generation and parameter-efficient fine-tuning for privacy-preserving personalization of large language models, showing their individual and combined benefits across diverse datasets.

Contribution

It provides the first comprehensive analysis of PEFT for LLM personalization and compares it with RAG, revealing their complementary strengths and performance dynamics.

Findings

01

Both RAG and PEFT improve personalization over non-personalized LLMs.

02

Combining RAG with PEFT yields further performance gains.

03

RAG is more effective for users with limited personal data, while PEFT excels with more data.

Abstract

Despite its substantial impact on various search, recommendation, and question answering tasks, privacy-preserving methods for personalizing large language models (LLMs) have received relatively limited exploration. There is one primary approach in this area through retrieval-augmented generation (RAG), which generates personalized outputs by enriching the input prompt with information retrieved from the user's personal data. This paper studies an orthogonal approach to RAG that involves learning user-dependent LLM parameters through parameter-efficient fine-tuning (PEFT). This paper presents the first systematic study for exploration of PEFT for LLM personalization and provides an extensive comparisons between RAG- and PEFT-based solutions, across a broad set of seven diverse datasets from the LaMP benchmark. Our results demonstrate that, on average, both RAG- and PEFT-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lamp-benchmark/lamp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Layer Normalization · WordPiece · Dropout · Attention Dropout · BART · Dense Connections