Few-shot Personalization of LLMs with Mis-aligned Responses

Jaehyung Kim; Yiming Yang

arXiv:2406.18678·cs.LG·March 5, 2025·3 cites

Few-shot Personalization of LLMs with Mis-aligned Responses

Jaehyung Kim, Yiming Yang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Fermi, a novel few-shot personalization method for large language models that uses mis-aligned responses and user profiles to iteratively improve personalized prompts, enhancing response quality.

Contribution

Fermi is the first approach to leverage mis-aligned responses and user profiles for iterative prompt personalization in LLMs, improving personalization effectiveness.

Findings

01

Fermi significantly outperforms baseline methods on various benchmarks.

02

Iterative prompt refinement improves personalization accuracy.

03

Using mis-aligned responses enhances the learning of personalized prompts.

Abstract

As the diversity of users increases, the capability of providing personalized responses by large language models (LLMs) has become increasingly important. Existing approaches have only limited successes in LLM personalization, due to the absence of personalized learning or the reliance on shared personal data. This paper proposes a new approach for a few-shot personalization of LLMs with their mis-aligned responses (Fermi). Our key idea is to learn a set of personalized prompts for each user by progressively improving the prompts using LLMs, based on user profile (e.g., demographic information) and a few examples of previous opinions. During an iterative process of prompt improvement, we incorporate the contexts of mis-aligned responses by LLMs, which are especially crucial for the effective personalization of LLMs. In addition, we develop an effective inference method to further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bbuing9/fermi
noneOfficial

Videos

Few-shot Personalization of LLMs with Mis-aligned Responses· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsSparse Evolutionary Training