Few-shot Personalization of LLMs with Mis-aligned Responses
Jaehyung Kim, Yiming Yang

TL;DR
This paper introduces Fermi, a novel few-shot personalization method for large language models that uses mis-aligned responses and user profiles to iteratively improve personalized prompts, enhancing response quality.
Contribution
Fermi is the first approach to leverage mis-aligned responses and user profiles for iterative prompt personalization in LLMs, improving personalization effectiveness.
Findings
Fermi significantly outperforms baseline methods on various benchmarks.
Iterative prompt refinement improves personalization accuracy.
Using mis-aligned responses enhances the learning of personalized prompts.
Abstract
As the diversity of users increases, the capability of providing personalized responses by large language models (LLMs) has become increasingly important. Existing approaches have only limited successes in LLM personalization, due to the absence of personalized learning or the reliance on shared personal data. This paper proposes a new approach for a few-shot personalization of LLMs with their mis-aligned responses (Fermi). Our key idea is to learn a set of personalized prompts for each user by progressively improving the prompts using LLMs, based on user profile (e.g., demographic information) and a few examples of previous opinions. During an iterative process of prompt improvement, we incorporate the contexts of mis-aligned responses by LLMs, which are especially crucial for the effective personalization of LLMs. In addition, we develop an effective inference method to further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques
MethodsSparse Evolutionary Training
