Persona-DB: Efficient Large Language Model Personalization for Response   Prediction with Collaborative Data Refinement

Chenkai Sun; Ke Yang; Revanth Gangi Reddy; Yi R. Fung; Hou Pong Chan,; Kevin Small; ChengXiang Zhai; Heng Ji

arXiv:2402.11060·cs.CL·February 4, 2025·1 cites

Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan,, Kevin Small, ChengXiang Zhai, Heng Ji

PDF

Open Access

TL;DR

Persona-DB introduces a novel framework for efficient LLM personalization by optimizing data representation and collaborative refinement, significantly improving response accuracy with less data and in cold-start scenarios.

Contribution

It proposes a hierarchical and collaborative data refinement approach to enhance retrieval efficiency and personalization in large language models.

Findings

01

Superior context efficiency with reduced retrieval size

02

Over 10% improvement in cold-start scenarios

03

Enhanced importance of collaborative knowledge with larger retrieval capacity

Abstract

The increasing demand for personalized interactions with large language models (LLMs) calls for methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on enhancing the retrieval stage and devoted limited exploration toward optimizing the representation of the database, a crucial aspect for tasks such as personalization. In this work, we examine the problem from a novel angle, focusing on how data can be better represented for more data-efficient retrieval in the context of LLM customization. To tackle this challenge, we introduce Persona-DB, a simple yet effective framework consisting of a hierarchical construction process to improve generalization across task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersona Design and Applications · Topic Modeling