BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in   Large Language Models Personalization

Gihun Lee; Minchan Jeong; Yujin Kim; Hojung Jung; Jaehoon Oh; Sangmook; Kim; Se-Young Yun

arXiv:2407.00693·cs.AI·October 1, 2024

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization

Gihun Lee, Minchan Jeong, Yujin Kim, Hojung Jung, Jaehoon Oh, Sangmook, Kim, Se-Young Yun

PDF

Open Access

TL;DR

This paper introduces BAPO, a method for personalizing large language models that preserves general knowledge while adapting to diverse user preferences, addressing the challenge of knowledge forgetting in personalized alignment.

Contribution

BAPO is a novel approach that uses reference model responses to prevent forgetting during personalized preference optimization in LLMs.

Findings

01

BAPO effectively balances personalization and knowledge preservation.

02

It outperforms previous methods in diverse preference adaptation.

03

BAPO maintains global knowledge with minimal loss.

Abstract

While learning to align Large Language Models (LLMs) with human preferences has shown remarkable success, aligning these models to meet the diverse user preferences presents further challenges in preserving previous knowledge. This paper examines the impact of personalized preference optimization on LLMs, revealing that the extent of knowledge loss varies significantly with preference heterogeneity. Although previous approaches have utilized the KL constraint between the reference model and the policy model, we observe that they fail to maintain general knowledge and alignment when facing personalized preferences. To this end, we introduce Base-Anchored Preference Optimization (BAPO), a simple yet effective approach that utilizes the initial responses of reference model to mitigate forgetting while accommodating personalized alignment. BAPO effectively adapts to diverse user preferences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Recommender Systems and Techniques

MethodsALIGN