Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework
Xilai Ma, Liye Zhao, Weijun Yao, Haibing Di, Wenya Wang, Jing Li

TL;DR
This paper introduces C-BPO, a novel framework for personalizing large language models using preference-calibrated binary signals, effectively capturing inter-user differences and improving personalization accuracy.
Contribution
The paper presents a new PU-learning-based objective for LLM personalization that leverages binary feedback to distinguish user-specific preferences from shared knowledge.
Findings
C-BPO outperforms existing personalization methods across multiple tasks.
Preference calibration reduces negative bias and improves user alignment.
Empirical results demonstrate the effectiveness of binary signals in modeling inter-user differences.
Abstract
Large Language Model (LLM) personalization aims to align model behaviors with individual user preferences. Existing methods often focus on isolated user histories, neglecting the essential role of inter-user differences. We propose C-BPO, a framework that personalizes LLMs via preference-calibrated binary signals. By treating target user data as positive feedback and other users' data as an auxiliary set of implicit negative signals, C-BPO captures distinct inter-user differences. To mitigate the preference overlap issue, where shared task knowledge is erroneously penalized, we derive an objective grounded in Positive-Unlabeled (PU) learning theory. This approach purifies negative signals by subtracting ``positive bias'', ensuring alignment with unique idiosyncrasies without compromising general helpfulness. Empirical experiments across various personalization tasks and backbone LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
