TL;DR
NextQuill introduces a causal preference modeling framework for LLM personalization, focusing on aligning model predictions with true user preferences through causal interventions, leading to improved personalization quality.
Contribution
It presents a novel causal perspective for LLM personalization, emphasizing alignment of causal preference effects rather than superficial data fitting.
Findings
Significantly improves personalization quality across multiple benchmarks.
Provides a causal foundation for more effective LLM adaptation.
Demonstrates the effectiveness of preference-based token focusing.
Abstract
Personalizing large language models (LLMs) for individual users has become increasingly important as they are progressively integrated into real-world applications to support users' daily lives. However, existing personalization approaches often fail to distinguish which components of model predictions and training data truly reflect user preferences, leading to superficial personalization alignment. In this paper, we introduce NextQuill, a novel LLM personalization alignment framework grounded in causal preference modeling. We approach personalization from a causal perspective, treating both model predictions and ground-truth data generation as outcomes influenced by user preferences, along with other factors. We define the true preference effect as the causal impact of user history (which reflects preferences) on each token prediction or data generation instance, estimated through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
