EXACT: Explicit Attribute-Guided Decoding-Time Personalization
Xin Yu, Hanwen Xing, Lingzhou Xue

TL;DR
EXACT introduces a decoding-time personalization method for large language models that uses interpretable attributes and preference feedback to adapt responses to individual users across evolving contexts.
Contribution
It proposes a novel attribute-guided decoding approach with theoretical guarantees and effective retrieval mechanisms for personalized language generation.
Findings
Outperforms baseline methods in preference accuracy.
Effectively adapts to shifting user preferences.
Provides theoretical approximation guarantees.
Abstract
Achieving personalized alignment requires adapting large language models to each user's evolving context. While decoding-time personalization offers a scalable alternative to training-time methods, existing methods largely rely on implicit, less interpretable preference representations and impose a rigid, context-agnostic user representation, failing to account for how preferences shift across prompts. We introduce EXACT, a new decoding-time personalization that aligns generation with limited pairwise preference feedback using a predefined set of interpretable attributes. EXACT first identifies user-specific attribute subsets by maximizing the likelihood of preferred responses in the offline stage. Then, for online inference, EXACT retrieves the most semantically relevant attributes for an incoming prompt and injects them into the context to steer generation. We establish theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Natural Language Processing Techniques
