Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning
Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R., Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

TL;DR
TICL is a tuning-free method that personalizes language models using a trial-error-explain process with minimal examples, significantly improving stylistic alignment in text generation tasks.
Contribution
It introduces TICL, a novel tuning-free approach that enhances personalized language model alignment through iterative prompt expansion with explanations and negative samples.
Findings
Achieves up to 91.5% win rate against state-of-the-art methods.
Outperforms tuning-free baselines in email, essay, and news article tasks.
Effectively learns user-specific styles with fewer than 10 examples.
Abstract
Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-error-explain process, adding model-generated negative samples and explanations that provide fine-grained guidance towards a specific user's style. TICL achieves favorable win rates on pairwise comparisons with LLM-as-a-judge up to 91.5% against the previous state-of-the-art and outperforms competitive tuning-free baselines for personalized alignment tasks of writing emails, essays and news articles. Both lexical and qualitative analyses show that the negative samples and explanations enable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Healthcare · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
MethodsALIGN
