Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context   Learning

Hyundong Cho; Karishma Sharma; Nicolaas Jedema; Leonardo F. R.; Ribeiro; Alessandro Moschitti; Ravi Krishnan; Jonathan May

arXiv:2502.08972·cs.CL·April 8, 2025

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R., Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

PDF

Open Access 1 Video

TL;DR

TICL is a tuning-free method that personalizes language models using a trial-error-explain process with minimal examples, significantly improving stylistic alignment in text generation tasks.

Contribution

It introduces TICL, a novel tuning-free approach that enhances personalized language model alignment through iterative prompt expansion with explanations and negative samples.

Findings

01

Achieves up to 91.5% win rate against state-of-the-art methods.

02

Outperforms tuning-free baselines in email, essay, and news article tasks.

03

Effectively learns user-specific styles with fewer than 10 examples.

Abstract

Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-error-explain process, adding model-generated negative samples and explanations that provide fine-grained guidance towards a specific user's style. TICL achieves favorable win rates on pairwise comparisons with LLM-as-a-judge up to 91.5% against the previous state-of-the-art and outperforms competitive tuning-free baselines for personalized alignment tasks of writing emails, essays and news articles. Both lexical and qualitative analyses show that the negative samples and explanations enable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning· underline

Taxonomy

TopicsMachine Learning in Healthcare · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis

MethodsALIGN