KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe, Kiela

TL;DR
This paper introduces KTO, a new model alignment method based on prospect theory, which directly maximizes human utility and outperforms existing preference-based methods across various scales.
Contribution
It proposes a novel human-aware loss function grounded in prospect theory and demonstrates its effectiveness in aligning large language models with human preferences.
Findings
KTO matches or exceeds performance of preference-based methods.
KTO works effectively from 1B to 30B scale models.
Different loss functions suit different settings depending on inductive biases.
Abstract
Kahneman & Tversky's tells us that humans perceive random variables in a biased but well-defined manner (1992); for example, humans are famously loss-averse. We show that objectives for aligning LLMs with human feedback implicitly incorporate many of these biases -- the success of these objectives (e.g., DPO) over cross-entropy minimization can partly be ascribed to them belonging to a family of loss functions that we call (HALOs). However, the utility functions these methods attribute to humans still differ from those in the prospect theory literature. Using a Kahneman-Tversky model of human utility, we propose a HALO that directly maximizes the utility of generations instead of maximizing the log-likelihood of preferences, as current methods do. We call this approach KTO, and it matches or exceeds the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ContextualAI/Contextual_KTO_Mistral_PairRMmodel· 122 dl· ♡ 32122 dl♡ 32
- 🤗asedmammad/Contextual_KTO_Mistral_PairRM-GGUFmodel· 222 dl· ♡ 2222 dl♡ 2
- 🤗openbmb/Eurus-7b-ktomodel· 39 dl· ♡ 1339 dl♡ 13
- 🤗SnakyMcSnekFace/Psyfighter2-13B-voremodel· 24 dl· ♡ 324 dl♡ 3
- 🤗openbmb/Eurux-8x22b-ktomodel· 23 dl· ♡ 823 dl♡ 8
- 🤗GritLM/GritLM-7B-KTOmodel· 8.1k dl· ♡ 48.1k dl♡ 4
- 🤗GritLM/GritLM-8x7B-KTOmodel· 8.1k dl· ♡ 38.1k dl♡ 3
- 🤗QuantFactory/Eurus-7b-kto-GGUFmodel· 126 dl126 dl
- 🤗sunatte/txt2sqlmodel
- 🤗RichardErkhov/GritLM_-_GritLM-8x7B-KTO-ggufmodel· 73 dl73 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlobal trade and economics
