Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback
Parker Whitfill, Stewy Slocum

TL;DR
This paper demonstrates that using only ordinal human preferences is fundamentally limited for aligning language models, and shows that collecting cardinal feedback significantly improves model alignment and performance.
Contribution
The paper proves the limitations of ordinal preferences for model alignment and introduces a new dataset of cardinal judgments to enhance fine-tuning methods.
Findings
Cardinal feedback enables better tradeoff resolution in model alignment.
Models fine-tuned with cardinal data outperform ordinal-only methods on benchmarks.
Collected 25,000 cardinal judgments using willingness-to-pay elicitation.
Abstract
Alignment techniques for LLMs rely on optimizing preference-based objectives -- where these preferences are typically elicited as ordinal, binary choices between responses. Recent work has focused on improving label quality or mitigating particular biases, but we identify a more fundamental limitation: these methods collect the wrong kind of data. We prove an impossibility result: no algorithm relying solely on ordinal comparisons can systematically recover the most preferred model. Intuitively, ordinal data lacks the information needed to resolve tradeoffs -- e.g., fixing a factual error on one prompt versus improving style on another. We show that selecting the optimal model requires recovering preferences over \emph{models} (rather than just responses), which can only be identified given cardinal feedback about response quality. To address this, we collect and publicly release a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Ethics and Social Impacts of AI
