Preference Models assume Proportional Hazards of Utilities

Chirag Nagpal

arXiv:2508.13189·stat.ML·August 20, 2025

Preference Models assume Proportional Hazards of Utilities

Chirag Nagpal

PDF

TL;DR

This paper links the Plackett-Luce preference model to the Cox Proportional Hazards model, providing new insights into preference estimation methods used in AI alignment tools.

Contribution

It establishes a novel connection between preference modeling and survival analysis, illuminating underlying assumptions in current AI preference estimation techniques.

Findings

01

Connects Plackett-Luce to Cox Proportional Hazards model

02

Provides insights into preference estimation assumptions

03

Suggests implications for AI alignment methods

Abstract

Approaches for estimating preferences from human annotated data typically involves inducing a distribution over a ranked list of choices such as the Plackett-Luce model. Indeed, modern AI alignment tools such as Reward Modelling and Direct Preference Optimization are based on the statistical assumptions posed by the Plackett-Luce model. In this paper, I will connect the Plackett-Luce model to another classical and well known statistical model, the Cox Proportional Hazards model and attempt to shed some light on the implications of the connection therein.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.