Mallows Model with Learned Distance Metrics: Sampling and Maximum Likelihood Estimation
Yeganeh Alimohammadi, Kiana Asgari

TL;DR
This paper introduces a generalized Mallows model that learns distance metrics from ranking data, providing efficient sampling and MLE algorithms with strong theoretical guarantees and empirical validation.
Contribution
It develops a novel Mallows model with learnable $L_eta$ distance metrics, along with efficient sampling and MLE algorithms, extending prior fixed-distance approaches.
Findings
Efficient FPTAS for sampling from the generalized Mallows model.
Consistent MLE estimators for central ranking, dispersion, and distance metric.
Empirical validation on sports ranking datasets.
Abstract
\textit{Mallows model} is a widely-used probabilistic framework for learning from ranking data, with applications ranging from recommendation systems and voting to aligning language models with human preferences~\cite{chen2024mallows, kleinberg2021algorithmic, rafailov2024direct}. Under this model, observed rankings are noisy perturbations of a central ranking , with likelihood decaying exponentially in distance from , i.e, where controls dispersion and is a distance function. Existing methods mainly focus on fixed distances (such as Kendall's distance), with no principled approach to learning the distance metric directly from data. In practice, however, rankings naturally vary by context; for instance, in some sports we regularly see long-range swaps (a low-rank team beating a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
