Mallows Model with Learned Distance Metrics: Sampling and Maximum Likelihood Estimation

Yeganeh Alimohammadi; Kiana Asgari

arXiv:2507.08108·stat.ML·July 14, 2025

Mallows Model with Learned Distance Metrics: Sampling and Maximum Likelihood Estimation

Yeganeh Alimohammadi, Kiana Asgari

PDF

TL;DR

This paper introduces a generalized Mallows model that learns distance metrics from ranking data, providing efficient sampling and MLE algorithms with strong theoretical guarantees and empirical validation.

Contribution

It develops a novel Mallows model with learnable $L_eta$ distance metrics, along with efficient sampling and MLE algorithms, extending prior fixed-distance approaches.

Findings

01

Efficient FPTAS for sampling from the generalized Mallows model.

02

Consistent MLE estimators for central ranking, dispersion, and distance metric.

03

Empirical validation on sports ranking datasets.

Abstract

\textit{Mallows model} is a widely-used probabilistic framework for learning from ranking data, with applications ranging from recommendation systems and voting to aligning language models with human preferences~\cite{chen2024mallows, kleinberg2021algorithmic, rafailov2024direct}. Under this model, observed rankings are noisy perturbations of a central ranking $σ$ , with likelihood decaying exponentially in distance from $σ$ , i.e, $P (π) \propto exp (- β \cdot d (π, σ)),$ where $β > 0$ controls dispersion and $d$ is a distance function. Existing methods mainly focus on fixed distances (such as Kendall's $τ$ distance), with no principled approach to learning the distance metric directly from data. In practice, however, rankings naturally vary by context; for instance, in some sports we regularly see long-range swaps (a low-rank team beating a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.