Statistical ranking and combinatorial Hodge theory
Xiaoye Jiang, Lek-Heng Lim, Yuan Yao, Yinyu Ye

TL;DR
This paper introduces a novel approach using combinatorial Hodge theory and the graph Helmholtzian to derive global rankings from incomplete and imbalanced pairwise data, providing a computationally efficient alternative to NP-hard methods.
Contribution
It applies combinatorial Hodge theory to ranking problems, decomposing pairwise rankings into meaningful components and enabling linear least squares solutions for global ranking.
Findings
Decomposition of ranking data into gradient and divergence-free flows.
Efficient linear least squares method for global ranking.
Analysis of local and global inconsistency in rankings.
Abstract
We propose a number of techniques for obtaining a global ranking from data that may be incomplete and imbalanced -- characteristics almost universal to modern datasets coming from e-commerce and internet applications. We are primarily interested in score or rating-based cardinal data. From raw ranking data, we construct pairwise rankings, represented as edge flows on an appropriate graph. Our statistical ranking method uses the graph Helmholtzian, the graph theoretic analogue of the Helmholtz operator or vector Laplacian, in much the same way the graph Laplacian is an analogue of the Laplace operator or scalar Laplacian. We study the graph Helmholtzian using combinatorial Hodge theory: we show that every edge flow representing pairwise ranking can be resolved into two orthogonal components, a gradient flow that represents the L2-optimal global ranking and a divergence-free flow (cyclic)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Data Management and Algorithms · Topological and Geometric Data Analysis
