Dimensionality Reduction and (Bucket) Ranking: a Mass Transportation Approach
Mastane Achab, Anna Korba, Stephan Cl\'emen\c{c}on

TL;DR
This paper introduces a novel mass transportation-based framework for reducing parameters in ranking data by partitioning items into buckets, enabling sparse representation of permutation distributions with theoretical guarantees and empirical validation.
Contribution
It develops an original mass transportation approach for bucket-based ranking data reduction, including distortion measures, rate bounds, and complexity penalization techniques.
Findings
The proposed method effectively summarizes ranking distributions with high accuracy.
Theoretical analysis provides bounds on the distortion and performance.
Numerical experiments demonstrate practical relevance on real data.
Abstract
Whereas most dimensionality reduction techniques (e.g. PCA, ICA, NMF) for multivariate data essentially rely on linear algebra to a certain extent, summarizing ranking data, viewed as realizations of a random permutation on a set of items indexed by , is a great statistical challenge, due to the absence of vector space structure for the set of permutations . It is the goal of this article to develop an original framework for possibly reducing the number of parameters required to describe the distribution of a statistical population composed of rankings/permutations, on the premise that the collection of items under study can be partitioned into subsets/buckets, such that, with high probability, items in a certain bucket are either all ranked higher or else all ranked lower than items in another bucket. In this context, 's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Random Matrices and Applications · Rough Sets and Fuzzy Logic
MethodsIndependent Component Analysis · Principal Components Analysis
