Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
Gabriel Rioux, Apoorva Nitsure, Mattia Rigotti, Kristjan, Greenewald, Youssef Mroueh

TL;DR
This paper introduces a novel statistical method based on optimal transport to assess multivariate stochastic dominance, enabling robust comparison of models across multiple metrics with proven theoretical guarantees.
Contribution
It develops a new multivariate stochastic dominance test using optimal transport, including an entropic regularization and a bootstrap-based hypothesis testing framework.
Findings
The method effectively compares large language models across multiple metrics.
The proposed test captures dependencies between metrics for more informed decisions.
The approach is computationally efficient with Sinkhorn algorithm implementation.
Abstract
Stochastic dominance is an important concept in probability theory, econometrics and social choice theory for robustly modeling agents' preferences between random outcomes. While many works have been dedicated to the univariate case, little has been done in the multivariate scenario, wherein an agent has to decide between different multivariate outcomes. By exploiting a characterization of multivariate first stochastic dominance in terms of couplings, we introduce a statistic that assesses multivariate almost stochastic dominance under the framework of Optimal Transport with a smooth cost. Further, we introduce an entropic regularization of this statistic, and establish a central limit theorem (CLT) and consistency of the bootstrap procedure for the empirical statistic. Armed with this CLT, we propose a hypothesis testing framework as well as an efficient implementation using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEuropean Monetary and Fiscal Policies · Economic theories and models · Game Theory and Voting Systems
