How should we aggregate ratings? Accounting for personal rating scales via Wasserstein barycenters

Daniel Raban

arXiv:2410.00865·math.ST·November 19, 2025

How should we aggregate ratings? Accounting for personal rating scales via Wasserstein barycenters

Daniel Raban

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Wasserstein barycenter-based method for aggregating individual ratings that accounts for personal rating scales, demonstrating its consistency and optimal convergence properties over traditional averaging.

Contribution

It proposes a novel non-parametric model using optimal transport to aggregate ratings, improving accuracy over simple averages and generalizing Kendall's W to numerical ratings.

Findings

01

Standard averaging is inconsistent for heterogeneous ratings.

02

Wasserstein barycenter estimator is asymptotically consistent.

03

The method achieves optimal convergence rates.

Abstract

A common method of comparing items is to collect numerical ratings on a linear scale and compare the average rating for each item. However, averaging ratings does not account for people rating according to differing personal rating scales. With this in mind, we investigate the problem of calculating aggregate numerical ratings from individual numerical ratings and propose a new, non-parametric model for the problem. We show that, with minimal modeling assumptions, the standard average is inconsistent for estimating the quality of items. Analyzing the problem of heterogeneous personal rating scales from the perspective of optimal transport, we derive an alternative rating estimator, which we show is asymptotically consistent almost surely and in L^p for estimating quality, with an optimal rate of convergence. Further, we generalize Kendall's W, a non-parametric coefficient of preference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pillowmath/rating-estimator
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCredit Risk and Financial Regulations