Revealing subgroup structure in ranked data using a Bayesian WAND
Stephen R. Johnson, Daniel A. Henderson, Richard J. Boys

TL;DR
This paper introduces a Bayesian nonparametric model called WAND for analyzing ranked data, effectively revealing subgroup structures and assessing ranker reliability, especially when data are incomplete or heterogeneous.
Contribution
It proposes a novel WAND process mixture of Plackett-Luce models that captures subgroup heterogeneity and ranker differences with efficient inference and detailed posterior insights.
Findings
Successfully identifies subgroup structures in simulated data.
Effectively assesses ranker reliability and entity exchangeability.
Provides a flexible framework for incomplete and heterogeneous ranked data.
Abstract
Ranked data arise in many areas of application ranging from the ranking of up-regulated genes for cancer to the ranking of academic statistics journals. Complications can arise when rankers do not report a full ranking of all entities; for example, they might only report their top-- ranked entities after seeing some or all entities. It can also be useful to know whether rankers are equally informative, and whether some entities are effectively judged to be exchangeable. When there is important subgroup structure in the data, summaries such as aggregate (overall) rankings can be misleading. In this paper we propose a flexible Bayesian nonparametric model for identifying heterogeneous structure and ranker reliability in ranked data. The model is a Weighted Adapted Nested Dirichlet (WAND) process mixture of Plackett-Luce models and inference proceeds through a simple and efficient Gibbs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
