Statistical Models for Repeated Categorical Ratings: The R Package rater
Jeffrey M. Pullin, Lyle C. Gurrin, Damjan Vukcevic

TL;DR
The paper introduces an R package 'rater' that uses Bayesian models to analyze repeated categorical ratings, estimating true item classes and rater accuracy, with implementations in Stan.
Contribution
It provides the first comprehensive Bayesian R package for analyzing repeated categorical ratings, extending the Dawid-Skene model with detailed inference techniques.
Findings
Effective estimation of true item classes and rater accuracy.
Implementation of Bayesian models in Stan for flexible analysis.
Illustrative examples demonstrating the package's utility.
Abstract
A common problem in many disciplines is the need to assign a set of items into categories or classes with known labels. This is often done by one or more expert raters, or sometimes by an automated process. If these assignments or `ratings' are difficult to make accurately, a common tactic is to repeat them by different raters, or even by the same rater multiple times on different occasions. We present an R package `rater`, available on CRAN, that implements Bayesian versions of several statistical models for analysis of repeated categorical rating data. Inference is possible for the true underlying (latent) class of each item, as well as the accuracy of each rater. The models are extensions of, and include, the Dawid-Skene model, and we implemented them using the Stan probabilistic programming language. We illustrate the use of `rater` through a few examples. We also discuss in detail…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
