Overall Agreement for Multiple Raters with Replicated Measurements
Tongrong Wang, Huiman X. Barnhart

TL;DR
This paper introduces new nonparametric agreement indices for multiple raters that preserve intuitive interpretation and accommodate replicated measurements, improving assessment of inter-rater reliability without distributional assumptions.
Contribution
It proposes a novel set of overall agreement indices based on maximum pairwise differences and a unified GEE-based estimation method that handles replications and maintains interpretability.
Findings
Proposed indices retain intuitive interpretation from pairwise agreement.
The GEE-based estimator achieves efficiency bounds under mild conditions.
Simulation studies demonstrate the method's effectiveness across scenarios.
Abstract
Multiple raters are often needed to be used interchangeably in practice for measurement or evaluation. Assessing agreement among these multiple raters via agreement indices are necessary before their participation. While the intuitively appealing agreement indices such as coverage probability and total deviation index, and relative area under coverage probability curve, have been extended for assessing overall agreement among multiple raters, these extensions have limitations. The existing overall agreement indices either require normality and homogeneity assumptions or did not preserve the intuitive interpretation of the indices originally defined for two raters. In this paper, we propose a new set of overall agreement indices based on maximum pairwise differences among all raters. The proposed new overall coverage probability, overall total deviation index and relative area under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReliability and Agreement in Measurement · Statistical Methods and Bayesian Inference · Hemodynamic Monitoring and Therapy
