Peer-review under review - A statistical study on proposal ranking at ESO. Part I: the pre-meeting phase
Ferdinando Patat

TL;DR
This study analyzes the reproducibility of proposal rankings in ESO's peer review process, revealing moderate agreement levels among referees and providing a statistical model to estimate process consistency.
Contribution
It offers a detailed statistical analysis of proposal ranking reproducibility and introduces a heuristic model to quantify agreement levels in peer review at ESO.
Findings
Approximately one third of top-ranked proposals by one referee are similarly ranked by others.
Agreement between panels of six referees is about 55% for top and bottom quartiles.
Model predictions align with boot-strapping results and NIPS experiment data.
Abstract
Peer review is the most common mechanism in place for assessing requests for resources in a large variety of scientific disciplines. One of the strongest criticisms to this paradigm is the limited reproducibility of the process, especially at largely oversubscribed facilities. In this and in a subsequent paper we address this specific aspect in a quantitative way, through a statistical study on proposal ranking at the European Southern Observatory. For this purpose we analysed a sample of about 15000 proposals, submitted by more than 3000 Principal Investigators over 8 years. The proposals were reviewed by more than 500 referees, who assigned over 140000 grades in about 200 panel sessions. After providing a detailed analysis of the statistical properties of the sample, the paper presents an heuristic model based on these findings, which is then used to provide quantitative estimates of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
