# Assessing the potential of a Bayesian ranking as an alternative to consensus meetings for decision making in research funding: A case study of Marie Skłodowska-Curie actions

**Authors:** Rachel Heyard, David G. Pina, Ivan Buljan, Ana Marušić, Annesha Sil, Irfan Ullah, Sergio Useche, Sergio Useche

PMC · DOI: 10.1371/journal.pone.0317772 · PLOS One · 2025-03-24

## TL;DR

This study explores whether a Bayesian ranking algorithm can replace consensus meetings in funding decisions, using data from Marie Skłodowska-Curie Actions.

## Contribution

The paper introduces a Bayesian hierarchical model as an alternative to traditional consensus meetings in grant funding decisions.

## Key findings

- Bayesian rankings showed large discrepancies compared to consensus meeting outcomes.
- Aggregated scores reduced discrepancies, especially in low success rate funding schemes.
- Individual scores before consensus meetings are not reliable predictors of final funding outcomes.

## Abstract

Funding agencies rely on panel or consensus meetings to summarise individual evaluations of grant proposals into a final ranking. However, previous research has shown inconsistency in decisions and inefficiency of consensus meetings. Using data from the Marie Skłodowska-Curie Actions, we aimed at investigating the differences between an algorithmic approach to summarise the information from grant proposal individual evaluations to decisions after consensus meetings, and we present an exploratory comparative analysis. The algorithmic approach employed was a Bayesian hierarchical model resulting in a Bayesian ranking of the proposals using the individual evaluation reports cast prior to the consensus meeting. Parameters from the Bayesian hierarchical model and the subsequent ranking were compared to the scores, ranking and decisions established in the consensus meeting reports. The results from the evaluation of 1,006 proposals submitted to three panels (Life Science, Mathematics, Social Sciences and Humanities) in two call years (2015 and 2019) were investigated in detail. Overall, we found large discrepancies between the consensus reports and the scores a Bayesian hierarchical model would have predicted. The discrepancies were less pronounced when the scores were aggregated into funding rankings or decisions. The best agreement between the final funding ranking can be observed in the case of funding schemes with very low success rates. While we set out to understand if algorithmic approaches, with the aim of summarising individual evaluation scores, could replace consensus meetings, we concluded that currently individual scores assigned prior to the consensus meetings are not useful to predict the final funding outcomes of the proposals. Following our results, we would suggest to use individual evaluations for a triage and subsequently not discuss the weakest proposals in panel or consensus meetings. This would allow a more nuanced evaluation of a smaller set of proposals and help minimise the uncertainty and biases when allocating funding.

## Full-text entities

- **Diseases:** IPT (MESH:D004834)
- **Chemicals:** OSF (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11970724/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11970724/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC11970724/full.md

---
Source: https://tomesphere.com/paper/PMC11970724