A Supervised Machine Learning Approach for Assessing Grant Peer Review Reports
Gabriel Okasa, Alberto de Le\'on, Michaela Strinzel, Anne Jorstad,, Katrin Milzow, Matthias Egger, Stefan M\"uller

TL;DR
This paper presents a machine learning pipeline using NLP and transformer models to analyze and categorize grant peer review reports, aiming to improve transparency and quality in grant evaluation processes.
Contribution
The study introduces a novel annotated corpus and fine-tuned transformer models for classifying peer review report content at scale, validated through robustness checks.
Findings
Many categories can be reliably classified by humans and machines.
Classification performance varies significantly with the approach used.
High difficulty in identifying certain categories correlates with lower classification accuracy.
Abstract
Peer review in grant evaluation informs funding decisions, but the contents of peer review reports are rarely analyzed. In this work, we develop a thoroughly tested pipeline to analyze the texts of grant peer review reports using methods from applied Natural Language Processing (NLP) and machine learning. We start by developing twelve categories reflecting content of grant peer review reports that are of interest to research funders. This is followed by multiple human annotators' iterative annotation of these categories in a novel text corpus of grant peer review reports submitted to the Swiss National Science Foundation. After validating the human annotation, we use the annotated texts to fine-tune pre-trained transformer models to classify these categories at scale, while conducting several robustness and validation checks. Our results show that many categories can be reliably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗snsf-data/specter2-review-track-recordmodel· 13 dl13 dl
- 🤗snsf-data/specter2-review-relevance-originality-topicalitymodel· 2 dl2 dl
- 🤗snsf-data/specter2-review-suitabilitymodel· 10 dl10 dl
- 🤗snsf-data/specter2-review-feasibilitymodel· 4 dl4 dl
- 🤗snsf-data/specter2-review-applicantmodel· 11 dl11 dl
- 🤗snsf-data/specter2-review-applicant-quantitymodel· 13 dl13 dl
- 🤗snsf-data/specter2-review-proposalmodel· 4 dl4 dl
- 🤗snsf-data/specter2-review-methodmodel· 4 dl4 dl
- 🤗snsf-data/specter2-review-positivemodel· 12 dl12 dl
- 🤗snsf-data/specter2-review-negativemodel· 11 dl11 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research
