Recall, Risk, and Governance in Automated Proposal Screening for Research Funding: Evidence from a National Funding Programme

Chandan G. Nagarajappa; Moumita Koley; Avinash Kumar; Rabindra Panigrahy; Pramod Kumar Arya

arXiv:2602.07869·cs.DL·February 10, 2026

Recall, Risk, and Governance in Automated Proposal Screening for Research Funding: Evidence from a National Funding Programme

Chandan G. Nagarajappa, Moumita Koley, Avinash Kumar, Rabindra Panigrahy, Pramod Kumar Arya

PDF

Open Access

TL;DR

This study empirically compares rule-based and LLM-based automated proposal screening methods in a national research funding context, highlighting the importance of error profiles and institutional suitability for high-stakes decisions.

Contribution

It provides the first empirical comparison of automated screening approaches against committee decisions, emphasizing error asymmetry and institutional context in AI tool evaluation.

Findings

01

TF-IDF approach outperforms LLM in recall and false negatives

02

LLM-based system excludes more proposals, risking irrecoverable errors

03

Error profile and transparency are crucial for AI suitability in funding decisions

Abstract

Research funding agencies are increasingly exploring automated tools to support early-stage proposal screening. Recent advances in large language models (LLMs) have generated optimism regarding their use for text-based evaluation, yet their institutional suitability for high-stakes screening decisions remains underexplored. In particular, there is limited empirical evidence on how automated screening systems perform when evaluated against institutional error costs. This study compares two automated approaches for proposal screening against the priorities of a national funding call: A transparent, rule-based method using term frequency-inverse document frequency (TF-IDF) with domain-specific keyword engineering, and a semantic classification approach based on a large language model. Using selection committee decisions as ground truth for 959 proposals, we evaluate performance with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicsscientometrics and bibliometrics research · Scientific Computing and Data Management · Computational and Text Analysis Methods