Recall, Risk, and Governance in Automated Proposal Screening for Research Funding: Evidence from a National Funding Programme
Chandan G. Nagarajappa, Moumita Koley, Avinash Kumar, Rabindra Panigrahy, Pramod Kumar Arya

TL;DR
This study empirically compares rule-based and LLM-based automated proposal screening methods in a national research funding context, highlighting the importance of error profiles and institutional suitability for high-stakes decisions.
Contribution
It provides the first empirical comparison of automated screening approaches against committee decisions, emphasizing error asymmetry and institutional context in AI tool evaluation.
Findings
TF-IDF approach outperforms LLM in recall and false negatives
LLM-based system excludes more proposals, risking irrecoverable errors
Error profile and transparency are crucial for AI suitability in funding decisions
Abstract
Research funding agencies are increasingly exploring automated tools to support early-stage proposal screening. Recent advances in large language models (LLMs) have generated optimism regarding their use for text-based evaluation, yet their institutional suitability for high-stakes screening decisions remains underexplored. In particular, there is limited empirical evidence on how automated screening systems perform when evaluated against institutional error costs. This study compares two automated approaches for proposal screening against the priorities of a national funding call: A transparent, rule-based method using term frequency-inverse document frequency (TF-IDF) with domain-specific keyword engineering, and a semantic classification approach based on a large language model. Using selection committee decisions as ground truth for 959 proposals, we evaluate performance with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Scientific Computing and Data Management · Computational and Text Analysis Methods
