Automated Risk-of-Bias Assessment of Randomized Controlled Trials: A First Look at a GEPA-trained Programmatic Prompting Framework
Lingbo Li, Anuradha Mathrani, Teo Susnjak

TL;DR
This paper presents a structured, code-based framework using GEPA for automating risk-of-bias assessments in RCTs with large language models, improving reproducibility and performance over manual prompts.
Contribution
Introduces a programmable, transparent pipeline with GEPA for optimizing LLM prompts in risk-of-bias assessment, surpassing manual prompt performance and enhancing reproducibility.
Findings
GEPA-generated prompts outperform manual prompts in accuracy.
Performance improved by 30-40% in key domains.
Method is applicable to both open and commercial LLMs.
Abstract
Assessing risk of bias (RoB) in randomized controlled trials is essential for trustworthy evidence synthesis, but the process is resource-intensive and prone to variability across reviewers. Large language models (LLMs) offer a route to automation, but existing methods rely on manually engineered prompts that are difficult to reproduce, generalize, or evaluate. This study introduces a programmable RoB assessment pipeline that replaces ad-hoc prompt design with structured, code-based optimization using DSPy and its GEPA module. GEPA refines LLM reasoning through Pareto-guided search and produces inspectable execution traces, enabling transparent replication of every step in the optimization process. We evaluated the method on 100 RCTs from published meta-analyses across seven RoB domains. GEPA-generated prompts were applied to both open-weight models (Mistral Small 3.1 with GPT-oss-20b)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeta-analysis and systematic reviews · Artificial Intelligence in Healthcare and Education · Advanced Causal Inference Techniques
