Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model
David Wilkins

TL;DR
This paper introduces GPTscreenR, an R package utilizing GPT-4 to automate literature screening in scoping reviews, aiming to reduce human effort while maintaining reasonable accuracy.
Contribution
The paper presents GPTscreenR, a novel software tool that applies GPT-4 with chain-of-thought prompting to automate scholarly source screening in reviews.
Findings
GPTscreenR achieved 71% sensitivity and 89% specificity.
Performance was comparable to zero-shot techniques.
Neither method matched human intraobserver agreement.
Abstract
Scoping reviews, a type of literature review, require intensive human effort to screen large numbers of scholarly sources for their relevance to the review objectives. This manuscript introduces GPTscreenR, a package for the R statistical programming language that uses the GPT-4 Large Language Model (LLM) to automatically screen sources. The package makes use of the chain-of-thought technique with the goal of maximising performance on complex screening tasks. In validation against consensus human reviewer decisions, GPTscreenR performed similarly to an alternative zero-shot technique, with a sensitivity of 71%, specificity of 89%, and overall accuracy of 84%. Neither method achieved perfect accuracy nor human levels of intraobserver agreement. GPTscreenR demonstrates the potential for LLMs to support scholarly work and provides a user-friendly software framework that can be integrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Meta-analysis and systematic reviews
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Byte Pair Encoding · Dropout · Adam · Softmax · Label Smoothing · Dense Connections
