Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review
James Stewart-Evans, Emma Wilson, Tessa Langley, Andrew Prayle, Angela Hands, Karen Exley, Jo Leonardi-Bee

TL;DR
This study explores using large language models with review protocols to expedite data extraction in complex scoping reviews, highlighting high accuracy for simple data but challenges with complex, subjective data items.
Contribution
It demonstrates the potential and limitations of LLMs in data extraction within scoping reviews and emphasizes the need for robust evaluation and reporting of LLM performance.
Findings
High accuracy for simple citation data extraction (83.3%-100%)
Lower accuracy for complex, subjective data (9.6%-15.8%)
LLM feedback can aid protocol refinement
Abstract
The data extraction stages of reviews are resource-intensive, and researchers may seek to expediate data extraction using online (large language models) LLMs and review protocols. Claude 3.5 Sonnet was used to trial two approaches that used a review protocol to prompt data extraction from 10 evidence sources included in a case study scoping review. A protocol-based approach was also used to review extracted data. Limited performance evaluation was undertaken which found high accuracy for the two extraction approaches (83.3% and 100%) when extracting simple, well-defined citation details; accuracy was lower (9.6% and 15.8%) when extracting more complex, subjective data items. Considering all data items, both approaches had precision >90% but low recall (<25%) and F1 scores (<40%). The context of a complex scoping review, open response types and methodological approach likely impacted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeta-analysis and systematic reviews · scientometrics and bibliometrics research · Academic Writing and Publishing
