Accelerating the pace and accuracy of systematic reviews using AI: a validation study
Jiada Zhan, Kara Suvada, Muwu Xu, Wenya Tian, Kelly C. Cara, Taylor C. Wallace, Mohammed K. Ali

TL;DR
This study validates the use of AI in speeding up systematic reviews by comparing its accuracy to human decisions in screening research articles.
Contribution
The study evaluates the performance of Review Copilot (GPT-4) in systematic review screening tasks against human decisions.
Findings
Review Copilot showed high sensitivity (99.2%) but moderate specificity (83.6%) in title/abstract screening.
Full-text screening by Review Copilot had high sensitivity (97.6%) but lower specificity (47.4%).
AI screening was completed in one-quarter of the time compared to human screening.
Abstract
Artificial intelligence (AI) can greatly enhance efficiency in systematic literature reviews and meta-analyses, but its accuracy in screening titles/abstracts and full-text articles is uncertain. This study evaluated the performance metrics (sensitivity, specificity) of a GPT-4 AI program, Review Copilot, against human decisions (gold standard) in screening titles/abstracts and full-text articles from four published systematic reviews/meta-analyses. Participant data from four already-published systematic literature reviews were used for this validation study. This was a study comparing Review Copilot to human decision-making (gold standard) in screening titles/abstracts and full-text articles for systematic reviews/meta-analyses. The four studies that were used in this study included observational studies and randomized control trials. Review Copilot operates on the OpenAI, GPT-4…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Meta-analysis and systematic reviews · Digital Mental Health Interventions
