# A hybrid approach to large-scale systematic literature reviews: combining automated tools with text-mining techniques

**Authors:** Zhao Hui Koh, Armita Zarnegar, Jason Skues, Greg Murray

PMC · DOI: 10.1186/s13104-026-07651-7 · BMC Research Notes · 2026-01-30

## TL;DR

The paper introduces a hybrid method combining automated tools and text-mining to improve the efficiency and accuracy of large-scale systematic literature reviews.

## Contribution

The novel hybrid approach enhances semi-automated tools by using text-mining to generate better seed articles, reducing biases and improving learning efficiency.

## Key findings

- The hybrid approach effectively reduces and screens a large number of articles (N=90,871) for systematic reviews.
- Simulations show the method can create comprehensive seed articles covering broad subject areas.
- The approach increases transparency and reusability of keywords for future review updates.

## Abstract

Semi-automated tools used during the preliminary screening of articles in systematic reviews can start with a small set of seed articles and actively learn from human decisions to prioritise more relevant articles for subsequent screening. However, given that these tools are vulnerable to biases and lack clear stopping criteria, their performance in large-scale systematic reviews remains uncertain, especially in reviews covering broad subject areas that require a substantial number of representative seed articles. This article presents a hybrid approach that uses text-mining techniques combined with a semi-automated tool to effectively reduce, screen, and validate a large cohort of articles (N = 90,871).

A preliminary evaluation using simulations indicated that this approach has the potential to craft a comprehensive collection of seed articles that covers broad subject areas for semi-automated tools in a large-scale systematic review. The strengths and limitations of using a semi-automated tool alone in such a context are discussed. Our approach increases the efficiency of automated tools by providing a larger and more focused selection of articles to start with, optimising the learning process for those tools and reducing biases. Additionally, our approach could increase the transparency and reusability of keywords for future review updates.

The online version contains supplementary material available at 10.1186/s13104-026-07651-7.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12930779/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12930779/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12930779/full.md

---
Source: https://tomesphere.com/paper/PMC12930779