# Use as directed? A comparison of software tools intended to check rigor and transparency of published work

**Authors:** Peter Eckmann, Adrian Barnett, Alexandra Bannach-Brown, Elisa Pilar Bascunan Atria, Guillaume Cabanac, Louise Delwen Owen Franzen, Małgorzata Anna Gazda, Kaitlyn Hair, James Howison, Halil Kilicoglu, Cyril Labbe, Sarah McCann, Vladislav Nachev, Martijn Roelandse, Maia Salholz-Hillel, Robert Schulz, Gerben ter Riet, Colby Vorland, Anita Bandrowski, Tracey Weissgerber

PMC · DOI: 10.1371/journal.pone.0342225 · PLOS One · 2026-02-13

## TL;DR

This paper compares 11 automated tools for checking scientific rigor and transparency, finding that combining tools improves performance in some areas.

## Contribution

The study provides a systematic comparison of 11 tools across 9 rigor criteria and identifies optimal strategies for their use.

## Key findings

- Some tools outperformed others in detecting open data.
- Combining tools improved detection of inclusion/exclusion criteria.
- Recommendations were made for improving tool development.

## Abstract

The causes of the reproducibility crisis include lack of standardization and transparency in scientific reporting. Checklists such as ARRIVE and CONSORT seek to improve transparency, but they are not always followed by authors and peer review often fails to identify missing items. To address these issues, there are several automated tools that have been designed to check different rigor criteria. We have conducted a broad comparison of 11 automated tools across 9 different rigor criteria from the ScreenIT group. We found some criteria, including detecting open data, where the combination of tools showed a clear winner, a tool which performed much better than other tools. In other cases, including detection of inclusion and exclusion criteria, the combination of tools exceeded the performance of any one tool. We also identified key areas where tool developers should focus their effort to make their tool maximally useful. We conclude with a set of insights and recommendations for stakeholders in the development of rigor and transparency detection tools. The code and data for the study is available at https://github.com/PeterEckmann1/tool-comparison.

## Full-text entities

- **Diseases:** cancer (MESH:D009369), LLM (MESH:D007806), hallucinations (MESH:D006212)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12904390/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12904390/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12904390/full.md

---
Source: https://tomesphere.com/paper/PMC12904390