To Err Is Human: Systematic Quantification of Errors in Published AI Papers via LLM Analysis
Federico Bianchi, Yongchan Kwon, Zachary Izzo, Linjun Zhang, James Zou

TL;DR
This paper introduces a GPT-5 based system to systematically identify and correct objective errors in published AI research papers, revealing a rising trend in mistakes over recent years.
Contribution
It presents a novel AI-driven method for automatic error detection and correction in scientific literature, enhancing reproducibility and reliability of AI research publications.
Findings
Published AI papers contain a significant number of objective mistakes.
The average number of mistakes per paper has increased over recent years.
The AI checker achieves an 83.2% precision in identifying actual mistakes.
Abstract
How many mistakes do published AI papers contain? Peer-reviewed publications form the foundation upon which new research and knowledge are built. Errors that persist in the literature can propagate unnoticed, creating confusion in follow-up studies and complicating reproducibility. The accelerating pace of research and the increasing demands on the peer-review system make such mistakes harder to detect and avoid. To address this, we developed a Paper Correctness Checker based on GPT-5 to systematically identify mistakes in papers previously published at top AI conferences and journals. Our analysis focuses on objective mistakes-e.g., errors in formulas, derivations, calculations, figures, and tables-that have a clearly verifiable ground truth. We intentionally exclude subjective considerations such as novelty, importance, or writing quality. We find that published papers contain a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Academic Publishing and Open Access · Meta-analysis and systematic reviews
