Can We Mathematically Spot Possible Manipulation of Results in Research Manuscripts Using Benford's Law?
Teddy Lazebnik, Dan Gorlitsky

TL;DR
This paper presents a semi-automatic method using an adaptive version of Benford's law to detect potential manipulation in research results from manuscripts, validated on datasets and recent economic journal articles.
Contribution
It introduces a novel application of an adaptive Benford's law approach to identify possible data manipulation in research manuscripts without needing raw data.
Findings
Predicted 79% of datasets accurately using the method.
Detected a 3% potential manipulation rate in recent economic papers.
Method offers a new tool for verifying research integrity.
Abstract
The reproducibility of academic research has long been a persistent issue, contradicting one of the fundamental principles of science. What is even more concerning is the increasing number of false claims found in academic manuscripts recently, casting doubt on the validity of reported results. In this paper, we utilize an adaptive version of Benford's law, a statistical phenomenon that describes the distribution of leading digits in naturally occurring datasets, to identify potential manipulation of results in research manuscripts, solely using the aggregated data presented in those manuscripts. Our methodology applies the principles of Benford's law to commonly employed analyses in academic manuscripts, thus, reducing the need for the raw data itself. To validate our approach, we employed 100 open-source datasets and successfully predicted 79% of them accurately using our rules.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBenford’s Law and Fraud Detection
