Unlucky Number 13? Manipulating Evidence Subject to Snooping
Uwe Hassler, Marc-Oliver Pohle

TL;DR
This paper examines how secret data snooping practices, termed MESSing, can significantly distort statistical inference, demonstrated through empirical analysis of lottery data suggesting 13 is an unlucky number.
Contribution
It introduces the concept of MESSing, categorizes various snooping practices, and quantifies their impact on statistical conclusions through empirical and theoretical examples.
Findings
MESSing can dramatically alter statistical inferences
Empirical example shows potential bias in lottery data
Quantitative analysis highlights risks of secret data manipulation
Abstract
Questionable research practices like HARKing or p-hacking have generated considerable recent interest throughout and beyond the scientific community. We subsume such practices involving secret data snooping that influences subsequent statistical inference under the term MESSing (manipulating evidence subject to snooping) and discuss, illustrate and quantify the possibly dramatic effects of several forms of MESSing using an empirical and a simple theoretical example. The empirical example uses numbers from the most popular German lottery, which seem to suggest that 13 is an unlucky number.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBenford’s Law and Fraud Detection · Probability and Statistical Research · Statistics Education and Methodologies
