A randomized trial in a massive online open course shows people don't know what a statistically significant relationship looks like, but they can learn
Aaron Fisher, G. Brooke Anderson, Roger Peng, Jeff Leek

TL;DR
This study shows that most people struggle to visually identify statistically significant relationships in scatterplots, but with practice, their detection ability can improve, highlighting misconceptions about what significance looks like.
Contribution
It provides empirical evidence that laypeople and students have difficulty recognizing significant relationships visually and demonstrates that training can enhance their detection skills.
Findings
Participants correctly identified 47.4% of significant relationships.
Adding visual aids increased false positives regardless of actual significance.
Repeated attempts improved detection of significant relationships.
Abstract
Scatterplots are the most common way for statisticians, scientists, and the public to visually detect relationships between measured variables. At the same time, and despite widely publicized controversy, P-values remain the most commonly used measure to statistically justify relationships identified between variables. Here we measure the ability to detect statistically significant relationships from scatterplots in a randomized trial of 2,039 students in a statistics massive open online course (MOOC). Each subject was shown a random set of scatterplots and asked to visually determine if the underlying relationships were statistically significant at the P < 0.05 level. Subjects correctly classified only 47.4% (95% CI: 45.1%-49.7%) of statistically significant relationships, and 74.6% (95% CI: 72.5%-76.6%) of non-significant relationships. Adding visual aids such as a best fit line or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Analysis with R · Big Data and Business Intelligence
