Comparing the Pearson and Spearman Correlation Coefficients Across   Distributions and Sample Sizes: A Tutorial Using Simulations and Empirical   Data

J. C. F. de Winter; S. D. Gosling; and J. Potter

arXiv:2408.15979·stat.ME·August 29, 2024

Comparing the Pearson and Spearman Correlation Coefficients Across Distributions and Sample Sizes: A Tutorial Using Simulations and Empirical Data

J. C. F. de Winter, S. D. Gosling, and J. Potter

PDF

TL;DR

This tutorial compares Pearson and Spearman correlation coefficients across different distributions and sample sizes, demonstrating their variability, bias, and robustness, with practical recommendations for psychological research.

Contribution

It provides a comprehensive simulation and empirical analysis of rp and rs, highlighting their differences and guiding their appropriate use based on data distribution.

Findings

01

rs is more variable than rp for normally distributed variables.

02

rp is more variable than rs with high kurtosis variables.

03

rs often better reflects the population correlation in heavy-tailed data.

Abstract

The Pearson product-moment correlation coefficient (rp) and the Spearman rank correlation coefficient (rs) are widely used in psychological research. We compare rp and rs on 3 criteria: variability, bias with respect to the population value, and robustness to an outlier. Using simulations across low (N = 5) to high (N = 1,000) sample sizes we show that, for normally distributed variables, rp and rs have similar expected values but rs is more variable, especially when the correlation is strong. However, when the variables have high kurtosis, rp is more variable than rs. Next, we conducted a sampling study of a psychometric dataset featuring symmetrically distributed data with light tails, and of 2 Likert-type survey datasets, 1 with light-tailed and the other with heavy-tailed distributions. Consistent with the simulations, rp had lower variability than rs in the psychometric dataset. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.