Log-Paradox: Necessary and sufficient conditions for confounding statistically significant pattern reversal under the log-transform
Ben Cardoen, Hanene Ben Yedder, Sieun Lee, Ivan Robert Nabi, Ghassan, Hamarneh

TL;DR
This paper investigates the paradoxical phenomenon where applying a log-transform to data can reverse the direction of statistically significant trends, highlighting the importance of careful interpretation and robust analysis methods.
Contribution
It derives necessary and sufficient conditions for pattern reversal under log-transform and demonstrates how small dataset modifications can induce this paradox, especially in biomedical imaging.
Findings
Log-transform can cause significant trend reversals in data analysis.
Small dataset changes as little as 5% can induce pattern reversal.
Biomedical image quantification is highly susceptible to this paradox.
Abstract
The log-transform is a common tool in statistical analysis, reducing the impact of extreme values, compressing the range of reported values for improved visualization, enabling the usage of parametric statistical tests requiring normally distributed data, or enabling linear models on non-linear data. Practitioners are rarely aware that log-transformed results can reverse findings: a hypothesis test without the transform can show a negative trend, while with the log-transform, it can show a positive trend, both statistically significant. We derive necessary and sufficient conditions underlying this paradoxical pattern reversal using finite difference notation. We show that biomedical image quantification is very susceptible to these conditions. Using a novel heuristic maximizing the reversal, we show that statistical significance of the paradoxical pattern reversal can be easily induced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis with R · Probabilistic and Robust Engineering Design
