Reply to Cox et al. and Kessler et al.: Data and code sharing is the way forward for fMRI
Anders Eklund, Thomas Nichols, Hans Knutsson

TL;DR
This paper advocates for data and code sharing in fMRI research, emphasizing the importance of non-parametric statistical methods as valid alternatives to parametric ones, supported by recent software developments.
Contribution
It highlights the adoption of non-parametric methods in major fMRI software, promoting transparency and robustness in data analysis.
Findings
Non-parametric methods are valid alternatives for fMRI data analysis.
Major software now support non-parametric group inference.
Sharing data and code enhances reproducibility in fMRI research.
Abstract
We are glad that our paper has generated intense discussions in the fMRI field, on how to analyze fMRI data and how to correct for multiple comparisons. The goal of the paper was not to disparage any specific fMRI software, but to point out that parametric statistical methods are based on a number of assumptions that are not always valid for fMRI data, and that non-parametric statistical methods are a good alternative. Through AFNIs introduction of non-parametric statistics in the function 3dttest++, the three most common fMRI softwares now all support non-parametric group inference (SPM through the toolbox SnPM, and FSL through the function randomise).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Reply to Cox et al. and Kessler et al.: Data and code sharing is the way forward for fMRI
We are glad that our paper [1] has generated intense discussions in the fMRI field, on how to analyze fMRI data and how to correct for multiple comparisons. The goal of the paper was not to disparage any specific fMRI software, but to point out that parametric statistical methods are based on a number of assumptions that are not always valid for fMRI data, and that non-parametric statistical methods [2] are a good alternative. Through AFNI’s introduction of nonparametric statistics in the function 3dttest++ [3, 4], the three most common fMRI softwares now all support non-parametric group inference (SPM through the toolbox SnPM (http://warwick.ac.uk/snpm), and FSL through the function randomise).
1 Reply to Cox et al.
Cox et al. [3] correctly point out that the bug in the AFNI function 3dClustSim only had a minor impact on the false positive rate (FPR). This was also covered in the original paper [1], ”We note that FWE rates are lower with the bug-fixed 3dClustSim function. As an example, the updated function reduces the degree of false positives from 31.0% to 27.1% for a CDT of P = 0.01, and from 11.5% to 8.6% for a CDT of P = 0.001.” It is unfortunate that several media outlets focused extensively on this bug, when the main problem was found to be violations of the assumptions in the statistical models.
The statement that AFNI had particularly high FPRs, compared to SPM and FSL, is for example supported by Supplementary Figure 1 a [1] (Beijing data, two-sample t-test with 20 subjects, cluster defining threshold p = 0.01). For 8 mm of smoothing, the FPR for AFNI is 23% - 31%, while it is 13% - 20% for SPM and 14% - 18% for FSL OLS. To understand the higher FPRs we investigated how the 3dClustSim function works, which eventually lead us to finding the bug in 3dClustSim. However, we agree that AFNI did not produce higher FPRs for all parameter combinations.
The 70% FPR comes from Supplementary Figure 9 c [1] (Oulu data, one-sample t-test with 40 subjects, cluster defining threshold p = 0.01, FSL OLS with 4 mm smoothing), and not, as some readers believed, from Figure 2 in the original paper [1] which shows results for the ad-hoc clustering approach. The main reason for using the highest observed FPR was to give the reader an idea of how severe the problem can be, but we agree that it lead to a too pessimistic view.
As pointed out by Cox et al. [3], the non-parametric approach also performed sub-optimal for the one-sample t-test, especially for the Oulu data. As discussed in our paper, the one-sample t-test has an assumption of symmetrically distributed errors that can be violated by outliers in small samples. Our current research is therefore focused on how to improve the non-parametric test for one-sample t-tests. Regarding the flexibility of the permutation testing, recent work has shown that virtually any regression model with independent errors can be accomodated [2], and even longitudinal and repeated measures data can be analyzed with a related bootstrap approach [5].
2 Reply to Kessler et al.
Kessler et al. [6] extend our evaluations to (non-parametric) cluster based false discovery rate (FDR) on task data, to better understand how existing parametric cluster p-values based on the familywise error rate (FWE) should be interpreted. For the problematic cluster defining threshold of p = 0.01, they conclude that a cluster FWE-corrected p-value smaller than p = 0.00001 survives FDR correction at q = 0.05. Indeed, this information makes it easier to interpret existing results in the fMRI literature, but it should be noted that it is not straightforward to generalize these results to other studies. For example, the fMRI software used, the MR sequence used (EPI or multiband), the degree of smoothing and the number of subjects are all likely to affect this cutoff. The only way to retrospectively evaluate existing results is, in our opinion, to re-analyze the original fMRI data (e.g. made available through OpenfMRI [7]) or to apply a new threshold to the statistical maps (e.g. made available through NeuroVault [8]).
3 Importance of data and code sharing
Cox et al. [3, 4] replicated and extended our findings with the same open fMRI data [9] as in our original paper (and made use of our processing scripts available on github, https://github.com/wanderine/ParametricMultisubjectfMRI), ultimately resulting in improvements to the AFNI software. Further, we never would have been able to identify the bug in 3dClustSim were AFNI not open source software. Kessler et al. [6] also used the same task datasets from OpenfMRI [7], to find the empirical cluster FDR. Together, these examples show the importance of data sharing [10, 11], open source software [12], code sharing [13, 14] and reproducibility [15].
Acknowledgements
This research was supported by the Neuroeconomic Research Initiative at Linköping University, by Swedish Research Council Grant 2013-5229 (”Statistical Analysis of fMRI Data”), the Information Technology for European Advancement 3 Project BENEFIT (better effectiveness and efficiency by measuring and modelling of interventional therapy), the Swedish Research Council Linnaeus Center CADICS (control, autonomy, and decision-making in complex systems), and the Wellcome Trust.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Eklund, T. Nichols, and H. Knutsson, “Cluster failure: why f MRI inferences for spatial extent have inflated false positive rates,” PNAS , vol. 113, pp. 7900–7905, 2016.
- 2[2] A. Winkler, G. Ridgway, M. Webster, S. Smith, and T. Nichols, “Permutation inference for the general linear model,” Neuro Image , vol. 92, pp. 381–397, 2014.
- 3[3] R.W. Cox, G. Chen, D.R. Glen, R.C. Reynolds, and P.A. Taylor, “FMRI Clustering and False Positive Rates,” PNAS , 2016.
- 4[4] Robert W Cox, Richard C Reynolds, and Paul A Taylor, “AFNI and Clustering: False Positive Rates Redux,” bio Rxiv, doi: http://dx.doi.org/10.1101/065862 , 2016.
- 5[5] Guillaume, B. et al., “Fast and accurate modelling of longitudinal and repeated measures neuroimaging data,” Neuro Image , vol. 94, pp. 287–302, 2014.
- 6[6] D. Kessler, M. Angstadt, and C. Sripada, “Reevaluating “Cluster Failure” Using Nonparametric Control of the False Discovery Rate,” PNAS , 2016.
- 7[7] Poldrack, R. et al., “Toward open sharing of task-based f MRI data: the Openf MRI project,” Frontiers in Neuroinformatics , vol. 7, no. 12, 2013.
- 8[8] Gorgolewski, K. et al., “Neuro Vault.org: A repository for sharing unthresholded statistical maps, parcellations, and atlases of the human brain,” Neuro Image , vol. 124, pp. 1242–1244, 2015.
