FAST Adaptive Smoothing and Thresholding for Improved Activation Detection in Low-Signal fMRI
Israel Almod\'ovar-Rivera, Ranjan Maitra

TL;DR
This paper introduces FAST, an automated adaptive smoothing and thresholding method that enhances activation detection in low-signal fMRI data, improving reliability in identifying brain regions during cognitive tasks.
Contribution
The paper presents a novel FAST algorithm combining smoothing and extreme value theory for better thresholding in low-signal fMRI analysis.
Findings
Effective in low-signal scenarios
Performs well in identifying sensory-specific brain regions
Shows promising results across diverse experiments
Abstract
Functional Magnetic Resonance Imaging is a noninvasive tool for studying cerebral function. Many factors challenge activation detection, especially in low-signal scenarios that arise in the performance of high-level cognitive tasks. We provide a fully automated fast adaptive smoothing and thresholding (FAST) algorithm that uses smoothing and extreme value theory on correlated statistical parametric maps for thresholding. Performance on experiments spanning a range of low-signal settings is very encouraging. The methodology also performs well in a study to identify the cerebral regions that perceive only-auditory-reliable or only-visual-reliable speech stimuli.
| Decreasing | Increasing | |
| 2 | (0.6, 0.3) | (0.3, 0.6) |
| 3 | (0.4, 0.3, 0.2) | (0.2, 0.3, 0.4) |
| 4 | (0.3, 0.25, 0.20, 0.15) | (0.15, 0.20, 0.25, 0.3) |
| 5 | (0.3, 0.25, 0.20, 0.10, 0.05) | (0.05, 0.10, 0.20, 0.25, 0.30) |
| Decreasing-Increasing | Increasing-Decreasing | |
| 2 | (0.6,0.3) | (0.3,0.6) |
| 3 | (0.4,0.1,0.4) | (0.1,0.7,0.1) |
| 4 | (0.4,0.05,0.05,0.4) | (0.05,0.4,0.4,0.05) |
| 5 | (0.25,0.15,0.1,0.15,0.25) | (0.1,0.15,0.4,0.15,0.1) |
| 0 | 0.01 | 0.025 | 0.05 | 0.075 | 0.1 | 0.25 | 0.5 | 0.75 | 0.99 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.001 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.025 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.05 | 0 | 2 | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | |
| 0.075 | 1 | 4 | 4 | 1 | 0 | 0 | 0 | 0 | 2 | 2 | |
| 0.1 | 3 | 5 | 6 | 1 | 0 | 0 | 0 | 0 | 2 | 3 |
| 0 | 0.01 | 0.025 | 0.05 | 0.075 | 0.1 | 0.25 | 0.5 | 0.75 | 0.99 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.001 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.025 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.05 | 0 | 2 | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | |
| 0.075 | 1 | 4 | 4 | 1 | 0 | 0 | 0 | 0 | 2 | 2 | |
| 0.1 | 3 | 5 | 6 | 1 | 0 | 0 | 0 | 0 | 2 | 3 |
| 0 | 0.01 | 0.025 | 0.05 | 0.075 | 0.1 | 0.25 | 0.5 | 0.75 | 0.99 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.001 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.025 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.05 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.075 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0.1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 0 | 0.01 | 0.025 | 0.05 | 0.075 | 0.1 | 0.25 | 0.5 | 0.75 | 0.99 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.001 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | |
| 0.01 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | |
| 0.05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | |
| 0.05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 | |
| 0.075 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 | |
| 0.1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
FAST Adaptive Smoothing and Thresholding for Improved
Activation Detection in Low-Signal fMRI
Israel Almodóvar-Rivera and Ranjan Maitra I. Almodóvar-Rivera is with the Department of Biostatistics and Epidemiology at the University of Puerto Rico, Medical Science Campus, San Juan, Puerto Rico, USA.R.Maitra is with the Department of Statistics, Iowa State University, Ames, Iowa, USA.An earlier version of this article won I. Almodóvar-Rivera a Student Paper Competition award sponsored by the American Statistical Association Section on Medical Devices and Diagnostics at the 2018 Joint Statistical Meetings.This research was supported in part by the National Institute of Biomedical Imaging and Bioengineering (NIBIB) of the National Institutes of Health (NIH) under its Award No. R21EB016212, I. Almodóvar-Rivera also acknowledges receipt of a fellowship from Iowa State University’s Alliance for Graduate Education and the Professoriate (AGEP) program for underrepresented graduate students in STEM fields. The content of this paper however is solely the responsibility of the authors and does not represent the official views of either the NIBIB or the NIH.©201? IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending a request to [email protected] received xxxx xx,201x; revised xxxxxxxx xx, 201x. Accepted xxxxxxxx xx, 201x. First published xxxxxxxx x, xxxx, current version published yyyyyyyy y, yyyyColor versions of one or more of the figures in this paper are available online at http://ieeexplore.org.Digital Object Identifier
Abstract
Functional Magnetic Resonance Imaging is a noninvasive tool for studying cerebral function. Many factors challenge activation detection, especially in low-signal scenarios that arise in the performance of high-level cognitive tasks. We provide a fully automated fast adaptive smoothing and thresholding (FAST) algorithm that uses smoothing and extreme value theory on correlated statistical parametric maps for thresholding. Performance on experiments spanning a range of low-signal settings is very encouraging. The methodology also performs well in a study to identify the cerebral regions that perceive only-auditory-reliable or only-visual-reliable speech stimuli.
Index Terms:
AM-FAST, AR-FAST, Adaptive Segmentation, AFNI, BIC, CNR, Cluster Thresholding, SPM, SUMA, TFCE
I Introduction
Functional Magnetic Resonance Imaging (fMRI) [1, 2, 3, 4, 5, 6, 7, 8, 9] studies the spatial characteristics and extent of brain function while at rest or, more commonly, while performing tasks or responding to external stimuli. The latter scenario is the setting for this paper. Here, the imaging modality acquires voxel-wise Blood-Oxygen-Level-Dependent (BOLD) measurements [10, 11] at rest and during stimulation or performance of a task. After pre-processing, a general linear or other statistical model [4, 12] is fit to the time course sequence against the expected BOLD response [13, 14, 15]. Statistical Parametric Mapping (SPM) [16] techniques provide voxel-wise test statistics summarizing the association between the time series response at each voxel and the expected BOLD response [3]. The map of test statistics is then thresholded to identify significantly activated voxels [17, 18, 19]. The analysis of fMRI datasets is challenged [20, 21, 22, 23] by factors such as scanner, inter- and intra-subject variability, voluntary/involuntary or stimulus-correlated motion and also the several-seconds delay in the BOLD response as the neural stimulus passes through the hemodynamic filter [24, 23, 25]. Pre-processing [22, 26] mitigates some of these effects, but additional challenges are presented by the fact that an fMRI study is expected to have no more than 1-3% activated voxels [27, 8]. Also, many activation studies involving high-level cognitive processes have low contrast-to-noise ratios (CNR), throwing up additional challenges as illustrated next.
I-A Activation Detection during Noisy Audiovisual Speech
The most important form of human communication is speech [28, 29, 30], which the brain is adept at understanding even in noisy surroundings. This ability may be due [31] to the brain’s capacity for multisensory integration of independently-acquired visual and auditory input information which reduces noise and allows for more accurate perception [32, 33]. Recently, [31] studied the role of the superior temporal sulcus (STS) in perceiving noisy speech, through fMRI and behavioral experiments, and established increased connectivity between the STS and the auditory or the visual cortex depending on whichever modality was more reliable, that is, less noisy.
[31] provided results on regions of interest (ROIs) drawn on the STS and the auditory and visual cortices. However, the full benefit of fMRI can be realized only if we move beyond assessing cerebral function at the ROI level to understanding it at the voxel level. Reliable voxel-wise activation detection in individual subjects may increase the adoption of fMRI in a clinical setting. All these are potential scenarios with low CNRs where accurate activation detection methods are needed.
I-B Background and Current Practices
Many thresholding methods [34, 35, 36, 37, 38] in fMRI address multiple testing issues in determining significance of test statistics but ignore spatial resolution. Acquired images are instead often spatially smoothed prior to analysis, but such non-adaptive smoothing reduces both the adaptive spatial resolution and the number of available independent tests for activation detection [39]. There are also iterative adaptive smoothing and segmentation methods such as propagation-separation (PS) [39] and adaptive-segmentation (AS) [40] that essentially segment the SPM into activated and inactivated voxels. PS approximately yields a random -field and uses Random Field Theory for segmentation while AS uses multi-scale testing. [40] argued for AS because of its more general development and fewer model assumptions. AS also requires no heuristic corrections for spatial correlation, provides decisions at prescribed significance levels and showed [40] better performance over PS in an auditory experiment. However, AS requires pre-specified bandwidth sequences and ignores correlation within the SPM. So Section of this paper develops theory and methodology for fully automated Fast Adaptive Smoothing and Thresholding (FAST) algorithms that account for correlation and obviate the need for setting all but one parameters. Performance evaluations on real datasets and large-scale simulation experiments are in Section III. Section IV revisits the dataset of Section I-A, while Section V provides discussion. A supplement with sections, figures and tables referenced using the prefix “S” is available.
II Theory and Methods
II-A Preliminaries
Let be the time series vector of the observed BOLD response at the th voxel obtained after preprocessing for registration and other corrections. It is common to relate to the expected BOLD response via the general linear model
[TABLE]
where is a th-order auto-regressive (AR) Gaussian error vector with AR coefficients and marginal variance . Without loss of generality (w.l.o.g.), assume that the design matrix has the intercept in the first column, the expected BOLD response for the stimulus levels in the next columns, and polynomial terms for the drift parameter in the remaining columns. Therefore, is a coefficient vector of length . We assume that the image volume has voxels, so in (1). The parameters s are usually estimated via generalized least squares or restricted maximum likelihood. A typical analysis approach then applies (voxel-wise) hypothesis tests with the null hypothesis specifying no activation owing to the stimulus or task. SPMs of the form with appropriate contrasts are then formulated at each voxel.
Many researchers use models that assume independent or AR(1) errors, while others pre-whiten the time series before fitting (1) under independence. Misspecified models can yield less precise SPMs [41, 42, 43, 44] so here we assume AR() errors, with assessed by the Bayes Information Criterion (BIC) [45, 46] that trades a fitted model’s complexity against its fidelity to the data. Tests on the SPM identify voxels that are activated with the application of the stimulus. Our objective is to develop an approach that adaptively and automatically smooths and thresholds the SPM while incorporating spatial correlation and the fact that the sequential thresholding results in SPMs from truncated distributions. Before detailing our methods, we provide some theoretical development.
II-B Theoretical Development
We assume -distributed SPMs with degrees of freedom large enough for them to be approximately standard normally distributed under the hypothesis of no activation. The SPM has a homogeneous correlation structure, a reasonable assumption with our use of radially symmetric smoothing kernels. We have
Theorem 1**.**
Let where and is a circulant correlation matrix with only nonnegative elements such that also has no negative entries. Writing , we let be the sum of the elements in any row of . Further, let be the maximum value of , that is, . Then the cumulative distribution function (CDF) of is given by , where is the CDF of the standard normal random variable. The equality holds when also has no negative entries.
Proof.
Let and have CDF . Then so that
[TABLE]
Now is also circulant and where is the sum of the elements of any row of and . If has no negative elements, and then equality holds in (LABEL:th1). ∎
Corollary 2**.**
For and as in Theorem 1, the limiting distribution of is bounded below by one that lies in the domain of attraction of the Gumbel distribution, and satisfies
[TABLE]
where and , with the standard normal probability density function (PDF).
Proof.
Each element of has CDF and PDF . Then has CDF that satisfies with and [47]. The result follows from Theorem 1. ∎
This paper uses for which can be shown, using Theorem 2 of [48], to have no negative entries. Then Corollary 2 provides a conservative bound for the quantiles of the limiting distribution of with the conservatism determined by the negative elements of .
The thresholding steps yield truncated (and correlated) random variables for potential thresholding in subsequent steps. We account for this added complication by deriving the limiting distribution of the maximum of a correlated sample from a right-truncated normal distribution.
Suppose that are independent identically distributed (IID) random variables from but truncated at , then each has PDF and CDF , where
[TABLE]
where is the indicator function. Then has CDF with limiting distribution as follows.
Theorem 3**.**
Let be a sample from (4). Then the limiting distribution of belongs to the domain of attraction of the reverse Weibull distribution and satisfies
[TABLE]
for some . Here and .
Proof.
Note that . From Theorem 10.5.2 in [49], for to be in the domain of attraction of the reverse Weibull, it sufficient to show that
[TABLE]
for some [50]. In our case, the limit holds because . Then upon using L’Hôpital’s rule, we have
[TABLE]
Thus the right-truncated normal distribution satisfies the reverse Weibull condition and converges to the reverse Weibull distribution with in (5). The constants in the theorem are as per extreme value theory [49, 47]. ∎
Theorem 4**.**
Let be a random vector from the density but that is right-truncated in each coordinate at , with and as in Theorem 1. Then the CDF of is .
Proof.
For IID random variables from (4), has CDF , where with a vector of IID random variables from . Then
[TABLE]
proving the statement of the theorem. ∎
Corollary 5**.**
Let and be as in Theorem 4. Then the limiting distribution of belongs to the domain of attraction of the reverse Weibull distribution, and satisfies:
[TABLE]
where and .
Proof.
The result is immediate from Theorems 3 and 4. ∎
II-C Fast Adaptive Smoothing and Thresholding
We propose our FAST algorithm that adaptively and, in sequence, smooths and identifies activated regions by thresholding. We estimate the amount of smoothing robustly or from the correlation structure that we assume is well-approximated by an ellipsoidally-symmetric 3D Gaussian kernel oriented along the three axes and with parameters . That is, under the null hypothesis (of no activation anywhere), we assume that the SPM where , with a circulant smoothing matrix [51]. Let . We estimate and by maximizing the loglikelihood function
[TABLE]
Both and are speedily computed using Fast Fourier Transforms (FFTs). Starting with the SPM , obtained as discussed in Section II-A, we propose the algorithm:
Initial Setup. At this point, assume that , where is the activation status of the th voxel. That is, assume that all voxels are inactive. Set . Also denote , and , where denotes the number of voxels for which . 2. 2.
Iterative Steps, For iterate as follows:
- (a)
Smoothing. Smooth in one of three ways:
- i.
Adaptive Likelihood maximization (ALL-FAST, pronounced ôl-fast): Maximize (7) given to obtain . Smooth with to get . 2. ii.
Adaptive Model-based smoothing (AM-FAST, pronounced ăm-fast): Use a Markov Random Field (MRF) prior model with parameters estimated using empirical Bayes methods as described in Section S1 to get from . 3. iii.
Adaptive Robust smoothing (AR-FAST, ahr-fast): Use [52] to smooth and get . 2. (b)
Standardization. Maximize (7) given to obtain , and . Standardize by scaling with a robust version of , say . 3. (c)
Adaptive Thresholding. This consists of two steps:
- i.
For , use Corollary 2 to obtain , otherwise (i.e. for ) use Corollary 5 to get . In both cases, use . 2. ii.
From the Gumbel (for ) or reverse Weibull distributions (for ), get
[TABLE]
where and are the upper-tail -values for the Gumbel and the reverse Weibull (with ) distributions, respectively. 3. iii.
Set if and if the th coordinate of exceeds . Let . 3. 3.
Termination.
Declare no activation and terminate if . Otherwise, let be the Jaccard Index [53, 54] of the activation maps in the th and th iterations. If , the algorithm terminates – the final activation map is .
Comments
A few comments are in order:
Correlation structure
A circulant correlation structure allows for spatial context in the association between the values of the voxel-wise test statistics, while having the added benefit of speedy computations via the use of FFTs.
Robust Estimation of
The estimate from (7) assumes no activation in . Ignoring the activation can inflate the estimate, so we obtain where is the estimated SD of , and is its biweight-estimated SD [55], both assuming a zero mean. Specifically, if has th component , we calculate
[TABLE]
where , the median absolute deviation of from 0, and .
Comparison with AS
Our FAST algorithms are similar to AS [40] in that they also smooth and threshold iteratively. But there are a few fundamental differences. The AS approach has a set user-specified sequence of bandwidths that smooths at each step. In contrast, ALL-, AM- and AR-FAST use likelihood, empirical Bayes and robust methods to optimally determine at each step. AS also thresholds but uses a general Fréchet extreme value distribution that ignores spatial context and the correlated truncated nature of the random variables that arise from the smoothing and thresholding at each iteration. Our development represents the procedure more accurately because we account for both the correlation structure (with the initial cut-off decided as per the Gumbel distribution) and the truncation (with subsequent cut-offs determined by the reverse Weibull distribution). Our more general allows for different amounts of smoothing in each axis. Finally, our method is entirely data-driven, with termination declared only if there is no initial activation or when between subsequent activation maps decreases.
Two-sided alternatives
Our development here builds from one-sided tests where large values are the extreme values of the SPM. For two-sided alternatives, we use the algorithm individually on the SPM and its negative, but replacing in (8) with . This provides two (disjoint) activation maps, the union of which is the two-sided activation map.
III Performance Evaluations
We studied performance of FAST relative to the most popular and relevant alternatives. Our evaluations were on real and simulated datasets and compared FAST with cluster thresholding (CT) applied with per [38], a second-order neighborhood and number of voxels in cluster determined by [56]’s 3dClustSim function, threshold-free cluster enhancement (TFCE) [37], permutation-based testing (PBT) [57], AS and PS (applied as AWS or adaptive-weighted smoothing [58]). We used and in FAST to obtain insight into the role of . We used R packages RFASTfMRI for FAST, fMRI for AS and AWS, AnalyzeFMRI for CT and permuco for TFCE and PBT.
III-A Finger-Tapping Experiments
Our first set of evaluations used the 12 replicated SPMs [24, 25] from the right-hand (RH) and left-hand (LH) finger tapping study of a RH-dominant male. For each method, Figure 1 displays
the summarized Jaccard similarity () between the activation maps from the 12 replications. For the RH experiments, AR-FAST showed the highest reliability of detected activation with . AR-FAST at and TFCE were marginally behind and AS and AWS also doing reasonably. TFCE was a bit better than AR-FAST for the LH experiments. The generally low for all methods points to potential issues in data quality and processing [54].
III-B Experiments on Simulated Phantom Data
Our next set of examples evaluated performance on phantom data simulated using (1) under different conditions.
III-B1 Motif and Stripes
We first study performance using the simulation setup of [40]. We thank K. Tabelow for readily sharing code that created the motif and three striped (, , ) phantoms of Figures 2LABEL:sub@motif-LABEL:sub@str64. The phantoms have 14, 50, 28 and 47% truly activated voxels, or more than the 1-3% expected in typical fMRI experiments. We used s as per [40] and CNRs of between 0.75 to 2.68 for the motif and 1 to 2 for the stripes. These examples are of two-sided alternatives. All simulations had AR(1) errors with . For AS and AWS, we adopted the maximum bandwidth sequence values (, 1, 2 and 3) in [40] for the four respective phantoms as the best-case specific choices. Figure 2LABEL:sub@JI-Tabelow summarizes performance. There is no clear overall winner but AM-FAST, PBT and TFCE find no activation at all (Figure S1). ALL-FAST and AR-FAST, in that order, perform creditably in some situations but not in others.
III-B2 Large-scale study with modified Hoffman phantom
The phantoms in [40], with uniform underlying structure () and no drift, but varying CNR, are not particularly representative of cerebral activation and do not provide much insight into performance of different activation methods. So we performed a large simulation study using a more realistic phantom and experimental setup that matches (1). We used a modified version of the digitized 2D Hoffman phantom [59] of Positron Emission Tomography, with 3465 in-brain pixels, representing two types (say, A and B) of anatomic structures – the latter has 138 deemed truly activated pixels in two distinct regions (Figure 3).
The th pixel in the phantom had values in (1) as per its location (see Figure 3).
As in (1), the design matrix had the intercept in the first column. The second column had the hemodynamic response function (HRF) [7] convolved with the input stimulus time series that alternated as 16 on-off blocks of 6 time-points each. The third column of represented linear drift and was set to (). As per (1), AR() Gaussian errors were simulated for different and at each pixel. Specifically, for each , we considered AR coefficients for a range of s with coefficients that were, with lag, (a) all equal, (b) decreasing, (c) increasing, (d) first decreasing, then increasing and (e) first increasing and then decreasing. We restricted to ensure stationary solutions. So for all AR(1) cases. For , for the equal AR coefficients scenario and as per Table I for the other cases.
Finally, to correspond to very low to moderate CNR = 0.50, 0.75, and 1.0. By design, our SNRs were 10 times our CNRs. Time series images were simulated using (1) and the setup of Fig. 3 and Table I.
For each pixel, we fit (1) with chosen from {0,1,2,3,4,5} using BIC. SPMs were generated as per Section II-A. Figure 4 provides sample SPMs and activation maps with the three top-performing methods: AR-FAST, AM-FAST and AS, for the three CNR settings with AR(2) errors and coefficients decreasing with order.
(See Figures S2 for activation maps using all methods and Figures S2–S5 for cases with other and/or decreasing order.) All methods do poorly for CNR=0.5 in this example, but AS correctly finds a few activated voxels. AM- and AR-FAST perform very well for CNR=0.75 and 1.0. Other methods – in particular ALL-FAST, CT, TFCE and PBT – barely find activation on this example.
To more fully understand performance, we replicated our experiment 25 times for each simulation setting. Figure S6 shows performance in estimating , with over-estimation and mild under-estimation for large and small values of the true AR order. The pattern broadly holds at all CNRs and s. We now discuss performance of activation detection methods on SPMs obtained upon fitting AR(). Figure 5 displays overall performance, in terms of , of all methods in the decreasing- cases. Performances with other types of s are similar (Figure S7). Figures S8–S12 display the number of activated voxels and the false and true positive rates (FPRs and TPRs). FAST methods are among the top performers at all CNRs: AM- and AR-FAST do very well for experiments with orders other than AR(1). (All methods do poorly at CNR=0.50 in the AR(1) case: we surmise that is because of highly-correlated noise obtained with .) AS and AWS are the next best performers but CT, TFCE and PBT perform very poorly with TPRs of below 25%. FAST methods have very high TPRs but FPRs of up to 0.2, 0.3 and 0.5% for ALL-, AM- and AR-FAST for , low CNR and . Overall, the slightly higher FPRs of the FAST methods are overwhelmed by their vastly higher TPRs, leading to their having the highest s.
Our threshold has a role, with smaller values performing better at higher CNRs and conversely. We suggest for low-CNR tasks and for high-CNR tasks. We suggest determining low or high CNR scenarios accordingly as whether the upper percentile of the estimated voxel-wise CNRs is less than the standard normal upper percentile (2.33) or not: the upper percentile of the estimated CNRs is chosen to include an activated voxel (if such exists) in the CNR determination. In our studies, ALL-FAST required more Step 2 iterations, but was computationally faster than AM-FAST AR-FAST and had lower TPR, FPR and , especially at low CNRs. Regardless, our methods were the fastest among all methods.
III-C Performance in Null Activation Scenarios
III-C1 Resting-State Dataset
A reviewer’s suggestion led us to apply our FAST algorithms on SPMs obtained upon fitting (1) to a resting state dataset [60, 61], with no activation identified even at . This zero FPR (when CNR=0) as opposed to the small FPR in low-CNR experiments may be due to Step 3 of our algorithm correctly allowing more Step 2 iterations to attenuate stray high-valued SPM voxels – termination is earlier in the low CNR cases given the spatially located weaker-signal peaks. For higher CNRs, Step 3 again adaptively admits more smoothing iterations that dampen stray high values in the SPM without substantially degrading the true high-signal peaks.
III-C2 Null-simulated SPMs
Another reviewer was concerned about multiple significance. Our use of is to set a threshold and not as a significance level: still, experiments on simulated null SPMs (Section S2) indicate no practical concerns.
IV Activation During Perception of Noisy Speech
The dataset, provided as data6 in the AFNI tutorial [56], is originally from an fMRI study [31] where
a subject heard and saw a female volunteer speak words, separately, in two different formats. The audio-reliable setting had the subject clearly hear the spoken word but see a degraded image of the speaker while the visual-reliable case had the subject clearly see the speaker vocalize the word but the audio was of reduced quality. There were three experimental runs, each consisting of a randomized design of 10 blocks, equally divided into blocks of audio-reliable and visual-reliable stimuli. -weighted images with volumes of (with voxels of dimension ) from echo-planar sequences (TR=2s) were obtained over time-points. Our interest was in determining activation corresponding to the audio () and visual () tasks. At each voxel, we fitted AR models for and chose with the highest BIC. Figure 6 uses AFNI and Surface Mapping (SUMA) to display activated regions obtained using AR-FAST on the SPM: see Figure S13 for maps drawn from ALL-FAST, AS, AWS and CT. We used because of the high (greater than 4) upper percentile of the voxel-wise estimated CNRs. Most of the activation occurs in Brodmann areas 18 and 19 (BA18 and BA19) which comprise the occipital cortex and the extrastriate (or peristriate) cortex. In humans with normal sight, this area is for visual association where feature-extraction, shape recognition, attentional and multimodal integrating functions occur. We also see increased activation in the STS, which recent studies [62] have related to distinguishing voices from environmental sounds, stories versus nonsensical speech, moving faces versus moving objects, biological motion and so on. ALL-FAST performs similarly as AR-FAST, while the other methods also identify the same regions but they identify a lot more activated voxels, some of which appear to be false positives. Although a detailed analysis of the results of this study is beyond the purview of this paper, we note that AR-FAST finds interpretable results even when applied to a single subject high-level cognition experiment.
V Discussion
We propose a new fully automated fast adaptive smoothing and thresholding algorithm suite called FAST with the ability to detect activation in low-signal settings. Three variants – ALL-FAST, AM-FAST and AR-FAST – are proposed with AR-FAST generally recommended because of its consistent good performance across a range of low-CNR experiments and real datasets. AM-FAST’s performance, while good, is more variable, while ALL-FAST appears to undersmooth but performs better in two-sided activation detection scenarios. Our methodology realistically accounts for both spatial correlation structure and is also developed under more accurate extreme value theory. Our algorithm suite is implemented in a R package RFASTfMRI available at https://github.com/ialmodovar/RFASTfMRI and is fully automated with one threshold choice for which we provide easily-implemented guidance. This contrasts with AS and AWS that require setting maximum smoothing bandwidths related to the expected diameter of activated regions [40] – a determination that may require considerable dexterity and is ambivalent when different-sized activation regions are expected.
A reviewer has pointed to the joint detection-estimation literature [63, 64] where estimation of the HRF and activation detection occur jointly. The FAST, AS and AWS algorithms can be placed in a related framework, with the distinction that the estimation step is of a more spatially consistent (smoothed) SPM. We also agree with another reviewer on other ways of ensuring spatial contiguity such as through Markov Random Field priors [65] and on the need to incorporate approaches also allowing for nonhomogeneous smoothing. Our algorithms converge by construction and are guaranteed to terminate. They are also seen to have good overall performance but, as observed by a reviewer, establishing the optimality properties and conditions and assumptions governing such properties may provide more solid theoretical grounding for FAST and improve its understanding and widen its applicability. Developing FAST for more sophisticated time series and spatial models, including in the context of complex-valued fMRI [66] as well as increased use of diagnostics in understanding activation and cognition are other important research areas and directions that would benefit from further attention.
Acknowledgment
The authors sincerely thank four anonymous reviewers and an Associate Editor whose helpful and insightful comments on an earlier version of this article greatly improved its content.
Supplementary Materials
S1 Model-based smoothing
In this section, we describe the model-based smoothing used in the AM-FAST algorithm. The starting point of our methodology is a SPM and our smoothed estimator is given by where is a smoothing matrix with parameter . We write = where
[TABLE]
with denoting the Kronecker product between matrices, is a identity matrix and is a tridiagonal matrix of the form
[TABLE]
with th eigenvalue given by for . In the above representation, is split into three components to represent the three axes of the volume image. Indeed represents the inverse of the dispersion matrix of a 3-D first-order Gaussian Markov Random Field with a neighborhood structure for each interior voxel given by the six nearest neighbors (one nearest in each of the two directions in each plane). Also with measuring the strength of the interaction between the neighbors in the th plane. For the edge voxels, the neighborhood structure is similar and given by the nearest neighboring voxels in each of the two directions in each plane, provided they exist in the imaging grid.
Our smoothed SPM is a penalized maximum likelihood estimator and is the same as the maximum a posterioiri (MAP) or posterior mean estimator of when we have and . So, the regularization parameter may be chosen using an empirical Bayes estimate. This estimate is given by
[TABLE]
We consider in our application. Then the eigenvalues of are all real and given by
[TABLE]
Let be the Type-2 1D Discrete Cosine Transform (DCT) matrices and be the inverse DCT. Then is the Type-2 DCT in the 3D volume of voxels and is the inverse 3D DCT. Then is the 3D DCT of with entries . Then (E11) reduces to
[TABLE]
Further, the smoothed estimator can be obtained by the inverse DCT of , where is the diagonal matrix with the diagonal element corresponding to the th voxel given by (E12).
S2 Performance of FAST Algorithms on Simulated Null-Activated SPMs
The second reviewer expressed continued concerns regarding multiple significance. Our use of is only to set a threshold: its connections with testing in deciding on this threshold notwithstanding, our algorithm is not built on, and does not use, the actual nominal significance level. Further, performance of FAST on the resting state dataset of Section III-C1 did not raise concerns of identifying false positives. Nevertheless, we also performed a comprehensive and thorough 2D simulation study to evaluate the issue of false positives. We generated SPMs under the null hypothesis of no activation, and used ALL-, AM- and AR-FAST for different and evaluated the incidence of even one pixel in an image (falsely) identified as activated. Specifically, we generated SPMs where is a 2D circulant matrix, with first order autocorrelation structure indexed in each dimension by . (For simplicity, we used a radial correlation structure in the simulation, but not in the estimation and activation detection.) We used to provide a comprehensive evaluation even though in our experience most SPMs from real datasets have estimated s substantially lower than 0.1. Our SPMs were of dimensions . For each , we simulated 1000 SPMs to account for simulation variability in drawing our conclusions and then applied both the FAST algorithms for a range of .
Table S1 represents the number of cases (out of 1000) where ALL-FAST, AM-FAST or AR-FAST for different and found (falsely) even one activated voxel. For , AR-FAST had no false positives. while for AM- and AR-FAST, there were (out of several thousands) only a handful of cases where even one pixel was falsely identified as activated and, in the case of AM-FAST, these all happened at values of . (Indeed, even with ALL-FAST, for , these cases with even one false positive identification mostly had 1-2 pixels in that image falsely identified as activated.) Thus, as with the case of resting state data of Section III-C1, and for all pratical fMRI settings, FAST performs very well in scenarios of no true activation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. W. Belliveau, D. N. Kennedy, R. C. Mc Kinstry, B. R. Buchbinder, R. M. Weisskoff, M. S. Cohen, J. M. Vevea, T. J. Brady, and B. R. Rosen, “Functional mapping of the human visual cortex by magnetic resonance imaging,” Science , vol. 254, pp. 716–719, 1991.
- 2[2] K. K. Kwong, J. W. Belliveau, D. A. Chesler, I. E. Goldberg, R. M. Weisskoff, B. P. Poncelet, D. N. Kennedy, B. E. Hoppel, M. S. Cohen, R. Turner, H.-M. Cheng, T. J. Brady, and B. R. Rosen, “Dynamic magnetic resonance imaging of human brain activity during primary sensory stimulation,” Proceedings of the National Academy of Sciences of the United States of America , vol. 89, pp. 5675–5679, 1992.
- 3[3] P. A. Bandettini, A. Jesmanowicz, E. C. Wong, and J. S. Hyde, “Processing strategies for time-course data sets in functional MRI of the human brain,” Magnetic Resonance in Medicine , vol. 30, pp. 161–173, 1993.
- 4[4] K. J. Friston, A. P. Holmes, K. J. Worsley, J.-B. Poline, C. D. Frith, and R. S. J. Frackowiak, “Statistical parametric maps in functional imaging: A general linear approach,” Human Brain Mapping , vol. 2, pp. 189–210, 1995.
- 5[5] A. M. Howseman and R. W. Bowtell, “Functional magnetic resonance imaging: imaging techniques and contrast mechanisms,” Philosophical Transactional of the Royal Society, London , vol. 354, pp. 1179–94, 1999.
- 6[6] W. D. Penny, K. J. Friston, J. T. Ashburner, S. J. Kiebel, and T. E. Nichols, Eds., Statistical Parametric Mapping: The Analysis of Functional Brain Images , 1st ed. Academic Press, 2006.
- 7[7] M. A. Lindquist, “The statistical analysis of f MRI data,” Statistical Science , vol. 23, no. 4, pp. 439–464, 2008.
- 8[8] N. A. Lazar, The Statistical Analysis of Functional MRI Data . Springer, 2008.
