A Multitaper, Causal Decomposition for Stochastic, Multivariate Time Series: Application to High-Frequency Calcium Imaging Data
Andrew T. Sornborger, James D. Lauderdale

TL;DR
This paper introduces a multitaper, causal decomposition method for multivariate stochastic time series that considers the full lagged covariance, improving the analysis of neural data and circuit connectivity.
Contribution
It presents a novel multitaper-based decomposition technique that leverages the full lagged covariance for better causal analysis of multivariate time series.
Findings
Method outperforms standard zero-lag approaches in simulations
Reveals important dynamical information in neural imaging data
Enhances understanding of circuit connectivity
Abstract
Neural data analysis has increasingly incorporated causal information to study circuit connectivity. Dimensional reduction forms the basis of most analyses of large multivariate time series. Here, we present a new, multitaper-based decomposition for stochastic, multivariate time series that acts on the covariance of the time series at all lags, , as opposed to standard methods that decompose the time series, , using only information at zero-lag. In both simulated and neural imaging examples, we demonstrate that methods that neglect the full causal structure may be discarding important dynamical information in a time series.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Neural Networks and Applications · Neural dynamics and brain function
A Multitaper, Causal Decomposition for Stochastic, Multivariate Time Series: Application to High-Frequency Calcium Imaging Data
Andrew T. Sornborger
Department of Mathematics, University of California, Davis, CA
Email: [email protected]
James D. Lauderdale
Department of Cellular Biology, University of Georgia, Athens, GA
Email: [email protected]
Abstract
Neural data analysis has increasingly incorporated causal information to study circuit connectivity. Dimensional reduction forms the basis of most analyses of large multivariate time series. Here, we present a new, multitaper-based decomposition for stochastic, multivariate time series that acts on the covariance of the time series at all lags, , as opposed to standard methods that decompose the time series, , using only information at zero-lag. In both simulated and neural imaging examples, we demonstrate that methods that neglect the full causal structure may be discarding important dynamical information in a time series.
Index Terms:
Neural Imaging, Multivariate Time Series, Matrix Decomposition, Causality, Multitaper Methods, Spectral Analysis, Dimension Reduction.
I Introduction
The technology for imaging neural systems has improved to the point where we can rapidly be overcome by the sheer size of our datasets. For this reason, a significant amount of effort has been expended to develop dimensional reduction methods. Independent component analysis [1, 2], non-negative matrix factorization [3], generalized linear models [4], principal component analysis [5], wavelet analyses [6, 7], and other matrix factorizations [8] have all been used to advantage in the analysis of neural imaging data.
The lagged-covariance
[TABLE]
and its Fourier transform, the cross-spectrum,
[TABLE]
are fundamental to the description of a stationary random process, . For a stationary process, the cross-spectrum is uncorrelated as a function of frequency, . Therefore, statistical estimates of stationary processes are often performed in the frequency domain. Since estimates of the cross-spectrum are approximately independent and identically distributed (i.i.d.), confidence intervals can typically be calculated more easily in the frequency domain.
Despite the fact that the Fourier transform can decorrelate a stationary process, uncertainty principles are also fundamental to the analysis of time series. They are a statement that a function that has well-localized support in one set of coordinates, say time, becomes delocalized when transformed to another set of coordinates, i.e. frequency. Therefore, although the statistics of a stationary process are more tractable in frequency coordinates, if the covariance of the process, , has more localized temporal support than the cross-spectrum, , then frequency-by-frequency estimation of covariance matrices can result in a loss of statistical power, since the power can be spread across many frequencies. This statement also holds in the opposite direction if the cross-spectrum has more localized frequency support than the covariance.
The goal of this paper is to improve the detection and estimation of lagged-covariance or cross-spectral information using multivariate, multitaper methods [9, 10]. We show that these methods can capture information in the data that goes unseen with standard dimensional reduction methods based only on zero-lag information.
II Methods
To attack this problem, we set up a framework for the detection and estimation of statistically significant covariance or cross-spectral information in multivariate data. The dimensional reduction of covariance or cross-spectral tensors is a technique that we have found to be very useful in revealing the structure latent in time series data. We use multitaper methods to construct our estimates. Multitaper methods provide a set of approximately independent estimates of the covariance or cross-spectrum allowing us to form consistent estimates with low broad-band frequency bias.
II-A The Probabilistic Model
We will assume that the time series of interest may be represented, at least approximately or over a given time window, as a band-limited, weakly stationary process
[TABLE]
Here, represents a vector of length of orthogonal increment Cramér processes (c.f. [9, 11, 12, 13]) for which
[TABLE]
and
[TABLE]
With these assumptions, the covariance at lag may be written
[TABLE]
[TABLE]
giving
[TABLE]
Here and below, bolded letters denote vectors; capital, unbolded letters denote matrices; small, unbolded letters denote scalars; T denotes the matrix transpose, † denotes the Hermitian transpose (complex conjugated and transposed) and denotes an expectation value.
II-B Causal Decomposition: Decomposing the Lagged-Covariance and Cross-Spectral Tensors
The most commonly used method for decomposing a data matrix is the singular value decomposition (SVD). This decomposition is used to estimate the principal components of stochastic processes and is so useful in the study of system dynamics (among other things) that it was reinvented many times during the 20th century and is also called the proper orthogonal decomposition (POD), the Karhunen-Loève (KL) transform and the method of empirical eigenfunctions.
The SVD of a vector time series is a decomposition of the form
[TABLE]
where
[TABLE]
the are mutually orthogonal, the are listed in descending order and are the normalized projections , and are mutually orthogonal. Note that the operator on the left of (9) is just the zero-lag covariance matrix, . Other decompositions, such as those mentioned in the Introduction are also based on reductions that use zero-lag information.
Typically, the SVD is first used to find left, , and right, , eigenvector bases for . Then, for dimensional reduction, the standard procedure is to truncate the decomposition by setting a set of small singular values, , to zero and reconstructing in the subspace of the remaining left and right eigenvectors.
As we saw in the Introduction and above, for stationary processes, the fundamental quantity of interest for modeling a stochastic system is the set of lagged covariance matrices, , or the cross-spectral matrices at different frequencies, . These translationally-invariant quantities measure how the stochastic system evolves over time depending on its previous history. Therefore, instead of applying the SVD to the raw data, , we apply it to estimates of the tensor or (note that once or is discretized, these quantities are indeed rank-three tensors). The SVD is not unique for tensors of rank higher than two. However, since and are clearly special since they are ordered, we will decompose or along these indices. Thus, the decomposition of the covariance becomes
[TABLE]
where the are eigen-covariance matrices, the are singular values and the are time- or lag-like eigenvectors. Here, , where . An analogous treatment results in the decomposition of the cross-spectrum
[TABLE]
where the are eigen-cross spectral matrices, the are singular values and the are frequency-like eigenvectors.
Since these are decompositions of quantities that are fundamentally related to the causal structure of stationary processes, we will refer to Eqs. (10) and (11) as Causal Decompositions (CDs).
II-C Multitaper Estimation
To obtain consistent statistical estimates of and , we resort to multitaper spectral analysis [9]. Multitaper methods have been used successfully in many applications in the analysis of neural time series [14]. Multitaper analysis is based on the projection of a time series onto a set of orthogonal ‘tapers’ called Slepians (also called discrete prolate spheroidal sequences or DPSSs). Slepians are a complete set of functions that are ordered in terms of their frequency concentration and serve as optimal band-pass filters provided the maximum index . Here, is the number of points in the time series and is a user-defined frequency bandwidth. The data is first tapered, then Fourier transformed, resulting in a set of frequency estimates, so-called eigenestimates, with low broad-band bias (i.e. estimates of amplitudes at a given frequency are relatively uninfluenced by frequencies outside of the bandwidth ).
The data is averaged across the eigenestimates, resulting in smoothed spectral estimates with reduced variance (relative to a single, tapered periodogram).
Because multitaper techniques rely on a pre-defined bandwidth , the CDs described here may be smoothed by the user to different degrees. More smoothing gives rise to smaller confidence intervals, but estimates that have lower resolution in frequency, whereas less smoothing results in larger confidence intervals, but higher frequency resolution.
Define the tapered eigenestimate
[TABLE]
where is a Slepian function. A multitaper estimate of is given by
[TABLE]
may then be estimated via (7).
II-D Statistical Significance
The estimate is Wishart distributed with degrees of freedom [10]. However, the distribution of is more complicated, due to the Fourier transform in (7). Furthermore, we wish to obtain confidence intervals on the eigen-cross spectral matrices, singular values and frequency-like eigenvectors of the SVD of (and the corresponding objects for the decomposition of ). Jackknife (leave-one-out) estimates are well-suited for application in these circumstances. First, we outline the jackknife estimation procedure for . We define
[TABLE]
where indicates the sum over all tapers except the ’th taper. We now perform an SVD on resulting in sets of eigen-covariance matrices, , singular values, , and frequency-like eigenvectors, , where and .
From these sets, we calculate estimated means and variances for , , and . With estimates of the mean and variance, it is straightforward to calculate confidence intervals for the eigen-covariance matrices, eigenvectors and singular values.
This procedure for calculating mean and variance estimates of the objects resulting from the decomposition of may also be used to calculate mean and variance estimates of the objects resulting from the decomposition of . In this case, we perform SVDs on , where
[TABLE]
and calculate the jackknife estimates as we did for . Notice that even though is not independent of , the sample is independent of because it is calculated with Slepian tapers whose Fourier transform is orthogonal to the other Slepian tapers in the frequency domain. Thus the samples are approximately i.i.d.. This is an example of a transformation-based-bootstrap method [15], which leaves out independent samples in the frequency domain.
For practical purposes, the CDs in (10) and (11) may be computed as ‘raw’ estimates, i.e. without an attempt to estimate statistics of the process. The use of multitaper methods does increase the time required to calculate a CD. Therefore, for cursory or preliminary analyses, it should be remembered that the number of tapers may be decreased or multitaper methods can be done away with entirely. The CD may be computed by simply forming a ‘raw’ estimate of or , then performing the SVD on the raw estimate. In this case, we have no confidence intervals, but we can get a sense of which eigen-matrices, or , represent the dominant contributions to the covariance or cross-spectrum, the form of the singular value spectrum, and the form of the eigenvectors or .
III Results
We tested our method on both simulated and actual neural imaging data. The simulated data analysis is meant to demonstrate a situation in which a standard data reduction would not detect any statistically significant structure, but a CD should find statistically significant correlations in the data. The neural imaging data that we analyze are 1000Hz, fluorescence line-scan measurements of seizure-related calcium activity using the ratiometric, genetically-encoded calcium indicator cameleon YC2.1 in a larval zebrafish. We demonstrate that with this data set our method detects structure that would not have been detected with a standard analysis.
III-A Simulated Data
We used an autoregressive (AR) process to generate a stationary multivariate time series, . The process was
[TABLE]
where the matrix is depicted in Fig. 1B and is an i.i.d. Gaussian random vector. Note that with this matrix, there was no feedback in the time series. Time series through were simply added to time series at the fixed lags of , and time units. Therefore the covariance tensor of this time series was diagonal at (Fig. 1A) (capturing the variance of each time series), proportional to at lags , , and and proportional to at lags , , and . Because all variables in the process have the same variance, is proportional to the identity matrix and an SVD of this data, whose left singular vectors are eigenvectors of , outputs random right/left singular vectors with ordered, normally distributed singular values.
In Fig. 2, we present results from a CD of the simulated AR process. The first eigen-cross-spectral estimates of the multitaper CD captured the diagonal cross-spectral matrix (see Fig. 1A for the theoretical matrix, the first eigen-cross-spectral matrix estimate is not shown). The second eigen-cross-spectral matrix of the multitaper CD captured the covariance due to A (Fig. 2B, this matrix is ), with estimation improving as increased (Fig. 2C,D). Note also that, since the covariance tensor of this time series only has narrow (-function) support at the discrete lags [math], , , , the uncertainty principle predicts that the cross-spectral tensor will have broad support across many frequencies, but the covariance tensor will have sharp (-function) support as a function of time (lag). For this reason, the ‘raw’ CD in the time (lag) domain is more statistically efficient than low multitaper estimators (in the sense of smallest mean squared error, Fig. 2F) due to the few, high signal-to-noise peaks in covariance support (Fig. 1E, red point in F). However, we have no confidence intervals for such raw estimators. We would expect processes with narrow frequency support to be more efficiently computed in the frequency domain.
III-B High-frequency Calcium Imaging Data
Calcium imaging data taken at high frequencies are capable of detecting individual action potentials, but at the cost of a reduced signal-to-noise due to the reduced number of photons per image in an acquisition. Thus, dimensional reduction methods are important for the analysis of such data. Action potentials in vivo are typically modeled as a stochastic point process. Because individual neurons communicate with each other with synaptic propagation times on the order of tens of milliseconds, one expects that a data-reduction using an SVD would not give useful results since only zero-lag correlations are being taken into account. This is indeed the case with the data set that we analyzed here. However, using a multitaper CD, we found a number of statistically significant events that went undetected with the SVD analysis.
The line-scan calcium imaging data that we analyzed was taken from a line through the dorsal central nervous system of a 9 days-post-fertilization (dpf) larval zebrafish (Fig. 3A). The data was taken in a set of 60 contiguous 8.192 second segments (Fig. 3B) for a total of 491.52 seconds ( minutes). A multitaper CD was performed on each segment. In order to visualize statistically significant events, the spectrum of a data matrix with normally distributed noise of equal power to the data in a given segment was subtracted from the spectrum of the CD (Fig. 3C). A series of statistically significant events were detected, three of which are shown in Fig. 3D-F in more detail. Each of these panels shows the two most significant eigen-cross-spectral matrices and the logarithm of the absolute value of the eigen-cross-spectrum. Note that the first eigen-cross-spectral matrix is, to a very good approximation, the identity. This is the matrix that would be diagonalized by the right eigenvectors of an SVD and would fail to identify the cross-spectral information contained in the second (and subsequent) eigen-cross-spectral matrices and eigen-cross-spectra.
The eigen-matrices and eigen-spectra show different structure from event to event. We will not interpret this here because it is impossible to interpret causal relationships in line scan data; the whole neuronal circuit is not sampled. However, due to the variety of relationships between line-scan pixels in the data, it is clear that a rich variety of neural mechanisms could be at play. Given a complete sample of neurons, using, for instance light-sheet microscopy, one would be able to make better guesses at the neural circuits that underlie these structures.
IV Discussion and Conclusions
We have shown that the causal decomposition presented here can detect information that would normally go unnoticed in the standard multivariate dimensional reduction methods that are used to analyze imaging data.
The CDs described here are a useful way of summarizing the information contained in the covariance and cross-spectrum. Like the standard SVD, which may be truncated to denoise a dataset , the CD may be used to denoise or . The results of a CD may be readily visualized to give the user a clearer understanding of the underlying covariance or cross-spectral structure of a time series. They may also be used to obtain improved estimates of multivariate auto-regressive processes and, hence, improved predictions from measurements of the process. If the user wants to use the CD to dimensionally reduce the dataset (not the covariance or cross-spectrum), since the eigen-cross-spectra are hermitian, , (alternatively, , thus ) will have an orthogonal, set of real eigenvectors. These may be listed in order of their covariance, thresholded, and the dataset may be projected on them, giving a reconstruction of the data retaining causal information.
We have made no attempt to optimize the decompositions. Improvements could be made for instance because the matrices are hermitian and . Similarly, . We note that a CD may be also be useful when performed on the coherency, amplitude spectrum, or other objects often investigated in the analysis of time series, instead of the cross-spectrum.
By using multitaper techniques, our approach gives us statistics that may be used to obtain confidence intervals on the various objects (eigen-covariance, singular values and lag- or frequency-like eigenvectors) that result from the decomposition. They may also be used to test whether the objects resulting from the decomposition, for instance the singular value spectrum, of a given covariance or cross-spectrum are statistically different from others. As an example, one can form a two-sample -statistic to test for non-stationarity of a process by calculating CDs of two different time-windows of the process and testing whether the distribution of singular values differs between them.
Acknowledgment
This work was funded by the National Institutes of Health, CRCNS program NS090645 (ATS and JDL).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. J. Mc Keown, S. Makeig, G. G. Brown, T. P. Jung, S. S. Kindermann, A. J. Bell, and T. J. Sejnowski, “Analysis of f MRI data by blind separation into independent spatial components,” Hum Brain Mapp , vol. 6, no. 3, pp. 160–188, 1998.
- 2[2] A. Hyvarinen and E. Oja, “Independent component analysis: algorithms and applications,” Neural Netw , vol. 13, no. 4-5, pp. 411–430, 2000.
- 3[3] L. Tao, J. D. Lauderdale, and A. T. Sornborger, “Mapping Functional Connectivity between Neuronal Ensembles with Larval Zebrafish Transgenic for a Ratiometric Calcium Indicator,” Front Neural Circuits , vol. 5, p. 2, 2011.
- 4[4] S. J. Kiebel, J. B. Poline, K. J. Friston, A. P. Holmes, and K. J. Worsley, “Robust smoothness estimation in statistical parametric maps using standardized residuals from the general linear model,” Neuroimage , vol. 10, no. 6, pp. 756–766, Dec 1999.
- 5[5] R. M. Everson, A. K. Prashanth, M. Gabbay, B. W. Knight, L. Sirovich, and E. Kaplan, “Representation of spatial frequency and orientation in the visual cortex,” Proc. Natl. Acad. Sci. U.S.A. , vol. 95, no. 14, pp. 8334–8338, Jul 1998.
- 6[6] T. P. Patel, K. Man, B. L. Firestein, and D. F. Meaney, “Automated quantification of neuronal networks and single-cell calcium dynamics using calcium imaging,” J. Neurosci. Methods , vol. 243, pp. 26–38, Mar 2015.
- 7[7] C. Park, N. A. Lazar, J. Ahn, and A. Sornborger, “A multiscale analysis of the temporal characteristics of resting-state f MRI data,” J. Neurosci. Methods , vol. 193, no. 2, pp. 334–342, Nov 2010.
- 8[8] A. Sornborger, J. Broder, A. Majumder, G. Srinivasamoorthy, E. Porter, S. S. Reagin, C. Keith, and J. D. Lauderdale, “Estimating weak ratiometric signals in imaging data. II. Meta-analysis with multiple, dual-channel datasets,” J Opt Soc Am A Opt Image Sci Vis , vol. 25, no. 9, pp. 2185–2194, Sep 2008.
