Factor Analysis for Spectral Estimation
Joakim And\'en, Amit Singer

TL;DR
This paper introduces a subspace projection method for spectral estimation that improves accuracy for short signals by leveraging a statistical model of fixed sources, with demonstrated success in cryo-electron microscopy data.
Contribution
It proposes a novel subspace-based spectral estimation technique with theoretical guarantees, addressing limitations of existing methods for short signals.
Findings
Enhanced spectral estimation accuracy for short signals.
Theoretical guarantees for the proposed method.
Successful application to cryo-electron microscopy data.
Abstract
Power spectrum estimation is an important tool in many applications, such as the whitening of noise. The popular multitaper method enjoys significant success, but fails for short signals with few samples. We propose a statistical model where a signal is given by a random linear combination of fixed, yet unknown, stochastic sources. Given multiple such signals, we estimate the subspace spanned by the power spectra of these fixed sources. Projecting individual power spectrum estimates onto this subspace increases estimation accuracy. We provide accuracy guarantees for this method and demonstrate it on simulated and experimental data from cryo-electron microscopy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Factor Analysis for Spectral Estimation
Joakim Andén
Program in Applied and Computational Mathematics
Princeton University
Princeton, NJ, 08544
Email: [email protected]
Amit Singer
Department of Mathematics and
Program in Applied and Computational Mathematics
Princeton University
Princeton, NJ, 08544
Abstract
Power spectrum estimation is an important tool in many applications, such as the whitening of noise. The popular multitaper method enjoys significant success, but fails for short signals with few samples. We propose a statistical model where a signal is given by a random linear combination of fixed, yet unknown, stochastic sources. Given multiple such signals, we estimate the subspace spanned by the power spectra of these fixed sources. Projecting individual power spectrum estimates onto this subspace increases estimation accuracy. We provide accuracy guarantees for this method and demonstrate it on simulated and experimental data from cryo-electron microscopy.111The authors were partially supported by Award Number R01GM090200 from the NIGMS, FA9550-12-1-0317 from AFOSR, Simons Investigator Award and Simons Collaboration on Algorithms and Geometry from Simons Foundation, and the Moore Foundation Data-Driven Discovery Investigator Award.
I Introduction
Estimating the power spectrum of a stationary field arises in many applications [1, 2, 3, 4]. For example, noise statistics play an important role in maximum likelihood estimation of signal parameters. Hence, its power spectrum must be estimated.
The most basic estimator of the power spectrum is the periodogram, which is asymptotically unbiased but has very high variance. To remedy this, spectral smoothness constraints or parametric models are often imposed [5, 6]. The multitaper estimator [7, 6, 8] is particularly popular, which multiplies the data with fixed tapers, computes their periodograms, and averages. This reduces variance at the expense of smoothing the estimated power spectrum, potentially increasing bias.
In this work, we consider the problem of estimating the power spectra of multiple independent signals. Given identically distributed signals, an ensemble average of their periodograms or multitaper estimates provides a low-variance power spectrum estimate. However, in certain applications, signals are not identically distributed, but combine a small number of identically distributed stochastic sources. For example, a non-stationary signal may be divided up into approximately stationary parts, each of which is a linear combination of some fixed random signals. Alternatively, noise signals may arise from measurements subject to differing experimental conditions which are described by combining various noise sources. This is the case for cryo-electron microscopy (cryo-EM) images, where large molecules are frozen in a thin layer of ice and then imaged by exposing them to an electron beam and recording the transmitted electrons [9, 10]. Variations in experimental conditions result in different noise characteristics. To whiten the projection images, accurate estimates of their noise power spectrum are therefore necessary.
While traditional methods such as multitaper estimates are applied in this context, an estimator which takes into account the low-dimensional variability of the noise distributions could yield improved accuracy. We propose a factor analysis model in which the -dimensional field is a linear combination of fixed fields
[TABLE]
where are independent random variables and are independent linear random fields. Typical values for include for stationary processes and for random images.
It follows that the power spectrum of conditioned on is a linear combination of the power spectra of . That is, the conditional power spectra reside in a low-dimensional subspace and are therefore determined by a small number of factors. We introduce a method for estimating this subspace and these factors given multiple realizations of . The method allows for estimation of the number of factors, which is small for most applications. Linear projection of individual power spectrum estimates onto the factor subspace then yields improved accuracy.
Section II discusses the power spectrum estimation problem for a single signal and describes the basic periodogram and multitaper methods. The factor analysis model and associated power spectrum estimation method are introduced and analyzed in Section III. Finally, Section IV presents numerical results on simulations and experimental data taken from cryo-EM images. Software implementing our proposed method and reproducing the figures of this paper is available at http://github.com/janden/fase/.
II Single Power Spectrum Estimation
Given a zero-mean stationary random field with finite second moments, we define its autocovariance as
[TABLE]
Its Fourier transform is the power spectrum of :
[TABLE]
The power spectrum is symmetric and non-negative.
While is defined over , we are only given samples over some finite domain , where . Let us denote this restriction of to by . Our task is then to estimate given .
II-A Periodogram
The most basic estimator for the power spectrum of a stationary field is the periodogram, given by
[TABLE]
Since is real, is symmetric and only needs to be computed on half of . Let us therefore choose a subdomain of such that and .
To study the properties of the periodogram, we first define a linear stationary field.
Definition 1**.**
A stationary field is linear if
[TABLE]
where is a stationary, zero-mean, white noise field such that and is a kernel satisfying .
For such a field, the periodogram is an asymptotically unbiased estimator for the power spectrum:
Lemma 1**.**
Let be a linear field and let be its restriction . Then we have, for ,
[TABLE]
The proof is obtained as a generalization of the case found in Proposition 10.3.1 of [5]. Recasting the periodogram as the Fourier transform of sums of pairs , the bias is due to not weighting these sums by the number of available pairs. Reweighting eliminates the bias, but at a significant increase in variance, increasing the mean squared error of the estimator. For this reason, the biased periodogram estimator is often preferred over its unbiased variant [6].
Despite its low bias for large , the periodogram is not suitable for power spectrum estimation due its high variance:
Lemma 2**.**
Let and be defined as in Lemma 1 and
[TABLE]
We then have, for all ,
[TABLE]
The proof again follows the one-dimensional case, Proposition 10.3.2 in [5]. It relies on the fact that as , converges to a -distributed random variable. Asymptotically, the standard deviation of is therefore approximately proportional to its expectation .
II-B The Multitaper Method
The multitaper method for spectral estimation was introduced by D. J. Thomson as an alternative to the periodogram with reduced variance at the cost of increased bias [7]. It relies on multiplying the sample by a number of data tapers, computing their periodograms, and averaging.
Given a target spectral resolution , the data tapers for are given by discrete prolate spheroidal sequences for , where . Abusing notation slightly, we denote their tensor products by
[TABLE]
where [11]. The associated multitaper estimator is then given by
[TABLE]
where for all .
The resolution parameter specifies the bias-variance trade-off of the estimator. As is increased, variance is reduced, but the estimate is smoothed, potentially increasing the bias of the estimator.
III Power Spectrum Factor Analysis
In certain applications, we do not have just one signal whose power spectrum we would like to estimate, but a set of signals. To estimate their power spectra, we can apply the methods described in the previous section to each signal individually. If in addition the signals are identically distributed, we can reduce the variance in the estimate by an ensemble average.
On the other hand, if the signals are not identically distributed, an ensemble average will destroy the variability in the power spectra. A different approach is applicable if the signals can be written as linear combinations of a small set of fixed random fields. In this case, factor analysis allows us to identify the subspace spanned by the power spectra of these fixed fields, allowing for increased accuracy by projecting multitaper estimates onto this space.
III-A Motivation: Cryo-Electron Microscopy
Single-particle cryo-EM images are typically very noisy, with signal-to-noise ratios typically below [9, 10]. Many reconstruction algorithms assume that this noise is close to white, but this is rarely the case. As a result, a whitening step is typically performed prior to reconstruction, in which the noise power spectrum is estimated and used to compute a whitening filter that is then applied. Power spectrum estimation is typically done by extracting image patches containing mostly noise and averaging their periodograms [12]. Other methods estimate the noise power spectrum as part of the reconstruction algorithm [13].
However, the noise distribution typically varies between projection images. Indeed, different projection images are obtained under different experimental conditions (ice thickness, electron dose, microscope defocus, electron scattering properties of the molecules, and so on) which affect the characteristics of the noise [15, 16, 17]. The variation in the noise signal can be modeled as a random linear combination of fixed noise sources due to background electrons, electrons scattered by the ice, and those inelastically scattered by the molecule. Experimental images with different noise power spectra are shown in Figure 1. As we will see in the following, the power spectra of these noise signals can be accurately estimated using the proposed factor analysis.
III-B Problem Setup
Let us denote the zero-mean stationary random field of interest by . We model it in (1) as a linear combination of fixed independent linear fields , with independent random coefficients .
Instead of a single field , we are concerned with a set of independent copies of defined by
[TABLE]
for . Here and are independent and identically distributed copies of and , respectively, for . The given data consists of restrictions of to .
The coefficient vectors are unknown, but fixed for each signal. As a result, the power spectrum of interest for is the conditional power spectrum , where we have replaced the expectation in (2) by the conditional expectation . Our problem is therefore estimating given .
One approach is to use the multitaper method described in Section II-B applied to each signal . However, we can obtain improved estimates by exploiting the fact that, for ,
[TABLE]
That is, the conditional power spectra are contained in a subspace of dimension at most spanned by . These power spectra can thus be described using a small set (at most ) of common factors. We can exploit this to increase the accuracy of a power spectrum estimate by projecting it onto this low-dimensional subspace. In the following, we propose a method for estimating this subspace.
III-C Power Spectrum Covariance Estimation
The non-trivial eigenvectors of the covariance matrix span the subspace containing the conditional power spectra. To simplify our expressions, we define in the same way as for and note that these are all identically distributed. We consider estimating the covariance matrix using the following theorem:
Theorem 1**.**
Let and be as in (1) and (11), respectively. Furthermore, let be the restrictions of to . In this case, the quantities
[TABLE]
and
[TABLE]
satisfy
[TABLE]
and
[TABLE]
where . The expected magnitudes and are each bounded by .
Proof.
First, we condition Lemma 1 on to obtain
[TABLE]
for . Taking expectation with respect to then gives
[TABLE]
Averaging then gives us,
[TABLE]
according to the central limit theorem, where . Plugging (18) into (19) then proves (15).
Conditioning Lemma 2 with respect to gives
[TABLE]
Consequently
[TABLE]
Taking the expectation with respect to now gives
[TABLE]
We now have
[TABLE]
Replacing the left-hand side expectations with sums using the central limit theorem yields (16). ∎
The theorem allows us to estimate the covariance matrix of the conditional power spectra. Traditionally, this would be done by forming . However, due to the distribution of the periodogram coordinates, the variances of are larger than they should be, so a correction is needed. That correction takes the form of .
Let us denote this covariance matrix estimate by
[TABLE]
for . Calculating its top eigenvectors allows us to estimate the low-dimensional subspace containing the conditional power spectrum . Specifically, these eigenvectors define a linear space that approximates the subspace spanned by .
III-D Linear Projection Estimators
We now use the information gained in the previous section to obtain a better estimate of the conditional power spectrum from the multitaper estimate . It satisfies
[TABLE]
where the size of the residual depends on and the regularity of with respect to the target multitaper resolution [6]. We know that is in the subspace spanned by the non-trivial eigenvectors of . We can use this to reduce the variance of the residual term.
While the exact covariance is not known to us, we have an estimator . Let be the leading eigenvectors of . We note that does not need to be known beforehand, but can be estimated from the eigenspectrum of . Here, we set to be the number of dominant eigenvalues of by locating the “knee” of the spectrum, where there is a significant drop in amplitude from the th eigenvalue to the th. In addition, since we expect to be contained in the span of , we can also check that the projection of to that span preserves most of its energy.
Projecting onto the span of yields
[TABLE]
This significantly reduces the estimation error since it will typically be almost orthogonal to and thus for . The orthogonality follows from the fact that is approximately isotropically distributed (its coordinates are close to independent random variables), so its inner product with a small set of fixed vectors is small with high probability. Note that is not guaranteed to be non-negative. To remedy this, negative components can be set to zero. However, this does not greatly affect the error of the estimate.
IV Numerical Results
To evaluate our method, we test it on noisy images simulated using (1) for . The first noise source has its energy concentrated in the low frequencies, with a power spectrum of , where is the rectangular function with support . The second source has a power spectrum of . These are combined with normally distributed coefficients in (1) to yield . Figure 2(a) shows two sample images of size .
We calculate noise images of different sizes to study the effect of this parameter on the accuracy of the estimation. Figure 2(b) shows the top eigenvalues of for a dataset of size with image size . There is a good separation of the top two eigenvalues from the bulk of the spectrum, with a spectral gap of of about .
Figure 2(c,d) shows the mean absolute error (MAE) of and with respect to the conditional power spectra as functions of for image sizes and , respectively. The unprojected multitaper estimator was computed with while the others used . For comparison, we plot the baseline estimator of assigning to each power spectrum and the oracle projection of the multitaper estimate, where the top eigenvectors of the population covariance are used in the definition of . For small values of , the covariance estimation fails, so the projected estimator performs worse compared to the unprojected variant . As increases, however, the linear projection improves results, eventually converging to the error obtained using the subspace obtained from the population covariance.
We also evaluate our algorithm using two experimental datasets from cryo-EM, one containing images of size depicting a 70S ribosome [14] and another containing images with depicting an 80S ribosome [18]. The top eigenvalues of are shown for the two datasets in Figure 3(a,b). As in the simulation, we see that a few eigenvalues dominate, with the estimation making up a separate bulk distribution. For the first dataset, a value of seems appropriate, while for the second dataset, we take . This number of components is in line with most noise models for cryo-EM, which typically include two or three noise components [15, 16, 17].
Two sample conditional power spectra estimates for each dataset are shown in Figure 3(c,d). The projection images corresponding to the two power spectra in 3(c) are the ones shown previously in Figure 1. We see that the estimated power spectra are quite reasonable, with the lower-frequency noise image corresponding to the power spectrum concentrated in the low frequencies while the image with “whiter” noise has a flatter estimated power spectrum.
V Conclusion
We have introduced a factor analysis model for noise fields. These arise when the nature of the noise varies between measurements such as in cryo-EM, where the noise is affected by various experimental factors. To estimate the individual power spectra of each noise image we introduce a new estimation method which relies on approximating the covariance of these spectra. The method is shown to provide accurate results in both simulated and experimental datasets.
Acknowledgment
The authors thank Fred Sigworth for discussions about cryo-EM noise models, which inspired this work. They also thank Frederik Simons for his helpful comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Stoica and R. L. Moses, Introduction to Spectral Analysis . Prentice Hall, 1997, vol. 1.
- 2[2] G. Bond et al. , “A pervasive millennial-scale cycle in North Atlantic Holocene and glacial climates,” Science , 278(5341): 1257–1266 (1997).
- 3[3] C. Harig and F. J. Simons, “Mapping Greenland’s mass loss in space and time,” Proc. Natl. Acad. Sci. , 109(49): 19934–19937 (2012).
- 4[4] A. Delorme and S. Makeig, “EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis,” J. Neurosci. Meth. , 134(1): 9–21 (2004).
- 5[5] P. J. Brockwell and R. A. Davis, Time Series: Theory and Methods , 2nd ed. Springer-Verlag New York, 1991.
- 6[6] D. B. Percival and A. T. Walden, Spectral Analysis for Physical Applications . Cambridge University Press, 1993.
- 7[7] D. J. Thomson, “Spectrum estimation and harmonic analysis,” Proceedings of the IEEE , 70(9): 1055-1096 (1982).
- 8[8] L. D. Abreu and J. L. Romero, “MSE estimates for multitaper spectral estimates and off-grid compressive sensing,” ar Xiv:1703.08190 , 2017.
