Common lines ab-initio reconstruction of $D_2$-symmetric molecules
Eitan Rosen, Yoel Shkolnisky

TL;DR
This paper introduces a novel common lines algorithm specifically designed for ab-initio reconstruction of molecules with D2 symmetry in cryo-electron microscopy, leveraging all pairwise and within-image common lines.
Contribution
The work presents the first common lines algorithm tailored for D2-symmetric molecules, extending existing methods from cyclic to dihedral symmetry.
Findings
Successfully applied to experimental cryo-EM data
Demonstrates accurate structure determination for D2-symmetric molecules
Extends common lines methodology to new symmetry class
Abstract
Cryo-electron microscopy is a state-of-the-art method for determining high-resolution three-dimensional models of molecules, from their two-dimensional projection images taken by an electron microscope. A crucial step in this method is to determine a low-resolution model of the molecule using only the given projection images, without using any three-dimensional information, such as an assumed reference model. For molecules without symmetry, this is often done by exploiting common lines between pairs of images. Common lines algorithms have been recently devised for molecules with cyclic symmetry, but no such algorithms exist for molecules with dihedral symmetry. In this work, we present a common lines algorithm for determining the structure of molecules with symmetry. The algorithm exploits the common lines between all pairs of images simultaneously, as well as common linesâŠ
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectron and X-Ray Spectroscopy Techniques · Advanced Electron Microscopy Techniques and Applications · Advanced X-ray Imaging Techniques
Common lines ab-initio reconstruction of -symmetric molecules
Eitan Rosen and Yoel Shkolnisky
Abstract
Cryo-electron microscopy is a state-of-the-art method for determining high-resolution three-dimensional models of molecules, from their two-dimensional projection images taken by an electron microscope. A crucial step in this method is to determine a low-resolution model of the molecule using only the given projection images, without using any three-dimensional information, such as an assumed reference model. For molecules without symmetry, this is often done by exploiting common lines between pairs of images. Common lines algorithms have been recently devised for molecules with cyclic symmetry, but no such algorithms exist for molecules with dihedral symmetry.
In this work, we present a common lines algorithm for determining the structure of molecules with symmetry. The algorithm exploits the common lines between all pairs of images simultaneously, as well as common lines within each image. We demonstrate the applicability of our algorithm using experimental cryo-electron microscopy data.
1 Introduction
Cryo-electron microscopy (cryo-EM) is a technique for acquiring two-dimensional projection images of biological macromolecules [5]. In this technique, a large number of copies of the same molecule is rapidly frozen in a thin layer of ice, fixing each molecule in some random unknown orientation. The frozen specimen is then imaged by an electron microscope, producing a set of two-dimensional projection images (defined below). Once the imaging orientations of the frozen molecules which produced the images are obtained, the three-dimensional structure of the molecule can be recovered from the projection images by standard tomographic procedures.
Formally, choosing some arbitrary fixed coordinate system of , the orientations of the imaged molecules at the moment of freezing can be described by a set of rotation matrices
[TABLE]
where we denote by the set , and is the group of rotation matrices. We denote the density function of the molecule by , where , and by the image generated by the microscope when imaging a copy of rotated by . The image is then given by the line integrals of along the lines parallel to , namely
[TABLE]
The orthogonal unit vectors and , which span the plane perpendicular to , form the coordinate system for the image from the point of view of an observer looking from the direction of the electron beam. We refer to and its perpendicular plane as the beaming direction and the projection plane of , respectively.
We can now state the âorientation assignment problemâ as the task of finding a set of matrices such that (1.2) is satisfied for all , given only the images (in particular, in (1.2) is unknown). In this work, we address the task of determining the orientations of a set of projection images obtained from a molecule with symmetry.
In plain language, a -symmetric molecule has three mutually perpendicular symmetry axes, where after we rotate the molecule by about any of these axes, the molecule looks exactly the same. Formally, let
[TABLE]
denote the three rotation matrices by about the and axes, respectively. For notational convenience we also denote the identity matrix by . Choosing a coordinate system in which the rotational symmetry axes of the molecule coincide with the and axes, the symmetry property implies that
[TABLE]
for any . Considering any projection image , by (1.4) we have
[TABLE]
for . Equation (1.5) shows that a -symmetric molecule induces an ambiguity in which all orientation assignments of the form , are consistent with the same set of images .
An additional ambiguity inherent to cryo-EM arises from the well known fact that the handedness (chirality) of a molecule cannot be established from its projection images. Denoting by the reflection matrix through the -plane, we define by the mirror image of the molecule . Since , we also have that , and thus, by (1.2) we have
[TABLE]
where . Noting that , and changing the variable to we have
[TABLE]
which shows that any projection image of the molecule is identical to the projection of its mirror image molecule . Thus, both orientation assignments and are consistent with the same set of projection images .
The âorientation assignment problemâ for a -symmetric molecule can now be stated as the task of finding one of the sets of matrices or satisfying (1.2), where each matrix can be independently replaced by a matrix .
2 Common lines and their induced geometry
One of the principal approaches to solving the orientation assignment problem, which we also employ in this work, relies on the well know Fourier slice theorem [7], which states that the two-dimensional Fourier transform of a projection image is a central planar slice of the three-dimensional Fourier transform of the molecule . Formally, denoting by the three-dimensional Fourier transform of , and by the two-dimensional Fourier transform of the image , we have that
[TABLE]
Note that the central slice of in (2.1) is spanned by the vectors and (the first two columns of the rotation matrix of (1.1)). Since any two non-coinciding central planes intersect at a single line through the origin, it follows that any pair of transformed projection images and for which share a unique pair of identical central lines. Formally, consider the unit vector
[TABLE]
By the definition of the cross product, is perpendicular to the normal vectors of both projection planes of and , and thus it gives the direction of their common line (see [11] for a detailed discussion). We can express using its local coordinates on both projection planes by
[TABLE]
where and are the angles between and the local -axes of the planes. Using this notation, the common line property implies that
[TABLE]
Now, let and be a pair of images of a -symmetric molecule. By (1.5), the image (also denoted ) is identical to the three images and . However, each of these four images corresponds to a plane, and by (1.2) the four planes are different from each other since their rotation matrices are different. Each of these planes has a unique common line with the projection plane of , and thus, we conclude that there are four different pairs of identical central lines between the images and . The directions of all four common lines are given by the unit vectors
[TABLE]
where for convenience we denote , since is the identity matrix. We write the local coordinates of the common lines of the pairs of images and , on the respective projection plane of each image (see (2.3)), as
[TABLE]
for . Thus, the common line property for -symmetric molecules implies that
[TABLE]
for . Throughout the following sections, we refer to all four common lines as the common lines of the images and .
An additional feature of symmetric molecules, and in particular of -symmetric molecules, is self common lines. For any , the images and of a -symmetric molecule, are identical. However, each image corresponds to a different projection plane, and thus, the projection plane of has a unique common line with each of the planes of and . The directions of these common lines are given by
[TABLE]
Thus, there are three different pairs of central lines in , corresponding to the common lines between the pairs and , which satisfy
[TABLE]
for , where
[TABLE]
for . We refer to all three common lines in (2.9) as the self common lines of the image .
3 Related work
Many of the current common line methods for orientation assignment are based on the method of angular reconstitution by Van Heel [13]. The core idea of the angular reconstitution method is rooted at the observation that the intersection of any three non-coinciding central planes establishes the orientations of all three, relative to each other. In particular, Van Heel shows how given a triplet of projection images , one can obtain either of the sets of relative rotation matrices or , by using the common lines between and . Both choices of relative rotation matrices are equally consistent with the images and , and are just the manifestation of the handedness ambiguity discussed in the previous section. The angular reconstitution method then makes the assumption that (without loss of generality) , which immediately establishes and from and . The orientations of the rest of the images for are obtained by fixing the pair of images and , and applying the same method sequentially to each triplet of images to retrieve , which immediately establishes by . Note that since all relative rotations and subsequently the rotations themselves were obtained by combining each of the images with the same pair of images , the angular reconstitution method ensures that we obtain a hand-consistent assignment, i.e. we either obtain or . Thus, we can recover either the original molecule or its mirror image.
The most commonly used procedure for estimating the common line of a pair of images and , is to calculate their Fourier transforms and then find a maximally correlated pair of central lines between the transformed images, see e.g. [12]. As the images obtained by cryo-EM are contaminated with high levels of noise, in practice the probability of correctly detecting their common lines, and subsequently their rotations using the angular reconstitution method, is low. In [11], Shkolnisky and Singer describe an algorithm for estimating the rotations which achieves robustness to noise by employing an approach known as âsynchronizationâ. In this approach, the rotations are estimated using all the relative rotations together at once. The authors use the set to construct a block matrix known as a âsynchronization matrixâ, whose block of size is given by , that is
[TABLE]
Defining the matrix , we see that
[TABLE]
and thus we can obtain by factoring using SVD.
A method for estimating the set of relative rotations in a non-sequential manner is given in [12], and takes advantage of the following observation: the relative rotation of can be estimated from the common lines between these images and any of the images , where . The authors of [12] show how to obtain a robust estimate of the relative rotation of each pair and , by taking a majority vote over all these estimates. Note that in order to construct the synchronization matrix , one has to obtain a hand-consistent set of relative rotations, i.e. either the set or the set . However, if the estimates or are obtained independently for each pair of images, this cannot be guaranteed. A solution to this issue is given in [8].
A direct application of any of the common lines based methods described above to a symmetric molecule encounters substantial difficulties stemming from the ambiguity described by (2.7). Suppose that given a pair of images and of a -symmetric molecule, we wish to estimate the relative rotation by applying the angular reconstitution method. By (2.7), we can detect four different common lines between the images, corresponding to four different pairs of projection planes, and we have no way of knowing which common line corresponds to which pair of planes. Thus, combining the common line of , together with common lines with , gives rise to different possible combinations of common line triplets, many of which do not submit an intersection of three planes. For instance, we can erroneously consider a combination of the common lines between the pairs , , and a third pair , from which we cannot establish the relative rotations of and , since we are not considering the correct common lines triplet between the projection planes of these images.
In [12], the authors derive a simple condition by which one can determine whether a triplet of common lines can be realized as the intersection of three central planes. Still, even common line triplets which do satisfy this condition can generate any of the rotations , between which we cannot distinguish. Thus, to construct the synchronization matrix in (3.2), one would have to devise a way to obtain a set of estimates , in which for each all the relative rotations for collectively âagreeâ on the identity of .
A further difficulty stems from the fact that pairs of central lines which are adjacent to each common line between a pair of images are also highly correlated. Thus, attempting to estimate 4 common lines between a pair of noisy images and by simply trying to detect the 4 best correlated central lines is most likely to fail (see [9] for a detailed discussion).
In Section 4 we present a different approach for estimating all four common lines and respective relative rotations of a pair of projection images of a -symmetric molecule, inspired by maximum likelihood methods. In Section 5 we outline an algorithm for extracting the rotations of from the relative rotations estimated by the procedure described in Section 4. This approach encounters 3 major obstacles which are resolved in Sections 6, 7 and 8. In Section 9 we demonstrate the applicability of our method to experimental cryo-EM data. Finally, in Section 10 we summarize and discuss future work.
4 Relative rotations estimation
In this section we present a method for estimating the set of relative rotations for a pair of images and of a -symmetric molecule. Similarly to the maximal correlations approach described in the previous section, we begin by computing the 2D Fourier transform of each image . By (2.7), in the noiseless case, each pair of transformed images and has exactly four pairs of perfectly correlated central lines. Thus, in principle, we can detect these common lines by computing correlations between pairs of central lines in and , and choosing the four maximally correlated pairs. However, as was explained in the previous section, this approach encounters several substantial difficulties. We now describe a different approach inspired by maximum likelihood methods.
Consider the set
[TABLE]
of quadruplets of relative rotations generated from all pairs of rotations with non-coinciding beaming directions, and let us denote the members of by . Given a pair of images and , we now show how one can use the common lines of the images to assign a score to each element , which indicates how well it approximates the quadruplet . Since , it can be detected by searching over for a candidate with the best score . We henceforth refer to as the relative rotations search space.
First, to relate each candidate to the common lines of and , let us compute the vectors
[TABLE]
analogously to (2.5). If , then the set corresponds to the direction vectors of the common lines of and . We subsequently refer to as the set of common lines directions of the quadruplet (corresponding to the pair ). Next, for each candidate , we use the set to compute the coordinate vectors
[TABLE]
for , analogously to (2.6). If , the coordinates in (4.3) correspond to the local coordinates of the common lines of and on the respective projection planes of the images. Denote by
[TABLE]
the half line (known as a Fourier ray) in the direction which forms an angle with the -axis of the transformed image . We then compute the normalized cross correlations
[TABLE]
of each pair of rays given by the direction vectors in (4.3). We use rays instead of lines, since the correlation between each pair of rays and is identical to the correlation value of their anti-podal rays. We then assign to each quadruplet the score
[TABLE]
By (2.7), if for some , then . Thus, the quadruplet with is declared as .
We remark that since in practice we use a discretization of to estimate the relative rotations of pairs of noisy images (see Section 9.1 for details), is never exactly 1, and so we simply choose a candidate which maximizes as an approximation for .
However, we can obtain more robust estimates to by also combining self common lines into the score (4.5). As was explained in Section 2, each image of a -symmetric molecule has three self common lines, given by (2.9), which are the intersections of the projection plane of with the projection planes of , and . We now show how to adjust the score of each candidate to account for the self common lines of each image in the pair and .
For each candidate , we first compute the vectors
[TABLE]
analogously to (2.8). If , then the sets and correspond to the directions of the self common lines of the images and , respectively (see (2.8)). Next, we use and of (4.6) to compute the coordinates
[TABLE]
for , analogously to (2.10). If , then the coordinates in (4.7) correspond to the local coordinates of the self common lines of and , on their respective projection planes (see (2.10)). We then compute the set of normalized autocorrelations
[TABLE]
for , and if for some , then by (2.9) we have . We therefore redefine the score in (4.5) to be
[TABLE]
For each , we then set
[TABLE]
to be .
We remark, that as was explained in Section 1 and Section 3 (see (1.6)), the images and are identical. Thus, the self common lines of each image are identical to the self common lines of , and the common lines between each pair of images and are identical to the common lines between and . Hence, for each , the set which maximizes the score has the same score as . Thus, in (4.10), for each , we either estimate or , independently from other pairs of and .
The procedure for estimating of the sets of relative rotations for each is summarized in Algorithm 1.
5 Estimating the rotation matrices
In the previous section, we have shown how to estimate for each either or . In Section 6 below, we will show how to resolve the handedness ambiguity. Thus, in this section we will assume w.l.o.g that we have the sets for all , and outline how to recover the rotations in (1.2) from these sets.
To recover the matrices row by row, we use the following observation. For each , denote by the diagonal matrix
[TABLE]
and note that
[TABLE]
where were defined in (1.3). Thus, for any pair of matrices and we have that
[TABLE]
for , where and are the rows of the matrices and , respectively. We can also compute the matrices for and , by noting that since are rows of orthogonal matrices we have
[TABLE]
Since in practice the matrices are estimated from noisy images, we get a more robust estimate for by using all , that is, by setting
[TABLE]
for and .
Next, for each , we construct the matrix whose block is given by the rank 1 matrix , and note that
[TABLE]
That is, , , are rank 1 matrices. We can now factorize each matrix using SVD, to obtain either the vector or , hence retrieving either the set of rows or , for each . Then, we can use these sets of rows to assemble the matrices row by row, where is a diagonal matrix with on its diagonal. If , we simply multiply all by , and thus, we can assume w.l.o.g that is a rotation. The matrix is an inherent degree of freedom, since we can always ârotate the worldâ by any orthogonal matrix.
Unfortunately, the approach just described is not directly applicable, as we now explain. Recall from Section 3, that though we can recover the set of relative rotation matrices from the common lines of and , we have no way of knowing for each which of the recovered matrices in the latter set is . This implies, that for each , we can only obtain a permutation of the 4-tuple where is some unknown permutation of . In (5.3), we computed the 3-tuples by summing the first element of the 4-tuple with the rest of its elements. Suppose for example, that we have the 4-tuple for a given a pair of images and . One can easily verify by direct calculation that
[TABLE]
where is the group of all permutations of a 3-tuple. Now, observe that by (5.2) and (5.5) we have
[TABLE]
This implies that the summation in (5.3) of a permutation of the 4-tuple results in a permutation of the respective 3-tuple of rank 1 matrices, where some of the matrices have a spurious factor. The following proposition, the proof of which is given Appendix A.1, summarizes the effect of the aforementioned summation on a general permutation of a 4-tuple , that is, on the order and signs of the respective 3-tuple .
Proposition 5.1**.**
Let for some be a permutation of the -tuple .
If , then the corresponding 3-tuple of rank 1 matrices is given by , where . 2. 2.
If for , then the corresponding 3-tuple of rank 1 matrices is given by
[TABLE]
Proposition 5.1 implies that we can only obtain the ordered triplets
[TABLE]
where the permutations are unknown, and each of the matrices is multiplied by , which is also unknown and depends on . Thus, we cannot construct the matrices in (5.4) using the triplets in (5.6) directly. We will show how to construct using the triplets (5.6) in Sections 7 and 8.
The following three sections are organized as follows. In Section 6, we show a method for handedness synchronization for -symmetric molecules, which is adapted from a method proposed in [8] for non-symmetric molecules. In Section 7, we show how to partition the 3-tuples in (5.6) into three sets of the form for and some unknown signs . Then, in Section 8 we show how to correct the signs so that we can construct the matrices
[TABLE]
where for and . We can then factor the matrices using SVD to obtain the vectors (of length ), which give us the sets of rows for , and assemble the matrices
[TABLE]
Since for each either or is in , that is, for some , we can compute the matrices
[TABLE]
by replacing all matrices which have with . The resulting set of matrices satisfies for all , and are therefore a solution for the orientation assignment problem which was stated at the end of Section 1.
6 Handedness synchronization
Following the discussion in the previous section, we now assume we have obtained a set of 4-tuples
[TABLE]
for some unknown , by applying Algorithm 1. We now explain how to extract one of the hand-consistent sets
[TABLE]
from the set in (6.1).
For all , we denote by
[TABLE]
a 4-tuple of relative rotations consistent with a pair of images and of a -symmetric molecule. We also denote by , the relative rotation in the 4-tuple for , and by the set . We now show how the set in (6.1) can be partitioned into two disjoint sets
[TABLE]
Once we have the partition in (6.4), we can compute the set of 4-tuples . Then, one of the hand-consistent sets in (6.2) is given by .
The partition in (6.4) is derived by the following procedure. First, we construct a graph with vertices corresponding to the estimates in (6.3), and with edges that encode which pairs of estimates and are in the same set in (6.4), and which arenât (as will be explained shortly). Then, we derive the partition in (6.4) from the eigenvector of the leading eigenvalue of the adjacency matrix of . The procedure we present is an adaptation of an algorithm that was derived in [8] for non-symmetric molecules.
We next state a proposition, the proof of which is given in Appendix A.2, which allows us to determine which estimates , , belong to the same set in (6.4). Following the approach in [8], we look at triplets of estimates of the form for all triplets , and determine which members of each such triplet are in the same set of (6.4) and which arenât.
Proposition 6.1**.**
For any consider the triplet of estimates
[TABLE]
that is, and are all in the same set of (6.4). Then, exactly 16 of the matrix products in the set
[TABLE]
satisfy
[TABLE]
Now, consider a triplet of estimates
[TABLE]
and note that since each estimate is either in the set or in the set of (6.4), it must be that either all estimates are in the same set, or two estimates are in one set and the third estimate is in the other. We define the âset configurationâ of a triplet by the row vector
[TABLE]
and denote
[TABLE]
Note that if, for example, is in one set of (6.4) and and are in another, then we have that and are all in the same set. We remark that we have found experimentally that whenever three estimates , and are not in the same set of (6.4), then all the products in (6.7) are far from in norm. Thus, Proposition 6.1 suggests that we can find the set configuration of a triplet of estimates by the following procedure. First, we compute the four sets of norms
[TABLE]
where is the Frobenius norm. Next, we sort the norms in each set and in (6.11) in ascending order, and denote the resulting ascending sequences by and , respectively. Finally, we compute the scores
[TABLE]
By Proposition 6.1, there are exactly 16 norms with value 0 in the set of (6.11) which corresponds to the correct set configuration of a triplet (see (6.9)). Thus, we set for such that is the minimal score in (6.12).
Once we have computed for all , we construct a graph whose vertices correspond to the estimates in (6.3), and whose edges are defined by the adjacency matrix (which we also denote by )
[TABLE]
Finally, we compute the eigenvector which corresponds to the leading eigenvalue of the matrix . In [8], it is shown that the leading eigenvalue of is simple, and that is of the form (up to normalization), where the sign of each entry encodes the set membership in (6.4) of each estimate . The procedure for handedness synchronization for -symmetric molecules is summarized in Algorithm 2.
7 Rotationsâ rows synchronization
At this point, in light of Sections 5 and 6, we assume that we have obtained one of the hand-consistent sets of 4-tuples in (6.2). Let us assume without loss of generality that we have the set . As was explained in Section 5, for each , we now form a 3-tuple of matrices by summing the first element of with each of the rest of its elements. By Proposition 5.1, this results in a set of triplets
[TABLE]
which was defined in (5.6), where are unknown and the signs are also unknown. In this section, we will show how to partition this set of triplets into three disjoint sets
[TABLE]
where are the (unknown) signs of . That is, for each , the set contains all outer products between the rows of the rotation matrices and for , up to sign. This partition will be obtained by casting it as a graph partitioning problem.
In Section 7.1, we show how to encode the partition in (7.1) as a graph in which each vertex corresponds to one of the matrices in (5.6). In Section 7.2, we construct the adjacency matrix of the graph, and in Section 7.3, we show how to extract the partition in (7.1) from the leading eigenvectors of the graphsâ adjacency matrix.
7.1 Graph partitioning formulation
In what follows, we denote the matrices in (7.1) by
[TABLE]
We now construct a weighted graph from which the partition in (7.1) can be inferred. Each vertex in corresponds to one of the matrices in (7.2) (henceforth, we shall refer to both the matrix and its corresponding vertex in using the notation ). Thus, we have that (see (7.2))
[TABLE]
We define the set of weighted edges of the graph by its adjacency matrix, which we also denote by , as follows
[TABLE]
That is, we only connect by an edge vertices which have exactly one index in common. We give this edge a weight if its incident vertices are in the same set of (7.3), and weight otherwise (see Fig. 1).
Note that the weights on the edges of induce a partition of the vertex set into the sets of (7.3), by grouping together vertices which are connected by edges with a weight of . Recovering the partition in (7.3) corresponds to coloring the vertices of with 3 colors, say, red, green, and blue, where the vertices of are colored red, of green and of blue.
In what follows, we show that the partition of to the sets in (7.3) can be derived from eigenvectors of the matrix .
Definition 7.1**.**
Let the eigenvalues of an matrix A be , with their respective multiplicities given by . We denote the spectrum of A by , and write
[TABLE]
Theorem 7.2**.**
The spectrum of the matrix is given by
[TABLE]
The proof of Theorem 7.2 is given in Appendix A.3.
Definition 7.3**.**
Define , , and define the pair of vectors by
[TABLE]
where for any we denote by the entry of with the same index as the row of which corresponds to the vertex .
Throughout Section 7, for any column vector , we denote
[TABLE]
That is, is the column vector that corresponds to the entries of the triplet in .
Proposition 7.4**.**
The vectors and in Definition 7.3 are orthogonal eigenvectors of , corresponding to the eigenvalue .
We prove Proposition 7.4 in Appendix A.4. The immediate consequence of Theorem 7.2 and Proposition 7.4 is the following corollary.
Corollary 7.5**.**
The eigenspace of is spanned by and .
Note that is a unit vector which exactly encodes the partition in (7.3), where the entries and encode the color of each vertex . The unit vector is âcolor blindâ in the sense that it can only distinguish between 2 colors. Obviously, in any 3-coloring of a graph we can always permute the colors, e.g., switch the color of all red vertices to green, green vertices to blue, and blue vertices to red. This is manifested in the following proposition.
Proposition 7.6**.**
Define the â3-colorâ and â2-colorâ vectors by
[TABLE]
respectively. For any , we define the vectors by
[TABLE]
where and are the entries of and with the same index as the row of which corresponds to the vertex . Then, and are orthogonal eigenvectors of of (7.4) in the eigenspace of .
Proof.
Observe that is obtained from of Definition 7.3 by replacing all entries with , all 0 entries with , and all entries with . The vector is obtained from in a similar manner. The proposition then follows by repeating the method of proof applied in Proposition 7.4 with and .
â
Following Corollary 7.5, we recover (up to a color permutation, i.e, one of the vectors of (7.8)) in the following manner. We begin by constructing the matrix . Then, we compute a pair of orthogonal eigenvectors and spanning the eigenspace of (the leading eigenvalue of ). In general, each of these eigenvectors is an orthogonal linear combination of and , and thus, we cannot read the partition in (7.3) directly from any one of them. In Section 7.3, we show how to âunmixâ and and retrieve . In practice, due to noise, we can only compute an approximation of , and thus, we never get the exact values , [math] and . We explain how to deal with this issue in Section 7.3. In the following section, we show how to construct of (7.4) using the set of matrices in (5.6).
7.2 Constructing
We now derive a procedure for constructing the matrix of (7.4). For any two pairs of indices and , we denote by
[TABLE]
the matrix given by the rows of corresponding to the vertices and , and columns of corresponding to the vertices and . By (7.4) and (7.9), we have
[TABLE]
where is the zero matrix. We will now show how to construct block by block, by computing the blocks for which .
The following lemma, the proof of which is given in Appendix A.5, characterizes the indices of the non-zero entries in of (7.4).
Lemma 7.7**.**
Define
[TABLE]
and for define
[TABLE]
Moreover, for , define
[TABLE]
Then, we have that
[TABLE]
By (7.10) and the second equality in (7.14) of Lemma 7.7, to construct , we only need to determine the blocks
[TABLE]
which we now show how to do.
For each triplet of indices , we consider the vertices of corresponding to the pairs of indices , , and , written in the rows of the table
[TABLE]
For each pair of vertices from different rows in (7.16), we need to determine whether this pair belongs to the same set of (7.3) or to different sets. This corresponds to choosing between an edge with a weight of or for each of these pairs in . We therefore show how to determine all edge weights between the vertices in (7.16) simultaneously. This procedure is then repeated for each triplet of indices .
First, observe that for any pair of matrices and , given by (7.2), we have
[TABLE]
since the row vectors and are rows of the orthogonal matrix . This suggests that for each pair of vertices and with unknown , we can determine whether they belong to the same set of (7.3), by simply computing the norm of the product of the matrices which they represent.
Since in practice we work with noisy data, we next show how to get more robust estimates for the edge weights of , by leveraging the graph structure of in conjunction with (7.17). Denote by
[TABLE]
the transposed matrices of (7.2). By (7.17) and (7.18), for each triplet of matrices and , , we have
[TABLE]
and by (7.18), we also have that
[TABLE]
Note that a non-zero product of matrices in (7.19) corresponds to a triplet of vertices in in the same set of (7.3) (see Fig. 1(b)). We now infer which vertices in (7.16) belong to the same set of (7.3), by constructing a function that vanishes for all vertex triplets for which (7.19) and (7.20) hold, namely, for all vertex triplets that belong to the same set of (7.3). Specifically, for each triplet of indices , we minimize the function (which will be explained shortly) given by
[TABLE]
over all and in , and all choices of sign between the 2 terms in each norm (since by (7.19), the sign of the right term in each norm is unknown), independently between the norms.
The rationale of minimizing (7.21) can be demonstrated in the following manner. Writing down the vertices as in (7.16), we seek to rearrange the vertices in the second and third rows so that after rearrangement, the vertices in each column are in the same set of (7.3) (see example in Fig. 2). Whenever we choose a pair of permutations such that the vertices in each column in
[TABLE]
are in the same set of (7.3), by (7.19) and (7.20), we have that all the terms in the sum (7.21) equal zero. Otherwise, if there exists a column in (7.22) in which there is a pair of vertices in different sets of (7.3), then we have that (7.21) is strictly . For example, if in each column of (7.22), there is a pair vertices in different classes, then by (7.19), the left term inside each norm in (7.21) equals zero, while the right term in each norm equals , and we get that .
Once we compute for each a pair of permutations and minimizing in (7.21), the matrix is set block by block by computing all the blocks of (7.15) in the following manner. We first set the first three blocks of (7.15), that is, , and . Consider (7.22), which consists of all the vertices that are incident to the edges that constitute the aforementioned blocks (see (7.9)). The triplet of vertices and , which are all in the same column of (7.22), are in the same set of (7.3), . Thus, we assign a weight +1 to the edges of , , and , which correspond to the pairs of vertices , , and , for . All the edges of , , and which correspond to pairs of the form , , and where , are assigned a weight since they are in a different sets of (7.3). As for the last three blocks of (7.15), that is, , , and , by (7.4) we have that for all , , and . Thus, by (7.9) we have
[TABLE]
Thus, for every it holds that
[TABLE]
and thus, , and can be set according to (7.24) after we compute and .
The procedure for constructing of (7.4) is summarized in Algorithm 3. In the next section, we turn to the task of unmixing the eigenvectors corresponding to the maximal eigenvalue of , in order to extract of Definition 7.3.
7.3 Unmixing the eigenvectors of
By Corollary 7.5, the eigenspace of of (7.4) corresponding the eigenvalue is spanned by and of Definition 7.3. However, any orthogonal linear combination of and is also an eigenvector, and so we can only compute two orthogonal eigenvectors which are linear combinations of and . In this section, we show how to âunmixâ these linear combinations to retrieve .
Suppose that we have computed a pair of orthogonal unit eigenvectors
[TABLE]
spanning the eigenspace of . Since and are unit vectors, by Proposition 7.4 we have
[TABLE]
that is, is also a unit vector. Similarly, we have that . Furthermore, again by Proposition 7.4, we have
[TABLE]
i.e., the coefficients vectors and are unit orthogonal vectors. Denoting
[TABLE]
by (7.26) and (7.27) there exists an angle such that either
[TABLE]
and by (7.25) and (7.29), we have that
[TABLE]
However, it can be easily verified that
[TABLE]
and thus, (7.30) can be written as
[TABLE]
Equation (7.32) suggests that we can if we can recover , we can unmix and and recover either or . By Definition 7.3, is obtained from by switching places between all and values in , and so both and encode the same partition in (7.3) of the vertices of . We now show how to find .
For any angle , we write
[TABLE]
In addition, using the notation introduced in (7.6), for any vector and for all , we define
[TABLE]
and we define by the value of whose magnitude is between and . Then, we define the function by
[TABLE]
The following proposition, the proof of which is given in Appendix A.6, states that the minimum of (7.35) over is obtained at for which and (up to a permutation of the vectors and as in (7.8)).
Proposition 7.8**.**
Out of all orthogonal pairs of unit vectors in the eigenspace of , the 12 pairs of vectors are the unique minimizers of in (7.35), up to normalization.
Thus, if we denote by a minimizer of (7.35), then we declare to be . The partition in (7.3) is then read from . In practice, due to noise in the input data, never exactly equals , and so after we compute , we threshold its entries according to
[TABLE]
The estimation of is summarized in Algorithm 4.
8 Signs synchronization
Assuming we have obtained the partition in (7.1) (see also (7.3)), our final task is to adjust the signs in each set of (7.1), so that we can construct the rank 1 matrices of (5.7). As was explained in Section 5, the matrices can then be decomposed to retrieve all the rows of the matrices in (1.1), which can then be assembled from their constituent rows. Since all assertions we derive in this section apply identically and independently to each set in (7.1), throughout this section we refer to a single set of rank 1 matrices
[TABLE]
by dropping the superscript that indicates the set in (7.1) to which the matrices belong. Our goal is therefore to estimate up to an arbitrary sign. To that end, we will adjust the signs in (8.1) so that the matrix of size whose block of size is given by (8.1) has rank 1. The leading eigenvector of will then give the vectors as required. We next describe the âsigns adjustmentâ procedure and the construction of .
The task of adjusting the signs in (8.1) consists of three steps. The first step of the signs adjustment procedure is computing the matrices for all by observing that
[TABLE]
since and so . For notational convenience, for all we write instead , since the âsignâ of equals 1. In principle, (8.2) allows to compute for each by using a single matrix for some arbitrarily chosen . However, since in practice the input data is noisy, we obtain more robust estimates for by computing the averages
[TABLE]
followed by computing the best rank 1 approximation of (8.3) using SVD.
While in (8.1) are defined only for , for notational convenience we define whenever , and as explained above . Thus, are defined for all . We next outline steps 2 and 3 of the signs adjustment procedure, before giving a detailed description of these steps. In step 2 of the procedure, we construct rank 1 matrices , which admit the decompositions
[TABLE]
for unknown , . For each , the matrix in (8.4) is constructed block by block from the matrices in (8.1) and (8.3), in such a way that each is a rank 1 matrix which admits the decomposition (8.4). This construction relies on Proposition 8.1 stated below. Each of the matrices in (8.4) can then be decomposed to recover all the rows up to signs . Thus, in theory, we could construct only one of these matrices, say , and recover , which is our goal in this section. However, in practice, the estimates of (8.1) contain errors since they were estimated from noisy images. Moreover, for each particular , the set of estimates used to construct the matrix critically depends on the common lines of the single noisy image in (1.2) with each of the images for , which due to the noise in the input images, may be highly inaccurate, leading to large errors in of (8.4).
Thus, to use all available data in estimating of (8.4), we first decompose all matrices in (8.4), which results in independent estimates for each row . We then execute the third step of our signs adjustment procedure in which we use all estimates together (see (8.4)) to obtain a set of signs , which allows us to adjust the signs of (8.1) by multiplying each matrix by , such that the matrix of size whose block is has rank 1. This latter procedure exploits all available data at once, improving the robustness of the estimation of the matrices of (1.2) to noisy input data. This matrix admits the decomposition
[TABLE]
for some unknown signs . Recalling that we dropped the index from of (5.7), and constructed from the set in (8.1), we see that in fact we can construct of (5.7) using for each , as required. We can then decompose each , and recover the rotation matrices of (1.2), as was explained in Section 5, which is our task in this paper. We now complete the details of the signs adjustment procedure described above, i.e., the construction of the matrices of (8.4) and (8.5).
The following proposition, the proof of which is given in Appendix A.7, is the basis for the construction of the matrices of (8.4).
Proposition 8.1**.**
Let be a matrix whose block of size is given by of (8.1) if , by its transpose if , and by of (8.2) if . Then, H is rank 1 iff for each
[TABLE]
where as noted above, for we define . Furthermore, whenever (8.6) holds, we have
[TABLE]
Now, suppose we wish to construct of (8.4). Proposition 8.1, and in particular (8.6), suggest that we can construct using the set in (8.1), by applying the following sign correction procedure. Recall that are rows of orthogonal matrices, and thus for all . For each pair , we compute the norm
[TABLE]
and replace the matrix with if (8.8) is greater than zero. Let be the resulting set of rank 1 matrices after this signs correction procedure. By construction, it holds that
[TABLE]
Now, let be the matrix whose block of size is given by . By (8.9) and Proposition 8.1, admits the decomposition
[TABLE]
The matrices are obtained in a similar manner. In practice, (8.8) is never exactly zero due to errors stemming from noise, and thus we also compute , and replace with , if
[TABLE]
At this point, we can factor each of the matrices (e.g., using SVD), and obtain the set of vectors
[TABLE]
of (8.4), where are unknown. For each we have
[TABLE]
that is, we have estimates for the row , where each estimate has an unknown sign . This concludes step 2 of the signs adjustment procedure outlined above.
For the third and final step of the signs adjustment procedure, we define
[TABLE]
Since are unit row vectors, by (8.11) and (8.12), we have
[TABLE]
for all . Thus, we can obtain all the products in (8.13) by taking dot products of the vectors in (8.11).
Now, suppose we computed the set from the values in (8.13) (as will be explained shortly). Since , we have that
[TABLE]
Thus, we can multiply each matrix in (8.1) by , and obtain the set of matrices , and together with (8.3), we can construct the matrix , whose block of size is given by
[TABLE]
Then, admits the decomposition in (8.5), as required. Thus, it only remains to show how to extract the set from the values in (8.13).
Let us define the matrix
[TABLE]
where , , and the products are computed using (8.13). The following proposition, the proof of which is given in Appendix A.8, shows that the signs can be extracted from in (8.16).
Proposition 8.2**.**
The leading eigenvalue of in (8.16) is and it is simple. Moreover, define to be the vector of length with entries . Then, is an eigenvector of corresponding to the eigenvalue .
By Proposition 8.2, the eigenvector of in (8.16) gives the set . The procedure for the signs adjustment (and the construction of of (8.5)), is summarized in Algorithm 5.
9 Numerical experiments
We implemented Algorithms 1â5 in Matlab and tested them on a dataset of raw projection images of the beta-galactosidase enzyme [4], which has a symmetry. All tests were executed on a dual Intel Xeon E5-2683 CPU (32 cores in total), with 768GB of RAM running Linux, and four nVidia GTX TITAN XP GPUâs. Section 9.1 provides some of the implementation details for Algorithms 1â5, and Section 9.2 presents the results on the experimental dataset.
9.1 Implementation details
To execute Algorithm 1, we need to discretize the space of rotations . To that end, we generated a pseudo-uniform spherical grid of =1200 points on , using the Saaf-Kuijlaars algorithm [10]. Then, for each on the spherical grid, we computed the set of rotations
[TABLE]
where for , and the vectors and are given by
[TABLE]
It is easily verified that the vectors , and form an orthonormal set, and that . The third column of each rotation is the beaming direction corresponding to the rotation , and the vectors
[TABLE]
are the coordinate systems for the plane perpendicular to . Thus, each set of matrices where is a discretization of the set of rotations in with beaming direction . We found experimentally that choosing (together with ) is sufficient to obtain accurate results.
As for runtime, for a set of 500 projection images, it took 1512 seconds to compute all sets of relative rotations (Algorithm 1), 720 second to synchronize handedness (Algorithm 2), 5784 seconds to compute the partition in (7.1) (Algorithms 3 and 4), and 1378 seconds to adjust the signs of (7.1) (Algorithm 5).
9.2 Beta-galactosidase experimental results
We applied Algorithms 1â5 to the EMPIAR- dataset [1] from the EMPIAR archive [6]. The dataset consists of 41,123 raw particles images, each of size pixels, with pixel size of Ă . To generate class averages from this dataset, we used the ASPIRE software package [2] as follows. First, all images were phase-flipped (in order to remove the phase-reversals in the CTF), down-sampled to size pixels (hence with pixel size of  à ), and normalized so that the noise in each image has zero mean and unit variance. We then split the images into two independent sets, each consisting of 20,560 particle images, and all subsequent processing was applied to each set independently.
We next used the class-averaging procedure in ASPIRE [2] to generate 2000 class averages from each of the two sets of particle images (using the EM-based class averaging algorithm in ASPIRE). A sample of these class averages is shown in Fig. 3. The input to our algorithm was 500 out of the 2000 class averages (by selecting every 4th image).
Next, we used the algorithms described in this paper to estimate the rotation matrices that correspond to the 500 class averages, and reconstructed the three-dimensional density map using the class averages and their estimated rotation matrices. The resolution of the reconstructed volume, assessed by comparing the reconstructions from the two independent sets of class averages is 8.23 à , using the Fourier shell correlation (FSC) -criterion [14] (Fig. 4(a)). When comparing our reconstructions to a high resolution reconstruction of the molecule (EMD- [3]), the resolution estimated using the 0.5-criterion of the FSC is 9.88 à  (Fig. 4(b)).
10 Summary and future work
In this paper, we presented a procedure for estimating the orientations corresponding to a given set of projection images of a -symmetric molecule. We have shown that the set of relative rotations between all pairs of images admits a special graph structure, and demonstrated that this structure can be exploited to recover the rotations. We then demonstrated our method by reconstructing an ab-initio model from an experimental set of cryo-EM images.
An obvious future work is to extend the proposed method to for . Preliminary theoretical analysis suggests that this can be achieved by combining the method of the current paper with the algorithms derived in [9].
Acknowledgments
This research was supported by the European Research Council (ERC) under the European Unionâs Horizon 2020 research and innovation programme (grant agreement 723991 - CRYOMATH).
Appendix A Appendix
A.1 Proof of Proposition 5.1
Let be a permutation of the 4-tuple .
This follows immediately from (5.2) and (5.3), by noting that the tuple is a permutation of . 2. 2.
Suppose that . Then, is a permutation of . By (5.5) and (5.3), we have
[TABLE]
[TABLE]
as required. The proof for the cases is similar.
â
A.2 Proof of Proposition 6.1
First, we note that the set of (1.3) forms a multiplicative group of matrices, known in literature as the Klein four-group. In particular, is the identity element of the group, and we have
[TABLE]
Now, since and are in , we have
[TABLE]
By (A.1), each member of the group is its own inverse, and so for any triplet we have that if and only if
[TABLE]
Now observe, that since are permutations, we have
[TABLE]
from which it follows that there are 16 possible products on the left-hand side of (A.3), corresponding to the 16 products for . Thus, there are at most 16 triplets for which (A.3) is satisfied. On the other hand, combining the closure property of groups with (A.4) gives us that there always exists an element of such that (A.3) is satisfied. We conclude that are exactly 16 triplets such that (A.3) is satisfied, from which by (A.2), the proof is concluded.
â
A.3 Proof of Theorem 7.2
We assume without loss of generality, that the first rows of correspond to the vertices , the following rows correspond to the vertices , and the last rows correspond to . Let be the matrix given by
[TABLE]
In [8], it was shown that the spectrum of is given by
[TABLE]
We now show how to relate the spectrum of to that of .
Let for , be an matrix which consists of rows and columns of the matrix . That is, we partition into 9 block matrices of equal dimensions. By assumption, for any , both the rows and columns of the matrix correspond to vertices of the same set in (7.3). Thus, by (7.4) we have that for ,
[TABLE]
which by (A.5) gives that for all . Now, consider any matrix for , and note that its rows correspond to the vertices of the set , whereas its columns correspond the vertices of the set . Again, by (7.4), we have that for ,
[TABLE]
Thus, by (A.5), we have that for all , and we conclude that
[TABLE]
Now, let be any eigenvector of corresponding to an eigenvalue . Denote , and consider the column vectors of length
[TABLE]
By (A.9), we have
[TABLE]
This shows that if is an eigenvalue of , then and are eigenvalues of . Furthermore, if is an eigenvector of with eigenvalue , then in (A.10) is an eigenvector of corresponding to the eigenvalue of , and and in (A.10) are eigenvectors of corresponding to the eigenvalue of . Now, note that
[TABLE]
and thus and in (A.10) are independent for any . Furthermore, since all eigenvalues of are non-zero (see (A.6)), we have that , and so is in a different eigenspace of than and . Thus, all three vectors in (A.10) are independent eigenvectors of . Let us denote the multiplicity of an eigenvalue of the matrix by , and the multiplicity of an eigenvalue of , by . We also denote the three eigenvalues of by and . It is simple to verify, that if a pair of vectors and are independent, then so are the pairs of vectors , , and . Thus, by (A.11), if is an eigenvalue of , then the eigenvalues and of satisfy
[TABLE]
which gives us that
[TABLE]
On the other hand, since has dimensions , we have
[TABLE]
by which we have that
[TABLE]
for otherwise, by (A.12) we would have a strong inequality in (A.13), which is a contradiction to (A.14).
We conclude that the set of eigenvalues of is given by . Finally, the multiplicities of the eigenvalues of in (7.5) are computed by combining (A.6) with (A.15).
â
A.4 Proof of Proposition 7.4
We begin by introducing some notation and definitions which we use in the current and subsequent proofs. Let denote the permutation matrix of , i.e., satisfies for any . Thus, using the notation introduced in (7.6) we can write
[TABLE]
where and are given in Definition 7.3, and and are defined in (7.7).
Now, using the notation of (7.9), consider the block of for some . If it were the case that and are the identity permutations, then by (7.9) and (7.4), we would have that
[TABLE]
However, in general, is the block of which we get by taking the entries of in the rows corresponding to the triplet of vertices , and columns corresponding to the triplet of vertices . In other words, is obtained from the matrix in (A.17) by permuting its rows by and its columns by , that is
[TABLE]
By the same argument applied to whenever , we have that
[TABLE]
The reason that in (A.18) is transposed, is that in order to permute the columns of a matrix by , one has to multiply it on the right by . We now prove Proposition 7.4
Proof of Proposition 7.4.
Let us compute the Rayleigh quotient for . By the second equality in (7.14), we have
[TABLE]
By (A.16) and (A.18), and since permutation matrices are orthogonal, we have
[TABLE]
It is straightforward to check that each term in the sum (A.20) equals exactly . By Definition 7.3, , and since there are triplets , the sum in (A.20) amounts to
[TABLE]
By Theorem 7.2, is the leading eigenvalue of , and thus, maximizes the Rayleigh quotient of , by which we have that is in the eigenspace of . A similar calculation for shows that it is also in the same eigenspace. Finally, observe that since are orthogonal for all , we have
[TABLE]
â
A.5 Proof of Lemma 7.7
We begin by showing the first equality in (7.14). Fix some , and observe that for any such that , we have that
[TABLE]
where in the first case of (A.21) it cannot be that since then we would have that (and similarly for the other cases). From (A.21) and (7.12) we get that
[TABLE]
This shows that for fixed it holds that
[TABLE]
and so by taking a union over all , we get from (7.11) that
[TABLE]
Conversely, suppose that for some and , . Then, we have that for , and . Thus, by (7.11), for any such that and we have that , from which we have that . Applying a similar argument to and , we get that for all , from which we get that
[TABLE]
Taking a union over all we have
[TABLE]
which together with (A.22) proves the first equality in (7.14).
Let us now show the second equality in (7.14). Suppose that . Then, from (7.12) we have that
[TABLE]
By (A.24) and the definition of and in (7.13), we have
[TABLE]
Thus, taking the union over all gives us that
[TABLE]
Now, for each we have
[TABLE]
by which we have that (renaming the indices to those given in (7.12))
[TABLE]
Taking a union over all we get
[TABLE]
thus, by (A.25), we have
[TABLE]
where the last inequality follows from (A.23).
Conversely, take . By the first part of the proof, belongs to one of the sets in (7.12). If , then we have that either or , and thus
[TABLE]
that is, or . This shows that (renaming the indices to the order given in (7.13))
[TABLE]
for either or , by which we conclude that
[TABLE]
In the same manner one can show that
[TABLE]
for all . Thus, we have that
[TABLE]
for all . Taking a union over all and using the first equality in (7.14) gives
[TABLE]
which concludes the proof of the second equality in (7.14).
â
A.6 Proof of Proposition 7.8
Define the sets
[TABLE]
where and were defined in (7.7), and is the permutation matrix of . That is, is the set of all permutations of and is the set of all permutations of .
Lemma A.1**.**
Let be such that and for some . Suppose, that are such that for some , and , where was defined in (7.28). Then, there exists such that
[TABLE]
Proof.
First, since is orthogonal, then
[TABLE]
Define , and note that is a linear subspace of that contains and . Since and is an orthogonal matrix, by (A.28) we have that and are also orthogonal vectors, and by assumption and are in and , respectively, and so they are in . Now, since there exists such that . Since is an orthogonal matrix, we have
[TABLE]
Finally, since is of dimension 2, there are exactly two vectors perpendicular to in . Thus, it must be that , from which we get (A.27). â
We now prove Proposition 7.8.
Proof of Proposition 7.8.
The function , given in (7.35), satisfies for all . Suppose that is such that . Such a necessarily exists, since if we choose in (7.33) such that (which can be done due to (7.32)), then (7.35) equals zero. In the notation of (7.6), we now show that there exists such that either
[TABLE]
or that
[TABLE]
from which it follows that .
First, we show that
[TABLE]
Indeed, for each pair , looking at the first square brackets in (7.35), we have that
[TABLE]
This shows that must be a permutation of in (7.7), that is, . Similarly, looking at the second square brackets in (7.35), we have that
[TABLE]
[TABLE]
which is possible only if is a permutation of the vector in (7.7), i.e., , which shows (A.31).
Now, by (7.32) and (7.33), the vectors and are either given by
[TABLE]
or by
[TABLE]
for some , where and is the pair of orthogonal eigenvectors of defined in (7.25). Let us assume first the case (A.32). In the notation of (7.6), we define
[TABLE]
Also, denoting by
[TABLE]
the matrices with columns and (defined in (7.7)), we get that by Definition 7.3 and (7.6), for each pair we have
[TABLE]
In particular, for and we have
[TABLE]
Thus, by (A.35) and (A.36) it follows that for all we have
[TABLE]
Now, by (A.32) and (7.6) we have that
[TABLE]
By (A.36) we have that and , and by (A.31), we have that and . Thus, using (A.38), by Lemma A.1 there exists a permutation matrix such that either
[TABLE]
First, assume that the case on the left of (A.39) holds. It then follows from (7.6), (A.32), (A.37) and (A.36) that
[TABLE]
Writing , we get by (A.34) that for all
[TABLE]
By (7.8) and (7.6), we have that
[TABLE]
for all . The last two equations show that (A.29) holds, which proves the proposition for the case , when the identity on the left of (A.39) holds.
If holds as well as the case on the right of (A.39), where , then by repeating the latter calculation with and replaced by and , we get that
[TABLE]
for all , i.e., that (A.30) holds, which proves the proposition for the case when the identity on the right of (A.39) holds. This concludes the proof for the case .
In the case where holds, we get by the same method of proof that either (A.30) or (A.29) hold, which proves the proposition for this case.
To conclude, we have shown that given which minimizes (7.35), the vectors and defined in (7.33), must satisfy either or for some .
â
A.7 Proof of Proposition 8.1
Suppose that (8.6) holds, and fix an arbitrary . Then, the block of is given by . Thus, since for all , we have
[TABLE]
which gives (8.7) and shows that indeed has rank 1.
Now suppose that (8.6) does not hold, and assume without loss of generality that . Denote the entry of a row by , . The rank 1 matrix is non-zero, thus, there exist such that , that is, the and entries of the vectors and , respectively, are non-zero. By (8.2), we have that , and thus, the first three rows of are given by the matrix
[TABLE]
Similarly, the next three rows of are given by the matrix
[TABLE]
Thus, since each vector is of length 3, rows number and of are given by
[TABLE]
Multiplying (A.44) by and (A.45) by , by our assumption we get
[TABLE]
Since , the latter two vectors are linearly dependent only if , which is impossible. Therefore, rows number and of given in (A.44) and (A.45) are linearly independent, which implies that .
â
A.8 Proof of Proposition 8.2
Fix a pair of indices . We begin by deriving an expression for , which is the entry of corresponding to the row of . By (8.16) and the first equality in (7.14) of Lemma 7.7, we have
[TABLE]
where the 4 sums to the right of the first equality in (A.46) correspond to the 4 sets of Lemma 7.7. Since there are exactly indices for which , we conclude from (A.46) that for all . Thus, is an eigenvector of corresponding to the eigenvalue . Next, we show that is simple. Define a partition of the set of (8.12) into two disjoint sets
[TABLE]
and note that a pair and are in the same set of (A.47) iff . Thus, by (8.16) and (A.47), the matrix is given by
[TABLE]
In [8], it was shown that the leading eigenvalue of is simple and is given by , which concludes the proof.
â
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] 2.2 Ă resolution cryo-EM structure of beta-galactosidase in complex with a cell-permeant inhibitor. http://dx.doi.org/10.6019/EMPIAR-10061 .
- 2[2] Aspire: Algorithms for Single Particle Reconstruction software package. http://spr.math.princeton.edu/ .
- 3[3] Atomic resolution cryo-EM structure of beta-galactosidase. http://www.ebi.ac.uk/pdbe/entry/emdb/EMD-7770 .
- 4[4] A. Bartesaghi, A. Merk, S. Banerjee, D. Matthies, X. Wu, J.L.S Milne, and Subramaniam S. 2.2 Ă resolution cryo-EM structure of beta-galactosidase in complex with a cell-permeant inhibitor. Science , 348:1147â1151, 2015.
- 5[5] G. Frank. Three-Dimensional Electron Microscopy of Macromolecular Assemblies: Visualization of Biological Molecules in Their Native State . Oxford, 2006.
- 6[6] A. Iudin, P. K. Korir, J. Salavert-Torres, G. J. Kleywegt, and A. Patwardhan. Empiar: a public archive for raw electron microscopy image data. Nature Methods , 13(5):387â388, 2016.
- 7[7] F. Natterer. The Mathematics of Computerized Tomography . Classics in Applied Mathematics. SIAM, 2001.
- 8[8] G. Pragier, I. Greenberg, Xiuyuan C., and Y. Shkolnisky. A graph partitioning approach to simultaneous angular reconstitution. IEEE Transactions on Computational Imaging , 2(3):323â334, 2016.
