Statistics of ambiguous rotations
R. Arnold, P. E. Jupp, H. Schaeben

TL;DR
This paper develops statistical methods for analyzing ambiguous rotations, which are rotations known only up to symmetry, with applications in biomechanics, crystallography, and seismology.
Contribution
It introduces new tests and models for ambiguous rotations, including uniformity tests, parametric models, and regression approaches.
Findings
A test for uniformity of ambiguous rotations is proposed.
Parametric models for ambiguous rotations are developed.
An illustrative example with diopside crystal orientations demonstrates the methods.
Abstract
The orientation of a rigid object can be described by a rotation that transforms it into a standard position. For a symmetrical object the rotation is known only up to multiplication by an element of the symmetry group. Such ambiguous rotations arise in biomechanics, crystallography and seismology. We develop methods for analyzing data of this form. A test of uniformity is given. Parametric models for ambiguous rotations are presented, tests of location are considered, and a regression model is proposed. A brief illustrative example involving orientations of diopside crystals is given.
| Group | Name | Frame | |
|---|---|---|---|
| trivial | orthonormal, | ||
| cyclic | orthonormal | ||
| cyclic | coplanar, | ||
| known up to cyclic order, | |||
| for | |||
| dihedral | orthogonal axes | ||
| dihedral | coplanar, | ||
| known up to cyclic order and reversal, | |||
| for | |||
| tetrahedral | for | ||
| octahedral | orthogonal axes | ||
| = cubic | |||
| icosahedral | for | ||
| = dodecahedral |
| Group, | |
|---|---|
| odd | |
| even | |
| odd | |
| even | |
| Group, | Inner product |
|---|---|
| odd | |
| even | |
| odd | |
| even | |
| Group | ||
|---|---|---|
| 3 | 9 | |
| 5/3 | 8 | |
| odd | ||
| even | ||
| 10 | ||
| odd | ||
| even | ||
| 32/9 | 10 | |
| 6/5 | 9 | |
| 21 |
| Group | |
|---|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry · Statistical and numerical algorithms · Soil Geostatistics and Mapping
Statistics of ambiguous rotations
R. Arnold,
School of Mathematics and Statistics, Victoria University of Wellington,
PO Box 600, Wellington, New Zealand,
P.E. Jupp,
School of Mathematics and Statistics, University of St Andrews,
St Andrews, Fife KY16 9SS, UK,
H. Schaeben,
Geophysics and Geoscience Informatics, TU Bergakademie Freiberg,
Germany
Abstract
The orientation of a rigid object can be described by a rotation that transforms it into a standard position. For a symmetrical object the rotation is known only up to multiplication by an element of the symmetry group. Such ambiguous rotations arise in biomechanics, crystallography and seismology. We develop methods for analyzing data of this form. A test of uniformity is given. Parametric models for ambiguous rotations are presented, tests of location are considered, and a regression model is proposed. A brief illustrative example involving orientations of diopside crystals is given.
Keywords: Frame, Orientation, Regression, Symmetry, Tensor, Test of uniformity.
1 Introduction
Data that are rotations of occur in various areas of science, such as palaeo-magnetism (Pesonen et al., 2003; Villalaín et al., 2016; Koymans et al., 2016), plate tectonics and seismology (Stein & Wysession, 2003; Hardebeck, 2006; Arnold & Townend, 2007; Walsh et al., 2009; Khalil & McClay, 2016), biomechanics (Rivest, 2005; Lekadir et al., 2015; Spronck et al., 2016), crystallography (Hielscher et al., 2010; Griffiths et al., 2016) and texture analysis, i.e., analysis of orientations of crystalites (Kunze & Schaeben, 2004, 2005; Du et al., 2016). The sample space is the 3-dimensional rotation group, , and methods for handling such data are now an established part of directional statistics; see §13.2 of Mardia & Jupp (2000). In some contexts the presence of symmetry means that the rotations are observed subject to ambiguity, so that it is not possible to distinguish a rotation from for any rotation in some given subgroup of . From the mathematical point of view, the sample space is the quotient of by . Such spaces arise in many scientific contexts: the case in which is generated by the rotations through about the coordinate axes gives the orthogonal axial frames considered by Arnold & Jupp (2013), which can be used to describe aspects of earthquakes; many groups of low order occur as the symmetry groups of crystals; the icosahedral group is the symmetry group of some carborane molecules (Jemmis, 1982), of most closed-shell viruses (Harrison, 2013), of the natural quasicrystal, icosahedrite (Bindi et al., 2011), and of the blue phases of some liquid crystals (Seideman,1990, §6.1.2). The object of this paper is to give a unified account of some general tools for the analysis of data consisting of ambiguous rotations with a finite symmetry group.
2 Ambiguous rotations
2.1 Symmetry groups
The orientation of a rigid object in can be described by a rotation that transforms it into some standard position. If the object is asymmetrical then this rotation is unique, so that the orientations of the object correspond to elements of the rotation group . If the object is symmetrical then the set of rotations that have no visible effect on the object forms a subgroup of . Then the orientations of the object correspond to elements of the homogeneous space , i.e. the set of equivalence classes of elements of under the right action of . We shall consider the cases in which is finite. In particular, the orientations of T-shaped, X-shaped and -shaped objects in are elements of with and , respectively. For in we shall denote the equivalence class of in by .
The finite subgroups of are known also as the point groups of the first kind. The classification result for these groups, given e.g. in Miller (1972), states that any such group is isomorphic to one of the following: the cyclic groups, , for , the dihedral groups, , for , the tetrahedral group, , the octahedral group, , and the icosahedral group, . These groups are listed in Table 1, together with the frames of vectors that will be used to represent elements of the sample spaces . The group has one element, the identity, .
2.2 Frames and symmetric frames
For each point group of the first kind, every element of can be represented uniquely by a -frame, i.e. an equivalence class of a frame, meaning a set of vectors or axes in . For with or with , it is convenient to take the vectors of the frame to be unit normals to the sides of a regular -gon; for we take a unit vector and an axis orthogonal to it; for we take a pair of orthogonal axes; for , or , it is convenient to take the vectors to be unit normals to the sides of a regular tetrahedron, cube or dodecahedron, respectively. Permutation of the vectors of the frame by the action of leads to ambiguity. This ambiguity is removed by passing to the corresponding -frame, i.e. the equivalence class of the frame under such permutations. The -frames will be denoted by square brackets, e.g. for , denotes the -frame arising from . By a symmetric frame, we shall mean a -frame for some . The frames that we consider are listed in Table 1, together with an indication of the ambiguities.
Special cases of Table 1 include the 7 crystal systems: triclinic, monoclinic, trigonal, tetragonal, orthorhombic, hexagonal and cubic with symmetry groups , , , , , and , respectively.
3 Transforming symmetric frames to tensors
3.1 Embeddings of the sample spaces
In order to carry out statistics on , we shall take the embedding approach used in, e.g. §10.8 of Mardia & Jupp (2000). We shall embed in an inner-product space, , on which acts. The embedding will be a well-defined equivariant one-to-one function such that has expectation if is uniformly distributed on . For , the space of square-integrable functions on , a very wide class of such embeddings can be obtained by averaging over . Let be an embedding used in the Hilbert space approach to Sobolev tests of uniformity; see §§10.8, 13.2.2 of Mardia & Jupp (2000), Giné (1975) and §4 of Prentice (1978). Define by , where denotes the number of elements in . If is one-to-one then it is an embedding. In general, such are quite complicated, so in this paper, for each , we focus on a simple choice of embedding, , of into an appropriate space of symmetric tensors. These are given in Table 2. Corresponding expressions for with are given in Table 3. Here is the standard inner product on the appropriate tensor product.
Define by
[TABLE]
which has the same value for all in . Then embeds in the sphere of radius with centre the origin in the vector space .
Each symmetric frame can be represented by an element of . In the triclinic case, where , is unique and . We have restricted our attention to point groups, , of the first kind, i.e., excluding reflections. However, in situations where reflection symmetries are also present we can adopt a right-handed convention for all orientations, and then neglect reflections. For example, we can treat observations on in the same way as those on .
3.2 Sample mean
Observations in can usefully be summarized by the sample mean of their images by , i.e., by . The sample mean is defined as the in that maximizes . Although is not necessarily unique, it follows from Theorem 3.2 of Bhattacharya & Patrangenaru (2003) that if are generated by a continuous distribution then is unique with probability 1.
3.3 Sample dispersion
A sensible measure of dispersion is
[TABLE]
analogous to the quantity used for spherical data; see p. 164 of Mardia & Jupp (2000). The dispersion satisfies the inequalities , where is defined in (1). Since is one-to-one, if and only if . Transformation of to with in leaves unchanged. If then , where with , the sample mean of , as in p. 290 of Mardia & Jupp (2000). If then , where is one of the measures of dispersion defined in §2.3 of Arnold & Jupp (2013).
4 Tests of uniformity
4.1 A simple test
The uniform distribution on is the unique distribution that is invariant under the action of on in which in maps to . Since the embeddings were chosen so that for uniformly distributed on , it is intuitively reasonable to reject uniformity if is far from , i.e. if is large. Significance can be assessed using simulation from the uniform distribution. For large samples, the following asymptotic result can be used.
Proposition 1
Given a random sample on , define by*
[TABLE]
where and are given by (1) and (2), respectively, and is the dimension of .
- (i)
For with , or , under uniformity, the asymptotic distribution of is , as . 2. (ii)
For ,
[TABLE]
where is the Rayleigh statistic for uniformity of and is the Bingham statistic for uniformity of , being the mean resultant length of and being the sample scatter matrix of . Under uniformity, and are asymptotically independent with asymptotic distributions and , respectively. 3. (iii)
For with ,
[TABLE]
where the subscripts and refer respectively to -frames and the corresponding -frames obtained by replacing the directed normal to the plane of a -frame by the undirected normal, and is the Rayleigh statistic for uniformity of . Under uniformity, and are asymptotically independent with asymptotic distributions and , respectively.
Values of and are given in Table 4. In the case , is the Rayleigh statistic (Mardia & Jupp, 2000, p. 287) for testing uniformity on . In the case , is the statistic given in §3 of Arnold & Jupp (2013) for testing uniformity on .
4.2 Some consistent tests
The test of uniformity based on is consistent only against alternatives for which is non-zero. For example, in any equal mixture of two frame cardioid distributions with densities (8) having concentrations and , , and so, in asymptotically large samples, cannot distinguish between such mixtures and the uniform distribution.
Tests of uniformity on that are consistent against all alternatives can be obtained as follows by averaging over Prentice’s generalization to of Giné’s test of uniformity; see §4 of Prentice (1978) and §13.2.2 of Mardia & Jupp (2000). Given in , with representatives in , put
[TABLE]
cf. the construction in §2 of Jupp & Spurr (1983). Uniformity is rejected if is large compared with the randomization distribution obtained by replacing by , where are independent random rotations obtained from the uniform distribution on . The orthorhombic case, , is considered in §3 of Arnold & Jupp (2013). It follows from Theorem 3.1 of Jupp & Spurr (1983) and the consistency of Giné’s test on (Mardia & Jupp, 2000, p. 289) that the test based on is consistent against all alternatives. More general Sobolev statistics on can be obtained from Sobolev statistics on by averaging over , as in (4).
Permutational multi-sample tests, tests of symmetry, tests of independence, and goodness-of-fit tests for symmetric frames can be obtained by applying the machinery of Wellner (1979), Jupp & Spurr (1983), Jupp & Spurr (1985) and Jupp (2005), respectively, to the embedding . These tests of independence are considered in §7.
5 Distributions on
5.1 A general class of distributions
An appealing class of distributions on consists of those with densities of the form
[TABLE]
where is a suitable known function and . The parameter measures location and measures concentration. If is a strictly increasing function, as in (6) or León et al. (2006) with , then the mode is .
In the case , the densities (5) depend on only through and the axes and the rotation angles of the random rotations are independent, with the axes being uniformly distributed. These distributions were introduced by Bingham et al. (2009) under the name of uniform axis-random spin distributions and by Hielscher et al. (2010) under the name of radially symmetric distributions. For , elements of do not have well-defined axes and, in general, the distributions on with densities of the form do not have uniformly distributed axes.
Taking proportional to in (5) gives the densities of the form
[TABLE]
For , the mode is and the maximum likelihood estimate of is the sample mean. The family (6) is a subfamily of the crystallographic exponential family introduced by Boogaart (2002, §3.2). For , (6) is the density of the matrix Fisher distribution with parameter matrix and (Mardia & Jupp, 2000, §13.2.3). For , (6) is the density of the equal concentration frame Watson distributions considered in Arnold & Jupp (2013, §6.1). Taking proportional to in (5) gives the densities of the form
[TABLE]
For , these densities are those of the de la Vallée Poussin distributions introduced by Schaeben (1997), and, under the name of Cayley distributions, by León et al. (2006).
Taking with in (5) gives the densities
[TABLE]
of the frame cardioid distributions, which are analogous to the cardioid distributions on the circle (Mardia & Jupp, 2000, §3.5.5). Useful estimators of and in (8) are the moment estimators, and , where is the sample mean defined in §3.2 and with the sample variance of .
Distributions on can be identified with distributions on that are invariant under the action of . One way of generating such distributions is to average a given distribution on over . This averaging construction has been used by Walsh et al. (2009) in the orthorhombic case, Du et al. (2016) in the cubic case, and by Matthies (1982), Gorelova et al. (2014) and Niezgoda et al. (2016) in the general crystallographic case. Because the parameters of the distributions (5) are readily interpretable and the distributions (6), being exponential models, have pleasant inferential properties, we find these models more useful than many models obtained by averaging over , especially as the latter can be quite demanding numerically.
5.2 Concentrated distributions
A standard coordinate system on is given by the inverse of a restriction of the exponential map from the space of skew-symmetric matrices to . This can be modified to provide coordinate systems on . Let be an element of . There are neighbourhoods of in and of in such that each in can be written uniquely as , where
[TABLE]
with in . Define from to by , where . Then is a coordinate system on . Second-order Taylor expansion about of as a function of , together with some computer algebra, gives the high-concentration asymptotic distribution of .
Proposition 2
For near in put for near in . If has density (6) with as in Table 2 then the asymptotic distribution of as is normal with mean and variance , where is given in Table 5. If has density (7) with then has this asymptotic distribution. *
6 Tests of location
6.1 One-sample tests
Let be an element of which is some measure of location of a distribution on . There are various tests of the null hypothesis that , where is a given element of . The case with was considered by Arnold & Jupp (2013, §8).
Permutation tests can be based on the following symmetries of : For in , define as the transformation that takes to . Then is well-defined and preserves .
For a sample summarized by the sample mean of , an appealing measure of the squared distance between the sample and is . It is appropriate to reject the null hypothesis for large values of . If the distribution of is symmetric under then significance can be assessed by comparing the observed value of with its randomization distribution, which can be obtained by replacing by , where are independent and distributed uniformly on .
If is a sample from a concentrated distribution with density (6) and mode then it is sensible to test by applying Hotelling’s 1-sample test to , where is the projection onto the tangent space given in §5.2.
6.2 Two-sample tests
Suppose that two independent random samples and on are summarized by the sample means and of and . Then the squared distance between the two samples can be measured by . It is appropriate to reject the null hypothesis that the parent populations are the same if is large. Significance can be assessed by comparing the observed value of with its randomization distribution, obtained by sampling from the potential values corresponding to the partitions of the combined sample into samples of sizes and .
Suppose that and are samples from concentrated distributions with density (6) on . Let be the maximum likelihood estimate of the mode under the null hypothesis that the parent populations are the same. Then the null hypothesis can be tested by applying Hotelling’s 2-sample test to and , where is the projection onto the tangent space given in §5.2.
7 Independence, regression and misorientation
7.1 Independence
Let , for , be equivariant functions into some common inner-product space such that has expectation if is uniformly distributed on . Then association of random variables on and on can be measured in terms of association of and .
The general approach of Jupp & Spurr (1985) leads to the following test of independence. Given pairs in , independence of and is rejected for large values of . The observed value of this statistic is compared with the randomization distribution given , . An alternative randomization test rejects independence for large values of the correlation coefficient defined in (11). For and of Table 2, this is one of the tests considered by Rivest & Chang (2006).
7.2 Regression
A reasonable model for homoscedastic regression of in on in has regression function for some in and error distribution that is a mild generalization of (6), so that the density of given is
[TABLE]
For and of Table 2 , model (9) is a generalization of the spherical regression model of Chang (1986). It is the submodel of the models with regression function that were introduced by Prentice (1989) and explored by Chang & Rivest (2001) and Rivest & Chang (2006). For , it is not possible in general to extend the model (9) to have regression function of the form ]. If then the given in Table 2 are not suitable, since in many cases for all values of , and . Instead, it is sensible to take and the obtained by the averaging construction described in the first paragraph of §3.1.
For , the maximum likelihood estimate of is
[TABLE]
In general, is a well-defined element of , rather than an element of some quotient.
Put . If then define
[TABLE]
where is given by (10). Then and can be regarded as a form of uncorrected sample correlation of and . If , and is defined by then , where is the misorientation angle for introduced by Tape & Tape (2012). Application of Proposition 2 to the decomposition
[TABLE]
gives the following high-concentration asymptotic distributions.
Proposition 3
For from model (9),*
- (i)
Asymptotically, for large ,
[TABLE]
and the quantities in (12) and (13) are asymptotically independent. 2. (ii)
An approximate high-concentration confidence region for is
[TABLE]
7.3 Misorientation
The relationship between ambiguous rotations in and in can be described by the misorientation, which is an element of the double coset space . Since and are images of and in for any in and in , determines a well-defined element of . See p. 274 of Morawiec (1997). In crystallography it is usual to identify with an asymmetric domain, a neighbourhood of in that is in one-to-one correspondence, modulo null sets, with under followed by projection of to . Then the misorientation between and is taken as the element, , of the asymmetric domain that satisfies and has smallest rotation angle among all such rotations in the domain. In the case in which the conditional distribution of given is uniform the distributions of the angle and axis of the misorientation are given in Morawiec (2004, Ch. 7). For general pairs in , we define the mean misorientation as the element of defined in (10). An alternative definition of the mean misorientation is the element of that maximizes .
8 Example
To illustrate the estimators and tests introduced above, we consider some samples of orientations of diopside crystals. These crystals are monoclinic, so we can represent their orientations by -frames.
The stereonet in Fig. 1 shows the vectors and axes given by orientations of 100 diopside crystals. A randomization test of uniformity based on from (3) in §4.1 has value less than , leading to decisive rejection of uniformity.
The stereonets in Fig. 2 show the vectors and axes given by orientations of 34 crystals from one region of a specimen and 37 crystals from another region. The two-sample permutation test of §6.2 yields a -value of for equality of the populations of the orientations in the two regions, so the hypothesis of equality is not rejected.
Acknowledgements
We thank David Mainprice for providing the diopside data.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arnold, R. & Jupp, P. E. (2013). Statistics of orthogonal axial frames. Biometrika 100 , 571–586.
- 2[2] R. Arnold & J. Townend. (2007). A Bayesian approach to estimating tectonic stress from seismological data. Geophys. J. Int. 170 , 1336–1356.
- 3[3] Bhattacharya, R. & Patrangenaru, V. (2003). Large sample theory of intrinsic and extrinsic sample means on manifolds. I. Ann. Math. Statist. 31 , 1-29.
- 4[4] Bindi, L., Steinhardt, P., Yao, N. & Lu, P. J. (2011). Icosahedrite, Al 6 Cu 24 Fe 13 subscript Al 6 subscript Cu 24 subscript Fe 13 \mathrm{Al}_{6}\mathrm{Cu}_{24}\mathrm{Fe}_{13} , the first natural quasicrystal. American Mineralogist 96 , 928–931.
- 5[5] Bingham, M A., Nordman, D. J. & Vardeman, S. B. (2009). Modeling and inference for measured crystal orientations and a tractable class of symmetric distributions for rotations in three dimensions. J. Amer. Statist. Assoc. 104 , 1385–1397.
- 6[6] Boogaart, K. G. van den (2002). Statistics for Individual Crystallographic Orientation Measurements . Aachen: Shaker Verlag.
- 7[7] Chang, T. (1986). Spherical regression. Ann. Statist. 14 , 907–924.
- 8[8] Chang, T. & Rivest, L.-P. (2001). M-estimation for location and regression models on Stiefel manifolds. Ann. Statist. 29 , 784–814.
