Signatures in Shape Analysis: an Efficient Approach to Motion Identification
Elena Celledoni, P{\aa}l Erik Lystad, Nikolas Tapia

TL;DR
This paper introduces a shape classification method using signatures, offering an efficient alternative to existing techniques like SRV transform and dynamic programming, with potential improvements in speed and accuracy.
Contribution
It presents a novel shape classification approach leveraging signatures, providing a reparametrization invariant method that enhances efficiency over current methods.
Findings
Signature-based classification is competitive with SRV transform.
The proposed method offers improved computational efficiency.
It effectively distinguishes shapes in various scenarios.
Abstract
Signatures provide a succinct description of certain features of paths in a reparametrization invariant way. We propose a method for classifying shapes based on signatures, and compare it to current approaches based on the SRV transform and dynamic programming.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Signatures in Shape Analysis: an Efficient Approach to Motion Identification
Elena Celledoni
[email protected] https://www.ntnu.edu/employees/elena.celledoni ,
Pål Erik Lystad
Department of Mathematical Sciences, Norwegian University of Science and Technology, 7491 Trondheim, Norway.
and
Nikolas Tapia
[email protected] https://www.wias-berlin.de/people/tapia Weierstraß-Institut Berlin, Mohrenstr. 39, 10117 Berlin, Germany.
Abstract.
Signatures provide a succinct description of certain features of paths in a reparametrization invariant way. We propose a method for classifying shapes based on signatures, and compare it to current approaches based on the SRV transform and dynamic programming.
Key words and phrases. Shape analysis, Signature, Motion identification
1. Introduction
Shape analysis is a broad and growing subject addressing the analysis of different types of data ranging from surfaces, landmarks, animation data etc. In this paper shapes are unparametrized curves. Mathematically a shape is an equivalence class of curves under reparameterization, that is, two curves are equivalent and determine the same shape if there exists a strictly increasing smooth bijection such that . For a given curve we denote by the corresponding shape.
The similarity between two shapes is then defined by creating a distance function on the space of shapes ,
[TABLE]
where is a suitable reparameterization invariant Riemannian distance on the manifold of parametrized curves.
Finding the optimal reparameterization is however computationally demanding, and in many applications simply unnecessary. This is specifically the case of applications where the optimal parametrization is not explicitly used for further calculations, e.g. problems of identification and classification. Ways of circumventing this step are therefore of great interest.
In recent years, after extensive work by Terry Lyons and collaborators, the theory of rough paths has gained considerable importance as a toolbox for mathematical analysis and for mathematical modeling in applications. In this context, the signature map provides a faithful representation of paths, capturing their essential global properties. A fundamental property of the signature is its invariance under reparameterization, surmising its importance for shapes.
In this paper, we define a measure of similarity between shapes in by means of the signature. We define a distance directly on We test the viability of this approach and use it to classify motion capture animations from the CMU motion capture database [7]. Indeed, this leads to an efficient technique that delivers results comparable to what is obtainable with methodologies based on the SRV transform, but at a much lower computational cost.
2. Shape analysis on Lie groups
In the following, will denote a finite-dimensional Lie group under multiplication with identity element denoted by . We let denote the corresponding right Lie algebra . For a fixed , left and right translation by will be denoted and respectively.
2.1. Shape Space
We consider the space of parameterized smooth curves on , i.e. smooth maps . To model the curves as unparameterized, or independent of parameterization, we define the shape space as the quotient space
[TABLE]
where is the group of orientation preserving diffeomorphisms of the parameter space . The elements of are equivalence classes of curves. The elements of the same class are curves which can be mapped to one another by changing their parameterization, that is, two curves are equal in shape space if there exists such that .
In the setting of our application, the search for optimal time parametrizations can be viewed as syncing up the animations, removing disturbances due to small pauses, different periodicity, or asynchronous starting and stopping, by shifting the movement of one character to match the other as closely as possible.
2.2. Geodesic Distances on Shape Space
Our goal is to introduce a meaningful and computable distance on to estimate the similarity between two shapes. This area of research started with the efforts of Younes [16]. We will restrict the space of curves to the space of immersions, i.e. curves with non-vanishing first derivative, which we denote by
[TABLE]
Let be a pseudo-metric on . We define , for two elements , by
[TABLE]
As shown in [3, Lemma 3.4], will be a pseudo-metric on if is a reparameterization invariant or, in other words, if for any two and any we have that
[TABLE]
An obvious choice of metric on is the familiar -metric. However, as shown by Michor and Mumford [13], this metric leads to vanishing geodesic distance which renders it useless. They further show in [14] that one solution to this problem is to consider metrics based on arc-length derivatives, creating a class of Sobolev-type metrics.
There are multiple possible metrics in this class. One option is based on what is usually referred to as the Square Root Velocity Transform (SRVT). This transform and accompanying metric was first introduced, in the context of shape analysis, by Srivastava et al. [15], who used the transformation when working with curves in Euclidian spaces. The transformation has later been adopted to more general shapes. Of particular interest is the formulation for shapes that are represented as Lie-group valued curves [3].
We define the SRVT by
[TABLE]
This transformation has the following useful properties [3, Lemma 3.6]:
- (1)
For every and , the following equivariant property holds:
[TABLE] 2. (2)
It is translation invariant: for all and
[TABLE]
A similar result is true for shapes with values in Euclidean spaces [15].
Further, one can obtain a Riemannian metric that coincides with the geodesic distance on a submanifold by using the SRVT to pull back the -metric on [3]. Further restricting the immersion space to , where is the identity element in , the distance turns out to be reparameterization invariant.
This invariance implies, in particular, that it will also yield a geodesic distance on [2]. The restriction to isn’t very troublesome as any curve can be transferred to this space by right translation by the inverse of its initial value, that is [3].
Using the equivariant property for the SRVT from equation (7) and defining for , the problem of calculating the metric for the shape space in equation (4) can be written as
[TABLE]
Finding this infimum will generally be very difficult. The usual approach is therefore to discretize the curves and solve instead a finite dimensional optimization problem. The most common methods used to solve this problem in shape analysis [15] are based on either the gradient descent method or a dynamic programming algorithm (DP). In our experiments we use the DP approach described in [1].
3. Signatures
Signatures, introduced by K.-T. Chen [4] for smooth paths and later generalized by Lyons [11] under the name of geometric rough paths, are an important tool for the study of the solutions of controlled differential equations, but have also proved useful for solving classification problems of time series, Machine Learning and Topological Data Analysis [6].
In the usual framework, signatures are defined for paths taking values in a Banach space. From a geometric point of view, and in light of our purposes, this setting has to be adapted. Luckily, Chen also considered signatures for curves taking values on a smooth manifold [4]. This definition is quite general and relies on the selection of a frame bundle. For Lie groups there is a canonical choice: the Maurer–Cartan form. This is the unique right-invariant one form such that , i.e. for [8, p. 311].
Below we denote, for a finite-dimensional vector space of dimension , the tensor algebra over ,
[TABLE]
We observe that is always infinite-dimensional. Its dual space is denoted by , and it may be identified with the ring of formal power series in noncommuting variables .
{dfn}
Let be a -dimensional Lie group and be a smooth curve and the Maurer-Cartan form on . The signature of is the family of linear maps on recursively defined by and
[TABLE]
In this definition, the notation denotes the -th component of the vector in a basis of the Lie algebra of .
The signature provides a compact description of certain features of a path [5]. One of its main advantages in our context is its reparameterization invariance: for any orientation-preserving diffeomorphism on we have that
[TABLE]
Other fundamental properties include:
- (1)
For each , the signature belongs to the set of group-like elements of , and for any , , the neutral element in the group. 2. (2)
Chen’s rule: For any three we have
[TABLE]
Using these properties, signatures may be efficiently computed for some restricted classes of paths. For example, if is a straight line in with base point direction , i.e. for , then
[TABLE]
A similar statement is true for geodesic curves on a finite-dimensional compact Lie group.
We may think of signatures as an infinite vector indexed by words over the alphabet . In particular, for a piecewise linear path the above formula means that if we want to know the component in (9) corresponding to the word then
[TABLE]
For a general piecewise linear path , we may use the above formula and Chen’s rule to deduce that
[TABLE]
where are the length of the time intervals where the path is sampled and are the slopes of the path in each of these intervals. The entries of this expression may be computed by using a Baker–Campbell–Hausdorff-type formula, for example.
Finally, we remark that the signature possesses another interesting property, namely it is an homomorphism from path space with concatenation to the tensor algebra . This means that if we are given two paths and , and we concatenate them to form a new path , then
[TABLE]
Moreover, if we reverse the path , i.e. we define then
[TABLE]
where the inverse is taken in the group-like elements of the tensor algebra.
It can be shown that actually, as a function of time the signature satisfies the differential equation
[TABLE]
in the tensor algebra. From this point of view, the signature map corresponds to the flow map of the vector field given by the base path. Thus, the signature belongs to an infinite-dimensional Lie group whose Lie algebra is the free Lie algebra over which we denote by . It does not, however, constitute a one-parameter subgroup. Therefore, for each fixed time interval we can map the signature to the free Lie algebra via a logarithm map, and we define
[TABLE]
This element, called the log-signature in the literature, provides a minimal description of the path, which is equivalent to the full signature.
There are many ways in which signatures can be used to compare shapes, but the essential feature is that since the map is reparameterization invariant, one obtains a way of directly comparing shapes instead of parameterized curves. For our experiments we chose a particular distance on (see next section for the precise formula), but this is by no means the only possible choice.
In making this choice one has to truncate the signature to obtain a finite-dimensional object. Due to the factorial decay of iterated integrals little information is lost in the process; still, some level has to be chosen and usually this done by running experiments. Once the truncation level is chosen, several choices of metric are available: the truncated tensor algebra becomes finite-dimensional so it has a nice linear structure and we are free to choose norms on it subject to some compatibility restrictions. There is also the notion of homogeneous norm on group-like elements, which takes into account the geometry of this group. Finally, the logarithm in this group maps signatures into a linear space (the free Lie algebra) in a bijective way, so no information is lost, but there is a substantial dimensional reduction.
According to our observations, is the last option which represents the most robust choice in terms of noise sensitivity, while also providing an accurate way of comparing signatures.
4. Experiments
Motion capture animations are usually recorded as the angle of every joint in a skeleton for every frame in an animation. A natural setting for the rotating joints is the Lie group of 3D rotations, . Every frame consists of independently rotating joints so the frame can be modeled as an element in , where is the Cartesian product of copies of . Interpolating between the frames will then allow us to model the animation as a parameterized curve.
We use an interpolation scheme in which one uses the log map to linearly interpolate on the Lie algebra, and then pull back to the Lie group with the exponential map. Let , we define the interpolation between and as
[TABLE]
Notice that and . Applying this interpolation component-wise to the frames in will enable us to construct a piece-wise interpolation between the frames of the animation. The Maurer–Cartan form along the interpolation is piece-wise constant, making it easy to compute SRV representations, -metrics, and signatures.
To test the effectiveness of the proposed frameworks we check whether they are able to identify different types of character motion. We have selected animations from the CMU motion capture database with descriptions "walk", "run/jog" and "forward jump". These are similar in length, and should produce results that conform with human intuition.
The test will calculate a distance matrix using the proposed similarity measures. From the distance matrix we produce a multidimensional scaling plot(MDS), depicting how similar, or dissimilar, the animations are. MDS tries to place the data points in 2-dimensional scatter plot while preserving the distances given by the distance matrix. See Kruskal [9] for more more information on this method.
In Figure 2 we calculate the distance matrix using the metric on interpolation curves in , and in Figure 3 we use the metric , equation (8), on the shapes generated by the curves in , where the optimal reparameterization is calculated with a DP algorithm. There are little to no patterns when projecting to the space , as seen in Figure 2. In Figure 3 however, we observe that modelling the curves as being parameterization invariant yields three easily distinguishable clusters of animations. Compared to Figure 2 we see a big benefit from this model assumption.
In Figure 1 the animations are projected to the shape space equipped with the distance function . While this figure does reveal the same structure as seen in figure 3, the clusters exhibit both a higher internal and a lower external variability. An important take away from this experiment is that this distance function in fact does preserve some of the structure of the shape space.
5. Concluding Remarks
Our preliminary experiments, show that classifying animations using a distance function on based on signatures produces very encouraging results. The proposed method is computationally very efficient, even though somewhat less accurate than known methods in shape analysis.
The Riemannian metric (4) requires calculating the optimal reparameterizations between every pair of animations. The proposed signature method instead only requires calculating the signature once for every animation, and then compares animations by computing inexpensive norms. The optimisation procedure is no longer necessary. 111A more thorough analysis of the run time complexities associated with these algorithms will be performed in future work.
In our experiments, the signature method outpreformed the optimal reparameterization metric by a factor of when classifying animations. A more precise comparison with the SRVT approach and other methods, see e.g. [10] goes beyond the scope of this work and will be considered in future work. Still our preliminary experiments give an idea of the possible performance benefits gained with the signature approach.
Increasing the accuracy of the signature method might also be possible by defining a more precise similarity measure. Nonetheless, our results can be seen as proof of concept for using signatures as an efficient way of classifying shapes.
5.1. Acknowledgements
This paper contains work done as part of P.E.L.’s master thesis. The master thesis will be published separately as part of NTNU’s Master of Science program in Applied Physics and Mathematics [12]. N.T. acknowledges that part of this work was carried out during his tenure of an ERCIM ‘Alain Bensoussan’ Fellowship Programme (contract number 2018-10) at NTNU. This work was supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie, grant agreement No.691070.
The data used in this project was obtained from http://mocap.cs.cmu.edu. The database was created with funding from NSF EIA-0196217.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Bauer, M. Eslitzbichler, and M. Grasmair, “Landmark-Guided Elastic Shape Analysis of Human Character Motions,” Inverse Problems and Imaging 11 (2015) 601–621 . · doi ↗
- 2[2] M. Bruveris, P. Michor, and D. Mumford, “Geodesic Completeness for Sobolev Metrics on the Space of Immersed Plane Curves,” Forum of Mathematics, Sigma [electronic only] 2 (2013) . · doi ↗
- 3[3] E. Celledoni, M. Eslitzbichler, and A. Schmeding, “Shape Analysis on Lie Groups with Applications in Computer Animation,” Journal of Geometric Mechanics Vol. 8 No. 3 (2016), pp. 273-304 (2015) , ar Xiv:1506.00783 [math.DG] .
- 4[4] K.-T. Chen, “Iterated Integrals and Exponential Homomorphisms,” Proceedings of the London Mathematical Society s 3-4 no. 1, (1954) 502–512 . · doi ↗
- 5[5] K.-T. Chen, “Integration of Paths–A Faithful Representation of Paths by Noncommutative Formal Power Series,” Transactions of the American Mathematical Society 89 no. 2, (1958) 395–407. http://www.jstor.org/stable/1993193 .
- 6[6] I. Chevyrev, V. Nanda, and H. Oberhauser, “Persistence paths and signature features in topological data analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence (2018) . To appear. · doi ↗
- 7[7] CMU Graphics Lab, “CMU Graphics Lab Motion Capture Database,” url visit date: 2018-12-10. http://mocap.cs.cmu.edu/ .
- 8[8] J. Hilgert and K.-H. Neeb, Structure and Geometry of Lie Groups . Springer-Verlag New York, 2012.
