Complete set of translation invariant measurements with Lipschitz bounds
Jameson Cahill, Andres Contreras, Andres Contreras Hip

TL;DR
This paper constructs low-dimensional, stable, and invariant signal representations under finite group actions, with explicit Lipschitz bounds, addressing stability and discriminability issues in invariant signal processing.
Contribution
It introduces a novel method to create low-dimensional, complete, and Lipschitz-invariant representations for signals under finite unitary group actions, using algebraic geometry tools.
Findings
Constructed invariant representations with explicit Lipschitz bounds.
Established existence of complete $ ext{Z}_m$-invariant representations for any $m$.
Provided a stable, discriminative transform applicable to signal classification.
Abstract
In image and audio signal classification, a major problem is to build stable representations that are invariant under rigid motions and, more generally, to small diffeomorphisms. Translation invariant representations of signals in are of particular importance. The existence of such representations is intimately related to classical invariant theory, inverse problems in compressed sensing and deep learning. Despite an impressive body of litereature on the subject, most representations available are either: i) not stable due to the presence of high frequencies; ii) non discriminative; iii) non invariant when projected to finite dimensional subspaces. In the present paper, we construct low dimensional representations of signals in that are invariant under finite unitary group actions, as a special case we establish the existence of low-dimensional and completeβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Complete set of translation invariant measurements with Lipschitz bounds
Jameson Cahill and Andres Contreras and Andres Contreras Hip
Department of Mathematical Sciences, New Mexico State University, Las Cruces, New Mexico, USA
Department of Mathematical Sciences, New Mexico State University, Las Cruces, New Mexico, USA
Department of Mathematical Sciences, New Mexico State University, Las Cruces, New Mexico, USA
Abstract.
In image and audio signal classification, a major problem is to build stable representations that are invariant under rigid motions and, more generally, to small diffeomorphisms. Translation invariant representations of signals in are of particular importance. The existence of such representations is intimately related to classical invariant theory, inverse problems in compressed sensing and deep learning. Despite an impressive body of litereature on the subject, most representations available are either: i) not stable due to the presence of high frequencies; ii) non discriminative; iii) non invariant when projected to finite dimensional subspaces. In the present paper, we construct low dimensional representations of signals in that are invariant under finite unitary group actions, as a special case we establish the existence of low-dimensional and complete -invariant representations for any . Our construction yields a stable, discriminative transform with semi-explicit Lipschitz bounds on the dimension; this is particularly relevant for applications. Using some tools from Algebraic Geometry, we define a high dimensional homogeneous function that is injective. We then exploit the projective character of this embedding and see that the target space can be reduced significantly by using a generic linear transformation. Finally, we introduce the notion of non-parallel map, which is enjoyed by our function and employ this to construct a Lipschitz modification of it.
Keywords. invariant theory, signal classification, stable representation
1. Introduction
One of the most important problems in machine learning and signal processing is the classification of visual and audio signals, i.e., given an equivalence relation on one would like to find a map with the property that if (and ideally only if) . One would also like to be as small as possible and for to be relatively easy to compute. It is a nontrivial task and the study of this an other related problems has sparked a wealth of exciting developments in mathematics in recent years [19, 3, 6, 5, 16] see also [25] and references therein for more on the history of these types of problems. The sophisticated tools created and used for this purpose, bridge several, seemingly unrelated areas of mathematics.
One type of equivalence relation that comes up in many instances is when the equivalence classes are given by the orbits of some group action, that is, a group acting on and whenever there is a for which . In all the applications considered in our study, we will assume that our action comes from a unitary representation of , i.e., we have a group homomorphism
[TABLE]
In this case we write for the space of orbits (this is a slight abuse of notation as this space depends on the specific representation of ) and the quotient metric on is given by (see Lemma 3.3.6 in [7])
[TABLE]
where denotes the orbit of under the action of .
By some elementary considerations, using small separating sets, one can show the existence of an injective map
[TABLE]
that distinguishes orbits. This is essentially a restatement of Proposition 5.1.1 in [14], see Section 2.1. While this looks very good in principle because it solves the classification problem with a small number of measurements, it is not good enough for applications. In general one would like to obtain maps that are not only injective but also stable in the sense that the distortion of the representation is globally controlled. In addition, one would like to construct measurements that are robust enough to preserve injectivity under small perturbations, due to noise for example. To be more precise, one would want a bi-Lipschitz map , that is a map for which there are constants so that
[TABLE]
for every
In this work we study the classification problem under finite unitary actions of signals in We obtain a representation in low dimensions, the first of its kind in a nontrivial setting. Its simplicity makes it a good option for applications. We first obtain some general results that apply to arbitrary unitary actions of finite groups before specializing to the important case of discrete translations (cyclic groups) where our results can be made much more explicit.
One of the motivations of our study is the problem of finding a representation of functions in that is invariant under translations. One such representation is given by the modulus of the Fourier transform; however, it is not injective and furthermore it is known to yield instabilities in the presence of high frequencies and it is for this reason not suitable for implementation. The numerous works that have tackled different aspects of this problem in recent years rely mostly on statistical learning techniques and as such they inherit some of their problems and limitiations.To develop a more satisfactory approach to classification of translation invariant signals in , Mallat introduced [19] a scattering transform that is invariant under translations but more importantly, comes with a global upper-Lipschitz bound (the scattering transform is non-expansive). To deal with the problem of instabilities created by the presence of high frequencies, Mallat tames the contributions of fine scale oscillations by using a wavelet-based convolutional network; a limiting procedure gives the desired object at infinite depth. Although the scattering transform provides a non-expansive map, invariant under general small diffeomorphisms, it is not discriminative and is not actually invariant in finite dimensions (invariance is only achieved in the limit, while for implementation one needs to cut off the process after a few iterations). In applications one always works in large, but finite dimensions and thus an alternative to the scattering transform is desirable, one that can be readily applied in this setting with few measurements and with a guarantee of no misclassification. The goal of our work is to provide a different approach to the group invariant classification problem in the relevant finite dimensional setting.
Of particular interest is the case where is a cyclic group. The representation of in this case will simply be the powers of some matrix that satisfies . Since the minimal polynomial of divides it follows that is diagonalizable and all of its eigenvalues are th roots of unity. To better understand complete translation invariant measurements in this setting, we further specialize to where the action is given by
[TABLE]
It is well known is diagonalized by the discrete Fourier transform , and so in the Fourier domain we have
[TABLE]
Theorem 1.1 guarantees the existence of a Lipschitz map that distinguishes orbits under this finite translation action with a number of measurements that is linear on the dimension.
Another important application that is closely related to the problem of phase retrieval is the case where where is an th root of unity, see Section 4.1.
A consequence of our study gives a stable and discriminative nonlinear transform that is -invariant. More precisely, we prove the following:
Theorem 1.1**.**
There is a -invariant map that induces an injective map and a constant depending only on such that
[TABLE]
for every
We provide more analytic and geometric information about the map in Section 4, in particular we give explicit bounds on the optimal Lipschitz constant
Constructing complete invariant representations with few measurements
As mentioned, in this paper we follow a different approach to build translation invariant representations. Our perspective hinges upon the use of polynomial measurements (as oposed to an infinite chain of convolution-taking modulus operations). The alternative we propose here has the following advantages
- β’
it lends itself well for applications; this is because the measurement and coding of signals is usually treated in very high but finite dimensional spaces. The calculations involved in the transform are very easy to handle and do not require complex mathematical or computational operations.
- β’
the number of measurements needed to differentiate signals in different orbits is linear in the number of dimensions.
- β’
the map we construct is actually -invariant, as opposed to approximately invariant. As such, it is an accurate representation at the chosen level of precision(scale).
- β’
the representation really solves the classification problem, it is truly injective.
- β’
not only is the map produced stable, but it comes with almost explicit bounds.
Our representation is not entirely constructive since the argument relies on choosing a linear map in the dimension reduction part and this is done using an abstract result. This in itself does not limit the range of applicability of the transform which works for a generic choice.
The construction of an initial large set of polynomial measurements that allow us to separate orbits follows from simple and intuitive observations. However, this is just the starting point as a map consisting of such monomials does not satisfy any of the other properties we ask of our representations.
To better illustrate the ideas and challenges behind the proof of Theorem 1.1, let us restrict ourselves to the case where is where is a prime number and the action is given by translation as in (1.4). We let be the modulation operator which is the diagonalization of the translation operator, that is, . In the Fourier domain we can define
[TABLE]
One can see that if and only if for some Indeed, suppose first that where and Then and there are such that where On the other hand, we have
[TABLE]
But then,
However, one can readily see that this map does not separate orbits, because if then are completely free. Also, since the measurements are polynomials, we cannot expect to have global Lipschitz bounds. To solve the separating problem, we can add a monomial for each pair of variables, but then we have measurements. We will see in Section 3 that by taking generic linear combinations of these measurements we can reduce the dimension dramatically. Then the problem becomes how to turn the resulting map into a Lipschitz one without losing the discriminative property and without adding more measurements. It is at this point that a geometric property we introduce (see definition 3.2) becomes crucial for building a stable transform. We combine this property satisfied by our polynomials together with the fact that our actions are unitary to reduce all our measurements to the unit sphere.
Although the maps we construct are injective, we show in Section 4.2 that their inverses are almost never Lipschitz. This of course does not rule out the possibility that other representations may come with a lower Lipschitz bound. However we believe such representations must be non-algebraic and therefore essentially different from ours.
The paper is organized as follows. In the second section we introduce some necessary algebraic background and we also discuss the connection with the well-known problem of phase retrieval. In Section 3 we prove some general results concerning the construction of effective low dimensional discriminative measurements under natural assumptions. In Section 4 we verify that the examples of interest for us satisfy the conditions of the theorems in Section 3 and we obtain explicit Lipschitz bounds for the resulting transforms. We finish the paper with a discussion on open questions and a few remarks in Section 5.
2. Background
2.1. Algebraic invariants
Given an action of a group on we denote by the ring of polynomials in that are invariant under this action. It is well known that this ring is finitely generated as a -algebra, however the generating set could be arbitrarily large. Nonetheless one might hope that given a generating set then the map given by
[TABLE]
would induce a map that is injective on . Unfortunately this turns out not to be true in general. In fact, an example for this is the multiplicative group of acting on via scalar multiplication [12], example 2.3.1.
As in [20], we define
[TABLE]
Then a set is said to be separating if whenever for all we have that . It is known that separating sets can exist which are much smaller than the number of generators of , in fact in [14] the following is proved in Theorem 5.1.1:
Theorem 2.1**.**
If acts on then a separating set of size at most exists.
Furthermore, in [12], section 2.3, it is noted that if is a finite group action, then
[TABLE]
We now specialize to the case where Without loss of generality we can assume that the action of is given by the powers of a diagonal matrix where is an th root of unity for every . It is proved in Theorem 5.2.1 in [14] that the following set of monomials is a separating set:
[TABLE]
where and is a primitive th root of unity, and is minimal such that there exists a with invariant. From the set of measurements (2.2), an invariant map defined by
[TABLE]
induces an explicit injective map . We will reduce the dimension of the target space by showing that a suitably generic linear map is injective when restricted to the image of In general, this should be the optimal dimension, but we know already that for specific examples this can be reduced even further to as can be seen from the special cases covered by [14], Proposition 5.2.2 (see also the discussion on subsection 2.2 about real phase retrieval being a particular case of this problem).
2.2. Phase retrieval
Phase retrieval is the problem of recovering a signal in (or ) up to a global phase factor from a collection of intensity measurements . This type of problem comes up in many applications and has a rich history, but has seen considerable interest in the last decade or so since the publication of [1]. To state the problem in the setting of this paper let denote the one dimensional torus and let act on by scalar multiplication. Then given any collection of vectors the mapping given by
[TABLE]
is invariant under the action of so we can consider the induced map whose domain is . The first problem in phase retrieval is to understand when this map is injective.
Let denote the space of Hermitian matrices and note that is a vector space over the real numbers (not the complex numbers) of dimension . The Hilbert-Schmidt inner product on is given by
[TABLE]
Now consider the map from to given by . First note that if and only if for some , so this map is injective on , and the image of this map is the set of positive rank one matrices in (which looks like ). Next observe that for
[TABLE]
Given a collection of vectors define the linear map given by
[TABLE]
and note that , so is injective on if and only if is injective when restricted to . If is not injective on then there are so that and so . From this observation it is straightforward to prove the following (see Lemma 9 in [4]):
Lemma 2.2**.**
* is injective on if and only if every nonzero matrix in has rank at least 3.*
Since the dimension of the set of rank at most 2 Hermitian matrices is this led the authors of [4] to conjecture that was necessary for to be injective. If this were taking place over the complex numbers this would follow directly from the Projective Dimension Theorem, in [11] the conjecture was proven for infinitely many values of , however in [23] a counterexample is constructed with and .
In most applications the measurements will never be exact and will be corrupted by noise of some form. Therefore we we would like the map to be not just injective but bi-Lipschitz as defined in (1.3) (here we use the quotient metric on ), however, it is shown in [4] that this can never be bi-Lipschitz in this sense. There are (at least) two ways of dealing with this situation. The first, as done in [4], is to modify the map to get a new map that is bi-Lipschitz. Another alternative which is explored in [2] is to replace the quotient metric with a different metric on with respect to which is bi-Lipschitz. In this paper we will encounter a similar situation where we will have an initial -invariant map which is injective but not Lipschitz. Our approach will be to exploit a geometric property of this map to produce a new map which is still injective but also Lipschitz.
We can also study phase retrieval over the real numbers where we replace with and with , which corresponds to the action on given by . In this case the analysis above is still valid, but for we have that is real and symmetric, and the entries of are precisely the monomials in (see (2.2)). Therefore, from this perspective real phase retrieval can be thought of as a very special case of the type of cyclic group actions that we consider in this paper.
3. Main results
In this section we present a series of abstract results that yield a discriminative and stable representation It can be seen in the next section that the hypotheses needed for the existence of such are satisfied by our objects of interest; we actually believe that the abstract framework here provided can find further applications in signal processing. The construction of the map rests on discriminative polynomial measurements in some possibly high dimensional space. These are later mapped to an dimensional space via a generic linear map. Still, this map is not necessarily Lipschitz so we appeal to its geometric properties to modify it so that it becomes Lipschitz while preserving injectivity. In the rest of this section we will present the steps described.
3.1. Dimension reduction
We already have that the polynomial map defined in (LABEL:F_T) is separating. We look at the problem of reducing the dimension, and getting a Lipschitz bound. This has to be done carefully because we need to reduce the dimension while still preserving injectivity and finally controlling the distortion. The next theorem reduces the dimension.
Theorem 3.1**.**
Let act on and suppose is a polynomial -invariant map such that the induced map is injective. Then for is injective for a generic linear map .
Proof.
First, since the components of are polynomials, we can write and
[TABLE]
where is a monomial of degree Then one of these monomials achieves maximum degree Now we produce a new map as follows:
[TABLE]
Note that is homogeneous and regular (it is a polynomial map), and so we know that is a projective variety. Therefore
[TABLE]
For a linear map if is not injective then there are so that but . We now claim that if then for a generic we have which would prove the theorem. To this end let
[TABLE]
(here is a class in projective space ). It is easy to see that is projective. Let us note that we can assume without loss of generality that is irreducible(if not, reason like below on each irreducible component). Also let be the projections of , that is:
[TABLE]
Then therefore, By [17] corollary 11.13 we know that if we take then
[TABLE]
On the other hand, we know
[TABLE]
This implies that
[TABLE]
Next observe that
[TABLE]
Therefore if then is injective for a generic . Finally,
[TABLE]
so we require
[TABLE]
which means β
The main idea in the proof of Theorem 3.1 is very similar to that of Lemma 2.2 in that we want to show that a generic linear map of appropriate rank is injective when restricted to a particular algebraic variety which means that the kernel of the linear map needs to avoid differences of pairs of vectors that are on the variety. The main difference between these two results is that the varieties under consideration in the phase retrieval case they have a lot of structure, in particular they are projective, whereas ours are not. On the other hand, phase retrieval takes place over the real numbers (even in the complex case) which complicates the use of certain tools from algebraic geometry. We note that variants of these types of arguments have been used in other recovery problems [24, 22, 11].
3.2. Non-parallel maps induce Lipschitz invariant representations
As we anticipated, we will discuss other general theorems that let us make modifications to turn a map into a Lipschitz one, provided the map we started with satisfies a geometric condition. To that end, we introduce the following concept:
Definition 3.2**.**
Suppose acts on and is -invariant. We say has the non-parallel property if the following holds: If and for some then for some .
Remark 3.3**.**
Note that if satisfies the non-parallel property, and is a linear map such that
[TABLE]
then also satisfies the non-parallel property.
The following will be used to get a candidate for a Lipschitz map.
Definition 3.4**.**
For any map we define by
[TABLE]
Proposition 3.5**.**
*Suppose acts on according to (1.1) and is a -invariant map. Then as defined in (3.1) satisfies
(a) is also -invariant,
(b) If has the non-parallel property and if the induced map is injective then the corresponding induced map is also injective.*
Proof.
Throughout the proof, we write instead .
(a) Note that if and then
[TABLE]
Where we have used our general assumption (1.1). Since is invariant, we have that
[TABLE]
So is invariant.
(b) Suppose are such that
[TABLE]
If then
[TABLE]
which implies that Now if we have that
[TABLE]
So in particular and are parallel. Since satisfies the non-parallel property, we have that for some . This implies that
[TABLE]
and since is injective we conclude which means is also injective. β
Remark 3.6**.**
Note that in the proof of (b) we really show that when has the non-parallel property then if and only if regardless of whether or not is injective.
In what follows, will denote the unit sphere in
Theorem 3.7**.**
Let be a unitary group acting on according to (1.1). Let be a -invariant map satisfying the non-parallel property. Assume is Then the map defined in (3.1) is Lipschitz. Furthermore, the optimal Lipschitz constant such that satisfies where
[TABLE]
Proof.
We will drop the subscript in order to ease the notation. Let If and then
[TABLE]
Now if then
[TABLE]
Since is bounded on the sphere, we have that
[TABLE]
This implies that
[TABLE]
By symmetry on and we have that
[TABLE]
We know that is differentiable when restricted to the unit sphere . Suppose Therefore we can use the mean value theorem to find
[TABLE]
Hence
[TABLE]
Then if we let
[TABLE]
we will have that
[TABLE]
If then
[TABLE]
from where we obtain
[TABLE]
This implies that
[TABLE]
Since is invariant, we know that for all we have that
[TABLE]
therefore
[TABLE]
This gives the Lipschitz bound. β
As mentioned in the introduction, an ideal representation should not only be injective but also control the distance between classes in terms of a lower Lipschitz bound. Because the representation we construct is made of linear combination of polynomials with no particular structure (as opposed to phase retrieval where the measurements take the particular form ), we cannot expect to have a lower Lipschitz bound at our level of generality. Moreover, we will see in the next section that when the map is not bi-Lipschitz.
In phase retrieval the map sends the measurements to for some and since the map is homogeneous of degree 2, one can take the square root of each component without destroying separation while making the map bi-Lipschitz. Our separating measurements are essentially complex valued-the phases contain essential information one cannot ignore, and so we cannot appeal to a similar idea.
4. Applications to cyclic groups
In this section, we discuss a few applications of the theorems in the previous section. Recall that in the case of we can assume the action is given by the powers of a diagonal matrix where each is an th root of unity.
Theorem 4.1**.**
Suppose we have the group acting on then we have an injective map
[TABLE]
Moreover, has a Lipschitz constant
This theorem will be a consequence of the following lemmas:
Lemma 4.2**.**
* defined in (2.3) satisfies the non-parallel property.*
Proof.
Suppose we have and such that
[TABLE]
Let be the first th root of unity and
[TABLE]
We will prove that Let Note that
[TABLE]
To prove that there are two cases:
Case 1: One of is not in
In this case, one of is [math] (by definition of ) and by the definition of one of is Therefore,
[TABLE]
Case 2: Both
In this case, we know We have and so
[TABLE]
Then Since there is a such that which implies that
[TABLE]
We know that which implies that
[TABLE]
and since we have
[TABLE]
Taking the modulus on both sides yields hence
[TABLE]
Therefore,
[TABLE]
which implies that Thus for some we have that
[TABLE]
Since T is unitary, we have that
[TABLE]
Since (in fact ), we know that is increasing in Hence This implies that Since is injective, This proves the non-parallel property.
β
Lemma 4.3**.**
The map where is the map defined in (2.3) and is any map satisfying the conclusion of Theorem 3.1, satisfies the non-parallel property. In particular, the map defined in (3.1) for this choice of induces an injective map Furthermore, satisfies the Lipschitz bound
[TABLE]
where is the operator norm of
Proof.
Combining remark 3.3, theorem 3.1, and proposition 4.2, we obtain that satisfies the non-parallel property. Now, using proposition 3.5, we get
[TABLE]
β
Lemma 4.4**.**
If is defined as in (3.1), then
[TABLE]
Proof.
Note that
[TABLE]
But if then
[TABLE]
Since
[TABLE]
we have that
[TABLE]
β
Proof of Theorem 4.1.
We start with the map given by (2.3). It is proven that this map is separating in [14], chapter 5. By Lemma 4.2, it satisfies the non-parallel property. We can apply Theorem 3.1 to reduce the dimension of the target space to For a generic we have that
[TABLE]
is a separating map. Now, by Remark 3.3, we have that also satisfies the non-parallel property. Hence we can use Theorem 3.7 for and we obtain an injective map with a Lipschitz bound:
[TABLE]
where
[TABLE]
Using Lemma 4.4, we directly deduce that
[TABLE]
β
Before proceeding we make a few remarks.
Since the set of invariants we use in this case are monomials of the form then it could make sense to arrange these in an matrix. In fact, since we only have one monomial for each we can choose to make this matrix triangular, symmetric, or Hermitian if we so desire. As such this problem fits naturally into the broader context of matrix recovery problems. In this paper we do not need to exploit this point of view as our proofs do not benefit from it.
We now return to the original motivation of this work, which was to study translation invariant measurements on . In this case acts on via translation as in (1.4). In this case the matrix is not diagonal, but it is diagonalizable by the Fourier transform. In the Fourier domain our action is given by powers of the modulation matrix
[TABLE]
One of the main motivations that Mallat cites in [19] for the development of the scattering transform is that although the modulus of the Fourier transform is translation invariant, it is not stable. In our context by taking the modulus of the first entries of , i.e., the measurements of the form , we can recover the modulus of the Fourier transform, but even these are not enough to get injectivity which is why we need the rest of the measurements.
4.1. The homogeneous case
In this subsection we will make a few remarks about the case when is homogeneous. We begin by showing that this only happens when for some root of unity . Note that the case which corresponds to real phase retrieval is a special case of this.
Proposition 4.5**.**
* is homogeneous if and only if for some th root of unity .*
Proof.
If then
[TABLE]
and so is homogeneous of degree .
Conversely, suppose is homogeneous of degree and . We readily see that each must be a primitive th root of unity, if not we would have at least on pair of monomials of the form and with . Now observe that the monomial must satisfy . Then, since and are both primitive th roots of unity there is a so that . Now by the division algorithm and the minimality of we see that and . This implies and therefore . β
Note that in the proof of Theorem 3.1 we needed to introduce a new variable to homogenize the map , however when this is not necessary. This means we can slightly improve the conclusion of Theorem 1.1 as follows:
Theorem 4.6**.**
Let be an -th root of unity and let act on via . Then there exists a satisfying the same conclusions as Theorem 4.1.
As we noted above we can arrange our monomials into an matrix. In this case one such matrix would be
[TABLE]
where denotes the vector whose components are the components of raised to the power . Note that this is a rank one matrix. This means that just as in the case of phase retrieval, any linear map that has the property that every nonzero matrix in has rank at least 3 will satisfy the conclusion of Theorem 4.6. Therefore we can use such an to obtain the in theorem 4.1. However, the set of rank (at most) two matrices is a determinantal variety and is well known to have dimension , so by the Projective Dimension Theorem any linear map whose kernel avoids this variety must have rank at least whereas Theorem 3.1 says a generic linear map of rank will yield the desired conclusion. This is because we only need to avoid rank two matrices of the form which is a much smaller subvariety.
4.2. No lower Lipschitz bounds
A natural question is whether or not the map satisfies a lower Lipschitz bound. We show that we can not expect this to be the case in general. The next proposition shows, is never Lipschitz when is cyclic and
Proposition 4.7**.**
Let Let be a representation of acting on Then
[TABLE]
Proof.
Recall the definition of the map (2.3). Without loss of generality we may assume that for all Since we see that there is a pair with Again, without loss of generality, we assume is such a pair and that
Let and for small consider the points We know that for sufficiently small and therefore
[TABLE]
On the other hand, note that is equal to on the unit sphere by definition. Also, any nonzero component of is a linear combination of If is any such component, then
[TABLE]
which is because This implies that
[TABLE]
The conclusion follows from (4.4) and (4.5).
β
To illustrate the previous proposition, consider the example 5.2.1. in [14]: there acts on and one has the explicit given by:
[TABLE]
In this example we can take and
Although is not Lipschitz, it is always continuous.
Proposition 4.8**.**
Under the same hypotheses of Proposition 3.5, the map is continuous.
Proof.
Let and be such that Because is bounded away from zero on the sphere, we see that is bounded. Therefore, up to subsequence (not relabelled) converges to some By continuity of we see that which then implies We have shown that every subsequence of possesses a subsequence converging to thus the whole sequence converges to
β
5. Conclusions, open questions and final remarks
As it can be seen from our analysis, effective -invariant representations can be built in finite dimensions by exploiting underlying algebraic and geometric properties of polynomial invariants. Though our transforms are general enough to cover many problems of interest, there are still some natural open questions regarding the construction of complete measurements in the finite dimensional setting.
A first question has to do with the use of polynomial invariants. In the end our transform does not preserve any of the features or the the algebraic structure of the maps leading up to it, so it would be very enlightening to explore ways to bypass the use of the invariant polynomials, and to appeal to the geometric non-parallel property to produce the final transform. A more analytical and more flexible approach would probably yield embeddings into even lower dimensions with much more explicit controls.
About the dimension reduction as we perform it here, it is certainly worth studying how to make a constructive choice of a linear map than that provided by Theorem 3.1. Even though our maps do not have a lower Lipschitz bound the choice of should play a significant role in any computation of the inverse map. More specifically, since we know that must avoid nonzero vectors that are differences of elements of in order for to be injective, then it seems intuitively clear that having these vectors βbounded awayβ from in some sense should provide numerical stability. There are a variety of ways we could define βbounded awayβ in this context. We expect properties similar to the nullspace property [13] and the restricted isometry property [9] to be of use here.
Another question that has to do with the algebraic approach employed here is how to treat more general group actions. Our assumption that is a finite group is essential for the separating polynomials to be actually separating in the sense that they discriminate orbits (see section 2.1 for more details). Also, another assumption that one should try to do without is that the action is unitary. This assumption is certainly convenient due to the handy representation and particular set of separating monomials, but it also plays a role in the rest of the construction and it affects the Lipschitz bound in a significant way because without it would not be invariant. Thus, it should be clear that a different perspective is needed to treat more general situations.
One salient desired property that is absent in our construction is a quantitative control of the injectivity of for example our map does not come with a lower Lipschitz bound. Of course, this could be a matter of our use of polynomial invariants, but it could be that a much more delicate problem is at hand. We believe that even if complete sets of measurements could be constructed that make use of other types of invariants that would have a lower Lipschitz constant, the bounds obtained could be very bad and would probably go to [math] as This is known to be the case in phase retrieval [8].
Finally, one of the motivations behind this work was to provide an alternative to the scattering transform of Mallat [19] and the strategy preferred in the works [6, 5, 21, 18, 15, 10] based on neural networks; our main goal was to provide an approach better adapted to deal with finite dimensional problems. In [19] the transform obtained is non expansive, that is the Lipschitz constant is equal to This is equivalent to, in our setting, having a bound independent of the dimension. We note that in principle, we could obtain non expansive maps simply by normalizing (which depends on ) by the corresponding Lipschitz constant (which also depends on ). The challenge becomes then to understand the possible limits of these normalized transforms as and how they relate to the scattering transform. The authors anticipate studying some of these problems in the future.
Acknowledgements. The work of A. Contreras was partially supported by a grant from the Simons Foundation # 426318.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Radu Balan, Pete Casazza, and Dan Edidin. On signal reconstruction without phase. Applied and Computational Harmonic Analysis , 20(3):345β356, 2006.
- 2[2] Radu Balan and Dongmian Zou. On lipschitz analysis and lipschitz synthesis for the phase retrieval problem. Linear Algebra and its Applications , 496:152β181, 2016.
- 3[3] Afonso S Bandeira, Ben Blum-Smith, Joe Kileel, Amelia Perry, Jonathan Weed, and Alexander S Wein. Estimation under group actions: recovering orbits from invariants. ar Xiv preprint ar Xiv:1712.10163 , 2017.
- 4[4] Afonso S Bandeira, Jameson Cahill, Dustin G Mixon, and Aaron A Nelson. Saving phase: Injectivity and stability for phase retrieval. Applied and Computational Harmonic Analysis , 37(1):106β125, 2014.
- 5[5] Michael M Bronstein, Joan Bruna, Yann Le Cun, Arthur Szlam, and Pierre Vandergheynst. Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine , 34(4):18β42, 2017.
- 6[6] Joan Bruna and StΓ©phane Mallat. Invariant scattering convolution networks. IEEE transactions on pattern analysis and machine intelligence , 35(8):1872β1886, 2013.
- 7[7] Dmitri Burago and Yuri Burago. A course in metric geometry .
- 8[8] Jameson Cahill, Peter Casazza, and Ingrid Daubechies. Phase retrieval in infinite-dimensional hilbert spaces. Transactions of the American Mathematical Society, Series B , 3(3):63β76, 2016.
