Isomorphism problems for tensors, groups, and cubic forms: completeness and reductions
Joshua A. Grochow, Youming Qiao

TL;DR
This paper establishes the computational equivalence of various isomorphism problems involving tensors, groups, and forms, introduces the concept of TI-complete problems, and provides new reductions and algorithms for group isomorphism.
Contribution
It proves the polynomial-time equivalence of multiple isomorphism problems, introduces TI-complete class, and presents novel reductions and search-to-decision algorithms for group isomorphism.
Findings
All isomorphism problems are polynomial-time equivalent.
d-tensor isomorphism reduces to 3-tensor isomorphism for d ≥ 3.
A new search-to-decision reduction for p-group isomorphism of class 2.
Abstract
In this paper we consider the problems of testing isomorphism of tensors, -groups, cubic forms, algebras, and more, which arise from a variety of areas, including machine learning, group theory, and cryptography. These problems can all be cast as orbit problems on multi-way arrays under different group actions. Our first two main results are: 1. All the aforementioned isomorphism problems are equivalent under polynomial-time reductions, in conjunction with the recent results of Futorny-Grochow-Sergeichuk (Lin. Alg. Appl., 2019). 2. Isomorphism of -tensors reduces to isomorphism of 3-tensors, for any . Our results suggest that these isomorphism problems form a rich and robust equivalence class, which we call Tensor Isomorphism-complete, or TI-complete. We then leverage the techniques used in the above results to prove two first-of-their-kind results for Group…
| Font | Object | Space of objects |
|---|---|---|
| matrix | or | |
| matrix tuple | or | |
| matrix space | [Subspaces of or ] | |
| 3-way array |
| Notation | Name | Group Action | ||
|---|---|---|---|---|
|
||||
|
||||
| Matrix Space Conjugacy | ||||
| Trilinear Form Equivalence | ||||
| Algebra Isomorphism |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Isomorphism problems for tensors, groups,
and cubic forms: completeness and reductions
Joshua A. Grochow 111Departments of Computer Science and Mathematics, University of Colorado, Boulder. [email protected]
Youming Qiao 222Centre for Quantum Software and Information, University of Technology Sydney. [email protected]
Abstract
In this paper we consider the problems of testing isomorphism of tensors, -groups, cubic forms, algebras, and more, which arise from a variety of areas, including machine learning, group theory, and cryptography. These problems can all be cast as orbit problems on multi-way arrays under different group actions. Our first two main results are:
All the aforementioned isomorphism problems are equivalent under polynomial-time reductions, in conjunction with the recent results of Futorny–Grochow–Sergeichuk (Lin. Alg. Appl., 2019). 2. 2.
Isomorphism of -tensors reduces to isomorphism of -tensors, for any .
All but one of the reductions for the preceding contributions work over arbitrary fields. Together they suggest that the aforementioned isomorphism problems form a rich and robust equivalence class, which we call Tensor Isomorphism-complete, or TI-complete for short. Furthermore, this provides a unified viewpoint on these hard isomorphism testing problems arising from a variety of areas.
We then leverage the techniques used in the above results to prove two first-of-their-kind results for Group Isomorphism (GpI):
We give a reduction from testing isomorphism of -groups of exponent and small class () to isomorphism of -groups of exponent and class 2. The latter are widely believed to be the hardest cases of GpI, but as far as we know, this is the first reduction from any more general class of groups to this class. 2. 4.
We give a search-to-decision reduction for isomorphism of -groups of exponent and class in time . While search-to-decision reductions for Graph Isomorphism (GI) have been known for more than 40 years, as far as we know this is the first non-trivial search-to-decision reduction in the context of GpI.
Our main technique for (1), (3), and (4) is a linear-algebraic analogue of the classical graph coloring gadget, which was used to obtain the search-to-decision reduction for GI. This gadget construction may be of independent interest and utility. The technique for (2) gives a method for encoding an arbitrary tensor into an algebra.
1 Introduction
Isomorphism problems in light of Babai’s breakthrough on Graph
Isomorphism.
In late 2015, Babai presented a quasipolynomial-time algorithm for Graph Isomorphism (GI) [Bab16]. This is widely regarded as one of the major breakthroughs in theoretical computer science of the past decade. Indeed, GI has been at the heart of complexity theory nearly since its inception: both Cook and Levin were thinking about GI when they defined [AD17, Sec. 1], Graph (Non-)Isomorphism played a special role in the creation of the class [Bab85, GMR85, BM88], and it still stands today as one of the few natural candidates for a problem that is “-intermediate,” that is, in , but neither in nor -complete [Lad75] (see [Exc] for additional candidates). Beyond its practical applications (e. g., [SV17, Irn05] and references therein) and its naturality, part of its fascination comes from its universal property: GI is universal for isomorphism problems for “explicitly given” structures [ZKT85, Sec. 15], that is, first-order structures on a set where, e. g., a -ary relation on is given by listing out a subset .
In light of Babai’s breakthrough on GI [Bab16], it is natural to consider “what’s next?” for isomorphism problems. That is, what isomorphism problems stand as crucial bottlenecks to further improvements on GI, and what isomorphism problems should naturally draw our attention for further exploration? Of course, one of the main open questions in the area remains whether or not GI is in . Babai [Bab16, arXiv version, Sec. 13.2 and 13.4] already lists several isomorphism problems for further study, including Group Isomorphism, Linear Code Equivalence, and Permutation Group Conjugacy. In this paper we expand this list in what we argue is a very natural direction, namely to isomorphism problems for multi-way arrays, also known as tensors.333There have been some disputes on the terminologies; see the preface of [Lan12]. Our approach is to use multi-way arrays as the basic underlying object, and to use tensors as the multi-way arrays under a certain group action.
Group actions on 3-way arrays.
3-way arrays are simply arrays with 3 indices, generalizing the case of matrices (=2-way arrays). In this paper we consider entries of the arrays being from a field , so a 3-way array is just , , , , and .
Let be the general linear group of degree over , and let denote the set of matrices. There are three natural group actions on : for , (1) sends to , (2) sends to , and (3) sends to . These three actions then endow with different algebraic/geometric interpretations: (1) a linear map from a vector space to another vector space , (2) a linear map from to itself, and (3) a bilinear map from to .
Likewise, 3-way arrays , , can be naturally acted by in one way, by in two different ways, and by in two different ways. These five actions endow various families of 3-way arrays with different algebraic/geometric meanings, including 3-tensors, bilinear maps, matrix (associative or Lie) algebras, and trilinear forms (a.k.a. non-commutative cubic forms). (See Sec. 2 for detailed explanations.) Over finite fields, the associated isomorphism problems are in , following the essentially same protocol as for GI.
With these group actions in mind, 3-way arrays capture a variety of important structures in several mathematical and computational disciplines. They arise naturally in quantum mechanics (states are described by tensors), the complexity of matrix multiplication (matrix multiplication is described by a tensor, and its algebraic complexity is essentially its tensor rank), the Geometric Complexity Theory approach [Mul11] to the Permanent versus Determinant Conjecture [Val79] (tensors describe the boundary of the determinant orbit closure, e. g., [Lan12, Sec. 13.6.3] and [Gro12b, Sec. 3.5.1] for introductions, and [HL16, Hüt17] for applications), data analysis [KB09], machine learning [PSS18], computational group theory [LQ17, BMW18], and cryptography [Pat96, JQSY19].
Main results.
The five natural actions on 3-way arrays mentioned above each lead to a different isomorphism problem on 3-way arrays; we discuss these problems and their interpretations in Sec. 2. Our first main result, Thm. A, shows that these isomorphism problems for 3-way arrays are all equivalent under polynomial-time reductions. Due to the algebraic or geometric interpretations, these problems are further equivalent to isomorphism problems on certain classes of groups, cubic forms, trilinear forms (a.k.a. non-commutative cubic forms), associative algebras, and Lie algebras. One consequence of these results (Cor. P), along with those of [FGS19], is a reduction from GpI for -groups of exponent and class to GpI for -groups of exponent and class 2. Although the latter have long been believed to be the hardest cases of GpI, as far as we are aware, this is the first reduction from a more general class of groups to this class.
Although these equivalences may have been expected by some experts, it had not been immediately clear to us for some time during this project. To get a sense for the non-obviousness, let us postulate the following hypothetical question. Recall that two matrices are called equivalent if there exists such that , and they are conjugate if there exists such that . Can we reduce testing Matrix Conjugacy to testing Matrix Equivalence? Of course since they are both in there is a trivial reduction; to avoid this, let us consider only reductions which send a matrix to a matrix such that and are conjugate iff and are equivalent. Nearly all reductions between isomorphism problems that we are aware of have this form (so-called “kernel reductions” [FG11]; cf. functorial reductions [Bab14]). After some thought, we realize that this is essentially impossible. The reason is that the equivalence class of a matrix is completely determined by its rank, while the conjugacy class of a matrix is determined by its rational canonical form. Among matrices there are only equivalence classes, but there are at least rational canonical forms (coming from the choice of minimal polynomial/companion matrix). Even when is a finite field, such a reduction would thus require an exponential increase in dimension, and when is infinite, such a reduction is impossible (regardless of running time).
Nonetheless, one of our results is that for spaces of matrices (one form of 3-way arrays), conjugacy testing does indeed reduce to equivalence testing! This is in sharp contrast to the case of single matrices. In the above setting, it means that there exists a polynomial-time computable map from to subspaces of , such that are conjugate up to a scalar if and only if are equivalent as matrix spaces. Such a reduction may not be clear at first sight.
Our second main result reduces TI to 3TI, for any fixed . From one viewpoint, this can be seen as a linear algebraic analogue of the now-classical reduction from -uniform Hypergraph Isomorphism to GI (e. g., [ZKT85]). However, as the reader will see, the reduction here is quite a bit more involved, using quiver algebras and the Wedderburn–Mal’cev Theorem on complements of the Jacobson radical in associative algebras. From another viewpoint, this can be seen as a step towards showing that 3TI is not only universal among isomorphism problems on 3-way arrays [FGS19], but perhaps 3TI is already universal for isomorphism problems on -way arrays for any ; see Sec. 10.1. These first two results indicate the robustness and naturality of the notion of -completeness.
Our next set of results reduce Graph Isomorphism and Linear Code Equivalence to these isomorphism problems for 3-way arrays (Sec. 3.2). This shows that these isomorphism problems for 3-way arrays form a set of potentially harder problems than these two problems, as also supported by the current difference in their practical difficulties.444There is a heuristic algorithm for Linear Code Equivalence by Sendrier [Sen00], which is practically effective in many cases, though for self-dual codes it reverts to an exponential search. It currently seems unlikely to us that either Graph Isomorphism or Code Equivalence is -complete.
Finally, our third main contribution is to show a search-to-decision reduction for these tensor problems (Thm. C), which may be of independent interest, leveraging our technique from above. While such a reduction has long been known for GI, for Group Isomorphism in general this remains a long-standing open question. Our techniques allow us to give a search-to-decision reduction for isomorphism of -groups of class 2 and exponent in time in the model of matrix groups over finite fields. This group class is widely regarded to be the hardest cases of Group Isomorphism. As far as we know, this is the first non-trivial search-to-decision reduction for testing isomorphism of a class of finite groups.
Implications of main results for practical
computations.
Our first main result may partly help to explain the difficulties from various areas when dealing with these isomorphism problems. There is currently a significant difference between isomorphism problems for 3-way arrays and that for graphs. Namely, in sharp contrast to Graph Isomorphism—for which very effective practical algorithms have existed for some time [McK80, MP14]—the problems we consider here all still pose great difficulty even on relatively small examples in practice. Indeed, such problems have been proposed to be difficult enough for cryptographic purposes [Pat96, JQSY19]. As further evidence of their practical difficulty, current algorithms implemented for Alternating Matrix Space Isometry 555An matrix over is alternating if for every , . When is not of characteristic , this is equivalent to the skew-symmetry condition.—a problem we show is -complete—can handle the cases when the 3-way array is of size over , but absolutely not for 3-way arrays of size , even though in this case the input can still be stored in only a few megabytes.666We thank James B. Wilson, who maintains a suite of algorithms for -group isomorphism testing, for communicating this insight to us from his hands-on experience. We of course maintain responsibility for any possible misunderstanding, or lack of knowledge regarding the performance of other implemented algorithms. In [PSS18], motivated by machine learning applications, computations on one -complete problem were performed in Macaulay2 [GS], but these could not go beyond small examples either. Our results imply that the complexities of these problems arising in many fields—from computational group theory to cryptography to machine learning—are all equivalent.
Isomorphism problems for 3-way arrays as a bottleneck for graph
isomorphism.
In addition to their many incarnations and practical uses mentioned above, the isomorphism problems we consider on 3-way arrays can be further motivated by their relationship to GI. Specifically, these problems both form a key bottleneck to putting GI into , and pose a great challenge for extending techniques used to solve GI.
Isomorphism problems for 3-way arrays stand as a key bottleneck to put GI in . This is because, as Babai pointed out [Bab16], Group Isomorphism is a key bottleneck to putting GI into . Indeed, the current-best upper bounds on these two problems are now quite close: for Group Isomorphism (originally due to [FN70, Mil78]777Miller attributes this to Tarjan., with improved constants [Wil14, Ros13a, Ros13b]), and for GI [Bab16] (see [HBD17] for calculation of the exponent). Within Group Isomorphism, it is widely regarded, for several reasons (e. g., [Bae38, Hig60, Ser77, Wil15]), that the bottleneck is the class of -groups of class 2 and exponent (i.e., is abelian and for all , odd). Then 3-way arrays enter the picture by Baer’s Correspondence [Bae38], which shows that the isomorphism problem for these groups is equivalent to telling whether two linear spaces of skew-symmetric matrices over are equivalent up to transformations of the form . This is the Alternating Matrix Space Isometry problem, which we show in this paper is -complete.888Because of the difference in verbosity of inputs, solving Group Isomorphism for this class of groups in time is equivalent to solving Alternating Matrix Space Isometry in time for matrix spaces of dimension over . The current state of the art is , which corresponds to the nearly-trivial upper bound of on Group Isomorphism.
To see why the techniques for GI face great difficulty when dealing with isomorphism problems for multi-way arrays, recall that most algorithms for GI, including Babai’s [Bab16], are built on two families of techniques: group-theoretic, and combinatorial. One of the main differences is that the underlying group action for GI is a permutation group acting on a combinatorial structure, whereas the underlying group actions for isomorphism problems for 3-way arrays are matrix groups acting on (multi)linear structures.
Already in moving from permutation groups to matrix groups, we find many new computational difficulties that arise naturally in basic subroutines used in isomorphism testing. For example, the membership problem for permutation groups is well-known to be efficiently solvable by Sims’s algorithm [Sim78] (see, e. g., [Ser03] for a textbook treatment), while for matrix groups this was only recently shown to be solvable with a number-theoretic oracle over finite fields of odd characteristic [BBS09]. Correspondingly, when moving from combinatorial structures to (multi)linear algebraic structures, we also find severe limitation on the use of most combinatorial techniques, like individualizing a vertex. For example, it is quite expensive to enumerate all vectors in a vector space, while it is usually considered efficient to go through all elements in a set. Similarly, within a set, any subset has a unique complement, whereas within , a subspace can have up to complements.
Given all the differences between the combinatorial and linear-algebraic worlds, it may be surprising that combinatorial techniques for Graph Isomorphism can nonetheless be useful for Group Isomorphism. Indeed, guided by the postulate that alternating matrix spaces can be viewed as a linear algebraic analogue of graphs, Li and the second author [LQ17] adapted the individualisation and refinement technique, as used by Babai, Erdős and Selkow [BES80], to tackle Alternating Matrix Space Isometry over . This algorithm was recently improved [BGL*+*19]. However, this technique, though helpful to improve from the brute-force time, seems still limited to getting -time algorithms.
New techniques.
Our first new technique for the above results on 3-way arrays is to develop a linear-algebraic analogue of the coloring gadget used in the context of Graph Isomorphism (see, e. g., [KST93]). These gadgets help us to restrict to various subgroups of the general linear group. Recall that, in relating GI with other isomorphism problems, coloring is a very useful idea. Given a graph , a coloring of vertices is a function where is a set of “colors.” Colored isomorphism between two vertex-colored graphs asks only for isomorphisms that send vertices of one color to vertices of that same color. If we are interested in making a specific vertex special (“individualizing” that vertex), we can assign this vertex a unique color. To reduce Colored Graph Isomorphism to ordinary Graph Isomorphism uses certain gadgets, and we adapt this idea to the context of 3-way arrays. We note that [FGS19] construct a related such gadget. In this paper, we develop a new gadget which we use both by itself, and in combination with the gadget from [FGS19] (albeit in a new context), see Sec. 4 and Sec. 7.
Our second new technique, used to show the reduction from TI to 3TI, is a simultaneous generalization of our reduction from 3TI to Algebra Isomorphism and the technique Grigoriev used [Gri81] to show that isomorphism in a certain restricted class of algebras is equivalent to GI. In brief outline: a 3-way array specifies the structure constants of an algebra with basis via , and this is essentially how we use it in the reduction from 3TI to Algebra Isomorphism. For arbitrary , we would like to similarly use a -way array to specify how -tuples of elements in some algebra multiply. The issue is that for to be an algebra, our construction must still specify how pairs of elements multiply. The basic idea is to let pairs (and triples, and so on, up to -tuples) multiply “freely” (that is, without additional relations), and then to use to rewrite any product of generators as a linear combination of the original generators. While this construction as described already gives one direction of the reduction (if , then ), the other direction is trickier. For that, we modify the construction to an algebra in which short products (less than generators) do not quite multiply freely, but almost. After the fact, we found out that this construction generalizes the one used by Grigoriev [Gri81] to show that GI was equivalent Algebra Isomorphism for a certain class of algebras (see Sec. 4 for a comparison).
Organization.
We aim to reach as wide an audience as possible, so we start with a detailed introduction to the various isomorphism problems on 3-way arrays, and their algebraic and geometric interpretations in Sec. 2. We then describe our results in detail in Sec. 3 and consider related work in Sec. 4. An illustration of the key technique is in Sec. 5. These sections may be viewed as an extended abstract.
The remainder of the paper gives detailed proofs of all results. Sec. 6 contains additional preliminaries. In Sec. 7, we present those reductions which use the linear-algebraic coloring technique, thus proving Thm. A(2) and Thm. C. We then finish the proof of Thm. A by presenting the remaining reductions in Sec. 8. Thm. B is proved in Sec. 9. In Sec. 10, we put forward a theory of universality for basis-explicit linear structures, in analogy with [ZKT85]. While not yet complete, this seems to provide another justification for studying Tensor Isomorphism and related problems, and it motivates some interesting open questions. In Appendix A we give a reduction from Cubic Form Equivalence to Degree- Form Equivalence for any (for this is easy; for it requires some work).
2 Preliminaries: Group actions on 3-way arrays
The formulas for most natural group actions on 3-way arrays are somewhat unwieldy; our experience suggests that they are more easily digested when presented in the context of some of the natural interpretations of 3-way arrays as mathematical objects. To connect the interpretations with the formulas themselves, one technical tool is very useful, namely, given a 3-way array , we define its frontal slices to be the matrices defined by ; that is, we think of the box of as arranged so that the and axes lie in the page, while the -axis is perpendicular to the page. Similarly, its lateral slices (viewing the 3D box of “from the side”) are defined by . An 3-way array thus has frontal slices and lateral slices.
A natural action on arrays of size is that of by change of basis in each of the 3 directions, namely . We will see several interpretations of this action below.
3-tensors.
A 3-way array , where , , and , is naturally identified as a vector in . Letting denote the th standard basis vector of , a standard basis of is . Then represents the vector in . The natural action by above corresponds to changes of basis of the three vector spaces in the tensor product. The problem of deciding whether two 3-way arrays are the same under this action is called 3-Tensor Isomorphism.999Some authors call this Tensor Equivalence; we use “Isomorphism” both because this is the natural notion of isomorphism for such objects, and because we will be considering many different equivalence relations on essentially the same underlying objects.
Matrix spaces.
Given a 3-way array , it is natural to consider the linear span of its frontal slices, , also called a matrix space. One convenience of this viewpoint is that the action of becomes implicit: it corresponds to change of basis within the matrix space . This allows us to generalize the three natural equivalence relations on matrices to matrix spaces: (1) two matrix spaces and are equivalent if there exists such that , where ; (2) two matrix spaces are conjugate if there exists such that ; and (3) they are isometric if . The corresponding decision problems, when is given by a basis , are Matrix Space Equivalence, Matrix Space Conjugacy, and Matrix Space Isometry, respectively.
Nilpotent groups.
If are two subsets of a group , then denotes the subgroup generated by all elements of the form , for . The lower central series of a group is defined as follows: , . A group is nilpotent if there is some such that ; the smallest such is called the nilpotency class of , or sometimes just “class” when it is understood from context. A finite group is nilpotent if and only if it is the product of its Sylow subgroups; in particular, all groups of prime power order are nilpotent.
Bilinear maps, finite groups, and systems of polynomials.
While the matrix space viewpoint has the merit of drawing an analogy with the more familiar object of matrices, other interpretations lead to standard complexity problems that may be more familiar to some readers. For example, from an 3-way array , we can construct a bilinear map (=system of bilinear forms) , sending to , where the are the frontal slices of .101010In this paper elements in are column vectors. The group action defining Matrix Space Equivalence is equivalent to the action of on such bilinear maps.
When , the action in Matrix Space Isometry is equivalent to the natural action of on such bilinear maps. Two bilinear maps that are essentially the same up to such basis changes are sometimes called pseudo-isometric [BW12].
Bilinear maps of the form turn out to arise naturally in group theory and algebraic geometry. When are skew-symmetric over , an odd prime, Baer’s correspondence [Bae38] gives a bijection between finite -groups of class 2 and exponent , that is, in which for all and in which , and their corresponding bilinear maps , given by . Two such groups are isomorphic if and only if their corresponding bilinear maps are pseudo-isometric, if and only if, using the matrix space terminology, the matrix spaces they span are isometric. When are symmetric, by the classical correspondences between symmetric matrices and homogeneous quadratic forms, a symmetric bilinear map naturally yields a quadratic map from to . The two quadratic maps are isomorphic if and only if the corresponding bilinear maps are pseudo-isometric.
Cubic forms & trilinear forms.
From a 3-way array we can also construct a cubic form (=homogeneous degree 3 polynomial) , where are formal variables. If we consider the variables as commuting—or, equivalently, if is symmetric, meaning it is unchanged by permuting its three indices—we get an ordinary cubic form; if we consider them as non-commuting, we get a trilinear form (or “non-commutative cubic form”). In either case, the natural notion of isomorphism here comes from the action of on the variables , in which transforms the preceding form into . In terms of 3-way arrays, we get . The corresponding isomorphism problems are called Cubic Form Equivalence (in the commutative case) and Trilinear Form Equivalence.
Algebras.
We may also consider a 3-way array , , as the structure constants of an algebra (which need not be associative, commutative, nor unital), say with basis , and with multiplication given by , and then extended (bi)linearly. Here the natural notion equivalence comes from the action of by change of basis on the . Despite the seeming similarity of this action to that on cubic forms, it turns out to be quite different: given , let ; then we have . Thus becomes . The inverse in the third factor here is the crucial difference between this case and that of cubic or trilinear forms above, similar to the difference between matrix conjugacy and matrix isometry. The corresponding isomorphism problem is called Algebra Isomorphism.
Summary.
The isomorphism problems of the above structures all have 3-way arrays as the underlying object, but are determined by different group actions. It is not hard to see that there are essentially five group actions in total: 3-Tensor Isomorphism, Matrix Space Conjugacy, Matrix Space Isometry, Trilinear Form Equivalence, and Algebra Isomorphism. It turns out that these cover all the natural isomorphism problems on 3-way arrays in which the group acting is a product of (where is the side length of the arrays); see Sec. 6.1 for discussion.
3 Main results
3.1 Equivalence of isomorphism problems for 3-way arrays
Definition 3.1** ().**
For any field , denotes the class of problems that are polynomial-time Turing (Cook) reducible to -Tensor Isomorphism over .111111We follow a natural convention: when is finite, a fixed algebraic extension of a finite field such as , the rationals, or a fixed algebraic extension of the rationals such as , we consider the usual model of Turing machines; when is , , the -adic rationals , or other more “exotic” fields, we consider this in the Blum–Shub–Smale model over . When we write without mentioning the field, the result holds for any field. .
We now state our first main theorem.
Theorem A**.**
3-Tensor Isomorphism* reduces to each of the following problems in polynomial time.*
Group Isomorphism* for -groups exponent ( for all ) and class 2 ( is abelian) given by generating matrices over . Here we consider only where is an odd prime.* 2. 2.
Matrix Space Isometry*, even for alternating or symmetric matrix spaces.* 3. 3.
Matrix Space Conjugacy*, and even the special cases:*
- (a)
Matrix Lie Algebra Conjugacy*, for solvable Lie algebras of derived length 2.121212And even further, where . * 2. (b)
Associative Matrix Algebra Conjugacy*.131313Even for algebras whose Jacobson radical squares to zero and . * 4. 4.
Algebra Isomorphism*, and even the special cases:*
- (a)
Associative Algebra Isomorphism*, for algebras that are commutative and unital, and for algebras that are commutative and 3-nilpotent ( for all )* 2. (b)
Lie Algebra Isomorphism*, for 2-step nilpotent Lie algebras ( )* 5. 5.
Cubic Form Equivalence* and Trilinear Form Equivalence.*
The algebras in (3) are given by a set of matrices which linearly span the algebra, while in (4) they are given by structure constants (see “Algebras” in Sec. 2).
Remark 3.2**.**
Agrawal & Saxena [AS05, Thm. 5] gave a reduction from Cubic Form Equivalence over to Ring Isomorphism for commutative, unital, associative algebras over , when every element of has a cube root. For finite fields , the only such fields are those for which and , which is asymptotically half of all primes. As explained after the proof of [AS05, Thm. 5], the use of cube roots seems inherent in their reduction.
Using our results in conjunction with [FGS19], we get a new reduction from Cubic Form Equivalence to Ring Isomorphism (for the same class of rings) which works over any field of characteristic 0 or . Note that our reduction is very different from the one in [AS05].
Figure 1 below summarizes where the various parts of Thm. A are proven.
We then resolve an open question well-known to the experts:141414We asked several experts who knew of the question, but we were unable to find a written reference. Interestingly, Oldenburger [Old36] worked on what we would call -Tensor Isomorphism as far back as the 1930s. We would be grateful for any prior written reference to the question of whether TI reduces to 3TI.
Theorem B**.**
-Tensor Isomorphism* reduces to Algebra Isomorphism.*
Since the main result of [FGS19] reduces the problems in Theorem A to 3-Tensor Isomorphism (cf. [FGS19, Rmk. 1.1]), we have:
Corollary B**.**
Each of the problems listed in Theorem A is -complete.151515For Cubic Form Equivalence, we only show that it is in when or . In particular, and are equivalent.
Remark 3.3**.**
This phenomenon is reminiscent of the transition in hardness from 2 to 3 in -SAT, -Coloring, -Matching, and many other -complete problems. It is interesting that an analogous phenomenon—a transition to some sort of “universality” from 2 to 3—occurs in the setting of isomorphism problems, which we believe are not -complete over finite fields.
Remark 3.4**.**
Here is a brief summary of what is known about the complexity of some of these problems. Over a finite field , these problems are in . For 3-way arrays, the brute-force algorithms run in time , as can be enumerated in time . Note that polynomial-time in the input size here would be . Over any field , these problems are in in the Blum–Shub–Smale model. When the input arrays are over and we ask for isomorphism over or , these problems are in using quantifier elimination. By Koiran’s celebrated result on Hilbert’s Nullstellensatz, for equivalence over they are in assuming the Generalized Riemann Hypothesis [Koi96]. When the input is over and we ask for equivalence over , it is unknown whether these problems are even decidable; classically this is studied under Algebra Isomorphism for associative, unital algebras over (see, e. g., [AS06, Poo14]), but by Cor. B, the question of decidability is open for all of these problems.
Over finite fields, several of these problems can be solved efficiently when one of the side lengths of the array is small. For -dimensional spaces of matrices, Matrix Space Conjugacy and Isometry can be solved in time: once we fix an element of , the isomorphism problem reduces to solving linear systems of equations. Less trivially, Matrix Space Conjugacy can be solved in time and 3TI for tensors in time , since once we fix an element of , the isomorphism problem either becomes an instance of, or reduces to [IQ18], Module Isomorphism, which admits several polynomial-time algorithms [BL08, CIK97, IKS10, Ser00]. Finally, one can solve Matrix Space Isometry in time : once one fixes an element of , there is a rather involved algorithm [IQ18], which uses the -algebra technique originated from the study of computing with -groups [Wil09, BW12].
3.2 Relations with Graph Isomorphism and Code
Equivalence
We observe then Graph Isomorphism and Code Equivalence reduce to 3-Tensor Isomorphism. In particular, the class contains the classical graph isomorphism class .
Recall Code Equivalence asks to decide whether two linear codes are the same up to a linear transformation preserving the Hamming weights of codes. Here the linear codes are just subspaces of of dimension , represented by linear bases. Linear transformations preserving the Hamming weights include permutations and monomial transformations. Recall that the latter consists of matrices where every row and every column has exactly one non-zero entry. Indeed, over many fields this is without loss of generality, as Hamming-weight-preserving linear maps are always induced by monomial transformations (first proved over finite fields [Mac62], and more recently over much more general algebraic objects, e. g., [GNW04]). CodeEq has long been studied in the coding theory community; see e.g. [PR97, Sen00].
For Code Equivalence, we observe that previous results already combine to give:
Observation 3.5**.**
Code Equivalence* (under permutations) reduces to 3-Tensor Isomorphism.*
Proof.
Code Equivalence reduces to Matrix Lie Algebra Conjugacy [Gro12a], a special case of Matrix Space Conjugacy, which in turn reduces to 3TI [FGS19]. ∎
Using the linear-algebraic coloring gadget, we can extend this to equivalence of codes under monomial transformations (see Sec. 5). Given two matrices over of rank , the Monomial Code Equivalence problem is to decide whether there exist and a monomial matrix (product of a diagonal matrix and a permutation matrix) such that .
Proposition 3.6**.**
Monomial Code Equivalence* reduces to 3-Tensor Isomorphism.*
Since Graph Isomorphism reduces to Code Equivalence [Luk93] (see [Miy96]) and [PR97] (even over arbitrary fields [Gro12a]), by Obs. 3.5 and Thm. A, we have the following.
Corollary 3.7**.**
Graph Isomorphism* reduces to Alternating Matrix Space Isometry.*
Using our linear-algebraic gadgets, we also reprove this result using a much more direct reduction (see Prop. 7.1). Besides being a different construction, another reason for the additional proof is that the technique leads to the search-to-decision reduction, which we discuss below.
3.3 Application to Group Isomorphism: reducing the nilpotency class
For several reasons, the hardest cases of Group Isomorphism are believed to be -groups of class 2 and exponent ; recall that these are groups in which every element has order , the order of the group is , and is abelian. See Nilpotent groups above. While this belief has been widely held for many decades, we are not aware of any prior reduction from a more general class of groups to this class. However, by combining our results with the Lazard correspondence, we immediately get such a reduction.
Corollary P**.**
Let be an odd prime. For groups generated by matrices of size , Group Isomorphism for -groups of exponent and class reduces to Group Isomorphism for -groups of exponent and class in time .
Proof.
By the Lazard correspondence (reproduced as Thm. 6.4 below) two -groups of exponent and class are isomorphic if and only if their corresponding -Lie algebras are. By Prop. 6.5, we can construct a generating set for the corresponding Lie algebra by applying the power series for logarithm to the generating matrices of . This Lie algebra is thus a subalgebra of matrices, so we can generate the entire Lie algebra (using the linear-algebra version of breadth-first search; its dimension is ) and compute its structure constants in time polynomial in , , and . Then use [FGS19] to reduce isomorphism of Lie algebras to TI, and then apply Thm. A (specifically, Cor. 7.6) to reduce to isomorphism of -groups of exponent and class given by a matrix generating set. ∎
The only obstacle to getting this proof to work in the Cayley table model is that our reduction from TI to Alternating Matrix Space Isometry (Prop. 7.3) blows up the dimension quadratically, which means the size of the group becomes after the reduction. See Question 10.5.
3.4 Search to decision reductions
Reducing search problems to their associated decision problems is a classical and intriguing topic in complexity theory. Aside from the now-standard search-to-decision reduction for SAT, one of the earliest results in this direction was by Valiant in the 1970’s [Val76]. A celebrated result of Bellare and Goldwasser shows that, assuming , there exists a language in for which search does not reduce to decision under polynomial-time reductions [BG94]. However, as usual for such statements based on complexity-theoretic assumptions, the problems constructed by such a proof are considered somewhat unnatural. For natural problems, on the one hand, there are search-to-decision reductions for -complete problems and for GI. On the other hand, such is not known, nor expected to be the case, for Nash Equilibrium [CDT09] (for which decision is trivial).
Reducing search to decision is particularly intriguing for testing isomorphism of groups. One difficulty is that it is not clear how to guess a partial solution, and then make progress by restricting to a subgroup. In general, testing isomorphism of certain algebraic structures (groups, algebras, etc.) forms a large family of problems for which search-to-decision reductions are not known.
Because of the close relationship between 3TI and isomorphism of various algebraic structures, one might expect similar difficulties in reducing search to decision for 3TI, and thus for -complete problems as well. Nonetheless, for Alternating Matrix Space Isometry, we are able to use the linear-algebraic coloring gadgets to get a non-trivial search-to-decision reduction.
Theorem C**.**
There is a search-to-decision reduction for Alternating Matrix Space Isometry which, given alternating matrix spaces over , computes an isometry between them if they are isometric, in time . The reduction queries the decision oracle with inputs of dimension at most .
As a consequence, a -time decision algorithm would result in a -time search algorithm, in contrast with the brute-force running time. Note that in this context, the size of the input is , so a running time is still quite generous.
By the connection between Alternating Matrix Space Isometry and Group Isomorphism for -groups of class and exponent , we have the following. Note that the natural succinct input representation mentioned in the following result can have size .
Corollary C**.**
Let be an odd prime, and let GpIso2Exp() denote the isomorphism problem for -groups of class 2 and exponent in the model of matrix groups over . For groups of order , there is a search-to-decision reduction for GpIso2Exp() running in time .
4 Related work
The most closely related work is that of Futorny, Grochow, and Sergeichuk [FGS19]. They show that a large family of isomorphism problems on 3-way arrays—including those involving multiple 3-way arrays simultaneously, or 3-way arrays that are partitioned into blocks, or 3-way arrays where some of the blocks or sides are acted on by the same group (e. g., Matrix Space Isometry)—all reduce to 3TI. Our work complements theirs in that all our reductions for Thm. A go in the opposite direction, reducing 3TI to other problems. Some of our other results relate GI and Code Equivalence to 3TI; the latter problems were not considered in [FGS19]. Thm. B considers -tensors for any , which were not considered in [FGS19].
In [AS05, AS06], Agrawal and Saxena considered Cubic Form Equivalence and testing isomorphism of commutative, associative, unital algebras. They showed that GI reduces to Algebra Isomorphism; Commutative Algebra Isomorphism reduces to Cubic Form Equivalence; and Homogeneous Degree- Form Equivalence reduces to Algebra Isomorphism assuming that the underlying field has th root for every field element. By combining a reduction from [FGS19], Prop. 7.3, and Cor. 8.5, we get a new reduction from Cubic Form Equivalence to Algebra Isomorphism that works over any field in which is a unit, which is fields of characteristic [math] or .
There are several other works which consider related isomorphism problems. Grigorev [Gri81] showed that GI is equivalent to isomorphism of unital, associative algebras such that the radical squares to zero and is abelian. Interestingly, we show -completeness for conjugacy of matrix algebras with the same abstract structure (even when is only 1-dimensional). Note the latter problem is equivalent to asking whether two representations of are equivalent up to automorphisms of . In the proof of Thm. B, which uses algebras in which when reducing from TI, we use Grigoriev’s result.
Brooksbank and Wilson [BW15] showed a reduction from Associative Algebra Isomorphism (when given by structure constants) to Matrix Algebra Conjugacy. Grochow [Gro12a], among other things, showed that GI and CodeEq reduce to Matrix Lie Algebra Conjugacy, which is a special case of Matrix Space Conjugacy.
In [KS06], Kayal and Saxena considered testing isomorphism of finite rings when the rings are given by structure constants. This problem generalizes testing isomorphism of algebras over finite fields. They put this problem in [KS06, Thm. 4.1], reduce GI to this problem [KS06, Thm. 4.4], and prove that counting the number of ring automorphism (#RA) is in [KS06, Thm. 5.1]. They also present a reduction from GI to #RA, and show that the decision version of the ring automorphism problem is in .
To summarize this zoo of isomorphism problems and reductions, we include Figure 2 for reference.
5 Overview of one new technique, and one full proof
In this section we describe one of the key new techniques in this paper: a linear-algebraic coloring gadget. We exhibit this gadget by giving the full proof of Prop. 3.6 as an example. A related gadget was used in [FGS19] to show reductions to 3TI; our reductions all go in the opposite direction. Furthermore, whereas the gadgets used in [FGS19] were primarily to ensure that two different blocks could not be mixed, our gadgets allow us to ensure that certain slices of a tensor can be permuted, while disallowing more general linear transformations.
In the context of GI, there are many ways to reduce Colored GI to ordinary GI; here we give one example, which will serve as an analogy for our linear-algebraic gadget. To individualize a vertex (give it a unique color), attach to it a large “star”: if , add new vertices to and attach them all to ; call the resulting graph . This has the effect that any automorphism of must fix , since has a degree strictly larger than any other vertex. Furthermore, if is obtained by a similar construction, then there is an isomorphism which sends if and only if . Finally, if we attach stars of size to multiple vertices , then any automorphism of must permute the amongst themselves, and there is an isomorphism sending if and only if the corresponding enlarged graphs are isomorphic.
We adapt this idea to the context of 3-way arrays. Let be an 3-way array, with lateral slices (each an matrix). For any vector , we get an associated lateral matrix , which is a linear combination of the lateral slices as given, namely (note that when is the -th standard basis vector, the associated lateral matrix is indeed ). By analogy with adjacency matrices of graphs, is a natural analogue of the neighborhood of a vertex in a graph. Correspondingly, we get a notion of “degree,” which we may define as
[TABLE]
The last two characterizations are analogous to the fact that the degree of a vertex in a graph may be defined as the number of “in-neighbors” (nonzero entries the corresponding row of the adjacency matrix) or the number of “out-neighbors” (nonzero entries in the corresponding column).
To “individualize” , we can enlarge with a gadget to increase , as in the graph case. Note that because the lateral matrices are all of size . For notational simplicity, let us individualize . To individualize , we will increase its degree by . Extend to a new 3-way array of size ; in the “first” “corner”, we will have the original array , and then we will append to it an identity matrix in one slice to increase . More specifically, the lateral slices of will be
[TABLE]
Now we have that . This almost does what we want, but now note that any vector with has . We can nonetheless consider this a sort of linear-algebraic individualization.
Leveraging this trick, we can then individualize an entire basis of simultaneously, so that for any vector in our basis, and for any nonzero outside the basis (not a scalar multiple of one of the basis vectors), as we do in the following proof of Prop. 3.6. This is also a 3-dimensional analogue of the reduction from GI to CodeEq [Luk93, Miy96, PR97] (where they use Hamming weight instead of rank).
Proof of Prop. 3.6.
Without loss of generality we assume , as the problem is easily solvable when . We treat a matrix as a 3-way array of size , and then follow the outline proposed above, of individualizing the entire standard basis . Since the third direction only has length 1, the maximum degree of any column is 1, so it suffices to use gadgets of rank 2. More specifically, we build a 3-way array whose lateral slices are
[TABLE]
where the block is in the -th block of size 2 (that is, rows and columns ). It will also be useful to visualize the frontal slices of , as follows. Here each entry of the “matrix” below is actually a -dimensional vector, “coming out of the page”:
[TABLE]
(In we turn the vectors and “on their side” so they become perpendicular to the page. )
We claim that and are monomially equivalent as codes if and only if and are isomorphic as 3-tensors.
() Suppose where , and . Then by examining the frontal slices it is not hard to see that for (where denotes a block matrix, where the pattern of the nonzero blocks and the scalars are governed by , and each block is either zero or a scalar multiple of ) we have and , where is the permutation corresponding to . Thus and are isomorphic tensors, via the isomorphism .
() Suppose there exist , , and , such that . First, note that every lateral slice of is of rank either or , and the actions of and do not change the ranks of the lateral slices. Furthermore, any non-trivial linear combination of more than lateral slice results in a lateral matrix of rank . It follows that cannot take nontrivial linear combinations of the lateral slices, hence it must be monomial.
Now consider the frontal slices. Note that, as we assume , every frontal slice of , except the first one, is of rank . Therefore, must be of the form where is . Since is invertible, we must have , and the first frontal slice of contains all the rows of scaled by in its first rows. The first frontal slice of is a matrix that generates, by definition (and since we’ve shown is monomial), a code monomially equivalent to . Since the first frontal slices of and are equal, and the latter is just a scalar multiple of , we have that and are monomially equivalent as codes as well. ∎
6 Preliminaries
Vector spaces.
Let be a field. In this paper we only consider finite-dimensional vector spaces over . We use to denote the vector space of length- column vectors. The th standard basis vector of is denoted as . Depending on the context, may denote the zero vector space, a zero vector, or an all-zero matrix. Let be a subset of vectors. We use to denote the subspace spanned by elements in .
Some groups.
The general linear group of degree over a field is denoted by . The symmetric group of degree is denoted by . The natural embedding of into is to represent permutations by permutation matrices. A monomial matrix in is a matrix where each row and each column has exactly one non-zero entry. All monomial matrices form a subgroup of which we call the monomial subgroup, denoted by , which is isomorphic to the semi-direct product . The subgroup of consisting of block upper-triangular matrices with a fixed block structure is called a (standard) parabolic subgroup.
Matrices.
Let be the linear space of matrices over , and . Given , denotes the transpose of .
A matrix is symmetric, if for any , , or equivalently . That is, represents a symmetric bilinear form. A matrix is alternating, if for any , . That is, represents an alternating bilinear form. Note that in characteristic , alternating is the same as skew-symmetric, but in characteristic 2 they differ (in characteristic 2, skew-symmetric=symmetric). The linear space of alternating matrices over is denoted by .
The identity matrix is denoted by , and when is clear from the context, we may just write . The elementary matrix is the matrix with the th entry being , and other entries being [math]. The -th elementary alternating matrix is the matrix .
Matrix tuples.
We use to denote the linear space of -tuples of matrices. Boldface letters like and denote matrix tuples. Let . Given and , . Given , where .
Remark 6.1**.**
In particular, note that corresponds to the entries in the th column of . While this choice is immaterial (we could have chosen the opposite convention), all of our later calculations are consistent with this convention.
Given , we say that and are equivalent, if there exist and , such that . Let . Then and are conjugate, if there exists , such that . And and are isometric, if there exists , such that . Finally, and are pseudo-isometric, if there exist and , such that .
Matrix spaces.
Linear subspaces of are called matrix spaces. Calligraphic letters like and denote matrix spaces. By a slight abuse of notation, for , we use to denote the subspace spanned by those matrices in .
3-way arrays.
Let be the linear space of 3-way arrays over . We use the fixed-width teletypefont for 3-way arrays, like , , etc..
Given , we can think of as a 3-dimensional table, where the th entry is denoted as . We can slice along one direction and obtain several matrices, which are then called slices. For example, slicing along the first coordinate, we obtain the horizontal slices, namely matrices , where . Similarly, we also obtain the lateral slices by slicing along the second coordinate, and the frontal slices by slicing along the third coordinate.
We will often represent a 3-way array as a matrix whose entries are vectors. That is, given , we can write
[TABLE]
where , so that . Note that, while are column vectors, in the above representation of , we should think of them as along the direction “orthogonal to the paper.” Following [KB09], we call the tube fibers of . Similarly, we can have the row fibers such that , and the column fibers such that .
Given and , let be the -way array whose th frontal slice is . For , let be the -way array whose th frontal slice is . Note that these notations are consistent with the notations for matrix tuples above, when we consider the matrix tuple of frontal slices of .
Let be a 3-way array. We say that is non-degenerate as a 3-tensor if the horizontal slices of are linearly independent, the lateral slices are linearly independent, and the frontal slices are linearly independent. Let be a matrix tuple consisting of the frontal slices of . Then it is easy to see that the frontal slices of are linearly independent if and only if . The lateral (resp., horizontal) slices of are linearly independent if and only if the intersection of the right (resp., left) kernels of is zero.
Observation 6.2**.**
Given -way arrays and , we can construct non-degenerate 3-way arrays and in polynomial time, such that and are isomorphic as 3-tensors if and only if and are isomorphic as 3-tensors.
Multi-way arrays.
For , we use similar notation to 3-way arrays, which we will not belabor. Here we merely observe:
Observation 6.3**.**
For any , -TI reduces to -TI.
Proof.
Given an -way array , we embed it as a -way array of format . If as -tensors, say via , then as -tensors via . Conversely, if via , then via . That is, all that can “go wrong” under this embedding is multiplication by scalars, but those scalars can be absorbed into any one of the . ∎
Algebras and their algorithmic representations.
An algebra consists of a vector space and a bilinear map . This bilinear map defines the product in this algebra. Note that we do not assume to be unital (having an identity), associative, alternating, nor satisfying the Jacobi identity. In the literature, an algebra without such properties is sometimes called a non-associative algebra (but also, as usual, associative algebras are a special case of non-associative algebras).
As in Section 1, after fixing an ordered basis where of , this bilinear map can be represented by an 3-way array , such that . This is the structural constant representation of . Algorithms for associative algebras and Lie algebras have been studied intensively in this model, e. g., [IR99, dG00].
It is also natural to consider matrix spaces that are closed under multiplication or commutator. More specifically, let be a matrix space. If is closed under multiplication, that is, for any , , then is a matrix (associative) algebra with the product being the matrix multiplication. If is closed under commutator, that is, for any , , then is a matrix Lie algebra with the product being the commutator. Algorithms for matrix algebras and matrix Lie algebras have also been studied, e. g., [EG00, Iva00, IR99].
The Lazard correspondence for -groups.
The Lazard correspondence is a correspondence between certain classes of groups and Lie algebras, which extends the usual correspondence between Lie groups and Lie algebras (say, over ) to some groups and Lie algebras in positive characteristic. Here we state just enough to give a sense of it; for further details we refer to Khukhro’s book [Khu98] and Naik’s thesis [Nai13]. While the thesis is quite long, it also includes a reader’s guide, and collects many results scattered across the literature or well-known to the experts in one place, building the theory from the ground up and with many examples.
Recall that a Lie ring is an abelian group equipped with a bilinear map , called the Lie bracket, which is (1) alternating ( for all ) and (2) satisfies the Jacobi identity for all . Let , and , which is the subgroup (of the underlying additive group) generated by all elements of the form for . Then is nilpotent if for some finite ; the smallest such is the nilpotency class. (Lie algebras are just Lie rings over a field.)
The correspondence between Lie algebras and Lie groups over uses the Baker–Campbell–Hausdorff (BCH) formula to convert between a Lie algebra and a Lie group, so we start there. The BCH formula is the solution to the problem that for non-commuting matrices , in general (where the matrix exponential here is defined using the power series for ). Rather, using commutators , we have
[TABLE]
where the remaining terms are iterated commutators that all involve at least 5 s and s, and successive terms involve more and more. Applying the exponential function to a Lie algebra in characteristic zero yields a Lie group. The BCH formula can be inverted, giving the correspondence in the other direction.
In a nilpotent Lie algebra, the BCH formula has only finitely many nonzero terms, so issues of convergence disappear and we may consider applying the correspondence over finite fields or rings; the only remaining obstacle is that the denominators appearing in the formula must be units in the ring. It turns out that the correspondence continues to work in characteristic so long as one does not need to use the -th term of the BCH formula (which includes division by ), and the latter is avoided whenever a nilpotent group has class strictly less than . While the correspondence does apply more generally, here we only state the version for finite groups. For any fixed nilpotency class , computing the Lazard correspondence is efficient in theory; for how to compute it in practice when the groups are given by polycyclic presentations, see [CdGVL12].
Let denote the set of finite groups of order and class , and let denote the set of Lie rings of order and class . We note that for nilpotency class 2, the Baer correspondence is the same as the Lazard correspondence.
Theorem 6.4** (Lazard Correspondence for finite groups, see, e. g., [Khu98, Ch. 9 & 10] or [Nai13, Ch. 6]).**
For any prime and any , there are functions such that (1) and are inverses of one another, (2) two groups are isomorphic if and only if and are isomorphic, and (3) if has exponent , then the exponent of the underlying abelian group of has exponent . More strongly, is an isomorphism of categories .
Part (3) can be found as a special case of [Nai13, Lemma 6.1.2].
For -groups given by matrices over the finite field , we will need one additional fact about the correspondence, namely that it also results in a Lie algebra of matrices. (Being able to bound the dimension of the Lie algebra and work with it in a simple linear-algebraic way seems crucial for our reduction to work efficiently.) In fact, the BCH correspondence is easier to see for matrix groups using the matrix exponential and matrix logarithm; most of the work for BCH and Lazard is to get the correspondence to work even without the matrices. In some sense, this is thus the “original” setting of this correspondence. Though it is surely not new, we could not find a convenient reference for this fact about matrix groups over finite fields, so we state it formally here.
Proposition 6.5**.**
Let be a finite -subgroup of matrices over a finite field of characteristic . Then (from the Lazard correspondence) can be realized as a finite Lie subalgebra of matrices over . Given a generating set for of matrices, a generating set for can be constructed in time.
Proof sketch.
is conjugate in to a group of upper unitriangular matrices (upper triangular with all 1s on the diagonal); this is a standard fact that can be seen in several ways, for example, by noting that the group of all upper unitriangular matrices in is a Sylow -subgroup, and applying Sylow’s Theorem. (Note that we do not need to do this conjugation algorithmically, though it is possible to do so; this is only for the proof.) Thus we may write every as , where the sum here is the ordinary sum of matrices, denotes the identity matrix, and is strictly upper triangular. In particular, (ordinary exponentiation of matrices). Thus the Taylor series for the logarithm
[TABLE]
has only finitely many terms, so we may use it even over .
In the Lie algebra we would like addition to be ordinary matrix addition; however, it turns out that we can write this addition in terms of a formula involving only commutators of group elements. Deriving this formula—the so-called first BCH inverse formula—for the matrices will be the same, step for step, as deriving the first inverse BCH formula in general. Since the formulae are identical, the additive structures on (using the matrix logarithm) and (from the Lazard correspondence) are identical. Similar considerations apply to the matrix commutator , now using the second BCH inverse formula. Overall, we conclude that (using Lazard) and (using the matrix logarithm) are isomorphic Lie algebras.
Equivalently, we may note that the derivation of the inverse BCH formula in [Khu98, Nai13] uses a free nilpotent associative algebra as an ambient setting in which both the group and the corresponding Lie algebra live; in our case, we may replace the ambient free nilpotent associative algebra with the algebra of strictly upper-triangular matrices over , and all the derivations remain the same, mutatis mutandis. See, for example, [Khu98, p. 105, “Another remark…”]. ∎
6.1 Tensor notation
To see that those problems in Section 2 exhaust distinct isomorphism problems coming from change-of-basis on 3-way arrays (without introducing multiple arrays, or block structure, or going to subgroups of ), and to keep track of the relation between all the above problems, we use standard mathematical notation for spaces of tensors (however, we won’t actually need the full abstract definition here; for a formal introduction see, e. g., [Lan12]).
Much as the three natural equivalence relations on matrices differ by how the groups act on the rows and columns, the same is true for tensors, but on the rows, columns, and depths (the “row-like” sub-arrays which are “perpendicular to the page”). There are two aspects to the notation: first, we keep track of which group is acting where by introducing names for the different vector spaces involved (this is also the standard basis-free notation, e. g., [Lan12]) and the groups acting on them, viz. , etc. Thus, while it is possible that and thus , the notation helps make clear which group is acting where. Second, to take into account the contragradient (“inverse”) action, given a vector space , denotes its dual space, consisting of the linear functions . acts on by sending a linear function to the function . In this notation, the three different actions on matrices correspond to the notations
[TABLE]
When we have a matrix space instead of a single matrix , we introduce an additional vector space , which is naturally isomorphic to as a vector space. The action of on serves to change basis within the matrix space, while leaving the space itself unchanged. In this notation, the problems we mention above are listed in Table 2.
To see that the family of problems in Table 2 exhausts the possible isomorphism problems on (undecorated) 3-way arrays, we note that in this notation there are some “different-looking” isomorphism problems that are trivially equivalent. The first is re-ordering the spaces: the isomorphism problem for is trivially equivalent to that for , simply by permuting indices, viz. . The second is about dual vector spaces. Although a vector space and its dual are technically different, and the group action differs by an inverse transpose, we can choose bases in and so that there is a linear isomorphism which induces a bijection between orbits; for example, the orbits of the action are the same as the orbits of the action , even though technically the former corresponds to and the latter to . This means that if we are considering the isomorphism problem in a tensor space such as , we can dualize each of the vector spaces separately, so long as when we do so, we dualize all instances of that vector space. For example, the isomorphism problem in is trivially equivalent to that in , but is not obviously equivalent to that in (though we will show such a reduction in this paper). As a consequence, when the action on all three directions comes from the same group, there are only two choices: and ; the remaining choices are trivially equivalent to one of these two. Using these, we see that the Table 2 in fact covers all possibilities up to these trivial equivalences.
Special cases of interest.
As in the case of isometry of matrices, wherein skew-symmetric and symmetric matrices play a special role, the same is true for isometry of matrix spaces. We say a matrix space is symmetric if every matrix is symmetric, and similarly for skew-symmetric or alternating. Symmetric Matrix Space Isometry is equivalent to asking whether two polynomial maps from to specified by homogeneous quadratic forms are the same under the action of . This problem has been proposed by Patarin [Pat96] as the basis of security for certain identification and signature schemes. Alternating Matrix Space Isometry is a particular case of interest, being in many ways a linear-algebraic analogue of GI [LQ17] (in addition to its close relation with Group Isomorphism for -groups of class 2 and exponent ).
Among trilinear forms, we can identify commutative cubic forms as those for which the coefficient 3-way array is symmetric under all 6 permutations of its 3 indices . Over rings in which is a unit, cubic forms embed into trilinear forms via the standard map where , where denotes the coefficient of in . Thus, over such rings Cubic Form Equivalence, as studied by Agrawal and Saxena [AS05, AS06], is a special case of Trilinear Form Equivalence.
Special cases of Algebra Isomorphism that are of interest include those of unital, associative algebras (commutative, e. g., as studied in [AS05, AS06, KS06], and non-commutative, such as group algebras) and Lie algebras.
Interesting cases of Matrix Space Conjugacy arise naturally whenever we have an algebra (say, associative or Lie) that is given to us as a subalgebra of the algebra of matrices. Two such matrix algebras can be isomorphic as abstract algebras, but the more natural notion of “isomorphism of matrix algebras” is that of conjugacy, which respects both the algebra structure and the presentation in terms of matrices. This is the linear-algebraic analogue of permutational isomorphism (=conjugacy) of permutation groups, and has been studied for matrix Lie algebras [Gro12a] and associative matrix algebras [BW15]. (For those who know what a representation is: it also turns out to be equivalent to asking whether two representations of an algebra are equivalent up to automorphisms of , a problem which naturally arises as a subroutine in, e. g., Group Isomorphism, where it is often known as Action Compatibility, e. g., [GQ17].)
6.2 On the type of reduction
As these problems arise from several different fields, there are various properties one might hope for in the notion of reduction. Most of our reductions satisfy all of the following properties; see Remark 6.6 below for details.
Kernel reductions: there is a function from objects of one type to objects of the other such that if and only if . See [FG11] for some discussion on the relation between kernel reductions and more general reductions. 2. 2.
Efficiently computable: as above is computable in polynomial time. In fact, we believe, though have not checked fully, that all of our reductions are computable by uniform constant-depth (algebraic) circuits; over finite fields and algebraic number fields, we believe they are in uniform (the threshold gates are needed to do some simple arithmetic on the indices). That is, there is a small circuit which, given as input will output the entry of the output. 3. 3.
Polynomial-size projections (“p-projections”) [Val84]: each coordinate of the output is either one of the input variables or a constant, and the dimension of the output is polynomially bounded by the dimension of the input. (In fact, in many cases, the dimension of the output is only linearly larger than that of the input.) 4. 4.
Functorial. For each type of tensor there is naturally a category of such tensors (see [Mac71] for generalities on categories). For example, for 3TI, , the objects of the category are three-tensors, and a morphism between and is given by linear maps , , and such that . Isomorphism of 3-tensors is the special case when all three of are invertible. Analogous categories can be defined for the other problems we consider, such as . A functor between two categories is a pair of maps such that (1) maps objects of to objects of , (2) if is a morphism in , then is a morphism in , (3) for any , , and (4) if and are morphisms in , then .
All our reductions are functorial on the categories in which we only consider isomorphisms; we suspect that they are also functorial on the entire categories (that is, including non-invertible homomorphisms). Furthermore, all our reductions yield another map such that for any isomorphism , is an isomorphism , and for any isomorphism . If we only consider isomorphisms (and not other homomorphisms), we believe all known reductions between isomorphism problems have this form, cf. [Bab14]. 5. 5.
Containment in the sense used in the literature on wildness. There are several definitions in the literature which typically are equivalent when restricted to so-called matrix problems. For a few such definitions, see, e. g., [FGS19, Def. 1.2], [Ser00], or [SS07, Def. XIX.1.3]. For those problems in this paper to which the preceding definitions could apply, our reductions have the defined property. However, since we are working in a slightly more general setting, we would like to suggest the following natural generalization of these notions. Given two pairs and of algebraic groups acting on algebraic varieties , an algebraic containment is an algebraic map (each coordinate of the output is given by a polynomial in the coordinates of the input) that is also a kernel reduction. In our case, all our spaces are affine space for some , and our maps are in fact of degree 1. (It might be interesting to consider whether using higher degree allows for more efficient reductions.) We may also require it to be “functorial,” in the sense that there is a homomorphism of algebraic groups (simultaneously an algebraic map and a group homomorphism) such that
[TABLE]
and a section , such that and
[TABLE]
where the dashed arrow above indicates that need only be defined on a subset of , namely, those such that there exist with (but on this subset it should still act like a homomorphism, in the sense that ).
Remark 6.6**.**
We believe all of our reductions satisfy all of the above properties, with the possible exceptions that Prop. 7.3 and Prop. 8.1 are only projections (3) and algebraic containments (5) on the set of non-degenerate 3-tensors. These reductions still satisfy the other three properties on the set of all tensors: They are kernel reductions by construction; non-degeneracy presents no obstacle to polynomial-time computation (Observation 6.2); and two tensors are isomorphic iff their non-degenerate parts are isomorphic, so they are still functorial. The obstacle to being projections or algebraic containments on the set of all 3-tensors here is closely related to the fact that the map sending a matrix to its row echelon form (or even just zero-ing out a number of rows so that the remaining non-zero rows are linearly independent) is neither a projection nor an algebraic map. We would find it interesting if there were reductions for these results satisfying all of the above properties for all 3-tensors.
7 Reductions using the linear algebraic coloring
gadgets
In this section, we present the remaining reductions that use the linear algebraic coloring idea. We first reduce Graph Isomorphism to Alternating Matrix Space Isometry, using a gadget to restrict the full general linear group to the monomial matrix group, similar to that in Section 5. However, unlike in the case there, the use here requires slightly more care because of the alternating condition. We then reduce 3-Tensor Isomorphism to Alternating Matrix Space Isometry. The gadget there restricts the full general linear group to a parabolic subgroup. We note that such a gadget has appeared in [FGS19], while ours is a slight modification of that to be compatible with the alternating structure. Finally, we combine the two gadgets to give a search-to-decision reduction for Alternating Matrix Space Isometry over finite fields.
7.1 From Graph Isomorphism to Alternating Matrix Space Isometry
Proposition 7.1**.**
Graph Isomorphism* reduces to Alternating Matrix Space Isometry.*
For this proof we will need the concept of monomial isometry; see Some Groups above. Recall that a matrix is monomial if, equivalently, it can be written as where is a nonsingular diagonal matrix and is a permutation matrix. We say two matrix spaces are monomially isometric if there is some such that .
Proof.
For a graph , let be the alternating matrix tuple with where , and let be the alternating matrix space spanned by that tuple. If is a permutation matrix giving an isomorphism between two graphs and , then it is easy to see that , and thus the corresponding matrix spaces are isometric. The converse direction is not clear (and may even be false). Instead, we will first extend the spaces and by gadgets which enforce that and are isometric iff they are monomially isometric (Lemma 7.2). Given Lemma 7.2, it thus suffices to reduce GI to Alternating Matrix Space Monomial Isometry.
Let us establish the latter reduction. We will show that if and only if and are monomially isometric. The forward direction was handled above. For the converse, suppose where is diagonal and is a permutation matrix. We claim that in this case, in fact gives an isomorphism from to . First let us establish that alone gives an isometry between and . Note that for any diagonal matrix and any elementary alternating matrix , we have . Since has a basis of elementary alternating matrices, the action of on this basis is just to re-scale each basis element, and thus . Thus, we have .
Finally, note that , where is the permutation corresponding to , and by abuse of notation we write as well. Since the elementary alternating matrices are linearly independent, and has a basis of elementary alternating matrices, the only way for to be in is for it to be equal to one of the basis elements (one of the matrices in ). In other words, must be an edge of . As is invertible, we thus have that gives an isomorphism . ∎
Lemma 7.2**.**
Alternating Matrix Space Monomial Isometry* reduces to Alternating Matrix Space Isometry.*
More specifically, there is a -time algorithm taking alternating matrix tuples to alternating matrix tuples, such that for , the matrix spaces and are monomially isometric if and only if the matrix spaces and are isometric.
Proof.
For , define to be the alternating matrix tuple , where
For , . 2. 2.
For , , , is the elementary alternating matrix .
At this point, some readers may wish to look at the large matrix in Equation 1 and/or at Figure 3.
It is clear that can be computed in time . Given alternating matrix tuples , let be the corresponding matrix spaces they span, and let and . We claim that and are monomially isometric if and only if and are isometric.
To prove this, it will help to think of our matrix tuples , etc. as (corresponding to) 3-way arrays, and to view these 3-way arrays from two different directions. Towards this end, write the 3-way array corresponding to as
[TABLE]
where are vectors in (“coming out of the page”), namely . The frontal slices of this array are precisely the matrices .
The 3-way array corresponding to is then the array:
[TABLE]
where (here think of the vector as a column vector, not coming out of the page; in the above array we then lay the column vector “on its side” so that it is coming out of the page), and , which we can equivalently write as , where we think of here as a vector of length . Note that all the the nonzero blocks besides upper-left “” block only have nonzero entries that are strictly behind the nonzero entries in the upper-left block.
The second viewpoint, which we will also use below, is to consider the lateral slices of , or equivalently, to view from the side. When viewing from the side, we see the 3-way array:
[TABLE]
where every has only the first components being possibly non-zero, namely, for and for any .
For the only if direction,
suppose there exist and , such that . We can construct and such that . In fact, we will show that we can take where , and where . It is not hard to see that this form already ensures that the first matrices in the vector and those of are the same, since when are of this form, those first matrices are controlled entirely by the (resp., ) in the upper-left block of (resp., ).
The remaining question is then how to design appropriate and to take care of the last matrices in these tuples. This actually boils down to applying the following simple identity, but “in 3 dimensions:” Let be the permutation matrix corresponding to , so that , and . Let be a diagonal matrix. Then
[TABLE]
To see how Equation 3 helps in our setting, it is easier to focus attention on the lower right sub-array of , which can be represented as a symbolic matrix
[TABLE]
Here we think of the ’s as independent variables, whose indices correspond to “how far into the page” they are. That is, corresponds to the vector in , which is coming out of the page and has its only nonzero entry slices back from the page.
Then the action of permutes the ’s and multiplies them by some scalars, the action of is on the left-hand side, and the action of is on the right-hand side. Let be the permutation supporting . Then sends to
[TABLE]
So setting , the monomial matrix supported by with scalars being ’s, we have by Equation 3.
For the if direction,
suppose there exist and , such that . The key feature of these gadgets now comes into play: consider the lateral slices of , which are the frontal slices of (which may be easier to visualize by looking at Equation 2 and Figure 3). The first lateral slices of and are of rank and , while the other lateral slices are of rank (in fact, they are of rank 1; note that without loss of generality we may assume , for the only alternating matrix space is the zero space). Furthermore, left multiplying a lateral slice by and right multiplying it by does not change its rank. However, the action of here is by , and while the here corresponds to left multiplication on the lateral slices (=frontal slices of ), the on the right here corresponds to taking linear combinations of the lateral slices. In other words, just as is the “side view” of , is the side view of . Taking linear combinations of the lateral slices could, in principle, alter their rank; we will use the latter possibility to show that must be of a constrained form.
Write where is of size . We first claim that . For if not, then in (the side view), one of the last frontal slices receives a nonzero contribution from one of the first frontal slices of . Looking at the form of these slices from Equation 2, we see that any such nonzero combination will have rank , but this is a contradiction since the corresponding slice in has rank . Thus , and therefore must be invertible, since is.
Finally, we claim that has to be a monomial matrix. If not, then some frontal slice of among the first would have a contribution from more than one of these slices. Considering the lower-right sub-matrix of such a slice, we see that it would have rank exactly for some , which is again a contradiction since the first slices of all have rank . It follows that , , are in , and thus and are monomially isometric via . ∎
7.2 From 3-Tensor Isomorphism to Matrix Space Isometry and Matrix Group Isomorphism
Proposition 7.3**.**
3-Tensor Isomorphism* reduces to Alternating Matrix Space Isometry. Symbolically, isomorphism in reduces to isomorphism in (or even to ), where and , and .*
Proof.
We will exhibit a function from 3-way arrays to matrix tuples such that two 3-way arrays which are non-degenerate as 3-tensors, are isomorphic as 3-tensors if and only if the matrix spaces are isometric. Note that we can assume our input tensors are non-degenerate by Observation 6.2. The construction is a bit involved, so we will first describe the construction in detail, and then prove the desired statement.
The gadget construction.
Given a 3-way array , let denote the corresponding -tuple of matrices, . The first step is to construct , defined by where . Already, note that if , then and are pseudo-isometric matrix tuples (equivalently, and are isometric matrix spaces).
However, it is not clear whether the converse should hold. Indeed, suppose for some . If we write as a block matrix , where and , then by considering the (1,2) block we get that for all , whereas what we would want is the same equation but without the term. To remedy this, it would suffice if we could extend the tuple to so that any pseudo-isometry between and will have .
To achieve this, we start from , and construct as follows. Here we write it out symbolically, on the next page is the same thing in matrix format, and in Figure 4 is a picture of the construction. Let . Write , where are defined as follows:
- •
For , . Recall that .
- •
For the next slices, that is, , we can naturally represent by where , . We then let be the elementary alternating matrix .
- •
For the next slices, that is , we can naturally represent by where , . We then let be the elementary alternating matrix .
We may view the above construction is as follows. Write the frontal view of as
[TABLE]
where , which we think of as a column vector, but when place in the above array, we think of it as coming out of the page.
Let be the -way array whose frontal slices are , so . Then the frontal view of is
[TABLE]
where , , and .
We now examine the ranks of the lateral slices of . We claim:
[TABLE]
To see why these hold:
- •
For , the th lateral slice is block-diagonal with two non-zero blocks. One block is of size , and the other is . Therefore .
- •
For , the th lateral slice is also block-diagonal with two non-zero blocks. One block is of size , and the other is . Therefore .
- •
For , after rearranging the columns, the th lateral slice has one non-zero block which is is for the first slices, and for the next slices. Therefore or , and since we have assumed , in either case we have .
We then consider the ranks of the linear combinations of the lateral slices.
- •
As long as the linear combination involves for , then the resulting matrix has rank at least , because of the matrix in the last rows.
- •
If the linear combination does not involve for , then the resulting matrix has rank at most , because in this case, there are at most non-zero rows.
- •
If the linear combination involves for , then the resulting matrix has rank at least , because of the matrix in the th to the th rows.
We then prove that and are isomorphic as 3-tensors if and only if and are isometric as matrix spaces. At first glance, the only if direction seems the easy one, as one expects to extend a 3-tensor isomorphism between to to an isometry between and easily. However, it turns out that this direction becomes somewhat technical because of the gadget introduced. This is handled in the following.
For the if direction,
suppose , for some and . Write as , where is of size , is of size , and is of size . By the discussion on the ranks of the linear combinations of the lateral slices, we have , , , and . So , where , , are invertible. Then consider the action of such on the first frontal slices of . The first frontal slices of are of the form , where is of size . Then we have
[TABLE]
From the fact that is invertible and , by considering the block, we find that every frontal slice of lies in (since the gadget does not affect the block-(1,2) position), which gives an isomorphism of tensors, as desired.
For the only if direction,
suppose and are isomorphic as 3-tensors, that is, , for some , , and .
We show that there exist and such that setting
[TABLE]
we have
[TABLE]
which will demonstrate that and are pseudo-isometric.
Since we are claiming that works, and is block-diagonal, it suffices to consider the first frontal slices separately from the remaining slices. For the first frontal slices, we have:
[TABLE]
It follows from the fact that that the first frontal slices of and of are the same.
We now consider the remaining frontal slices separately. Towards that end, let be the 3-way array obtained by removing the first frontal slices from . That is, the th frontal slice of is the th frontal slice of . Similarly construct from . We are left to show that and are pseudo-isometric under some and . Note that and are from the isomorphism between and , while and are what we still need to design.
We first note that both and can be viewed as a block 3-way array of size , whose two frontal slices are the block matrices
[TABLE]
where is of size , and is of size . Although these are already identical in , the issue here is that and may alter the slices of when they act on , so we need a way to “undo” this action to bring it back to the same slices in .
We now claim that we may further handle these two block slices—the “” slices and the “”-slices—separately, that is, that we may take and where , , , and .
To handle , first note that we have
[TABLE]
where .
Now we examine the lateral slices of . The th lateral slice of (up to a suitable permutation) is
[TABLE]
where each is of size , is the th block, and there are block matrices in total. The action of on is by left multiplication. So it sends to . If we set to be the identity and , where there are copies of on the diagonal, then we have , and thus .
It is easy to check that can be handled in the same way, where now play the roles that played before, respectively. This produces the desired , , , and , and concludes the proof. ∎
Corollary 7.4**.**
3-Tensor Isomorphism* reduces to Symmetric Matrix Space Isometry.*
Proof.
In the proof of Proposition 7.3, we can easily replace with , and the elementary alternating matrices with the elementary symmetric matrices, and the resulting proof goes through mutatis mutandis. ∎
Finally, we show how to reduce to Group Isomorphism for matrix groups. We begin with a lemma that we also need for the search-to-decision reduction below. We believe this lemma to be classical, but have not found a reference stating it in quite the form we need.
Lemma 7.5** (Constructive version of Baer’s correspondence for matrix groups).**
Let be an odd prime. Over the finite field , Alternating Matrix Space Isometry is equivalent to Group Isomorphism for matrix groups over that are -groups of class and exponent . More precisely, there are functions computable in time :
- •
* and*
- •
**
such that: (1) for an alternating bilinear map , the group generated by is the Baer group corresponding to , (2) and Alt are mutually inverse, in the sense that the group generated by is isomorphic to the group generated by , and conversely is pseudo-isometric to .
Proof.
First, let be a -group of class and exponent given by generating matrices of size over . Then from the generating matrices of , we first compute a generating set of , by just computing all the commutators of the given generators. We can then remove those redundant elements from this generating set in time , using Luks’ result on computing with solvable matrix groups[Luk92]. We then compute a set of representatives of a non-redundant generating set of , again using Luks’s aforementioned result. From these data we can compute an alternating bilinear map representing the commutator map of in time .
Conversely, let an alternating bilinear map be given by . From , for , construct . That is, the th column of is the th column of . Then for , construct
[TABLE]
and for , construct
[TABLE]
Let be the matrix group generated by and . Then it can be verified easily that, is isomorphic to the Baer group corresponding to the alternating bilinear map defined by . In particular, (isomorphism of abelian groups), and . This construction can be done in time . ∎
Corollary 7.6**.**
Let be an odd prime. 3-Tensor Isomorphism over reduces to Group Isomorphism for -groups of class 2 and exponent given by matrices over , in time (where is the max of the dimensions of the 3-tensor).
Proof.
Combine Proposition 7.3 with Lemma 7.5. Note that for this direction of the reduction, we only need the function from Lemma 7.5, which can be computed in time . ∎
7.3 Search to decision reduction for -Group Isomorphism and Alternating Matrix Space Isometry
Theorem C.
Given an oracle deciding Alternating Matrix Space Isometry, there is a -time algorithm to find an isometry between two alternating matrix spaces , if it exists, using at most oracle queries each of size at most .
In particular, if Alternating Matrix Space Isometry can be decided in time, then isometries between such spaces can be found in time. See Question 10.5.
Proof.
As before, we first present the gadget construction, which is a combination of the two gadgets introduced in Sections 7.1 and 7.2, respectively. Then based on this gadget, we present the search-to-decision reduction.
Gadget construction.
Let be an ordered linear basis of , and let be the 3-way array constructed from , so we can write
[TABLE]
where , thought of as a vector coming out of the page.
We first consider a -tensor constructed from , for any , as
[TABLE]
Consider the lateral slices of .
- •
The first lateral slices have rank in . Note that the rank is strictly less than because some tube fibers (coming out of the page) are in the upper-left sub-array.
- •
The next lateral slices have rank in .
- •
The remaining lateral slices have rank in (since .)
By combining the arguments for the two gadgets introduced in Sections 7.1 and 7.2 respectively, we have the following. From Sec. 7.2, for invertible matrices and to satisfy , has to be of the form , where is of size , is of size , and is of size . Furthermore, from Sec. 7.1, is a monomial matrix. In particular, if such and exist, then it implies that and are isometric by a matrix of the form where is a monomial matrix of size . Note that the presence of , , does not interfere here, because of the argument in the if direction in the proof of Proposition 7.3. On the other hand, if and are isometric by a matrix of such form, then and are also isometric.
The search-to-decision reduction.
Given these preparations, we now present the search-to-decision reduction for Alternating Matrix Space Isometry. Recall that this requires us to use the decision oracle to compute an explicit isometry transformation , if and are indeed isometric. Think of as sending the standard basis to another basis , where and are in .
In the first step, we guess , the image of , and a complement subspace of , at the cost of . For each such guess, let be the matrix which sends and sends to the chosen complementary subspace in some fashion. We apply to , and call the resulting -way array in the following. Then construct and , and feed these two instances to the oracle . Note that, since (using notation as above) must be monomial, any equivalence between and must preserve our choice of up to scale. Thus, clearly, if and are indeed isometric and we guess the correct image of , then the oracle will return yes (and conversely).
In the second step, we guess , the image of , and a complement subspace of within , at the cost of . Note here that the previous step guarantees that there is an isometry respecting the direct sum decomposition , so we need only search for a complement of within , and not a more general complement of in all of . This is crucial for the runtime, as at the step, the latter strategy would result in searching through possibilities.
For each such guess, we apply the corresponding transformation to (and again call the resulting -way array ). Then construct and , and feed these two instances to the oracle . Clearly, if and are indeed isometric and we guess the correct image of (and from the previous step), then the oracle will return yes. However, there is a small caveat here, namely we may guess some image of , such that and are actually isometric by some matrix of the form where is a monomial matrix of size . But this is fine, as it still means that our choices of is correct as a set up to scaling. So we proceed.
In general, in the th step, we know that and are isometric by some where is a monomial matrix of size . We guess , the image of in , and a complement subspace of within . This cost is . For each such guess, we apply the corresponding transformation to (and call the resulting -way array ). Then construct and , and feed these two instances to the oracle . Once we guess correctly, we ensure that and are isometric by where is a monomial matrix of size .
So after the th step, we know that and are isometric by a monomial transformation. The number of all monomial transformations is by . Therefore we can enumerate all monomial transformations and check correspondingly.
Note that all the instances we feed into the oracle are of size . This concludes the proof. ∎
**Corollary C **(Search to decision for testing isomorphism of
a class of -groups).
Let be an odd prime. Given an oracle deciding isomorphism of -groups of class 2 and exponent given by generating matrices over of size , there is a -time algorithm to find an isomorphism between such groups, using at most oracle queries each of size at most .
Proof.
The result follows from Theorem C with the constructive version of Baer’s Correspondence in the model of matrix groups over finite fields (Lemma 7.5).
In more detail, given Lemma 7.5 we can follow the procedure in the proof of Theorem C. For the given -groups, we compute their commutator maps. Then whenever we need to feed the decision oracle, we transform from the alternating bilinear map to a generating set of a -group of class and exponent with this bilinear map as the commutator map. After getting the desired pseudo-isometry for the alternating bilinear maps, we can easily recover an isomorphism between the originally given -groups. This concludes the proof. ∎
8 Other reductions for the Main Theorem A
In this section, we present other reductions to finish the proof of Theorem A. The reductions here are based on the constructions which may be summarized as “putting the given 3-way array to an appropriate corner of a larger 3-way array.” Such an idea is quite classical in the context of matrix problems and wildness [GP69]; here we use the same idea for problems on 3-way arrays.
8.1 From 3-Tensor Isomorphism to Matrix Space Conjugacy
Proposition 8.1**.**
3-Tensor Isomorphism* reduces to Matrix Space Conjugacy. Symbolically, reduces to , where .*
Proof.
The construction. For a 3-way array , let be the matrix tuple consisting of frontal slices of . Construct from , where . See Figure 5.
Given two non-degenerate 3-way arrays which we wish to test for isomorphism (we can assume non-degeneracy without loss of generality, see Observation 6.2), we claim that as 3-tensors if and only if the matrix spaces and are conjugate.
For the only if direction, since and are isomorphic as 3-tensors, there exist , , and , such that . Let . Then . It follows that, , which just says that .
For the if direction, since and are conjugate, there exist , and , such that . Write , where , . Let , where . Then as , we have for every ,
[TABLE]
This in particular implies that for every , . In other words, every row of lies in the common left kernel of with . Since is non-degenerate, must be the zero matrix. It follows that , so and are both invertible matrices. By Equation 5, we have , where , , and , which just says that and are isomorphic as 3-tensors. ∎
Corollary 8.2**.**
3-Tensor Isomorphism* reduces to*
Matrix Lie Algebra Conjugacy*, where is commutative;* 2. 2.
Associative Matrix Algebra Conjugacy*, where is commutative (and in fact has the property that for all ; note that is not unital);* 3. 3.
Matrix Lie Algebra Conjugacy*, where is solvable of derived length 2, and ; and,* 4. 4.
Associative Matrix Algebra Conjugacy*, where the Jacobson radical squares to zero, and .*
Proof.
We use the notation from the proof of Proposition 8.1. Note that the matrix spaces constructed there, e. g., , are all subspaces of the matrix space .
For (1) and (2), observe that for any two matrices , we have , and thus as well. Thus any matrix subspace of is both a commutative matrix Lie algebra and a commutative associative matrix algebra with zero product.
For (3) and (4), we note that we can alter the construction of Proposition 8.1 by including the matrix in both matrix spaces and without disrupting the reduction. Indeed, for the forward direction we have that (again, following notation as above) .
For the reverse direction, we then have that for , we have . Let , where . Then as , we have for every ,
[TABLE]
Considering the (2,1) block of this equation, we find that if , then immediately . But even if , then we are back to the same argument as in Proposition 8.1, namely that by the non-degeneracy of , we still get by considering the (2,2) block. The remainder of the argument only depended on the (1,2) block of the preceding equation, which is the same as before.
Finally, to see the structure of the corresponding algebras, we must consider how our new element interacts with the others. Easy calculations reveal:
[TABLE]
(3) For the structure of the Lie algebra, we have from the above equations that any commutator is either 0 or lands in . And since , we have that is the subspace of that we started with before including . Since everything in that subspace commutes, we get that , and thus the Lie algebra is solvable of derived length 2. Moreover, is spanned by the image of , whence it is isomorphic to .
(4) Recall that for rings without an identity, the Jacobson radical can be characterized as [Lam91, p. 63]. Note that the only nontrivial cases to check are those for which , since otherwise and then we may take as well. So we have . But since is a left identity, this latter equation is just . For any , we may take , since then both sides of the equation are zero, and thus includes all the matrices in the original space from Proposition 8.1. However, , for there is no such that : any element of can be written for some . Writing this way, we are trying to solve the equation . Thus we conclude , and then we get that , a contradiction. So , and thus is spanned by the image of , whence it is isomorphic to . ∎
8.2 From Matrix Space Isometry to Algebra Isomorphism and Trilinear Form Equivalence
Proposition 8.3**.**
Matrix Space Isometry* reduces to Algebra Isomorphism and Trilinear Form Equivalence. Symbolically, reduces to and to , where .*
Proof.
The construction. Given a matrix space by an ordered linear basis , construct the 3-way array whose frontal slices are:
[TABLE]
Let denote the algebra whose structure constants are defined by , and let denote the trilinear form whose coefficients are given by .
Given two matrix spaces , we claim that and are isometric if and only if (isomorphism of algebras) if and only if and are equivalent as trilinear forms. The proofs are broken into the following two lemmas, which then complete the proof of the proposition. ∎
Lemma 8.4**.**
Let notation be as above. The matrix spaces are isometric if and only if and are isomorphic.
Proof.
Let be the ordered bases of , respectively. Recall that are isometric if and only if there exist such that . Also recall that and are isomorphic as algebras if and only if there exists such that . Since (resp. ) form a linear basis of (resp. ), we have that (resp. ) are linearly independent.
The only if direction
is easy to verify. Given an isometry between and , let . Let . Then for , . For , . Let . Then for , . For , is the th matrix in , which in turn equals by the assumption on and . This proves the only if direction.
For the if direction,
let be an algebra isomorphism, where is of size . Let , and . Since for , , we have . Therefore has to be , because ’s are linearly independent. It follows that , where and are invertible. So for , we have . Also the last matrices in are , where is the th matrix in . This implies that and together form an isometry between and . ∎
Corollary 8.5**.**
Matrix Space Isometry* reduces to*
Associative Algebra Isomorphism*, for algebras that are commutative and unital;* 2. 2.
Associative Algebra Isomorphism*, for algebras that are commutative and 3-nilpotent ( for all ); and,* 3. 3.
Lie Algebra Isomorphism*, for Lie algebras that are 2-step nilpotent ( for all ).*
Proof.
We follow the notation from the proof of Lemma 8.4. We begin by observing that is a 3-nilpotent algebra, and therefore is automatically associative. Let , where , , and, as a subspace of , has a basis given by and has a basis given by . Let denote the product in , so that . Note that because the lower rows and the rightmost columns of each frontal slice of are zero, we have that for any and any . Thus only way to get a nonzero product is of the form where , and here the product ends up in , since the only nonzero frontal slices are . Since any nonzero product ends up in , and anything in times anything at all is zero, we have that for all , that is, is 3-nilpotent. Any 3-nilpotent algebra is automatically associative, since the associativity condition only depends on products of three elements.
(2) If instead of general Matrix Space Isometry, we start from Symmetric Matrix Space Isometry (which is also -complete by Corollary 7.4), then we see that the algebra is commutative, for we then have , which corresponds to .
(1) As is standard, from the algebra , we may adjoin a unit by considering . In terms of vector spaces, we have , where the new summand is spanned by the identity . This standard algebraic construction has the property that two such algebras are isomorphic if and only if their corresponding unit-adjoined algebras are (see, e. g., [Dor32, Wik19]).
(3) By starting from an alternating matrix space (and noting that Alternating Matrix Space Isometry is still -complete, by Corollary 7.4), we get that is alternating, that is, . Since we still have that it is 3-nilpotent, , we find that automatically satisfies the Jacobi identity. An alternating product satisfying the Jacobi identity is, by definition, a Lie bracket (that is, we can define ), and thus we get a Lie algebra with structure constants . Translating the 3-nilpotency condition into the Lie bracket notation, we get , or in other words that the Lie algebra is nilpotent of class 2. ∎
Corollary 8.6**.**
3-Tensor Isomorphism* reduces to Cubic Form Equivalence.*
Proof.
Agrawal and Saxena [AS06] show that Commutative Algebra Isomorphism reduces to Cubic Form Equivalence. Combine with Corollary 8.5(1). ∎
The reduction from to is achieved by the same construction.
Lemma 8.7**.**
Let , and be as above. Then and are pseudo-isometric if and only if and are isomorphic as trilinear forms.
Proof.
Recall that and are pseudo-isometric if there exist such that . Also recall that and are equivalent as trilinear forms if there exists such that . Since (resp. ) form a linear basis of , we have that (resp. ) are linearly independent.
The only if direction
is easy to verify. Given an pseudo-isometry between and , let . Then it can be verified easily that is a trilinear form equivalence between and , following the same approach in the proof of Lemma 8.4.
For the if direction,
write be a trilinear form equivalence between and . We first observe that the last matrices in are still linearly independent. Then, because of the first matrices in are all zero matrices, has to be the zero matrix. It follows that , where and are invertible. Then it can be verified easily that and form an pseudo-isometry between and , following the same approach in the proof of Lemma 8.4. ∎
9 Reducing -Tensor Isomorphism to 3-Tensor Isomorphism
Theorem B.
-Tensor Isomorphism reduces to Algebra Isomorphism. If the input tensor has size , then the output algebra has dimension where .
Remark 9.1**.**
One cannot do too much better in terms of size of the output, as the following argument suggests. Over finite fields, we may count the number of orbits, which provides a rigorous lower bound on the size blow-up of any kernel reduction (see, e. g., [FG11, Sec. 4.2.4]). Over infinite fields, if we consider algebraic reductions, they must preserve dimension, so we can make a similar (albeit more heuristic) argument by considering the “dimension” of the set of orbits. Here we have put “dimension” in quotes because the set of orbits is not a well-behaved topological space (it is typically not even ), but even in this case the same argument should essentially hold. The space of -tensors has dimension , and the group has dimension , so the “dimension” of the set of orbits is at least (); over , the number of orbits is at least . For algebras of dimension , the space of such algebras is -dimensional, so the “dimension” of the set of orbits is at most ; over , the number of orbits is at most . Thus we need , whence .
Proof idea.
The idea here is similar to the reduction from 3TI to Algebra Isomorphism: we want to create an algebra in which all products eventually land in an ideal, and multiplication of algebra elements by elements in the ideal is described by the tensor we started with. For a 3-tensor this was very natural, as the structure constants of any algebra form a 3-tensor. In that case, we are using it to say how to write the product of 2 elements as a linear combination (the third factor of the tensor) of basis elements. With a -tensor for , we now want to use it to describe how to write the product of elements as a linear combination of basis elements. The tricky part here is that in an algebra we must still describe the product of any two elements. The idea is to create a set of generators, let them freely generate monomials up to degree , and then when we get a product of generators, rewrite it as a linear combination of generators according to the given tensor. This idea almost provides one direction of the reduction: if two -tensors are isomorphic, then the corresponding algebras are isomorphic. However, there is an issue with implementing this, namely that monomials are commutative, but our tensors need not be symmetric, and moreover, they need not even be “square” (have all side lengths equal). In [AS05, Thm. 5] they reduce Degree- Form Equivalence to Commutative Algebra Isomorphism along similar lines, but there the starting objects are themselves commutative, so this was not an issue. In our case, we will get around this using a certain noncommutative algebra where the only nonzero products are those which come “in the right order.”
Another potentially tricky aspect of the reduction is the converse: suppose we build our algebras as above from two -tensors, and are isomorphic; how can we guarantee that and are isomorphic? For this, we would like to be able to identify certain subsets of the algebras as characteristic (invariant under any automorphism), so that those characteristic subsets force the isomorphism to take a particular form, which we can then massage into an isomorphism between the tensors . Our way of doing this is to encode the “degree” structure into the path algebra of a graph, as described in the next section. If the graph has no automorphisms, then the path algebra has the advantage that for any two vertices , the subset of spanned by the paths from to is nearly characteristic in a way we make precise below. ∎
9.1 Preliminaries for Theorem B
To make the above proof idea precise, we will need a little background on Leavitt path algebras (a.k.a. quiver algebras) and their quotients. For a textbook reference on these algebras, see [ASS06, Ch. II], and for a textbook treatment of Wedderburn–Artin theory and the Jacobson radical, see [Lam91]. Aside from the definition of path algebra, most of this section will end up being used as a black box; we include it mostly for ease of reference.
We start with some important, classical results on the structure of associative algebras. The Jacobson radical of an associative algebra , here denoted , is the intersection of all maximal right ideals. Equivalently, . A unital algebra over a field is semisimple if ; in this case, by Wedderburn’s Theorem (see below), is isomorphic to a direct sum of matrix algebras over finite-degree division rings extending . An algebra is called separable if it is semisimple over every field extending , that is, is semisimple for all fields extending . Equivalently, is separable if it is isomorphic to , where each is a division ring extending such that the center is a separable field extension of . If has characteristic zero or is perfect (which includes all finite fields), then all its extensions are separable. For the algebra we construct, it will simply be a direct sum of copies of , so it is automatically separable over any field.
An element is idempotent if . An idempotent is primitive if it cannot be written as the sum of two nonzero idempotents. Two idempotents are orthogonal if . A complete set of primitive orthogonal idempotents of is a set of primitive idempotents which are pairwise orthogonal, and such that the set is maximal subject to this condition.
Theorem 9.2** (Wedderburn–Mal’cev, see, e. g., [Far05]).**
Let be an finite-dimensional, associative, unital algebra over a field . Then
* (as algebras), where each is a division ring of finite degree over .* 2. 2.
If is separable, then there exists a subalgebra such that (as -vector spaces). 3. 3.
If is any separable subalgebra, then there exists such that .
The last part of the preceding theorem is what we will use to show that the set of paths in our graph is “nearly characteristic;” that is, it is not characteristic, but it is characteristic up to conjugacy (=inner automorphisms).
Definition 9.3** (Leavitt path algebra).**
Given a directed multigraph (possibly with parallel edges and self-loops, a.k.a. quiver), its Leavitt path algebra is the algebra of paths in , where multiplication is given by concatenation of paths (when this is well-defined), and zero otherwise. That is, is generated by , where the generators are thought of as the “path of length [math]” at vertex . The defining relations in are that the product of two paths is their concatenation if the end of the first equals the start of the second, and zero otherwise. More formally, the relations are:
[TABLE]
where is the Kronecker delta: it is if and [math] otherwise.
Note that we are allowed to take formal linear combinations of paths in this algebra, as it is an -algebra (so in particular, it is an -vector space). The arrow ideal of is the two-sided ideal generated by the arrows, and has a basis consisting of all paths of length ; it is denoted .
Lemma 9.4** (See [ASS06, Cor. II.1.11]).**
If is finite, connected, and acyclic, then is the arrow ideal , and has a basis consisting of all paths of length , and the set is a complete set of primitive orthogonal idempotents.
Corollary 9.5**.**
Let be a finite, connected, acyclic graph, and an ideal of contained in ; let . Then (1) , (2) , whence is separable, and (3) is a complete set of primitive orthogonal idempotents, where is the image of under the quotient map .
Proof.
(1) This holds for any ideal contained in the radical of any finite-dimensional associative unital algebra [Lam91, Prop. 4.6].
(2) It is clear that as vector spaces, (where ), and the span of the is easily seen to be an algebra isomorphic to , where the -th copy of is spanned by , where is the natural projection. Thus . Since , we have . As a semisimple algebra, we thus have that , and as is always a separable extension over itself, is separable.
(3) The property of being a set of primitive orthogonal idempotents is preserved by homomorphisms, so there are only two things to check here: first, that none of the is zero modulo , and second, that there are no additional primitive idempotents in that are mutually orthogonal with every . To see that none of the are zero, note that factors through ; then since for any (from the previous paragraph), it must be the case that as well. Finally, we must show this is a complete set of primitive orthogonal idempotents. Suppose not; that is, suppose there is some such that is a primitive idempotent that is orthogonal in to every . First, we claim that . For, since is a finite acyclic graph, its arrow ideal is nilpotent: there are no paths longer than , so we must have , whence cannot contain any idempotents. Since is nilpotent, the same must be true of , whence cannot contain any idempotents, so cannot be in . But then the image of in is nonzero (since ), so is another primitive idempotent orthogonal to every in . But this is a contradiction, since is already a complete set of primitive orthogonal idempotents for . ∎
Finally, in the course of the proof, we will use the following construction of Grigoriev:
Theorem 9.6** (Grigoriev [Gri81, Theorem 1]).**
Graph Isomorphism* is equivalent to Algebra Isomorphism for algebras such that the radical squares to zero and is abelian.*
In our proof, all we will need aside from Grigoriev’s result is to see the construction itself, which we recall here in language consistent with ours.
Construction [Gri81].
Given a graph , construct an algebra as follows: it is generated by subject to the following relations: , , , when , is generated by , and the radical squares to zero. It is immediate that this is just . From any such algebra , Grigoriev recovers a corresponding weighted graph, where the weight on is . In our setting we use multiple parallel edges rather than weight, but the proof goes through mutatis mutandis. ∎
9.2 Proof of Theorem B
Proof.
Let be an -tensor. Let be the following directed multigraph (see Figure 6): it has vertices, labeled , and for , it has parallel arrows from vertex to vertex , and parallel arrows from to .
Because of the structure of this graph, we can index the generators of a little more mnemonically than in the preliminaries above: let the generators corresponding to the arrows from be for , and let the generators corresponding to the arrows be for . Let be the quotient of by the relation161616For those familiar with quiver algebras, we note that this ideal is not admissible, as it is not contained in . It can probably be made admissible by inserting new vertices in the middle of each edge . However, when we tried to do that in a naive way, we ran into problems verifying the reduction, as what should be a linear transformation either ends up being incorrect or ends up being quadratic, either of which caused issues.
[TABLE]
At the moment, we only have in terms of generators and relations; however, it will be easy to turn it into its basis representation. The key is to bound its dimension, which we do now. Except for paths of length (because of the nontrivial relations (6)), this is just counting the number of paths in the graph described above. The only nonzero monomials of degree are those of the form . For a given choice of , there are exactly such monomials, so we have
[TABLE]
Note that in the first line we can exactly specify , independent of itself (depending only on its dimensions). For any fixed , this dimension is polynomial in . By the linear-algebraic analogue of breadth-first search, we may thus list a basis for and its structure constants with respect to that basis.
We claim that the map is a reduction. Suppose is another tensor of the same dimension, and let be the associated algebra as above. We claim that as -tensors if and only if as algebras.
For the only if direction, suppose via , that is
[TABLE]
for all . Then we claim that the block-diagonal matrix (where ), together with mapping to , induces an isomorphism from to . Note that itself is not an isomorphism, as , but specifies a linear map on the generators of , which we may then exend to all of .
First let us see that indeed gives a well-defined homomorphism . Since is only defined on the generators and is, by definition, extended by distributivity, the only thing to check here is that sends the relations of into the relations of . Let denote the basis of as above. The map is defined by ,
[TABLE]
and
[TABLE]
By left multiplying by , we may rewrite this last equation as
[TABLE]
note the transpose.
To check the relations, let us write out the Leavitt path algebra relations explicitly for our graph, in our notation. The generators of are , and the relations are (6) and the quiver relations:
[TABLE]
Note that the set is linearly spanned by the paths in this graph.
The relations involving the are easy to verify, since they only depend on the first subscript of (resp., ), and does not alter this subscript.
For relation (7), we have:
[TABLE]
where the final inequality comes from the defining relations in .
The verification for remaining quiver relations is similar, since does not alter the start and end vertices of any arrow (though it may send a single arrow in to a linear combination of arrows in ).
We now verify the relation (6). We have
[TABLE]
as desired. Thus the map induced by is an algebra homomorphism.
Next, since is an isomorphism of -dimensional vector spaces, the map it induces is surjective on the generators of , whence it is surjective onto all of . Finally, since , any linear surjective map is automatically bijective, so this map is indeed an isomorphism of algebras.
For the if direction, suppose that is an isomorphism of algebras. Since the Jacobson radical is characteristic, we have . Then is a set of primitive orthogonal idempotents in , and their span is a separable subalgebra (isomorphic to ) such that . By the Wedderburn–Mal’cev Theorem (Theorem 9.2(3)), there is some such that . Since the are the only primitive idempotents in , we must have that for all and some permutation .
Next we will show that this permutation is in fact the identity, so that for all . For this, consider and similarly . These are precisely the algebras considered by Grigoriev [Gri81] (reproduced as Theorem 9.6 above). Since is characteristic, so is its square, and thus induces an isomorphism . By Theorem 1 of Grigoriev [Gri81], any isomorphism induces an isomorphism of the corresponding graphs, so this isomorphism must map to for each (since our graph has no automorphisms). Thus must be the identity, and for all .
Since conjugation is an automorphism, let be , where . By the above, for all . Thus . In particular, define to be the restriction of to for and to be the restriction of to . Then we have that is a linear bijection from the span of to the span of for all . We claim that is a tensor isomorphism , that is,
[TABLE]
From the fact that is an isomorphism, we have
[TABLE]
For each , equating the coefficient of gives
[TABLE]
Let be the natural row vector of length , and similarly for . Then we may rewrite the preceding set of equations (one for each choice of ) in matrix notation as
[TABLE]
Right multiplying by , we then get
[TABLE]
as claimed. ∎
10 Conclusion: universality and open questions
10.1 Towards universality for basis-explicit linear structures
A classic result is that GI is complete for isomorphism problems of explicitly given structures (see, e. g., [ZKT85, Section 15]). Here we formally state the linear-algebraic analogue of this result, and observe trivially that the results of [FGS19] already show that 3-Tensor Isomorphism is universal among what we call “basis-explicit” (multi)linear structures of degree 2.
First let us recall the statement of the result for GI, so we can develop the appropriate analogue for tensor isomorphism. A first-order signature is a list of positive integers ; a model of this signature consists of a set (colloquially referred to as “vertices”), relations , and functions . The numbers are thus the arities of the relations , and the are the arities of the functions .171717Sometimes one also includes constants in the definition, but these can be handled as relations of arity 1. While we could have done the same for functions, treating a function of arity as its graph, which is a relation of arity , distinguishing between relations and functions will be useful when we come to our linear-algebraic analogue. Two such models and are isomorphic if there is a bijection that sends to for all and to for all . In symbols, is an isomorphism if for all and all , and similarly if for all and all . By an “explicitly given structure” or “explicit model” we mean a model where each relation is given by a list of its elements and each function is given by listing all of its input-output pairs. Fixing a signature, the isomorphism problem for that signature is to decide, given two explicit models of that signature, whether they are isomorphic. This isomorphism problem is directly encoded into the isomorphism problem for edge-colored hypergraphs, which can then be reduced to GI using standard gadgets.
For example, the signature for directed graphs (possibly with self-loops) is simply —its models are simply binary relations. If one wants to consider graphs without self-loops, this is a special case of the isomorphism problem for the signature , namely, those explicit models in which for any . Note that a graph without self-loops is never isomorphic to a graph with self-loops, and two directed graphs without self-loops are isomorphic as directed graphs if and only if they are isomorphic as models of the signature . In other words, the isomorphism problem for simple directed graphs really is just a special case. The same holds for undirected graphs without self-loops, which are simply models of the signature in which and is symmetric. As another example, the signature for finite groups is : the first relation will be a singleton, indicating which element is the identity, the function is the inverse function , and the second function is the group multiplication . Of course, models of the signature can include many non-groups as well, but, as was the case with directed graphs, a group will never be isomorphic to a non-group, and two groups are isomorphic as models of iff they are isomorphic as groups.
A natural linear-algebraic analogue of the above is as follows. One additional feature we add here for purposes of generality is that we need to make room for dual vector spaces. A linear signature is then a list of pairs of nonnegative integers with the property that and for all . By the arity of the -th relation (resp., function) we mean the sum (resp., ).
Definition 10.1** (Linear signature, basis-explicit).**
Given a linear signature
[TABLE]
a linear model for over a field consists of an -vector space , and linear subspaces for and linear maps for . Two such linear models are isomorphic if there is a linear bijection that sends to for all and to for all (details below).
A basis-explicit linear model is given by a basis for each , and, for each element of a basis of the domain of , the value of on that element. Vectors here are written out in their usual dense coordinate representation.
In particular, this means that an element of —say, a basis element of —is written out as a vector of length . We will only be concerned with finite-dimensional linear models.
Given , let denote the linear map which is defined on basis vectors factor-wise: , and then extended to the whole space by linearity. (Recall that , so elements of are linear maps , and thus is a map from , i. e., an element of , as desired). Similarly, when we say that sends to , we mean that .
Remark 10.2**.**
We use the term “basis-explicit” rather than just “explicit,” because over a finite field, one may also consider a linear model of as an explicit model of a different signature (where the different signature additionally encodes the structure of a vector space on , namely, the addition and scalar multiplication), and then one may talk of a single mathematical object having explicit representations—where everything is listed out—and basis-explicit representations—where things are described in terms of bases. An example of this distinction arises when considering isomorphism of -groups of class : the “explicit” version is when they are given by their full multiplication table (which reduces to GI), while the “basis-explicit” version is when they are given by a generating set of matrices or a polycyclic presentation (which GI reduces to).
Theorem 10.3** (Futorny–Grochow–Sergeichuk [FGS19]).**
Given any linear signature where all relationship arities are at most 3 and all function arities are at most 2, the isomorphism problem for finite-dimensional basis-explicit linear models of reduces to 3-Tensor Isomorphism in polynomial time.
Because of the equivalence between -Tensor Isomorphism and 3-Tensor Isomorphism (Theorem B + [FGS19]), we expect the analogous result to hold for arbitrary . Thus an analogue of the results of [FGS19] for -tensors would yield the full analogue of the universality result for GI.
Open Question 10.4**.**
Is -Tensor Isomorphism universal for isomorphism problems on -way arrays? That is, prove the analogue of the results of [FGS19] for -way arrays for any .
10.2 Other open questions
Our search-to-decision reduction (Theorem C) produces instances of dimension from instances of dimension . As stated, this means that a simply-exponential (-time) decision algorithm would result only in a search algorithm, but the latter runtime is trivial. We note that it may be possible to alleviate this blow-up by attempting to generalize the logarithmic-size “coloring palette” construction for reducing Colored GI to GI from the graph case to the linear-algebraic case.
Open Question 10.5**.**
Is there a search-to-decision reduction for Alternating Matrix Space Isometry (and, consequently, isomorphism of -groups of class 2 and exponent , given in their natural succinct encoding) that runs in time , and produces instances of quasi-linear () dimension?
In Section 3.2 we gave several different reductions from GI to Alternating Matrix Space Isometry. To summarize, they are:
A direct reduction from GI to Alternating Matrix Space Isometry (Prop. 7.1) 2. 2.
GI Matrix Lie Algebra Conjugacy [Gro12a], which in turn reduces to 3TI [FGS19], and then to Alternating Matrix Space Isometry (Thm. A); 3. 3.
GI CodeEq [PR97, Luk93, Miy96], CodeEq Matrix Lie Algebra Conjugacy [Gro12a], and then follow the same reductions as in (1); 4. 4.
GI Monomial Code Equivalence (the same reduction from [PR97] works for monomial equivalence of codes, see [Gro12a]), which in turn reduces to 3TI (Prop. 3.6), and thence to Alternating Matrix Space Isometry (Thm. A) 5. 5.
GI Algebra Isomorphism [Gri81, AS05], which reduces to 3TI [FGS19], and then to Alternating Matrix Space Isometry (Thm. A).
Can one prove that these reductions are all distinct? Are some of them equivalent in some natural sense, e. g., up to a change of basis?
Next, most of our results hold for arbitrary fields, or arbitrary fields with minor restrictions. However, in all of our reductions, we reduce one problem over to another problem over the same field .
Open Question 10.6**.**
What is the relationship between over different fields? In particular, what is the relationship between and , between and for coprime , or between and ?
We note that even the relationship between and is not particularly clear. For matrix tuples (rather than spaces; equivalently, representations of finitely generated algebras) it is the case that for any extension field , two matrix tuples over are -equivalent (resp., conjugate) if and only if they are -equivalent [KL86] (see [dSP10] for a simplified proof). However, for equivalence of tensors this need not be the case. This seems closely related to the so-called “problem of forms” for various algebras, namely the existence of algebras that are not isomorphic over , but which become isomorphic over an extension field.
Example 10.7** (Non-isomorphic tensors isomorphic over an extension field).**
Over , let and let . Since these two matrices have different signatures, they are not isometric over ; since they have the same rank, they are isometric over . To turn this into an example of 3-tensors, first we consider the corresponding instance of Matrix Space Isometry given by and . Note that , so the signatures of all matrices in are , , or . Similarly, the signatures appearing in are , , and , so these two matrix spaces are not isometric over , though they are isometric over since and are. Finally, apply the reduction from Matrix Space Isometry to 3TI [FGS19] to get two 3-tensors . Since the reduction itself is independent of field, if we consider it over we find that and must not be isomorphic 3-tensors over , but if we consider the reduction over we find that they are isomorphic as 3-tensors over .
Similar examples can be constructed over finite fields of odd characteristic, taking and where is a non-square in (and replacing the role of with that of ). Instead of signature, isometry types of matrices over are characterized by their rank and whether their determinant is a square or not. In this case, since our matrices are even-dimensional diagonal matrices, scaling them multiplies their determinant by a square. Thus every matrix in will have its determinant being a square in , and every nonzero matrix in will not, but in they are all squares.
It would also be interesting to study the complexity of other group actions on tensors and how they relate to the problems here. For example, the action of unitary groups on classifies pure quantum states up to “local unitary operations,” and the action of on , over , is the well-studied action by stochastic local operations with classical communication (SLOCC) on quantum states (e. g., [GW13, Miy04, CD–07]). Isomorphism of -dimensional lattices in -dimensional space can be seen as the natural action of by left and right multiplication on matrices. As another example, orbits for several of the natural actions of on 3-tensors over , even for small values of , are the fundamental objects in Bhargava’s seminal work on higher composition laws [Bha04a, Bha04b, Bha04c, Bha08]. We note that while the orthogonal group is the stabilizer of a 2-form on (that is, an element of ) and is the stabilizer of the induced action on (by the determinant)—so gadgets similar to those in this paper might be useful— is not the stabilizer of any such structure.
In Remark 9.1 we observed that any reduction (in the sense of Sec. 6.2) from TI to 3TI must have a blow-up in dimension which is asymptotically , while our construction uses dimension .
Open Question 10.8**.**
Is there a reduction from TI to 3TI (as in Sec. 6.2) such that the dimension of the output is ?
Finally, in terms of practical algorithms, we wonder how well modern SAT solvers would do on instances of 3-Tensor Isomorphism over (or over other finite fields, encoded into bit-strings).
Acknowledgments
The authors would like to thanks James B. Wilson for related discussions, and Uriya First, Lek-Heng Lim, and J. M. Landsberg for help searching for references asking whether d could reduce to . J. A. G. would like to thank V. Futorny and V. V. Sergeichuk for their collaboration on the related work [FGS19]. Ideas leading to this work originated from the workshop “Wildness in computer science, physics, and mathematics” at the Santa Fe Institute. Both authors were supported by NSF grant DMS-1750319. Y. Q. was partly supported by Australian Research Council DECRA DE150100720.
Appendix A Reducing Cubic Form Equivalence to Degree- Form Equivalence
Proposition A.1**.**
Cubic Form Equivalence* reduces to Degree- Form Equivalence, for any .*
We suspect that a similar construction would give a reduction from Degree- Form Equivalence to Degree- Form Equivalence for any , but our argument relies on a case analysis that is somewhat specific to . Our argument might be adaptable to any fixed value of the prover desires, with a consequently more complicated case analysis, but to prove it for all simultaneously seems to require a different argument.
Proof.
The reduction itself is quite simple: , where is a new variable not appearing in . If is an equivalence between and —that is, —then is an equivalence from to . Conversely, suppose is equivalent to via . We split the proof into several cases.
If ,
then is not present so we already have that and are equivalent.
If is not divisible by for some linear form ,
then is the unique factor in both and which is raised do the power. Thus any equivalence between these two must map to itself, hence has the form
[TABLE]
(if we put last in our basis, and think of the matrix as acting on the left of the column vectors corresponding to the variables). However, since both and do not depend on , it must be the case that whatever contributions makes to , they all cancel. More precisely, all monomials involving in must cancel, so if we alter into that never includes (that is, if we make the stars in the last row above all zero), then , hence , so and are equivalent.
The preceding case always applies when , for then , but . We are left to handle the following cases:
and is a product of linear forms; 2. 2.
, is a product of a linear form and an irreducible quadratic form.
Suppose is a product of linear forms,
then let us define as the number of linearly independent linear forms appearing in the factorization of . Note that if , then for some , if , then (now we can absorb any constant into ), and if then with all linearly independent. Then we have that if and only if is also a product of linear forms of the same rank. For acts transitively on -tuples of linearly independent vectors for all , and in order to have linearly independent forms, we must have . Since we have supposed , by uniqueness of factorization must be a product of linear forms of the same rank as , and thus indeed .
If and where is linear and is an irreducible quadratic,
then to understand the situation we begin by first doing a change of basis on to put into a form in which its kernel is evident. Note that none of these simplifications are part of the reduction, but rather they are to help us prove that the reduction works. Thinking of as given by its matrix such that , we can always change basis to get into the form
[TABLE]
where . Since does not depend on , if we think of as a quadratic form on , then the matrices are the same, but larger by one additional zero row and column.
Next we will try to simplify as much as possible while maintaining the (new) form of . For this we first compute the stabilizer of the new form of . We can compute the stabilizer as the set of invertible matrices such that:
[TABLE]
This turns into the following equations on the blocks of :
[TABLE]
From the first equation and the fact that is full rank, we find that must be an invertible matrix. From the next equation and the fact that both and are full rank, we then find that . Thus the stabilizer of is:
[TABLE]
Now we simplify . Note that acts on as a column vector. Consider , with ; we will say “ contains ” if and only if . If contains some with , then by setting and , we may choose to be any invertible matrix which sends (recall the trailing for the coordinate) to , and thus without loss of generality we may assume that only contains with .
Next, note that if contains some for and , then we may use the action of to eliminate the . Namely, by taking , , and . This makes in contribute to the coordinate, eliminating . Thus, under the action of , we need only consider two cases for linear forms under the action of : a linear form is equivalent to either
- a.
one which contains some with , in which case we can bring it to a form in which it contains no with (and no ), or 2. b.
it contains no with , in which case we can use the action of to bring it to the form .
Let us call the corresponding linear forms “type (a)” and “type (b).” Note that the linear form is of type (b).
Now, write and , and assume that we have applied the preceding change of basis to bring to the form specified above. Recall that we are assuming , and need to show that . If, after applying the same change of basis to , we do not have , then and also —contrary to our assumption—since (resp., ) is the unique irreducible quadratic factor of (resp., ). So we may assume that, after this change of basis, , both of which have with .
Next, since we are assuming , and itself is of type (b), so it must be the case that the types of are the same. Thus we have two cases to consider: either they are both of type (a), or both of type (b).
Suppose both are of type (a).
In this case, the equivalence between and cannot send to and to , for both are of type (a), whereas is of type (b). Thus the equivalence between and must restrict to an equivalence between and (when we ignore , or set its contribution to the other variables to zero, as in the above case where was not divisible by ).
Suppose both are of type (b).
In this case, it is possible that the equivalence from to could send to and to (since all three of are in case (b)); however, we will see that in this case, even such a situation will not cause an issue. Without loss of generality, by the change of bases described above, we have and (the same ), where contains no with . Using elements of with , and , we then get an action of (via ) on linear forms in the variables . Since is linearly independent from (in particular, it does not contain ) and the action of is transitive on pairs of linearly independent vectors, we may use to fix and , and send to , giving the desired equivalence . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AD 17] Eric Allender and Bireswar Das. Zero knowledge and circuit minimization. Inf. Comput. , 256:2–8, 2017. doi:10.1016/j.ic.2017.04.004 . · doi ↗
- 2[AS 05] Manindra Agrawal and Nitin Saxena. Automorphisms of finite rings and applications to complexity of problems. In STACS 2005, 22nd Annual Symposium on Theoretical Aspects of Computer Science, Proceedings , pages 1–17, 2005. doi:10.1007/978-3-540-31856-9_1 . · doi ↗
- 3[AS 06] Manindra Agrawal and Nitin Saxena. Equivalence of 𝔽 𝔽 \mathbb{F} -algebras and cubic forms. In STACS 2006, 23rd Annual Symposium on Theoretical Aspects of Computer Science, Proceedings , pages 115–126, 2006. doi:10.1007/11672142_8 . · doi ↗
- 4[ASS 06] Ibrahim Assem, Daniel Simson, and Andrzej Skowroński. Elements of the representation theory of associative algebras. Vol. 1 , volume 65 of London Mathematical Society Student Texts . Cambridge University Press, Cambridge, 2006. Techniques of representation theory. doi:10.1017/CBO 9780511614309 . · doi ↗
- 5[Bab 85] L Babai. Trading group theory for randomness. In Proceedings of the Seventeenth Annual ACM Symposium on Theory of Computing , STOC ’85, pages 421–429. ACM, 1985. doi:10.1145/22145.22192 . · doi ↗
- 6[Bab 14] László Babai. On the automorphism groups of strongly regular graphs I. In Proceedings of the 5th Conference on Innovations in Theoretical Computer Science , ITCS ’14, pages 359–368, 2014. doi:10.1145/2554797.2554830 . · doi ↗
- 7[Bab 16] László Babai. Graph isomorphism in quasipolynomial time [extended abstract]. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016 , pages 684–697, 2016. ar Xiv:1512.03547 [cs.DS] version 2. doi:10.1145/2897518.2897542 . · doi ↗
- 8[Bae 38] Reinhold Baer. Groups with abelian central quotient group. Trans. AMS , 44(3):357–386, 1938. doi:10.1090/S 0002-9947-1938-1501972-1 . · doi ↗
