Isomorphism problems for tensors, groups, and cubic forms: completeness and reductions

Joshua A. Grochow; Youming Qiao

arXiv:1907.00309·cs.CC·June 18, 2025

Isomorphism problems for tensors, groups, and cubic forms: completeness and reductions

Joshua A. Grochow, Youming Qiao

PDF

TL;DR

This paper establishes the computational equivalence of various isomorphism problems involving tensors, groups, and forms, introduces the concept of TI-complete problems, and provides new reductions and algorithms for group isomorphism.

Contribution

It proves the polynomial-time equivalence of multiple isomorphism problems, introduces TI-complete class, and presents novel reductions and search-to-decision algorithms for group isomorphism.

Findings

01

All isomorphism problems are polynomial-time equivalent.

02

d-tensor isomorphism reduces to 3-tensor isomorphism for d ≥ 3.

03

A new search-to-decision reduction for p-group isomorphism of class 2.

Abstract

In this paper we consider the problems of testing isomorphism of tensors, $p$ -groups, cubic forms, algebras, and more, which arise from a variety of areas, including machine learning, group theory, and cryptography. These problems can all be cast as orbit problems on multi-way arrays under different group actions. Our first two main results are: 1. All the aforementioned isomorphism problems are equivalent under polynomial-time reductions, in conjunction with the recent results of Futorny-Grochow-Sergeichuk (Lin. Alg. Appl., 2019). 2. Isomorphism of $d$ -tensors reduces to isomorphism of 3-tensors, for any $d \geq 3$ . Our results suggest that these isomorphism problems form a rich and robust equivalence class, which we call Tensor Isomorphism-complete, or TI-complete. We then leverage the techniques used in the above results to prove two first-of-their-kind results for Group…

Tables2

Table 1. Table 1: Summary of notation related to 3-way arrays and tensors.

Font	Object	Space of objects
$A, B, \dots$	matrix	$M (n, 𝔽)$ or $M (ℓ \times n, 𝔽)$
$𝐀, 𝐁, \dots$	matrix tuple	$M {(n, 𝔽)}^{m}$ or $M {(ℓ \times n, 𝔽)}^{m}$
$𝒜, ℬ, \dots$	matrix space	[Subspaces of $M (n, 𝔽)$ or $Λ (n, 𝔽)$ ]
$𝙰, 𝙱, \dots$	3-way array	$T (ℓ \times n \times m, 𝔽)$

Table 2. Table 2: The cast of isomorphism problems on 3-way arrays. In Section 6.1 we show how this exhausts the possibilities.

Notation

Name

Group Action

U \otimes V \otimes W

Matrix Space Equivalence

3-Tensor Isomorphism

𝒜 \mapsto g ​ 𝒜 ​ h^{- 1}

V \otimes V \otimes W

Matrix Space Isometry

Bilinear Map Pseudo-Isometry

𝒜 \mapsto g ​ 𝒜 ​ g^{T}

V \otimes V^{*} \otimes W

Matrix Space Conjugacy

𝒜 \mapsto g ​ 𝒜 ​ g^{- 1}

V \otimes V \otimes V

Trilinear Form Equivalence

f ​ (\vec{x}) \mapsto f ​ (g^{- 1} ​ \vec{x})

V \otimes V \otimes V^{*}

Algebra Isomorphism

μ ​ (\vec{x}, \vec{y}) \mapsto g ​ μ ​ (g^{- 1} ​ \vec{x}, g^{- 1} ​ \vec{y})

Equations137

d

d

d

d

de g_{A} (v)

de g_{A} (v)

L_{1}^{'} = [L_{1} 0 0 I_{d}] and L_{j}^{'} = [L_{j} 0 00] (for j > 1) .

L_{1}^{'} = [L_{1} 0 0 I_{d}] and L_{j}^{'} = [L_{j} 0 00] (for j > 1) .

L_{j}=\left[\begin{array}[]{ccccccc}a_{1,j}&\mathbf{0}_{1\times 2}&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ a_{d,j}&\mathbf{0}_{1\times 2}&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&I_{2}&\cdots&\mathbf{0}_{2\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}\\ \end{array}\right]

L_{j}=\left[\begin{array}[]{ccccccc}a_{1,j}&\mathbf{0}_{1\times 2}&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ a_{d,j}&\mathbf{0}_{1\times 2}&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}&\cdots&\mathbf{0}_{1\times 2}\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&I_{2}&\cdots&\mathbf{0}_{2\times 2}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \mathbf{0}_{2\times 1}&\mathbf{0}_{2\times 2}&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}&\cdots&\mathbf{0}_{2\times 2}\\ \end{array}\right]

\mathtt{A}=\begin{bmatrix}\tilde{a}_{1,1}&\tilde{a}_{1,2}&\dots&\tilde{a}_{1,n}\\ \vdots&\vdots&\ddots&\vdots\\ \tilde{a}_{d,1}&\tilde{a}_{d,2}&\dots&\tilde{a}_{d,n}\\ e_{1,1}&\mathbf{0}&\dots&\mathbf{0}\\ e_{1,2}&\mathbf{0}&\dots&\mathbf{0}\\ \mathbf{0}&e_{2,1}&\dots&\mathbf{0}\\ \mathbf{0}&e_{2,2}&\dots&\mathbf{0}\\ \vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\mathbf{0}&\dots&e_{n,1}\\ \mathbf{0}&\mathbf{0}&\dots&e_{n,2}\end{bmatrix},\begin{array}[]{rcl}\lx@intercol\hfil\text{where}\hfil\lx@intercol\\ \tilde{a}_{i,j}&=&\begin{bmatrix}a_{i,j}\\ \mathbf{0}_{2n\times 1}\end{bmatrix}\in\mathbb{F}^{1+2n}\\ e_{i,j}&=&\vec{e}_{1+2(i-1)+j}\in\mathbb{F}^{1+2n}\text{ for }i\in[n],j\in[2]\\ \\ \lx@intercol\hfil\text{ and the frontal slices are}\hfil\lx@intercol\\ \\ A_{1}&=&\begin{bmatrix}A\\ \mathbf{0}_{2n\times n}\end{bmatrix}\\ A_{1+2(i-1)+j}&=&E_{d+2(i-1)+j,i}\qquad\text{ for }i\in[n],j\in[2]\end{array}

\mathtt{A}=\begin{bmatrix}\tilde{a}_{1,1}&\tilde{a}_{1,2}&\dots&\tilde{a}_{1,n}\\ \vdots&\vdots&\ddots&\vdots\\ \tilde{a}_{d,1}&\tilde{a}_{d,2}&\dots&\tilde{a}_{d,n}\\ e_{1,1}&\mathbf{0}&\dots&\mathbf{0}\\ e_{1,2}&\mathbf{0}&\dots&\mathbf{0}\\ \mathbf{0}&e_{2,1}&\dots&\mathbf{0}\\ \mathbf{0}&e_{2,2}&\dots&\mathbf{0}\\ \vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\mathbf{0}&\dots&e_{n,1}\\ \mathbf{0}&\mathbf{0}&\dots&e_{n,2}\end{bmatrix},\begin{array}[]{rcl}\lx@intercol\hfil\text{where}\hfil\lx@intercol\\ \tilde{a}_{i,j}&=&\begin{bmatrix}a_{i,j}\\ \mathbf{0}_{2n\times 1}\end{bmatrix}\in\mathbb{F}^{1+2n}\\ e_{i,j}&=&\vec{e}_{1+2(i-1)+j}\in\mathbb{F}^{1+2n}\text{ for }i\in[n],j\in[2]\\ \\ \lx@intercol\hfil\text{ and the frontal slices are}\hfil\lx@intercol\\ \\ A_{1}&=&\begin{bmatrix}A\\ \mathbf{0}_{2n\times n}\end{bmatrix}\\ A_{1+2(i-1)+j}&=&E_{d+2(i-1)+j,i}\qquad\text{ for }i\in[n],j\in[2]\end{array}

A = w_{1, 1} w_{2, 1} ⋮ w_{ℓ, 1} w_{1, 2} w_{2, 2} ⋱ w_{ℓ, 2} \dots \dots ⋱ \dots w_{1, n} w_{2, n} ⋮ w_{ℓ, n},

A = w_{1, 1} w_{2, 1} ⋮ w_{ℓ, 1} w_{1, 2} w_{2, 2} ⋱ w_{ℓ, 2} \dots \dots ⋱ \dots w_{1, n} w_{2, n} ⋮ w_{ℓ, n},

exp (X) exp (Y) = exp (X + Y + \frac{1}{2} [X, Y] + \frac{1}{12} ([X, [X, Y]] - [Y, [X, Y]]) - \frac{1}{24} [Y, [X, [X, Y]]] + \dots),

exp (X) exp (Y) = exp (X + Y + \frac{1}{2} [X, Y] + \frac{1}{12} ([X, [X, Y]] - [Y, [X, Y]]) - \frac{1}{24} [Y, [X, [X, Y]]] + \dots),

lo g (1 + n) = n - \frac{n ^{2}}{2} + \frac{n ^{3}}{3} - \dots

lo g (1 + n) = n - \frac{n ^{2}}{2} + \frac{n ^{3}}{3} - \dots

U \otimes V (left-right action) V \otimes V^{*} (conjugacy) V \otimes V (isometry).

U \otimes V (left-right action) V \otimes V^{*} (conjugacy) V \otimes V (isometry).

\overline{r} (g) \cdot r (v) = r (g \cdot v) .

\overline{r} (g) \cdot r (v) = r (g \cdot v) .

h \cdot r (v) = r (v^{'}) ⟹ \overline{s} (h) \circ v = v^{'},

h \cdot r (v) = r (v^{'}) ⟹ \overline{s} (h) \circ v = v^{'},

A = 0 - a_{1, 2} - a_{1, 3} ⋮ - a_{1, n} a_{1, 2} 0 - a_{2, 3} ⋱ - a_{2, n} a_{1, 3} a_{2, 3} 0 ⋱ - a_{3, n} \dots \dots \dots ⋱ \dots a_{1, n} a_{2, n} a_{3, n} ⋮ 0,

A = 0 - a_{1, 2} - a_{1, 3} ⋮ - a_{1, n} a_{1, 2} 0 - a_{2, 3} ⋱ - a_{2, n} a_{1, 3} a_{2, 3} 0 ⋱ - a_{3, n} \dots \dots \dots ⋱ \dots a_{1, n} a_{2, n} a_{3, n} ⋮ 0,

\tilde{\mathtt{A}}=\left[\begin{array}[]{ccccc;{2pt/2pt}ccc;{2pt/2pt}ccc;{2pt/2pt}c;{2pt/2pt}ccc}\mathbf{0}&\tilde{a}_{1,2}&\tilde{a}_{1,3}&\dots&\tilde{a}_{1,n}&e_{1,1}&\ldots&e_{1,n}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ -\tilde{a}_{1,2}&\mathbf{0}&\tilde{a}_{2,3}&\dots&\tilde{a}_{2,n}&\mathbf{0}&\ldots&\mathbf{0}&e_{2,1}&\dots&e_{2,n}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\ddots&\ddots&\vdots&\ddots&\ddots&\ddots&\ddots&\ddots&\ddots&\ldots&\ddots&\ddots&\vdots\\ -\tilde{a}_{1,n}&-\tilde{a}_{2,n}&-\tilde{a}_{3,n}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&e_{n,1}&\dots&e_{n,n}\\ \hdashline[2pt/2pt]-e_{1,1}&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ -e_{1,n}&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\mathbf{0}&-e_{2,1}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \mathbf{0}&-e_{2,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \hdashline[2pt/2pt]\mathbf{0}&\mathbf{0}&\mathbf{0}&\dots&-e_{n,1}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \mathbf{0}&\mathbf{0}&\mathbf{0}&\dots&-e_{n,n}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \end{array}\right],

\tilde{\mathtt{A}}=\left[\begin{array}[]{ccccc;{2pt/2pt}ccc;{2pt/2pt}ccc;{2pt/2pt}c;{2pt/2pt}ccc}\mathbf{0}&\tilde{a}_{1,2}&\tilde{a}_{1,3}&\dots&\tilde{a}_{1,n}&e_{1,1}&\ldots&e_{1,n}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ -\tilde{a}_{1,2}&\mathbf{0}&\tilde{a}_{2,3}&\dots&\tilde{a}_{2,n}&\mathbf{0}&\ldots&\mathbf{0}&e_{2,1}&\dots&e_{2,n}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\ddots&\ddots&\vdots&\ddots&\ddots&\ddots&\ddots&\ddots&\ddots&\ldots&\ddots&\ddots&\vdots\\ -\tilde{a}_{1,n}&-\tilde{a}_{2,n}&-\tilde{a}_{3,n}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&e_{n,1}&\dots&e_{n,n}\\ \hdashline[2pt/2pt]-e_{1,1}&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ -e_{1,n}&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\mathbf{0}&-e_{2,1}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \mathbf{0}&-e_{2,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \hdashline[2pt/2pt]\mathbf{0}&\mathbf{0}&\mathbf{0}&\dots&-e_{n,1}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\vdots&\vdots&\dots&\vdots&\vdots&\ldots&\vdots&\vdots&\dots&\vdots&\ldots&\vdots&\dots&\vdots\\ \mathbf{0}&\mathbf{0}&\mathbf{0}&\dots&-e_{n,n}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\ldots&\mathbf{0}&\dots&\mathbf{0}\\ \end{array}\right],

A

A

\mathtt{A}^{lat}=\left[\begin{array}[]{cccc;{2pt/2pt}ccc;{2pt/2pt}c;{2pt/2pt}ccc}\ell_{1,1}&\ell_{1,2}&\dots&\ell_{1,m}&e_{n+1}&\ldots&e_{2n}&\dots&0&\ldots&0\\ \vdots&\ddots&\ddots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \ell_{n,1}&\ell_{n,2}&\dots&\ell_{n,m}&0&\ldots&0&\dots&e_{n^{2}+1}&\ldots&e_{n^2+n}\\ \hdashline[2pt/2pt]0&0&\dots&0&e_{1}&\ldots&0&\dots&0&\ldots&0\\ \vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\dots&\vdots&\ddots&\vdots\\ 0&0&\dots&0&0&\ldots&e_{1}&\dots&0&\ldots&0\\ \hdashline[2pt/2pt]\vdots&\ddots&\ddots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \hdashline[2pt/2pt]0&0&\dots&0&0&\ldots&0&\dots&e_{n}&\ldots&0\\ \vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\dots&\vdots&\ddots&\vdots\\ 0&0&\dots&0&0&\ldots&0&\dots&0&\ldots&e_{n}\\ \end{array}\right],

\mathtt{A}^{lat}=\left[\begin{array}[]{cccc;{2pt/2pt}ccc;{2pt/2pt}c;{2pt/2pt}ccc}\ell_{1,1}&\ell_{1,2}&\dots&\ell_{1,m}&e_{n+1}&\ldots&e_{2n}&\dots&0&\ldots&0\\ \vdots&\ddots&\ddots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \ell_{n,1}&\ell_{n,2}&\dots&\ell_{n,m}&0&\ldots&0&\dots&e_{n^{2}+1}&\ldots&e_{n^2+n}\\ \hdashline[2pt/2pt]0&0&\dots&0&e_{1}&\ldots&0&\dots&0&\ldots&0\\ \vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\dots&\vdots&\ddots&\vdots\\ 0&0&\dots&0&0&\ldots&e_{1}&\dots&0&\ldots&0\\ \hdashline[2pt/2pt]\vdots&\ddots&\ddots&\vdots&\vdots&\ddots&\vdots&\ddots&\vdots&\ddots&\vdots\\ \hdashline[2pt/2pt]0&0&\dots&0&0&\ldots&0&\dots&e_{n}&\ldots&0\\ \vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\dots&\vdots&\ddots&\vdots\\ 0&0&\dots&0&0&\ldots&0&\dots&0&\ldots&e_{n}\\ \end{array}\right],

P^{t} D P = diag (α_{σ^{- 1} (1)}, \dots, α_{σ^{- 1} (n)}) .

P^{t} D P = diag (α_{σ^{- 1} (1)}, \dots, α_{σ^{- 1} (n)}) .

M = x_{1} I_{n} 0 ⋮ 0 0 x_{2} I_{n} ⋱ 0 \dots \dots ⋱ \dots 00 ⋮ x_{n} I_{n} .

M = x_{1} I_{n} 0 ⋮ 0 0 x_{2} I_{n} ⋱ 0 \dots \dots ⋱ \dots 00 ⋮ x_{n} I_{n} .

M^{P} = α_{σ (1)} x_{σ (1)} I_{n} 0 ⋮ 0 0 α_{σ (2)} x_{σ (2)} I_{n} ⋱ 0 \dots \dots ⋱ \dots 00 ⋮ α_{σ (n)} x_{σ (n)} I_{n} .

M^{P} = α_{σ (1)} x_{σ (1)} I_{n} 0 ⋮ 0 0 α_{σ (2)} x_{σ (2)} I_{n} ⋱ 0 \dots \dots ⋱ \dots 00 ⋮ α_{σ (n)} x_{σ (n)} I_{n} .

\mathtt{A}=\left[\begin{array}[]{ccc}a_{1,1}^{\prime}&\dots&a_{1,n}^{\prime}\\ \vdots&\ddots&\vdots\\ a_{\ell,1}^{\prime}&\dots&a_{\ell,n}^{\prime}\\ \end{array}\right],

\mathtt{A}=\left[\begin{array}[]{ccc}a_{1,1}^{\prime}&\dots&a_{1,n}^{\prime}\\ \vdots&\ddots&\vdots\\ a_{\ell,1}^{\prime}&\dots&a_{\ell,n}^{\prime}\\ \end{array}\right],

\tilde{\mathtt{A}}=\left[\begin{array}[]{ccc;{2pt/2pt}ccc;{2pt/2pt}ccc;{2pt/2pt}ccc}\mathbf{0}&\dots&\mathbf{0}&a_{1,1}&\ldots&a_{1,n}&e_{1,1}&\dots&e_{2n+1, 1}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\dots&\mathbf{0}&a_{\ell, 1}&\ldots&a_{\ell,n}&e_{1,\ell}&\dots&e_{2n+1, \ell}&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]-a_{1,1}&\dots&-a_{\ell,1}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&f_{1,1}&\dots&f_{4n+2,1}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ -a_{1,n}&\dots&-a_{\ell,n}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&f_{1,n}&\dots&f_{4n+2,n}\\ \hdashline[2pt/2pt]-e_{1,1}&\dots&-e_{1,\ell}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ -e_{2n+1,1}&\dots&-e_{2n+1,\ell}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\mathbf{0}&\dots&\mathbf{0}&-f_{1,1}&\ldots&-f_{1,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\dots&\mathbf{0}&-f_{4n+2, 1}&\ldots&-f_{4n+2,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\end{array}\right],

\tilde{\mathtt{A}}=\left[\begin{array}[]{ccc;{2pt/2pt}ccc;{2pt/2pt}ccc;{2pt/2pt}ccc}\mathbf{0}&\dots&\mathbf{0}&a_{1,1}&\ldots&a_{1,n}&e_{1,1}&\dots&e_{2n+1, 1}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\dots&\mathbf{0}&a_{\ell, 1}&\ldots&a_{\ell,n}&e_{1,\ell}&\dots&e_{2n+1, \ell}&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]-a_{1,1}&\dots&-a_{\ell,1}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&f_{1,1}&\dots&f_{4n+2,1}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ -a_{1,n}&\dots&-a_{\ell,n}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&f_{1,n}&\dots&f_{4n+2,n}\\ \hdashline[2pt/2pt]-e_{1,1}&\dots&-e_{1,\ell}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ -e_{2n+1,1}&\dots&-e_{2n+1,\ell}&\mathbf{0}&\ldots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \hdashline[2pt/2pt]\mathbf{0}&\dots&\mathbf{0}&-f_{1,1}&\ldots&-f_{1,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\dots&\mathbf{0}&-f_{4n+2, 1}&\ldots&-f_{4n+2,n}&\mathbf{0}&\dots&\mathbf{0}&\mathbf{0}&\dots&\mathbf{0}\end{array}\right],

ℓ

ℓ

\begin{array}[]{rccclrcccl}&&\text{ For $i$...}&&&&&\mathrm{rk}(L_{i})\\ \hline\cr 1&\leq&i&\leq&\ell&2n+1&\leq&\mathrm{rk}(L_{i})&\leq&3n+1\\ \ell+1&\leq&i&\leq&\ell+n&4n+2&\leq&\mathrm{rk}(L_{i})&\leq&5n+2\\ \ell+n+1&\leq&i&\leq&\ell+n+6n+3&&&\mathrm{rk}(L_{i})&\leq&n\end{array}

\begin{array}[]{rccclrcccl}&&\text{ For $i$...}&&&&&\mathrm{rk}(L_{i})\\ \hline\cr 1&\leq&i&\leq&\ell&2n+1&\leq&\mathrm{rk}(L_{i})&\leq&3n+1\\ \ell+1&\leq&i&\leq&\ell+n&4n+2&\leq&\mathrm{rk}(L_{i})&\leq&5n+2\\ \ell+n+1&\leq&i&\leq&\ell+n+6n+3&&&\mathrm{rk}(L_{i})&\leq&n\end{array}

P_{1, 1}^{t} 00 0 P_{2, 2}^{t} 0 P_{3, 1}^{t} P_{3, 2}^{t} P_{3, 3}^{t} 0 - A_{i}^{t} 0 A_{i} 00 000 P_{1, 1} 0 P_{3, 1} 0 P_{2, 2} P_{3, 2} 00 P_{3, 3} = 0 - P_{2, 2}^{t} A_{i} P_{1, 1} 0 P_{1, 1}^{t} A_{i} P_{2, 2} 00 000 .

P_{1, 1}^{t} 00 0 P_{2, 2}^{t} 0 P_{3, 1}^{t} P_{3, 2}^{t} P_{3, 3}^{t} 0 - A_{i}^{t} 0 A_{i} 00 000 P_{1, 1} 0 P_{3, 1} 0 P_{2, 2} P_{3, 2} 00 P_{3, 3} = 0 - P_{2, 2}^{t} A_{i} P_{1, 1} 0 P_{1, 1}^{t} A_{i} P_{2, 2} 00 000 .

\begin{array}[]{rclcl}\tilde{Q}&=&\operatorname{diag}(P,Q,U)&\in&\mathrm{GL}(\ell+7n+3,\mathbb{F})\\ \tilde{R}&=&\operatorname{diag}(R,V)&\in&\mathrm{GL}(m+\ell(2n+1)+n(4n+2),\mathbb{F}),\end{array}

\begin{array}[]{rclcl}\tilde{Q}&=&\operatorname{diag}(P,Q,U)&\in&\mathrm{GL}(\ell+7n+3,\mathbb{F})\\ \tilde{R}&=&\operatorname{diag}(R,V)&\in&\mathrm{GL}(m+\ell(2n+1)+n(4n+2),\mathbb{F}),\end{array}

\tilde{Q}^{t} r (A) \tilde{Q} = r (B)^{\tilde{R}},

\tilde{Q}^{t} r (A) \tilde{Q} = r (B)^{\tilde{R}},

\tilde{Q}^{t} \tilde{A}_{i} \tilde{Q} = P^{t} 00 0 Q^{t} 0 00 U^{t} 0 - A_{i}^{t} 0 A_{i} 00 000 P 00 0 Q 0 00 U = 0 - Q^{t} A_{i}^{t} P 0 P^{t} A_{i} Q 00 000 .

\tilde{Q}^{t} \tilde{A}_{i} \tilde{Q} = P^{t} 00 0 Q^{t} 0 00 U^{t} 0 - A_{i}^{t} 0 A_{i} 00 000 P 00 0 Q 0 00 U = 0 - Q^{t} A_{i}^{t} P 0 P^{t} A_{i} Q 00 000 .

00 - E 0 0000 E 000 0000 and 0000 000 - F 0000 0 F 00,

00 - E 0 0000 E 000 0000 and 0000 000 - F 0000 0 F 00,

P^{t} R^{t} U_{1}^{t} U_{2}^{t} 00 - E^{t} 0 0000 E 000 0000 P R U_{1} U_{2} = 00 - U_{1}^{t} E^{t} P 0 0000 P^{t} E U_{1} 000 0000,

P^{t} R^{t} U_{1}^{t} U_{2}^{t} 00 - E^{t} 0 0000 E 000 0000 P R U_{1} U_{2} = 00 - U_{1}^{t} E^{t} P 0 0000 P^{t} E U_{1} 000 0000,

L_{i} = [0 \dots 0 I_{ℓ} 0 \dots 0],

L_{i} = [0 \dots 0 I_{ℓ} 0 \dots 0],

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Isomorphism problems for tensors, groups,

and cubic forms: completeness and reductions

Joshua A. Grochow 111Departments of Computer Science and Mathematics, University of Colorado, Boulder. [email protected]

Youming Qiao 222Centre for Quantum Software and Information, University of Technology Sydney. [email protected]

Abstract

In this paper we consider the problems of testing isomorphism of tensors, $p$ -groups, cubic forms, algebras, and more, which arise from a variety of areas, including machine learning, group theory, and cryptography. These problems can all be cast as orbit problems on multi-way arrays under different group actions. Our first two main results are:

All the aforementioned isomorphism problems are equivalent under polynomial-time reductions, in conjunction with the recent results of Futorny–Grochow–Sergeichuk (Lin. Alg. Appl., 2019). 2. 2.

Isomorphism of $d$ -tensors reduces to isomorphism of $3$ -tensors, for any $d\geq 3$ .

All but one of the reductions for the preceding contributions work over arbitrary fields. Together they suggest that the aforementioned isomorphism problems form a rich and robust equivalence class, which we call Tensor Isomorphism-complete, or TI-complete for short. Furthermore, this provides a unified viewpoint on these hard isomorphism testing problems arising from a variety of areas.

We then leverage the techniques used in the above results to prove two first-of-their-kind results for Group Isomorphism (GpI):

We give a reduction from testing isomorphism of $p$ -groups of exponent $p$ and small class ( $c<p$ ) to isomorphism of $p$ -groups of exponent $p$ and class 2. The latter are widely believed to be the hardest cases of GpI, but as far as we know, this is the first reduction from any more general class of groups to this class. 2. 4.

We give a search-to-decision reduction for isomorphism of $p$ -groups of exponent $p$ and class $2$ in time $|G|^{O(\log\log|G|)}$ . While search-to-decision reductions for Graph Isomorphism (GI) have been known for more than 40 years, as far as we know this is the first non-trivial search-to-decision reduction in the context of GpI.

Our main technique for (1), (3), and (4) is a linear-algebraic analogue of the classical graph coloring gadget, which was used to obtain the search-to-decision reduction for GI. This gadget construction may be of independent interest and utility. The technique for (2) gives a method for encoding an arbitrary tensor into an algebra.

1 Introduction

Isomorphism problems in light of Babai’s breakthrough on Graph

Isomorphism.

In late 2015, Babai presented a quasipolynomial-time algorithm for Graph Isomorphism (GI) [Bab16]. This is widely regarded as one of the major breakthroughs in theoretical computer science of the past decade. Indeed, GI has been at the heart of complexity theory nearly since its inception: both Cook and Levin were thinking about GI when they defined $\mathsf{NP}$ [AD17, Sec. 1], Graph (Non-)Isomorphism played a special role in the creation of the class $\mathsf{AM}$ [Bab85, GMR85, BM88], and it still stands today as one of the few natural candidates for a problem that is “ $\mathsf{NP}$ -intermediate,” that is, in $\mathsf{NP}$ , but neither in $\mathsf{P}$ nor $\mathsf{NP}$ -complete [Lad75] (see [Exc] for additional candidates). Beyond its practical applications (e. g., [SV17, Irn05] and references therein) and its naturality, part of its fascination comes from its universal property: GI is universal for isomorphism problems for “explicitly given” structures [ZKT85, Sec. 15], that is, first-order structures on a set $V$ where, e. g., a $k$ -ary relation on $V$ is given by listing out a subset $R\subseteq V^{k}$ .

In light of Babai’s breakthrough on GI [Bab16], it is natural to consider “what’s next?” for isomorphism problems. That is, what isomorphism problems stand as crucial bottlenecks to further improvements on GI, and what isomorphism problems should naturally draw our attention for further exploration? Of course, one of the main open questions in the area remains whether or not GI is in $\mathsf{P}$ . Babai [Bab16, arXiv version, Sec. 13.2 and 13.4] already lists several isomorphism problems for further study, including Group Isomorphism, Linear Code Equivalence, and Permutation Group Conjugacy. In this paper we expand this list in what we argue is a very natural direction, namely to isomorphism problems for multi-way arrays, also known as tensors.333There have been some disputes on the terminologies; see the preface of [Lan12]. Our approach is to use multi-way arrays as the basic underlying object, and to use tensors as the multi-way arrays under a certain group action.

Group actions on 3-way arrays.

3-way arrays are simply arrays with 3 indices, generalizing the case of matrices (=2-way arrays). In this paper we consider entries of the arrays being from a field $\mathbb{F}$ , so a 3-way array is just $\mathtt{A}=(a_{i,j,k})$ , $i\in[\ell]$ , $j\in[n]$ , $k\in[m]$ , and $a_{i,j,k}\in\mathbb{F}$ .

Let $\mathrm{GL}(n,\mathbb{F})$ be the general linear group of degree $n$ over $\mathbb{F}$ , and let $\mathrm{M}(n,\mathbb{F})$ denote the set of $n\times n$ matrices. There are three natural group actions on $\mathrm{M}(n,\mathbb{F})$ : for $A\in\mathrm{M}(n,\mathbb{F})$ , (1) $(P,Q)\in\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})$ sends $A$ to $P^{t}AQ$ , (2) $P\in\mathrm{GL}(n,\mathbb{F})$ sends $A$ to $P^{-1}AP$ , and (3) $P\in\mathrm{GL}(n,\mathbb{F})$ sends $A$ to $P^{t}AP$ . These three actions then endow $A$ with different algebraic/geometric interpretations: (1) a linear map from a vector space $V$ to another vector space $W$ , (2) a linear map from $V$ to itself, and (3) a bilinear map from $V\times V$ to $\mathbb{F}$ .

Likewise, 3-way arrays $\mathtt{A}=(a_{i,j,k})$ , $i,j,k\in[n]$ , can be naturally acted by $\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})$ in one way, by $\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})$ in two different ways, and by $\mathrm{GL}(n,\mathbb{F})$ in two different ways. These five actions endow various families of 3-way arrays with different algebraic/geometric meanings, including 3-tensors, bilinear maps, matrix (associative or Lie) algebras, and trilinear forms (a.k.a. non-commutative cubic forms). (See Sec. 2 for detailed explanations.) Over finite fields, the associated isomorphism problems are in $\mathsf{NP}\cap\mathsf{coAM}$ , following the essentially same $\mathsf{coAM}$ protocol as for GI.

With these group actions in mind, 3-way arrays capture a variety of important structures in several mathematical and computational disciplines. They arise naturally in quantum mechanics (states are described by tensors), the complexity of matrix multiplication (matrix multiplication is described by a tensor, and its algebraic complexity is essentially its tensor rank), the Geometric Complexity Theory approach [Mul11] to the Permanent versus Determinant Conjecture [Val79] (tensors describe the boundary of the determinant orbit closure, e. g., [Lan12, Sec. 13.6.3] and [Gro12b, Sec. 3.5.1] for introductions, and [HL16, Hüt17] for applications), data analysis [KB09], machine learning [PSS18], computational group theory [LQ17, BMW18], and cryptography [Pat96, JQSY19].

Main results.

The five natural actions on 3-way arrays mentioned above each lead to a different isomorphism problem on 3-way arrays; we discuss these problems and their interpretations in Sec. 2. Our first main result, Thm. A, shows that these isomorphism problems for 3-way arrays are all equivalent under polynomial-time reductions. Due to the algebraic or geometric interpretations, these problems are further equivalent to isomorphism problems on certain classes of groups, cubic forms, trilinear forms (a.k.a. non-commutative cubic forms), associative algebras, and Lie algebras. One consequence of these results (Cor. P), along with those of [FGS19], is a reduction from GpI for $p$ -groups of exponent $p$ and class $<p$ to GpI for $p$ -groups of exponent $p$ and class 2. Although the latter have long been believed to be the hardest cases of GpI, as far as we are aware, this is the first reduction from a more general class of groups to this class.

Although these equivalences may have been expected by some experts, it had not been immediately clear to us for some time during this project. To get a sense for the non-obviousness, let us postulate the following hypothetical question. Recall that two matrices $A,B\in\mathrm{M}(n,\mathbb{F})$ are called equivalent if there exists $P,Q\in\mathrm{GL}(n,\mathbb{F})$ such that $P^{-1}AQ=B$ , and they are conjugate if there exists $P\in\mathrm{GL}(n,\mathbb{F})$ such that $P^{-1}AP=B$ . Can we reduce testing Matrix Conjugacy to testing Matrix Equivalence? Of course since they are both in $\mathsf{P}$ there is a trivial reduction; to avoid this, let us consider only reductions $r$ which send a matrix $A$ to a matrix $r(A)$ such that $A$ and $B$ are conjugate iff $r(A)$ and $r(B)$ are equivalent. Nearly all reductions between isomorphism problems that we are aware of have this form (so-called “kernel reductions” [FG11]; cf. functorial reductions [Bab14]). After some thought, we realize that this is essentially impossible. The reason is that the equivalence class of a matrix is completely determined by its rank, while the conjugacy class of a matrix is determined by its rational canonical form. Among $n\times n$ matrices there are only $n+1$ equivalence classes, but there are at least $|\mathbb{F}|^{n}$ rational canonical forms (coming from the choice of minimal polynomial/companion matrix). Even when $\mathbb{F}$ is a finite field, such a reduction would thus require an exponential increase in dimension, and when $\mathbb{F}$ is infinite, such a reduction is impossible (regardless of running time).

Nonetheless, one of our results is that for spaces of matrices (one form of 3-way arrays), conjugacy testing does indeed reduce to equivalence testing! This is in sharp contrast to the case of single matrices. In the above setting, it means that there exists a polynomial-time computable map $\phi$ from $\mathrm{M}(n,\mathbb{F})$ to subspaces of $\mathrm{M}(s,\mathbb{F})$ , such that $A,B$ are conjugate up to a scalar if and only if $\phi(A),\phi(B)\leq\mathrm{M}(s,\mathbb{F})$ are equivalent as matrix spaces. Such a reduction may not be clear at first sight.

Our second main result reduces $d$ TI to 3TI, for any fixed $d\geq 3$ . From one viewpoint, this can be seen as a linear algebraic analogue of the now-classical reduction from $d$ -uniform Hypergraph Isomorphism to GI (e. g., [ZKT85]). However, as the reader will see, the reduction here is quite a bit more involved, using quiver algebras and the Wedderburn–Mal’cev Theorem on complements of the Jacobson radical in associative algebras. From another viewpoint, this can be seen as a step towards showing that 3TI is not only universal among isomorphism problems on 3-way arrays [FGS19], but perhaps 3TI is already universal for isomorphism problems on $d$ -way arrays for any $d$ ; see Sec. 10.1. These first two results indicate the robustness and naturality of the notion of $\mathsf{TI}$ -completeness.

Our next set of results reduce Graph Isomorphism and Linear Code Equivalence to these isomorphism problems for 3-way arrays (Sec. 3.2). This shows that these isomorphism problems for 3-way arrays form a set of potentially harder problems than these two problems, as also supported by the current difference in their practical difficulties.444There is a heuristic algorithm for Linear Code Equivalence by Sendrier [Sen00], which is practically effective in many cases, though for self-dual codes it reverts to an exponential search. It currently seems unlikely to us that either Graph Isomorphism or Code Equivalence is $\mathsf{TI}$ -complete.

Finally, our third main contribution is to show a search-to-decision reduction for these tensor problems (Thm. C), which may be of independent interest, leveraging our technique from above. While such a reduction has long been known for GI, for Group Isomorphism in general this remains a long-standing open question. Our techniques allow us to give a search-to-decision reduction for isomorphism of $p$ -groups of class 2 and exponent $p$ in time $|G|^{O(\log\log|G|)}$ in the model of matrix groups over finite fields. This group class is widely regarded to be the hardest cases of Group Isomorphism. As far as we know, this is the first non-trivial search-to-decision reduction for testing isomorphism of a class of finite groups.

Implications of main results for practical

computations.

Our first main result may partly help to explain the difficulties from various areas when dealing with these isomorphism problems. There is currently a significant difference between isomorphism problems for 3-way arrays and that for graphs. Namely, in sharp contrast to Graph Isomorphism—for which very effective practical algorithms have existed for some time [McK80, MP14]—the problems we consider here all still pose great difficulty even on relatively small examples in practice. Indeed, such problems have been proposed to be difficult enough for cryptographic purposes [Pat96, JQSY19]. As further evidence of their practical difficulty, current algorithms implemented for Alternating Matrix Space Isometry 555An $n\times n$ matrix $A$ over $\mathbb{F}$ is alternating if for every $v\in\mathbb{F}^{n}$ , $v^{t}Av=0$ . When $\mathbb{F}$ is not of characteristic $2$ , this is equivalent to the skew-symmetry condition.—a problem we show is $\mathsf{TI}$ -complete—can handle the cases when the 3-way array is of size $10\times 10\times 10$ over $\mathbb{F}_{13}$ , but absolutely not for 3-way arrays of size $100\times 100\times 100$ , even though in this case the input can still be stored in only a few megabytes.666We thank James B. Wilson, who maintains a suite of algorithms for $p$ -group isomorphism testing, for communicating this insight to us from his hands-on experience. We of course maintain responsibility for any possible misunderstanding, or lack of knowledge regarding the performance of other implemented algorithms. In [PSS18], motivated by machine learning applications, computations on one $\mathsf{TI}$ -complete problem were performed in Macaulay2 [GS], but these could not go beyond small examples either. Our results imply that the complexities of these problems arising in many fields—from computational group theory to cryptography to machine learning—are all equivalent.

Isomorphism problems for 3-way arrays as a bottleneck for graph

isomorphism.

In addition to their many incarnations and practical uses mentioned above, the isomorphism problems we consider on 3-way arrays can be further motivated by their relationship to GI. Specifically, these problems both form a key bottleneck to putting GI into $\mathsf{P}$ , and pose a great challenge for extending techniques used to solve GI.

Isomorphism problems for 3-way arrays stand as a key bottleneck to put GI in $\mathsf{P}$ . This is because, as Babai pointed out [Bab16], Group Isomorphism is a key bottleneck to putting GI into $\mathsf{P}$ . Indeed, the current-best upper bounds on these two problems are now quite close: $n^{O(\log n)}$ for Group Isomorphism (originally due to [FN70, Mil78]777Miller attributes this to Tarjan., with improved constants [Wil14, Ros13a, Ros13b]), and $n^{O(\log^{2}n)}$ for GI [Bab16] (see [HBD17] for calculation of the exponent). Within Group Isomorphism, it is widely regarded, for several reasons (e. g., [Bae38, Hig60, Ser77, Wil15]), that the bottleneck is the class of $p$ -groups of class 2 and exponent $p$ (i.e., $G/Z(G)$ is abelian and $g^{p}=1$ for all $g$ , $p$ odd). Then 3-way arrays enter the picture by Baer’s Correspondence [Bae38], which shows that the isomorphism problem for these groups is equivalent to telling whether two linear spaces of skew-symmetric matrices over $\mathbb{F}_{p}$ are equivalent up to transformations of the form $A\mapsto P^{t}AP$ . This is the Alternating Matrix Space Isometry problem, which we show in this paper is $\mathsf{TI}$ -complete.888Because of the difference in verbosity of inputs, solving Group Isomorphism for this class of groups in time $\mathrm{poly}(|G|)$ is equivalent to solving Alternating Matrix Space Isometry in time $p^{O(n+m)}$ for $n\times n$ matrix spaces of dimension $m$ over $\mathbb{F}_{p}$ . The current state of the art is $p^{O(n^{2})}$ , which corresponds to the nearly-trivial upper bound of $|G|^{O(\log|G|)}$ on Group Isomorphism.

To see why the techniques for GI face great difficulty when dealing with isomorphism problems for multi-way arrays, recall that most algorithms for GI, including Babai’s [Bab16], are built on two families of techniques: group-theoretic, and combinatorial. One of the main differences is that the underlying group action for GI is a permutation group acting on a combinatorial structure, whereas the underlying group actions for isomorphism problems for 3-way arrays are matrix groups acting on (multi)linear structures.

Already in moving from permutation groups to matrix groups, we find many new computational difficulties that arise naturally in basic subroutines used in isomorphism testing. For example, the membership problem for permutation groups is well-known to be efficiently solvable by Sims’s algorithm [Sim78] (see, e. g., [Ser03] for a textbook treatment), while for matrix groups this was only recently shown to be solvable with a number-theoretic oracle over finite fields of odd characteristic [BBS09]. Correspondingly, when moving from combinatorial structures to (multi)linear algebraic structures, we also find severe limitation on the use of most combinatorial techniques, like individualizing a vertex. For example, it is quite expensive to enumerate all vectors in a vector space, while it is usually considered efficient to go through all elements in a set. Similarly, within a set, any subset has a unique complement, whereas within $\mathbb{F}_{q}^{n}$ , a subspace can have up to $q^{\Theta(n^{2})}$ complements.

Given all the differences between the combinatorial and linear-algebraic worlds, it may be surprising that combinatorial techniques for Graph Isomorphism can nonetheless be useful for Group Isomorphism. Indeed, guided by the postulate that alternating matrix spaces can be viewed as a linear algebraic analogue of graphs, Li and the second author [LQ17] adapted the individualisation and refinement technique, as used by Babai, Erdős and Selkow [BES80], to tackle Alternating Matrix Space Isometry over $\mathbb{F}_{q}$ . This algorithm was recently improved [BGL*+*19]. However, this technique, though helpful to improve from the brute-force $q^{n^{2}}\cdot\mathrm{poly}(n,\log q)$ time, seems still limited to getting $q^{O(n)}$ -time algorithms.

New techniques.

Our first new technique for the above results on 3-way arrays is to develop a linear-algebraic analogue of the coloring gadget used in the context of Graph Isomorphism (see, e. g., [KST93]). These gadgets help us to restrict to various subgroups of the general linear group. Recall that, in relating GI with other isomorphism problems, coloring is a very useful idea. Given a graph $G=(V,E)$ , a coloring of vertices is a function $c:V\to C$ where $C$ is a set of “colors.” Colored isomorphism between two vertex-colored graphs asks only for isomorphisms that send vertices of one color to vertices of that same color. If we are interested in making a specific vertex $v\in V$ special (“individualizing” that vertex), we can assign this vertex a unique color. To reduce Colored Graph Isomorphism to ordinary Graph Isomorphism uses certain gadgets, and we adapt this idea to the context of 3-way arrays. We note that [FGS19] construct a related such gadget. In this paper, we develop a new gadget which we use both by itself, and in combination with the gadget from [FGS19] (albeit in a new context), see Sec. 4 and Sec. 7.

Our second new technique, used to show the reduction from $d$ TI to 3TI, is a simultaneous generalization of our reduction from 3TI to Algebra Isomorphism and the technique Grigoriev used [Gri81] to show that isomorphism in a certain restricted class of algebras is equivalent to GI. In brief outline: a 3-way array $\mathtt{A}$ specifies the structure constants of an algebra with basis $x_{1},\dotsc,x_{n}$ via $x_{i}\cdot x_{j}:=\sum_{k}\mathtt{A}(i,j,k)x_{k}$ , and this is essentially how we use it in the reduction from 3TI to Algebra Isomorphism. For arbitrary $d\geq 3$ , we would like to similarly use a $d$ -way array $\mathtt{A}$ to specify how $d$ -tuples of elements in some algebra $\mathcal{A}$ multiply. The issue is that for $\mathcal{A}$ to be an algebra, our construction must still specify how pairs of elements multiply. The basic idea is to let pairs (and triples, and so on, up to $(d-2)$ -tuples) multiply “freely” (that is, without additional relations), and then to use $\mathtt{A}$ to rewrite any product of $d-1$ generators as a linear combination of the original generators. While this construction as described already gives one direction of the reduction (if $\mathtt{A}\cong\mathtt{B}$ , then $\mathcal{A}\cong\mathcal{B}$ ), the other direction is trickier. For that, we modify the construction to an algebra in which short products (less than $d-2$ generators) do not quite multiply freely, but almost. After the fact, we found out that this construction generalizes the one used by Grigoriev [Gri81] to show that GI was equivalent Algebra Isomorphism for a certain class of algebras (see Sec. 4 for a comparison).

Organization.

We aim to reach as wide an audience as possible, so we start with a detailed introduction to the various isomorphism problems on 3-way arrays, and their algebraic and geometric interpretations in Sec. 2. We then describe our results in detail in Sec. 3 and consider related work in Sec. 4. An illustration of the key technique is in Sec. 5. These sections may be viewed as an extended abstract.

The remainder of the paper gives detailed proofs of all results. Sec. 6 contains additional preliminaries. In Sec. 7, we present those reductions which use the linear-algebraic coloring technique, thus proving Thm. A(2) and Thm. C. We then finish the proof of Thm. A by presenting the remaining reductions in Sec. 8. Thm. B is proved in Sec. 9. In Sec. 10, we put forward a theory of universality for basis-explicit linear structures, in analogy with [ZKT85]. While not yet complete, this seems to provide another justification for studying Tensor Isomorphism and related problems, and it motivates some interesting open questions. In Appendix A we give a reduction from Cubic Form Equivalence to Degree- $d$ Form Equivalence for any $d\geq 3$ (for $d>6$ this is easy; for $d=4$ it requires some work).

2 Preliminaries: Group actions on 3-way arrays

The formulas for most natural group actions on 3-way arrays are somewhat unwieldy; our experience suggests that they are more easily digested when presented in the context of some of the natural interpretations of 3-way arrays as mathematical objects. To connect the interpretations with the formulas themselves, one technical tool is very useful, namely, given a 3-way array $\mathtt{A}(i,j,k)$ , we define its frontal slices to be the matrices $A_{k}$ defined by $A_{k}(i,j):=\mathtt{A}(i,j,k)$ ; that is, we think of the box of $\mathtt{A}$ as arranged so that the $i$ and $j$ axes lie in the page, while the $k$ -axis is perpendicular to the page. Similarly, its lateral slices (viewing the 3D box of $\mathtt{A}$ “from the side”) are defined by $L_{j}(i,k):=\mathtt{A}(i,j,k)$ . An $\ell\times n\times m$ 3-way array thus has $m$ frontal slices and $n$ lateral slices.

A natural action on arrays of size $\ell\times n\times m$ is that of $\mathrm{GL}(\ell,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ by change of basis in each of the 3 directions, namely $((P,Q,R)\cdot\mathtt{A})(i^{\prime},j^{\prime},k^{\prime})=\sum_{i,j,k}\mathtt{A}(i,j,k)P_{ii^{\prime}}Q_{jj^{\prime}}R_{kk^{\prime}}$ . We will see several interpretations of this action below.

3-tensors.

A 3-way array $\mathtt{A}(i,j,k)$ , where $i\in[\ell]$ , $j\in[n]$ , and $k\in[m]$ , is naturally identified as a vector in $\mathbb{F}^{\ell}\otimes\mathbb{F}^{n}\otimes\mathbb{F}^{m}$ . Letting $\vec{e_{i}}$ denote the $i$ th standard basis vector of $\mathbb{F}^{n}$ , a standard basis of $\mathbb{F}^{\ell}\otimes\mathbb{F}^{n}\otimes\mathbb{F}^{m}$ is $\{\vec{e_{i}}\otimes\vec{e_{j}}\otimes\vec{e_{k}}\}$ . Then $\mathtt{A}$ represents the vector $\sum_{i,j,k}\mathtt{A}(i,j,k)\vec{e_{i}}\otimes\vec{e_{j}}\otimes\vec{e_{j}}$ in $\mathbb{F}^{\ell}\otimes\mathbb{F}^{n}\otimes\mathbb{F}^{m}$ . The natural action by $\mathrm{GL}(\ell,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ above corresponds to changes of basis of the three vector spaces in the tensor product. The problem of deciding whether two 3-way arrays are the same under this action is called 3-Tensor Isomorphism.999Some authors call this Tensor Equivalence; we use “Isomorphism” both because this is the natural notion of isomorphism for such objects, and because we will be considering many different equivalence relations on essentially the same underlying objects.

Matrix spaces.

Given a 3-way array $\mathtt{A}$ , it is natural to consider the linear span of its frontal slices, $\mathcal{A}=\langle A_{1},\dotsc,A_{m}\rangle$ , also called a matrix space. One convenience of this viewpoint is that the action of $\mathrm{GL}(m,\mathbb{F})$ becomes implicit: it corresponds to change of basis within the matrix space $\mathcal{A}$ . This allows us to generalize the three natural equivalence relations on matrices to matrix spaces: (1) two $\ell\times n$ matrix spaces $\mathcal{A}$ and $\mathcal{B}$ are equivalent if there exists $(P,Q)\in\mathrm{GL}(\ell,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})$ such that $P\mathcal{A}Q=\mathcal{B}$ , where $P\mathcal{A}Q:=\{PAQ:A\in\mathcal{A}\}$ ; (2) two $n\times n$ matrix spaces $\mathcal{A},\mathcal{B}$ are conjugate if there exists $P\in\mathrm{GL}(n,\mathbb{F})$ such that $P\mathcal{A}P^{-1}=\mathcal{B}$ ; and (3) they are isometric if $P\mathcal{A}P^{t}=\mathcal{B}$ . The corresponding decision problems, when $\mathcal{A}$ is given by a basis $A_{1},\dotsc,A_{d}$ , are Matrix Space Equivalence, Matrix Space Conjugacy, and Matrix Space Isometry, respectively.

Nilpotent groups.

If $A,B$ are two subsets of a group $G$ , then $[A,B]$ denotes the subgroup generated by all elements of the form $[a,b]=aba^{-1}b^{-1}$ , for $a\in A,b\in B$ . The lower central series of a group $G$ is defined as follows: $\gamma_{1}(G)=G$ , $\gamma_{k+1}(G)=[\gamma_{k}(G),G]$ . A group is nilpotent if there is some $c$ such that $\gamma_{c+1}(G)=1$ ; the smallest such $c$ is called the nilpotency class of $G$ , or sometimes just “class” when it is understood from context. A finite group is nilpotent if and only if it is the product of its Sylow subgroups; in particular, all groups of prime power order are nilpotent.

Bilinear maps, finite groups, and systems of polynomials.

While the matrix space viewpoint has the merit of drawing an analogy with the more familiar object of matrices, other interpretations lead to standard complexity problems that may be more familiar to some readers. For example, from an $\ell\times n\times m$ 3-way array $\mathtt{A}$ , we can construct a bilinear map (=system of $m$ bilinear forms) $f_{\mathtt{A}}:\mathbb{F}^{\ell}\times\mathbb{F}^{n}\to\mathbb{F}^{m}$ , sending $(u,v)\in\mathbb{F}^{\ell}\times\mathbb{F}^{n}$ to $(u^{t}A_{1}v,\dots,u^{t}A_{m}v)^{t}$ , where the $A_{k}$ are the frontal slices of $\mathtt{A}$ .101010In this paper elements in $\mathbb{F}^{n}$ are column vectors. The group action defining Matrix Space Equivalence is equivalent to the action of $\mathrm{GL}(\ell,\mathbb{F})\times\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ on such bilinear maps.

When $\ell=n$ , the action in Matrix Space Isometry is equivalent to the natural action of $\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ on such bilinear maps. Two bilinear maps that are essentially the same up to such basis changes are sometimes called pseudo-isometric [BW12].

Bilinear maps of the form $V\times V\to W$ turn out to arise naturally in group theory and algebraic geometry. When $A_{k}$ are skew-symmetric over $\mathbb{F}_{p}$ , $p$ an odd prime, Baer’s correspondence [Bae38] gives a bijection between finite $p$ -groups of class 2 and exponent $p$ , that is, in which $g^{p}=1$ for all $g$ and in which $[G,G]\leq Z(G)$ , and their corresponding bilinear maps $G/Z(G)\times G/Z(G)\to[G,G]$ , given by $(gZ(G),hZ(G))\mapsto[g,h]=ghg^{-1}h^{-1}$ . Two such groups are isomorphic if and only if their corresponding bilinear maps are pseudo-isometric, if and only if, using the matrix space terminology, the matrix spaces they span are isometric. When $A_{k}$ are symmetric, by the classical correspondences between symmetric matrices and homogeneous quadratic forms, a symmetric bilinear map naturally yields a quadratic map from $\mathbb{F}^{n}$ to $\mathbb{F}^{m}$ . The two quadratic maps are isomorphic if and only if the corresponding bilinear maps are pseudo-isometric.

Cubic forms & trilinear forms.

From a 3-way array $\mathtt{A}$ we can also construct a cubic form (=homogeneous degree 3 polynomial) $\sum_{i,j,k}\mathtt{A}(i,j,k)x_{i}x_{j}x_{k}$ , where $x_{i}$ are formal variables. If we consider the variables as commuting—or, equivalently, if $\mathtt{A}$ is symmetric, meaning it is unchanged by permuting its three indices—we get an ordinary cubic form; if we consider them as non-commuting, we get a trilinear form (or “non-commutative cubic form”). In either case, the natural notion of isomorphism here comes from the action of $\mathrm{GL}(n,\mathbb{F})$ on the $n$ variables $x_{i}$ , in which $P\in\mathrm{GL}(n,\mathbb{F})$ transforms the preceding form into $\sum_{ijk}\mathtt{A}(i,j,k)(\sum_{i^{\prime}}P_{ii^{\prime}}x_{i^{\prime}})(\sum_{j^{\prime}}P_{jj^{\prime}}x_{j^{\prime}})(\sum_{k^{\prime}}P_{kk^{\prime}}x_{k^{\prime}})$ . In terms of 3-way arrays, we get $(P\cdot\mathtt{A})(i^{\prime},j^{\prime},k^{\prime})=\sum_{ijk}\mathtt{A}(i,j,k)P_{ii^{\prime}}P_{jj^{\prime}}P_{kk^{\prime}}$ . The corresponding isomorphism problems are called Cubic Form Equivalence (in the commutative case) and Trilinear Form Equivalence.

Algebras.

We may also consider a 3-way array $\mathtt{A}(i,j,k)$ , $i,j,k\in[n]$ , as the structure constants of an algebra (which need not be associative, commutative, nor unital), say with basis $x_{1},\dotsc,x_{n}$ , and with multiplication given by $x_{i}\cdot x_{j}=\sum_{k}\mathtt{A}(i,j,k)x_{k}$ , and then extended (bi)linearly. Here the natural notion equivalence comes from the action of $\mathrm{GL}(n,\mathbb{F})$ by change of basis on the $x_{i}$ . Despite the seeming similarity of this action to that on cubic forms, it turns out to be quite different: given $P\in\mathrm{GL}(n,\mathbb{F})$ , let $\vec{x}^{\prime}=P\vec{x}$ ; then we have $x_{i}^{\prime}\cdot x_{j}^{\prime}=(\sum_{i}P_{i^{\prime}i}x_{i})\cdot(\sum_{j}P_{j^{\prime}j}x_{j})=\sum_{i,j}P_{i^{\prime}i}P_{j^{\prime}j}x_{i}\cdot x_{j}$ $=\sum_{i,j,k}P_{i^{\prime}i}P_{j^{\prime}j}\mathtt{A}(i,j,k)x_{k}=\sum_{i,j,k}P_{i^{\prime}i}P_{j^{\prime}j}\mathtt{A}(i,j,k)\sum_{k^{\prime}}(P^{-1})_{kk^{\prime}}x_{k^{\prime}}$ . Thus $\mathtt{A}$ becomes $(P\cdot\mathtt{A})(i^{\prime},j^{\prime},k^{\prime})=\sum_{ijk}\mathtt{A}(i,j,k)P_{i^{\prime}i}P_{j^{\prime}j}(P^{-1})_{kk^{\prime}}$ . The inverse in the third factor here is the crucial difference between this case and that of cubic or trilinear forms above, similar to the difference between matrix conjugacy and matrix isometry. The corresponding isomorphism problem is called Algebra Isomorphism.

Summary.

The isomorphism problems of the above structures all have 3-way arrays as the underlying object, but are determined by different group actions. It is not hard to see that there are essentially five group actions in total: 3-Tensor Isomorphism, Matrix Space Conjugacy, Matrix Space Isometry, Trilinear Form Equivalence, and Algebra Isomorphism. It turns out that these cover all the natural isomorphism problems on 3-way arrays in which the group acting is a product of $\mathrm{GL}(n,\mathbb{F})$ (where $n$ is the side length of the arrays); see Sec. 6.1 for discussion.

3 Main results

3.1 Equivalence of isomorphism problems for 3-way arrays

Definition 3.1 ( $d\mathsf{TI},\mathsf{TI}$ ).

For any field $\mathbb{F}$ , $d\mathsf{TI}_{\mathbb{F}}$ denotes the class of problems that are polynomial-time Turing (Cook) reducible to $d$ -Tensor Isomorphism over $\mathbb{F}$ .111111We follow a natural convention: when $\mathbb{F}$ is finite, a fixed algebraic extension of a finite field such as $\overline{\mathbb{F}}_{p}$ , the rationals, or a fixed algebraic extension of the rationals such as $\overline{\mathbb{Q}}$ , we consider the usual model of Turing machines; when $\mathbb{F}$ is $\mathbb{R}$ , $\mathbb{C}$ , the $p$ -adic rationals $\mathbb{Q}_{p}$ , or other more “exotic” fields, we consider this in the Blum–Shub–Smale model over $\mathbb{F}$ . When we write $d\mathsf{TI}$ without mentioning the field, the result holds for any field. $\mathsf{TI}_{\mathbb{F}}=\bigcup_{d\geq 1}d\mathsf{TI}_{\mathbb{F}}$ .

We now state our first main theorem.

Theorem A.

3-Tensor Isomorphism* reduces to each of the following problems in polynomial time.*

Group Isomorphism* for $p$ -groups exponent $p$ ( $g^{p}=1$ for all $g$ ) and class 2 ( $G/Z(G)$ is abelian) given by generating matrices over $\mathbb{F}_{p^{e}}$ . Here we consider only $\mathsf{3TI}_{\mathbb{F}_{p^{e}}}$ where $p$ is an odd prime.* 2. 2.

Matrix Space Isometry*, even for alternating or symmetric matrix spaces.* 3. 3.

Matrix Space Conjugacy*, and even the special cases:*

(a)

Matrix Lie Algebra Conjugacy*, for solvable Lie algebras $L$ of derived length 2.121212And even further, where $L/[L,L]\cong\mathbb{F}$ . * 2. (b)

Associative Matrix Algebra Conjugacy*.131313Even for algebras $A$ whose Jacobson radical $J(A)$ squares to zero and $A/J(A)\cong\mathbb{F}$ . * 4. 4.

Algebra Isomorphism*, and even the special cases:*

(a)

Associative Algebra Isomorphism*, for algebras that are commutative and unital, and for algebras that are commutative and 3-nilpotent ( $abc=0$ for all $a,b,c,\in A$ )* 2. (b)

Lie Algebra Isomorphism*, for 2-step nilpotent Lie algebras ( $[u,[v,w]]=0$ $\forall u,v,w$ )* 5. 5.

Cubic Form Equivalence* and Trilinear Form Equivalence.*

The algebras in (3) are given by a set of matrices which linearly span the algebra, while in (4) they are given by structure constants (see “Algebras” in Sec. 2).

Remark 3.2.

Agrawal & Saxena [AS05, Thm. 5] gave a reduction from Cubic Form Equivalence over $\mathbb{F}$ to Ring Isomorphism for commutative, unital, associative algebras over $\mathbb{F}$ , when every element of $\mathbb{F}$ has a cube root. For finite fields $\mathbb{F}_{q}$ , the only such fields are those for which $q=p^{2e+1}$ and $p\equiv 2\pmod{3}$ , which is asymptotically half of all primes. As explained after the proof of [AS05, Thm. 5], the use of cube roots seems inherent in their reduction.

Using our results in conjunction with [FGS19], we get a new reduction from Cubic Form Equivalence to Ring Isomorphism (for the same class of rings) which works over any field of characteristic 0 or $p>3$ . Note that our reduction is very different from the one in [AS05].

Figure 1 below summarizes where the various parts of Thm. A are proven.

We then resolve an open question well-known to the experts:141414We asked several experts who knew of the question, but we were unable to find a written reference. Interestingly, Oldenburger [Old36] worked on what we would call $d$ -Tensor Isomorphism as far back as the 1930s. We would be grateful for any prior written reference to the question of whether $d$ TI reduces to 3TI.

Theorem B.

$d$ -Tensor Isomorphism* reduces to Algebra Isomorphism.*

Since the main result of [FGS19] reduces the problems in Theorem A to 3-Tensor Isomorphism (cf. [FGS19, Rmk. 1.1]), we have:

Corollary B.

Each of the problems listed in Theorem A is $\mathsf{TI}$ -complete.151515For Cubic Form Equivalence, we only show that it is in $\mathsf{TI}_{\mathbb{F}}$ when $\operatorname{char}\mathbb{F}>3$ or $\operatorname{char}\mathbb{F}=0$ . In particular, $d{\sc TI}$ and ${\sc 3TI}$ are equivalent.

Remark 3.3.

This phenomenon is reminiscent of the transition in hardness from 2 to 3 in $k$ -SAT, $k$ -Coloring, $k$ -Matching, and many other $\mathsf{NP}$ -complete problems. It is interesting that an analogous phenomenon—a transition to some sort of “universality” from 2 to 3—occurs in the setting of isomorphism problems, which we believe are not $\mathsf{NP}$ -complete over finite fields.

Remark 3.4.

Here is a brief summary of what is known about the complexity of some of these problems. Over a finite field $\mathbb{F}_{q}$ , these problems are in $\mathsf{NP}\cap\mathsf{coAM}$ . For $\ell\times n\times m$ 3-way arrays, the brute-force algorithms run in time $q^{O(\ell^{2}+n^{2}+m^{2})}$ , as $\mathrm{GL}(n,\mathbb{F}_{q})$ can be enumerated in time $q^{\Theta(n^{2})}$ . Note that polynomial-time in the input size here would be $\mathrm{poly}(\ell,n,m,\log q)$ . Over any field $\mathbb{F}$ , these problems are in $\mathsf{NP}_{\mathbb{F}}$ in the Blum–Shub–Smale model. When the input arrays are over $\mathbb{Q}$ and we ask for isomorphism over $\mathbb{C}$ or $\mathbb{R}$ , these problems are in $\mathsf{PSPACE}$ using quantifier elimination. By Koiran’s celebrated result on Hilbert’s Nullstellensatz, for equivalence over $\mathbb{C}$ they are in $\mathsf{AM}$ assuming the Generalized Riemann Hypothesis [Koi96]. When the input is over $\mathbb{Q}$ and we ask for equivalence over $\mathbb{Q}$ , it is unknown whether these problems are even decidable; classically this is studied under Algebra Isomorphism for associative, unital algebras over $\mathbb{Q}$ (see, e. g., [AS06, Poo14]), but by Cor. B, the question of decidability is open for all of these problems.

Over finite fields, several of these problems can be solved efficiently when one of the side lengths of the array is small. For $d$ -dimensional spaces of $n\times n$ matrices, Matrix Space Conjugacy and Isometry can be solved in $q^{O(n^{2})}\cdot\mathrm{poly}(d,n,\log q)$ time: once we fix an element of $\mathrm{GL}(n,\mathbb{F}_{q})$ , the isomorphism problem reduces to solving linear systems of equations. Less trivially, Matrix Space Conjugacy can be solved in time $q^{O(d^{2})}\cdot\mathrm{poly}(d,n,\log q)$ and 3TI for $n\times m\times d$ tensors in time $q^{O(d^{2})}\cdot\mathrm{poly}(d,n,m,\log q)$ , since once we fix an element of $\mathrm{GL}(d,\mathbb{F}_{q})$ , the isomorphism problem either becomes an instance of, or reduces to [IQ18], Module Isomorphism, which admits several polynomial-time algorithms [BL08, CIK97, IKS10, Ser00]. Finally, one can solve Matrix Space Isometry in time $q^{O(d^{2})}\cdot\mathrm{poly}(d,n,\log q)$ : once one fixes an element of $\mathrm{GL}(d,\mathbb{F}_{q})$ , there is a rather involved algorithm [IQ18], which uses the $*$ -algebra technique originated from the study of computing with $p$ -groups [Wil09, BW12].

3.2 Relations with Graph Isomorphism and Code

Equivalence

We observe then Graph Isomorphism and Code Equivalence reduce to 3-Tensor Isomorphism. In particular, the class $\mathsf{TI}$ contains the classical graph isomorphism class $\mathsf{GI}$ .

Recall Code Equivalence asks to decide whether two linear codes are the same up to a linear transformation preserving the Hamming weights of codes. Here the linear codes are just subspaces of $\mathbb{F}_{q}^{n}$ of dimension $d$ , represented by linear bases. Linear transformations preserving the Hamming weights include permutations and monomial transformations. Recall that the latter consists of matrices where every row and every column has exactly one non-zero entry. Indeed, over many fields this is without loss of generality, as Hamming-weight-preserving linear maps are always induced by monomial transformations (first proved over finite fields [Mac62], and more recently over much more general algebraic objects, e. g., [GNW04]). CodeEq has long been studied in the coding theory community; see e.g. [PR97, Sen00].

For Code Equivalence, we observe that previous results already combine to give:

Observation 3.5.

Code Equivalence* (under permutations) reduces to 3-Tensor Isomorphism.*

Proof.

Code Equivalence reduces to Matrix Lie Algebra Conjugacy [Gro12a], a special case of Matrix Space Conjugacy, which in turn reduces to 3TI [FGS19]. ∎

Using the linear-algebraic coloring gadget, we can extend this to equivalence of codes under monomial transformations (see Sec. 5). Given two $d\times n$ matrices $A,B$ over $\mathbb{F}$ of rank $d$ , the Monomial Code Equivalence problem is to decide whether there exist $Q\in\mathrm{GL}(d,\mathbb{F})$ and a monomial matrix $P\in\mathrm{Mon}(n,\mathbb{F})\leq\mathrm{GL}(n,\mathbb{F})$ (product of a diagonal matrix and a permutation matrix) such that $QAP=B$ .

Proposition 3.6.

Monomial Code Equivalence* reduces to 3-Tensor Isomorphism.*

Since Graph Isomorphism reduces to Code Equivalence [Luk93] (see [Miy96]) and [PR97] (even over arbitrary fields [Gro12a]), by Obs. 3.5 and Thm. A, we have the following.

Corollary 3.7.

Graph Isomorphism* reduces to Alternating Matrix Space Isometry.*

Using our linear-algebraic gadgets, we also reprove this result using a much more direct reduction (see Prop. 7.1). Besides being a different construction, another reason for the additional proof is that the technique leads to the search-to-decision reduction, which we discuss below.

3.3 Application to Group Isomorphism: reducing the nilpotency class

For several reasons, the hardest cases of Group Isomorphism are believed to be $p$ -groups of class 2 and exponent $p$ ; recall that these are groups in which every element has order $p$ , the order of the group is $p^{n}$ , and $G/Z(G)$ is abelian. See Nilpotent groups above. While this belief has been widely held for many decades, we are not aware of any prior reduction from a more general class of groups to this class. However, by combining our results with the Lazard correspondence, we immediately get such a reduction.

Corollary P.

Let $p$ be an odd prime. For groups generated by $m$ matrices of size $n\times n$ , Group Isomorphism for $p$ -groups of exponent $p$ and class $c<p$ reduces to Group Isomorphism for $p$ -groups of exponent $p$ and class $2$ in time $\mathrm{poly}(n,m,\log p)$ .

Proof.

By the Lazard correspondence (reproduced as Thm. 6.4 below) two $p$ -groups of exponent $p$ and class $c<p$ are isomorphic if and only if their corresponding $\mathbb{F}_{p}$ -Lie algebras are. By Prop. 6.5, we can construct a generating set for the corresponding Lie algebra by applying the power series for logarithm to the generating matrices of $G$ . This Lie algebra is thus a subalgebra of $n\times n$ matrices, so we can generate the entire Lie algebra (using the linear-algebra version of breadth-first search; its dimension is $\leq n^{2}$ ) and compute its structure constants in time polynomial in $n$ , $m$ , and $\log p$ . Then use [FGS19] to reduce isomorphism of Lie algebras to TI, and then apply Thm. A (specifically, Cor. 7.6) to reduce to isomorphism of $p$ -groups of exponent $p$ and class $2$ given by a matrix generating set. ∎

The only obstacle to getting this proof to work in the Cayley table model is that our reduction from TI to Alternating Matrix Space Isometry (Prop. 7.3) blows up the dimension quadratically, which means the size of the group becomes $|G|^{O(\log|G|)}$ after the reduction. See Question 10.5.

3.4 Search to decision reductions

Reducing search problems to their associated decision problems is a classical and intriguing topic in complexity theory. Aside from the now-standard search-to-decision reduction for SAT, one of the earliest results in this direction was by Valiant in the 1970’s [Val76]. A celebrated result of Bellare and Goldwasser shows that, assuming $\mathsf{EE}\neq\mathsf{NEE}$ , there exists a language in $\mathsf{NP}$ for which search does not reduce to decision under polynomial-time reductions [BG94]. However, as usual for such statements based on complexity-theoretic assumptions, the problems constructed by such a proof are considered somewhat unnatural. For natural problems, on the one hand, there are search-to-decision reductions for $\mathsf{NP}$ -complete problems and for GI. On the other hand, such is not known, nor expected to be the case, for Nash Equilibrium [CDT09] (for which decision is trivial).

Reducing search to decision is particularly intriguing for testing isomorphism of groups. One difficulty is that it is not clear how to guess a partial solution, and then make progress by restricting to a subgroup. In general, testing isomorphism of certain algebraic structures (groups, algebras, etc.) forms a large family of problems for which search-to-decision reductions are not known.

Because of the close relationship between 3TI and isomorphism of various algebraic structures, one might expect similar difficulties in reducing search to decision for 3TI, and thus for $\mathsf{TI}$ -complete problems as well. Nonetheless, for Alternating Matrix Space Isometry, we are able to use the linear-algebraic coloring gadgets to get a non-trivial search-to-decision reduction.

Theorem C.

There is a search-to-decision reduction for Alternating Matrix Space Isometry which, given $n\times n$ alternating matrix spaces $\mathcal{A},\mathcal{B}$ over $\mathbb{F}_{q}$ , computes an isometry between them if they are isometric, in time $q^{\tilde{O}(n)}$ . The reduction queries the decision oracle with inputs of dimension at most $O(n^{2})$ .

As a consequence, a $q^{\tilde{O}(\sqrt{n})}$ -time decision algorithm would result in a $q^{\tilde{O}(n)}$ -time search algorithm, in contrast with the brute-force $q^{O(n^{2})}$ running time. Note that in this context, the size of the input is $\mathrm{poly}(n,\log q)$ , so a $q^{\tilde{O}(\sqrt{n})}$ running time is still quite generous.

By the connection between Alternating Matrix Space Isometry and Group Isomorphism for $p$ -groups of class $2$ and exponent $p$ , we have the following. Note that the natural succinct input representation mentioned in the following result can have size $\mathrm{poly}(\ell,\log p)=\mathrm{poly}(\log|G|)$ .

Corollary C.

Let $p$ be an odd prime, and let GpIso2Exp( $p$ ) denote the isomorphism problem for $p$ -groups of class 2 and exponent $p$ in the model of matrix groups over $\mathbb{F}_{p}$ . For groups of order $p^{\ell}$ , there is a search-to-decision reduction for GpIso2Exp( $p$ ) running in time $|G|^{O(\log\log|G|)}=p^{\tilde{O}(\ell)}$ .

4 Related work

The most closely related work is that of Futorny, Grochow, and Sergeichuk [FGS19]. They show that a large family of isomorphism problems on 3-way arrays—including those involving multiple 3-way arrays simultaneously, or 3-way arrays that are partitioned into blocks, or 3-way arrays where some of the blocks or sides are acted on by the same group (e. g., Matrix Space Isometry)—all reduce to 3TI. Our work complements theirs in that all our reductions for Thm. A go in the opposite direction, reducing 3TI to other problems. Some of our other results relate GI and Code Equivalence to 3TI; the latter problems were not considered in [FGS19]. Thm. B considers $d$ -tensors for any $d\geq 3$ , which were not considered in [FGS19].

In [AS05, AS06], Agrawal and Saxena considered Cubic Form Equivalence and testing isomorphism of commutative, associative, unital algebras. They showed that GI reduces to Algebra Isomorphism; Commutative Algebra Isomorphism reduces to Cubic Form Equivalence; and Homogeneous Degree- $d$ Form Equivalence reduces to Algebra Isomorphism assuming that the underlying field has $d$ th root for every field element. By combining a reduction from [FGS19], Prop. 7.3, and Cor. 8.5, we get a new reduction from Cubic Form Equivalence to Algebra Isomorphism that works over any field in which $3!$ is a unit, which is fields of characteristic [math] or $p>3$ .

There are several other works which consider related isomorphism problems. Grigorev [Gri81] showed that GI is equivalent to isomorphism of unital, associative algebras $A$ such that the radical $R(A)$ squares to zero and $A/R(A)$ is abelian. Interestingly, we show $\mathsf{TI}$ -completeness for conjugacy of matrix algebras with the same abstract structure (even when $A/R(A)$ is only 1-dimensional). Note the latter problem is equivalent to asking whether two representations of $A$ are equivalent up to automorphisms of $A$ . In the proof of Thm. B, which uses algebras in which $R(A)^{d}=0$ when reducing from $d$ TI, we use Grigoriev’s result.

Brooksbank and Wilson [BW15] showed a reduction from Associative Algebra Isomorphism (when given by structure constants) to Matrix Algebra Conjugacy. Grochow [Gro12a], among other things, showed that GI and CodeEq reduce to Matrix Lie Algebra Conjugacy, which is a special case of Matrix Space Conjugacy.

In [KS06], Kayal and Saxena considered testing isomorphism of finite rings when the rings are given by structure constants. This problem generalizes testing isomorphism of algebras over finite fields. They put this problem in $\mathsf{NP}\cap\mathsf{coAM}$ [KS06, Thm. 4.1], reduce GI to this problem [KS06, Thm. 4.4], and prove that counting the number of ring automorphism (#RA) is in $\mathsf{FP}^{\mathsf{AM}\cap\mathsf{coAM}}$ [KS06, Thm. 5.1]. They also present a $\mathsf{ZPP}$ reduction from GI to #RA, and show that the decision version of the ring automorphism problem is in $\mathsf{P}$ .

To summarize this zoo of isomorphism problems and reductions, we include Figure 2 for reference.

5 Overview of one new technique, and one full proof

In this section we describe one of the key new techniques in this paper: a linear-algebraic coloring gadget. We exhibit this gadget by giving the full proof of Prop. 3.6 as an example. A related gadget was used in [FGS19] to show reductions to 3TI; our reductions all go in the opposite direction. Furthermore, whereas the gadgets used in [FGS19] were primarily to ensure that two different blocks could not be mixed, our gadgets allow us to ensure that certain slices of a tensor can be permuted, while disallowing more general linear transformations.

In the context of GI, there are many ways to reduce Colored GI to ordinary GI; here we give one example, which will serve as an analogy for our linear-algebraic gadget. To individualize a vertex $v\in G$ (give it a unique color), attach to it a large “star”: if $|V(G)|=n$ , add $n+1$ new vertices to $G$ and attach them all to $v$ ; call the resulting graph $G_{v}$ . This has the effect that any automorphism of $G_{v}$ must fix $v$ , since $v$ has a degree strictly larger than any other vertex. Furthermore, if $H_{w}$ is obtained by a similar construction, then there is an isomorphism $G\to H$ which sends $v\mapsto w$ if and only if $G_{v}\cong H_{w}$ . Finally, if we attach stars of size $n+1$ to multiple vertices $v_{1},\dotsc,v_{k}$ , then any automorphism of $G$ must permute the $v_{i}$ amongst themselves, and there is an isomorphism $G\to H$ sending $\{v_{1},\dotsc,v_{k}\}\mapsto\{w_{1},\dotsc,w_{k}\}$ if and only if the corresponding enlarged graphs are isomorphic.

We adapt this idea to the context of 3-way arrays. Let $\mathtt{A}$ be an $\ell\times n\times m$ 3-way array, with lateral slices $L_{1},L_{2},\dotsc,L_{n}$ (each an $\ell\times m$ matrix). For any vector $v\in\mathbb{F}^{n}$ , we get an associated lateral matrix $L_{v}$ , which is a linear combination of the lateral slices as given, namely $L_{v}:=\sum_{j=1}^{n}v_{j}L_{j}$ (note that when $v=\vec{e_{j}}$ is the $j$ -th standard basis vector, the associated lateral matrix is indeed $L_{j}$ ). By analogy with adjacency matrices of graphs, $L_{v}$ is a natural analogue of the neighborhood of a vertex in a graph. Correspondingly, we get a notion of “degree,” which we may define as

[TABLE]

The last two characterizations are analogous to the fact that the degree of a vertex $v$ in a graph $G$ may be defined as the number of “in-neighbors” (nonzero entries the corresponding row of the adjacency matrix) or the number of “out-neighbors” (nonzero entries in the corresponding column).

To “individualize” $v$ , we can enlarge $\mathtt{A}$ with a gadget to increase $\deg_{\mathtt{A}}(v)$ , as in the graph case. Note that $\deg_{\mathtt{A}}(v)\leq\min\{\ell,m\}$ because the lateral matrices are all of size $\ell\times m$ . For notational simplicity, let us individualize $v=\vec{e_{1}}=(1,0,\dotsc,0)^{t}$ . To individualize $v$ , we will increase its degree by $d=\min\{\ell,m\}+1>\max_{v\in\mathbb{F}^{n}}\deg_{\mathtt{A}}(v)$ . Extend $\mathtt{A}$ to a new 3-way array $\mathtt{A}_{v}$ of size $(\ell+d)\times n\times(m+d)$ ; in the “first” $\ell\times n\times m$ “corner”, we will have the original array $\mathtt{A}$ , and then we will append to it an identity matrix in one slice to increase $\deg(v)$ . More specifically, the lateral slices of $\mathtt{A}_{v}$ will be

[TABLE]

Now we have that $\deg_{\mathtt{A}_{v}}(v)\geq d$ . This almost does what we want, but now note that any vector $w=(w_{1},\dotsc,w_{n})$ with $w_{1}\neq 0$ has $\deg_{\mathtt{A}_{v}}(w)=\mathrm{rk}(w_{1}L_{1}^{\prime}+\sum_{j\geq 2}w_{j}L_{j})\geq d$ . We can nonetheless consider this a sort of linear-algebraic individualization.

Leveraging this trick, we can then individualize an entire basis of $\mathbb{F}^{n}$ simultaneously, so that $d\leq\deg(v)<2d$ for any vector $v$ in our basis, and $\deg(v^{\prime})\geq 2d$ for any nonzero $v^{\prime}$ outside the basis (not a scalar multiple of one of the basis vectors), as we do in the following proof of Prop. 3.6. This is also a 3-dimensional analogue of the reduction from GI to CodeEq [Luk93, Miy96, PR97] (where they use Hamming weight instead of rank).

Proof of Prop. 3.6.

Without loss of generality we assume $d>1$ , as the problem is easily solvable when $d=1$ . We treat a $d\times n$ matrix $A$ as a 3-way array of size $d\times n\times 1$ , and then follow the outline proposed above, of individualizing the entire standard basis $\vec{e_{1}},\dotsc,\vec{e_{n}}$ . Since the third direction only has length 1, the maximum degree of any column is 1, so it suffices to use gadgets of rank 2. More specifically, we build a $(d+2n)\times n\times(1+2n)$ 3-way array $\mathtt{A}$ whose lateral slices are

[TABLE]

where the $I_{2}$ block is in the $j$ -th block of size 2 (that is, rows $d+2(j-1)+\{1,2\}$ and columns $2(j-1)+\{1,2\}$ ). It will also be useful to visualize the frontal slices of $\mathtt{A}$ , as follows. Here each entry of the “matrix” below is actually a $(1+2n)$ -dimensional vector, “coming out of the page”:

[TABLE]

(In $\mathtt{A}$ we turn the vectors $\tilde{a}_{i,j}$ and $e_{i,j}$ “on their side” so they become perpendicular to the page. )

We claim that $A$ and $B$ are monomially equivalent as codes if and only if $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors.

( $\Rightarrow$ ) Suppose $QADP=B$ where $Q\in\mathrm{GL}(n,\mathbb{F})$ , $D=\operatorname{diag}(\alpha_{1},\dotsc,\alpha_{n})$ and $P\in S_{n}\leq\mathrm{GL}(n,\mathbb{F})$ . Then by examining the frontal slices it is not hard to see that for $Q^{\prime}=\begin{bmatrix}Q&0\\ 0&(DP)^{-1}\otimes I_{2}\end{bmatrix}$ (where $DP^{-1}\otimes I_{2}$ denotes a $2n\times 2n$ block matrix, where the pattern of the nonzero blocks and the scalars are governed by $(DP)^{-1}$ , and each $2\times 2$ block is either zero or a scalar multiple of $I_{2}$ ) we have $Q^{\prime}A_{1}DP=B_{1}$ and $Q^{\prime}A_{1+2(i-1)+j}DP=B_{1+2(\pi(i)-1)+j}$ , where $\pi$ is the permutation corresponding to $P$ . Thus $\mathtt{A}$ and $\mathtt{B}$ are isomorphic tensors, via the isomorphism $(Q^{\prime},DP,\operatorname{diag}(I_{1},P))$ .

( $\Leftarrow$ ) Suppose there exist $Q\in\mathrm{GL}(d+2n,\mathbb{F})$ , $P\in\mathrm{GL}(n,\mathbb{F})$ , and $R\in\mathrm{GL}(1+2n,\mathbb{F})$ , such that $Q\mathtt{A}P=\mathtt{B}^{R}$ . First, note that every lateral slice of $\mathtt{A}$ is of rank either $2$ or $3$ , and the actions of $Q$ and $R$ do not change the ranks of the lateral slices. Furthermore, any non-trivial linear combination of more than $1$ lateral slice results in a lateral matrix of rank $\geq 4$ . It follows that $P$ cannot take nontrivial linear combinations of the lateral slices, hence it must be monomial.

Now consider the frontal slices. Note that, as we assume $d>1$ , every frontal slice of $Q\mathtt{A}P$ , except the first one, is of rank $1$ . Therefore, $R$ must be of the form $\begin{bmatrix}r_{1,1}&\mathbf{0}_{1\times(n-1)}\\ \vec{r^{\prime}}&R^{\prime}\end{bmatrix}$ where $R^{\prime}$ is $(n-1)\times(n-1)$ . Since $R$ is invertible, we must have $r_{1,1}\neq 0$ , and the first frontal slice of $\mathtt{B}^{R}$ contains all the rows of $B$ scaled by $r_{1,1}$ in its first $d$ rows. The first frontal slice of $Q\mathtt{A}P$ is a matrix that generates, by definition (and since we’ve shown $P$ is monomial), a code monomially equivalent to $A$ . Since the first frontal slices of $Q\mathtt{A}P$ and $\mathtt{B}^{R}$ are equal, and the latter is just a scalar multiple of $B_{1}$ , we have that $A$ and $B$ are monomially equivalent as codes as well. ∎

6 Preliminaries

Vector spaces.

Let $\mathbb{F}$ be a field. In this paper we only consider finite-dimensional vector spaces over $\mathbb{F}$ . We use $\mathbb{F}^{n}$ to denote the vector space of length- $n$ column vectors. The $i$ th standard basis vector of $\mathbb{F}^{n}$ is denoted as $\vec{e_{i}}$ . Depending on the context, $\mathbf{0}$ may denote the zero vector space, a zero vector, or an all-zero matrix. Let $S$ be a subset of vectors. We use $\langle S\rangle$ to denote the subspace spanned by elements in $S$ .

Some groups.

The general linear group of degree $n$ over a field $\mathbb{F}$ is denoted by $\mathrm{GL}(n,\mathbb{F})$ . The symmetric group of degree $n$ is denoted by $\mathrm{S}_{n}$ . The natural embedding of $\mathrm{S}_{n}$ into $\mathrm{GL}(n,\mathbb{F})$ is to represent permutations by permutation matrices. A monomial matrix in $\mathrm{M}(n,\mathbb{F})$ is a matrix where each row and each column has exactly one non-zero entry. All monomial matrices form a subgroup of $\mathrm{GL}(n,\mathbb{F})$ which we call the monomial subgroup, denoted by $\mathrm{Mon}(n,\mathbb{F})$ , which is isomorphic to the semi-direct product $\mathbb{F}^{n}\rtimes S_{n}$ . The subgroup of $\mathrm{GL}(n,\mathbb{F})$ consisting of block upper-triangular matrices with a fixed block structure is called a (standard) parabolic subgroup.

Matrices.

Let $\mathrm{M}(\ell\times n,\mathbb{F})$ be the linear space of $\ell\times n$ matrices over $\mathbb{F}$ , and $\mathrm{M}(n,\mathbb{F}):=\mathrm{M}(n\times n,\mathbb{F})$ . Given $A\in\mathrm{M}(\ell\times n,\mathbb{F})$ , $A^{t}$ denotes the transpose of $A$ .

A matrix $A\in\mathrm{M}(n,\mathbb{F})$ is symmetric, if for any $u,v\in\mathbb{F}^{n}$ , $u^{t}Av=v^{t}Au$ , or equivalently $A=A^{t}$ . That is, $A$ represents a symmetric bilinear form. A matrix $A\in\mathrm{M}(n,\mathbb{F})$ is alternating, if for any $u\in\mathbb{F}^{n}$ , $u^{t}Au=0$ . That is, $A$ represents an alternating bilinear form. Note that in characteristic $\neq 2$ , alternating is the same as skew-symmetric, but in characteristic 2 they differ (in characteristic 2, skew-symmetric=symmetric). The linear space of $n\times n$ alternating matrices over $\mathbb{F}$ is denoted by $\Lambda(n,\mathbb{F})$ .

The $n\times n$ identity matrix is denoted by $I_{n}$ , and when $n$ is clear from the context, we may just write $I$ . The elementary matrix $E_{i,j}$ is the matrix with the $(i,j)$ th entry being $1$ , and other entries being [math]. The $(i,j)$ -th elementary alternating matrix is the matrix $E_{i,j}-E_{j,i}$ .

Matrix tuples.

We use $\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ to denote the linear space of $m$ -tuples of $\ell\times n$ matrices. Boldface letters like $\mathbf{A}$ and $\mathbf{B}$ denote matrix tuples. Let $\mathbf{A}=(A_{1},\dots,A_{m}),\mathbf{B}=(B_{1},\dots,B_{m})\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ . Given $P\in\mathrm{M}(\ell,\mathbb{F})$ and $Q\in\mathrm{M}(n,\mathbb{F})$ , $P\mathbf{A}Q:=(PA_{1}Q,\dots,PA_{m}Q)\in\mathrm{M}(\ell,\mathbb{F})$ . Given $R=(r_{i,j})_{i,j\in[m]}\in\mathrm{M}(m,\mathbb{F})$ , $\mathbf{A}^{R}:=(A_{1}^{\prime},\dots,A_{m}^{\prime})\in\mathrm{M}(m,\mathbb{F})$ where $A_{i}^{\prime}=\sum_{j\in[m]}r_{j,i}A_{j}$ .

Remark 6.1.

In particular, note that $A_{i}^{\prime}$ corresponds to the entries in the $i$ th column of $R$ . While this choice is immaterial (we could have chosen the opposite convention), all of our later calculations are consistent with this convention.

Given $\mathbf{A},\mathbf{B}\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ , we say that $\mathbf{A}$ and $\mathbf{B}$ are equivalent, if there exist $P\in\mathrm{GL}(\ell,\mathbb{F})$ and $Q\in\mathrm{GL}(n,\mathbb{F})$ , such that $P\mathbf{A}Q=\mathbf{B}$ . Let $\mathbf{A},\mathbf{B}\in\mathrm{M}(n,\mathbb{F})^{m}$ . Then $\mathbf{A}$ and $\mathbf{B}$ are conjugate, if there exists $P\in\mathrm{GL}(n,\mathbb{F})$ , such that $P^{-1}\mathbf{A}P=\mathbf{B}$ . And $\mathbf{A}$ and $\mathbf{B}$ are isometric, if there exists $P\in\mathrm{GL}(n,\mathbb{F})$ , such that $P^{t}\mathbf{A}P=\mathbf{B}$ . Finally, $\mathbf{A}$ and $\mathbf{B}$ are pseudo-isometric, if there exist $P\in\mathrm{GL}(n,\mathbb{F})$ and $R\in\mathrm{GL}(m,\mathbb{F})$ , such that $P^{t}\mathbf{A}P=\mathbf{B}^{R}$ .

Matrix spaces.

Linear subspaces of $\mathrm{M}(\ell\times n,\mathbb{F})$ are called matrix spaces. Calligraphic letters like $\mathcal{A}$ and $\mathcal{B}$ denote matrix spaces. By a slight abuse of notation, for $\mathbf{A}\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ , we use $\langle\mathbf{A}\rangle$ to denote the subspace spanned by those matrices in $\mathbf{A}$ .

3-way arrays.

Let $\mathrm{T}(\ell\times n\times m,\mathbb{F})$ be the linear space of $\ell\times n\times m$ 3-way arrays over $\mathbb{F}$ . We use the fixed-width teletypefont for 3-way arrays, like $\mathtt{A}$ , $\mathtt{B}$ , etc..

Given $\mathtt{A}\in\mathrm{T}(\ell\times n\times m,\mathbb{F})$ , we can think of $\mathtt{A}$ as a 3-dimensional table, where the $(i,j,k)$ th entry is denoted as $\mathtt{A}(i,j,k)\in\mathbb{F}$ . We can slice $\mathtt{A}$ along one direction and obtain several matrices, which are then called slices. For example, slicing along the first coordinate, we obtain the horizontal slices, namely $\ell$ matrices $A_{1},\dots,A_{\ell}\in\mathrm{M}(n\times m,\mathbb{F})$ , where $A_{i}(j,k)=\mathtt{A}(i,j,k)$ . Similarly, we also obtain the lateral slices by slicing along the second coordinate, and the frontal slices by slicing along the third coordinate.

We will often represent a 3-way array as a matrix whose entries are vectors. That is, given $\mathtt{A}\in\mathrm{T}(\ell\times n\times m,\mathbb{F})$ , we can write

[TABLE]

where $w_{i,j}\in\mathbb{F}^{m}$ , so that $w_{i,j}(k)=\mathtt{A}(i,j,k)$ . Note that, while $w_{i,j}\in\mathbb{F}^{m}$ are column vectors, in the above representation of $\mathtt{A}$ , we should think of them as along the direction “orthogonal to the paper.” Following [KB09], we call $w_{i,j}$ the tube fibers of $\mathtt{A}$ . Similarly, we can have the row fibers $v_{i,k}\in\mathbb{F}^{n}$ such that $v_{i,k}(j)=\mathtt{A}(i,j,k)$ , and the column fibers $u_{j,k}\in\mathbb{F}^{\ell}$ such that $u_{j,k}(i)=\mathtt{A}(i,j,k)$ .

Given $P\in\mathrm{M}(\ell,\mathbb{F})$ and $Q\in\mathrm{M}(n,\mathbb{F})$ , let $P\mathtt{A}Q$ be the $\ell\times n\times m$ $3$ -way array whose $k$ th frontal slice is $PA_{k}Q$ . For $R=(r_{i,j})\in\mathrm{GL}(m,\mathbb{F})$ , let $\mathtt{A}^{R}$ be the $\ell\times n\times m$ $3$ -way array whose $k$ th frontal slice is $\sum_{k^{\prime}\in[m]}r_{k^{\prime},k}A_{k^{\prime}}$ . Note that these notations are consistent with the notations for matrix tuples above, when we consider the matrix tuple $\mathbf{A}=(A_{1},\dotsc,A_{k})$ of frontal slices of $\mathtt{A}$ .

Let $\mathtt{A}\in\mathrm{T}(\ell\times n\times m,\mathbb{F})$ be a 3-way array. We say that $\mathtt{A}$ is non-degenerate as a 3-tensor if the horizontal slices of $\mathtt{A}$ are linearly independent, the lateral slices are linearly independent, and the frontal slices are linearly independent. Let $\mathbf{A}=(A_{1},\dots,A_{m})\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ be a matrix tuple consisting of the frontal slices of $\mathtt{A}$ . Then it is easy to see that the frontal slices of $\mathtt{A}$ are linearly independent if and only if $\dim(\langle\mathbf{A}\rangle)=m$ . The lateral (resp., horizontal) slices of $\mathtt{A}$ are linearly independent if and only if the intersection of the right (resp., left) kernels of $A_{i}$ is zero.

Observation 6.2.

Given $3$ -way arrays $\mathtt{A}$ and $\mathtt{B}$ , we can construct non-degenerate 3-way arrays $\mathtt{A}^{\prime}$ and $\mathtt{B}^{\prime}$ in polynomial time, such that $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors if and only if $\mathtt{A}^{\prime}$ and $\mathtt{B}^{\prime}$ are isomorphic as 3-tensors.

Multi-way arrays.

For $d\geq 3$ , we use similar notation to 3-way arrays, which we will not belabor. Here we merely observe:

Observation 6.3.

For any $d^{\prime}\geq d$ , $d$ -TI reduces to $d^{\prime}$ -TI.

Proof.

Given an $n_{1}\times\dotsb\times n_{d}$ $d$ -way array $\mathtt{A}$ , we embed it as a $d^{\prime}$ -way array $\tilde{\mathtt{A}}$ of format $n_{1}\times\dotsb\times n_{d}\times 1\times 1\times\dotsb\times 1$ . If $\mathtt{A}\cong\mathtt{B}$ as $d$ -tensors, say via $(P_{1},\dotsc,P_{d})$ , then $\tilde{\mathtt{A}}\cong\tilde{\mathtt{B}}$ as $d^{\prime}$ -tensors via $(P_{1},\dotsc,P_{d},1,1,\dotsc,1)$ . Conversely, if $\tilde{\mathtt{A}}\cong\tilde{\mathtt{B}}$ via $(P_{1},\dotsc,P_{d},\alpha_{d+1},\dotsc,\alpha_{d^{\prime}})$ , then $\mathtt{A}\cong\mathtt{B}$ via $(\alpha_{d+1}\alpha_{d+2}\dotsb\alpha_{d^{\prime}}P_{1},\dotsc,P_{d})$ . That is, all that can “go wrong” under this embedding is multiplication by scalars, but those scalars can be absorbed into any one of the $P_{i}$ . ∎

Algebras and their algorithmic representations.

An algebra $A$ consists of a vector space $V$ and a bilinear map $\circ:V\times V\to V$ . This bilinear map defines the product $\circ$ in this algebra. Note that we do not assume $A$ to be unital (having an identity), associative, alternating, nor satisfying the Jacobi identity. In the literature, an algebra without such properties is sometimes called a non-associative algebra (but also, as usual, associative algebras are a special case of non-associative algebras).

As in Section 1, after fixing an ordered basis $(b_{1},\dots,b_{n})$ where $b_{i}\in\mathbb{F}^{n}$ of $V\cong\mathbb{F}^{n}$ , this bilinear map $\circ$ can be represented by an $n\times n\times n$ 3-way array $\mathtt{A}$ , such that $b_{i}\circ b_{j}=\sum_{k\in[n]}\mathtt{A}(i,j,k)b_{k}$ . This is the structural constant representation of $\mathtt{A}$ . Algorithms for associative algebras and Lie algebras have been studied intensively in this model, e. g., [IR99, dG00].

It is also natural to consider matrix spaces that are closed under multiplication or commutator. More specifically, let $\mathcal{A}\leq\mathrm{M}(n,\mathbb{F})$ be a matrix space. If $\mathcal{A}$ is closed under multiplication, that is, for any $A,B\in\mathcal{A}$ , $AB\in\mathcal{A}$ , then $\mathcal{A}$ is a matrix (associative) algebra with the product being the matrix multiplication. If $\mathcal{A}$ is closed under commutator, that is, for any $A,B\in\mathcal{A}$ , $[A,B]=AB-BA\in\mathcal{A}$ , then $\mathcal{A}$ is a matrix Lie algebra with the product being the commutator. Algorithms for matrix algebras and matrix Lie algebras have also been studied, e. g., [EG00, Iva00, IR99].

The Lazard correspondence for $p$ -groups.

The Lazard correspondence is a correspondence between certain classes of groups and Lie algebras, which extends the usual correspondence between Lie groups and Lie algebras (say, over $\mathbb{R}$ ) to some groups and Lie algebras in positive characteristic. Here we state just enough to give a sense of it; for further details we refer to Khukhro’s book [Khu98] and Naik’s thesis [Nai13]. While the thesis is quite long, it also includes a reader’s guide, and collects many results scattered across the literature or well-known to the experts in one place, building the theory from the ground up and with many examples.

Recall that a Lie ring is an abelian group $L$ equipped with a bilinear map $[,]$ , called the Lie bracket, which is (1) alternating ( $[x,x]=0$ for all $x\in L$ ) and (2) satisfies the Jacobi identity $[x,[y,z]]+[y,[z,x]]+[z,[x,y]]=0$ for all $x,y,z\in L$ . Let $L^{1}=L$ , and $L^{i+1}=[L,L^{i}]$ , which is the subgroup (of the underlying additive group) generated by all elements of the form $[x,y]$ for $x\in L,y\in L^{i}$ . Then $L$ is nilpotent if $L^{c+1}=0$ for some finite $c$ ; the smallest such $c$ is the nilpotency class. (Lie algebras are just Lie rings over a field.)

The correspondence between Lie algebras and Lie groups over $\mathbb{R}$ uses the Baker–Campbell–Hausdorff (BCH) formula to convert between a Lie algebra and a Lie group, so we start there. The BCH formula is the solution to the problem that for non-commuting matrices $X,Y$ , $e^{X}e^{Y}\neq e^{X+Y}$ in general (where the matrix exponential here is defined using the power series for $e^{x}$ ). Rather, using commutators $[A,B]=AB-BA$ , we have

[TABLE]

where the remaining terms are iterated commutators that all involve at least 5 $X$ s and $Y$ s, and successive terms involve more and more. Applying the exponential function to a Lie algebra in characteristic zero yields a Lie group. The BCH formula can be inverted, giving the correspondence in the other direction.

In a nilpotent Lie algebra, the BCH formula has only finitely many nonzero terms, so issues of convergence disappear and we may consider applying the correspondence over finite fields or rings; the only remaining obstacle is that the denominators appearing in the formula must be units in the ring. It turns out that the correspondence continues to work in characteristic $p$ so long as one does not need to use the $p$ -th term of the BCH formula (which includes division by $p$ ), and the latter is avoided whenever a nilpotent group has class strictly less than $p$ . While the correspondence does apply more generally, here we only state the version for finite groups. For any fixed nilpotency class $c$ , computing the Lazard correspondence is efficient in theory; for how to compute it in practice when the groups are given by polycyclic presentations, see [CdGVL12].

Let $\mathbf{Grp}_{p,n,c}$ denote the set of finite groups of order $p^{n}$ and class $c$ , and let $\mathbf{Lie}_{p,n,c}$ denote the set of Lie rings of order $p^{n}$ and class $c$ . We note that for nilpotency class 2, the Baer correspondence is the same as the Lazard correspondence.

Theorem 6.4 (Lazard Correspondence for finite groups, see, e. g., [Khu98, Ch. 9 & 10] or [Nai13, Ch. 6]).

For any prime $p$ and any $1\leq c<p$ , there are functions $\mathbf{log}\colon\mathbf{Grp}_{p,n,c}\leftrightarrow\mathbf{Lie}_{p,n,c}:\mathbf{exp}$ such that (1) $\mathbf{log}$ and $\mathbf{exp}$ are inverses of one another, (2) two groups $G,H\in\mathbf{Grp}_{p,n,c}$ are isomorphic if and only if $\mathbf{log}(G)$ and $\mathbf{log}(H)$ are isomorphic, and (3) if $G$ has exponent $p$ , then the exponent of the underlying abelian group of $\mathbf{log}(G)$ has exponent $p$ . More strongly, $\mathbf{log}$ is an isomorphism of categories $\mathbf{Grp}_{p,n,c}\cong\mathbf{Lie}_{p,n,c}$ .

Part (3) can be found as a special case of [Nai13, Lemma 6.1.2].

For $p$ -groups given by $d\times d$ matrices over the finite field $\mathbb{F}_{p^{e}}$ , we will need one additional fact about the correspondence, namely that it also results in a Lie algebra of $d\times d$ matrices. (Being able to bound the dimension of the Lie algebra and work with it in a simple linear-algebraic way seems crucial for our reduction to work efficiently.) In fact, the BCH correspondence is easier to see for matrix groups using the matrix exponential and matrix logarithm; most of the work for BCH and Lazard is to get the correspondence to work even without the matrices. In some sense, this is thus the “original” setting of this correspondence. Though it is surely not new, we could not find a convenient reference for this fact about matrix groups over finite fields, so we state it formally here.

Proposition 6.5.

Let $G\leq\mathrm{GL}(d,\mathbb{F}_{p^{e}})$ be a finite $p$ -subgroup of $d\times d$ matrices over a finite field of characteristic $p$ . Then $\mathbf{log}(G)$ (from the Lazard correspondence) can be realized as a finite Lie subalgebra of $d\times d$ matrices over $\mathbb{F}_{p^{e}}$ . Given a generating set for $G$ of $m$ matrices, a generating set for $\mathbf{log}(G)$ can be constructed in $\mathrm{poly}(d,n,\log p)$ time.

Proof sketch.

$G$ is conjugate in $\mathrm{GL}(d,\mathbb{F}_{p^{e}})$ to a group of upper unitriangular matrices (upper triangular with all 1s on the diagonal); this is a standard fact that can be seen in several ways, for example, by noting that the group $U$ of all upper unitriangular matrices in $\mathrm{GL}(d,\mathbb{F}_{p^{e}})$ is a Sylow $p$ -subgroup, and applying Sylow’s Theorem. (Note that we do not need to do this conjugation algorithmically, though it is possible to do so; this is only for the proof.) Thus we may write every $g\in G$ as $1+n$ , where the sum here is the ordinary sum of matrices, $1$ denotes the identity matrix, and $n$ is strictly upper triangular. In particular, $n^{d}=0$ (ordinary exponentiation of matrices). Thus the Taylor series for the logarithm

[TABLE]

has only finitely many terms, so we may use it even over $\mathbb{F}_{p^{e}}$ .

In the Lie algebra we would like addition to be ordinary matrix addition; however, it turns out that we can write this addition in terms of a formula involving only commutators of group elements. Deriving this formula—the so-called first BCH inverse formula—for the matrices will be the same, step for step, as deriving the first inverse BCH formula in general. Since the formulae are identical, the additive structures on $\log(G)$ (using the matrix logarithm) and $\mathbf{log}(G)$ (from the Lazard correspondence) are identical. Similar considerations apply to the matrix commutator $[\log(g),\log(h)]=\log(g)\log(h)-\log(h)\log(g)$ , now using the second BCH inverse formula. Overall, we conclude that $\mathbf{log}(G)$ (using Lazard) and $\log(G)$ (using the matrix logarithm) are isomorphic Lie algebras.

Equivalently, we may note that the derivation of the inverse BCH formula in [Khu98, Nai13] uses a free nilpotent associative algebra as an ambient setting in which both the group and the corresponding Lie algebra live; in our case, we may replace the ambient free nilpotent associative algebra with the algebra of $d\times d$ strictly upper-triangular matrices over $\mathbb{F}_{p^{e}}$ , and all the derivations remain the same, mutatis mutandis. See, for example, [Khu98, p. 105, “Another remark…”]. ∎

6.1 Tensor notation

To see that those problems in Section 2 exhaust distinct isomorphism problems coming from change-of-basis on 3-way arrays (without introducing multiple arrays, or block structure, or going to subgroups of $\mathrm{GL}(n,\mathbb{F})$ ), and to keep track of the relation between all the above problems, we use standard mathematical notation for spaces of tensors (however, we won’t actually need the full abstract definition here; for a formal introduction see, e. g., [Lan12]).

Much as the three natural equivalence relations on matrices differ by how the groups act on the rows and columns, the same is true for tensors, but on the rows, columns, and depths (the “row-like” sub-arrays which are “perpendicular to the page”). There are two aspects to the notation: first, we keep track of which group is acting where by introducing names $U,V,W$ for the different vector spaces involved (this is also the standard basis-free notation, e. g., [Lan12]) and the groups acting on them, viz. $\mathrm{GL}(U),\mathrm{GL}(V),\mathrm{GL}(W)$ , etc. Thus, while it is possible that $\dim U=\dim V$ and thus $\mathrm{GL}(U)\cong\mathrm{GL}(V)$ , the notation helps make clear which group is acting where. Second, to take into account the contragradient (“inverse”) action, given a vector space $V$ , $V^{*}$ denotes its dual space, consisting of the linear functions $V\to\mathbb{F}$ . $\mathrm{GL}(V)$ acts on $V^{*}$ by sending a linear function $\ell\in V^{*}$ to the function $(g\cdot\ell)(v)=\ell(g^{-1}(v))$ . In this notation, the three different actions on matrices correspond to the notations

[TABLE]

When we have a matrix space $\mathcal{A}\subseteq M(n\times m,\mathbb{F})$ instead of a single matrix $A$ , we introduce an additional vector space $W$ , which is naturally isomorphic to $\mathcal{A}$ as a vector space. The action of $\mathrm{GL}(W)$ on $W$ serves to change basis within the matrix space, while leaving the space itself unchanged. In this notation, the problems we mention above are listed in Table 2.

To see that the family of problems in Table 2 exhausts the possible isomorphism problems on (undecorated) 3-way arrays, we note that in this notation there are some “different-looking” isomorphism problems that are trivially equivalent. The first is re-ordering the spaces: the isomorphism problem for $V\otimes V\otimes W$ is trivially equivalent to that for $V\otimes W\otimes V$ , simply by permuting indices, viz. $\mathtt{A}^{\prime}(i,j,k)=\mathtt{A}(i,k,j)$ . The second is about dual vector spaces. Although a vector space $V$ and its dual $V^{*}$ are technically different, and the group action differs by an inverse transpose, we can choose bases in $V$ and $V^{*}$ so that there is a linear isomorphism $V\to V^{*}$ which induces a bijection between orbits; for example, the orbits of the action $g\cdot A=gAg^{t}$ are the same as the orbits of the action $g\cdot A=g^{-t}Ag^{-1}$ , even though technically the former corresponds to $V\otimes V$ and the latter to $V^{*}\otimes V^{*}$ . This means that if we are considering the isomorphism problem in a tensor space such as $V\otimes V\otimes W$ , we can dualize each of the vector spaces $V,W$ separately, so long as when we do so, we dualize all instances of that vector space. For example, the isomorphism problem in $V\otimes V\otimes W$ is trivially equivalent to that in $V^{*}\otimes V^{*}\otimes W$ , but is not obviously equivalent to that in $V\otimes V^{*}\otimes W$ (though we will show such a reduction in this paper). As a consequence, when the action on all three directions comes from the same group, there are only two choices: $V\otimes V\otimes V$ and $V\otimes V\otimes V^{*}$ ; the remaining choices are trivially equivalent to one of these two. Using these, we see that the Table 2 in fact covers all possibilities up to these trivial equivalences.

Special cases of interest.

As in the case of isometry of matrices, wherein skew-symmetric and symmetric matrices play a special role, the same is true for isometry of matrix spaces. We say a matrix space $\mathcal{A}$ is symmetric if every matrix $A\in\mathcal{A}$ is symmetric, and similarly for skew-symmetric or alternating. Symmetric Matrix Space Isometry is equivalent to asking whether two polynomial maps from $\mathbb{F}^{n}$ to $\mathbb{F}^{m}$ specified by homogeneous quadratic forms are the same under the action of $\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ . This problem has been proposed by Patarin [Pat96] as the basis of security for certain identification and signature schemes. Alternating Matrix Space Isometry is a particular case of interest, being in many ways a linear-algebraic analogue of GI [LQ17] (in addition to its close relation with Group Isomorphism for $p$ -groups of class 2 and exponent $p$ ).

Among trilinear forms, we can identify commutative cubic forms as those for which the coefficient 3-way array $\mathtt{A}$ is symmetric under all 6 permutations of its 3 indices $\mathtt{A}(i,j,k)=\mathtt{A}(j,i,k)=\dotsb=\mathtt{A}(k,i,j)$ . Over rings in which $6$ is a unit, cubic forms embed into trilinear forms via the standard map $f\mapsto T$ where $T_{i_{1},i_{2},i_{3}}=\frac{1}{3!}\sum_{\pi\in S_{3}}[x_{i_{\pi(1)}}x_{i_{\pi(2)}}x_{i_{\pi(3)}}]f$ , where $[x^{e}]f$ denotes the coefficient of $x^{e}$ in $f$ . Thus, over such rings Cubic Form Equivalence, as studied by Agrawal and Saxena [AS05, AS06], is a special case of Trilinear Form Equivalence.

Special cases of Algebra Isomorphism that are of interest include those of unital, associative algebras (commutative, e. g., as studied in [AS05, AS06, KS06], and non-commutative, such as group algebras) and Lie algebras.

Interesting cases of Matrix Space Conjugacy arise naturally whenever we have an algebra $A$ (say, associative or Lie) that is given to us as a subalgebra of the algebra $\mathrm{M}(n,\mathbb{F})$ of $n\times n$ matrices. Two such matrix algebras can be isomorphic as abstract algebras, but the more natural notion of “isomorphism of matrix algebras” is that of conjugacy, which respects both the algebra structure and the presentation in terms of matrices. This is the linear-algebraic analogue of permutational isomorphism (=conjugacy) of permutation groups, and has been studied for matrix Lie algebras [Gro12a] and associative matrix algebras [BW15]. (For those who know what a representation is: it also turns out to be equivalent to asking whether two representations of an algebra $A$ are equivalent up to automorphisms of $A$ , a problem which naturally arises as a subroutine in, e. g., Group Isomorphism, where it is often known as Action Compatibility, e. g., [GQ17].)

6.2 On the type of reduction

As these problems arise from several different fields, there are various properties one might hope for in the notion of reduction. Most of our reductions satisfy all of the following properties; see Remark 6.6 below for details.

Kernel reductions: there is a function $r$ from objects of one type to objects of the other such that $A\sim_{1}B$ if and only if $r(A)\sim_{2}r(B)$ . See [FG11] for some discussion on the relation between kernel reductions and more general reductions. 2. 2.

Efficiently computable: $r$ as above is computable in polynomial time. In fact, we believe, though have not checked fully, that all of our reductions are computable by uniform constant-depth (algebraic) circuits; over finite fields and algebraic number fields, we believe they are in uniform $\mathsf{TC}^{0}$ (the threshold gates are needed to do some simple arithmetic on the indices). That is, there is a small circuit which, given $A,i,j,k$ as input will output the $(i,j,k)$ entry of the output. 3. 3.

Polynomial-size projections (“p-projections”) [Val84]: each coordinate of the output is either one of the input variables or a constant, and the dimension of the output is polynomially bounded by the dimension of the input. (In fact, in many cases, the dimension of the output is only linearly larger than that of the input.) 4. 4.

Functorial. For each type of tensor there is naturally a category of such tensors (see [Mac71] for generalities on categories). For example, for 3TI, $U\otimes V\otimes W$ , the objects of the category are three-tensors, and a morphism between $\mathtt{A}\in U\otimes V\otimes W$ and $\mathtt{B}\in U^{\prime}\otimes V^{\prime}\otimes W^{\prime}$ is given by linear maps $P:U\to U^{\prime}$ , $Q\colon V\to V^{\prime}$ , and $R\colon W\to W^{\prime}$ such that $(P,Q,R)\cdot\mathtt{A}=\mathtt{B}$ . Isomorphism of 3-tensors is the special case when all three of $P,Q,R$ are invertible. Analogous categories can be defined for the other problems we consider, such as $V\otimes V^{*}\otimes W$ . A functor between two categories $\mathcal{C},\mathcal{D}$ is a pair of maps $(r,\overline{r})$ such that (1) $r$ maps objects of $\mathcal{C}$ to objects of $\mathcal{D}$ , (2) if $f\colon A\to B$ is a morphism in $\mathcal{C}$ , then $\overline{r}(f)\colon r(A)\to r(B)$ is a morphism in $\mathcal{D}$ , (3) for any $A\in\mathcal{C}$ , $\overline{r}(\operatorname{id}_{A})=\operatorname{id}_{r(A)}$ , and (4) if $f\colon A\to B$ and $g\colon B\to C$ are morphisms in $\mathcal{C}$ , then $\overline{r}(g\circ f)=\overline{r}(g)\circ\overline{r}(f)$ .

All our reductions are functorial on the categories in which we only consider isomorphisms; we suspect that they are also functorial on the entire categories (that is, including non-invertible homomorphisms). Furthermore, all our reductions yield another map $\overline{s}$ such that for any isomorphism $f^{\prime}\colon r(A)\to r(B)$ , $\overline{s}(f)$ is an isomorphism $A\to B$ , and $\overline{s}(\overline{r}(f))=f$ for any isomorphism $f\colon A\to B$ . If we only consider isomorphisms (and not other homomorphisms), we believe all known reductions between isomorphism problems have this form, cf. [Bab14]. 5. 5.

Containment in the sense used in the literature on wildness. There are several definitions in the literature which typically are equivalent when restricted to so-called matrix problems. For a few such definitions, see, e. g., [FGS19, Def. 1.2], [Ser00], or [SS07, Def. XIX.1.3]. For those problems in this paper to which the preceding definitions could apply, our reductions have the defined property. However, since we are working in a slightly more general setting, we would like to suggest the following natural generalization of these notions. Given two pairs $(G,V)$ and $(H,W)$ of algebraic groups $G,H$ acting on algebraic varieties $V,W$ , an algebraic containment is an algebraic map $r\colon V\to W$ (each coordinate of the output is given by a polynomial in the coordinates of the input) that is also a kernel reduction. In our case, all our spaces $V,W$ are affine space $\mathbb{F}^{n}$ for some $n$ , and our maps $r$ are in fact of degree 1. (It might be interesting to consider whether using higher degree allows for more efficient reductions.) We may also require it to be “functorial,” in the sense that there is a homomorphism of algebraic groups $\overline{r}\colon G\to H$ (simultaneously an algebraic map and a group homomorphism) such that

[TABLE]

and a section $\overline{s}\colon H\dashrightarrow G$ , such that $\overline{s}\circ\overline{r}=\operatorname{id}_{G}$ and

[TABLE]

where the dashed arrow above indicates that $\overline{s}$ need only be defined on a subset of $H$ , namely, those $h\in H$ such that there exist $v,v^{\prime}\in V$ with $h\cdot r(v)=r(v^{\prime})$ (but on this subset it should still act like a homomorphism, in the sense that $\overline{s}(hh^{\prime})=\overline{s}(h)\overline{s}(h^{\prime})$ ).

Remark 6.6.

We believe all of our reductions satisfy all of the above properties, with the possible exceptions that Prop. 7.3 and Prop. 8.1 are only projections (3) and algebraic containments (5) on the set of non-degenerate 3-tensors. These reductions still satisfy the other three properties on the set of all tensors: They are kernel reductions by construction; non-degeneracy presents no obstacle to polynomial-time computation (Observation 6.2); and two tensors are isomorphic iff their non-degenerate parts are isomorphic, so they are still functorial. The obstacle to being projections or algebraic containments on the set of all 3-tensors here is closely related to the fact that the map sending a matrix to its row echelon form (or even just zero-ing out a number of rows so that the remaining non-zero rows are linearly independent) is neither a projection nor an algebraic map. We would find it interesting if there were reductions for these results satisfying all of the above properties for all 3-tensors.

7 Reductions using the linear algebraic coloring

gadgets

In this section, we present the remaining reductions that use the linear algebraic coloring idea. We first reduce Graph Isomorphism to Alternating Matrix Space Isometry, using a gadget to restrict the full general linear group to the monomial matrix group, similar to that in Section 5. However, unlike in the case there, the use here requires slightly more care because of the alternating condition. We then reduce 3-Tensor Isomorphism to Alternating Matrix Space Isometry. The gadget there restricts the full general linear group to a parabolic subgroup. We note that such a gadget has appeared in [FGS19], while ours is a slight modification of that to be compatible with the alternating structure. Finally, we combine the two gadgets to give a search-to-decision reduction for Alternating Matrix Space Isometry over finite fields.

7.1 From Graph Isomorphism to Alternating Matrix Space Isometry

Proposition 7.1.

Graph Isomorphism* reduces to Alternating Matrix Space Isometry.*

For this proof we will need the concept of monomial isometry; see Some Groups above. Recall that a matrix is monomial if, equivalently, it can be written as $DP$ where $D$ is a nonsingular diagonal matrix and $P$ is a permutation matrix. We say two matrix spaces $\mathcal{A},\mathcal{B}$ are monomially isometric if there is some $M\in\mathrm{Mon}(n,\mathbb{F})$ such that $M^{t}\mathcal{A}M=\mathcal{B}$ .

Proof.

For a graph $G=([n],E)$ , let $\mathbf{A}_{G}$ be the alternating matrix tuple $\mathbf{A}_{G}=(A_{1},\dotsc,A_{|E|})$ with $A_{e}=E_{i,j}-E_{j,i}$ where $e=\{i,j\}\in E$ , and let $\mathcal{A}_{G}=\langle\mathbf{A}_{G}\rangle$ be the alternating matrix space spanned by that tuple. If $P$ is a permutation matrix giving an isomorphism between two graphs $G$ and $H$ , then it is easy to see that $P^{t}\mathcal{A}_{G}P=\mathcal{A}_{H}$ , and thus the corresponding matrix spaces are isometric. The converse direction is not clear (and may even be false). Instead, we will first extend the spaces $\mathcal{A}_{G}$ and $\mathcal{A}_{H}$ by gadgets which enforce that $\mathcal{A}_{G}$ and $\mathcal{A}_{H}$ are isometric iff they are monomially isometric (Lemma 7.2). Given Lemma 7.2, it thus suffices to reduce GI to Alternating Matrix Space Monomial Isometry.

Let us establish the latter reduction. We will show that $G\cong H$ if and only if $\mathcal{A}_{G}$ and $\mathcal{A}_{H}$ are monomially isometric. The forward direction was handled above. For the converse, suppose $P^{t}D^{t}\mathcal{A}_{G}DP=\mathcal{A}_{H}$ where $D$ is diagonal and $P$ is a permutation matrix. We claim that in this case, $P$ in fact gives an isomorphism from $G$ to $H$ . First let us establish that $P$ alone gives an isometry between $\mathcal{A}_{G}$ and $\mathcal{A}_{H}$ . Note that for any diagonal matrix $D=\operatorname{diag}(\alpha_{1},\dotsc,\alpha_{n})$ and any elementary alternating matrix $E_{i,j}-E_{j,i}$ , we have $D^{t}(E_{i,j}-E_{j,i})D=\alpha_{i}\alpha_{j}(E_{i,j}-E_{j,i})$ . Since $\mathcal{A}_{G}$ has a basis of elementary alternating matrices, the action of $D$ on this basis is just to re-scale each basis element, and thus $D^{t}\mathcal{A}_{G}D=\mathcal{A}_{G}$ . Thus, we have $P^{t}\mathcal{A}_{G}P=\mathcal{A}_{H}$ .

Finally, note that $P^{t}(E_{i,j}-E_{j,i})P=E_{\pi(i),\pi(j)}-E_{\pi(j),\pi(i)}=A_{\pi(e)}$ , where $\pi\in\mathrm{S}_{n}$ is the permutation corresponding to $P$ , and by abuse of notation we write $\pi(e)=\pi(\{i,j\})=\{\pi(i),\pi(j)\}$ as well. Since the elementary alternating matrices are linearly independent, and $\mathcal{A}_{H}$ has a basis of elementary alternating matrices, the only way for $A_{\pi(e)}$ to be in $\mathcal{A}_{H}$ is for it to be equal to one of the basis elements (one of the matrices in $\mathbf{A}_{H}$ ). In other words, $\pi(e)$ must be an edge of $H$ . As $P$ is invertible, we thus have that $P$ gives an isomorphism $G\cong H$ . ∎

Lemma 7.2.

Alternating Matrix Space Monomial Isometry* reduces to Alternating Matrix Space Isometry.*

More specifically, there is a $\mathrm{poly}(n,m)$ -time algorithm $r$ taking alternating matrix tuples to alternating matrix tuples, such that for $\mathbf{A},\mathbf{B}\in\Lambda(n,\mathbb{F})^{m}$ , the matrix spaces $\mathcal{A}=\langle\mathbf{A}\rangle$ and $\mathcal{B}=\langle\mathbf{B}\rangle$ are monomially isometric if and only if the matrix spaces $\langle r(\mathbf{A})\rangle$ and $\langle r(\mathbf{B})\rangle$ are isometric.

Proof.

For $\mathbf{A}=(A_{1},\dotsc,A_{m})\in\Lambda(n,\mathbb{F})^{m}$ , define $r(\mathbf{A})$ to be the alternating matrix tuple $\tilde{\mathbf{A}}=(\tilde{A}_{1},\dots,\tilde{A}_{m+n^{2}})\in\Lambda(n+n^{2},\mathbb{F})^{m+n^{2}}$ , where

For $k=1,\dots,m$ , $\tilde{A}_{k}=\begin{bmatrix}A_{k}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\\ \end{bmatrix}$ . 2. 2.

For $k=m+(i-1)n+j$ , $i\in[n]$ , $j\in[n]$ , $\tilde{A}_{k}$ is the elementary alternating matrix $E_{i,in+j}-E_{in+j,i}$ .

At this point, some readers may wish to look at the large matrix in Equation 1 and/or at Figure 3.

It is clear that $r$ can be computed in time $\tilde{O}((m+n^{2})(n^{2}+n))=\mathrm{poly}(n,m)$ . Given alternating matrix tuples $\mathbf{A},\mathbf{B}$ , let $\mathcal{A},\mathcal{B}$ be the corresponding matrix spaces they span, and let $\tilde{\mathcal{A}}=\langle r(\mathbf{A})\rangle$ and $\tilde{\mathcal{B}}=\langle r(\mathbf{B})\rangle$ . We claim that $\mathcal{A}$ and $\mathcal{B}$ are monomially isometric if and only if $\tilde{\mathcal{A}}$ and $\tilde{\mathcal{B}}$ are isometric.

To prove this, it will help to think of our matrix tuples $\mathbf{A},\tilde{\mathbf{A}}$ , etc. as (corresponding to) 3-way arrays, and to view these 3-way arrays from two different directions. Towards this end, write the 3-way array corresponding to $\mathbf{A}$ as

[TABLE]

where $a_{i,j}$ are vectors in $\mathbb{F}^{m}$ (“coming out of the page”), namely $a_{i,j}(k)=A_{k}(i,j)$ . The frontal slices of this array are precisely the matrices $A_{1},\dotsc,A_{m}$ .

The 3-way array corresponding to $\tilde{\mathbf{A}}=r(\mathbf{A})$ is then the $(n+1)n\times(n+1)n\times(m+n^{2})$ array:

[TABLE]

where $\tilde{a}_{i,j}=\begin{bmatrix}a_{i,j}\\ \mathbf{0}\end{bmatrix}\in\mathbb{F}^{m+n^{2}}$ (here think of the vector $a_{i,j}$ as a column vector, not coming out of the page; in the above array we then lay the column vector $\tilde{a}_{i,j}$ “on its side” so that it is coming out of the page), and $e_{i,j}:=e_{m+(i-1)n+j}\in\mathbb{F}^{m+n^{2}}$ , which we can equivalently write as $\begin{bmatrix}\mathbf{0}_{m}\\ e_{i}\otimes e_{j}\end{bmatrix}$ , where we think of $e_{i}\otimes e_{j}$ here as a vector of length $n^{2}$ . Note that all the the nonzero blocks besides upper-left “ $\mathtt{A}$ ” block only have nonzero entries that are strictly behind the nonzero entries in the upper-left block.

The second viewpoint, which we will also use below, is to consider the lateral slices of $\mathtt{A}$ , or equivalently, to view $\mathtt{A}$ from the side. When viewing $\mathtt{A}$ from the side, we see the $(n+1)n\times(m+n^{2})\times(n+1)n$ 3-way array:

[TABLE]

where every $\ell_{i,k}\in\mathbb{F}^{n^{2}+n}$ has only the first $n$ components being possibly non-zero, namely, $\ell_{i,k}(j)=A_{k}(i,j)$ for $i\in[n],j\in[n],k\in[m]$ and $\ell_{i,k}(j)=0$ for any $j>n$ .

For the only if direction,

suppose there exist $P\in\mathrm{Mon}(n,\mathbb{F})$ and $Q\in\mathrm{GL}(m,\mathbb{F})$ , such that $P^{t}\mathbf{A}P=\mathbf{B}^{Q}$ . We can construct $\tilde{P}\in\mathrm{Mon}(n+n^{2},\mathbb{F})$ and $\tilde{Q}\in\mathrm{GL}(m+n^{2},\mathbb{F})$ such that $\tilde{P}^{t}\tilde{\mathbf{A}}\tilde{P}=\tilde{\mathbf{B}}^{\tilde{Q}}$ . In fact, we will show that we can take $\tilde{P}=\begin{bmatrix}P&\mathbf{0}\\ \mathbf{0}&P^{\prime}\end{bmatrix}$ where $P^{\prime}\in\mathrm{Mon}(n^{2},\mathbb{F})$ , and $\tilde{Q}=\begin{bmatrix}Q&\mathbf{0}\\ \mathbf{0}&Q^{\prime}\end{bmatrix}$ where $Q^{\prime}\in\mathrm{Mon}(n^{2},\mathbb{F})$ . It is not hard to see that this form already ensures that the first $m$ matrices in the vector $\tilde{P}^{t}\tilde{\mathbf{A}}\tilde{P}$ and those of $\tilde{\mathbf{B}}^{\tilde{Q}}$ are the same, since when $\tilde{P},\tilde{Q}$ are of this form, those first $m$ matrices are controlled entirely by the $P$ (resp., $Q$ ) in the upper-left block of $\tilde{P}$ (resp., $\tilde{Q}$ ).

The remaining question is then how to design appropriate $P^{\prime}$ and $Q^{\prime}$ to take care of the last $n^{2}$ matrices in these tuples. This actually boils down to applying the following simple identity, but “in 3 dimensions:” Let $P$ be the permutation matrix corresponding to $\sigma\in\mathrm{S}_{n}$ , so that $Pe_{i}=e_{\sigma(i)}$ , and $e_{i}^{t}P=e_{\sigma^{-1}(i)}^{t}$ . Let $D=\operatorname{diag}(\alpha_{1},\dots,\alpha_{n})$ be a diagonal matrix. Then

[TABLE]

To see how Equation 3 helps in our setting, it is easier to focus attention on the lower right $n^{2}\times n^{2}$ sub-array of $\mathtt{A}^{lat}$ , which can be represented as a symbolic matrix

[TABLE]

Here we think of the $x_{i}$ ’s as independent variables, whose indices correspond to “how far into the page” they are. That is, $x_{i}$ corresponds to the vector $\vec{e}_{i}$ in $\mathtt{A}^{lat}$ , which is coming out of the page and has its only nonzero entry $i$ slices back from the page.

Then the action of $P$ permutes the $x_{i}$ ’s and multiplies them by some scalars, the action of $P^{\prime}$ is on the left-hand side, and the action of $Q^{\prime}$ is on the right-hand side. Let $\sigma$ be the permutation supporting $P$ . Then $P$ sends $M$ to

[TABLE]

So setting $P^{\prime}=\sigma\otimes I_{n}$ , $Q^{\prime}$ the monomial matrix supported by $\sigma\otimes I_{n}$ with scalars being $1/\alpha_{i}$ ’s, we have $P^{\prime t}M^{P}Q^{\prime}=M$ by Equation 3.

For the if direction,

suppose there exist $\tilde{P}\in\mathrm{GL}(n+n^{2},\mathbb{F})$ and $\tilde{Q}\in\mathrm{GL}(m+n^{2},\mathbb{F})$ , such that $\tilde{P}^{t}\tilde{\mathbf{A}}\tilde{P}=\tilde{\mathbf{B}}^{\tilde{Q}}$ . The key feature of these gadgets now comes into play: consider the lateral slices of $\tilde{\mathtt{A}}$ , which are the frontal slices of $\mathtt{A}^{lat}$ (which may be easier to visualize by looking at Equation 2 and Figure 3). The first $n$ lateral slices of $\tilde{\mathtt{A}}$ and $\tilde{\mathtt{B}}$ are of rank $\geq n$ and $<2n$ , while the other lateral slices are of rank $<n$ (in fact, they are of rank 1; note that without loss of generality we may assume $n>1$ , for the only $1\times 1$ alternating matrix space is the zero space). Furthermore, left multiplying a lateral slice by $\tilde{P}^{t}$ and right multiplying it by $\tilde{Q}$ does not change its rank. However, the action of $\tilde{P}$ here is by $\tilde{P}^{t}\tilde{\mathbf{A}}\tilde{P}$ , and while the $\tilde{P}^{t}$ here corresponds to left multiplication on the lateral slices (=frontal slices of $\mathtt{A}^{lat}$ ), the $\tilde{P}$ on the right here corresponds to taking linear combinations of the lateral slices. In other words, just as $\mathtt{A}^{lat}$ is the “side view” of $\tilde{\mathtt{A}}$ , $(\tilde{P}^{t}\mathtt{A}^{lat}\tilde{Q})^{\tilde{P}}$ is the side view of $(\tilde{P}^{t}\tilde{\mathtt{A}}\tilde{P})^{\tilde{Q}}$ . Taking linear combinations of the lateral slices could, in principle, alter their rank; we will use the latter possibility to show that $\tilde{P}$ must be of a constrained form.

Write $\tilde{P}=\begin{bmatrix}P_{1,1}&P_{1,2}\\ P_{2,1}&P_{2,2}\end{bmatrix}$ where $P_{1,1}$ is of size $n\times n$ . We first claim that $P_{1,2}=\mathbf{0}$ . For if not, then in $(\mathtt{A}^{lat})^{\tilde{P}}$ (the side view), one of the last $n^{2}$ frontal slices receives a nonzero contribution from one of the first $n$ frontal slices of $\mathtt{A}^{lat}$ . Looking at the form of these slices from Equation 2, we see that any such nonzero combination will have rank $\geq n$ , but this is a contradiction since the corresponding slice in $\mathtt{B}^{lat}$ has rank $1$ . Thus $P_{1,2}=\mathbf{0}$ , and therefore $P_{1,1}$ must be invertible, since $\tilde{P}$ is.

Finally, we claim that $P_{1,1}$ has to be a monomial matrix. If not, then some frontal slice of $(\mathtt{A}^{lat})^{\tilde{P}}$ among the first $n$ would have a contribution from more than one of these $n$ slices. Considering the lower-right $n^{2}\times n^{2}$ sub-matrix of such a slice, we see that it would have rank exactly $kn$ for some $k\geq 2$ , which is again a contradiction since the first $n$ slices of $\mathtt{B}^{lat}$ all have rank $<2n$ . It follows that $P_{1,1}^{t}A_{i}P_{1,1}$ , $i\in[m]$ , are in $\mathcal{B}$ , and thus $\mathcal{A}$ and $\mathcal{B}$ are monomially isometric via $P_{1,1}$ . ∎

7.2 From 3-Tensor Isomorphism to Matrix Space Isometry and Matrix Group Isomorphism

Proposition 7.3.

3-Tensor Isomorphism* reduces to Alternating Matrix Space Isometry. Symbolically, isomorphism in $U\otimes V\otimes W$ reduces to isomorphism in $V^{\prime}\otimes V^{\prime}\otimes W^{\prime}$ (or even to $\bigwedge^{2}V^{\prime}\otimes W$ ), where $\ell=\dim U\leq n=\dim V$ and $m=\dim W$ , $\dim V^{\prime}=\ell+7n+3$ and $\dim W^{\prime}=m+\ell(2n+1)+n(4n+2)$ .*

Proof.

We will exhibit a function $r$ from 3-way arrays to matrix tuples such that two 3-way arrays $\mathtt{A},\mathtt{B}\in T(\ell\times n\times m,\mathbb{F})$ which are non-degenerate as 3-tensors, are isomorphic as 3-tensors if and only if the matrix spaces $\langle r(\mathtt{A})\rangle,\langle r(\mathtt{B})\rangle$ are isometric. Note that we can assume our input tensors are non-degenerate by Observation 6.2. The construction is a bit involved, so we will first describe the construction in detail, and then prove the desired statement.

The gadget construction.

Given a 3-way array $\mathtt{A}\in T(\ell\times n\times m,\mathbb{F})$ , let $\mathbf{A}$ denote the corresponding $m$ -tuple of matrices, $\mathbf{A}\in M(\ell\times n)^{m}$ . The first step is to construct $s(\mathtt{A})\in\Lambda(\ell+n,\mathbb{F})^{m}$ , defined by $s(\mathtt{A})=(A_{1}^{\Lambda},\dotsc,A_{m}^{\Lambda})$ where $A_{i}^{\Lambda}=\begin{bmatrix}\mathbf{0}&A_{i}\\ -A_{i}^{t}&\mathbf{0}\end{bmatrix}$ . Already, note that if $\mathtt{A}\cong\mathtt{B}$ , then $s(\mathtt{A})$ and $s(\mathtt{B})$ are pseudo-isometric matrix tuples (equivalently, $\langle s(\mathtt{A})\rangle$ and $\langle s(\mathtt{B})\rangle$ are isometric matrix spaces).

However, it is not clear whether the converse should hold. Indeed, suppose $Ps(\mathtt{A})P^{T}=s(\mathtt{B})^{Q}$ for some $P\in\mathrm{GL}(\ell+n,\mathbb{F}),Q\in\mathrm{GL}(m,\mathbb{F})$ . If we write $P$ as a block matrix $\begin{bmatrix}P_{11}&P_{12}\\ P_{21}&P_{22}\end{bmatrix}$ , where $P_{11}\in M(\ell,\mathbb{F})$ and $P_{22}\in M(n,\mathbb{F})$ , then by considering the (1,2) block we get that $P_{11}A_{i}P_{22}^{t}-P_{21}^{t}A_{i}^{t}P_{12}=\sum_{j=1}^{m}q_{ij}B_{j}$ for all $i=1,\dotsc,m$ , whereas what we would want is the same equation but without the $P_{21}^{t}A_{i}^{t}P_{12}$ term. To remedy this, it would suffice if we could extend the tuple $s(\mathtt{A})$ to $r(\mathtt{A})$ so that any pseudo-isometry $(P,Q)$ between $r(\mathtt{A})$ and $r(\mathtt{B})$ will have $P_{21}=0$ .

To achieve this, we start from $s(\mathtt{A})=\mathbf{A}^{\Lambda}\in\Lambda(n+\ell,\mathbb{F})^{m}$ , and construct $r(\mathtt{A})\in\Lambda(\ell+7n+3,\mathbb{F})^{m+\ell(2n+1)+n(4n+2)}$ as follows. Here we write it out symbolically, on the next page is the same thing in matrix format, and in Figure 4 is a picture of the construction. Let $s=m+\ell(2n+1)+n(4n+2)$ . Write $r(\mathtt{A})=(\tilde{A}_{1},\dots,\tilde{A}_{s})$ , where $\tilde{A}_{i}\in\Lambda(\ell+7n+3,\mathbb{F})$ are defined as follows:

•

For $1\leq i\leq m$ , $\tilde{A}_{i}=\begin{bmatrix}A_{i}^{\Lambda}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ . Recall that $A_{i}^{\Lambda}\in\Lambda(\ell+n,\mathbb{F})$ .

•

For the next $\ell(2n+1)$ slices, that is, $m+1\leq i\leq m+\ell(2n+1)$ , we can naturally represent $i-m$ by $(p,q)$ where $p\in[\ell]$ , $q\in[2n+1]$ . We then let $\tilde{A}_{i}$ be the elementary alternating matrix $E_{p,\ell+n+q}-E_{\ell+n+q,p}$ .

•

For the next $n(4n+2)$ slices, that is $m+\ell(2n+1)+1\leq i\leq m+\ell(n+1)+n(4n+2)$ , we can naturally represent $i-m-\ell(n+1)$ by $(p,q)$ where $p\in[n]$ , $q\in[4n+2]$ . We then let $\tilde{A}_{i}$ be the elementary alternating matrix $E_{\ell+p,n+\ell+2n+1+q}-E_{n+\ell+2n+1+q,\ell+p}$ .

We may view the above construction is as follows. Write the frontal view of $\mathtt{A}$ as

[TABLE]

where $a_{i,j}^{\prime}\in\mathbb{F}^{m}$ , which we think of as a column vector, but when place in the above array, we think of it as coming out of the page.

Let $\tilde{\mathtt{A}}$ be the $3$ -way array whose frontal slices are $\tilde{A}_{i}$ , so $\tilde{\mathtt{A}}\in\mathrm{T}((\ell+7n+3)\times(\ell+7n+3)\times(m+\ell(2n+1)+n(4n+2)),\mathbb{F})$ . Then the frontal view of $\tilde{\mathtt{A}}$ is

[TABLE]

where $a_{i,j}=\begin{bmatrix}a_{i,j}^{\prime}\\ \mathbf{0}\end{bmatrix}\in\mathbb{F}^{m+\ell(2n+1)+n(4n+2)}$ , $e_{i,j}=\vec{e}_{m+(j-1)(2n+1)+i}$ , and $f_{i,j}=\vec{e}_{m+\ell(2n+1)+(j-1)(4n+2)+i}$ .

We now examine the ranks of the lateral slices $L_{i}$ of $\tilde{\mathtt{A}}$ . We claim:

[TABLE]

To see why these hold:

•

For $1\leq i\leq\ell$ , the $i$ th lateral slice $L_{i}$ is block-diagonal with two non-zero blocks. One block is of size $n\times\ell$ , and the other is $-I_{2n+1}$ . Therefore $2n+1\leq\mathrm{rk}(L_{i})\leq 3n+1$ .

•

For $\ell+1\leq i\leq\ell+n$ , the $i$ th lateral slice $L_{i}$ is also block-diagonal with two non-zero blocks. One block is of size $\ell\times n$ , and the other is $-I_{4n+2}$ . Therefore $4n+2\leq\mathrm{rk}(L_{i})\leq 5n+2$ .

•

For $\ell+n+1\leq i\leq\ell+n+6n+3$ , after rearranging the columns, the $i$ th lateral slice $L_{i}$ has one non-zero block which is is $I_{\ell}$ for the first $2n+1$ slices, and $I_{n}$ for the next $4n+2$ slices. Therefore $\mathrm{rk}(L_{i})=\ell$ or $n$ , and since we have assumed $\ell\leq n$ , in either case we have $\mathrm{rk}(L_{i})\leq n$ .

We then consider the ranks of the linear combinations of the lateral slices.

•

As long as the linear combination involves $L_{i}$ for $\ell+1\leq i\leq\ell+n$ , then the resulting matrix has rank at least $4n+2$ , because of the matrix $-I_{4n+2}$ in the last $4n+2$ rows.

•

If the linear combination does not involve $L_{i}$ for $\ell+1\leq i\leq\ell+n$ , then the resulting matrix has rank at most $4n+1$ , because in this case, there are at most $\ell+n+2n+1\leq 4n+1$ non-zero rows.

•

If the linear combination involves $L_{i}$ for $1\leq i\leq\ell$ , then the resulting matrix has rank at least $2n+1$ , because of the matrix $-I_{2n+1}$ in the $(\ell+n+1)$ th to the $(\ell+3n+1)$ th rows.

We then prove that $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors if and only if $\langle r(\mathtt{A})\rangle$ and $\langle r(\mathtt{B})\rangle$ are isometric as matrix spaces. At first glance, the only if direction seems the easy one, as one expects to extend a 3-tensor isomorphism between $\mathtt{A}$ to $\mathtt{B}$ to an isometry between $\langle r(\mathtt{A})\rangle$ and $\langle r(\mathtt{B})\rangle$ easily. However, it turns out that this direction becomes somewhat technical because of the gadget introduced. This is handled in the following.

For the if direction,

suppose $P^{t}\tilde{\mathtt{A}}P=\tilde{\mathtt{B}}^{Q}$ , for some $P\in\mathrm{GL}(\ell+7n+3,\mathbb{F})$ and $Q\in\mathrm{GL}(m+\ell(2n+1)+n(4n+2),\mathbb{F})$ . Write $P$ as $\begin{bmatrix}P_{1,1}&P_{1,2}&P_{1,3}\\ P_{2,1}&P_{2,2}&P_{2,3}\\ P_{3,1}&P_{3,2}&P_{3,3}\end{bmatrix}$ , where $P_{1,1}$ is of size $\ell\times\ell$ , $P_{2,2}$ is of size $n\times n$ , and $P_{3,3}$ is of size $(6n+3)\times(6n+3)$ . By the discussion on the ranks of the linear combinations of the lateral slices, we have $P_{2,1}=\mathbf{0}$ , $P_{1,2}=\mathbf{0}$ , $P_{1,3}=\mathbf{0}$ , and $P_{2,3}=\mathbf{0}$ . So $P=\begin{bmatrix}P_{1,1}&\mathbf{0}&\mathbf{0}\\ \mathbf{0}&P_{2,2}&\mathbf{0}\\ P_{3,1}&P_{3,2}&P_{3,3}\end{bmatrix}$ , where $P_{1,1}$ , $P_{2,2}$ , $P_{3,3}$ are invertible. Then consider the action of such $P$ on the first $m$ frontal slices of $\tilde{\mathtt{A}}$ . The first $m$ frontal slices of $\tilde{\mathtt{A}}$ are of the form $\begin{bmatrix}\mathbf{0}&A_{i}&\mathbf{0}\\ -A_{i}^{t}&\mathbf{0}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}&\mathbf{0}\end{bmatrix}$ , where $A_{i}$ is of size $\ell\times n$ . Then we have

[TABLE]

From the fact that $Q$ is invertible and $P^{t}\tilde{\mathtt{A}}P=\tilde{\mathtt{B}}^{Q}$ , by considering the $(1,2)$ block, we find that every frontal slice of $P_{11}^{t}\mathtt{A}P_{22}$ lies in $\langle\mathbf{B}\rangle$ (since the gadget does not affect the block-(1,2) position), which gives an isomorphism of tensors, as desired.

For the only if direction,

suppose $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors, that is, $P^{t}\mathtt{A}Q=\mathtt{B}^{R}$ , for some $P\in\mathrm{GL}(\ell,\mathbb{F})$ , $Q\in\mathrm{GL}(n,\mathbb{F})$ , and $R\in\mathrm{GL}(m,\mathbb{F})$ .

We show that there exist $U\in\mathrm{GL}(6n+3,\mathbb{F})$ and $V\in\mathrm{GL}(\ell(2n+1)+n(4n+2),\mathbb{F})$ such that setting

[TABLE]

we have

[TABLE]

which will demonstrate that $r(\mathtt{A})$ and $r(\mathtt{B})$ are pseudo-isometric.

Since we are claiming that $\tilde{R}=\operatorname{diag}(R,V)\in\mathrm{GL}(m,\mathbb{F})\times\mathrm{GL}(\ell(2n+1)+n(4n+2),\mathbb{F})$ works, and $\tilde{R}$ is block-diagonal, it suffices to consider the first $m$ frontal slices separately from the remaining slices. For the first $m$ frontal slices, we have:

[TABLE]

It follows from the fact that $P^{t}\mathtt{A}Q=\mathtt{B}^{R}$ that the first $m$ frontal slices of $\tilde{Q}^{t}r(\mathtt{A})\tilde{Q}$ and of $r(\mathtt{B})^{\tilde{R}}$ are the same.

We now consider the remaining frontal slices separately. Towards that end, let $\tilde{\mathtt{A}}^{\prime}\in\mathrm{T}((\ell+7n+3)\times(\ell+7n+3)\times(\ell(2n+1)+n(4n+2)),\mathbb{F})$ be the 3-way array obtained by removing the first $m$ frontal slices from $\tilde{\mathtt{A}}$ . That is, the $i$ th frontal slice of $\tilde{\mathtt{A}}^{\prime}$ is the $(m+i)$ th frontal slice of $\tilde{\mathtt{A}}$ . Similarly construct $\tilde{\mathtt{B}}^{\prime}$ from $\tilde{\mathtt{B}}$ . We are left to show that $\tilde{\mathtt{A}}^{\prime}$ and $\tilde{\mathtt{B}}^{\prime}$ are pseudo-isometric under some $\tilde{Q}=\operatorname{diag}(P,Q,U)$ and $V$ . Note that $P$ and $Q$ are from the isomorphism between $\mathtt{A}$ and $\mathtt{B}$ , while $U$ and $V$ are what we still need to design.

We first note that both $\tilde{\mathtt{A}}^{\prime}$ and $\tilde{\mathtt{B}}^{\prime}$ can be viewed as a block 3-way array of size $4\times 4\times 2$ , whose two frontal slices are the block matrices

[TABLE]

where $\mathtt{E}$ is of size $\ell\times(2n+1)\times\ell(2n+1)$ , and $\mathtt{F}$ is of size $n\times(4n+2)\times n(4n+2)$ . Although these are already identical in $\mathtt{A}^{\prime},\mathtt{B}^{\prime}$ , the issue here is that $P$ and $Q$ may alter the slices of $\tilde{\mathtt{A}}^{\prime}$ when they act on $\mathtt{A}$ , so we need a way to “undo” this action to bring it back to the same slices in $\mathtt{B}^{\prime}$ .

We now claim that we may further handle these two block slices—the “ $E$ ” slices and the “ $F$ ”-slices—separately, that is, that we may take $U=\operatorname{diag}(U_{1},U_{2})$ and $V=\operatorname{diag}(V_{1},V_{2})$ where $U_{1}\in\mathrm{GL}(2n+1,\mathbb{F})$ , $U_{2}\in\mathrm{GL}(4n+2,\mathbb{F})$ , $V_{1}\in\mathrm{GL}(\ell(2n+1),\mathbb{F})$ , and $V_{2}\in\mathrm{GL}(n(4n+2),\mathbb{F})$ .

To handle $\mathtt{E}$ , first note that we have

[TABLE]

where $E\in M(\ell\times(2n+1),\mathbb{F})$ .

Now we examine the lateral slices of $\mathtt{E}$ . The $i$ th lateral slice of $\mathtt{E}$ (up to a suitable permutation) is

[TABLE]

where each $\mathbf{0}$ is of size $\ell\times\ell$ , $I_{\ell}$ is the $i$ th block, and there are $2n+1$ block matrices in total. The action of $P$ on $L_{i}$ is by left multiplication. So it sends $L_{i}$ to $P^{t}L_{i}=\begin{bmatrix}\mathbf{0}&\dots&\mathbf{0}&P^{t}&\mathbf{0}&\dots&\mathbf{0}\end{bmatrix}$ . If we set $U_{1}$ to be the identity and $V_{1}=\operatorname{diag}(P^{t},\dotsc,P^{t})$ , where there are $(2n+1)$ copies of $P^{t}$ on the diagonal, then we have $L_{i}V_{1}=P^{t}L_{i}$ , and thus $P^{t}\mathtt{E}U_{1}=\mathtt{E}^{V_{1}}$ .

It is easy to check that $\mathtt{F}$ can be handled in the same way, where now $R,U_{2},V_{2}$ play the roles that $P,U_{1},V_{1}$ played before, respectively. This produces the desired $U_{1}$ , $U_{2}$ , $V_{1}$ , and $V_{2}$ , and concludes the proof. ∎

Corollary 7.4.

3-Tensor Isomorphism* reduces to Symmetric Matrix Space Isometry.*

Proof.

In the proof of Proposition 7.3, we can easily replace $A_{i}^{\Lambda}$ with $A_{i}^{s}=\begin{bmatrix}\mathbf{0}&A_{i}\\ A_{i}^{t}&\mathbf{0}\end{bmatrix}$ , and the elementary alternating matrices with the elementary symmetric matrices, and the resulting proof goes through mutatis mutandis. ∎

Finally, we show how to reduce to Group Isomorphism for matrix groups. We begin with a lemma that we also need for the search-to-decision reduction below. We believe this lemma to be classical, but have not found a reference stating it in quite the form we need.

Lemma 7.5 (Constructive version of Baer’s correspondence for matrix groups).

Let $p$ be an odd prime. Over the finite field $\mathbb{F}=\mathbb{F}_{p^{e}}$ , Alternating Matrix Space Isometry is equivalent to Group Isomorphism for matrix groups over $\mathbb{F}$ that are $p$ -groups of class $2$ and exponent $p$ . More precisely, there are functions computable in time $\mathrm{poly}(n,m,\log|\mathbb{F}|)$ :

•

$G\colon\Lambda(n,\mathbb{F})^{m}\to\mathrm{M}(n+m+1,\mathbb{F})^{n+m}$ * and*

•

$\text{Alt}\colon\mathrm{M}(n,\mathbb{F})^{m}\to\Lambda(m,\mathbb{F})^{O(m^{2})}$ **

such that: (1) for an alternating bilinear map $\mathbf{A}$ , the group generated by $G(\mathbf{A})$ is the Baer group corresponding to $\mathbf{A}$ , (2) $G$ and Alt are mutually inverse, in the sense that the group generated by $G(\text{Alt}(M_{1},\dotsc,M_{m}))$ is isomorphic to the group generated by $M_{1},\dotsc,M_{m}$ , and conversely $\text{Alt}(G(\mathbf{A}))$ is pseudo-isometric to $\mathbf{A}$ .

Proof.

First, let $G$ be a $p$ -group of class $2$ and exponent $p$ given by $m$ generating matrices of size $n\times n$ over $\mathbb{F}$ . Then from the generating matrices of $G$ , we first compute a generating set of $[G,G]$ , by just computing all the commutators of the given generators. We can then remove those redundant elements from this generating set in time $\mathrm{poly}(\log|[G,G]|,\log|\mathbb{F}|)$ , using Luks’ result on computing with solvable matrix groups[Luk92]. We then compute a set of representatives of a non-redundant generating set of $G/[G,G]$ , again using Luks’s aforementioned result. From these data we can compute an alternating bilinear map representing the commutator map of $G$ in time $\mathrm{poly}(n,m,\log|F|)$ .

Conversely, let an alternating bilinear map be given by $\mathbf{A}=(A_{1},\dots,A_{m})\in\Lambda(n,\mathbb{F})^{m}$ . From $\mathbf{A}$ , for $i\in[n]$ , construct $B_{i}=[A_{1}\vec{e_{i}},\dots,A_{m}\vec{e_{i}}]\in\mathrm{M}(n\times m,\mathbb{F})$ . That is, the $j$ th column of $B_{i}$ is the $i$ th column of $A_{j}$ . Then for $i\in[n]$ , construct

[TABLE]

and for $j\in[m]$ , construct

[TABLE]

Let $G(\mathbf{A})$ be the matrix group generated by $\tilde{B}_{i}$ and $\tilde{C}_{j}$ . Then it can be verified easily that, $G(\mathbf{A})$ is isomorphic to the Baer group corresponding to the alternating bilinear map defined by $\mathbf{A}$ . In particular, $[G,G]\cong\mathbb{F}^{m}\cong\mathbb{Z}_{p}^{em}$ (isomorphism of abelian groups), and $G/[G,G]\cong\mathbb{F}^{n}\cong\mathbb{Z}_{p}^{en}$ . This construction can be done in time $\mathrm{poly}(n,m,\log|\mathbb{F}|)$ . ∎

Corollary 7.6.

Let $p$ be an odd prime. 3-Tensor Isomorphism over $\mathbb{F}=\mathbb{F}_{p^{e}}$ reduces to Group Isomorphism for $p$ -groups of class 2 and exponent $p$ given by matrices over $\mathbb{F}$ , in time $\mathrm{poly}(n,\log|\mathbb{F}|)$ (where $n$ is the max of the dimensions of the 3-tensor).

Proof.

Combine Proposition 7.3 with Lemma 7.5. Note that for this direction of the reduction, we only need the function $G$ from Lemma 7.5, which can be computed in time $\mathrm{poly}(n,\log p)$ . ∎

7.3 Search to decision reduction for $p$ -Group Isomorphism and Alternating Matrix Space Isometry

Theorem C.

Given an oracle deciding Alternating Matrix Space Isometry, there is a $q^{O(n)}\cdot n!=q^{\tilde{O}(n)}$ -time algorithm to find an isometry between two alternating matrix spaces $\mathcal{A},\mathcal{B}\in\Lambda(n,\mathbb{F}_{q})$ , if it exists, using at most $q^{O(n)}$ oracle queries each of size at most $O(n^{2})$ .

In particular, if Alternating Matrix Space Isometry can be decided in $q^{\tilde{O}(\sqrt{n})}$ time, then isometries between such spaces can be found in $q^{\tilde{O}(n)}$ time. See Question 10.5.

Proof.

As before, we first present the gadget construction, which is a combination of the two gadgets introduced in Sections 7.1 and 7.2, respectively. Then based on this gadget, we present the search-to-decision reduction.

Gadget construction.

Let $\mathbf{A}=(A_{1},\dots,A_{m})$ be an ordered linear basis of $\mathcal{A}$ , and let $\mathtt{A}\in\mathrm{M}(n\times n\times m,\mathbb{F}_{q})$ be the 3-way array constructed from $\mathbf{A}$ , so we can write

[TABLE]

where $a_{i,j}\in\mathbb{F}^{m}$ , $1\leq i<j\leq n$ thought of as a vector coming out of the page.

We first consider a $3$ -tensor $\tilde{\mathtt{A}}_{i}$ constructed from $\mathtt{A}$ , for any $1\leq i\leq n-1$ , as $\tilde{\mathtt{A}}_{i}=$

[TABLE]

Consider the lateral slices of $\tilde{\mathtt{A}}_{i}$ .

•

The first $i$ lateral slices have rank in $[2n,3n)$ . Note that the rank is strictly less than $3n$ because some tube fibers (coming out of the page) are $\mathbf{0}$ in the upper-left $n\times n$ sub-array.

•

The next $n-i$ lateral slices have rank in $[n,2n)$ .

•

The remaining $2ni+n$ lateral slices have rank in $[1,n)$ (since $i\geq 1$ .)

By combining the arguments for the two gadgets introduced in Sections 7.1 and 7.2 respectively, we have the following. From Sec. 7.2, for invertible matrices $P$ and $Q$ to satisfy $P^{t}\tilde{\mathtt{A}}_{i}P=\tilde{\mathtt{B}}_{i}^{Q}$ , $P$ has to be of the form $\begin{bmatrix}P_{1,1}&\mathbf{0}&\mathbf{0}\\ \mathbf{0}&P_{2,2}&\mathbf{0}\\ P_{3,1}&P_{3,2}&P_{3,3}\end{bmatrix}$ , where $P_{1,1}$ is of size $i\times i$ , $P_{2,2}$ is of size $(n-i)\times(n-i)$ , and $P_{3,3}$ is of size $(2ni+n)\times(2ni+n)$ . Furthermore, from Sec. 7.1, $P_{1,1}$ is a monomial matrix. In particular, if such $P$ and $Q$ exist, then it implies that $\mathtt{A}$ and $\mathtt{B}$ are isometric by a matrix of the form $\begin{bmatrix}P_{1,1}&\mathbf{0}\\ \mathbf{0}&P_{2,2}\end{bmatrix}$ where $P_{1,1}$ is a monomial matrix of size $i\times i$ . Note that the presence of $P_{3,i}$ , $i=1,2,3$ , does not interfere here, because of the argument in the if direction in the proof of Proposition 7.3. On the other hand, if $\mathtt{A}$ and $\mathtt{B}$ are isometric by a matrix of such form, then $\tilde{\mathtt{A}}_{i}$ and $\tilde{\mathtt{B}}_{i}$ are also isometric.

The search-to-decision reduction.

Given these preparations, we now present the search-to-decision reduction for Alternating Matrix Space Isometry. Recall that this requires us to use the decision oracle $\mathcal{O}$ to compute an explicit isometry transformation $P\in\mathrm{GL}(n,q)$ , if $\mathcal{A}$ and $\mathcal{B}$ are indeed isometric. Think of $P$ as sending the standard basis $(\vec{e_{1}},\dots,\vec{e_{n}})$ to another basis $(v_{1},\dots,v_{n})$ , where $e_{i}$ and $v_{i}$ are in $\mathbb{F}_{q}^{n}$ .

In the first step, we guess $v_{1}$ , the image of $e_{1}$ , and a complement subspace of $\langle v_{1}\rangle$ , at the cost of $q^{O(n)}$ . For each such guess, let $P_{1}$ be the matrix which sends $e_{1}\mapsto v_{1}$ and sends $\langle e_{2},\dotsc,e_{n}\rangle$ to the chosen complementary subspace in some fashion. We apply $P_{1}$ to $\mathtt{A}$ , and call the resulting $3$ -way array $\mathtt{A}$ in the following. Then construct $\tilde{\mathtt{A}}_{1}$ and $\tilde{\mathtt{B}}_{1}$ , and feed these two instances to the oracle $\mathcal{O}$ . Note that, since $P_{1,1}$ (using notation as above) must be monomial, any equivalence between $\tilde{\mathtt{A}}_{1}$ and $\tilde{\mathtt{B}}_{1}$ must preserve our choice of $v_{1}$ up to scale. Thus, clearly, if $\mathtt{A}$ and $\mathtt{B}$ are indeed isometric and we guess the correct image of $e_{1}$ , then the oracle $\mathcal{O}$ will return yes (and conversely).

In the second step, we guess $v_{2}$ , the image of $e_{2}$ , and a complement subspace of $\langle v_{2}\rangle$ within $\langle e_{2},\dots,e_{n}\rangle$ , at the cost of $q^{O(n)}$ . Note here that the previous step guarantees that there is an isometry respecting the direct sum decomposition $\langle v_{1}\rangle\oplus\langle e_{2},\dotsc,e_{n}\rangle$ , so we need only search for a complement of $v_{2}$ within $\langle e_{2},\dotsc,e_{n}\rangle$ , and not a more general complement of $\langle v_{1},v_{2}\rangle$ in all of $\mathbb{F}_{q}^{n}$ . This is crucial for the runtime, as at the $n/2$ step, the latter strategy would result in searching through $q^{\Theta(n^{2})}$ possibilities.

For each such guess, we apply the corresponding transformation to $\mathtt{A}$ (and again call the resulting $3$ -way array $\mathtt{A}$ ). Then construct $\tilde{\mathtt{A}}_{2}$ and $\tilde{\mathtt{B}}_{2}$ , and feed these two instances to the oracle $\mathcal{O}$ . Clearly, if $\mathcal{A}$ and $\mathcal{B}$ are indeed isometric and we guess the correct image of $e_{2}$ (and $e_{1}$ from the previous step), then the oracle $\mathcal{O}$ will return yes. However, there is a small caveat here, namely we may guess some image of $e_{2}$ , such that $\mathcal{A}$ and $\mathcal{B}$ are actually isometric by some matrix $P$ of the form $\begin{bmatrix}P_{1,1}&\mathbf{0}\\ \mathbf{0}&P_{2,2}\end{bmatrix}$ where $P_{1,1}$ is a monomial matrix of size $2$ . But this is fine, as it still means that our choices of $\{v_{1},v_{2}\}$ is correct as a set up to scaling. So we proceed.

In general, in the $i$ th step, we know that $\mathcal{A}$ and $\mathcal{B}$ are isometric by some $P=\begin{bmatrix}P_{1,1}&\mathbf{0}\\ \mathbf{0}&P_{2,2}\end{bmatrix}$ where $P_{1,1}$ is a monomial matrix of size $(i-1)\times(i-1)$ . We guess $v_{i}$ , the image of $e_{i}$ in $\langle e_{i},\dots,e_{n}\rangle$ , and a complement subspace of $\langle v_{i}\rangle$ within $\langle e_{i},\dots,e_{n}\rangle$ . This cost is $q^{O(n)}$ . For each such guess, we apply the corresponding transformation to $\mathtt{A}$ (and call the resulting $3$ -way array $\mathtt{A}$ ). Then construct $\tilde{\mathtt{A}}_{i}$ and $\tilde{\mathtt{B}}_{i}$ , and feed these two instances to the oracle $\mathcal{O}$ . Once we guess correctly, we ensure that $\mathcal{A}$ and $\mathcal{B}$ are isometric by $P=\begin{bmatrix}P_{1,1}&\mathbf{0}\\ \mathbf{0}&P_{2,2}\end{bmatrix}$ where $P_{1,1}$ is a monomial matrix of size $i\times i$ .

So after the $(n-1)$ th step, we know that $\mathcal{A}$ and $\mathcal{B}$ are isometric by a monomial transformation. The number of all monomial transformations is by $(q-1)^{n}\cdot n!\leq q^{n}\cdot 2^{n\log n}=q^{\tilde{O}(n)}$ . Therefore we can enumerate all monomial transformations and check correspondingly.

Note that all the instances we feed into the oracle $\mathcal{O}$ are of size $O(n^{2})$ . This concludes the proof. ∎

Corollary C (Search to decision for testing isomorphism of

a class of $p$ -groups).

Let $p$ be an odd prime. Given an oracle deciding isomorphism of $p$ -groups of class 2 and exponent $p$ given by generating matrices over $\mathbb{F}_{p}$ of size $\mathrm{poly}(n)$ , there is a $|G|^{O(\log\log|G|)}$ -time algorithm to find an isomorphism between such groups, using at most $\mathrm{poly}(|G|)$ oracle queries each of size at most $\mathrm{poly}(n)$ .

Proof.

The result follows from Theorem C with the constructive version of Baer’s Correspondence in the model of matrix groups over finite fields (Lemma 7.5).

In more detail, given Lemma 7.5 we can follow the procedure in the proof of Theorem C. For the given $p$ -groups, we compute their commutator maps. Then whenever we need to feed the decision oracle, we transform from the alternating bilinear map to a generating set of a $p$ -group of class $2$ and exponent $p$ with this bilinear map as the commutator map. After getting the desired pseudo-isometry for the alternating bilinear maps, we can easily recover an isomorphism between the originally given $p$ -groups. This concludes the proof. ∎

8 Other reductions for the Main Theorem A

In this section, we present other reductions to finish the proof of Theorem A. The reductions here are based on the constructions which may be summarized as “putting the given 3-way array to an appropriate corner of a larger 3-way array.” Such an idea is quite classical in the context of matrix problems and wildness [GP69]; here we use the same idea for problems on 3-way arrays.

8.1 From 3-Tensor Isomorphism to Matrix Space Conjugacy

Proposition 8.1.

3-Tensor Isomorphism* reduces to Matrix Space Conjugacy. Symbolically, $U\otimes V\otimes W$ reduces to $V^{\prime}\otimes V^{\prime*}\otimes W$ , where $\dim V^{\prime}=\dim U+\dim V$ .*

Proof.

The construction. For a 3-way array $\mathtt{A}\in\mathrm{T}(\ell\times n\times m,\mathbb{F})$ , let $\mathbf{A}=(A_{1},\dots,A_{m})\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ be the matrix tuple consisting of frontal slices of $\mathtt{A}$ . Construct $\tilde{\mathbf{A}}=(\tilde{A}_{1},\dots,\tilde{A}_{m})\in\mathrm{M}(\ell+n,\mathbb{F})^{m}$ from $\mathbf{A}$ , where $\tilde{A}_{i}=\begin{bmatrix}\mathbf{0}&A_{i}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ . See Figure 5.

Given two non-degenerate 3-way arrays $\mathtt{A},\mathtt{B}$ which we wish to test for isomorphism (we can assume non-degeneracy without loss of generality, see Observation 6.2), we claim that $\mathtt{A}\cong\mathtt{B}$ as 3-tensors if and only if the matrix spaces $\langle\tilde{\mathbf{A}}\rangle$ and $\langle\tilde{\mathbf{B}}\rangle$ are conjugate.

For the only if direction, since $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors, there exist $P\in\mathrm{GL}(\ell,\mathbb{F})$ , $Q\in\mathrm{GL}(n,\mathbb{F})$ , and $R\in\mathrm{GL}(m,\mathbb{F})$ , such that $P\mathbf{A}Q=\mathbf{B}^{R}=(B_{1}^{\prime},\dots,B_{m}^{\prime})\in\mathrm{M}(\ell\times n,\mathbb{F})^{m}$ . Let $\tilde{P}=\begin{bmatrix}P^{-1}&\mathbf{0}\\ \mathbf{0}&Q\end{bmatrix}$ . Then $\tilde{P}^{-1}\tilde{A}_{i}\tilde{P}=\begin{bmatrix}P&\mathbf{0}\\ \mathbf{0}&Q^{-1}\end{bmatrix}\cdot\begin{bmatrix}\mathbf{0}&A_{i}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}\cdot\begin{bmatrix}P^{-1}&\mathbf{0}\\ \mathbf{0}&Q\end{bmatrix}=\begin{bmatrix}\mathbf{0}&PA_{i}Q\\ \mathbf{0}&\mathbf{0}\end{bmatrix}=\begin{bmatrix}\mathbf{0}&B_{i}^{\prime}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ . It follows that, $\tilde{P}^{-1}\tilde{\mathbf{A}}\tilde{P}=\tilde{\mathbf{B}}^{R}$ , which just says that $\tilde{P}^{-1}\langle\tilde{\mathbf{A}}\rangle\tilde{P}=\langle\tilde{\mathbf{B}}\rangle$ .

For the if direction, since $\langle\tilde{\mathbf{A}}\rangle$ and $\langle\tilde{\mathbf{B}}\rangle$ are conjugate, there exist $\tilde{P}\in\mathrm{GL}(\ell+n,\mathbb{F})$ , and $\tilde{R}\in\mathrm{GL}(m,\mathbb{F})$ , such that $\tilde{P}^{-1}\tilde{\mathbf{A}}\tilde{P}=\tilde{\mathbf{B}}^{\tilde{R}}$ . Write $\tilde{\mathbf{B}}^{\tilde{R}}:=\tilde{\mathbf{B}}^{\prime}=(\tilde{B}_{1}^{\prime},\dots,\tilde{B}_{m}^{\prime})$ , where $\tilde{B}_{i}^{\prime}=\begin{bmatrix}\mathbf{0}&B_{i}^{\prime}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ , $B_{i}^{\prime}\in\mathrm{M}(\ell\times n,\mathbb{F})$ . Let $\tilde{P}=\begin{bmatrix}P_{1,1}&P_{1,2}\\ P_{2,1}&P_{2,2}\end{bmatrix}$ , where $P_{1,1}\in\mathrm{M}(\ell,\mathbb{F})$ . Then as $\tilde{\mathbf{A}}\tilde{P}=\tilde{P}\tilde{\mathbf{B}}^{\prime}$ , we have for every $i\in[m]$ ,

[TABLE]

This in particular implies that for every $i\in[m]$ , $P_{2,1}A_{i}=\mathbf{0}$ . In other words, every row of $P_{2,1}$ lies in the common left kernel of $A_{i}$ with $i\in[m]$ . Since $\mathbf{A}$ is non-degenerate, $P_{2,1}$ must be the zero matrix. It follows that $\tilde{P}=\begin{bmatrix}P_{1,1}&P_{1,2}\\ \mathbf{0}&P_{2,2}\end{bmatrix}\in\mathrm{GL}(\ell+n,\mathbb{F})$ , so $P_{1,1}$ and $P_{2,2}$ are both invertible matrices. By Equation 5, we have $P_{1,1}\mathbf{A}=\mathbf{B}^{\tilde{R}}P_{2,2}$ , where $P_{1,1}\in\mathrm{GL}(\ell,\mathbb{F})$ , $P_{2,2}\in\mathrm{GL}(n,\mathbb{F})$ , and $\tilde{R}\in\mathrm{GL}(m,\mathbb{F})$ , which just says that $\mathtt{A}$ and $\mathtt{B}$ are isomorphic as 3-tensors. ∎

Corollary 8.2.

3-Tensor Isomorphism* reduces to*

Matrix Lie Algebra Conjugacy*, where $L$ is commutative;* 2. 2.

Associative Matrix Algebra Conjugacy*, where $A$ is commutative (and in fact has the property that $ab=0$ for all $a,b\in A$ ; note that $A$ is not unital);* 3. 3.

Matrix Lie Algebra Conjugacy*, where $L$ is solvable of derived length 2, and $L/[L,L]\cong\mathbb{F}$ ; and,* 4. 4.

Associative Matrix Algebra Conjugacy*, where the Jacobson radical $J(A)$ squares to zero, and $A/J(A)\cong\mathbb{F}$ .*

Proof.

We use the notation from the proof of Proposition 8.1. Note that the matrix spaces constructed there, e. g., $\tilde{\mathbf{A}}$ , are all subspaces of the $(\ell+n)\times(\ell+n)$ matrix space $\mathcal{U}:=\begin{bmatrix}\mathbf{0}&M(\ell\times n,\mathbb{F})\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ .

For (1) and (2), observe that for any two matrices $A,A^{\prime}\in\mathcal{U}$ , we have $AA^{\prime}=0$ , and thus $[A,A^{\prime}]=AA^{\prime}-A^{\prime}A=0$ as well. Thus any matrix subspace of $\mathcal{U}$ is both a commutative matrix Lie algebra and a commutative associative matrix algebra with zero product.

For (3) and (4), we note that we can alter the construction of Proposition 8.1 by including the matrix $M_{0}=\begin{bmatrix}I_{\ell}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ in both matrix spaces $\tilde{\mathcal{A}}$ and $\tilde{\mathcal{B}}$ without disrupting the reduction. Indeed, for the forward direction we have that (again, following notation as above) $\tilde{P}^{-1}\begin{bmatrix}I_{\ell}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}\tilde{P}=\begin{bmatrix}P&\mathbf{0}\\ \mathbf{0}&Q^{-1}\end{bmatrix}\begin{bmatrix}I_{\ell}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}\begin{bmatrix}P^{-1}&\mathbf{0}\\ \mathbf{0}&Q\end{bmatrix}=\begin{bmatrix}I_{\ell}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ .

For the reverse direction, we then have that for $\tilde{\mathbf{B}}^{\prime}=\tilde{\mathbf{B}}^{\tilde{R}}$ , we have $\tilde{B}^{\prime}_{i}=\begin{bmatrix}\alpha I_{d}&B_{i}^{\prime}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ . Let $\tilde{P}=\begin{bmatrix}P_{1,1}&P_{1,2}\\ P_{2,1}&P_{2,2}\end{bmatrix}$ , where $P_{1,1}\in\mathrm{M}(\ell,\mathbb{F})$ . Then as $\tilde{\mathbf{A}}\tilde{P}=\tilde{P}\tilde{\mathbf{B}}^{\prime}$ , we have for every $i\in[m]$ ,

[TABLE]

Considering the (2,1) block of this equation, we find that if $\alpha\neq 0$ , then immediately $P_{2,1}=\mathbf{0}$ . But even if $\alpha=0$ , then we are back to the same argument as in Proposition 8.1, namely that by the non-degeneracy of $\mathbf{A}$ , we still get $P_{2,1}=\mathbf{0}$ by considering the (2,2) block. The remainder of the argument only depended on the (1,2) block of the preceding equation, which is the same as before.

Finally, to see the structure of the corresponding algebras, we must consider how our new element $M_{0}$ interacts with the others. Easy calculations reveal:

[TABLE]

(3) For the structure of the Lie algebra, we have from the above equations that any commutator is either 0 or lands in $\mathcal{U}$ . And since $[M_{0},\tilde{A}_{i}]=\tilde{A}_{i}$ , we have that $[L,L]$ is the subspace of $\mathcal{U}$ that we started with before including $M_{0}$ . Since everything in that subspace commutes, we get that $[[L,L],[L,L]]=0$ , and thus the Lie algebra is solvable of derived length 2. Moreover, $L/[L,L]$ is spanned by the image of $M_{0}$ , whence it is isomorphic to $\mathbb{F}$ .

(4) Recall that for rings without an identity, the Jacobson radical can be characterized as $J(A)=\{a\in A|(\forall b\in A)(\exists c\in A)[c+ba=cba]\}$ [Lam91, p. 63]. Note that the only nontrivial cases to check are those for which $b=M_{0}$ , since otherwise $ba=0$ and then we may take $c=0$ as well. So we have $J(A)=\{a\in A|(\exists c\in A)[c+M_{0}a=cM_{0}a]\}$ . But since $M_{0}$ is a left identity, this latter equation is just $c+a=ca$ . For any $a\in\mathcal{U}$ , we may take $c=-a$ , since then both sides of the equation are zero, and thus $J(A)$ includes all the matrices in the original space from Proposition 8.1. However, $M_{0}\notin J(A)$ , for there is no $c$ such that $c+M_{0}=cM_{0}$ : any element of $A$ can be written $\alpha M_{0}+u$ for some $u\in\mathcal{U}$ . Writing $c$ this way, we are trying to solve the equation $\alpha M_{0}+u+M_{0}=(\alpha M_{0}+u)M_{0}=\alpha M_{0}$ . Thus we conclude $u=0$ , and then we get that $\alpha+1=\alpha$ , a contradiction. So $M_{0}\notin J(A)$ , and thus $A/J(A)$ is spanned by the image of $M_{0}$ , whence it is isomorphic to $\mathbb{F}$ . ∎

8.2 From Matrix Space Isometry to Algebra Isomorphism and Trilinear Form Equivalence

Proposition 8.3.

Matrix Space Isometry* reduces to Algebra Isomorphism and Trilinear Form Equivalence. Symbolically, $V\otimes V\otimes W$ reduces to $V^{\prime}\otimes V^{\prime}\otimes V^{\prime*}$ and to $V^{\prime}\otimes V^{\prime}\otimes V^{\prime}$ , where $\dim V^{\prime}=\dim V+\dim W$ .*

Proof.

The construction. Given a matrix space $\mathcal{A}$ by an ordered linear basis $\mathbf{A}=(A_{1},\dotsc,A_{m})$ , construct the 3-way array $\mathtt{A}^{\prime}\in T((n+m)\times(n+m)\times(n+m),\mathbb{F})$ whose frontal slices are:

[TABLE]

Let $\text{Alg}(\mathtt{A}^{\prime})$ denote the algebra whose structure constants are defined by $\mathtt{A}^{\prime}$ , and let $f_{\mathtt{A}^{\prime}}$ denote the trilinear form whose coefficients are given by $\mathtt{A}^{\prime}$ .

Given two matrix spaces $\mathcal{A},\mathcal{B}$ , we claim that $\mathcal{A}$ and $\mathcal{B}$ are isometric if and only if $\text{Alg}(\mathtt{A}^{\prime})\cong\text{Alg}(\mathtt{B}^{\prime})$ (isomorphism of algebras) if and only if $f_{\mathtt{A}^{\prime}}$ and $f_{\mathtt{A}^{\prime}}$ are equivalent as trilinear forms. The proofs are broken into the following two lemmas, which then complete the proof of the proposition. ∎

Lemma 8.4.

Let notation be as above. The matrix spaces $\mathcal{A},\mathcal{B}$ are isometric if and only if $\text{Alg}(\mathtt{A}^{\prime})$ and $\text{Alg}(\mathtt{B}^{\prime})$ are isomorphic.

Proof.

Let $\mathbf{A},\mathbf{B}$ be the ordered bases of $\mathcal{A},\mathcal{B}$ , respectively. Recall that $\mathcal{A},\mathcal{B}$ are isometric if and only if there exist $(P,R)\in\mathrm{GL}(n,\mathbb{F})\times\mathrm{GL}(m,\mathbb{F})$ such that $P^{t}\mathbf{A}P=\mathbf{B}^{R}$ . Also recall that $\text{Alg}(\mathtt{A}^{\prime})$ and $\text{Alg}(\mathtt{B}^{\prime})$ are isomorphic as algebras if and only if there exists $\tilde{P}\in\mathrm{GL}(n+m,\mathbb{F})$ such that $\tilde{P}^{t}\mathbf{A}^{\prime}\tilde{P}=\mathbf{B}^{\prime\tilde{P}}$ . Since $A_{i}$ (resp. $B_{i}$ ) form a linear basis of $\mathcal{A}$ (resp. $\mathcal{B}$ ), we have that $A_{i}$ (resp. $B_{i}$ ) are linearly independent.

The only if direction

is easy to verify. Given an isometry $(P,R)$ between $\mathcal{A}$ and $\mathcal{B}$ , let $\tilde{P}=\begin{bmatrix}P&\mathbf{0}\\ \mathbf{0}&R\end{bmatrix}$ . Let $\tilde{P}^{t}\mathbf{A}^{\prime}\tilde{P}=(A_{1}^{\prime\prime},\dots,A_{n+m}^{\prime\prime})$ . Then for $i\in[n]$ , $A_{i}^{\prime\prime}=\mathbf{0}$ . For $n+1\leq i\leq n+m$ , $A_{i}^{\prime\prime}=\begin{bmatrix}P^{t}A_{i}P&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ . Let $\mathbf{B}^{\prime\tilde{P}}=(B_{1}^{\prime\prime},\dots,B_{n+m}^{\prime\prime})$ . Then for $i\in[n]$ , $B_{i}^{\prime\prime}=\mathbf{0}$ . For $n+1\leq i\leq n+m$ , $B_{i}^{\prime\prime}$ is the $(i-n)$ th matrix in $\mathbf{B}^{R}$ , which in turn equals $P^{t}A_{i}P$ by the assumption on $P$ and $R$ . This proves the only if direction.

For the if direction,

let $\tilde{P}=\begin{bmatrix}P&X\\ Y&R\end{bmatrix}\in\mathrm{GL}(n+m,\mathbb{F})$ be an algebra isomorphism, where $P$ is of size $n\times n$ . Let $\tilde{P}\mathbf{A}^{\prime}\tilde{P}^{t}=(A_{1}^{\prime\prime},\dots,A_{n+m}^{\prime\prime})$ , and $\mathbf{B}^{\prime\tilde{P}}=(B_{1}^{\prime\prime},\dots,B_{n+m}^{\prime\prime})$ . Since for $i\in[n]$ , $A_{i}^{\prime}=\mathbf{0}$ , we have $A_{i}^{\prime\prime}=\mathbf{0}=B_{i}^{\prime\prime}$ . Therefore $Y$ has to be $\mathbf{0}$ , because $B_{i}$ ’s are linearly independent. It follows that $\tilde{P}=\begin{bmatrix}P&X\\ \mathbf{0}&R\end{bmatrix}$ , where $P$ and $R$ are invertible. So for $1\leq i\leq m$ , we have $\tilde{P}^{t}A_{i+n}^{\prime}\tilde{P}=\begin{bmatrix}P^{t}&\mathbf{0}\\ X^{t}&R^{t}\end{bmatrix}\begin{bmatrix}A_{i}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}\begin{bmatrix}P&X\\ \mathbf{0}&R\end{bmatrix}=\begin{bmatrix}P^{t}A_{i}P&P^{t}A_{i}X\\ X^{t}A_{i}P&X^{t}A_{i}X\end{bmatrix}$ . Also the last $m$ matrices in $\mathbf{B}^{\prime\tilde{P}}$ are $\begin{bmatrix}B_{i}^{\prime\prime}&\mathbf{0}\\ \mathbf{0}&\mathbf{0}\end{bmatrix}$ , where $B_{i}^{\prime\prime}$ is the $i$ th matrix in $\mathbf{B}^{R}$ . This implies that $P\in\mathrm{GL}(n,\mathbb{F})$ and $R\in\mathrm{GL}(m,\mathbb{F})$ together form an isometry between $\mathcal{A}$ and $\mathcal{B}$ . ∎

Corollary 8.5.

Matrix Space Isometry* reduces to*

Associative Algebra Isomorphism*, for algebras that are commutative and unital;* 2. 2.

Associative Algebra Isomorphism*, for algebras that are commutative and 3-nilpotent ( $abc=0$ for all $a,b,c\in A$ ); and,* 3. 3.

Lie Algebra Isomorphism*, for Lie algebras that are 2-step nilpotent ( $[u,[v,w]]=0$ for all $u,v,w\in L$ ).*

Proof.

We follow the notation from the proof of Lemma 8.4. We begin by observing that $\text{Alg}(\mathtt{A}^{\prime})$ is a 3-nilpotent algebra, and therefore is automatically associative. Let $V^{\prime}=V\oplus W$ , where $\dim V=n$ , $\dim W=m$ , and, as a subspace of $V^{\prime}\cong\mathbb{F}^{n+m}$ , $V$ has a basis given by $e_{1},\dotsc,e_{n}$ and $W$ has a basis given by $e_{n+1},\dotsc,e_{n+m}$ . Let $\circ$ denote the product in $\text{Alg}(\mathtt{A}^{\prime})$ , so that $x_{i}\circ x_{j}=\sum_{k}\mathtt{A}^{\prime}(i,j,k)x_{k}$ . Note that because the lower $m$ rows and the rightmost $m$ columns of each frontal slice of $\mathtt{A}^{\prime}$ are zero, we have that $w\circ x=x\circ w=0$ for any $w\in W$ and any $x\in V^{\prime}$ . Thus only way to get a nonzero product is of the form $v\circ v^{\prime}$ where $v,v^{\prime}\in V$ , and here the product ends up in $W$ , since the only nonzero frontal slices are $n+1,\dotsc,n+m$ . Since any nonzero product ends up in $W$ , and anything in $W$ times anything at all is zero, we have that $abc=0$ for all $a,b,c\in\text{Alg}(\mathtt{A}^{\prime})$ , that is, $\text{Alg}(\mathtt{A}^{\prime})$ is 3-nilpotent. Any 3-nilpotent algebra is automatically associative, since the associativity condition only depends on products of three elements.

(2) If instead of general Matrix Space Isometry, we start from Symmetric Matrix Space Isometry (which is also $\mathsf{3TI}$ -complete by Corollary 7.4), then we see that the algebra is commutative, for we then have $\mathtt{A}^{\prime}(i,j,k)=\mathtt{A}^{\prime}(j,i,k)$ , which corresponds to $x_{i}\circ x_{j}=x_{j}\circ x_{i}$ .

(1) As is standard, from the algebra $A=\text{Alg}(\mathtt{A}^{\prime})$ , we may adjoin a unit by considering $A^{\prime}=A[e]/(e\circ x=x\circ e=x|x\in A^{\prime})$ . In terms of vector spaces, we have $A^{\prime}\cong A\oplus\mathbb{F}$ , where the new $\mathbb{F}$ summand is spanned by the identity $e$ . This standard algebraic construction has the property that two such algebras $A,B$ are isomorphic if and only if their corresponding unit-adjoined algebras $A^{\prime},B^{\prime}$ are (see, e. g., [Dor32, Wik19]).

(3) By starting from an alternating matrix space $\mathcal{A}$ (and noting that Alternating Matrix Space Isometry is still $\mathsf{3TI}$ -complete, by Corollary 7.4), we get that $\text{Alg}(\mathtt{A}^{\prime})$ is alternating, that is, $v\circ v=0$ . Since we still have that it is 3-nilpotent, $a\circ b\circ c=0$ , we find that $\circ$ automatically satisfies the Jacobi identity. An alternating product satisfying the Jacobi identity is, by definition, a Lie bracket (that is, we can define $[v,w]:=v\circ w$ ), and thus we get a Lie algebra with structure constants $\mathtt{A}^{\prime}$ . Translating the 3-nilpotency condition $a\circ b\circ c=0$ into the Lie bracket notation, we get $[a,[b,c]]=0$ , or in other words that the Lie algebra is nilpotent of class 2. ∎

Corollary 8.6.

3-Tensor Isomorphism* reduces to Cubic Form Equivalence.*

Proof.

Agrawal and Saxena [AS06] show that Commutative Algebra Isomorphism reduces to Cubic Form Equivalence. Combine with Corollary 8.5(1). ∎

The reduction from $V\otimes V\otimes W$ to $V^{\prime}\otimes V^{\prime}\otimes V^{\prime}$ is achieved by the same construction.

Lemma 8.7.

Let $\mathbf{A},\mathbf{B},\mathbf{A}^{\prime}$ , and $\mathbf{B}^{\prime}$ be as above. Then $\mathbf{A}$ and $\mathbf{B}$ are pseudo-isometric if and only if $\mathbf{A}^{\prime}$ and $\mathbf{B}^{\prime}$ are isomorphic as trilinear forms.

Proof.

Recall that $\mathbf{A}$ and $\mathbf{B}$ are pseudo-isometric if there exist $P\in\mathrm{GL}(n,\mathbb{F}),R\in\mathrm{GL}(m,\mathbb{F})$ such that $P^{t}\mathbf{A}P=\mathbf{B}^{R}$ . Also recall that $\mathbf{A}^{\prime}$ and $\mathbf{B}^{\prime}$ are equivalent as trilinear forms if there exists $\tilde{P}\in\mathrm{GL}(n+m,\mathbb{F})$ such that $\tilde{P}^{t}\mathbf{A}^{\prime\tilde{P}}\tilde{P}=\mathbf{B}^{\prime}$ . Since $A_{i}$ (resp. $B_{i}$ ) form a linear basis of $\mathcal{A}$ , we have that $A_{i}$ (resp. $B_{i}$ ) are linearly independent.

The only if direction

is easy to verify. Given an pseudo-isometry $P,R$ between $\mathbf{A}$ and $\mathbf{B}$ , let $\tilde{P}=\begin{bmatrix}P&\mathbf{0}\\ \mathbf{0}&R^{-1}\end{bmatrix}$ . Then it can be verified easily that $\tilde{P}$ is a trilinear form equivalence between $\mathbf{A}^{\prime}$ and $\mathbf{B}^{\prime}$ , following the same approach in the proof of Lemma 8.4.

For the if direction,

write $\tilde{P}=\begin{bmatrix}P&X\\ Y&R\end{bmatrix}\in\mathrm{GL}(n+m,\mathbb{F})$ be a trilinear form equivalence between $\mathbf{A}^{\prime}$ and $\mathbf{B}^{\prime}$ . We first observe that the last $m$ matrices in $\tilde{P}^{t}\mathbf{A}^{\prime}\tilde{P}$ are still linearly independent. Then, because of the first $n$ matrices in $\mathbf{B}^{\prime}$ are all zero matrices, $Y$ has to be the zero matrix. It follows that $\tilde{P}=\begin{bmatrix}P&X\\ \mathbf{0}&R\end{bmatrix}$ , where $P$ and $R$ are invertible. Then it can be verified easily that $P$ and $R^{-1}$ form an pseudo-isometry between $\mathbf{A}$ and $\mathbf{B}$ , following the same approach in the proof of Lemma 8.4. ∎

9 Reducing $d$ -Tensor Isomorphism to 3-Tensor Isomorphism

Theorem B.

$d$ -Tensor Isomorphism reduces to Algebra Isomorphism. If the input tensor has size $n_{1}\times n_{2}\times\dotsb\times n_{d}$ , then the output algebra has dimension $O(d^{2}n^{d-1})$ where $n=\max\{n_{i}\}$ .

Remark 9.1.

One cannot do too much better in terms of size of the output, as the following argument suggests. Over finite fields, we may count the number of orbits, which provides a rigorous lower bound on the size blow-up of any kernel reduction (see, e. g., [FG11, Sec. 4.2.4]). Over infinite fields, if we consider algebraic reductions, they must preserve dimension, so we can make a similar (albeit more heuristic) argument by considering the “dimension” of the set of orbits. Here we have put “dimension” in quotes because the set of orbits is not a well-behaved topological space (it is typically not even $T_{1}$ ), but even in this case the same argument should essentially hold. The space of $n\times n\times\dotsb\times n$ $d$ -tensors has dimension $n^{d}$ , and the group $\mathrm{GL}_{n}\times\dotsb\times\mathrm{GL}_{n}$ has dimension $dn^{2}$ , so the “dimension” of the set of orbits is at least $n^{d}-dn^{2}\sim n^{d}$ ( $d\geq 3$ ); over $\mathbb{F}_{q}$ , the number of orbits is at least $q^{n^{d}-dn^{2}}$ . For algebras of dimension $N$ , the space of such algebras is $\leq N^{3}$ -dimensional, so the “dimension” of the set of orbits is at most $N^{3}$ ; over $\mathbb{F}_{q}$ , the number of orbits is at most $q^{N^{3}}$ . Thus we need $N^{3}\gtrsim n^{d}$ , whence $N\gtrsim n^{d/3}$ .

Proof idea.

The idea here is similar to the reduction from 3TI to Algebra Isomorphism: we want to create an algebra in which all products eventually land in an ideal, and multiplication of algebra elements by elements in the ideal is described by the tensor we started with. For a 3-tensor this was very natural, as the structure constants of any algebra form a 3-tensor. In that case, we are using it to say how to write the product of 2 elements as a linear combination (the third factor of the tensor) of basis elements. With a $d$ -tensor for $d\geq 3$ , we now want to use it to describe how to write the product of $d-1$ elements as a linear combination of basis elements. The tricky part here is that in an algebra we must still describe the product of any two elements. The idea is to create a set of generators, let them freely generate monomials up to degree $d-2$ , and then when we get a product of $d-1$ generators, rewrite it as a linear combination of generators according to the given tensor. This idea almost provides one direction of the reduction: if two $d$ -tensors $\mathtt{A},\mathtt{B}$ are isomorphic, then the corresponding algebras $\mathcal{A},\mathcal{B}$ are isomorphic. However, there is an issue with implementing this, namely that monomials are commutative, but our tensors $\mathtt{A},\mathtt{B}$ need not be symmetric, and moreover, they need not even be “square” (have all side lengths equal). In [AS05, Thm. 5] they reduce Degree- $d$ Form Equivalence to Commutative Algebra Isomorphism along similar lines, but there the starting objects are themselves commutative, so this was not an issue. In our case, we will get around this using a certain noncommutative algebra where the only nonzero products are those which come “in the right order.”

Another potentially tricky aspect of the reduction is the converse: suppose we build our algebras $\mathcal{A},\mathcal{B}$ as above from two $d$ -tensors, and $\mathcal{A},\mathcal{B}$ are isomorphic; how can we guarantee that $\mathtt{A}$ and $\mathtt{B}$ are isomorphic? For this, we would like to be able to identify certain subsets of the algebras as characteristic (invariant under any automorphism), so that those characteristic subsets force the isomorphism to take a particular form, which we can then massage into an isomorphism between the tensors $\mathtt{A},\mathtt{B}$ . Our way of doing this is to encode the “degree” structure into the path algebra of a graph, as described in the next section. If the graph has no automorphisms, then the path algebra has the advantage that for any two vertices $i,j$ , the subset of $\mathcal{A}$ spanned by the paths from $i$ to $j$ is nearly characteristic in a way we make precise below. ∎

9.1 Preliminaries for Theorem B

To make the above proof idea precise, we will need a little background on Leavitt path algebras (a.k.a. quiver algebras) and their quotients. For a textbook reference on these algebras, see [ASS06, Ch. II], and for a textbook treatment of Wedderburn–Artin theory and the Jacobson radical, see [Lam91]. Aside from the definition of path algebra, most of this section will end up being used as a black box; we include it mostly for ease of reference.

We start with some important, classical results on the structure of associative algebras. The Jacobson radical of an associative algebra $A$ , here denoted $R(A)$ , is the intersection of all maximal right ideals. Equivalently, $R(A)=\{a\in A:\text{every element of }1+AxA\text{ is invertible}\}$ . A unital algebra $A$ over a field $\mathbb{F}$ is semisimple if $R(A)=0$ ; in this case, by Wedderburn’s Theorem (see below), $A$ is isomorphic to a direct sum of matrix algebras over finite-degree division rings extending $\mathbb{F}$ . An algebra $A$ is called separable if it is semisimple over every field extending $\mathbb{F}$ , that is, $A\otimes_{\mathbb{F}}\mathbb{K}$ is semisimple for all fields $\mathbb{K}$ extending $\mathbb{F}$ . Equivalently, $A$ is separable if it is isomorphic to $\bigoplus_{i=1}^{d}\mathrm{M}(d_{i},\mathbb{F}_{i})$ , where each $\mathbb{F}_{i}$ is a division ring extending $\mathbb{F}$ such that the center $Z(\mathbb{F}_{i})$ is a separable field extension of $\mathbb{F}$ . If $\mathbb{F}$ has characteristic zero or is perfect (which includes all finite fields), then all its extensions are separable. For the algebra we construct, it will simply be a direct sum of copies of $\mathbb{F}$ , so it is automatically separable over any field.

An element $a\in A$ is idempotent if $a^{2}=a$ . An idempotent $e$ is primitive if it cannot be written as the sum of two nonzero idempotents. Two idempotents $e,f$ are orthogonal if $ef=fe=0$ . A complete set of primitive orthogonal idempotents of $A$ is a set $\{e_{1},\dotsc,e_{n}\}$ of primitive idempotents which are pairwise orthogonal, and such that the set is maximal subject to this condition.

Theorem 9.2 (Wedderburn–Mal’cev, see, e. g., [Far05]).

Let $A$ be an finite-dimensional, associative, unital algebra over a field $\mathbb{F}$ . Then

$A/R(A)\cong\bigoplus_{i=1}^{d}\mathrm{M}(d_{i},\mathbb{F}_{i})$ * (as algebras), where each $\mathbb{F}_{i}$ is a division ring of finite degree over $\mathbb{F}$ .* 2. 2.

If $A/R(A)$ is separable, then there exists a subalgebra $S\subseteq A$ such that $A=S\oplus R(A)$ (as $\mathbb{F}$ -vector spaces). 3. 3.

If $T\subseteq A$ is any separable subalgebra, then there exists $r\in R(A)$ such that $(1+r)T(1+r)^{-1}\subseteq S$ .

The last part of the preceding theorem is what we will use to show that the set of paths $i\to j$ in our graph is “nearly characteristic;” that is, it is not characteristic, but it is characteristic up to conjugacy (=inner automorphisms).

Definition 9.3 (Leavitt path algebra).

Given a directed multigraph $G$ (possibly with parallel edges and self-loops, a.k.a. quiver), its Leavitt path algebra $\text{Path}(G)$ is the algebra of paths in $G$ , where multiplication is given by concatenation of paths (when this is well-defined), and zero otherwise. That is, $\text{Path}(G)$ is generated by $\{e_{v}:v\in V(G)\}\cup\{x_{a}:a\in E(G)\}$ , where the generators $e_{v}$ are thought of as the “path of length [math]” at vertex $v$ . The defining relations in $\text{Path}(G)$ are that the product of two paths is their concatenation if the end of the first equals the start of the second, and zero otherwise. More formally, the relations are:

[TABLE]

where $\delta_{x,y}$ is the Kronecker delta: it is $1$ if $x=y$ and [math] otherwise.

Note that we are allowed to take formal linear combinations of paths in this algebra, as it is an $\mathbb{F}$ -algebra (so in particular, it is an $\mathbb{F}$ -vector space). The arrow ideal of $\text{Path}(G)$ is the two-sided ideal generated by the arrows, and has a basis consisting of all paths of length $\geq 1$ ; it is denoted $R_{G}$ .

Lemma 9.4 (See [ASS06, Cor. II.1.11]).

If $G$ is finite, connected, and acyclic, then $R(\text{Path}(G))$ is the arrow ideal $R_{G}$ , and has a basis consisting of all paths of length $\geq 1$ , and the set $\{e_{v}:v\in V(G)\}$ is a complete set of primitive orthogonal idempotents.

Corollary 9.5.

Let $G$ be a finite, connected, acyclic graph, and $I$ an ideal of $\text{Path}(G)$ contained in $R_{G}$ ; let $A=\text{Path}(G)/I$ . Then (1) $R(A)=R_{G}/I$ , (2) $A/R(A)\cong\mathbb{F}^{\oplus|V(G)|}$ , whence $A/R(A)$ is separable, and (3) $\{\overline{e}_{v}:v\in V(G)\}$ is a complete set of primitive orthogonal idempotents, where $\overline{e}_{v}$ is the image of $e_{v}$ under the quotient map $\text{Path}(G)\to\text{Path}(G)/I=A$ .

Proof.

(1) This holds for any ideal contained in the radical of any finite-dimensional associative unital algebra [Lam91, Prop. 4.6].

(2) It is clear that as vector spaces, $\text{Path}(G)=\langle e_{1},\dotsc,e_{n}\rangle\oplus R_{G}$ (where $n=|V(G)|$ ), and the span of the $e_{i}$ is easily seen to be an algebra isomorphic to $\mathbb{F}^{n}$ , where the $i$ -th copy of $\mathbb{F}$ is spanned by $\pi(e_{i})$ , where $\pi\colon\text{Path}(G)\to\text{Path}(G)/R_{G}$ is the natural projection. Thus $\text{Path}(G)/R_{G}\cong\mathbb{F}^{n}$ . Since $R(A)=R_{G}/I$ , we have $A/R(A)=(\text{Path}(G)/I)/(R_{G}/I)\cong\text{Path}(G)/R_{G}\cong\mathbb{F}^{n}$ . As a semisimple algebra, we thus have that $A/R(A)\cong\bigoplus\mathrm{M}(1,\mathbb{F})$ , and as $\mathbb{F}$ is always a separable extension over itself, $A/R(A)$ is separable.

(3) The property of being a set of primitive orthogonal idempotents is preserved by homomorphisms, so there are only two things to check here: first, that none of the $\overline{e}_{v}$ is zero modulo $I$ , and second, that there are no additional primitive idempotents in $A$ that are mutually orthogonal with every $\overline{e}_{v}$ . To see that none of the $\overline{e}_{v}$ are zero, note that $\pi\colon\text{Path}(G)\to\text{Path}(G)/R_{G}$ factors through $A$ ; then since $\pi(e_{v})\neq 0$ for any $v$ (from the previous paragraph), it must be the case that $\overline{e}_{v}\neq 0$ as well. Finally, we must show this is a complete set of primitive orthogonal idempotents. Suppose not; that is, suppose there is some $e\notin\{\overline{e}_{v}:v\in V(G)\}$ such that $e$ is a primitive idempotent that is orthogonal in $A$ to every $\overline{e}_{v}$ . First, we claim that $e\notin R(A)=R_{G}/I$ . For, since $G$ is a finite acyclic graph, its arrow ideal $R_{G}$ is nilpotent: there are no paths longer than $n-1=|V(G)-1|$ , so we must have $R_{G}^{n}=0$ , whence $R_{G}$ cannot contain any idempotents. Since $R_{G}$ is nilpotent, the same must be true of $R_{G}/I$ , whence $R_{G}$ cannot contain any idempotents, so $e$ cannot be in $R_{G}$ . But then the image of $e$ in $A/R_{G}$ is nonzero (since $e\notin R_{G}$ ), so $e$ is another primitive idempotent orthogonal to every $\pi(e_{v})$ in $\text{Path}(G)/R_{G}=A/R(A)$ . But this is a contradiction, since $\{\pi(e_{v})\}$ is already a complete set of primitive orthogonal idempotents for $A/R(A)$ . ∎

Finally, in the course of the proof, we will use the following construction of Grigoriev:

Theorem 9.6 (Grigoriev [Gri81, Theorem 1]).

Graph Isomorphism* is equivalent to Algebra Isomorphism for algebras $A$ such that the radical squares to zero and $A/R(A)$ is abelian.*

In our proof, all we will need aside from Grigoriev’s result is to see the construction itself, which we recall here in language consistent with ours.

Construction [Gri81].

Given a graph $G$ , construct an algebra $\mathcal{A}_{G}$ as follows: it is generated by $\{e_{i}:i\in V(G)\}\cup\{e_{ij}:(i,j)\in E(G)\}$ subject to the following relations: $e_{i}e_{j}=\delta_{ij}e_{i}$ , $e_{i}e_{kj}=\delta_{ik}e_{kj}$ , $e_{kj}e_{i}=\delta{ij}e_{kj}$ , $e_{ij}e_{kl}=0$ when $j\neq k$ , $R(\mathcal{A}_{G})$ is generated by $\{e_{ij}\}$ , and the radical squares to zero. It is immediate that this is just $\text{Path}(G)/R_{G}^{2}$ . From any such algebra $\mathcal{A}$ , Grigoriev recovers a corresponding weighted graph, where the weight on $(i,j)$ is $\dim e_{i}\mathcal{A}e_{j}$ . In our setting we use multiple parallel edges rather than weight, but the proof goes through mutatis mutandis. ∎

9.2 Proof of Theorem B

Proof.

Let $\mathtt{A}$ be an $n_{1}\times n_{2}\times\dotsb\times n_{d}$ $d$ -tensor. Let $G$ be the following directed multigraph (see Figure 6): it has $d$ vertices, labeled $1,\dotsc,d$ , and for $i=1,\dotsc,d-1$ , it has $n_{i}$ parallel arrows from vertex $i$ to vertex $i+1$ , and $n_{d}$ parallel arrows from $1$ to $d$ .

Because of the structure of this graph, we can index the generators of $\text{Path}(G)$ a little more mnemonically than in the preliminaries above: let the generators corresponding to the $n_{i}$ arrows from $i\to(i+1)$ be $x_{i,a}$ for $a=1,\dotsc,n_{i}$ , and let the generators corresponding to the $n_{d}$ arrows $1\to d$ be $x_{d,a}$ for $a=1,\dotsc,n_{d}$ . Let $\mathcal{A}$ be the quotient of $\text{Path}(G)$ by the relation161616For those familiar with quiver algebras, we note that this ideal is not admissible, as it is not contained in $R_{G}^{2}$ . It can probably be made admissible by inserting new vertices in the middle of each edge $1\to d$ . However, when we tried to do that in a naive way, we ran into problems verifying the reduction, as what should be a linear transformation either ends up being incorrect or ends up being quadratic, either of which caused issues.

[TABLE]

At the moment, we only have $\mathcal{A}$ in terms of generators and relations; however, it will be easy to turn it into its basis representation. The key is to bound its dimension, which we do now. Except for paths of length $d-1$ (because of the nontrivial relations (6)), this is just counting the number of paths in the graph described above. The only nonzero monomials of degree $k+1$ are those of the form $x_{i,a_{i}}x_{i+1,a_{i+1}}x_{i+2,a_{i+2}}\dotsb x_{i+k,a_{i+k}}$ . For a given choice of $i\in\{1,\dotsc,d-1-k\}$ , there are exactly $n_{i}n_{i+1}\dotsb n_{i+k}$ such monomials, so we have

[TABLE]

Note that in the first line we can exactly specify $\dim\mathcal{A}$ , independent of $\mathtt{A}$ itself (depending only on its dimensions). For any fixed $d$ , this dimension is polynomial in $n$ . By the linear-algebraic analogue of breadth-first search, we may thus list a basis for $\mathcal{A}$ and its structure constants with respect to that basis.

We claim that the map $\mathtt{A}\mapsto\mathcal{A}$ is a reduction. Suppose $\mathtt{B}$ is another tensor of the same dimension, and let $\mathcal{B}$ be the associated algebra as above. We claim that $\mathtt{A}\cong\mathtt{B}$ as $d$ -tensors if and only if $\mathcal{A}\cong\mathcal{B}$ as algebras.

For the only if direction, suppose $\mathtt{A}\cong\mathtt{B}$ via $(P_{1},P_{2},\dotsc,P_{d})\in\mathrm{GL}(n_{1},\mathbb{F})\times\dotsb\times\mathrm{GL}(n_{d},\mathbb{F})$ , that is

[TABLE]

for all $i_{1},\dotsc,i_{d}$ . Then we claim that the block-diagonal matrix $P=\operatorname{diag}(P_{1},P_{2},\dotsc,P_{d-1},P_{d}^{-1})\in\mathrm{GL}(n,\mathbb{F})$ (where $n=\sum_{i=1}^{d}n_{i}$ ), together with mapping $e_{i}$ to $e_{i}$ , induces an isomorphism from $\mathcal{A}$ to $\mathcal{B}$ . Note that $P$ itself is not an isomorphism, as $\dim\mathcal{A}\approx n^{d}$ , but $P$ specifies a linear map on the generators of $\mathcal{A}$ , which we may then exend to all of $\mathcal{A}$ .

First let us see that $P$ indeed gives a well-defined homomorphism $\mathcal{A}\to\mathcal{B}$ . Since $P$ is only defined on the generators and is, by definition, extended by distributivity, the only thing to check here is that $P$ sends the relations of $\mathcal{A}$ into the relations of $\mathcal{B}$ . Let $y_{1,1},\dotsc,y_{1,n_{1}},\dotsc,y_{d,n_{d}},e_{1},\dotsc,e_{d}$ denote the basis of $\mathcal{B}$ as above. The map $P$ is defined by $P(e_{i})=e_{i}$ ,

[TABLE]

and

[TABLE]

By left multiplying by $P_{d}^{t}$ , we may rewrite this last equation as

[TABLE]

note the transpose.

To check the relations, let us write out the Leavitt path algebra relations explicitly for our graph, in our notation. The generators of $\mathcal{A}$ are $x_{1,1},x_{1,2},\dotsc,x_{1,n_{1}},x_{2,1},x_{2,2},\dotsc,x_{2,n_{2}},\dotsc,x_{d,n_{d}},e_{1},\dotsc,e_{d}$ , and the relations are (6) and the quiver relations:

[TABLE]

Note that the set $e_{i}\mathcal{A}e_{j}$ is linearly spanned by the paths $i\to j$ in this graph.

The relations involving the $e_{i}$ are easy to verify, since they only depend on the first subscript of $x_{i,a}$ (resp., $y_{j,b}$ ), and $P$ does not alter this subscript.

For relation (7), we have:

[TABLE]

where the final inequality comes from the defining relations $y_{i,a^{\prime}}y_{d,b^{\prime}}=0$ in $\mathcal{B}$ .

The verification for remaining quiver relations is similar, since $P$ does not alter the start and end vertices of any arrow (though it may send a single arrow $i\to j$ in $\mathcal{A}$ to a linear combination of arrows $i\to j$ in $\mathcal{B}$ ).

We now verify the relation (6). We have

[TABLE]

as desired. Thus the map $\mathcal{A}\to\mathcal{B}$ induced by $P$ is an algebra homomorphism.

Next, since $P$ is an isomorphism of $(d+n)$ -dimensional vector spaces, the map it induces $\mathcal{A}\to\mathcal{B}$ is surjective on the generators of $\mathcal{B}$ , whence it is surjective onto all of $\mathcal{B}$ . Finally, since $\dim\mathcal{A}=\dim\mathcal{B}<\infty$ , any linear surjective map $\mathcal{A}\to\mathcal{B}$ is automatically bijective, so this map is indeed an isomorphism of algebras.

For the if direction, suppose that $f\colon\mathcal{A}\to\mathcal{B}$ is an isomorphism of algebras. Since the Jacobson radical is characteristic, we have $f(R(\mathcal{A}))=R(\mathcal{B})$ . Then $\{f(e_{v}):v\in V\}$ is a set of primitive orthogonal idempotents in $\mathcal{B}$ , and their span $T=\langle f(e_{v}):v\in V\rangle$ is a separable subalgebra (isomorphic to $\mathbb{F}^{n}$ ) such that $\mathcal{B}=T\oplus R(\mathcal{B})$ . By the Wedderburn–Mal’cev Theorem (Theorem 9.2(3)), there is some $r\in R(\mathcal{B})$ such that $(1+r)T(1+r)^{-1}=\langle e_{1},\dotsc,e_{n}\rangle=:S$ . Since the $e_{i}$ are the only primitive idempotents in $S$ , we must have that $(1+r)f(e_{i})(1+r)^{-1}=e_{\pi(i)}$ for all $i$ and some permutation $\pi\in S_{n}$ .

Next we will show that this permutation is in fact the identity, so that $(1+r)f(e_{i})(1+r)^{-1}=e_{i}$ for all $i$ . For this, consider $\mathcal{A}^{\prime}=\mathcal{A}/R(\mathcal{A})^{2}$ and similarly $\mathcal{B}^{\prime}$ . These are precisely the algebras considered by Grigoriev [Gri81] (reproduced as Theorem 9.6 above). Since $R(\mathcal{A})$ is characteristic, so is its square, and thus $f$ induces an isomorphism $\mathcal{A}^{\prime}\stackrel{{\scriptstyle\cong}}{{\to}}\mathcal{B}^{\prime}$ . By Theorem 1 of Grigoriev [Gri81], any isomorphism $\mathcal{A}^{\prime}\to\mathcal{B}^{\prime}$ induces an isomorphism of the corresponding graphs, so this isomorphism must map $e_{i}$ to $e_{i}$ for each $i$ (since our graph $G$ has no automorphisms). Thus $\pi$ must be the identity, and $(1+r)f(e_{i})(1+r)^{-1}=e_{i}$ for all $i$ .

Since conjugation is an automorphism, let $f^{\prime}\colon\mathcal{A}\to\mathcal{B}$ be $c_{1+r}\circ f$ , where $c_{1+r}(b)=(1+r)b(1+r)^{-1}$ . By the above, $f^{\prime}(e_{i})=e_{i}$ for all $i$ . Thus $f^{\prime}(e_{i}\mathcal{A}e_{j})=e_{i}\mathcal{B}e_{j}$ . In particular, define $P_{i}$ to be the restriction of $f^{\prime}$ to $e_{i}\mathcal{A}e_{i+1}$ for $i=1,\dotsc,d-1$ and $P_{d}$ to be the restriction of $f^{\prime}$ to $e_{1}\mathcal{A}e_{d}$ . Then we have that $P_{i}$ is a linear bijection from the span of $x_{i,1},\dotsc,x_{i,n_{i}}$ to the span of $y_{i,1},\dotsc,y_{i,n_{i}}$ for all $i$ . We claim that $P=(P_{1},\dotsc,P_{d-1},P_{d}^{-t})$ is a tensor isomorphism $\mathtt{A}\to\mathtt{B}$ , that is,

[TABLE]

From the fact that $f^{\prime}$ is an isomorphism, we have

[TABLE]

For each $j_{d}\in\{1,\dotsc,n_{d}\}$ , equating the coefficient of $y_{d,j_{d}}$ gives

[TABLE]

Let $\mathtt{A}(i_{1},\dotsc,i_{d-1},-)$ be the natural row vector of length $n_{d}$ , and similarly for $\mathtt{B}(j_{1},\dotsc,j_{d-1},-)$ . Then we may rewrite the preceding set of $n_{d}$ equations (one for each choice of $j_{d}$ ) in matrix notation as

[TABLE]

Right multiplying by $P_{d}^{-1}$ , we then get

[TABLE]

as claimed. ∎

10 Conclusion: universality and open questions

10.1 Towards universality for basis-explicit linear structures

A classic result is that GI is complete for isomorphism problems of explicitly given structures (see, e. g., [ZKT85, Section 15]). Here we formally state the linear-algebraic analogue of this result, and observe trivially that the results of [FGS19] already show that 3-Tensor Isomorphism is universal among what we call “basis-explicit” (multi)linear structures of degree 2.

First let us recall the statement of the result for GI, so we can develop the appropriate analogue for tensor isomorphism. A first-order signature is a list of positive integers $(r_{1},r_{2},\dotsc,r_{k};f_{1},\dotsc,f_{\ell})$ ; a model of this signature consists of a set $V$ (colloquially referred to as “vertices”), $k$ relations $R_{i}\subseteq V^{r_{i}}$ , and $\ell$ functions $F_{i}\colon V^{f_{i}}\to V$ . The numbers $r_{i}$ are thus the arities of the relations $R_{i}$ , and the $f_{i}$ are the arities of the functions $F_{i}$ .171717Sometimes one also includes constants in the definition, but these can be handled as relations of arity 1. While we could have done the same for functions, treating a function of arity $f$ as its graph, which is a relation of arity $f+1$ , distinguishing between relations and functions will be useful when we come to our linear-algebraic analogue. Two such models $(V;R_{1},\dotsc,R_{k};F_{1},\dotsc,F_{\ell})$ and $(V^{\prime};R_{1}^{\prime},\dotsc,R_{k}^{\prime};F_{1}^{\prime},\dotsc,F_{\ell}^{\prime})$ are isomorphic if there is a bijection $\varphi\colon V\to V^{\prime}$ that sends $R_{i}$ to $R_{i}^{\prime}$ for all $i$ and $F_{i}$ to $F_{i}^{\prime}$ for all $i$ . In symbols, $\varphi$ is an isomorphism if $(v_{1},\dotsc,v_{r_{i}})\in R_{i}\Leftrightarrow(\varphi(v_{1}),\dotsc,\varphi(v_{r_{i}}))\in R_{i}^{\prime}$ for all $i$ and all $v_{*}\in V$ , and similarly if $\varphi(F_{i}(v_{1},\dotsc,v_{f_{i}}))=F_{i}^{\prime}(\varphi(v_{1}),\dotsc,\varphi(v_{f_{i}}))$ for all $i$ and all $v_{*}\in V$ . By an “explicitly given structure” or “explicit model” we mean a model where each relation $R_{i}$ is given by a list of its elements and each function is given by listing all of its input-output pairs. Fixing a signature, the isomorphism problem for that signature is to decide, given two explicit models of that signature, whether they are isomorphic. This isomorphism problem is directly encoded into the isomorphism problem for edge-colored hypergraphs, which can then be reduced to GI using standard gadgets.

For example, the signature for directed graphs (possibly with self-loops) is simply $\sigma=(2;)$ —its models are simply binary relations. If one wants to consider graphs without self-loops, this is a special case of the isomorphism problem for the signature $\sigma$ , namely, those explicit models in which $(v,v)\notin R_{1}$ for any $v$ . Note that a graph without self-loops is never isomorphic to a graph with self-loops, and two directed graphs without self-loops are isomorphic as directed graphs if and only if they are isomorphic as models of the signature $\sigma$ . In other words, the isomorphism problem for simple directed graphs really is just a special case. The same holds for undirected graphs without self-loops, which are simply models of the signature $\sigma$ in which $(v,v)\notin R_{1}$ and $R_{1}$ is symmetric. As another example, the signature for finite groups is $\gamma=(1;1,2)$ : the first relation $R_{1}$ will be a singleton, indicating which element is the identity, the function $F_{1}$ is the inverse function $F_{1}(g)=g^{-1}$ , and the second function $F_{2}$ is the group multiplication $F_{2}(g,h)=gh$ . Of course, models of the signature $\gamma$ can include many non-groups as well, but, as was the case with directed graphs, a group will never be isomorphic to a non-group, and two groups are isomorphic as models of $\gamma$ iff they are isomorphic as groups.

A natural linear-algebraic analogue of the above is as follows. One additional feature we add here for purposes of generality is that we need to make room for dual vector spaces. A linear signature is then a list of pairs of nonnegative integers $((r_{1},r_{1}^{*}),\dotsc,(r_{k},r_{k}^{*});(f_{1},f_{1}^{*}),\dotsc,(f_{\ell},f_{\ell}^{*}))$ with the property that $r_{i}+r_{i}^{*}>0$ and $f_{i}+f_{i}^{*}>0$ for all $i$ . By the arity of the $i$ -th relation (resp., function) we mean the sum $r_{i}+r_{i}^{*}$ (resp., $f_{i}+f_{i}^{*}$ ).

Definition 10.1 (Linear signature, basis-explicit).

Given a linear signature

[TABLE]

a linear model for $\sigma$ over a field $\mathbb{F}$ consists of an $\mathbb{F}$ -vector space $V$ , and linear subspaces $R_{i}\leq V^{\otimes r_{i}}\otimes(V^{*})^{\otimes r_{i}^{*}}$ for $1\leq i\leq k$ and linear maps $F_{i}\colon V^{\otimes f_{i}}\otimes(V^{*})^{\otimes f_{i}^{*}}\to V$ for $1\leq i\leq\ell$ . Two such linear models $(V;R_{1},\dotsc,R_{k};F_{1},\dotsc,F_{\ell}),(V^{\prime};R_{1}^{\prime},\dotsc,R_{k}^{\prime};F_{1}^{\prime},\dotsc,F_{\ell}^{\prime})$ are isomorphic if there is a linear bijection $\varphi\colon V\to V^{\prime}$ that sends $R_{i}$ to $R_{i}^{\prime}$ for all $i$ and $F_{i}$ to $F_{i}^{\prime}$ for all $i$ (details below).

A basis-explicit linear model is given by a basis for each $R_{i}$ , and, for each element of a basis of the domain of $F_{i}$ , the value of $F_{i}$ on that element. Vectors here are written out in their usual dense coordinate representation.

In particular, this means that an element of $V^{\otimes r}$ —say, a basis element of $R_{1}$ —is written out as a vector of length $(\dim V)^{r}$ . We will only be concerned with finite-dimensional linear models.

Given $\varphi\colon V\to V^{\prime}$ , let $\varphi^{\otimes r_{i}\otimes r_{i}^{*}}$ denote the linear map $\varphi^{\otimes r_{i}\otimes r_{i}^{*}}\colon V^{\otimes r_{i}}\otimes(V^{*})^{\otimes r_{i}^{*}}\to V^{\prime\otimes r_{i}}\otimes(V^{\prime*})^{\otimes r_{i}^{*}}$ which is defined on basis vectors factor-wise: $\varphi^{\otimes r_{i}\otimes r_{i}^{*}}(v_{1}\otimes\dotsb\otimes v_{r_{i}}\otimes\ell_{1}\otimes\dotsb\otimes\ell_{r_{i}^{*}})=\varphi(v_{1})\otimes\dotsb\otimes\varphi(v_{r_{i}})\otimes\varphi^{*}(\ell_{1})\otimes\dotsb\otimes\varphi^{*}(\ell_{r_{i}^{*}})$ , and then extended to the whole space by linearity. (Recall that $V^{*}=\mathrm{Hom}(V,\mathbb{F})$ , so elements of $V^{*}$ are linear maps $\ell\colon V\to\mathbb{F}$ , and thus $\varphi^{*}(\ell):=\ell\circ\varphi^{-1}$ is a map from $V^{\prime}\to V\to\mathbb{F}$ , i. e., an element of $V^{\prime*}$ , as desired). Similarly, when we say that $\varphi$ sends $F_{i}$ to $F_{i}^{\prime}$ , we mean that $\varphi(F_{i}(v_{1}\otimes\dotsb\otimes v_{f_{i}}\otimes\ell_{1}\otimes\dotsb\otimes\ell_{f_{i}^{*}}))=F_{i}^{\prime}(\varphi^{\otimes f_{i}\otimes f_{i}^{*}}(v_{1}\otimes\dotsb\otimes v_{f_{i}}\otimes\ell_{1}\otimes\dotsb\otimes\ell_{f_{i}^{*}}))$ .

Remark 10.2.

We use the term “basis-explicit” rather than just “explicit,” because over a finite field, one may also consider a linear model of $\sigma$ as an explicit model of a different signature (where the different signature additionally encodes the structure of a vector space on $V$ , namely, the addition and scalar multiplication), and then one may talk of a single mathematical object having explicit representations—where everything is listed out—and basis-explicit representations—where things are described in terms of bases. An example of this distinction arises when considering isomorphism of $p$ -groups of class $2$ : the “explicit” version is when they are given by their full multiplication table (which reduces to GI), while the “basis-explicit” version is when they are given by a generating set of matrices or a polycyclic presentation (which GI reduces to).

Theorem 10.3 (Futorny–Grochow–Sergeichuk [FGS19]).

Given any linear signature $\sigma$ where all relationship arities are at most 3 and all function arities are at most 2, the isomorphism problem for finite-dimensional basis-explicit linear models of $\sigma$ reduces to 3-Tensor Isomorphism in polynomial time.

Because of the equivalence between $d$ -Tensor Isomorphism and 3-Tensor Isomorphism (Theorem B + [FGS19]), we expect the analogous result to hold for arbitrary $d$ . Thus an analogue of the results of [FGS19] for $d$ -tensors would yield the full analogue of the universality result for GI.

Open Question 10.4.

Is $d$ -Tensor Isomorphism universal for isomorphism problems on $d$ -way arrays? That is, prove the analogue of the results of [FGS19] for $d$ -way arrays for any $d\geq 3$ .

10.2 Other open questions

Our search-to-decision reduction (Theorem C) produces instances of dimension $O(n^{2})$ from instances of dimension $n$ . As stated, this means that a simply-exponential ( $q^{\tilde{O}(n)}$ -time) decision algorithm would result only in a $q^{\tilde{O}(n^{2})}$ search algorithm, but the latter runtime is trivial. We note that it may be possible to alleviate this blow-up by attempting to generalize the logarithmic-size “coloring palette” construction for reducing Colored GI to GI from the graph case to the linear-algebraic case.

Open Question 10.5.

Is there a search-to-decision reduction for Alternating Matrix Space Isometry (and, consequently, isomorphism of $p$ -groups of class 2 and exponent $p$ , given in their natural succinct encoding) that runs in time $q^{\tilde{O}(n)}$ , and produces instances of quasi-linear ( $\tilde{O}(n)$ ) dimension?

In Section 3.2 we gave several different reductions from GI to Alternating Matrix Space Isometry. To summarize, they are:

A direct reduction from GI to Alternating Matrix Space Isometry (Prop. 7.1) 2. 2.

GI $\leq$ Matrix Lie Algebra Conjugacy [Gro12a], which in turn reduces to 3TI [FGS19], and then to Alternating Matrix Space Isometry (Thm. A); 3. 3.

GI $\leq$ CodeEq [PR97, Luk93, Miy96], CodeEq $\leq$ Matrix Lie Algebra Conjugacy [Gro12a], and then follow the same reductions as in (1); 4. 4.

GI $\leq$ Monomial Code Equivalence (the same reduction from [PR97] works for monomial equivalence of codes, see [Gro12a]), which in turn reduces to 3TI (Prop. 3.6), and thence to Alternating Matrix Space Isometry (Thm. A) 5. 5.

GI $\leq$ Algebra Isomorphism [Gri81, AS05], which reduces to 3TI [FGS19], and then to Alternating Matrix Space Isometry (Thm. A).

Can one prove that these reductions are all distinct? Are some of them equivalent in some natural sense, e. g., up to a change of basis?

Next, most of our results hold for arbitrary fields, or arbitrary fields with minor restrictions. However, in all of our reductions, we reduce one problem over $\mathbb{F}$ to another problem over the same field $\mathbb{F}$ .

Open Question 10.6.

What is the relationship between $\mathsf{TI}$ over different fields? In particular, what is the relationship between $\mathsf{TI}_{\mathbb{F}_{p}}$ and $\mathsf{TI}_{\mathbb{F}_{p^{e}}}$ , between $\mathsf{TI}_{\mathbb{F}_{p}}$ and $\mathsf{TI}_{\mathbb{F}_{q}}$ for coprime $p,q$ , or between $\mathsf{TI}_{\mathbb{F}_{p}}$ and $\mathsf{TI}_{\mathbb{Q}}$ ?

We note that even the relationship between $\mathsf{TI}_{\mathbb{F}_{p}}$ and $\mathsf{TI}_{\mathbb{F}_{p^{e}}}$ is not particularly clear. For matrix tuples (rather than spaces; equivalently, representations of finitely generated algebras) it is the case that for any extension field $\mathbb{K}\supseteq\mathbb{F}$ , two matrix tuples over $\mathbb{F}$ are $\mathbb{F}$ -equivalent (resp., conjugate) if and only if they are $\mathbb{K}$ -equivalent [KL86] (see [dSP10] for a simplified proof). However, for equivalence of tensors this need not be the case. This seems closely related to the so-called “problem of forms” for various algebras, namely the existence of algebras that are not isomorphic over $\mathbb{F}$ , but which become isomorphic over an extension field.

Example 10.7 (Non-isomorphic tensors isomorphic over an extension field).

Over $\mathbb{R}$ , let $M_{1}=I_{4}$ and let $M_{2}=\operatorname{diag}(1,1,1,-1)$ . Since these two matrices have different signatures, they are not isometric over $\mathbb{R}$ ; since they have the same rank, they are isometric over $\mathbb{C}$ . To turn this into an example of 3-tensors, first we consider the corresponding instance of Matrix Space Isometry given by $\mathcal{M}_{1}=\langle M_{1}\rangle$ and $\mathcal{M}_{2}=\langle M_{2}\rangle$ . Note that $\mathcal{M}_{1}=\{\lambda I_{4}:\lambda\in\mathbb{R}\}$ , so the signatures of all matrices in $\mathcal{M}_{1}$ are $(4,0)$ , $(0,0)$ , or $(0,4)$ . Similarly, the signatures appearing in $\mathcal{M}_{2}$ are $(3,1)$ , $(0,0)$ , and $(1,3)$ , so these two matrix spaces are not isometric over $\mathbb{R}$ , though they are isometric over $\mathbb{C}$ since $M_{1}$ and $M_{2}$ are. Finally, apply the reduction from Matrix Space Isometry to 3TI [FGS19] to get two 3-tensors $\mathtt{A}_{1},\mathtt{A}_{2}$ . Since the reduction itself is independent of field, if we consider it over $\mathbb{R}$ we find that $\mathtt{A}_{1}$ and $\mathtt{A}_{2}$ must not be isomorphic 3-tensors over $\mathbb{R}$ , but if we consider the reduction over $\mathbb{C}$ we find that they are isomorphic as 3-tensors over $\mathbb{C}$ .

Similar examples can be constructed over finite fields $\mathbb{F}$ of odd characteristic, taking $M_{1}=I_{2}$ and $M_{2}=\operatorname{diag}(1,\alpha)$ where $\alpha$ is a non-square in $\mathbb{F}$ (and replacing the role of $\mathbb{C}$ with that of $\mathbb{K}=\mathbb{F}[x]/(x^{2}-\alpha)$ ). Instead of signature, isometry types of matrices over $\mathbb{F}$ are characterized by their rank and whether their determinant is a square or not. In this case, since our matrices are even-dimensional diagonal matrices, scaling them multiplies their determinant by a square. Thus every matrix in $\mathcal{M}_{1}$ will have its determinant being a square in $\mathbb{F}$ , and every nonzero matrix in $\mathcal{M}_{2}$ will not, but in $\mathbb{K}$ they are all squares.

It would also be interesting to study the complexity of other group actions on tensors and how they relate to the problems here. For example, the action of unitary groups $U(\mathbb{C}^{n_{1}})\times\dotsb\times U(\mathbb{C}^{n_{d}})$ on $\mathbb{C}^{n_{1}}\otimes\dotsb\otimes\mathbb{C}^{n_{d}}$ classifies pure quantum states up to “local unitary operations,” and the action of $\mathrm{SL}(U_{1})\times\dotsb\times\mathrm{SL}(U_{d})$ on $U_{1}\otimes\dotsb\otimes U_{d}$ , over $\mathbb{C}$ , is the well-studied action by stochastic local operations with classical communication (SLOCC) on quantum states (e. g., [GW13, Miy04, CD–07]). Isomorphism of $m$ -dimensional lattices in $n$ -dimensional space can be seen as the natural action of $O_{n}(\mathbb{R})\times\mathrm{GL}_{m}(\mathbb{Z})$ by left and right multiplication on $n\times m$ matrices. As another example, orbits for several of the natural actions of $\mathrm{GL}_{n}(\mathbb{Z})\times\mathrm{GL}_{m}(\mathbb{Z})\times\mathrm{GL}_{r}(\mathbb{Z})$ on 3-tensors over $\mathbb{Z}$ , even for small values of $n,m,r$ , are the fundamental objects in Bhargava’s seminal work on higher composition laws [Bha04a, Bha04b, Bha04c, Bha08]. We note that while the orthogonal group $O(V)$ is the stabilizer of a 2-form on $V$ (that is, an element of $V\otimes V$ ) and $\mathrm{SL}(V)$ is the stabilizer of the induced action on $\bigwedge^{\dim V}V$ (by the determinant)—so gadgets similar to those in this paper might be useful— $\mathrm{GL}_{n}(\mathbb{Z})$ is not the stabilizer of any such structure.

In Remark 9.1 we observed that any reduction (in the sense of Sec. 6.2) from $d$ TI to 3TI must have a blow-up in dimension which is asymptotically $n^{d/3}$ , while our construction uses dimension $O(d^{2}n^{d-1})$ .

Open Question 10.8.

Is there a reduction from $d$ TI to 3TI (as in Sec. 6.2) such that the dimension of the output is $\mathrm{poly}(d)\cdot n^{d/3(1+o(1))}$ ?

Finally, in terms of practical algorithms, we wonder how well modern SAT solvers would do on instances of 3-Tensor Isomorphism over $\mathbb{F}_{2}$ (or over other finite fields, encoded into bit-strings).

Acknowledgments

The authors would like to thanks James B. Wilson for related discussions, and Uriya First, Lek-Heng Lim, and J. M. Landsberg for help searching for references asking whether d ${\sc TI}$ could reduce to ${\sc 3TI}$ . J. A. G. would like to thank V. Futorny and V. V. Sergeichuk for their collaboration on the related work [FGS19]. Ideas leading to this work originated from the workshop “Wildness in computer science, physics, and mathematics” at the Santa Fe Institute. Both authors were supported by NSF grant DMS-1750319. Y. Q. was partly supported by Australian Research Council DECRA DE150100720.

Appendix A Reducing Cubic Form Equivalence to Degree- $d$ Form Equivalence

Proposition A.1.

Cubic Form Equivalence* reduces to Degree- $d$ Form Equivalence, for any $d\geq 3$ .*

We suspect that a similar construction would give a reduction from Degree- $d^{\prime}$ Form Equivalence to Degree- $d$ Form Equivalence for any $d^{\prime}\leq d$ , but our argument relies on a case analysis that is somewhat specific to $d^{\prime}=3$ . Our argument might be adaptable to any fixed value of $d^{\prime}$ the prover desires, with a consequently more complicated case analysis, but to prove it for all $d^{\prime}$ simultaneously seems to require a different argument.

Proof.

The reduction itself is quite simple: $f\mapsto z^{d-3}f$ , where $z$ is a new variable not appearing in $f$ . If $A$ is an equivalence between $f$ and $g$ —that is, $f(x)=g(Ax)$ —then $\operatorname{diag}(A,1_{z})$ is an equivalence from $z^{d-3}f$ to $z^{d-3}g$ . Conversely, suppose $\tilde{f}=z^{d-3}f$ is equivalent to $\tilde{g}=z^{d-3}g$ via $\tilde{f}(x)=\tilde{g}(Bx)$ . We split the proof into several cases.

If $d=3$ ,

then $z$ is not present so we already have that $f$ and $g$ are equivalent.

If $f$ is not divisible by $\ell^{d-3}$ for some linear form $\ell$ ,

then $z^{d-3}$ is the unique factor in both $z^{d-3}f$ and $z^{d-3}g$ which is raised do the $d-3$ power. Thus any equivalence $B$ between these two must map $z$ to itself, hence has the form

[TABLE]

(if we put $z$ last in our basis, and think of the matrix as acting on the left of the column vectors corresponding to the variables). However, since both $f$ and $g$ do not depend on $z$ , it must be the case that whatever contributions $z$ makes to $g(Bx)$ , they all cancel. More precisely, all monomials involving $z$ in $g(Bx)$ must cancel, so if we alter $B$ into $\tilde{B}$ that $\tilde{B}x_{i}$ never includes $z$ (that is, if we make the stars in the last row above all zero), then $g(\tilde{B}x)=g(Bx)$ , hence $f(x)=g(\tilde{B}x)$ , so $f$ and $g$ are equivalent.

The preceding case always applies when $d>6$ , for then $d-3>3$ , but $\deg f=3$ . We are left to handle the following cases:

$d\leq 6$ and $f$ is a product of linear forms; 2. 2.

$d=4$ , $f$ is a product of a linear form and an irreducible quadratic form.

Suppose $f$ is a product of linear forms,

then let us define $\mathrm{rk}(f)$ as the number of linearly independent linear forms appearing in the factorization of $f$ . Note that if $\mathrm{rk}(f)=1$ , then $f=\alpha\ell^{3}$ for some $\alpha\in\mathbb{F}$ , if $\mathrm{rk}(f)=2$ , then $f=\ell_{1}^{2}\ell_{2}$ (now we can absorb any constant into $\ell_{2}$ ), and if $\mathrm{rk}(f)=3$ then $f=\ell_{1}\ell_{2}\ell_{3}$ with all $\ell_{i}$ linearly independent. Then we have that $f\sim g$ if and only if $g$ is also a product of linear forms of the same rank. For $\mathrm{GL}_{n}$ acts transitively on $k$ -tuples of linearly independent vectors for all $k\leq n$ , and in order to have $\mathrm{rk}(f)$ linearly independent forms, we must have $n\geq\mathrm{rk}(f)$ . Since we have supposed $z^{d-3}f\sim z^{d-3}g$ , by uniqueness of factorization $g$ must be a product of linear forms of the same rank as $f$ , and thus indeed $f\sim g$ .

If $d=4$ and $f=\ell\varphi$ where $\ell$ is linear and $\varphi$ is an irreducible quadratic,

then to understand the situation we begin by first doing a change of basis on $f$ to put $\varphi$ into a form in which its kernel is evident. Note that none of these simplifications are part of the reduction, but rather they are to help us prove that the reduction works. Thinking of $\varphi$ as given by its matrix $M_{\varphi}$ such that $\varphi(x)=x^{t}M_{\varphi}x$ , we can always change basis to get $M_{\varphi}$ into the form

[TABLE]

where $r=\mathrm{rk}(M_{\varphi})=\mathrm{rk}(M^{\prime})$ . Since $\varphi$ does not depend on $z$ , if we think of $\varphi$ as a quadratic form on $\{x_{1},\dotsc,x_{n},z\}$ , then the matrices are the same, but larger by one additional zero row and column.

Next we will try to simplify $\ell$ as much as possible while maintaining the (new) form of $M_{\varphi}=\operatorname{diag}(M^{\prime},\mathbf{0})$ . For this we first compute the stabilizer of the new form of $M_{\varphi}$ . We can compute the stabilizer as the set of invertible matrices $A$ such that:

[TABLE]

This turns into the following equations on the blocks of $X$ :

[TABLE]

From the first equation and the fact that $M^{\prime}$ is full rank, we find that $A_{11}$ must be an invertible $r\times r$ matrix. From the next equation and the fact that both $M$ and $A_{11}$ are full rank, we then find that $A_{12}=0$ . Thus the stabilizer of $M_{\varphi}$ is:

[TABLE]

Now we simplify $\ell$ . Note that $S$ acts on $\ell$ as a column vector. Consider $\ell=\sum_{i=1}^{n}\ell_{i}x_{i}$ , with $\ell_{i}\in\mathbb{F}$ ; we will say “ $\ell$ contains $x_{i}$ ” if and only if $\ell_{i}\neq 0$ . If $\ell$ contains some $x_{r+k}$ with $k\geq 1$ , then by setting $A_{11}=I_{r}$ and $A_{21}=0$ , we may choose $A_{22}$ to be any invertible matrix which sends $(\ell_{r+1},\dotsc,\ell_{n},\ell_{n+1})$ (recall the trailing $\ell_{n+1}$ for the $z$ coordinate) to $(1,0,\dotsc,0)$ , and thus without loss of generality we may assume that $\ell$ only contains $x_{i}$ with $1\leq i\leq r+1$ .

Next, note that if $\ell$ contains some $x_{i}$ for $1\leq i\leq r$ and $x_{r+1}$ , then we may use the action of $S$ to eliminate the $x_{r+1}$ . Namely, by taking $A_{11}=I_{r}$ , $A_{22}=I_{n+1}$ , and $A_{21}=(-\ell_{r+1}/\ell_{i})E_{1i}$ . This makes $\ell_{i}x_{i}$ in $\ell$ contribute $-\ell_{r+1}$ to the $x_{r+1}$ coordinate, eliminating $x_{r+1}$ . Thus, under the action of $S$ , we need only consider two cases for linear forms under the action of $S$ : a linear form is equivalent to either

a.

one which contains some $x_{i}$ with $1\leq i\leq r$ , in which case we can bring it to a form in which it contains no $x_{r+j}$ with $j\geq 1$ (and no $z$ ), or 2. b.

it contains no $x_{i}$ with $1\leq i\leq r$ , in which case we can use the action of $S$ to bring it to the form $\ell=x_{r+1}$ .

Let us call the corresponding linear forms “type (a)” and “type (b).” Note that the linear form $z$ is of type (b).

Now, write $f=\ell\varphi$ and $g=\ell^{\prime}\varphi^{\prime}$ , and assume that we have applied the preceding change of basis to bring $f$ to the form specified above. Recall that we are assuming $\tilde{f}\sim\tilde{g}$ , and need to show that $f\sim g$ . If, after applying the same change of basis to $g$ , we do not have $M_{\varphi^{\prime}}=M_{\varphi}$ , then $f\not\sim g$ and also $\tilde{f}\not\sim\tilde{g}$ —contrary to our assumption—since $\varphi$ (resp., $\varphi^{\prime}$ ) is the unique irreducible quadratic factor of $\tilde{f}$ (resp., $\tilde{g}$ ). So we may assume that, after this change of basis, $\varphi=\varphi^{\prime}$ , both of which have $M_{\varphi}=\operatorname{diag}(M^{\prime},0_{n-r+1})$ with $r=\operatorname{rank}(M_{\varphi})$ .

Next, since we are assuming $\tilde{f}\sim\tilde{g}$ , and $z$ itself is of type (b), so it must be the case that the types of $\ell,\ell^{\prime}$ are the same. Thus we have two cases to consider: either they are both of type (a), or both of type (b).

Suppose both $\ell,\ell^{\prime}$ are of type (a).

In this case, the equivalence between $\tilde{f}$ and $\tilde{g}$ cannot send $z$ to $\ell^{\prime}$ and $\ell$ to $z$ , for both $\ell,\ell^{\prime}$ are of type (a), whereas $z$ is of type (b). Thus the equivalence between $\tilde{f}$ and $\tilde{g}$ must restrict to an equivalence between $f$ and $g$ (when we ignore $z$ , or set its contribution to the other variables to zero, as in the above case where $f$ was not divisible by $\ell^{d-3}$ ).

Suppose both $\ell,\ell^{\prime}$ are of type (b).

In this case, it is possible that the equivalence from $\tilde{f}$ to $\tilde{g}$ could send $z$ to $\ell^{\prime}$ and $\ell$ to $z$ (since all three of $\ell,\ell^{\prime},z$ are in case (b)); however, we will see that in this case, even such a situation will not cause an issue. Without loss of generality, by the change of bases described above, we have $\tilde{f}=zx_{r+1}\varphi$ and $\tilde{g}=z\ell^{\prime}\varphi$ (the same $\varphi$ ), where $\ell^{\prime}$ contains no $x_{i}$ with $1\leq i\leq r$ . Using elements of $S$ with $A_{11}=I_{r}$ , and $A_{21}=0$ , we then get an action of $\mathrm{GL}_{n-r+1}$ (via $A_{22}$ ) on linear forms in the variables $x_{r+1},\dotsc,x_{n},z$ . Since $\ell^{\prime}$ is linearly independent from $z$ (in particular, it does not contain $z$ ) and the action of $\mathrm{GL}$ is transitive on pairs of linearly independent vectors, we may use $S$ to fix $\varphi$ and $z$ , and send $x_{r+1}$ to $\ell^{\prime}$ , giving the desired equivalence $f\sim g$ . ∎

Bibliography99

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AD 17] Eric Allender and Bireswar Das. Zero knowledge and circuit minimization. Inf. Comput. , 256:2–8, 2017. doi:10.1016/j.ic.2017.04.004 . · doi ↗
2[AS 05] Manindra Agrawal and Nitin Saxena. Automorphisms of finite rings and applications to complexity of problems. In STACS 2005, 22nd Annual Symposium on Theoretical Aspects of Computer Science, Proceedings , pages 1–17, 2005. doi:10.1007/978-3-540-31856-9_1 . · doi ↗
3[AS 06] Manindra Agrawal and Nitin Saxena. Equivalence of 𝔽 𝔽 \mathbb{F} -algebras and cubic forms. In STACS 2006, 23rd Annual Symposium on Theoretical Aspects of Computer Science, Proceedings , pages 115–126, 2006. doi:10.1007/11672142_8 . · doi ↗
4[ASS 06] Ibrahim Assem, Daniel Simson, and Andrzej Skowroński. Elements of the representation theory of associative algebras. Vol. 1 , volume 65 of London Mathematical Society Student Texts . Cambridge University Press, Cambridge, 2006. Techniques of representation theory. doi:10.1017/CBO 9780511614309 . · doi ↗
5[Bab 85] L Babai. Trading group theory for randomness. In Proceedings of the Seventeenth Annual ACM Symposium on Theory of Computing , STOC ’85, pages 421–429. ACM, 1985. doi:10.1145/22145.22192 . · doi ↗
6[Bab 14] László Babai. On the automorphism groups of strongly regular graphs I. In Proceedings of the 5th Conference on Innovations in Theoretical Computer Science , ITCS ’14, pages 359–368, 2014. doi:10.1145/2554797.2554830 . · doi ↗
7[Bab 16] László Babai. Graph isomorphism in quasipolynomial time [extended abstract]. In Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2016 , pages 684–697, 2016. ar Xiv:1512.03547 [cs.DS] version 2. doi:10.1145/2897518.2897542 . · doi ↗
8[Bae 38] Reinhold Baer. Groups with abelian central quotient group. Trans. AMS , 44(3):357–386, 1938. doi:10.1090/S 0002-9947-1938-1501972-1 . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Isomorphism problems for tensors, groups,

Abstract

1 Introduction

Isomorphism problems in light of Babai’s breakthrough on Graph

Group actions on 3-way arrays.

Main results.

Implications of main results for practical

Isomorphism problems for 3-way arrays as a bottleneck for graph

New techniques.

Organization.

2 Preliminaries: Group actions on 3-way arrays

3-tensors.

Matrix spaces.

Nilpotent groups.

Bilinear maps, finite groups, and systems of polynomials.

Cubic forms & trilinear forms.

Algebras.

Summary.

3 Main results

3.1 Equivalence of isomorphism problems for 3-way arrays

Definition 3.1** (dTI,TId\mathsf{TI},\mathsf{TI}dTI,TI).**

Theorem A**.**

Remark 3.2**.**

Theorem B**.**

Corollary B**.**

Remark 3.3**.**

Remark 3.4**.**

3.2 Relations with Graph Isomorphism and Code

Observation 3.5**.**

Proof.

Proposition 3.6**.**

Corollary 3.7**.**

3.3 Application to Group Isomorphism: reducing the nilpotency class

Corollary P**.**

Proof.

3.4 Search to decision reductions

Theorem C**.**

Corollary C**.**

4 Related work

5 Overview of one new technique, and one full proof

Proof of Prop. 3.6.

6 Preliminaries

Vector spaces.

Some groups.

Matrices.

Matrix tuples.

Remark 6.1**.**

Matrix spaces.

3-way arrays.

Observation 6.2**.**

Multi-way arrays.

Observation 6.3**.**

Proof.

Algebras and their algorithmic representations.

The Lazard correspondence for ppp-groups.

Theorem 6.4** (Lazard Correspondence for finite groups, see, e. g., [Khu98, Ch. 9 & 10] or [Nai13, Ch. 6]).**

Proposition 6.5**.**

Proof sketch.

6.1 Tensor notation

Special cases of interest.

6.2 On the type of reduction

Remark 6.6**.**

7 Reductions using the linear algebraic coloring

7.1 From Graph Isomorphism to Alternating Matrix Space Isometry

Proposition 7.1**.**

Proof.

Lemma 7.2**.**

Proof.

For the only if direction,

For the if direction,

7.2 From 3-Tensor Isomorphism to Matrix Space Isometry and Matrix Group Isomorphism

Proposition 7.3**.**

Proof.

The gadget construction.

Definition 3.1 ( $d\mathsf{TI},\mathsf{TI}$ ).

Theorem A.

Remark 3.2.

Theorem B.

Corollary B.

Remark 3.3.

Remark 3.4.

Observation 3.5.

Proposition 3.6.

Corollary 3.7.

Corollary P.

Theorem C.

Corollary C.

Remark 6.1.

Observation 6.2.

Observation 6.3.

The Lazard correspondence for $p$ -groups.

Theorem 6.4 (Lazard Correspondence for finite groups, see, e. g., [Khu98, Ch. 9 & 10] or [Nai13, Ch. 6]).

Proposition 6.5.

Remark 6.6.

Proposition 7.1.

Lemma 7.2.

Proposition 7.3.

Corollary 7.4.

Lemma 7.5 (Constructive version of Baer’s correspondence for matrix groups).

Corollary 7.6.

7.3 Search to decision reduction for $p$ -Group Isomorphism and Alternating Matrix Space Isometry

Corollary C (Search to decision for testing isomorphism of

Proposition 8.1.

Corollary 8.2.

Proposition 8.3.

Lemma 8.4.

Corollary 8.5.

Corollary 8.6.

Lemma 8.7.

9 Reducing $d$ -Tensor Isomorphism to 3-Tensor Isomorphism

Remark 9.1.

Theorem 9.2 (Wedderburn–Mal’cev, see, e. g., [Far05]).

Definition 9.3 (Leavitt path algebra).

Lemma 9.4 (See [ASS06, Cor. II.1.11]).

Corollary 9.5.

Theorem 9.6 (Grigoriev [Gri81, Theorem 1]).

Definition 10.1 (Linear signature, basis-explicit).

Remark 10.2.

Theorem 10.3 (Futorny–Grochow–Sergeichuk [FGS19]).

Open Question 10.4.

Open Question 10.5.

Open Question 10.6.

Example 10.7 (Non-isomorphic tensors isomorphic over an extension field).

Open Question 10.8.

Appendix A Reducing Cubic Form Equivalence to Degree- $d$ Form Equivalence

Proposition A.1.

If $d=3$ ,

If $f$ is not divisible by $\ell^{d-3}$ for some linear form $\ell$ ,

Suppose $f$ is a product of linear forms,

If $d=4$ and $f=\ell\varphi$ where $\ell$ is linear and $\varphi$ is an irreducible quadratic,