Improved method for finding optimal formulae for bilinear maps in a   finite field

Svyatoslav Covanov (CARAMBA)

arXiv:1705.07728·cs.DS·December 10, 2018

Improved method for finding optimal formulae for bilinear maps in a finite field

Svyatoslav Covanov (CARAMBA)

PDF

TL;DR

This paper introduces a new pruning criterion that enhances the search for optimal bilinear map formulae over finite fields, leading to new optimal solutions and insights into matrix product decompositions.

Contribution

It presents a novel pruning criterion for exhaustive search, enabling discovery of new optimal formulae and proving uniqueness of certain matrix product decompositions.

Findings

01

New optimal formulae for short product modulo X^5 and circulant product modulo (X^5 - 1)

02

Proof of uniqueness of the optimal decomposition for 3x2 by 2x3 matrix products

03

Enhanced search efficiency for bilinear map formulae

Abstract

In 2012, Barbulescu, Detrey, Estibals and Zimmermann proposed a new framework to exhaustively search for optimal formulae for evaluating bilinear maps, such as Strassen or Karatsuba formulae. The main contribution of this work is a new criterion to aggressively prune useless branches in the exhaustive search, thus leading to the computation of new optimal formulae, in particular for the short product modulo X 5 and the circulant product modulo (X 5 -- 1). Moreover , we are able to prove that there is essentially only one optimal decomposition of the product of 3 x 2 by 2 x 3 matrices up to the action of some group of automorphisms.

Tables7

Table 1. Table 1: Comparison of the cardinality for ℓ = 3 ℓ 3 \ell=3 of three coverings of T 𝑇 {T} for K = 𝔽 2 𝐾 subscript 𝔽 2 K=\mathbb{F}_{2} .

set	cardinality
$𝒮_{2} (Span (\emptyset)) = 𝒮_{2}$	$980$
$𝒮_{3} (Span (Φ_{0}))$	$28$
$𝒮_{4} (Span (Φ_{0}, Φ_{1}))$	$6$

Table 2. Table 2: Timings for our approach to compute the sets Ω d subscript Ω 𝑑 \Omega_{d} over K = 𝔽 2 𝐾 subscript 𝔽 2 K=\mathbb{F}_{2} on a single core of a 3.3 GHz Intel Core i5-4590.

set	$Ω_{1}$	$Ω_{2}$	$Ω_{3}$	$Ω_{4}$	$Ω_{5}$	$Ω_{6}$	$Ω_{7}$	$Ω_{8}$
cardinality	$1$	$3$	$9$	$31$	$141$	$969$	$11, 289$	$265, 577$
upper bound	$1$	$9$	$4.4 \cdot 10^{2}$	$9.9 \cdot 10^{4}$	$9.5 \cdot 10^{7}$	$3.8 \cdot 10^{11}$	$6.1 \cdot 10^{15}$	$4.0 \cdot 10^{20}$
time (s)	$0$	$4.0 \cdot 10^{- 2}$	$6.0 \cdot 10^{- 2}$	$1.8 \cdot 10^{- 1}$	$1.5$	$1.8 \cdot 10$	$4.7 \cdot 10^{2}$	$1.8 \cdot 10^{4}$

Table 3. Table 3: Timings obtained with Algorithm BDEZ and BDEZStab for various bilinear maps over K = 𝔽 2 𝐾 subscript 𝔽 2 K=\mathbb{F}_{2} .

bilinear map	rank	algorithm	nb. of tests	time (s)
${MatProduct}_{(2, 2, 2)}$	$7$	BDEZ	$1.05 \cdot 10^{6}$	$8.5 \cdot 10$
${MatProduct}_{(2, 2, 2)}$	$7$	BDEZStab	$6.8 \cdot 10^{3}$	$5.0 \cdot 10^{- 1}$
${MatProduct}_{(3, 2, 3)}$	$15$	BDEZ	$9.2 \cdot 10^{19}$ (est.)	$1.1 \cdot 10^{17}$ (est.)
${MatProduct}_{(3, 2, 3)}$	$15$	BDEZStab	$2.6 \cdot 10^{13}$ (est.)	$3.4 \cdot 10^{10}$ (est.)
		CoveringSetsMethod	$1.6 \cdot {𝟏𝟎}^{𝟗}$	$8.5 \cdot {𝟏𝟎}^{𝟓}$
${MatProduct}_{(2, 3, 2)}$	$11$	BDEZ	$2.3 \cdot 10^{23}$ (est.)	$2.7 \cdot 10^{20}$ (est.)
		BDEZStab	$4.6 \cdot 10^{18}$ (est.)	$5.4 \cdot 10^{15}$ (est.)
		CoveringSetsMethod	$6.3 \cdot {𝟏𝟎}^{𝟏𝟎}$	$4.1 \cdot {𝟏𝟎}^{𝟔}$
${ShortProduct}_{3}$	$5$	BDEZ	$5.9 \cdot 10^{2}$	$1.4 \cdot 10^{- 1}$
${ShortProduct}_{3}$	$5$	BDEZStab	$3.4 \cdot 10$	$0.0$
${ShortProduct}_{4}$	$8$	BDEZ	$5.2 \cdot 10^{7}$	$4.3 \cdot 10^{3}$
		BDEZStab	$3.1 \cdot 10^{5}$	$2.7 \cdot 10$
		CoveringSetsMethod	$2.8 \cdot {𝟏𝟎}^{𝟐}$	$3.0$
${ShortProduct}_{5}$	$11$	BDEZ	$1.8 \cdot 10^{16}$ (est.)	$5.7 \cdot 10^{12}$ (est.)
		BDEZStab	$6.9 \cdot 10^{11}$ (est.)	$2.2 \cdot 10^{8}$ (est.)
		CoveringSetsMethod	$6.3 \cdot {𝟏𝟎}^{𝟔}$	$2.4 \cdot {𝟏𝟎}^{𝟑}$
${ShortProduct}_{6}$	$14$	BDEZ	$3.9 \cdot 10^{26}$ (est.)	$4.7 \cdot 10^{23}$ (est.)
${ShortProduct}_{6}$	$14$	BDEZStab	$2.0 \cdot 10^{19}$ (est.)	$2.7 \cdot 10^{16}$ (est.)
${CirculantProduct}_{3}$	$4$	BDEZ	$36$	$0.0$
${CirculantProduct}_{3}$	$4$	BDEZStab	$6$	$0.1 \cdot 10^{- 2}$
${CirculantProduct}_{4}$	$8$	BDEZ	$5.2 \cdot 10^{7}$	$4.3 \cdot 10^{3}$
${CirculantProduct}_{4}$	$8$	BDEZStab	$3.1 \cdot 10^{5}$	$2.7 \cdot 10$
${CirculantProduct}_{5}$	$10$	BDEZ	$4.0 \cdot 10^{13}$ (est.)	$1.2 \cdot 10^{10}$ (est.)
		BDEZStab	$1.0 \cdot 10^{10}$ (est.)	$3.5 \cdot 10^{6}$ (est.)
		CoveringSetsMethod	$8.8 \cdot {𝟏𝟎}^{𝟖}$	$5.4 \cdot {𝟏𝟎}^{𝟑}$
${CirculantProduct}_{6}$	$12$	BDEZ	$1.0 \cdot 10^{20}$ (est.)	$1.3 \cdot 10^{17}$ (est.)
${CirculantProduct}_{6}$	$12$	BDEZStab	$1.1 \cdot 10^{15}$ (est.)	$1.5 \cdot 10^{12}$ (est.)

Table 4. Table 4: Computation of elements of 𝒮 15 ( T 3 , 2 , 3 ) subscript 𝒮 15 subscript 𝑇 3 2 3 \mathscr{S}_{{15}}(T_{3,2,3}) .

set	cardinality	nb. tests	time (s)	nb. of solutions found
${\tilde{ℰ}}_{0}$	$8.8 \cdot 10$	$1.2 \cdot 10^{8}$	$2.0 \cdot 10^{5}$	$5$
${\tilde{ℰ}}_{1}$	$7.5 \cdot 10^{5}$	$2.2 \cdot 10^{7}$	$3.3 \cdot 10^{5}$	$13$
${\tilde{ℰ}}_{2}$	$1.0 \cdot 10^{4}$	$2.8 \cdot 10^{5}$	$4.1 \cdot 10^{2}$	$1$
${\tilde{ℰ}}_{3}$	$2.7 \cdot 10^{5}$	$5.9 \cdot 10^{8}$	$9.1 \cdot 10^{5}$	$46$
${\tilde{ℰ}}_{4}^{'}$	$2.5 \cdot 10^{7}$	$9.1 \cdot 10^{8}$	$1.3 \cdot 10^{6}$	$2$

Table 5. Table 5: Computation of 𝒮 11 ( T 2 , 3 , 2 ) subscript 𝒮 11 subscript 𝑇 2 3 2 \mathscr{S}_{{11}}(T_{2,3,2}) .

set	cardinality	nb. tests	time (s)	nb. of solutions found
${\tilde{ℰ}}_{0}$	$139$	$5.0 \cdot 10^{4}$	$6.2 \cdot 10^{4}$	$44$
${\tilde{ℰ}}_{1}$	$3.8 \cdot 10^{8}$	$6.3 \cdot 10^{10}$	$4.1 \cdot 10^{6}$	$5, 614$

Table 6. Table 6: Computation of 𝒮 r ( T ) subscript 𝒮 𝑟 𝑇 \mathscr{S}_{{r}}(T) .

bilinear map	nb. of tests	time (s)	nb. of solutions	equivalence classes
${ShortProduct}_{4}$	$2.8 \cdot 10^{2}$	$3.0$	$1, 440$	$220$
${ShortProduct}_{5}$	$6.3 \cdot 10^{6}$	$2.4 \cdot 10^{3}$	$146, 944$	$11, 424$

Table 7. Table 7: Computation of 𝒮 10 ( T ) subscript 𝒮 10 𝑇 \mathscr{S}_{{10}}(T) .

set	cardinality	nb. tests	time (s)	nb. of solutions found
${\tilde{ℰ}}_{0}$	$5.2 \cdot 10$	$8.7 \cdot 10^{7}$	$3.1 \cdot 10^{3}$	$0$
${\tilde{ℰ}}_{1}$	$2.0 \cdot 10^{3}$	$6.7 \cdot 10^{5}$	$2.4 \cdot 10^{2}$	$264$

Equations208

A = a_{0} + a_{1} X and B = b_{0} + b_{1} X .

A = a_{0} + a_{1} X and B = b_{0} + b_{1} X .

Φ = Φ_{0} Φ_{1} Φ_{2} : (a, b) \mapsto a_{0} b_{0} a_{0} b_{1} + a_{1} b_{0} a_{1} b_{1} .

Φ = Φ_{0} Φ_{1} Φ_{2} : (a, b) \mapsto a_{0} b_{0} a_{0} b_{1} + a_{1} b_{0} a_{1} b_{1} .

Φ = ϕ_{0} \cdot 100 + ϕ_{1} \cdot 010 + ϕ_{2} \cdot 010 + ϕ_{3} \cdot 001,

Φ = ϕ_{0} \cdot 100 + ϕ_{1} \cdot 010 + ϕ_{2} \cdot 010 + ϕ_{3} \cdot 001,

Φ = ϕ_{0} \cdot 1 - 1 0 + ψ \cdot 010 + ϕ_{2} \cdot 0 - 1 1 .

Φ = ϕ_{0} \cdot 1 - 1 0 + ψ \cdot 010 + ϕ_{2} \cdot 0 - 1 1 .

Φ = 0 \leq t < r \sum ϕ_{t} \cdot c_{t} .

Φ = 0 \leq t < r \sum ϕ_{t} \cdot c_{t} .

\forall h \in {0, \dots, ℓ - 1}, M_{h} \in Span ({N_{0}, \dots, N_{r - 1}}) .

\forall h \in {0, \dots, ℓ - 1}, M_{h} \in Span ({N_{0}, \dots, N_{r - 1}}) .

C = a_{0} b_{0} + (a_{0} b_{1} + a_{1} b_{0}) X + (a_{0} b_{2} + a_{1} b_{1} + a_{2} b_{0}) X^{2} .

C = a_{0} b_{0} + (a_{0} b_{1} + a_{1} b_{0}) X + (a_{0} b_{2} + a_{1} b_{1} + a_{2} b_{0}) X^{2} .

Φ_{0} : Φ_{1} : Φ_{2} : (a, b) \mapsto a_{0} b_{0}, (a, b) \mapsto a_{0} b_{1} + a_{1} b_{0}, (a, b) \mapsto a_{0} b_{2} + a_{1} b_{1} + a_{2} b_{0} .

Φ_{0} : Φ_{1} : Φ_{2} : (a, b) \mapsto a_{0} b_{0}, (a, b) \mapsto a_{0} b_{1} + a_{1} b_{0}, (a, b) \mapsto a_{0} b_{2} + a_{1} b_{1} + a_{2} b_{0} .

M_{0} = 100000000, M_{1} = 010100000, M_{2} = 001010100 .

M_{0} = 100000000, M_{1} = 010100000, M_{2} = 001010100 .

Φ \circ σ : (a, b) \mapsto Φ (μ (a), ν (b)) .

Φ \circ σ : (a, b) \mapsto Φ (μ (a), ν (b)) .

\forall a, b, ((Φ \circ σ) \circ σ^{'}) (a, b) = (Φ \circ σ) (μ^{'} (a), ν^{'} (b)) = Φ (μ (μ^{'} (a)), ν (ν^{'} (b))) = (Φ \circ (σ \circ σ^{'})) (a, b) .

\forall a, b, ((Φ \circ σ) \circ σ^{'}) (a, b) = (Φ \circ σ) (μ^{'} (a), ν^{'} (b)) = Φ (μ (μ^{'} (a)), ν (ν^{'} (b))) = (Φ \circ (σ \circ σ^{'})) (a, b) .

rk (T \circ σ) = rk (T) .

rk (T \circ σ) = rk (T) .

M_{1} = (1000), M_{2} = (0010) .

M_{1} = (1000), M_{2} = (0010) .

M_{1}^{'} = M_{1} \cdot σ = X^{T} \cdot M_{1} \cdot Y = (0001), M_{2}^{'} = M_{2} \cdot σ = X^{T} \cdot M_{2} \cdot Y = (0100) .

M_{1}^{'} = M_{1} \cdot σ = X^{T} \cdot M_{1} \cdot Y = (0001), M_{2}^{'} = M_{2} \cdot σ = X^{T} \cdot M_{2} \cdot Y = (0100) .

Stab (T) = {σ \in GL (K^{m}) \times GL (K^{n}) ∣ T \circ σ = T} .

Stab (T) = {σ \in GL (K^{m}) \times GL (K^{n}) ∣ T \circ σ = T} .

\forall σ \in Stab (T), S_{r} (T) \circ σ = S_{r} (T),

\forall σ \in Stab (T), S_{r} (T) \circ σ = S_{r} (T),

Φ : a_{0} ⋮ a_{ℓ - 1}, b_{0} ⋮ b_{ℓ - 1} \mapsto c_{0} ⋮ c_{ℓ - 1}

Φ : a_{0} ⋮ a_{ℓ - 1}, b_{0} ⋮ b_{ℓ - 1} \mapsto c_{0} ⋮ c_{ℓ - 1}

\forall j \in {0, \dots, ℓ - 1}, M (0, \dots, 0, 1, j zeros 0, \dots, 0) = N^{ℓ - 1 - j},

\forall j \in {0, \dots, ℓ - 1}, M (0, \dots, 0, 1, j zeros 0, \dots, 0) = N^{ℓ - 1 - j},

(Ψ \circ σ, Ψ^{'} \circ σ) = (I, N);

(Ψ \circ σ, Ψ^{'} \circ σ) = (I, N);

Φ_{p, q, r} : M_{p, q} (K) \times M_{q, r} (K) (A, B) ⟶ ⟼ M_{p, r} (K) A \cdot B .

Φ_{p, q, r} : M_{p, q} (K) \times M_{q, r} (K) (A, B) ⟶ ⟼ M_{p, r} (K) A \cdot B .

T_{p, q, r} = Span ({Φ_{i, j}}_{i, j}) .

T_{p, q, r} = Span ({Φ_{i, j}}_{i, j}) .

1000010000000000, 0000000010000100, 0010000100000000, 0000000000100001,

1000010000000000, 0000000010000100, 0010000100000000, 0000000000100001,

(1000) \otimes I_{2}, (0010) \otimes I_{2}, (0100) \otimes I_{2}, (0001) \otimes I_{2},

(1000) \otimes I_{2}, (0010) \otimes I_{2}, (0100) \otimes I_{2}, (0001) \otimes I_{2},

Φ_{i, j} = 0 \leq h < q \sum e_{i} \otimes f_{h} \otimes f_{h} \otimes g_{j} .

Φ_{i, j} = 0 \leq h < q \sum e_{i} \otimes f_{h} \otimes f_{h} \otimes g_{j} .

1000010010100101, 1000010000100001, 0010000110000100, 1000010000100001,

1000010010100101, 1000010000100001, 0010000110000100, 1000010000100001,

1000110000100011, 1000110000100011, 0100100000010010, 0100100000010010,

1000110000100011, 1000110000100011, 0100100000010010, 0100100000010010,

1000010000100001, 1000010010100101, 1000010000100001, 0010000110000100 .

1000010000100001, 1000010010100101, 1000010000100001, 0010000110000100 .

1000010000000000 .

1000010000000000 .

1000010000100001 .

1000010000100001 .

S_{r} (T) \subset {T + V ∣ \exists i \in {0, \dots, g - 1}, V \in E_{i, r} \circ Stab (T)} .

S_{r} (T) \subset {T + V ∣ \exists i \in {0, \dots, g - 1}, V \in E_{i, r} \circ Stab (T)} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Improved method for finding optimal formulae for bilinear maps in a finite field

Svyatoslav Covanov

[email protected]

Université de Lorraine, LORIA, UMR 7503, Vandoeuvre-lès-Nancy, F-54506, France

Inria, Villers-lès-Nancy, F-54600, France

CNRS, LORIA, UMR 7503, Vandoeuvre-lès-Nancy, F-54506, France

Abstract

In 2012, Barbulescu, Detrey, Estibals and Zimmermann proposed a new framework to exhaustively search for optimal formulae for evaluating bilinear maps over finite fields, such as Strassen or Karatsuba formulae. The main contribution of this work is a new criterion to aggressively prune useless branches in the exhaustive search, thus leading to the computation of new optimal formulae. We apply in particular our approach to the short product modulo $X^{5}$ and the circulant product modulo $(X^{5}-1)$ . Moreover, we are able to prove that there is essentially only one optimal decomposition of the product of $3\times 2$ by $2\times 3$ matrices up to the action of some group of automorphisms.

keywords:

bilinear rank, optimal formulae, polynomial multiplication, matrix multiplication, finite field arithmetic, bilinear map

1 Introduction

Finding optimal formulae for computing bilinear maps is a problem of algebraic complexity theory [7, 6, 26, 15], initiated by the discoveries of Karatsuba and Ofman [16] and Strassen [26]. It consists in determining almost optimal algorithms for important problems of complexity theory, among which the well studied complexity of matrix multiplication [26, 21, 9, 18] and the complexity of polynomial multiplication [16, 27, 24, 11].

As far as polynomial multiplication is concerned, the first improvement over the schoolbook method came from Karatsuba and Ofman [16] in 1962, who proposed a decomposition of the bilinear map associated to the product of two polynomials of degree $1$

[TABLE]

Using the schoolbook algorithm, computing the product $A\cdot B$ requires ${4}$ multiplications over the coefficient ring: $a_{0}b_{0}$ , $a_{1}b_{0}$ , $a_{0}b_{1}$ , $a_{1}b_{1}$ . With the algorithm proposed by Karatsuba, the coefficients of the product $A\cdot B$ can be retrieved from the computation of the ${3}$ following multiplications: $a_{0}b_{0}$ , $(a_{0}+a_{1})(b_{0}+b_{1})$ , $a_{1}b_{1}$ . In particular, Karatsuba’s algorithm can be applied recursively to improve the binary complexity of the multiplication of two $n$ -bit integers: instead of $O(n^{2})$ with the naive schoolbook algorithm, we obtain $O(n^{\log_{2}3})$ .

In 1969, Strassen [26] proposed formulae improving on the cost of the product of two $2\times 2$ matrices. When applied recursively on large matrices, this leads to a binary complexity of $O(n^{\log_{2}7})$ instead of $O(n^{\log_{2}8})=O(n^{3})$ . Smirnov describes in [25] practical algorithms for matrices of higher dimensions. One can notice that, most of the time, optimal algorithms for matrix multiplication are unknown. For example, it is possible compute the product of $3\times 3$ matrices over $\mathbb{C}$ with $23$ multiplications [17], but the best known lower bound is still $19$ [4].

State of the art

An obstacle to finding optimal formulae is the fact that the decomposition of bilinear maps is known to be NP-hard [14]. In terms of method, the least-squares method seems to be one of the most popular [25]. Another way to decompose a bilinear map consists in using ingredients from geometry [3] and to find a generalization of the decomposition of singular value decomposition for matrices to general tensors. However, these methods are essentially used over an algebraically closed field $K$ (e.g. $K=\mathbb{C}$ ) and are not meant to produce all the possible decompositions for a bilinear map. In our context, we are looking for a method computing optimal formulae (for the bilinear rank) over a finite field $K$ . These formulae can be used for the same bilinear map over any extension of $K$ . Thus, they can be used in the context of the asymptotic multiplication of polynomials over a finite field for example. Furthermore, for a set of formulae of $\mathbb{Q}$ , we can deduce formulae over a finite field $K$ . It should be possible, given all the optimal formulae over $K$ to obtain formulae over $\mathbb{Q}$ . In other terms, finding optimal formulae over finite fields can be used to improve on the multiplication algorithms over larger fields.

Montgomery proposed in [19] an algorithm to compute such a decomposition for the particular case of polynomials of small degree over a finite field. The author takes advantage of the fact that the number of possible formulae is always finite on a finite field. He obtains new formulae for the multiplication of polynomials of degree $4$ , $5$ and $6$ over $\mathbb{F}_{2}$ . In [20], Oseledets proposes a heuristic approach to solve the bilinear rank problem for the polynomial product over $\mathbb{F}_{2}$ . Later, Barbulescu et al. proposed in [1] a unified framework, extending the idea proposed by Oseledets. This allows the authors to compute the bilinear rank of different applications, such as the short product or the middle product over a finite field. Their algorithm allows one to generate all the possible rank decompositions of any bilinear map over a finite field. We extend this work in the current article.

Contributions

The work presented is an improvement to the algorithm introduced in [1], allowing one to increase the family of bilinear maps over a finite field for which we are able to compute all the optimal formulae. Our algorithm relies on the automorphism group stabilizing a bilinear map, and on the notion of “stem” of a vector space associated to such a bilinear map. The main theorem of this work is Theorem 27 and it states that Algorithm 4 is able to find all decompositions of a bilinear map over a finite field. It can be used for proving lower bounds on the rank of a bilinear map and it has applications for improving upper bounds on the Chudnovsky-Chudnovsky algorithms [8, 23, 22]. Specifically, we compute all the decompositions for the short product of polynomials $P$ and $Q$ modulo $X^{5}$ and the product of $3\times 2$ by $2\times 3$ matrices. The latter problem was out of reach with the method used in [1]. We prove, in particular, that the set of possible decompositions for this matrix product is essentially unique, up to the action of the automorphism group. It is difficult to propose a complexity analysis showing the impact of our method, since it takes into account intrinsic properties of the bilinear maps that are considered.

Roadmap

This article is organized as follows. In Section 2, we present the theoretical tools and the framework for this article, corresponding to the framework introduced in [1]. In Section 3, we present, with kind permission of the authors, unpublished improvements [2] taking into account the symmetries of bilinear maps. In Section 4, we describe the algebraic structure of specific bilinear maps. This section can be skipped on a first read, because it is only required in proofs of the following section. In Section 5, we describe the theoretical aspect of our main contribution, which relies on the construction of coverings, and illustrate it with the examples of the short product and the matrix product. We discuss specific algorithmic aspects in Section 6: this part is quite technical and can be skipped on a first read. Finally, experimental timings are given in Section 7.

2 Preliminaries

We present in this section the definition of the mathematical objects that we manipulate in this work and we define the bilinear rank. We choose the characterization given by de Groote [10] or Bürgisser et al. [7, Ch. 14]. In particular, we introduce here the framework of [1] and the underlying linear algebra problem.

2.1 Problem statement

Let ${K}$ be a field. Given a bilinear map $\mathbf{\Phi}:{K}^{m}\times{K}^{n}\rightarrow{K}^{\ell}$ , the bilinear rank problem consists in finding the minimal number of multiplications between scalars used for evaluating $\mathbf{\Phi}$ . The set $\mathcal{L}(K^{m},K^{n};K^{\ell})$ denotes the set of bilinear maps from $K^{m}\times K^{n}$ to $K^{\ell}$ . Any bilinear map $\mathbf{\Phi}$ from $K^{m}\times K^{n}$ to $K^{\ell}$ can be seen as an element of $\mathcal{L}(K^{m},K^{n};K)^{\ell}$ , whose coordinates are the bilinear forms $(\Phi_{h})_{0\leq h<\ell}$ .

Example 1 (Multiplication of linear polynomials).

Let $A=a_{0}+a_{1}X$ and $B=b_{0}+b_{1}X$ be two polynomials over $K$ . The product $A\cdot B$ is associated to the bilinear map $\mathbf{\Phi}$ taking as input the vectors $\mathbf{a}=(a_{0},a_{1})$ and $\mathbf{b}=(b_{0},b_{1})$ such that

[TABLE]

Denoting by $\phi_{0}$ , $\phi_{1}$ , $\phi_{2}$ and $\phi_{3}$ the bilinear forms $(\mathbf{a},\mathbf{b})\mapsto a_{0}b_{0}$ , $(\mathbf{a},\mathbf{b})\mapsto a_{0}b_{1}$ , $(\mathbf{a},\mathbf{b})\mapsto a_{1}b_{0}$ and $(\mathbf{a},\mathbf{b})\mapsto a_{1}b_{1}$ , respectively, we have

[TABLE]

which corresponds to the schoolbook algorithm.

Let $\psi$ be an element of $\mathcal{L}(K^{2},K^{2};K)$ such that $\psi:(\mathbf{a},\mathbf{b})\mapsto(a_{0}+a_{1})(b_{0}+b_{1})$ . Then, since $\phi_{1}+\phi_{2}=\psi-\phi_{0}-\phi_{3}$ , we can rewrite $\mathbf{\Phi}$ as

[TABLE]

The bilinear forms $\phi_{0}$ , $\psi$ and $\phi_{2}$ each correspond to exactly one multiplication over $K$ . This decomposition corresponds to the Karatsuba algorithm. Thus, we can deduce that the bilinear rank of $\mathbf{\Phi}$ is at most $3$ . Actually, one can show that the bilinear rank of $\mathbf{\Phi}$ is equal to $3$ .

Formally, a bilinear form $\phi\in\mathcal{L}(K^{m},K^{n};K)$ is said to have rank one if there exist two linear forms $\alpha\in\mathcal{L}(K^{m};K)$ and $\beta\in\mathcal{L}(K^{n};K)$ such that $\phi(\mathbf{a},\mathbf{b})=\alpha(\mathbf{a})\cdot\beta(\mathbf{b})$ . For $i\in\{0,\ldots,m-1\}$ and $j\in\{0,\ldots,n-1\}$ , we denote by $e_{i,j}$ the bilinear forms $e_{i,j}:(\mathbf{a},\mathbf{b})\mapsto a_{i}b_{j}$ . The $e_{i,j}$ ’s have rank one and form the canonical basis of $\mathcal{L}(K^{m},K^{n};K)$ . This implies that any bilinear form can be expressed as a linear combination of bilinear forms of rank one.

Definition 2 (Bilinear rank).

The rank of a bilinear form $\Phi$ , denoted by $\operatorname{rk}(\Phi)$ , is defined as the minimal number of bilinear forms $\phi_{t}$ of rank one such that $\Phi$ is a linear combination of the $\phi_{t}$ ’s. Then, a family $(\phi_{t})_{t}$ of cardinality $\operatorname{rk}(\Phi)$ is said to be an optimal decomposition of $\Phi$ .

We extend this definition to bilinear maps $\mathbf{\Phi}\in\mathcal{L}(K^{m},K^{n};K)^{\ell}$ : the rank $r$ of $\mathbf{\Phi}$ is the cardinality of a minimal set of bilinear forms $(\phi_{t})_{0\leq t<r}$ of rank one for which there exist vectors $\mathbf{c}_{t}\in K^{\ell}$ such that

[TABLE]

We have a matrix equivalent of Definition 2. Indeed, for $\Phi\in\mathcal{L}(K^{m},K^{n};K)$ , there exists a matrix $M\in\mathcal{M}_{m,n}(K)$ such that $\Phi(\mathbf{a},\mathbf{b})={\mathbf{a}}^{\mathrm{T}}\cdot M\cdot\mathbf{b}$ for $\mathbf{a}\in K^{m}$ and $\mathbf{b}\in K^{n}$ . In this situation, the usual matrix rank of $M$ is equal to the rank of $\Phi$ defined as above. Let $\mathbf{\Phi}=(\Phi_{0},\ldots,\Phi_{\ell-1})$ be a bilinear map of rank $r$ , for which each $\Phi_{h}$ for $0\leq h<\ell$ is represented by $M_{h}\in\mathcal{M}_{m,n}(K)$ . Consequently, there exists a set of $r$ matrices $N_{t}\in\mathcal{M}_{m,n}(K)$ of rank one such that

[TABLE]

Example 3 (Short product of polynomials of degree $2$ ).

We describe in this example the matrices associated to the short product of two polynomials of degree $2$ .

Let $A$ and $B$ be the polynomials $A=a_{0}+a_{1}X+a_{2}X^{2}$ and $B=b_{0}+b_{1}X+b_{2}X^{2}$ . We denote by $C$ the polynomial $A\cdot B\bmod X^{3}$ :

[TABLE]

We consider $A$ and $B$ as vectors of $K^{3}$ denoted by $\mathbf{a}$ and $\mathbf{b}$ , respectively. Let $\Phi_{0}$ , $\Phi_{1}$ and $\Phi_{2}$ be bilinear forms defined as

[TABLE]

In order to represent the corresponding matrices, we use the canonical basis for $\mathcal{L}(K^{3},K^{3};K)$ , i.e. the bilinear forms $e_{i,j}$ satisfying $e_{i,j}:(\mathbf{a},\mathbf{b})\mapsto a_{i}b_{j}$ , for $0\leq i,j<3$ . Then, the matrices $M_{h}$ associated to $\Phi_{h}$ are

[TABLE]

2.2 A linear algebra problem

The approach of [1] consists in computing the rank of a bilinear map $\mathbf{\Phi}=(\Phi_{0},\ldots,\Phi_{\ell-1})$ by considering $T=\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})$ , which is a subspace of $\mathcal{L}(K^{m},K^{n};K)$ . Indeed, finding formulas for computing the $\Phi_{t}$ ’s is equivalent to finding a family of rank-one bilinear forms generating $T$ . Thus, we need to extend the definition of the rank to subspaces of $\mathcal{L}(K^{m},K^{n};K)$ .

Notation 4.

For $T$ a subspace of $\mathcal{L}(K^{m},K^{n};K)$ , we denote by $\mathscr{S}_{{m,n,r}}(T)$ the set of subspaces $V\subset\mathcal{L}(K^{m},K^{n};K)$ spanned by a free family of rank-one bilinear forms of size $r$ such that $T\subset V$ .

When $T=\operatorname{Span}(\emptyset)$ , $\mathscr{S}_{{m,n,r}}(T)$ is the set of subspaces $V\in\mathcal{L}(K^{m},K^{n};K)$ spanned by a free family of rank-one bilinear forms of size $r$ and we denote it simply by $\mathscr{S}_{{m,n,r}}$ .

When $m$ and $n$ are clear from the context, these sets are simply denoted by $\mathscr{S}_{{r}}(T)$ and $\mathscr{S}_{{r}}$ .

We use Notation 4 to define the rank of a subspace $T\in\mathcal{L}(K^{m},K^{n};K)$ in Definition 5.

Definition 5 (Rank of a subspace of $\mathcal{L}(K^{m},K^{n};K)$ ).

Let $T$ be a subspace of $\mathcal{L}(K^{m},K^{n};K)$ . We denote by $\operatorname{rk}(T)$ the smallest $r$ such that $\mathscr{S}_{{r}}(T)\neq\emptyset$ . The set $\mathscr{S}_{{\operatorname{rk}(T)}}(T)$ is the said to be the set of optimal decompositions of $T$ .

We observe that $\operatorname{rk}(T)\geq\dim(T)$ .

Let $\mathbf{\Phi}=(\Phi_{0},\ldots,\Phi_{\ell-1})\in\mathcal{L}(K^{m},K^{n};K)^{\ell}$ and $T=\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})\subset\mathcal{L}(K^{m},K^{n};K)$ . Decomposing a bilinear map $\mathbf{\Phi}\in\mathcal{L}(K^{m},K^{n};K)^{\ell}$ into linear combination of $r$ rank-one bilinear forms is equivalent to computing $\mathscr{S}_{{r}}(T)$ . Our approach focuses on the latter point of view, which is also the point of view taken by Algorithm [1, Alg. 1].

General strategy for computing the bilinear rank

Taking into account the formalism proposed in Section 2.2, the algorithmic strategy we use to compute the bilinear rank of a bilinear map is stated as follows.

Let $T=\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})\subset\mathcal{L}(K^{m},K^{n};K)$ of dimension $\ell$ ;

2.

start with the known lower bound $r=\ell$ on the bilinear rank;

3.

compute $\mathscr{S}_{{r}}(T)$ ;

4.

if $\mathscr{S}_{{r}}(T)=\emptyset$ , increment $r$ and return to the previous step;

5.

if $\mathscr{S}_{{r}}(T)\neq\emptyset$ , $r$ is the bilinear rank and $\mathscr{S}_{{r}}(T)$ the set of optimal decompositions.

2.3 The BDEZ Algorithm (Barbulescu, Detrey, Estibals, Zimmermann)

We describe in this section Algorithm [1, Alg. 1], which is a recursive method to solve the bilinear rank problem for a bilinear map $\mathbf{\Phi}=(\Phi_{0},\ldots,\Phi_{\ell-1})$ over a finite field. As described above, this is essentially equivalent to computing $\mathscr{S}_{{r}}(T)$ for $T=\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})$ of dimension $\ell$ .

In order to get all the vector spaces $V\in\mathscr{S}_{{r}}$ such that $T\subset V$ , we compute the vector spaces $W\in\mathscr{S}_{{r-\ell}}$ such that $T\oplus W\in\mathscr{S}_{{r}}$ . In other terms, instead of enumerating all the elements of $\mathscr{S}_{{r}}$ , we rather enumerate complementary subspaces of $T$ in $\mathscr{S}_{{r-\ell}}$ . This restriction can be done thanks to Proposition [1, Prop. 1], reformulated as Proposition 6 using the formalism of Section 2.2.

Proposition 6.

Let $T$ be a subspace of dimension $\ell$ of $\mathcal{L}(K^{m},K^{n};K)$ , let $r\geq\ell$ be an integer. For any $V\in\mathscr{S}_{{r}}(T)$ , there exists $W\in\mathscr{S}_{{r-\ell}}$ such that $T\oplus W=V$ .

Proof.

Let $\mathcal{B}$ be a basis of $V$ composed of rank-one matrices. We define inductively a sequence of subspaces $(W_{t})_{0\leq t\leq r-\ell}$ , such that for any $t$ we have $W_{t}\in\mathscr{S}_{{t}}$ , as follows.

The set $W_{0}$ is the null subspace and satisfies $T\oplus W_{0}\subset V$ and $\dim{T\oplus W_{0}}=\ell$ .

2.

For $t\in\left\{1,\ldots,r-\ell\right\}$ , assuming that $T\oplus W_{t-1}\subset V$ and $\dim{(T\oplus W_{t-1})}=\ell+t-1$ , there exists $\Phi\in\mathcal{B}$ such that $\Phi\not\in T\oplus W_{t-1}$ (otherwise $T\oplus W_{t-1}=V$ and $\dim{V}\leq r-1$ , which is a contradiction). Then, we define $W_{t}$ as $W_{t}=W_{t-1}\oplus\operatorname{Span}(\{\Phi\})$ . The subspace $W_{t}$ satisfies $T\oplus W_{t}\subset V$ , $\dim{(T\oplus W_{t})}=\ell+t$ and $W_{t}\in\mathscr{S}_{{t}}$ .

Taking $W=W_{r-\ell}$ , Proposition 6 is proved. ∎

We denote by $\mathcal{G}$ the set of rank-one bilinear forms up to a multiplicative factor, isomorphic to $\mathscr{S}_{{m,n,1}}$ . In a finite field, $\mathcal{G}$ is a finite set of cardinality ${(\#K^{m}-1)(\#K^{n}-1)}/{(\#K-1)^{2}}$ . Algorithm BDEZ requires a test to determine whether, for $V\in\mathcal{L}(K^{m},K^{n};K)$ of dimension $r$ , we have $V\in\mathscr{S}_{{r}}$ : we denote by HasRankOneBasis this test. A naive method to perform this test is described in Algorithm 1. We could think of other methods based on solving bilinear systems, but it does not seem efficient in our applications. However, an optimized version of this algorithm is used for particular bilinear maps (such as product of $2\times 3$ by $3\times 2$ matrices, for example).

Algorithm BDEZ can be described as a recursive optimized version of the backtracking method constructing all the sets of cardinality $r-\ell$ of independent bilinear forms of rank one. The input of the first call to BDEZ is: a target subspace $T$ of dimension $\ell$ and an integer $r$ ( $r$ is a lower bound on the rank of $T$ , as explained at the end of Section 2.2).

Algorithm BDEZ takes into account, on Line 9, the equivalence relation “modulo $V$ ”: two distinct elements $\phi$ and $\phi^{\prime}$ of $\mathcal{H}$ may be such that $V+\operatorname{Span}(\{\phi\})=V+\operatorname{Span}(\{\phi^{\prime}\})$ . Reducing each element of $\mathcal{H}$ against $V$ (via Gauss reduction) allows us to consider a single representative for each such equivalence class modulo $V$ . A similar reduction is performed on Line 15 to compute $\mathcal{G}\bmod T$ .

The recursive calls of this algorithm can be represented by a tree in which each node at depth $r-\ell$ corresponds to a vector space $T\oplus W_{u_{1},u_{2},\ldots,u_{r-\ell}}$ of dimension $r$ generated by a basis of $T$ and rank-one matrices $\phi_{u_{1}},\phi_{u_{2}},\ldots,\phi_{u_{r-\ell}}$ . For example, assuming that the initial set of rank-one bilinear forms is $\mathcal{G}=\{\phi_{0},\phi_{1},\phi_{2},\phi_{3}\}$ and ignoring the reductions computed on Line 9, we would obtain generically, for $r-\ell=3$ , the tree given in Figure 1.

3 Improving on BDEZ using symmetries

We present in this section, with kind permission from the authors, an unpublished improvement [2] to Algorithm BDEZ. This improvement takes into account the fact that we can define rank-preserving automorphisms of $\mathcal{L}(K^{m},K^{n};K)$ . Their action is defined in Section 3.1.

3.1 Action of automorphisms on $\mathcal{L}(K^{m},K^{n};K)$

We work with subspaces of $\mathcal{L}(K^{m},K^{n};K)$ rather than with bilinear maps, as in Section 2.2. We describe in this section the rank-preserving group of automorphisms $\sigma$ acting on subspaces $T\subset\mathcal{L}(K^{m},K^{n};K)$ , also referred to as the $\operatorname{RP}$ -automorphisms group.

Definition 7.

An element $\sigma=(\mu,\nu)\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ acts on $\mathcal{L}(K^{m},K^{n};K)$ via

[TABLE]

Such an element is called $\operatorname{RP}$ -automorphism.

Proposition 8.

The action of $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ is a group action and its elements are all invertible.

Proof.

For $\sigma=(\mu,\nu),\sigma^{\prime}=(\mu^{\prime},\nu^{\prime})\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ and $\Phi\in\mathcal{L}(K^{m},K^{n};K)$ , we have

[TABLE]

Thus, the action that we defined is indeed a group action. Since all the elements of $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ are invertible, we have automorphisms. ∎

Proposition 9 ( $\operatorname{RP}$ -automorphisms preserve the rank).

Let $\sigma\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ .

For any $\Phi\in\mathcal{L}(K^{m},K^{n};K)$ , we have $\operatorname{rk}(\Phi\operatorname{\mathop{\circ}}\sigma)=\operatorname{rk}(\Phi)$ .

2.

For any subspace $T\subset\mathcal{L}(K^{m},K^{n};K)$ , we also have $\operatorname{rk}(T\operatorname{\mathop{\circ}}\sigma)=\operatorname{rk}(T)$ .

Proof.

First, let $\phi\in\mathcal{L}(K^{m},K^{n};K)$ of rank one. There exist $\alpha\in\mathcal{L}(K^{m};K)$ and $\beta\in\mathcal{L}(K^{n};K)$ such that $\phi:(\mathbf{a},\mathbf{b})\mapsto\alpha(\mathbf{a})\cdot\beta(\mathbf{b})$ . There exist $\mu\in\operatorname{GL}(K^{m})$ and $\nu\in\operatorname{GL}(K^{n})$ such that $\phi\operatorname{\mathop{\circ}}\sigma:(\mathbf{a},\mathbf{b})\mapsto\alpha(\mu(\mathbf{a}))\cdot\beta(\nu(\mathbf{b}))$ . Since $\alpha\operatorname{\mathop{\circ}}\mu\in\mathcal{L}(K^{m};K)$ and $\beta\operatorname{\mathop{\circ}}\nu\in\mathcal{L}(K^{n};K)$ , $\phi\operatorname{\mathop{\circ}}\sigma$ is a rank-one bilinear form.

Since the $\operatorname{RP}$ -automorphisms in Definition 7 preserve the rank of rank-one bilinear forms, by linearity and by definition of the rank of a bilinear form, it preserves the rank of any bilinear form. For any subspace $T\subset\mathcal{L}(K^{m},K^{n};K)$ and any $\sigma\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ , we have

[TABLE]

∎

Remark 10.

Note that, when $m=n$ , Proposition 7 is not the most general notion of $\operatorname{RP}$ -automorphisms that we may have: for simplicity, we do not take into account the possible transposition $\tau$ acting on any $\Phi\in\mathcal{L}(K^{m},K^{m};K)$ , via $\Phi\circ\tau:(\mathbf{a},\mathbf{b})\mapsto\Phi(\mathbf{b},\mathbf{a})$ .

Notation 11 (Group action on matrices).

The group $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ is isomorphic to the group $\operatorname{GL}_{m}(K)\times\operatorname{GL}_{n}(K)$ , acting on matrices $M$ via $M\cdot(X,Y)={X}^{\mathrm{T}}\cdot M\cdot Y.$ Thus, we often consider elements of $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ as elements of $\operatorname{GL}_{m}(K)\times\operatorname{GL}_{n}(K)$ and vice versa.

Example 12 (Action of $\operatorname{GL}({K^{2}})\times\operatorname{GL}({K^{2}})$ ).

Let us consider the subspace $V$ of $\mathcal{L}(K^{2},K^{2};K)$ generated by the bilinear forms represented by the matrices $M_{1}$ and $M_{2}$ defined as

[TABLE]

We take $\sigma=(X,Y)$ such that $X=Y=\begin{pmatrix}0&1\\ 1&0\end{pmatrix}$ .

The subspace $V^{\prime}=V\circ\sigma$ is generated by $M^{\prime}_{1}$ and $M^{\prime}_{2}$ , defined as

[TABLE]

Since we will often refer to subgroups of $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ stabilizing elements of $\mathcal{L}(K^{m},K^{n};K)$ in the following, we define the notion of setwise stabilizer.

Definition 13 (Setwise stabilizer).

For a subset $\mathcal{T}\subset\mathcal{L}(K^{m},K^{n};K)$ , we denote by $\operatorname{Stab}(\mathcal{T})$ the subgroup of $\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ stabilizing $\mathcal{T}$ :

[TABLE]

We use the same notation for a subspace $T\subset\mathcal{L}(K^{m},K^{n};K)$ .

In the rest of this work, we often refer to the “stabilizer” of a given set $\mathcal{T}$ . Each time, we exclusively mean the setwise stabilizer of $\mathcal{T}$ , which is, in general, different from the pointwise stabilizer of $\mathcal{T}$ . Indeed, the pointwise stabilizer of $\mathcal{T}$ is defined as $\{\sigma\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})\ |\ \forall\Phi\in\mathcal{T},\ \Phi\circ\sigma=\Phi\}$ .

The algorithmic improvement originally presented in [2] comes from the fact that, for any target space $T\subset\mathcal{L}(K^{m},K^{n};K)$ of dimension $\ell$ and any integer $r\geq\ell$ , we have

[TABLE]

because $\sigma$ preserves the rank. Thus, we can restrict our interest to the computation of the quotient $\hbox{$ \mathscr{S}_{{r}}(T) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ instead of $\mathscr{S}_{{r}}(T)$ .

3.2 BDEZ with stabilizer

In order to find all the elements of $\mathscr{S}_{{r}}(T)$ , it is sufficient to obtain one representative per equivalence class of $\hbox{$ \mathscr{S}{{r}}(\operatorname{Span}(T)) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ , from which one can recover the whole orbits through the group action of $\operatorname{Stab}(T)$ . Moreover, we can compute $\hbox{$ \mathscr{S}{{r}}(T) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ faster than $\mathscr{S}_{{r}}(T)$ . Thus, we adapt our general strategy to this idea.

General strategy for computing the bilinear rank using $\operatorname{RP}$ -automorphisms

The new algorithmic strategy we are considering is stated as follows, for a target subspace $T\subset\mathcal{L}(K^{m},K^{n};K)$ of dimension $\ell$ and the associated subgroup $\operatorname{Stab}(T)$ of $\operatorname{RP}$ -automorphisms stabilizing $T$ :

start with an initial guess $r=\ell$ ;

2.

compute $\hbox{$ \mathscr{S}_{{r}}(T) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ (the set $\mathscr{S}_{{r}}(T)$ up to the action of $\operatorname{Stab}(T)$ );

3.

if $\hbox{$ \mathscr{S}_{{r}}(T) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}=\emptyset$ , increment $r$ and return to the previous step;

4.

enumerate $\mathscr{S}_{{r}}(T)$ using the action of $\operatorname{Stab}(T)$ ;

5.

at the end, $r$ is the rank and $\mathscr{S}_{{r}}(T)$ the set of optimal decompositions.

Algorithm BDEZStab is a recursive approach for the computation of one representative per equivalence class. The input of the first call to BDEZStab is: a target subspace $T$ of dimension $\ell$ , the group $\operatorname{Stab}(T)$ and an integer $r\geq\ell$ .

Figure 2 describes this recursive approach using a tree and illustrates how some branches are pruned, relying on Proposition 14. We assume that the initial set of rank-one bilinear forms is $\{\phi_{0},\phi_{1},\phi_{2},\phi_{3}\}$ and that we have $\sigma\in\operatorname{Stab}(T)$ such that $\sigma(\phi_{0})=\phi_{1}$ , $\sigma(\phi_{1})=\phi_{0}$ , $\sigma(\phi_{2})=\phi_{3}$ and $\sigma(\phi_{3})=\phi_{2}$ .

Proposition 14.

Let $T$ and $V$ be subspaces of $\mathcal{L}(K^{m},K^{n};K)$ such that $V\in\mathscr{S}_{{r}}(T)$ . Then, given the orbit $\phi\circ\operatorname{Stab}(T)$ of a bilinear form $\phi$ of rank one, if $V$ satisfies $V\cap\left(\phi\circ\operatorname{Stab}(T)\right)\neq\emptyset$ , then there exists an element $V^{\prime}$ in the equivalence class of $V$ for the action of $\operatorname{Stab}(T)$ and such that $\phi\in V^{\prime}$ .

Proof.

There exists $\sigma\in\operatorname{Stab}(T)$ such that $\phi\circ\sigma\in V$ . We can then take $V^{\prime}=V\circ(\sigma^{-1})$ , which meets all the conditions. ∎

The particularity of BDEZStab is that, instead of enumerating all the elements of $\mathcal{H}$ as in BDEZ, we restrict the enumeration to one element per equivalence class for the action of $U\subset\operatorname{Stab}(V)$ . We use in particular the fact that the additional computations such as stabilizers on Line 11 are negligible, compared to the speed-up obtained by pruning branches in BDEZ. Heuristically, BDEZStab is faster than BDEZ by a factor $\#\operatorname{Stab}{(T)}$ . This method constitutes the state of the art for the current work: our contribution is compared to the performance of this algorithm.

4 Algebraic structure of some bilinear maps

In this section, we describe the structure of some vector spaces corresponding to bilinear maps that are considered in our applications. This section can be skipped on a first read. It is needed to prove properties of specific bilinear maps that are stated in Section 5. In particular, we need to know the structure of the stabilizer of a vector space in order to be able to improve on the exhaustive search.

4.1 Short product

For the purpose of this section, we restrict our discussion to the specific case defined as follows. Let $\ell$ be a positive integer, let $\boldsymbol{\Phi}$ be the bilinear map $\boldsymbol{\Phi}\in\mathcal{L}(K^{\ell},K^{\ell};K^{\ell})$ defined by the short product

[TABLE]

such that $\sum_{0\leq i<\ell}c_{i}X^{\ell-1-i}=(\sum_{0\leq i<\ell}a_{i}X^{i})(\sum_{0\leq i<\ell}b_{i}X^{\ell-1-i})\mod X^{\ell}$ . Let $T$ be the subspace of $\mathcal{L}(K^{\ell},K^{\ell};K)$ spanned by the $\ell$ bilinear forms that are the coordinates of $\boldsymbol{\Phi}$ , denoted by $\Phi_{0},\ldots,\Phi_{\ell-1}$ .

The matrix representing the element $\sum_{0\leq i<\ell}m_{i}\Phi_{i}\in T$ , where $m_{i}\in K$ , is

$M(m_{0},\ldots,m_{\ell-1})=$

${m_{0}}$${m_{1}}$${m_{\ell-1}}$${0}$${m_{1}}$${0}$${0}$${m_{0}}$$\left.\vbox{\hrule height=30.22662pt,depth=30.22662pt,width=0.0pt}\right]$$\left[\vbox{\hrule height=30.22662pt,depth=30.22662pt,width=0.0pt}\right.$ ,

in the canonical basis. This matrix is an upper triangular Toeplitz matrix.

Let $N$ be the matrix $M(0,1,\ldots,0)$ . The matrix $N$ is a nilpotent matrix such that

[TABLE]

and $N^{\ell}=0$ . The elements of the algebra $K[N]$ are the upper triangular Toeplitz matrices and $K[N]\cong K[X]/(X^{\ell})$ .

We provide in Theorem 15 a useful property describing the action of $\operatorname{Stab}(T)$ on $T$ .

Theorem 15.

Let any integer $\ell\geq 2$ :

the orbit of the identity matrix $I=N^{0}$ for the action of $\operatorname{Stab}(T)$ is the set of invertible matrices of $T$ ; 2. 2.

the orbit of $N$ for the action of $\operatorname{Stab}(T)\cap\operatorname{Stab}(I)$ is the set of nilpotent matrices of $T$ ; 3. 3.

for any pair $(\Psi,\Psi^{\prime})$ of elements of $T$ such that $\operatorname{rk}(\Psi)=\ell$ and $\operatorname{rk}(\Psi^{\prime})=\ell-1$ , there exists $\sigma\in\operatorname{Stab}(T)$ such that

[TABLE] 4. 4.

we have $\operatorname{Stab}(I)\cap\operatorname{Stab}(N)\subset\operatorname{Stab}(T)$ and the cardinality of $\operatorname{Stab}(T)$ is $(\#K)^{3\ell-4}(\#K-1)^{3}$ .

Proof.

See A.1. ∎

4.2 Matrix product

We denote by $\boldsymbol{\Phi}_{p,q,r}$ the bilinear map corresponding to the $p\times q$ by $q\times r$ matrix product:

[TABLE]

We denote by $\Phi_{i,j}$ the bilinear forms such that $\Phi_{i,j}(A,B)$ is the coefficient $(i,j)$ of $\boldsymbol{\Phi}_{p,q,r}(A,B)$ for $i\in\left\{0,\ldots,p-1\right\},j\in\left\{0,\ldots,r-1\right\}$ . The elements $\Phi_{i,j}$ satisfy $\Phi_{i,j}(A,B)=\sum_{0\leq h<q}a_{i,h}b_{h,j}$ .

The bilinear map $\boldsymbol{\Phi}_{p,q,r}$ is represented by a subspace of $\mathcal{L}(K^{pq},K^{qr};K)$ denoted by

[TABLE]

In order to represent the elements of $T_{p,q,r}$ in terms of matrices of $\mathcal{M}_{pq,qr}$ , we need an order on the $a_{i,h}$ ’s and $b_{h,j}$ ’s.

For the $a_{i,h}$ ’s, we fix the following order: $a_{i,h}\leq a_{i^{\prime},h^{\prime}}$ if $i\leq i^{\prime}$ or $i=i^{\prime}$ and $h\leq h^{\prime}$ , which is the row-major order.

2.

For the $b_{h,j}$ ’s, we fix the following order: $b_{h,j}\leq b_{h^{\prime},j^{\prime}}$ if $j\leq j^{\prime}$ or $j=j^{\prime}$ and $h\leq h^{\prime}$ , which is the column-major order.

Then, in the bases of $\mathcal{M}_{p,q}$ and $\mathcal{M}_{q,r}$ given by the $a_{i,h}$ ’s and $b_{h,j}$ ’s ordered as above, the elements of $T_{p,q,r}$ can be represented as matrices of $\mathcal{M}_{pq,qr}$ divided in blocks of size $q\times q$ equal to $I_{q}$ the identity matrix of $\mathcal{M}_{q,q}$ . Consequently, this space is isomorphic to $\mathcal{M}_{p,r}\otimes I_{q}$ and all the elements of $T_{p,q,r}$ have a rank which is multiple of $q$ .

Example 16 (Matrix representation of elements of $T_{2,2,2}$ ).

The elements of $T_{2,2,2}$ are represented by matrices of $\mathcal{M}_{4,4}$ spanned by

[TABLE]

corresponding to the coefficients $a_{0,0}b_{0,0}+a_{0,1}b_{1,0}$ , $a_{0,0}b_{0,1}+a_{0,1}b_{1,1}$ , $a_{1,0}b_{0,0}+a_{1,1}b_{1,0}$ and $a_{1,0}b_{0,1}+a_{1,1}b_{1,1}$ , respectively. The previous matrices can also be expressed as

[TABLE]

respectively.

Let $(e_{i})$ , $(f_{h})$ and $(g_{j})$ be the canonical bases of $K^{p}$ , $K^{q}$ and $K^{r}$ . The subspace $T_{p,q,r}$ can be easily characterized with the tensor notation: it is generated by the vectors, for $i\in\left\{0,\ldots,p-1\right\},j\in\left\{0,\ldots,r-1\right\}$ ,

[TABLE]

Theorem 17.

For the group action $M\cdot(X,Y)\mapsto{X}^{\mathrm{T}}MY$ , the subgroup stabilizing the vector space $T_{p,q,r}$ can be described as the group given by the pairs $({P\otimes{R}^{\mathrm{T}}},Q\otimes(R^{-1}))$ for $P\in\operatorname{GL}_{p}$ , $R\in\operatorname{GL}_{q}$ , and $Q\in\operatorname{GL}_{r}$ .

Proof.

See A.2. ∎

Corollary 18.

The elements of $T_{p,q,r}$ of a given rank lie in the same orbit under the action of $\operatorname{Stab}(T_{p,q,r})$ .

Example 19 (Action of the stabilizer of $T_{2,2,2}$ ).

The stabilizer of $T_{2,2,2}$ is generated by the following elements of $\operatorname{GL}({K^{4}})\times\operatorname{GL}({K^{4}})$ :

[TABLE]

The vector space of $T_{2,2,2}$ is isomorphic to $\mathcal{M}_{2,2}\otimes I_{2}$ . Thus the elements of $T_{2,2,2}$ have rank [math], $2$ or $4$ .

Via the action of $\operatorname{Stab}(T_{2,2,2})$ , all the elements of rank $2$ can all be mapped to the element

[TABLE]

Similarly, via the action of $\operatorname{Stab}(T_{2,2,2})$ , all the elements of rank $4$ can all be mapped to the element

[TABLE]

5 Coverings of subspaces of bilinear forms

Our contribution consists in reducing the number of vector spaces $W$ that we need to enumerate in order to get those that satisfy $T\oplus W\in\mathscr{S}_{{r}}$ , where $T$ is the vector space representing a given bilinear map. To this effect, we restrict the enumeration to vector spaces $W$ satisfying some properties which are intrinsic to $T$ . In this section, the definition and theoretical aspects of the set of vector spaces satisfying these properties are treated, illustrated via the example of the short product and the matrix product. In Section 6, we deal with practical and computational aspects.

5.1 Theoretical aspect

Our strategy consists, first, for any $r\geq\ell$ , in constructing $g$ sets $\mathcal{E}_{i,r}$ for $i\in\left\{0,\ldots,g-1\right\}$ , that are all subsets of $\mathscr{S}_{{r-\ell+k_{i}}}$ , where $k_{i}$ is a nonnegative integer, and that satisfy some property described in Definition 20.

Definition 20 (Covering of a vector space).

Let $r$ be a nonnegative integer, and $\{k_{i}\}$ a set of nonnegative integers such that $k_{i}\leq\ell$ . Let $T$ be a subspace of $\mathcal{L}(K^{m},K^{n};K)$ of dimension $\ell$ . Let $\{\mathcal{E}_{i,r}\}_{0\leq i<g}$ be a set of subsets where $\mathcal{E}_{i,r}\subset\mathscr{S}_{{r-\ell+k_{i}}}$ , for all $i\in\left\{0,\ldots,g-1\right\}$ . Then, $(\mathcal{E}_{i,r})_{0\leq i<g}$ is said to be a covering of $T$ if and only if, for any vector space $W\in\mathscr{S}_{{r-\ell}}$ such that $T\oplus W\in\mathscr{S}_{{r}}$ , there exist an index $i\in\left\{0,\ldots,g-1\right\}$ , a subspace $V\in\mathcal{E}_{i,r}$ , and an $\operatorname{RP}$ -automorphism $\sigma\in\operatorname{Stab}(T)$ such that $T+(V\circ\sigma)=T\oplus W$ .

Proposition 21.

Given $T\subset\mathcal{L}(K^{m},K^{n};K)$ as above and a covering $(\mathcal{E}_{i,r})_{0\leq i<g}$ of ${T}$ , then, for any $r\geq\ell$ , we have

[TABLE]

Proof.

Let $V\in\mathscr{S}_{{r}}(T)$ . By Proposition 6, there exists $W\in\mathscr{S}_{{r-\ell}}$ such that $T\oplus W=U$ . Then, by Definition 20, there exist an index $i\in\left\{0,\ldots,g-1\right\}$ , a subspace $V\in\mathcal{E}_{i,r}$ , and an $\operatorname{RP}$ -automorphism $\sigma\in\operatorname{Stab}(T)$ such that $T+(V\circ\sigma)=T\oplus W$ . Taking $V^{\prime}=V\circ\sigma$ , we thus have $U=T+V^{\prime}$ and $V^{\prime}\in\mathcal{E}_{i,r}\circ\operatorname{Stab}(T)$ , which proves the inclusion. ∎

Thus, assuming that we have a method for computing the $\mathcal{E}_{i,r}$ ’s, we are able to cover the whole set $\mathscr{S}_{{r}}(T)$ . For example, the set composed of the single set $\mathcal{E}_{0,r}=\hbox{$ \mathscr{S}_{{r-\ell}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ is a covering of ${T}$ and can be enumerated using BDEZStab. We describe below how we construct the $\mathcal{E}_{i,r}$ ’s that we use in practice.

Definition 22 (Stem of a vector space).

For a vector space $T$ , a set $\{F_{i}\}_{0\leq i<g}$ of $g$ subspaces $F_{i}\subset T$ of dimension $k_{i}$ is said to be a stem of $T$ if and only if, for any basis $\mathcal{B}$ of $T$ , there exist $i\in\left\{0,\ldots,g-1\right\}$ , an $\operatorname{RP}$ -automorphism $\sigma\in\operatorname{Stab}(T)$ and a free family $\mathcal{F}\subset\mathcal{B}$ of size $k_{i}$ such that

[TABLE]

Proposition 23.

For a vector space $T\subset\mathcal{L}(K^{m},K^{n};K)$ , a stem of $T$ given by $g$ subspaces $F_{i}\subset T$ , and $g$ subgroups $U_{i}\subset\operatorname{Stab}(T)\cap\operatorname{Stab}(F_{i})$ , the set $\{\mathcal{E}_{i,r}\}_{0\leq i<g}$ , where each $\mathcal{E}_{i,r}$ is a set of representatives of the quotient $\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ U_{i} $}$ , is a covering of $T$ .

Proof.

Let $W\in\mathscr{S}_{{r-\ell}}$ be such that $T\oplus W\in\mathscr{S}_{{r}}$ . Take a basis $\mathcal{W}$ of $W$ , and complete it into a basis of $T\oplus W$ using $\ell$ rank-one bilinear forms, denoted by $\{\psi_{i}\}_{0\leq i<\ell}$ . For all $i\in\left\{0,\ldots,\ell-1\right\}$ , write $\psi_{i}=t_{i}+w_{i}$ , with $t_{i}\in T$ and $w_{i}\in W$ .

The $t_{i}$ ’s are linearly independent. Otherwise, there would exist coefficients $(\lambda_{i})_{0\leq i<\ell}$ such that $\sum_{i=0}^{\ell-1}\lambda_{i}t_{i}=0$ , whence $\sum_{i=0}^{\ell-1}\lambda_{i}\psi_{i}=\sum_{i=0}^{\ell-1}\lambda_{i}w_{i}$ , which would then contradict the fact that $\{\psi_{i}\}_{0\leq i<\ell}$ completes $\mathcal{W}$ into a basis of $T\oplus W$ .

Consequently, $\mathcal{B}=\{t_{i}\}_{0\leq i<\ell}$ is a free family of $\ell$ vectors of $T$ and, as $\dim(T)=\ell$ , $\mathcal{B}$ is a basis of $T$ . Then, by Definition 22, there exist an index $i\in\left\{0,\ldots,g-1\right\}$ , a subset $\mathcal{F}\subset\mathcal{B}$ of size $k_{i}=\dim(F_{i})$ , and an $\operatorname{RP}$ -automorphism $\sigma\in\operatorname{Stab}(T)$ such that $\operatorname{Span}(\mathcal{F})\circ\sigma=F_{i}$ .

Let $V=W\oplus\operatorname{Span}(\mathcal{F})$ . Writing $\mathcal{F}=\{t_{i}\}_{i\in I}$ , with $I\subset\left\{0,\ldots,\ell-1\right\}$ , we define $\mathcal{F}^{\prime}=\{\psi_{i}\}_{i\in I}$ . Since $\psi_{i}=t_{i}+w_{i}$ and $\operatorname{Span}(\mathcal{F}^{\prime})\in\mathscr{S}_{{k_{i}}}$ , we have $V=W\oplus\operatorname{Span}(\mathcal{F})=W\oplus\operatorname{Span}(\mathcal{F}^{\prime})\in\mathscr{S}_{{r-\ell+k_{i}}}$ .

Now, consider $V^{\prime}=V\circ\sigma=(W\oplus\operatorname{Span}(\mathcal{F}))\circ\sigma$ : we also have $V^{\prime}\in\mathscr{S}_{{r-\ell+k_{i}}}$ , as $\operatorname{RP}$ -automorphisms preserve the bilinear rank, and $F_{i}=\operatorname{Span}(\mathcal{F})\circ\sigma\subset V^{\prime}$ , whence $V^{\prime}\in\mathscr{S}_{{r-\ell+k_{i}}}(F_{i})$ .

Finally, let $V^{\prime\prime}\in\mathcal{E}_{i,r}$ be a representative of the equivalence class of $V^{\prime}$ in the quotient set $\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ U_{i} $}$ : there exists an $\operatorname{RP}$ -automorphism $\gamma\in U_{i}$ such that $V^{\prime\prime}=V^{\prime}\circ\gamma$ . We then have

[TABLE]

where the last equality comes from the fact that $\operatorname{Span}(\mathcal{F})\subset T$ . Finally, as $\gamma^{-1}\circ\sigma^{-1}\in\operatorname{Stab}(T)$ , this proves the result. ∎

Given $T$ and a stem of $T$ , we can derive a new algorithm that computes $\mathscr{S}_{{r}}(T)$ via the computation of some intermediate sets $\mathcal{E}_{i,r}=\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ U_{i} $}$ for $i\in\left\{0,\ldots,g-1\right\}$ .

Example 24 (Two examples of stems).

For any vector space $T$ , let $\mathcal{B}$ be a basis of $T$ . There exists a subset of $\mathcal{B}$ generating $T$ (namely, $\mathcal{B}$ ): $\{T\}$ is a stem of $T$ . There exists also a subset of $\mathcal{B}$ generating $\operatorname{Span}(\emptyset)$ (namely, $\emptyset$ ): $\{\operatorname{Span}(\emptyset)\}$ is a stem of $T$ .

An enumeration algorithm that uses $\{T\}$ as a stem amounts to computing $\hbox{$ \mathscr{S}_{{r}}(T) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ . In this case, we did not decompose the original problem into simpler problems.

2.

If the stem chosen is the set $\{\operatorname{Span}(\emptyset)\}$ , this is equivalent to enumerate a set of representatives of the quotient $\hbox{$ \mathscr{S}_{{r-\ell}}(\operatorname{Span}(\emptyset)) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T) $}$ . For this purpose, no better methods than BDEZStab is known.

Thus, BDEZStab can be seen as an approach derived from the stem $\{\operatorname{Span}(\emptyset)\}$ . We propose here other strategies that are derived from stems, given by sets of subspaces $F_{i}\subset T$ of dimension $k_{i}$ . The enumeration of a set $\mathscr{S}_{{r-\ell+k_{i}}}(F_{i})$ is interesting in practice if its cardinality is less than $\#\mathscr{S}_{{r-\ell}}$ . However, its cost depends also on the algorithms used for the computation of quotients and stabilizers and on how large $k_{i}$ is, which is detailed below.

No automatic method is known to determine, how to choose a stem for a given vector space $T$ : we have to provide a stem for each $T$ . This task has to be done by hand specifically for each bilinear map. We will actually do so in Section 5.2 and 5.3 for the examples of the short product and the matrix. To this end, the determination of the stabilizer, as done in Section 4, plays a key role.

In order to compute a set of the form $\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ U_{i} $}$ , we proceed in two steps. Let $\mathcal{F}_{i}$ be a basis of $F_{i}$ . Our strategy assumes that we have a finite representation of a group $U_{i}$ such that $U_{i}\subset\operatorname{Stab}(T)\cap\operatorname{Stab}(F_{i})$ . In Proposition 23, the larger the groups $U_{i}$ are, the smaller the $\mathcal{E}_{i,r}$ ’s are. And we prefer to keep the $\mathcal{E}_{i,r}$ ’s as small as possible, since it gives smaller sets to enumerate. Thus, this should lead us to choose $U_{i}=\operatorname{Stab}(T)\cap\operatorname{Stab}(F_{i})$ . However, in practice, the method used in our implementation is specialized to the choice $U_{i}=\operatorname{Stab}(T)\cap\operatorname{Stab}(\mathcal{F}_{i})\subset\operatorname{Stab}(T)\cap\operatorname{Stab}(F_{i})$ (we have $\operatorname{Stab}(\mathcal{F}_{i})\subset\operatorname{Stab}(F_{i})$ ) because only in this case do we have a practical algorithm to enumerate a set of representatives for the quotient $\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ U_{i} $}$ .

Notation 25.

For a free family $\mathcal{F}$ of $k$ bilinear forms and a positive integer $d$ , we let

[TABLE]

In order to enumerate sets of the form $\hbox{$ \mathscr{S}{{r-\ell+k{i}}}(F_{i}) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(T)\cap\operatorname{Stab}(\mathcal{F}_{i}) $}$ , we adopt a two-step strategy.

Remark 26.

This strategy requires the precomputation of a set of representatives of the quotient

[TABLE]

Section 6.3 describes how to compute such a set.

However, there is a pratical limit on their dimension $k_{i}$ , due to the precomputations that are used in our method and that constitute a bottleneck. Assuming that

[TABLE]

behaves as $(d!)^{1.1}$ over $\mathbb{F}_{2}$ (which is an empirical estimate), storing a set of representatives of

[TABLE]

for $d=13$ would require $15$ terabytes for instance. Consequently, given the largest “ $d$ ” for which we are able to compute in practice

[TABLE]

we have a practical constraint on how large the $r-\ell+k_{i}$ ’s may be: we should have $r-\ell+k_{i}\leq d$ for all $i$ .

Thus, we precompute the quotient $\hbox{$ \mathscr{S}{{r-\ell+k{i}}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ . The first step consists in computing $\tilde{\mathscr{S}}_{r-\ell+k_{i}}(\mathcal{F}_{i})$ and is detailed in Section 6.1. The second step applies the action of the left transversal

[TABLE]

which can be computed using the algorithms proposed in [12] for example.

We describe in Algorithm CoveringSetsMethod the global strategy to find optimal formulae for $T$ in the sense of the bilinear rank, that is, to enumerate $\mathscr{S}_{{r}}(T)$ given a stem. We assume that we are given a subspace $T$ and a set of $g$ free families $\mathcal{F}_{0},\ldots,\mathcal{F}_{g-1}$ of $T$ such that $\{\operatorname{Span}(\mathcal{F}_{i})\}_{i}$ forms a stem of $T$ .

Theorem 27.

Let $R$ be the rank of $T$ . For any positive integer $r$ , Algorithm CoveringSetsMethod proves either that

$r<R$ ** 2. 2.

or $R\leq r$ .

In the case where $R\leq r$ , any element of $\mathscr{S}_{{r}}(T)$ is included in the set returned by Algorithm CoveringSetsMethod.

The computation of the quotient $\mathcal{Q}$ on Line 5 is detailed in Section 6.1.

5.2 A stem for the short product

We use the same notations as in Section 4.1: we denote by $\Phi_{0},\ldots,\Phi_{\ell-1}$ the bilinear forms such that

[TABLE]

and by $T$ the subspace $\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})$ .

In order to produce a covering of the vector spaces $W$ satisfying $T\oplus W\in\mathscr{S}_{{r}}(T)$ that we compute with CoveringSetsMethod, we need a stem of $T$ . This stem is given in Proposition 28.

Proposition 28 (Stem for the short product).

For any $\ell\geq 2$ the singleton $\{\operatorname{Span}(\{\Phi_{0},\Phi_{1}\})\}$ is a stem of ${T}$ : for any basis $\mathcal{B}$ of $T$ , there exists $\sigma\in\operatorname{Stab}(T)$ and $\mathcal{F}\subset\mathcal{B}$ of cardinality $2$ such that

[TABLE]

Proof.

We first observe that for any $\Phi\in\operatorname{Span}(\{\Phi_{\ell-1-i},\ldots,\Phi_{\ell-1}\})$ , $\operatorname{rk}(\Phi)\leq i+1$ . Therefore, any element of rank $\ell$ in $T$ has a nonzero coordinate over $\Phi_{0}$ in its decomposition over the basis $(\Phi_{0},\ldots,\Phi_{\ell-1})$ and, reciprocally, any element having a nonzero coordinate over $\Phi_{0}$ has rank $\ell$ . Thus, a basis $\mathcal{B}$ of $T$ necessarily contains an element of rank $\ell$ denoted by $\Psi$ . The element $\Psi$ has a nonzero coordinate over $\Phi_{0}$ , when we decompose it over $\left\{\Phi_{0},\ldots,\Phi_{\ell-1}\right\}$ . Similarly, there exist $\Psi^{\prime}\in\mathcal{B}$ and $\lambda\in K$ for which $\Psi^{\prime}-\lambda\Psi$ has rank $\ell-1$ .

We then use Theorem 15 to find an element $\sigma\in\operatorname{Stab}(T)$ such that

[TABLE]

which concludes. ∎

We give in Table 1 the cardinality of coverings of $\mathscr{S}_{{r}}(T)$ given by Proposition 28.

In conclusion, we need to compute the following set: $\tilde{\mathscr{S}}_{r-\ell+2}(\{\Phi_{0},\Phi_{1}\})$ . We describe in Section 6 how we perform Line 5 of Algorithm CoveringSetsMethod. The set $\mathcal{L}$ on Line 6 of CoveringSetsMethod is, for the short product, a set containing one element, which is the identity element of $\operatorname{GL}({K^{\ell}})\times\operatorname{GL}({K^{\ell}})$ .

5.3 A stem for the matrix product $3\times 2$ by $2\times 3$ over $\mathbb{F}_{2}$

We focus here on the special case given by the bilinear map

[TABLE]

over $K=\mathbb{F}_{2}$ . The rank of this bilinear map is known to be $15$ [13]. However, all the optimal formulae are not known. We denote by $\Phi_{i,j}$ the bilinear forms such that $\Phi_{i,j}(A,B)$ is the coefficient $(i,j)$ of $\boldsymbol{\Phi}_{3,2,3}(A,B)$ for $i,j\in\left\{0,1,2\right\}$ . The elements $\Phi_{i,j}$ satisfy $\Phi_{i,j}(A,B)=a_{i,0}b_{0,j}+a_{i,1}b_{1,j}$ .

The target subspace of $\mathcal{L}(K^{6},K^{6};K)$ considered is denoted by

[TABLE]

The approach proposed in this section can be generalized to any matrix product (albeit at the expense of combinatorial blowup).

We use the stem of $T_{3,2,3}$ given by Proposition 29.

Proposition 29 (Covering of the matrix product).

The set

[TABLE]

is a covering of $T_{3,2,3}$ : for any basis $\mathcal{B}$ of $T_{3,2,3}$ , there exists $\mathcal{F}\subset\mathcal{B}$ and $\sigma\in\operatorname{Stab}(T_{3,2,3})$ such that

[TABLE]

Proof.

Let $\mathcal{B}$ be a basis of $T_{3,2,3}$ .

If there exists an element $\Phi$ of rank $6$ in $\mathcal{B}$ , then, according to Corollary 18, there exists $\sigma\in\operatorname{Stab}(T_{3,2,3})$ such that $\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}\in\mathcal{B}\circ\sigma$ . Otherwise, any element $\Phi$ of $\mathcal{B}$ has rank smaller or equal to $4$ and we have to distinguish two cases.

2.

If there exists an element $\Phi$ of rank $4$ , there exists $\sigma$ such that $\Phi_{0,0}+\Phi_{1,1}\in\mathcal{B}\circ\sigma$ and, consequently, there exists another element $\Phi^{\prime}\in\mathcal{B}$ of rank $2$ or $4$ whose coordinate over $\Phi_{2,2}$ in the basis $(\Phi_{i,j})_{i,j}$ is nonzero: we need to look at the possible orbits in which $\Phi^{\prime}$ is included under the action of the subgroup of $\operatorname{Stab}(T_{3,2,3})$ preserving the fact that $\Phi$ is in the orbit of $\Phi_{0,0}+\Phi_{1,1}$ . We can prove that there exist $3$ such orbits and that there exists $\sigma\in\operatorname{Stab}(T_{3,2,3})$ and $\mathcal{F}\subset\mathcal{B}$ of cardinality $2$ such that

[TABLE]

3.

Otherwise, all the elements of $\mathcal{B}$ have rank $2$ and there exists $\mathcal{F}\subset\mathcal{B}$ and $\sigma\in\operatorname{Stab}(T_{3,2,3})$ such that

[TABLE]

∎

6 How to compute subspaces containing specific bilinear forms

We propose in this section a method for computing a covering of ${\mathscr{S}_{{r}}(T)}$ , where $T$ is a target space of dimension $\ell$ . The covering is a set of subspaces containing a specific set of bilinear forms described as in Section 5.2 or 5.3. More specifically, we are interested in computing sets defined as $\tilde{\mathscr{S}}_{r-\ell+k}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ , for $\Psi_{0},\ldots,\Psi_{k-1}$ bilinear forms of $\mathcal{L}(K^{m},K^{n};K)$ . Those can be described as sets of subspaces of rank $r-\ell+k$ containing a prescribed set $\left\{\Psi_{0},\ldots,\Psi_{k-1}\right\}$ of bilinear forms, up to the action of $\operatorname{Stab}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ .

6.1 General approach

First, our strategy consists in precomputing the quotient $\hbox{$ \mathscr{S}_{{m,n,r-\ell+k}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ . This quotient is smaller than $\mathscr{S}_{{m,n,r-\ell+k}}$ by construction. We explain how to compute it in Section 6.3.

Algorithm 5 explains how we compute the quotient $\mathcal{Q}$ in Algorithm CoveringSetsMethod.

Correctness of Algorithm 5.

By construction, according to Line 6, any element of $\mathcal{Q}$ is an element of

[TABLE]

First, we prove that any orbit of $\tilde{\mathscr{S}}_{r-\ell+k}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ has a representative in $\mathcal{Q}$ .

Let $W^{\prime}$ be a representative of an orbit in $\tilde{\mathscr{S}}_{r-\ell+k}(\{\Psi_{0},\ldots,\Psi_{k-1}\}).$ There exist $\sigma\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ and $W$ a representative of an element of $\hbox{$ \mathscr{S}_{{m,n,r-\ell+k}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ such that $W\circ\sigma=W^{\prime}$ . Thus, we have $\{\Psi_{0},\ldots,\Psi_{k-1}\}\circ\sigma^{-1}\subset W$ and the set

[TABLE]

satisfies the predicate on Line 6. Any $\sigma^{\prime}$ such that $\{\Psi_{0},\ldots,\Psi_{k-1}\}\circ\sigma^{-1}\circ\sigma^{\prime}=\{\Psi_{0},\ldots,\Psi_{k-1}\}$ satisfies

[TABLE]

which means that an element of $W\circ\sigma\circ\operatorname{Stab}(\{\Psi_{0},\ldots,\Psi_{k-1}\})=W^{\prime}\circ\operatorname{Stab}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ is included in the list returned by Algorithm 5. Thus, the list returned contains at least one representative per orbit of $\tilde{\mathscr{S}}_{r-\ell+k}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ .

2.

In the following, we prove that each orbit of $\tilde{\mathscr{S}}_{r-\ell+k}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ has a unique representative in $\mathcal{Q}$ .

Assume that there exist $W,W^{\prime}\in\mathcal{Q}$ and $\gamma\in\operatorname{Stab}(\{\Psi_{0},\ldots,\Psi_{k-1}\})$ such that $W=W^{\prime}\circ\gamma$ . By construction, there exists $W_{0},W^{\prime}_{0}\in\mathscr{S}_{{r-\ell+k}}$ and $\sigma,\sigma^{\prime}\in\operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}})$ such that $W=W_{0}\circ\sigma$ and $W^{\prime}=W^{\prime}_{0}\circ\sigma^{\prime}$ . Then $W^{\prime}_{0}=W_{0}\circ\sigma\circ\gamma^{-1}\circ\sigma^{\prime-1}$ , whence $W^{\prime}_{0}=W_{0}$ as on Line 4 of Algorithm 5 we enumerate only one representative of each orbit of $\hbox{$ \mathscr{S}_{{r-\ell+k}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ . Thus, $\sigma\circ\gamma^{-1}\circ\sigma^{\prime-1}\in\operatorname{Stab}(W_{0})$ .

Still by construction, there exists $\{\Phi_{0},\ldots,\Phi_{k-1}\}$ and $\{\Phi^{\prime}_{0},\ldots,\Phi^{\prime}_{k-1}\}\subset W_{0}$ such that

[TABLE]

and

[TABLE]

Then,

[TABLE]

and $\{\Phi_{0},\ldots,\Phi_{k-1}\}$ is in the same orbit as $\{\Phi^{\prime}_{0},\ldots,\Phi^{\prime}_{k-1}\}$ under the action of $\operatorname{Stab}(W_{0})$ , which is contradictory with the definition of the quotient on Line 5.

∎

Testing the predicate on Line 6 is a problem generalizing the problem of [7, Ch. 19] and [15]: given two pairs $(M_{0},M_{1})$ and $(N_{0},N_{1})$ of $(\mathcal{M}_{m,n})^{2}$ , determine whether there exists two invertible matrices $X$ and $Y$ such that $({X}^{\mathrm{T}}M_{0}Y,{X}^{\mathrm{T}}M_{1}Y)=(N_{0},N_{1})$ , which is done by computing a Weierstrass–Kronecker canonical form for $(M_{0},M_{1})$ . When we consider more than two matrices, for example three matrices $(M_{0},M_{1},M_{2})$ mapped to $(N_{0},N_{1},N_{2})$ , we compute $(X,Y)$ such that $(M_{0},M_{1})$ is mapped to $(N_{0},N_{1})$ and we compose it with elements of $\hbox{$ \operatorname{Stab}{M_{0}}\cap\operatorname{Stab}{M_{1}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}{M_{2}} $}$ , computed with the algorithms proposed in [12] for example. The complexity for finding all the $\operatorname{RP}$ -automorphisms $\sigma$ in IntermediateSetViaQuotientComputation is bounded by the cardinality of $\mathscr{S}_{{r-\ell+k}}$ (which is comparable to BDEZ) by construction, and is hard to estimate more precisely. In our applications, it appears to be negligible compared to BDEZ.

6.2 Application to the short product

We come back to the example given in Section 5.2 corresponding to the short product. We recall that $T$ is the subspace obtained from the bilinear map given by the short product modulo $\ell$ and that we need to compute the set $\mathcal{Q}=\tilde{\mathscr{S}}_{r-\ell+2}(\{\Phi_{0},\Phi_{1}\})$ for a given integer $r$ .

If we take $\ell=3$ , we can represent $\Phi_{0}$ and $\Phi_{1}$ by the matrices

[TABLE]

Thus, for a given couple $(M_{0},M_{1})$ of matrices representing bilinear forms of a subspace $W\in\hbox{$ \mathscr{S}_{{r-\ell+2}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{\ell}})\times\operatorname{GL}({K^{\ell}}) $}$ , we are looking for invertible matrices $X$ and $Y$ such that

[TABLE]

which is done in Algorithm 6. As it is precised on Line 6 of Algorithm 6, we find $X$ and $Y$ such that ${X}^{\mathrm{T}}M_{0}Y=I$ via Gauss reduction. Then, we need to check whether ${X}^{\mathrm{T}}M_{1}Y$ and $N$ are similar or not ( $({X}^{\mathrm{T}}M_{1}Y)^{\ell}$ should be the null matrix for this purpose), as done on Line 9 of Algorithm 6.

Once we have computed $\mathcal{Q}$ , it remains to compute the left transversal

[TABLE]

and to compute $\mathcal{Q}\circ\mathcal{L}$ . According to Theorem 15, we have $\#\mathcal{L}=1$ , which means that Algorithm 6 actually returns $\tilde{\mathscr{S}}_{r-\ell+2}(\{I,N\})\circ\mathcal{L}$ .

In terms of complexity, we do not have explicit bounds. However, we can state that the complexity depends linearly on $\#\hbox{$ \mathscr{S}{{r-\ell+2}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{\ell}})\times\operatorname{GL}({K^{\ell}}) $}$ and on the number of pairs of bilinear forms $(\Phi,\Psi)$ per element of $\hbox{$ \mathscr{S}{{r-\ell+2}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{\ell}})\times\operatorname{GL}({K^{\ell}}) $}$ such that $\operatorname{rk}(\Phi)=\ell$ and $\operatorname{rk}(\Psi)=\ell-1$ .

6.3 Computing the orbits of vector spaces of bilinear forms

In this section, we propose an approach for computing the set $\hbox{$ \mathscr{S}_{{m,n,d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ , required by the algorithm described in Section 6.1. Its cost is at least exponential in $d$ , $m$ and $n$ and difficult to estimate.

Notation 30.

We denote by ${\Omega}_{d}$ the quotient $\hbox{$ \mathscr{S}_{{d,d,d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}}) $}$ for any $d\geq 1$ .

First, we describe how we represent elements of $\mathscr{S}_{{m,n,d}}$ and we prove that given the knowledge of $\Omega_{d}$ we can deduce the elements of $\hbox{$ \mathscr{S}_{{m,n,d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{m}})\times\operatorname{GL}({K^{n}}) $}$ for any $m$ and $n$ from this precomputation.

Let $W$ be an element of $\mathscr{S}_{{m,n,d}}$ . There exist $d$ rank-one bilinear forms $\phi_{t}:(\mathbf{a},\mathbf{b})\mapsto\alpha_{t}(\mathbf{a})\cdot\beta_{t}(\mathbf{b})$ such that $W=\operatorname{Span}\left(\{\phi_{i}\}_{i\in\left\{0,\ldots,d-1\right\}}\right)$ . In the canonical basis of $K^{m}$ and $K^{n}$ , we represent $\alpha_{t}$ and $\beta_{t}$ as matrices of $\mathcal{M}_{1,m}$ and $\mathcal{M}_{1,n}$ . Thus, there exist two matrices $U\in\mathcal{M}_{d,m}$ and $V\in\mathcal{M}_{d,n}$ , whose rows are given by the linear forms $\alpha_{t}$ and $\beta_{t}$ respectively, and $W$ can be represented by the pair $(U,V)$ . Such a representation is not unique (for example, any permutation of the rows of $(U,V)$ gives a valid representation). In particular, for a pair of matrices $(U,V)$ representing some vector space $W$ , there exists $\sigma=\mu\times\nu$ in $\operatorname{GL}(K^{m})\times\operatorname{GL}(K^{n})$ such that the pair of matrices $U^{\prime},V^{\prime}$ , such that $(U^{\prime},V^{\prime})=(U\circ\mu,V\circ\nu)$ represents $W\circ\sigma$ , are the reduced column echelon form of the matrices $U$ and $V$ , respectively.

Example 31.

Let us consider the vector space $W$ of $\mathscr{S}_{{3,4,6}}$ generated by the rank-one bilinear forms represented by

[TABLE]

The pair of matrices $(U,V)$ associated to $W$ is

[TABLE]

Assuming that we have a representation of the elements of $\hbox{$ \mathscr{S}_{{d,d,d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}}) $}$ in terms of pairs of matrices $(U,V)\in\mathcal{M}_{d,d}\times\mathcal{M}_{d,d}$ in reduced column echelon form, we obtain all the elements of

[TABLE]

by considering the subset $\Omega^{\prime}_{d}$ of $\Omega_{d}=\hbox{$ \mathscr{S}_{{d,d,d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}}) $}$ of elements represented by matrices $(U,V)$ in reduced column echelon form such that $\operatorname{rk}(U)\leq\min(m,d)$ and $\operatorname{rk}(V)\leq\min(n,d)$ . Given $m$ and $n$ , a set of representatives for

[TABLE]

can be seen as matrices $(U^{\prime},V^{\prime})\in\mathcal{M}_{d,m}\times\mathcal{M}_{d,n}$ in reduced column echelon form and for which there exists matrices $(U,V)\in\mathcal{M}_{d,d}^{2}$ , representing an element of $\Omega^{\prime}_{d}$ , obtained by adding $d-m$ an $d-n$ zero columns to $U^{\prime}$ and $V^{\prime}$ , respectively, or by removing zero columns if $d<m$ or $d<n$ .

Our strategy consists in deducing $\Omega_{d}$ from the computation of $\Omega_{d-1}$ . Algorithm 7 describes this strategy: for each vector space $W$ of $\Omega_{d-1}$ , we extend it to a vector space of $\mathcal{L}(K^{d},K^{d};K)$ by padding with zeros, and we consider the vector spaces $W\oplus\operatorname{Span}(\{\phi\})$ that can be obtained by adding an element $\phi$ of rank one. We remove from the set of $W\oplus\operatorname{Span}(\{\phi\})$ the vector spaces that are isomorphic via an isomorphism test. We determine whether two vector spaces $W^{\prime}$ and $W$ are isomorphic if there exists a basis of $W^{\prime}$ of rank-one bilinear forms such that the corresponding couple of matrices $(U^{\prime},V^{\prime})$ in reduced column echelon form is equal to $(U,V)$ . The complexity of this approach depends on the number of bases of rank-one bilinear forms of $W$ , which, compared to $d$ , is not large generically. However, there are degenerate cases for which the nomber of bases is very large (exponential in $d^{2}$ ). These cases require specific code to recognize them and to treat them separately.

The naive algorithm which checks for each pair of elements of the set $\mathcal{L}$ whether or not they are isomorphic, computed in Line 11 of Algorithm 7, can be improved. Indeed, we propose to compute invariants for the group action induced by $\operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}})$ and to compare subspaces having the same invariants. For example, for $W\in\mathscr{S}_{{d,d,d}}$ , we consider the polynomial $P_{W}=\sum_{0\leq t\leq d}p_{t}(W)X^{t}$ such that

[TABLE]

Therefore, for any $\sigma\in\operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}})$ , $P_{W\circ\sigma}=P_{W}$ .

We have been able to compute $\Omega_{d}$ for $d\in\left\{1,\ldots,8\right\}$ and $K=\mathbb{F}_{2}$ with an implementation in Magma V2.21-3 [5]111The code of this implementation can be found at the address http://karancode.gforge.inria.fr. The timings are described in Table 2.

It would be interesting to obtain an upper bound on $\#\Omega_{d}$ with the good order of magnitude. Indeed, we are able to say for instance that $\#\Omega_{d}$ is bounded by the quantity

[TABLE]

corresponding to the number of possible rank-one bilinear forms that we add to elements of $\Omega_{d-1}$ to obtain an element of $\Omega_{d}$ . This formula leads recursively to the following bound:

[TABLE]

However, this upper bound differs by a huge factor from the true cardinality of $\Omega_{d}$ and cannot consequently be used in a complexity analysis.

To conclude, we show in Figure 3 how the subspaces of $\Omega_{3}$ over $\mathbb{F}_{2}$ are related to $\Omega_{2}$ and $\Omega_{1}$ by using its partially ordered set structure. Each element of $\Omega_{d}$ is represented by the corresponding couple of matrices $(U,V)$ of $\mathcal{M}_{d,d}^{2}$ .

7 Experimental results

An implementation in Magma V2.21-3 [5] of the algorithms presented in the previous sections has been done1. We compare in this section the timings obtained from various instances of the bilinear rank problem for these different algorithms. Our Magma implementation of the algorithm described in [1] is clearly slower than the original C version. However, since we are interested in the speed-up obtained from our work, we need a fair approach. We show in particular that Algorithm BDEZStab, although it is neither multithreaded nor written in C, improves considerably on the timings estimated in [1]. The new algorithm proposed in the current article is denoted by CoveringSetsMethod: compared to Algorithm BDEZStab, it constitutes a huge speed-up on particular instances of the bilinear rank problem among which the matrix product, discussed in Sections 7.2 and 7.3, and the short product, discussed in Section 7.4. All the timings presented in this section have been done on a single core of a 3.3 GHz Intel Core i5-4590 processor.

7.1 Recursive approach

We need a few notations to denote the various bilinear maps we are interested in:

$\textsf{MatProduct}_{(p,q,r)}$ denotes the product of matrices $p\times q$ by $q\times r$ ,

2.

$\textsf{ShortProduct}_{\ell}$ denotes the product of polynomials modulo $X^{\ell}$ ,

3.

$\textsf{CirculantProduct}_{\ell}$ denotes the product of polynomials modulo $X^{\ell}-1$ .

We give in Table 3 timings for various bilinear maps and for the implementations of BDEZ and BDEZStab. The number of tests represents the number of calls to HasRankOneBasis.

It is possible to estimate the time it would take to obtain a result for a bilinear rank problem out of reach for BDEZ or BDEZStab. We denote by $\mathcal{N}_{t}$ the number of calls to HasRankOneBasis in these algorithms when the input $r$ is equal to $\ell+t$ . ( $\ell$ is the dimension of the vector space $T$ corresponding to the bilinear map). Since when $r$ is too large, BDEZ is too expensive, there is a practical limit on the known values of $\mathcal{N}_{t}$ , $t$ being a positive integer. We consider the ratio $\lceil\frac{\mathcal{N}_{t}}{\mathcal{N}_{t-1}}\rceil$ to estimate $\mathcal{N}_{t+1}$ . Assuming that this ratio decreases with $t$ , which seems to hold empirically, we have

[TABLE]

$t$ being a positive integer of $\left\{1,\ldots,r-\ell\right\}$ .

Thus, we are able to predict timings for bilinear maps indicated in Table 3 via to this assumption, which allows us to compare Algorithm BDEZ to other approaches for problems of larger sizes. We estimate the number of tests by computing

[TABLE]

where $r-\ell$ is the difference $\operatorname{rk}(T)-\dim(T)$ for $T$ representing a bilinear map and $t$ is the largest integer for which we are able to compute $\mathcal{N}_{t}$ . The time can be estimated with a similar technique. We observe that the speed-up seems to match with $\#\operatorname{Stab}(T)$ , as expected. The estimated values in Table 3 relying on BDEZStab have not been effectively done because the implementation of CoveringSetsMethod allowed us to obtain more results, more efficiently. The estimations rely on the heuristic given by the Inequality 1. In the global strategy, we increase progressively the lower bound $r$ on the rank, before running BDEZ, BDEZStab or CoveringSetsMethod. For $r<\operatorname{rk}(T)$ , the time spent in those algorithms is negligible, because of the exponential growth of their complexity.

It is not clear how to estimate timings for our approach CoveringSetsMethod beyond what has been done and reported in Table 3. However, for the set of bilinear maps for which CoveringSetsMethod allows one to compute all the optimal formulae, we observe a clear speed-up compared to BDEZStab.

In order to compute bilinear maps of larger degrees using this method, we need to be able to compute and store all the elements of

[TABLE]

for $\textsf{ShortProduct}_{6}$ (and even more for other bilinear maps), which has not been done yet and requires a specific effort for an optimized implementation of the algorithm described in Section 6.3. Moreover, being able to decompose a matrix product of larger dimensions, such as $3\times 3$ by $3\times 3$ , requires to improve on the theoretical aspect of our strategy, since the size of the required set

[TABLE]

is expected to be too large, based on the apparent exponential growth of the progression of the sets described in Table 2.

In the following, we describe how we computed optimal formulae for bilinear maps given in Table 3 via our approach using the stems. We provide some technical details, specific to each bilinear map, necessary for an implementation.

7.2 Matrix product $3\times 2$ by $2\times 3$

We give in this section the timings obtained with our approach for computing the bilinear rank of the matrix product $(3,2,3)$ over $\mathbb{F}_{2}$ . We use the same notations as in Section 5.3. We recall that we denote by $\boldsymbol{\Phi}_{3,2,3}$ the bilinear map

[TABLE]

We denote by $\Phi_{i,j}$ the bilinear forms such that $\Phi_{i,j}(A,B)$ is the coefficient $(i,j)$ of $\boldsymbol{\Phi}_{3,2,3}(A,B)$ . The subspace $T_{3,2,3}$ is defined by

[TABLE]

As described in Section 6.1, we need to precompute the quotients

[TABLE]

for $k\in\left\{1,2,3\right\}$ , and, given the stem that is used, we can restrict the enumeration to subspaces containing at least one element of rank $6$ . The techniques for computing theses subsets are described in Section 6.3.

The intermediate sets, corresponding to the quotient $\mathcal{Q}$ computed using IntermediateSetViaQuotientComputation in Section 6, were computed in $1.6\cdot 10^{5}$ seconds. They are defined as the following sets: $\tilde{\mathcal{E}}_{0}=\tilde{\mathscr{S}}_{7}(\{\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}\})$ , $\tilde{\mathcal{E}}_{1}=\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{0,1}+\Phi_{2,2}\})$ , $\tilde{\mathcal{E}}_{2}=\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{1,1}+\Phi_{2,2}\})$ , $\tilde{\mathcal{E}}_{3}=\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{2,2}\})$ , $\tilde{\mathcal{E}}_{4}=\tilde{\mathscr{S}}_{9}(\{\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}\})$ . For the set $\tilde{\mathcal{E}}_{4}$ , we actually used an additional trick, described in B, which allowed us to consider only a much smaller subset $\tilde{\mathcal{E}}^{\prime}_{4}$ .

We give in Table 4 the time required to compute the second step of Section 6.1, which corresponds to $\mathcal{Q}\circ\mathcal{L}$ calls to HasRankOneBasis.

In conclusion, we are able to decompose $\boldsymbol{\Phi}_{3,2,3}$ over $\mathbb{F}_{2}$ and to give all the possible optimal decompositions. We have a speed-up of $10^{4}$ compared to our implementation of Algorithm BDEZStab. Although the rank of this bilinear map was already known thanks to Hopcroft and Kerr [13], determining all the possible optimal decompositions was not a well studied problem to our knowledge.

We prove with our algorithm that there is only one class of equivalence of vector spaces $W\in\mathscr{S}_{{6,6,15}}$ containing $T_{3,2,3}$ , for the group action induced by $\operatorname{Stab}(T_{3,2,3})$ . It is interesting to note that this is also the case for $T_{2,2,2}$ . We do not have this kind of result for the short product for example.

7.3 Matrix product $2\times 3$ by $3\times 2$

We denote by $\boldsymbol{\Phi}_{2,3,2}$ the bilinear map

[TABLE]

and $\Phi_{i,j}$ its coefficients.

We compute the following sets, corresponding to the quotient $\mathcal{Q}$ computed with IntermediateSetViaQuotientComputation in Section 6, within $1.5\cdot 10^{6}$ seconds:

$\tilde{\mathcal{E}}_{0}=\tilde{\mathscr{S}}_{9}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{0,0}+\Phi_{0,1}+\Phi_{1,0}\})$ ,

2.

$\tilde{\mathcal{E}}_{1}=\tilde{\mathscr{S}}_{9}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{0,0}\})$ .

We used, in particular, the fact that for any basis $\mathcal{B}$ of $T_{2,3,2}$ , there exist two elements $\Phi$ and $\Psi$ of $\mathcal{B}$ such that there exists an element in $\operatorname{Span}(\{\Phi,\Psi\})$ whose decomposition over $(\Phi_{0,0},\Phi_{1,1},\Phi_{0,1},\Phi_{1,0})$ has the following shape:

[TABLE]

The timings for the second step of the method proposed in Section 6 are described in Table 5.

We obtained a speed-up of $10^{9}$ compared to our implementation of BDEZStab, and we found $1{,}096{,}452$ elements of $\mathscr{S}_{{11}}(T_{2,3,2})$ , divided in $196$ equivalence classes of solutions with respect to the action of $\operatorname{Stab}(T_{2,3,2})$ . The computations described in Table 5 used an improved basic test HasRankOneBasis specialized for $T_{2,3,2}$ . This test uses the fact that, given a subspace $W$ of $\tilde{\mathcal{E}}_{0}$ or $\tilde{\mathcal{E}}_{1}$ , we have two elements $t_{0}$ and $t_{1}$ in $T_{2,3,2}$ such that there exist $w_{0},w_{1}\in W$ such that $t_{0}-w_{0}$ and $t_{1}-w_{1}$ have rank one. We enumerate the elements $w\in W$ such that the rank of $t_{0}-w$ or $t_{1}-w$ is one, instead of enumerating the whole set of rank-one bilinear forms.

7.4 Short product

We present in this section the timings obtained with our method for the decomposition of the short product. We managed to obtain all the elements of $\mathscr{S}_{{r}}(T)$ , where $T$ is the vector space generated by the bilinear forms associated to $\textsf{ShortProduct}_{\ell}$ for $\ell=4$ and $\ell=5$ and $r=\operatorname{rk}(T)$ .

The last column of Table 6 describes the number of equivalence classes of vector spaces in $\mathscr{S}_{{r}}(T)$ , with respect to the group $\operatorname{Stab}(T)$ .

7.5 Circulant product

We present in this section how to find, with our approach, optimal decompositions of the polynomial product modulo $(X^{5}-1)$ . We denote by $T$ the target space spanned by the coefficients $\Phi_{i}$ of the bilinear map

[TABLE]

The structure of $T$ allows us to gain an interesting speed-up. Indeed, $T$ has the following structure: there exists, up to a constant mulitplicative factor, a unique element of rank one $\phi=\Phi_{0}+\Phi_{1}+\Phi_{2}+\Phi_{3}+\Phi_{4}$ and a hyperplane $H$ such that $H$ contains all the elements of rank $4$ and such that all the elements of rank $5$ are included in $\operatorname{Span}(\{\phi\})\oplus H$ . Moreover, the action of $\operatorname{Stab}(T)$ on $H-\{0\}$ is transitive (proved by an exhaustive enumeration in $\mathbb{F}_{2}$ ), which means that all the elements of rank $4$ are in the same orbit. Consequently, it is also transitive on $\operatorname{Span}(\{\phi\})\oplus H$ and all the elements of rank $5$ are in the same orbit.

Let $\mathcal{B}=\{\Phi_{0},\ldots,\Phi_{4}\}$ be a basis of $T$ . We distinguish then $2$ cases: either there exists $i$ such that $\Phi_{i}$ has rank $5$ , or there is no such $i$ , which implies that $\phi\in\mathcal{B}$ . We deduce from these observations the following sets to compute:

$\tilde{\mathcal{E}}_{0}=\tilde{\mathscr{S}}_{6}(\{\Phi_{4}\})$ and

2.

$\mathcal{E}_{1}=\hbox{$ \mathscr{S}_{{9}}(\operatorname{Span}(H)) $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{Stab}(H) $}$ (any element $V\in\mathcal{E}_{1}$ satisfies $T\subset V+\operatorname{Span}(\{\phi\})\in\mathscr{S}_{{10}}$ ).

We obtain the set $\mathcal{E}_{1}$ via the computation of a covering of $\mathcal{E}_{1}$ obtained with

[TABLE]

We have in Table 7 the timings for the second step of the procedure described in Section 6.1. The set $\mathscr{S}_{{10}}(T)$ contains $2025$ elements divided in $9$ equivalence classes of solutions. Interestingly, the set $\tilde{\mathcal{E}}_{0}$ does not correspond to any element of $\mathscr{S}_{{10}}(T)$ . It means that, for a basis $\mathcal{B}$ of bilinear forms of rank one containing $\phi$ and generating a subspace of $\mathscr{S}_{{10}}(T)$ , the coordinate of the elements of rank $4$ on $\phi$ is zero.

8 Conclusions

One of the most challenging problems in the field of bilinear complexity is the decomposition of the bilinear map given by the product of $3\times 3$ matrices. Currently, our approach cannot be used to tackle this problem. However, we believe that it could be approached by further research in the direction of the Hamming weight idea developed in B. An important obstacle is the fact that, assuming that the rank is $21$ , it would require to compute $\hbox{$ \mathscr{S}_{{15}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{9}})\times\operatorname{GL}({K^{9}}) $}$ , which is very large.

Another aspect which is not well understood currently for our approach is to establish a realistic complexity analysis. It requires a theoretical understanding of how the cardinality of the quotients $\hbox{$ \mathscr{S}_{{d}} $}\kern 0.0pt/\kern-1.00006pt\lower 1.50696pt\hbox{$ \operatorname{GL}({K^{d}})\times\operatorname{GL}({K^{d}}) $}$ behave asymptotically and a classification of their representatives.

Further research could focus on symmetric decompositions of bilinear maps, which have applications for the multiplication of polynomials over “small” finite fields (such as $\mathbb{F}_{2}$ ). Especially, we can improve on the upper bounds on the rank of the product of two polynomials of fixed degrees by improving on the bilinear complexity of the multiplication algorithms used in the Chudnovsky-Chudnovsky approach [8, 23, 22].

Finally, the approach proposed in this work allows one to compute exhaustively the optimal formulae for new bilinear maps, which was not feasible with [1]. Moreover, it uses combinatorial objects which are not well documented in the litterature, which may rekindle curiosity for them.

Acknowledgements

The author is grateful to Jérémie Detrey and Emmanuel Thomé for their helpful comments and suggestions.

References

[1]

R. Barbulescu, J. Detrey, N. Estibals, and P. Zimmermann.

Finding optimal formulae for bilinear maps.

Arithmetic of finite fields: 4th International Workshop, WAIFI 2012, Bochum, Germany, July 16-19, 2012. Proceedings, pages 168–186, 2012.

doi:10.1007/978-3-642-31662-3_12.

[2]

R. Barbulescu, J. Detrey, N. Estibals, and P. Zimmermann.

Finding optimal formulae for bilinear maps.

AriC Seminar, Mar. 2012.

URL: https://hal.inria.fr/hal-01413162.

[3]

A. Bernardi, J. Brachat, P. Comon, and B. Mourrain.

General tensor decomposition, moment matrices and applications.

Journal of Symbolic Computation, 52:51–71, 2013.

International Symposium on Symbolic and Algebraic Computation.

doi:10.1016/j.jsc.2012.05.012.

[4]

M. Bläser.

On the complexity of the multiplication of matrices of small formats.

Journal of Complexity, 19(1):43–60, 2003.

doi:10.1016/S0885-064X(02)00007-9.

[5]

W. Bosma, J. Cannon, and C. Playoust.

The Magma algebra system. I. The user language.

J. Symbolic Comput., 24(3-4):235–265, 1997.

Computational algebra and number theory (London, 1993).

doi:10.1006/jsco.1996.0125.

[6]

R. W. Brockett and D. Dobkin.

On the optimal evaluation of a set of bilinear forms.

Linear Algebra and its Applications, 19(3):207–235, 1978.

doi:10.1016/0024-3795(78)90012-5.

[7]

P. Bürgisser, M. Clausen, and M. A. Shokrollahi.

Algebraic Complexity Theory.

Springer, 1st edition, 2010.

[8]

D. Chudnovsky and G. Chudnovsky.

Algebraic complexities and algebraic curves over finite fields.

Journal of Complexity, 4(4):285–316, 1988.

doi:10.1016/0885-064X(88)90012-X.

[9]

D. Coppersmith and S. Winograd.

Computational algebraic complexity editorial matrix multiplication via arithmetic progressions.

Journal of Symbolic Computation, 9(3):251–280, 1990.

doi:10.1016/S0747-7171(08)80013-2.

[10]

H. F. de Groote.

Lectures on the Complexity of Bilinear Problems.

Springer-Verlag, 1987.

[11]

D. Harvey, J. van der Hoeven, and G. Lecerf.

Even faster integer multiplication.

Technical report, ArXiv, 2014.

arXiv:1407.3360.

[12]

D. F. Holt, B. Eick, and E. A. O’Brien.

Handbook of computational group theory.

Discrete mathematics and its applications. Chapman & Hall/CRC, Boca Raton, 2005.

URL: http://opac.inria.fr/record=b1102239.

[13]

J. E. Hopcroft and L. R. Kerr.

On minimizing the number of multiplications necessary for matrix multiplication.

SIAM Journal on Applied Mathematics, 20(1):30–36, 1971.

doi:10.1137/0120004.

[14]

J. Håstad.

Tensor rank is NP-complete.

Journal of Algorithms, 11(4):644–654, 1990.

doi:10.1016/0196-6774(90)90014-6.

[15]

J. JáJá.

Optimal evaluation of pairs of bilinear forms.

SIAM Journal on Computing, 8(3):443–462, 1979.

doi:10.1137/0208037.

[16]

A. Karatsuba and Y. Ofman.

Multiplication of multidigit numbers on automata.

Soviet Physics-Doklady, 7:595–596, 1963.

(English translation).

[17]

J. D. Laderman.

A noncommutative algorithm for multiplying $3\times 3$ matrices using $23$ multiplications.

Bull. Amer. Math. Soc., 82(1):126–128, 1976.

[18]

F. Le Gall.

Powers of tensors and fast matrix multiplication.

In Proceedings of the 39th International Symposium on Symbolic and Algebraic Computation, ISSAC ’14, pages 296–303. ACM, 2014.

doi:10.1145/2608628.2608664.

[19]

P. Montgomery.

Five, six, and seven-term Karatsuba-like formulae.

IEEE Transactions on Computers, 54(3):362–369, 2005.

doi:10.1109/TC.2005.49.

[20]

I. Oseledets.

Optimal Karatsuba-like formulae for certain bilinear forms in GF(2).

Linear Algebra and its Applications, 429(8–9):2052–2066, 2008.

doi:10.1016/j.laa.2008.06.004.

[21]

V. Y. Pan.

Strassen’s algorithm is not optimal trilinear technique of aggregating, uniting and canceling for constructing fast algorithms for matrix operations.

In Proceedings of the 19th Annual Symposium on Foundations of Computer Science, SFCS ’78, pages 166–176, Washington, DC, USA, 1978. IEEE Computer Society.

URL: http://dx.doi.org/10.1109/SFCS.1978.34, doi:10.1109/SFCS.1978.34.

[22]

M. Rambaud.

Finding optimal Chudnovsky-Chudnovsky multiplication algorithms.

Arithmetic of Finite Fields: 5th International Workshop, WAIFI 2014, Gebze, Turkey, September 27-28, 2014. Revised Selected Papers, pages 45–60, 2015.

doi:10.1007/978-3-319-16277-5_3.

[23]

H. Randriambololona.

Bilinear complexity of algebras and the Chudnovsky–Chudnovsky interpolation method.

Journal of Complexity, 28(4):489–517, 2012.

doi:10.1016/j.jco.2012.02.005.

[24]

A. Schönhage and V. Strassen.

Schnelle Multiplikation großer Zahlen.

Computing, 7(3-4):281–292, 1971.

doi:10.1007/BF02242355.

[25]

A. V. Smirnov.

The bilinear complexity and practical algorithms for matrix multiplication.

Computational Mathematics and Mathematical Physics, 53(12):1781–1795, 2013.

doi:10.1134/S0965542513120129.

[26]

V. Strassen.

Gaussian elimination is not optimal.

Numerische Mathematik, 13(4):354–356, 1969.

doi:10.1007/BF02165411.

[27]

A. L. Toom.

The complexity of a scheme of functional elements realizing the multiplication of integers.

Soviet Mathematics Doklady, 3:714–716, 1963.

(English translation).

Appendix A Computation of stabilizers

A.1 Stabilizer of the short product

In this section, we prove Theorem 15, using the notations of Section 4.1: the bilinear map

[TABLE]

is the bilinear map corresponding to the short product modulo $X^{\ell}$ , $\Phi_{0},\ldots,\Phi_{\ell-1}$ are the bilinear forms such that

[TABLE]

and $T$ is the subspace $\operatorname{Span}(\{\Phi_{0},\ldots,\Phi_{\ell-1}\})$ .

We recall that $T$ is represented by the ring $K[N]$ of polynomials of degree less than or equal to $\ell-1$ evaluated in the matrix $N$ , which is a nilpotent matrix. For example, for $\ell=4$ ,

[TABLE]

We observe that the bilinear forms of rank exactly $\ell$ within $T$ are described by the matrices that can be expressed as $P(N)$ where $P$ is a polynomial of degree smaller or equal to $\ell-1$ over $K$ such that $P(0)\neq 0$ . See 15

Proof.

First, we prove that, for any element $M\in T$ of rank $\ell$ , there exists $R\in K[N]$ a polynomial of degree at most $\ell-1$ such that $R(0)\neq 0$ and $R(N)=M$ , and that

[TABLE]

Any element in the orbit of $I_{\ell}$ has rank $\ell$ and any element of rank $\ell$ in $T$ is associated to a polynomial $R\in K[N]$ evaluated in $N$ of degree $\ell-1$ such that $R(0)\neq 0$ . It remains to prove that the orbit of $I_{\ell}$ corresponds exactly to the set of rank- $\ell$ elements. Given $R\in K[N]$ such that $R(0)\neq 0$ , we denote by $M_{1}(R)$ the element $(I_{\ell},R(N))$ . This element is in $\operatorname{Stab}(T)$ because, for any $S$ , we have $R(N)S(N)=(RS\bmod X^{\ell})(N)$ , which is a polynomial evaluated in $N$ of degree at most $\ell-1$ . We have:

[TABLE]

2.

We prove that, for any element $M\in T$ of rank $\ell-1$ , there exists $R\in K[N]$ a polynomial of degree at most $\ell-1$ such that $R(0)=0$ , $R^{\prime}(0)\neq 0$ and $R(N)=M$ , and that

[TABLE]

An element of the orbit of $N$ is an element of rank $\ell-1$ and an element of rank $\ell-1$ in $T$ is associated to a polynomial $R\in K[N]$ evaluated in $N$ of degree at most $\ell-1$ such that $R(0)=0$ and $R^{\prime}(0)\neq 0$ . It remains to prove that $N$ can mapped to any element of rank $\ell-1$ via the action of $\operatorname{Stab}(I_{\ell})\cap\operatorname{Stab}(T)$ .

Let $e_{\ell}$ be the vector $(0,\cdots,0,1)$ , such that $R(N)\cdot e_{\ell}$ corresponds to the last colum of $R(N)$ . We have $R(N)^{\ell-1}e_{\ell}\neq 0$ . Thus, let $P(N)$ be the matrix whose columns are given by the tuple $(R(N)^{\ell-1}\cdot e_{\ell},R(N)^{\ell-2}\cdot e_{\ell},\ldots,R(N)\cdot e_{\ell},e_{\ell})$ . We have $R(N)P(N)=(0,R(N)^{\ell-1}\cdot e_{\ell},\ldots,R(N)^{2}\cdot e_{\ell},R(N)\cdot e_{\ell})=R(N)$ and $P(N)N=(0,R(N)^{\ell-1}\cdot e_{\ell},R(N)^{\ell-2}\cdot e_{\ell},\ldots,R(N)\cdot e_{\ell})$ . Consequently, we have $R(N)P(N)=P(N)N$ and $P(N)^{-1}R(N)P(N)=N$ . We take $M_{2}(R)=({P(N)}^{\mathrm{T}},P(N)^{-1})$ :

[TABLE]

3.

Let $(\Psi,\Psi^{\prime})$ be a couple of elements of $T$ such that $\operatorname{rk}(\Psi)=\ell$ and $\operatorname{rk}(\Psi^{\prime})=\ell-1$ . Let $(P,P^{\prime})$ be the corresponding matrices. According to the previous points, there exist $M_{1}\in\operatorname{Stab}(T)$ such that $I_{\ell}\cdot M_{1}=P$ and $M_{2}\in\operatorname{Stab}(T)\cap\operatorname{Stab}(I_{\ell})$ such that $N\cdot M_{2}=P^{\prime}\cdot M_{1}^{-1}$ . Consequently, we have

[TABLE]

4.

We prove that we have $\operatorname{Stab}(I_{\ell})\cap\operatorname{Stab}(N)\subset\operatorname{Stab}(T)$ and that, for any $M_{3}\in\operatorname{Stab}(I_{\ell})\cap\operatorname{Stab}(N)$ , there exists $R\in K[N]$ a polynomial of degree at most $\ell-1$ such that $R(0)\neq 0$ and

[TABLE]

Let $M_{3}\in\operatorname{Stab}(I_{\ell})\cap\operatorname{Stab}(N)$ . Since $M_{3}\in\operatorname{Stab}(I_{\ell})$ , there exists $P\in\operatorname{GL}_{\ell}$ such that $M_{3}=({(P^{-1})}^{\mathrm{T}},P)$ and, since $M_{3}\in\operatorname{Stab}(N)$ , $P^{-1}NP=N$ . We have $PN=NP$ .

Multiplying a matrix by $N$ on the left shifts the rows upward and multiplying $N$ on the right shifts the columns on the right. Therefore, denoting by $p_{ij}$ the coefficients of $P$ , with $p_{00}\neq 0$ and $p_{i0}=0$ for $i\geq 1$ , we have

[TABLE]

More particularly, $P$ is equal to the evaluation in $N$ of a polynomial $R$ such that $R(0)\neq 0$ , from which we deduce that

[TABLE]

Given the form of the elements of $\operatorname{Stab}(I_{\ell})\cap\operatorname{Stab}(N)$ , its cardinality is equal to the number of polynomials $R$ of degree at most $\ell-1$ such that $R(0)\neq 0$ , which is $\#K^{\ell-1}(K-1)$ . Combining with the fact that there are $\#K^{\ell-1}(K-1)\cdot\#K^{\ell-2}(K-1)$ pairs $(\Psi,\Psi^{\prime})$ of elements of $T$ such that $\operatorname{rk}(\Psi)=\ell$ and $\operatorname{rk}(\Psi^{\prime})=\ell-1$ , we have $\#\operatorname{Stab}(T)=\#K^{3\ell-4}(\#K-1)^{3}$ .

∎

A.2 Stabilizer of the matrix product

We denote by $T_{p,q,r}$ the vector space given by the product of matrices $p\times q$ by $q\times r$ , which is isomorphic to $\mathcal{M}_{p,r}\otimes I_{q}$ (we do not use the canonical basis for this representation). For the group action $M\cdot(X,Y)\mapsto{X}^{\mathrm{T}}MY$ , we want to prove Theorem 17. See 17

Proof.

Let $(X,Y)$ be a pair of invertible matrices such that ${X}^{\mathrm{T}}T_{p,q,r}Y=T_{p,q,r}$ . For any $i\in\left\{0,\ldots,p-1\right\}$ and $j\in\left\{0,\ldots,q-1\right\}$ , we denote by $M_{i,j}$ the matrix ${X}^{\mathrm{T}}\cdot(e_{i,j})\cdot Y$ , where $e_{i,j}$ is the canonical basis of $\mathcal{M}_{p,r}$ . Denoting by $X_{i,h}$ the $q\times q$ blocks of $X$ and $Y_{\ell,j}$ the $q\times q$ blocks of $Y$ , we have $M_{i,j}=(X_{i,h}Y_{j,\ell})_{h,\ell}$ for any $i$ and $j$ . Consequently, since ${X}^{\mathrm{T}}\cdot(e_{i,j})\cdot Y\in T_{p,q,r}$ , we have

[TABLE]

Let $(i,h)$ such that $X_{i,h}$ is not null and $j$ any integer in $\left\{0,\ldots,q-1\right\}$ . We have the inclusion

[TABLE]

and, since $Y$ is invertible, we even have the equality. Thus, for any $(i,h)$ such that $X_{i,h}$ is not null, we have shown that $X_{i,h}$ is invertible. We have the same property for the blocks of $Y$ .

Combining the fact that the blocks of $X$ and $Y$ that are not null are invertible and Equation (2), we can conclude that the stabilizer of $T_{p,q,r}$ is generated by matrices $(X,Y)$ such that there exists $g\in\operatorname{GL}_{q}$ satisfying

[TABLE]

∎

Appendix B Using the Hamming weight for the matrix product

We describe in this section a trick allowing one to speed-up the execution of our approach for the matrix product. However, this part is technical and can be skipped on a first read.

We still denote by $T$ the subspace of $\mathcal{L}(K^{6},K^{6};K)$ corresponding to the coefficients of the product of $3\times 2$ by $2\times 3$ matrices. We recall the stem of $T$ that we consider:

[TABLE]

We define the following sets:

$\mathcal{E}_{0}=\mathscr{S}_{{7}}(\operatorname{Span}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}))$ ,

2.

$\mathcal{E}_{1}=\mathscr{S}_{{8}}(\operatorname{Span}(\Phi_{0,0}+\Phi_{1,1},\Phi_{0,1}+\Phi_{2,2}))$ ,

3.

$\mathcal{E}_{2}=\mathscr{S}_{{8}}(\operatorname{Span}(\Phi_{0,0}+\Phi_{1,1},\Phi_{1,1}+\Phi_{2,2}))$ ,

4.

$\mathcal{E}_{3}=\mathscr{S}_{{8}}(\operatorname{Span}(\Phi_{0,0}+\Phi_{1,1},\Phi_{2,2}))$ and

5.

$\mathcal{E}_{4}=\mathscr{S}_{{9}}(\operatorname{Span}(\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}))$ .

In theory, we have to enumerate the elements of the sets $\tilde{\mathscr{S}}_{7}(\{\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}\})$ , $\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{0,1}+\Phi_{2,2}\})$ , $\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{1,1}+\Phi_{2,2}\})$ , $\tilde{\mathscr{S}}_{8}(\{\Phi_{0,0}+\Phi_{1,1},\Phi_{2,2}\})$ and $\tilde{\mathscr{S}}_{9}(\{\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}\})$ , denoted by $\tilde{\mathcal{E}}_{0}$ , $\tilde{\mathcal{E}}_{1}$ , $\tilde{\mathcal{E}}_{2}$ , $\tilde{\mathcal{E}}_{3}$ and $\tilde{\mathcal{E}}_{4}$ , respectively. However, one can notice that, given $V\in\mathscr{S}_{{15}}(T)$ such that

[TABLE]

it may happen that there exists $W^{\prime}\subset V$ such that

[TABLE]

and

[TABLE]

In other terms, the $V$ ’s corresponding to the $5$ sets to enumerate do not form a partition of $\mathscr{S}_{{15}}(T)$ .

Thus, we propose, if possible, to enumerate a subset of $\mathcal{E}_{4}$ , rather than the whole set, without losing exhaustivity. The strategy that is proposed is related to the notion of Hamming weight of the elements $\Phi_{0,0}$ , $\Phi_{1,1}$ and $\Phi_{2,2}$ .

Definition 32 (Hamming weight for $\mathscr{S}_{{d}}$ ).

Let $W\in\mathscr{S}_{{d}}$ and $\mathcal{B}=(\psi_{0},\ldots,\psi_{d-1})$ a basis of rank-one bilinear forms of $W$ . Any $x\in W$ has a unique decomposition over $\mathcal{B}$ :

[TABLE]

We define its Hamming weight over $\mathcal{B}$ as

[TABLE]

We can extend the definition of the Hamming weight to any subset $\mathcal{S}$ of $W$ :

[TABLE]

The Hamming weight over some basis has a useful property related to the bilinear rank stated in Lemma 33.

Lemma 33.

Let $W\in\mathscr{S}_{{d}}$ and $\mathcal{B}$ a basis of $W$ composed of rank-one bilinear forms. For any subset $\mathcal{S}$ of $W$ , we have

[TABLE]

Proof.

Clear from the definition of the rank of a set $\mathcal{S}$ given in Definition 5. ∎

We describe in Theorem 34 what is the subset of $\tilde{\mathcal{E}}_{4}$ that we consider.

Theorem 34.

Let $W$ be a subspace such that $W\in\tilde{\mathcal{E}}_{4}\ \text{and}\ T_{3,2,3}+W\in\mathscr{S}_{{15}}$ and let $\mathcal{B}$ be a basis of $W$ composed of rank-one bilinear forms. Let $\tilde{\mathcal{E}}^{\prime}$ be the subset of elements $W\in\tilde{\mathcal{E}}_{4}$ such that

[TABLE]

We obtain all the elements of $\mathscr{S}_{{15}}(T)$ via the enumeration of $\tilde{\mathcal{E}}_{0}$ , $\tilde{\mathcal{E}}_{1}$ , $\tilde{\mathcal{E}}_{2}$ , $\tilde{\mathcal{E}}_{3}$ and $\tilde{\mathcal{E}}^{\prime}$ .

We prove Theorem 34 within $2$ steps:

We prove in Lemma 35 that if $\mathbb{H}_{\mathcal{B}}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2})\neq\mathbb{H}_{\mathcal{B}}(\Phi_{0,0})+\mathbb{H}_{\mathcal{B}}(\Phi_{1,1})+\mathbb{H}_{\mathcal{B}}(\Phi_{2,2})$ , a subspace $V$ obtained as $V=T+W\circ\sigma$ can also be obtained as $V=T+W^{\prime}\circ\sigma$ , with $W^{\prime}\in\tilde{\mathcal{E}}_{0}$ , $\tilde{\mathcal{E}}_{1}$ or $\tilde{\mathcal{E}}_{3}$ . 2. 2.

Otherwise, if $\mathbb{H}_{\mathcal{B}}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2})=\mathbb{H}_{\mathcal{B}}(\Phi_{0,0})+\mathbb{H}_{\mathcal{B}}(\Phi_{1,1})+\mathbb{H}_{\mathcal{B}}(\Phi_{2,2})$ , it remains to prove that we do not lose in generality if we assume that $\mathbb{H}_{\mathcal{B}}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2})>6$ , which is done in Lemma 36.

Lemma 35.

Let $W\in\mathscr{S}_{{9}}$ and let $V=T+W$ . We assume that $T+W\in\mathscr{S}_{{15}}(T)$ and $\operatorname{Span}(\{\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}\})\subset W$ . Let $\mathcal{B}$ be a basis of rank-one bilinear forms of $W$ . If $\mathbb{H}_{\mathcal{B}}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2})\neq\mathbb{H}_{\mathcal{B}}(\Phi_{0,0})+\mathbb{H}_{\mathcal{B}}(\Phi_{1,1})+\mathbb{H}_{\mathcal{B}}(\Phi_{2,2})$ , there exists $W^{\prime}\subset W$ such that $V=T+W^{\prime}$ and there exists $\sigma^{\prime}\in\operatorname{Stab}(T)$ such that $W^{\prime}\circ\sigma^{\prime}\in{\mathcal{E}}_{0},{\mathcal{E}}_{1}$ or ${\mathcal{E}}_{3}$ .

Proof.

We have by hypothesis $\mathbb{H}_{\mathcal{B}}(\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2})<\mathbb{H}_{\mathcal{B}}(\Phi_{0,0})+\mathbb{H}_{\mathcal{B}}(\Phi_{1,1})+\mathbb{H}_{\mathcal{B}}(\Phi_{2,2})$ . Thus, there exist two elements $\Psi\in\mathcal{B}$ and $\Phi\in\{\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}\}$ such that the coordinate of $\Phi$ on $\Psi$ is not zero and the coordinates of $\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}$ on $\Psi$ is zero. By considering the vector space $W^{\prime}=\operatorname{Span}(\mathcal{B}-\{\Psi\})$ , we have $W^{\prime}\in\mathscr{S}_{{8}}$ . Moreover, we have $W=\operatorname{Span}(\{\Psi\})\oplus W^{\prime}=\operatorname{Span}(\{\Phi\})\oplus W^{\prime}$ and $T+(\operatorname{Span}(\{\Phi\})\oplus W^{\prime})=\subset T+W^{\prime}$ . Thus, $\dim(T+W^{\prime})=\dim(T+W)=15$ . Consequently, $\dim(T\cap W^{\prime})=2$ .

If there exists in $T\cap W^{\prime}$ two elements $\Phi_{1}$ and $\Phi_{2}$ of rank smaller or equal to $4$ such that $\Phi_{1}+\Phi_{2}=\Phi_{0,0}+\Phi_{1,1}+\Phi_{2,2}$ , then

[TABLE]

Otherwise, there exists $W^{\prime\prime}\subset W^{\prime}$ such that

[TABLE]

and $T+W^{\prime\prime}\in\mathscr{S}_{{15}}$ , which concludes. ∎

Lemma 36.

Let $V\in\mathscr{S}_{{15}}(T)$ . The subspace $V$ satisfies hypotheses H1 and H2 state as follows.

H1:

For any $W\subset V$ such that there exists $\sigma\in\operatorname{Stab}(T)$ satisfying $W\circ\sigma\in\mathscr{S}_{{9}}(\operatorname{Span}(\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}))$ and $T+W\in\mathscr{S}_{{15}}$ , we have, for any basis $\mathcal{B}$ of rank-one bilinear forms of $W$ ,

[TABLE]

H2:

There do not exist $W\subset V$ and $\sigma\in\operatorname{Stab}(T)$ such that $W\circ\sigma\in{\mathcal{E}}_{0},{\mathcal{E}}_{1}$ , $\mathcal{E}_{2}$ or ${\mathcal{E}}_{3}$ and $V=T+W$ (in other terms, $V$ can not be obtained via the enumeration of $\tilde{\mathcal{E}}_{0}$ , $\tilde{\mathcal{E}}_{1}$ , $\tilde{\mathcal{E}}_{2}$ or $\tilde{\mathcal{E}}_{3}$ ).

Then, there exists $W^{\prime}\subset V$ and $\sigma^{\prime}\in\operatorname{Stab}(T)$ such that $W^{\prime}\circ\sigma^{\prime}\in\mathscr{S}_{{9}}(\operatorname{Span}(\Phi_{0,0},\Phi_{1,1},\Phi_{2,2}))$ , $T+W^{\prime}\in\mathscr{S}_{{15}}$ , and $W^{\prime}$ has a basis $\mathcal{B}^{\prime}$ of rank-one bilinear forms such that

[TABLE]

Proof.

Let $W\in\mathscr{S}_{{6}}$ be such that $T\oplus W\in\mathscr{S}_{{15}}$ . Take a basis $\mathcal{W}$ of $W$ , and complete it into a basis $\mathcal{B}$ of $T\oplus W$ using $9$ rank-one bilinear forms, denoted by $\{\psi_{i}\}_{0\leq i<9}$ . For all $i\in\left\{0,\ldots,8\right\}$ , write $\psi_{i}=\Phi_{i}+\Psi_{i}$ , with $\Phi_{i}\in T$ and $\Psi_{i}\in W$ . The $\Phi_{i}$ ’s form a basis of $T$ . In our context, we are concerned by the case $\operatorname{rk}(\Phi_{i})=2$ for any $i$ (otherwise H2 is not satisfied). Since we assume Hypothesis H1, it is enough to prove that there exists $i$ such that $\mathbb{H}_{\mathcal{B}}(\Phi_{i})>2$ .

There is necessarily a couple $(\Phi_{i},\Phi_{i^{\prime}})$ such that $\mathbb{H}_{\mathcal{B}}(\Phi_{i}+\Phi_{i^{\prime}})<\mathbb{H}_{\mathcal{B}}(\Phi_{i})+\mathbb{H}_{\mathcal{B}}(\Phi_{i^{\prime}}).$ Otherwise, we would have $\#\mathcal{B}\geq 2\cdot 9=18\neq 15=\operatorname{rk}(T)$ .

If $\mathbb{H}_{\mathcal{B}}(\Phi_{i})=2$ and $\mathbb{H}_{\mathcal{B}}(\Phi_{i^{\prime}})=2$ , then

[TABLE]

and, by Lemma 33, $\operatorname{rk}(\{\Phi_{i},\Phi_{i^{\prime}}\})\leq 3$ . Henceforth, we prove that this is contradictory, because $\operatorname{rk}(\{\Phi_{i},\Phi_{i^{\prime}}\})=4$ . Indeed, there are two cases.

If $\Phi_{i}+\Phi_{i^{\prime}}$ has rank $4$ , the conclusion follows.

2.

If $\operatorname{Span}(\{\Phi_{i},\Phi_{i^{\prime}}\})$ is isomorphic to $T_{2,2,1}$ , whose rank is equal to $4$ : $T_{2,2,1}$ and $T_{2,1,2}$ have the same rank according to [10] and $T_{2,1,2}$ is a vector space of dimension one generated by a bilinear form of rank $4$ .

Consequently, $\mathbb{H}_{\mathcal{B}}(\Phi_{i})>2$ or $\mathbb{H}_{\mathcal{B}}(\Phi_{i^{\prime}})>2$ . ∎

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Barbulescu, J. Detrey, N. Estibals, and P. Zimmermann. Finding optimal formulae for bilinear maps. Arithmetic of finite fields: 4th International Workshop, WAIFI 2012, Bochum, Germany, July 16-19, 2012. Proceedings , pages 168–186, 2012. doi:10.1007/978-3-642-31662-3_12 . · doi ↗
2[2] R. Barbulescu, J. Detrey, N. Estibals, and P. Zimmermann. Finding optimal formulae for bilinear maps. Ari C Seminar, Mar. 2012. URL: https://hal.inria.fr/hal-01413162 .
3[3] A. Bernardi, J. Brachat, P. Comon, and B. Mourrain. General tensor decomposition, moment matrices and applications. Journal of Symbolic Computation , 52:51–71, 2013. International Symposium on Symbolic and Algebraic Computation. doi:10.1016/j.jsc.2012.05.012 . · doi ↗
4[4] M. Bläser. On the complexity of the multiplication of matrices of small formats. Journal of Complexity , 19(1):43–60, 2003. doi:10.1016/S 0885-064X(02)00007-9 . · doi ↗
5[5] W. Bosma, J. Cannon, and C. Playoust. The Magma algebra system. I. The user language. J. Symbolic Comput. , 24(3-4):235–265, 1997. Computational algebra and number theory (London, 1993). doi:10.1006/jsco.1996.0125 . · doi ↗
6[6] R. W. Brockett and D. Dobkin. On the optimal evaluation of a set of bilinear forms. Linear Algebra and its Applications , 19(3):207–235, 1978. doi:10.1016/0024-3795(78)90012-5 . · doi ↗
7[7] P. Bürgisser, M. Clausen, and M. A. Shokrollahi. Algebraic Complexity Theory . Springer, 1st edition, 2010.
8[8] D. Chudnovsky and G. Chudnovsky. Algebraic complexities and algebraic curves over finite fields. Journal of Complexity , 4(4):285–316, 1988. doi:10.1016/0885-064X(88)90012-X . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Improved method for finding optimal formulae for bilinear maps in a finite field

Abstract

keywords:

1 Introduction

State of the art

Contributions

Roadmap

2 Preliminaries

2.1 Problem statement

Example 1** (Multiplication of linear polynomials).**

Definition 2** (Bilinear rank).**

Example 3** (Short product of polynomials of degree 222).**

2.2 A linear algebra problem

Notation 4**.**

Definition 5** (Rank of a subspace of L(Km,Kn;K)\mathcal{L}(K^{m},K^{n};K)L(Km,Kn;K)).**

General strategy for computing the bilinear rank

2.3 The BDEZ Algorithm (Barbulescu, Detrey, Estibals, Zimmermann)

Proposition 6**.**

Proof.

3 Improving on BDEZ using symmetries

3.1 Action of automorphisms on L(Km,Kn;K)\mathcal{L}(K^{m},K^{n};K)L(Km,Kn;K)

Definition 7**.**

Proposition 8**.**

Proof.

Proposition 9** (RP⁡\operatorname{RP}RP-automorphisms preserve the rank).**

Proof.

Remark 10**.**

Notation 11** (Group action on matrices).**

Example 12** (Action of GL⁡(K2)×GL⁡(K2)\operatorname{GL}({K^{2}})\times\operatorname{GL}({K^{2}})GL(K2)×GL(K2)).**

Definition 13** (Setwise stabilizer).**

3.2 BDEZ with stabilizer

General strategy for computing the bilinear rank using RP⁡\operatorname{RP}RP-automorphisms

Proposition 14**.**

Proof.

4 Algebraic structure of some bilinear maps

4.1 Short product

Theorem 15**.**

Proof.

4.2 Matrix product

Example 16** (Matrix representation of elements of T2,2,2T_{2,2,2}T2,2,2​).**

Theorem 17**.**

Proof.

Corollary 18**.**

Example 19** (Action of the stabilizer of T2,2,2T_{2,2,2}T2,2,2​).**

5 Coverings of subspaces of bilinear forms

5.1 Theoretical aspect

Definition 20** (Covering of a vector space).**

Proposition 21**.**

Proof.

Definition 22** (Stem of a vector space).**

Proposition 23**.**

Proof.

Example 24** (Two examples of stems).**

Notation 25**.**

Remark 26**.**

Theorem 27**.**

5.2 A stem for the short product

Proposition 28** (Stem for the short product).**

Proof.

5.3 A stem for the matrix product 3×23\times 23×2 by 2×32\times 32×3 over F2\mathbb{F}_{2}F2​

Proposition 29** (Covering of the matrix product).**

Proof.

6 How to compute subspaces containing specific bilinear forms

6.1 General approach

Correctness of Algorithm 5.

6.2 Application to the short product

6.3 Computing the orbits of vector spaces of bilinear forms

Notation 30**.**

Example 31**.**

7 Experimental results

7.1 Recursive approach

7.2 Matrix product 3×23\times 23×2 by 2×32\times 32×3

7.3 Matrix product 2×32\times 32×3 by 3×23\times 23×2

7.4 Short product

Example 1 (Multiplication of linear polynomials).

Definition 2 (Bilinear rank).

Example 3 (Short product of polynomials of degree $2$ ).

Notation 4.

Definition 5 (Rank of a subspace of $\mathcal{L}(K^{m},K^{n};K)$ ).

Proposition 6.

3.1 Action of automorphisms on $\mathcal{L}(K^{m},K^{n};K)$

Definition 7.

Proposition 8.

Proposition 9 ( $\operatorname{RP}$ -automorphisms preserve the rank).

Remark 10.

Notation 11 (Group action on matrices).

Example 12 (Action of $\operatorname{GL}({K^{2}})\times\operatorname{GL}({K^{2}})$ ).

Definition 13 (Setwise stabilizer).

General strategy for computing the bilinear rank using $\operatorname{RP}$ -automorphisms

Proposition 14.

Theorem 15.

Example 16 (Matrix representation of elements of $T_{2,2,2}$ ).

Theorem 17.

Corollary 18.

Example 19 (Action of the stabilizer of $T_{2,2,2}$ ).

Definition 20 (Covering of a vector space).

Proposition 21.

Definition 22 (Stem of a vector space).

Proposition 23.

Example 24 (Two examples of stems).

Notation 25.

Remark 26.

Theorem 27.

Proposition 28 (Stem for the short product).

5.3 A stem for the matrix product $3\times 2$ by $2\times 3$ over $\mathbb{F}_{2}$

Proposition 29 (Covering of the matrix product).

Notation 30.

Example 31.

7.2 Matrix product $3\times 2$ by $2\times 3$

7.3 Matrix product $2\times 3$ by $3\times 2$

Definition 32 (Hamming weight for $\mathscr{S}_{{d}}$ ).

Lemma 33.

Theorem 34.

Lemma 35.

Lemma 36.