Efficient evaluation of noncommutative polynomials using tensor and   noncommutative Waring decompositions

Eric Evert; J. William Helton; Shiyuan Huang; Jiawang Nie

arXiv:1903.05910·math.FA·February 24, 2022

Efficient evaluation of noncommutative polynomials using tensor and noncommutative Waring decompositions

Eric Evert, J. William Helton, Shiyuan Huang, Jiawang Nie

PDF

Open Access

TL;DR

This paper explores efficient evaluation methods for noncommutative polynomials using tensor and Waring decompositions, aiming to reduce matrix multiplications and improve computational speed.

Contribution

It introduces a noncommutative Waring decomposition framework, compares it with classical approaches, and provides methods for computing these decompositions to enhance evaluation efficiency.

Findings

01

Decomposition reduces matrix multiplications needed for evaluation.

02

Comparison shows noncommutative polynomials differ from commutative ones in decomposability.

03

Proposed methods improve evaluation speed for generic noncommutative polynomials.

Abstract

This paper analyses a Waring type decomposition of a noncommuting (NC) polynomial $p$ with respect to the goal of evaluating $p$ efficiently on tuples of matrices. Such a decomposition can reduce the number of matrix multiplications needed to evaluate a noncommutative polynomial and is valuable when a single polynomial must be evaluated on many matrix tuples. In pursuit of this goal we examine a noncommutative analog of the classical Waring problem and various related decompositions. For example, we consider a "Waring decomposition" in which each product of linear terms is actually a power of a single linear NC polynomial or more generally a power of a homogeneous NC polynomial. We describe how NC polynomials compare to commutative ones with regard to these decompositions, describe a method for computing the NC decompositions and compare the effect of various decompositions on the…

Tables1

Table 1. Table 1. Break even points for evaluation of a homogeneous NC polynomial using LPS to be more efficient than Horner’s method or naive evaluation.

	no. of evals. for LPS to break even vs.				generic	tensor decomp.
(g,d)	Horner		naive		tensor	time	rel.
	$20 \times 20$	$100 \times 100$	$20 \times 20$	$100 \times 100$	rank	(s)	error
(3,3)⁴⁴4The space ${(ℂ^{3})}^{\otimes 3}$ is defective and the generic rank for tensors of this size is $5$ rather than the expected $4$ .	9,000	430	409	20	5	0.025	$1 * 10^{- 14}$
(4,4)	54,751	2,618	1,856	89	20	1.85	$8 * 10^{- 4}$
(8,4)	139,136	6,654	1,853	89	142	30.9	$7 * 10^{- 4}$
(5,5)	290,994	13,916	4,498	215	149	75.3	$1 * 10^{- 3}$
(3,6)	171,110	8,183	3,972	190	57	18.8	$3 * 10^{- 4}$

Equations218

x = (x_{1}, x_{2}, ..., x_{g})

x = (x_{1}, x_{2}, ..., x_{g})

L_{s} (x) := A_{1}^{(s)} x_{1} + A_{2}^{(s)} x_{2} + ... A_{g}^{(s)} x_{g},

L_{s} (x) := A_{1}^{(s)} x_{1} + A_{2}^{(s)} x_{2} + ... A_{g}^{(s)} x_{g},

x^{α} = x_{α_{1}} x_{α_{2}} x_{α_{3}} ... x_{α_{d}} .

x^{α} = x_{α_{1}} x_{α_{2}} x_{α_{3}} ... x_{α_{d}} .

p (x) = α \sum P_{α} x^{α}

p (x) = α \sum P_{α} x^{α}

p (X) = ∣ α ∣ \leq d \sum P_{α} X^{α}

p (X) = ∣ α ∣ \leq d \sum P_{α} X^{α}

p (x) = ∣ α ∣ = d \sum T (α) x^{α}

p (x) = ∣ α ∣ = d \sum T (α) x^{α}

T_{p} = s = 1 \sum r A^{(s)} (1) \otimes A^{(s)} (2) \otimes \dots \otimes A^{(s)} (d) w h er e A^{(s)} (i) = A_{1}^{(s)} (i) ⋮ A_{g}^{(s)} (i) \in C^{g}

T_{p} = s = 1 \sum r A^{(s)} (1) \otimes A^{(s)} (2) \otimes \dots \otimes A^{(s)} (d) w h er e A^{(s)} (i) = A_{1}^{(s)} (i) ⋮ A_{g}^{(s)} (i) \in C^{g}

p (x)

p (x)

\displaystyle=\sum_{|\alpha|=d}\bigg{(}\sum_{s=1}^{r}\prod_{i=1}^{d}A^{(s)}_{\alpha_{i}}(i)\bigg{)}x^{\alpha}.

\begin{array}[]{rclcl}p(x)&=&20x_{1}x_{1}x_{1}+50x_{1}x_{2}x_{1}+20x_{1}x_{3}x_{1}-30x_{2}x_{1}x_{1}-75x_{2}x_{2}x_{1}\\ &&-30x_{2}x_{3}x_{1}-10x_{3}x_{1}x_{1}-25x_{3}x_{2}x_{1}-10x_{3}x_{3}x_{1}-8x_{1}x_{1}x_{2}\\ &&-62x_{1}x_{2}x_{2}-35x_{1}x_{3}x_{2}+46x_{2}x_{1}x_{2}+59x_{2}x_{2}x_{2}+10x_{2}x_{3}x_{2}\\ &&+26x_{3}x_{1}x_{2}+9x_{3}x_{2}x_{2}-10x_{3}x_{3}x_{2}+44x_{1}x_{1}x_{3}+26x_{1}x_{2}x_{3}\\ &&-10x_{1}x_{3}x_{3}+2x_{2}x_{1}x_{3}-107x_{2}x_{2}x_{3}-70x_{2}x_{3}x_{3}+22x_{3}x_{1}x_{3}\\ &&-57x_{3}x_{2}x_{3}-50x_{3}x_{3}x_{3}.\end{array}

\begin{array}[]{rclcl}p(x)&=&20x_{1}x_{1}x_{1}+50x_{1}x_{2}x_{1}+20x_{1}x_{3}x_{1}-30x_{2}x_{1}x_{1}-75x_{2}x_{2}x_{1}\\ &&-30x_{2}x_{3}x_{1}-10x_{3}x_{1}x_{1}-25x_{3}x_{2}x_{1}-10x_{3}x_{3}x_{1}-8x_{1}x_{1}x_{2}\\ &&-62x_{1}x_{2}x_{2}-35x_{1}x_{3}x_{2}+46x_{2}x_{1}x_{2}+59x_{2}x_{2}x_{2}+10x_{2}x_{3}x_{2}\\ &&+26x_{3}x_{1}x_{2}+9x_{3}x_{2}x_{2}-10x_{3}x_{3}x_{2}+44x_{1}x_{1}x_{3}+26x_{1}x_{2}x_{3}\\ &&-10x_{1}x_{3}x_{3}+2x_{2}x_{1}x_{3}-107x_{2}x_{2}x_{3}-70x_{2}x_{3}x_{3}+22x_{3}x_{1}x_{3}\\ &&-57x_{3}x_{2}x_{3}-50x_{3}x_{3}x_{3}.\end{array}

T_{p} (:, :, 1) = 20 - 30 - 10 50 - 75 - 25 20 - 30 - 10 an d T_{p} (:, :, 2) = - 8 4626 - 62 599 - 35 10 - 10

T_{p} (:, :, 1) = 20 - 30 - 10 50 - 75 - 25 20 - 30 - 10 an d T_{p} (:, :, 2) = - 8 4626 - 62 599 - 35 10 - 10

T_{p} (:, :, 3) = 44222 26 - 107 - 57 - 10 - 70 - 50

T_{p} (:, :, 3) = 44222 26 - 107 - 57 - 10 - 70 - 50

T = - 3 - 4 - 4 \otimes - 4 45 \otimes 012 + - 2 31 \otimes 252 \otimes - 5 5 - 5 .

T = - 3 - 4 - 4 \otimes - 4 45 \otimes 012 + - 2 31 \otimes 252 \otimes - 5 5 - 5 .

\begin{array}[]{rclcl}p(x)&=&(-3x_{1}-4x_{2}-4x_{3})(-4x_{1}+4x_{2}+5x_{3})(x_{2}+2x_{3})\\ &&+(-2x_{1}+3x_{2}+x_{3})(2x_{1}+5x_{2}+2x_{3})(-5x_{1}+5x_{2}-5x_{3})\end{array}

\begin{array}[]{rclcl}p(x)&=&(-3x_{1}-4x_{2}-4x_{3})(-4x_{1}+4x_{2}+5x_{3})(x_{2}+2x_{3})\\ &&+(-2x_{1}+3x_{2}+x_{3})(2x_{1}+5x_{2}+2x_{3})(-5x_{1}+5x_{2}-5x_{3})\end{array}

p (x) = c + i = 1 \sum g x_{i} p_{i} (x)

p (x) = c + i = 1 \sum g x_{i} p_{i} (x)

\begin{array}[]{lclcl}x_{1}(x_{1}(20x_{1}-8x_{2}+44x_{3})+x_{2}(50x_{1}-62x_{2}+26x_{3})+5x_{3}(4x_{1}-7x_{3}-2x_{3}))\\ -x_{2}(x_{1}(30x_{1}-46x_{2}-2x_{3})+x_{2}(75x_{1}-59x_{2}+107x_{3})+10x_{3}(3x_{1}-x_{2}+7x_{3}))\\ -x_{3}(x_{1}(10x_{1}-26x_{2}-22x_{3})+x_{2}(25x_{1}-9x_{2}+57x_{3})+10x_{3}(x_{1}+x_{2}-5x_{3}).\end{array}

\begin{array}[]{lclcl}x_{1}(x_{1}(20x_{1}-8x_{2}+44x_{3})+x_{2}(50x_{1}-62x_{2}+26x_{3})+5x_{3}(4x_{1}-7x_{3}-2x_{3}))\\ -x_{2}(x_{1}(30x_{1}-46x_{2}-2x_{3})+x_{2}(75x_{1}-59x_{2}+107x_{3})+10x_{3}(3x_{1}-x_{2}+7x_{3}))\\ -x_{3}(x_{1}(10x_{1}-26x_{2}-22x_{3})+x_{2}(25x_{1}-9x_{2}+57x_{3})+10x_{3}(x_{1}+x_{2}-5x_{3}).\end{array}

L_{s} (x) := A_{1}^{(s)} x_{1} + A_{2}^{(s)} x_{2} + ... + A_{g}^{(s)} x_{g}

L_{s} (x) := A_{1}^{(s)} x_{1} + A_{2}^{(s)} x_{2} + ... + A_{g}^{(s)} x_{g}

p (x) = s = 1 \sum r (L_{s} (x))^{d}

p (x) = s = 1 \sum r (L_{s} (x))^{d}

G_{s} (x) = ∣ α ∣ = δ \sum A_{α}^{(s)} x^{α}

G_{s} (x) = ∣ α ∣ = δ \sum A_{α}^{(s)} x^{α}

p (x) = s = 1 \sum r (G_{s} (x))^{d}

p (x) = s = 1 \sum r (G_{s} (x))^{d}

\mathbbm 1_{j}^{α_{i}} = {10 if α_{i} = j if α_{i} \neq = j .

\mathbbm 1_{j}^{α_{i}} = {10 if α_{i} = j if α_{i} \neq = j .

\mathbbm 1_{j}^{α} := i = 1 \sum d \mathbbm 1_{j}^{α_{i}} .

\mathbbm 1_{j}^{α} := i = 1 \sum d \mathbbm 1_{j}^{α_{i}} .

⌈ \frac{( d g + d - 1 )}{g} ⌉,

⌈ \frac{( d g + d - 1 )}{g} ⌉,

p_{T} (X) = ∣ α ∣ = d \sum T_{α} X^{α} .

p_{T} (X) = ∣ α ∣ = d \sum T_{α} X^{α} .

T = s = 1 \sum r A^{(s)} \otimes \dots \otimes A^{(s)} where d copies of A^{(s)} appear in each tensor product.

T = s = 1 \sum r A^{(s)} \otimes \dots \otimes A^{(s)} where d copies of A^{(s)} appear in each tensor product.

p_{T} (X) = s = 1 \sum r (i = 1 \sum g A_{i}^{(s)} X_{i})^{d} .

p_{T} (X) = s = 1 \sum r (i = 1 \sum g A_{i}^{(s)} X_{i})^{d} .

\begin{array}[]{rclcl}p(x)&=&x_{1}^{3}-4x_{2}^{3}-4x_{3}^{3}+5x_{1}x_{1}x_{2}+5x_{1}x_{2}x_{1}+5x_{2}x_{1}x_{1}\\ &&-3x_{1}x_{1}x_{3}-3x_{1}x_{3}x_{1}-3x_{3}x_{1}x_{1}+7x_{2}x_{2}x_{1}+7x_{2}x_{1}x_{2}+7x_{1}x_{2}x_{2}\\ &&-11x_{2}x_{2}x_{3}-11x_{2}x_{3}x_{2}-11x_{3}x_{2}x_{2}+6x_{3}x_{3}x_{1}+6x_{3}x_{1}x_{3}+6x_{1}x_{3}x_{3}\\ &&-6x_{3}x_{3}x_{2}-6x_{3}x_{2}x_{3}-6x_{2}x_{3}x_{3}+x_{1}x_{2}x_{3}+x_{1}x_{3}x_{2}+x_{2}x_{1}x_{3}\\ &&+x_{2}x_{3}x_{1}+x_{3}x_{1}x_{2}+x_{3}x_{2}x_{1}.\end{array}

\begin{array}[]{rclcl}p(x)&=&x_{1}^{3}-4x_{2}^{3}-4x_{3}^{3}+5x_{1}x_{1}x_{2}+5x_{1}x_{2}x_{1}+5x_{2}x_{1}x_{1}\\ &&-3x_{1}x_{1}x_{3}-3x_{1}x_{3}x_{1}-3x_{3}x_{1}x_{1}+7x_{2}x_{2}x_{1}+7x_{2}x_{1}x_{2}+7x_{1}x_{2}x_{2}\\ &&-11x_{2}x_{2}x_{3}-11x_{2}x_{3}x_{2}-11x_{3}x_{2}x_{2}+6x_{3}x_{3}x_{1}+6x_{3}x_{1}x_{3}+6x_{1}x_{3}x_{3}\\ &&-6x_{3}x_{3}x_{2}-6x_{3}x_{2}x_{3}-6x_{2}x_{3}x_{3}+x_{1}x_{2}x_{3}+x_{1}x_{3}x_{2}+x_{2}x_{1}x_{3}\\ &&+x_{2}x_{3}x_{1}+x_{3}x_{1}x_{2}+x_{3}x_{2}x_{1}.\end{array}

T (:, :, 1) = 15 - 3 571 - 3 16 and T (:, :, 2) = 571 7 - 4 - 11 1 - 11 - 6

T (:, :, 1) = 15 - 3 571 - 3 16 and T (:, :, 2) = 571 7 - 4 - 11 1 - 11 - 6

T (:, :, 3) = - 3 16 1 - 11 - 6 6 - 6 - 4,

T (:, :, 3) = - 3 16 1 - 11 - 6 6 - 6 - 4,

T = v_{1} \otimes v_{1} \otimes v_{1} + v_{2} \otimes v_{2} \otimes v_{2} + v_{3} \otimes v_{3} \otimes v_{3} + v_{4} \otimes v_{4} \otimes v_{4}

T = v_{1} \otimes v_{1} \otimes v_{1} + v_{2} \otimes v_{2} \otimes v_{2} + v_{3} \otimes v_{3} \otimes v_{3} + v_{4} \otimes v_{4} \otimes v_{4}

v_{1} \approx - 0.081 0.409 1.890 v_{2} \approx 3.165 - 3.910 - 3.654

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Matrix Theory and Algorithms · graph theory and CDMA systems

Full text

Efficient evaluation of noncommutative polynomials using tensor and noncommutative Waring decompositions

Eric Evert1

Eric Evert, Group Science, Engineering and Technology

KU Leuven Kulak,

E. Sabbelaan 53, 8500 Kortrijk, Belgium

and

Electrical Engineering ESAT/STADIUS

KU Leuven,

Kasteelpark Arenberg 10, 3001 Leuven, Belgium

[email protected]

,

J. William Helton1

J. William Helton, Department of Mathematics

University of California

San Diego

[email protected]

,

Shiyuan Huang1

Shiyuan Huang, Department of Computer Science

Columbia University

New York City

[email protected]

and

Jiawang Nie

Jiawang Nie, Department of Mathematics

University of California

San Diego

[email protected]

Abstract.

This paper analyses a Waring type decomposition of a noncommuting (NC) polynomial $p$ with respect to the goal of evaluating $p$ efficiently on tuples of matrices. Such a decomposition can reduce the number of matrix multiplications needed to evaluate a noncommutative polynomial and is valuable when a single polynomial must be evaluated on many matrix tuples.

In pursuit of this goal we examine a noncommutative analog of the classical Waring problem and various related decompositions. For example, we consider a “Waring decomposition” in which each product of linear terms is actually a power of a single linear NC polynomial or more generally a power of a homogeneous NC polynomial. We describe how NC polynomials compare to commutative ones with regard to these decompositions, describe a method for computing the NC decompositions and compare the effect of various decompositions on the speed of evaluation of generic NC polynomials.

Key words and phrases:

noncommutative polynomials, Waring problem, sums of powers, matrix variables, symmetric tensors

2010 Mathematics Subject Classification:

Primary 11P05, 46L52. Secondary 15A69, 47A56

1Research supported by the NSF grant DMS-1500835

1. Introduction

This paper concerns decompositions of noncommutative polynomials as sums of products of linear polynomials. The goal is to find ways of quickly evaluating noncommutative polynomials on tuples of matrices.

A place where efficient evaluations matter comes in numerical solution of problems arising in linear systems and control. Problems which are completely specified by signal flow diagrams having $L^{2}$ signals all take the form of solving collections of matrix inequalities based on polynomial matrix inequalities. For example, see [CHS06].

After changes of variables, some basic problems of this type convert to solving Linear Matrix Inequalities (whose coefficients are functions of the given system parameters) and for these there are numerous numerical optimization schemes [WSV00]. As with all optimization algorithms these require very many function evaluations.

[CHS06] showed how, using NC symbolic software, one could produce optimization algorithms whose linear subproblem has coefficients which are NC polynomials in the current iterate $\chi^{(k)}$ . As $\chi^{(1)},\chi^{(2)},\dots$ progresses toward the optimum, many function evaluations of NC polynomials are required.

The striking fact is that the NC polynomials $p_{1},\dots,p_{s}$ which must be evaluated depend only on the signal flow diagram and on the numerical optimization algorithm in the package. They do not depend on what is being designed, e.g.. a ship controller, airplane controller or helicopter controller (not to mention which ship, which plane etc).

Thus in the lifetime of a popular software toolbox a few specific polynomials must be evaluated billions (at least) of times on matrices of various sizes.

Pursuits involving noncommutative polynomials are in the spirit of the burgeoning area called free analysis. Here one takes classical problems and works out analogues with noncommutative variables, which are free of constraints. These free analogues typically have interpretations for matrix or operator variables and their development often impacts various areas.

One of the original efforts here was Voiculescu’s free probability, which started by developing a notion of entropy for operator variables and which has a become a big area having many associations to random matrix theory, [MS17]. Some other directions are free analytic function theory, cf. [KVV14] and free real algebraic geometry [BKP16] with some consequences for system engineering being [HMPV09]. Our paper concerns and gives applications for the noncommutative variant of the classical Waring problem.

1.1. Noncommutative polynomials

We work with functions of $g$ noncommutative variables

[TABLE]

and are interested in powers of linear functions

[TABLE]

where $s$ is an index and $A^{(s)}_{i}\in\mathbb{R}\ or\ \mathbb{C}$ for $1\leq i\leq g$ .

For any (index) tuple $\alpha=(\alpha_{1},\alpha_{2},...,\alpha_{d})$ , where $\alpha_{i}$ for $1\leq i\leq d$ are integers between 1 and g, we denote

[TABLE]

We say the monomial $x^{\alpha}$ has degree $d$ . For example, if $\alpha=(1,2,1,3)$ , then $x^{\alpha}=x_{1}x_{2}x_{1}x_{3}$ is a degree $4$ monomial.

A noncommutative (NC) polynomial is a formal sum of the form

[TABLE]

where $P_{\alpha}\in{\mathbb{R}}$ or ${\mathbb{C}}$ for each $\alpha$ and only finitely many of the $P_{\alpha}$ are nonzero. The degree of a NC polynomial is equal to that of its highest degree monomial which has a nonzero coefficient. If all monomials of a NC polynomial with nonzero coefficients have the same degree, then the NC polynomial is homogeneous.

Let $p(x)=\sum_{|\alpha|\leq d}P_{\alpha}x^{\alpha}$ be a noncommutative polynomial in $g$ noncommutative variables. Then for any $n$ and for any $g$ -tuple of $n\times n$ matrices $X=(X_{1},\dots,X_{g})$ , we define the evaluation of $p$ on $X$ by

[TABLE]

where $X^{0}=I_{n}$ . A question of practical interest is how to efficiently evaluate a NC polynomial on a collection of matrix tuples.

In this article we show that tensor decompositions may be used to significantly reduce the number of matrix multiplications needed to evaluate a noncommutative polynomial. Here a tensor is a multiindexed array $T\in(\mathbb{C}^{g})^{\otimes d}$ with entries $T(\alpha)\in\mathbb{C}$ where $\alpha=(\alpha_{1},\dots,\alpha_{d})$ is a $d$ -tuple of integers between $1$ and $g$ .

Our general strategy is as follows. First one associates a homogeneous noncommutative polynomial $p$ to a tensor $T_{p}\in(\mathbb{C}^{g})^{\otimes d}$ . By computing the tensor decomposition of the associated tensor, one gets a decomposition that expresses the NC polynomial as a sum of products of linear terms. This reduces the number of matrix multiplications needed to evaluate $p$ .

The nonhomogeneous setting can easily be handled can easily be handled by sorting $p$ as a sum of homogeneous polynomials. Additionally, one could homogenize the polynomials with a dummy variable (say $x_{0}$ ), then replace $x_{0}$ with $1$ after a factorization is obtained.

1.1.1. Evaluation using tensor decompositions

Let

[TABLE]

be a homogeneous degree $d$ noncommutative polynomial in $g$ variables $x=(x_{1},\dots,x_{g})$ . We can associate $p$ to the tensor $T_{p}=(T(\alpha))_{|\alpha|=d}$ . Suppose that $T_{p}$ has a rank $r$ decomposition

[TABLE]

for each $i$ and $s$ . Then we have

[TABLE]

We call a decomposition of the form (1.1.2) a linear product sum for the NC polynomial $p$ . Additionally, if $r$ is as small as possible, we say $p$ has product sum rank $r$ . Before continuing we give an example.

1.1.2. Example

Consider the noncommutative polynomial

[TABLE]

Think of its coefficients $p_{ijk}$ for $i,j,k=1,2,3$ as entries of a tensor $T_{p}$ with frontal slices

[TABLE]

and

[TABLE]

where $T_{p}(:,:,i)$ is the standard Matlab index notation. One can check that $p$ has the rank 2 decomposition

[TABLE]

It follows from (1.1.4) that $p$ has the rank $2$ linear product sum decomposition

[TABLE]

which one can check using NCAlgebra [OHMS17].

In this case, evaluating $p$ as it is written in equation (1.1.3) requires $54$ matrix multiplications and $26$ matrix additions. However, using equation (1.1.5) one needs only $4$ matrix multiplications and $12$ matrix additions, so our complexity is reduced by an order of $10$ .

As we illustrate later, a low rank tensor decomposition like (1.1.4) can be computed by standard numerical software packages such as Tensorlab. Accuracy of the decompositions will be discussed in section 2.3.4.

1.1.3. A basic NC Horner method

One may also evaluate a NC polynomial using a basic extension of Horner’s method to the NC setting. Given a degree $d$ NC polynomial $p(x)$ in $g$ variables, one may first write

[TABLE]

where $c$ is a constant and the degree of $p_{i}$ is less than $d$ for each $i$ . One may then recursively apply this method to each $p_{i}$ until all polynomials appearing in the summation have degree equal to one. For example, for the polynomial $p(x)$ in equation (1.1.3), is equal to

[TABLE]

Writing $p$ in this form allows $p$ to be evaluated using $12$ matrix multiplications and $26$ matrix additions, thus this method offers a significant improvement over naive evaluation. While this basic Horner method greatly improves on naive evaluation, the linear product sum decomposition for this NC polynomial is still notably more efficient.

Section 2.2 contains a more detailed comparison of the computational complexity of these three methods for generic homogeneous NC polynomials. We thank a referee for urging us to compare this method to linear product sums. Schrempf in [S19] subsequent to this paper introduced an interesting and natural method for evaluation. It heavily uses ‘linear system realizations’, known in the algebra community as ‘linearization’ or the ‘linearization trick’.

1.2. Waring decompositions of noncommutative polynomials

The case where a homogeneous noncommutative polynomial can be expressed as a sum of powers of linear forms adds further advantage for efficient numerical evaluation, as the $d$ th power of a matrix can be computed more efficiently than the product of $d$ matrices. This calls for a natural noncommutative generalization of the classical Waring problem.

The problem is as follows: Given a NC polynomial $p$ in the NC indeterminates $x=(x_{1},\dots,x_{g})$ , determine if there exist linear functions

[TABLE]

such that

[TABLE]

where $A_{i}^{(s)}\in{\mathbb{R}}$ or ${\mathbb{C}}$ for $1=1,\dots,g$ . We call a decomposition of the form (1.2.1) a rank $r$ real (resp. complex) Waring decomposition of $p$ . If $r$ is as small as possible then we say $p$ has Waring rank $r$ .

In the spirit of the NC Waring problem, we also consider the more general problem of determining if a homogeneous NC polynomial of degree $\delta d$ can be decomposed as a sum of $d$ th powers of homogeneous degree $\delta$ NC polynomials. That is, supposing $p$ is a homogeneous degree $\delta d$ NC polynomial, we wish to determine if there are homogeneous degree $\delta$ polynomials

[TABLE]

such that

[TABLE]

where each $A_{\alpha}$ is in ${\mathbb{R}}$ or ${\mathbb{C}}$ . We call a decomposition of the form (1.2.2) a rank $r$ real (resp. complex) $(\delta,d)$ -Waring decomposition of $p$ or sometimes a general Waring decomposition.

The NC Waring problem reduces to the classical commutative variable Waring problem, thereby effectively solving it over ${\mathbb{C}}$ . In a similar spirit, we reduce the NC general Waring problem to a classical Waring problem, but in more variables, see Section 4.

1.3. The NC Waring decomposition

Before stating a result we need a definition. Define an indicator function on an index $d$ -tuple $\alpha=(\alpha_{1},\dots,\alpha_{d})$ by first defining

[TABLE]

Then the indicator function $\mathbbm{1}_{j}^{\alpha}$ which gives the number of $j$ ’s appearing in $\alpha$ is

[TABLE]

We caution the reader that the superscript appearing on the indicator function $\mathbbm{1}_{j}^{\alpha}$ is not interpreted as a power.

A corollary for ${\delta}=1$ of Theorem 3.5 is:

Corollary 1.1.

Suppose a NC homogeneous polynomial $p(x)=\sum_{\alpha}{P_{\alpha}x^{\alpha}}$ , where $P_{\alpha}=P_{\alpha_{1},\alpha_{2},...,\alpha_{d}}$ $\in\mathbb{C}$ , satisfies $P_{\alpha}=P_{\tilde{\alpha}}$ for any index sets $\alpha,\tilde{\alpha}$ such that $\mathbbm{1}_{j}^{\alpha}=\mathbbm{1}_{j}^{\tilde{\alpha}}$ for all $1\leq j\leq g$ . Then $p$ has a NC complex coefficient Waring decomposition with linear powers. Moreover, for a generic NC homogeneous polynomial, the number of terms needed is

[TABLE]

except in the cases

•

$d=2$ , where $g$ terms are needed

•

$(d,g)=(3,5),(4,3),(4,4),(4,5)$ * where $\lceil\frac{1}{g}{{g+d-1}\choose d}\rceil+1$ terms are needed.*

Proof.

This corollary is a combination of Theorem 3.5, the main result in Section 3.3.2, and the solutions for the classical Waring Problem [AH95, OO12]. ∎

Here the term generic means that the set of exceptions is contained in a proper closed algebraic variety, i.e., in the zero set of a nontrivial system of polynomial equations.

Each term in a Waring decomposition of a NC polynomial can be evaluated by computing the $d$ th power of a matrix rather than computing the product of $d$ different matrices. This gives Waring decompositions an additional computational advantage over linear product sum decompositions when the number of terms needed for each decomposition is the same as one typically expects.

The authors thank Ignat Domanov for discussion related to NC Waring decompositions and efficient polynomial evaluations.

1.4. Guide to readers

In Section 2 we examine in more detail the use of linear product sum decompositions to evaluate NC polynomials on matrix variables. We then discus computation of NC Waring and linear product sum decompositions. Additionally we estimate the expected computational savings when evaluating a NC polynomial using one of these decompositions and provide timing comparisons for naive evaluation and evaluation using and linear product sum decompositions.

Section 3 shows that the NC Waring problem reduces to the classical Waring problem. The section begins by introducing a compatibility condition which is necessary for a NC homogeneous polynomial $p$ to have a Waring decomposition. Theorem 3.5 shows that a NC homogeneous polynomial $p$ has a $t$ -term Waring decomposition if and only if it satisfies our compatibility condition and its commutative collapse has a $t$ -term Waring decomposition.

Section 4 considers the general NC Waring problem. Similar to the $\delta=1$ case, we begin by introducing a general $\delta$ -compatibility condition which is necessary for the existence of a $(\delta,d)$ -NC Waring decomposition. Theorem 4.9 shows that, under the $\delta$ -compatibility condition, the general NC Waring problem is equivalent to a commutative Waring problem for a polynomial with an increased number of variables. We end with Section 4.5 which illustrates that an increase in our number of variables is necessary to reduce the general NC Waring decomposition to a commutative Waring decomposition.

2. Accelerating NC polynomial evaluation using tensor and Waring decompositions

In this section we will establish a connection between general tensor decompositions and decompositions of noncommutative polynomials. Using this connection we describe how to use tensor decomposition to efficiently evaluate noncommutative polynomials on matrix variables. Having discussed general tensor decompositions in the introduction, we first consider polynomials with a NC Waring decomposition. Next in Section 2.2 we compare the computational cost of using the various decompositions. Also we discus issues of accuracy.

2.1. NC Waring decompositions and symmetric tensors

It is well known that the classical polynomial Waring problem is equivalent to the problem of symmetric tensor decomposition. Let $T\in(\mathbb{C}^{g})^{\otimes d}$ be a symmetric tensor, i.e. a symmetric multiidexed array, with entries $T_{\alpha}\in\mathbb{C}$ where $\alpha=(\alpha_{1},\dots,\alpha_{d})$ is a $d$ -tuple of integers between $1$ and $g$ . Here symmetric means that for any permutation $\pi\in\mathcal{S}_{d}$ , we have $T_{\alpha}=T_{\pi(\alpha)}$ where $\pi(\alpha)=(\alpha_{\pi(1)},\dots,\alpha_{\pi(d)})$ . We may associate $T$ to a homogeneous degree d polynomial $p_{T}(x)$ in the commutative variables $X=(X_{1},\dots,X_{g})$ by setting

[TABLE]

Suppose $T$ has rank $r$ symmetric tensor decomposition

[TABLE]

Here $A^{(s)}=(A_{1}^{(s)},\dots,A_{g}^{(s)})^{T}\in\mathbb{C}^{g}$ for each $s$ . Then it is straightforward to check that

[TABLE]

That is, a rank $r$ symmetric tensor decomposition of $T$ corresponds to a rank $r$ Waring decomposition for $p_{T}(z)$ . By reversing this correspondence one sees that a rank $r$ Waring decomposition for a homogeneous polynomial gives a rank $r$ symmetric tensor decomposition for the associated symmetric tensor. The fact that the tensor corresponding to a NC polynomial with a Waring decomposition is symmetric is a consequence of Theorem 3.5.

2.1.1. Numerical computation of NC Waring decompositions

We now give an example which computes an NC Waring decomposition by using popular tensor decomposition software. Consider the homogeneous noncommutative polynomial

[TABLE]

We associate $p(x)$ to the symmetric tensor $T$ defined by its frontal slices

[TABLE]

and

[TABLE]

Using Tensorlab111A matlab script which computes this decomposition using Tensorlab is avaliable on GitHub at https://github.com/NCAlgebra/UserNCNotebooks. [VDSBL16] we compute that $T$ is a rank $4$ tensor and has symmetric tensor decomposition

[TABLE]

where

[TABLE]

and

[TABLE]

It follows that $p$ has the rank 4 NC Waring decomposition

[TABLE]

This is easy to numerically verify using NCAlgebra [OHMS17].

A naive evaluation of $p$ on a matrix tuple using the original definition of $p$ requires $54$ matrix multiplications. In contrast, evaluating $p$ on a matrix tuple using its NC Waring decomposition only requires $8$ matrix multiplications.

2.2. Computational savings

We now examine the computational costs for each of the the Waring, linear product sum, and basic Horner methods for NC polynomial evaluation.

2.2.1. Linear product sum

The maximum rank of a tensor $T\in(\mathbb{C}^{g})^{\otimes d}$ is not known, however it is conjectured [AOP09] that the rank of a generic tensor $T\in(\mathbb{C}^{g})^{\otimes d}$ is equal to

[TABLE]

except in a small number of defective spaces where most commonly one additional term is needed.

Each term in the linear product sum decomposition is an NC monomial of degree $d$ and may be evaluated in $(d-1)$ multiplications. Therefore, if this conjecture holds, then it follows that generic homogeneous noncommutative polynomials of degree $d$ in $g$ variables may be evaluated using approximately

[TABLE]

matrix multiplications.

2.2.2. Waring

We now consider the case where $p$ has a $t$ -term degree $d$ NC Waring decomposition as in equation (1.2.1). In this case, for any matrix tuple $X$ we may evaluate $p(X)$ using $tg-1$ matrix additions and $t$ matrix exponentiations of degree $d$ , where for generic NC polynomials $t\leq\lceil\frac{1}{g}{{g+d-1}\choose d}\rceil+1$ by Corollary 1.1. We note that powers of a matrix may be efficiently computed either by decomposing the exponent as a sum of powers of two, or by first computing the Jordan form of the matrix.

Using repeated squaring methods, a matrix exponentiation of degree $d$ can be evaluated with at most $2\lfloor\log_{2}d\rfloor$ matrix multiplications. In addition, using Stirling’s approximation one can show that

[TABLE]

It follows that if an NC polynomial $p$ has an NC Waring decomposition, then one may evaluate $p$ on matrix variables using approximately

[TABLE]

matrix multiplications. Here $e\approx 2.718$ .

2.2.3. Horner’s method

Using equation (1.1.6), one sees that if $h(g,d-1)$ denotes the number of matrix multiplications needed to evaluate a degree $d-1$ NC polynomial in $g$ variables using this basic Horner method, then

[TABLE]

Using $h(g,1)=0$ , one then has

[TABLE]

with equality for generic NC polynomials.

2.2.4. The $d=2$ case

In the case that $p$ is a homogeneous NC polynomial of degree $2$ in $g$ variables, the tensor $T_{p}$ corresponding to $p$ is in fact a $g\times g$ matrix. It follows that $p$ has linear product sum rank less than or equal to $g$ , hence $p$ may be evaluated using at most $g$ matrix multiplications. Horner’s method also generically requires $g$ matrix multiplications in the $d=2$ case, while naive evaluation generically requires $g^{2}$ matrix multiplications.

2.3. Comparison of computational costs

In this subsection we compare computational costs for the various methods.

2.3.1. Comparison of efficiency: Linear product sum vs. Horner

We now briefly compare the various methods. Supposing that the generic rank of a tensor $T\in({\mathbb{C}}^{g})^{\otimes d}$ is in fact given by equation (2.2.1) and using the approximation in equation (2.2.2), one finds that for generic homogeneous NC polynomials, the basic Horner method requires approximately

[TABLE]

more matrix multiplications to evaluate a NC polynomial than the linear product sum method. The above shows that evaluation with linear product sum is more efficient for generic homogeneous NC polynomials than evaluation with Horner for all $(g,d)$ provided $g,d\geq 3$ . This increased efficiency leads to a notable improvement for NC polynomials requiring millions of evaluations, see Table 1.

While the more practical point is that linear product sum is more efficient than Horner for all fixed $(g,d)$ , it gives perspective to look at extremes of ratios. The asymptotic ratio of the number of matrix multiplications needed by linear product sum to that of Horner approaches $1$ as $d$ tends to infinity. Thus, for high degree homogeneous NC polynomials requiring smaller numbers of evaluations, Horner is likely more appropriate due to the computational cost associated with computing a linear product sum decomposition. In contrast, for fixed $d$ this ratio approaches $(d-1)/d$ as $g$ tends to infinity.

2.3.2. Comparison of efficiency: Waring vs. linear product sum

The main advantage of a Waring decomposition compared to linear product sum is that a $d$ th power of a linear form may be evaluated (by repeated squaring) using no more than $2\lfloor\log_{2}d\rfloor$ matrix multiplications. In contrast, the product of $d$ distinct linear forms naively requires $d-1$ matrix multiplications to evaluate.

The rank of a tensor is necessarily less than or equal to the symmetric rank of a tensor. It follows that if $p$ has a Waring decomposition, hence the corresponding tensor $T$ is symmetric, then the ratio of the number of matrix multiplications needed by the Waring method and the linear product sum method is bounded below by

[TABLE]

with equality if the rank of $T$ is equal to the symmetric rank of $T$ .

An example of a tensor whose rank is strictly less than its symmetric rank has only recently been produced [S18]. The example is of a symmetric tensor of size $800\times 800\times 800$ with rank 903 and symmetric rank greater than 903. The corresponding NC polynomial $p$ is a homogeneous degree $d=3$ polynomial in $g=800$ variables.

Since the degree of $p$ is $3$ , in both the Waring method and the linear product sum method, each monomial requires $2$ matrix multiplications to evaluate. As a consequence, in this example, using the linear product sum decomposition allows for $p$ to be evaluated in strictly fewer matrix multiplications than the Waring decomposition.

Although an example of a tensor with rank less than symmetric rank is known, there are various results showing that rank is equal to symmetric rank for generic tensors having small rank, e.g. see [COV17, F16]. Additionally, we note that it remains unknown if generic symmetric tensors have rank equal to symmetric rank.

In the case $d=3$ , there is no advantage of using a Waring decomposition over a linear product sum decomposition in terms of number of multiplications required for evaluation. However, we expect that as $d$ grows large, even if there is a gap between the Waring rank and linear product sum rank of a given NC polynomial, a NC Waring decomposition will outperform a linear product sum decomposition in terms of efficiency due to the ability to efficiently evaluate matrix powers.

It is also worth pointing out that the generic symmetric rank for symmetric tensors in $({\mathbb{C}}^{g})^{\otimes d}$ is strictly less than the generic rank for arbitrary tensors in $({\mathbb{C}}^{g})^{\otimes d}$ provided $g,d\geq 3$ , with the gap becoming increasingly significant as $g$ and $d$ grow. In contrast, Horner’s method sees no notable improvement when used on NC polynomials which have a Waring decomposition. Thus both Waring and linear product sum decompositions significantly outperform Horner’s method in this setting.

2.3.3. Comparison to naive

All three methods offer a serious improvement over naive evaluation. Since a naive evaluation of a single degree $d$ NC monomial requires $d-1$ matrix multiplications, the naive approach to evaluating a NC polynomial on a matrix tuple generically requires

[TABLE]

matrix multiplications. It follows that for a NC polynomial with linear product sum rank given by equation (2.2.1), the ratio of the number of matrix multiplications used in the linear product sum method to those in the naive method is approximately

[TABLE]

Similarly, for NC polynomials with a Waring decomposition, the ratio of the number of matrix multiplications used in the Waring method to those in the naive method is then approximately bounded above by

[TABLE]

a quantity that rapidly approaches zero as $g$ or $d$ increase, provided $3\leq d,g$ .

2.3.4. Accuracy of computations

The tensor in Example 1.1.3 has a unique rank $2$ decomposition (up to scaling) which can be shown using Kruskal’s condition for uniqueness of tensor decompositions [K77]. Indeed, when the example is treated with Tensorlab a rank $2$ decomposition which is the same (up to scaling) as the decomposition in (1.1.4) is produced.

For display purposes in equation (2.1.1) and above we have truncated the coefficients in the decompositions for $T$ and $p$ at the thousandths place which leads to a small round off error. If we use the long form coefficients computed by Tensorlab, then the decomposition for $T$ and $p$ is highly accurate. Note that $T$ has infinitely many rank $4$ tensor decompositions. The computed tensor decomposition depends on the initialization of the algorithm used in the computation.

Although highly accurate decompositions can be computed for small tensors, when working with large tensors of generic rank, one should not expect to exactly compute a tensor decomposition. However, in early steps of noncommutative optimization algorithms, a small amount of error in the computed descent directions is unlikely to cause serious difficulty. Exact evaluations may be used in later steps when near an optimum. Amounts of relative error averaged over our experiments in tensor decompositions for tensors of the various selected $g$ and $d$ are reported in Table 1.

2.3.5. Experiment comparing run times of linear product sum to other methods

We now give a brief illustration of experimental timing where we for evaluating homogeneous NC polynomials on $20\times 20$ and $100\times 100$ matrices using linear product sums , Horner, and naive evaluation.

Table 1 selects several values of $g$ and $d$ , in column $1$ , and presents properties of the tensor decomposition in the space $({\mathbb{C}}^{g})^{\otimes d}$ in the last $3$ columns: generic tensor rank, time to find a decomposition, and accuracy of the decomposition. This is the tensor decomposition used for the linear product sum method.

Columns $2$ and $3$ list how many polynomial evaluations are needed for linear product sum to overcome its tensor decomposition cost, and hence to outperform Horner’s method222The cost of computing a Horner decomposition is assumed to be negligible in this comparison.. Similarly, columns $4$ and $5$ show when linear product sum breaks even with the naive method333The estimates are generated as follows: We randomly generate $1000$ pairs of $n\times n$ matrices and compute the average amount of time needed for a single multiplication of a pair $n\times n$ matrices. The number of matrix multiplications needed for a generic rank linear product sum evaluation or a naive evaluation is multiplied by the average amount of time needed for a single matrix multiplication to compute the expected time needed for a single evaluation on $n\times n$ matrices. Using this methodology the average time needed for multiplication of a pair of $20\times 20$ matrices or $100\times 100$ matrices was found to be $1.4056*10^{-6}$ seconds or $2.9392*10^{-5}$ seconds, respectively..

In the case that a NC polynomial has low Waring or linear product sum rank, evaluation using these methods will be much more efficient. Also, the tensor decomposition needed to compute the NC polynomial decomposition takes significantly less time to compute and the error in the decomposition will be significantly lower.

3. The noncommutative Waring problem

In this section we examine when a noncommutative polynomial has a NC Waring decomposition. Two approaches are considered. First we consider a noncommutative algebra approach. In this approach, we show that if a noncommutative polynomial $p$ has a Waring decomposition, then its coefficients must satisfy a compatibility condition. If this condition is satisfied, then we prove that $p$ has a $t$ -term Waring decomposition if and only if the restriction of $p$ to commuting variables has a classical $t$ -term Waring decomposition.

The second approach makes use of identification of noncommutative polynomials and tensors and known results for tensor decompositions. To an expert in both tensor theory and in NC polynomials the use of this approach and results on NC Waring decompositions may not come as a surprise. However, for our (main) NC polynomial audience we include a self contained NC polynomial proof.

Before proceeding with proofs we briefly discuss the history of the polynomial Waring problem.

3.1. History of the Waring decomposition

The polynomial Waring problem concerns the question whether a given polynomial, $f(x_{1},x_{2},\dots,x_{n})$ , can be represented by sums of powers of polynomials, where $x_{i}$ ’s are variables which commute. In this form, the Waring problem is closely related to symmetric tensor decomposition. The polynomial Waring problem for powers of linear forms was treated successfully in [AH95] and subsequently in [RS00] and [FOS12] and has been studied extensively, as is shown, for example, in [BC13] and [GV08].

3.2. A basic definition

Noncommutative Waring decompositions are associated with commutative Waring decompositions through a correspondence we now describe.

For a NC polynomial $p$ , the associated commutative collapse, $p_{c}$ , is the commutative polynomial obtained by considering the variables of $p$ to be commutative. Our notation for commutative collapse for a NC monomial $x^{\alpha}=x_{\alpha_{1}}x_{\alpha_{2}}\dots x_{\alpha_{d}}$ is $X^{\alpha}=X_{\alpha_{1}}X_{\alpha_{2}}\dots X_{\alpha_{d}}$ . For example, when $\alpha=(1,2,1,2)$ , $x^{\alpha}=x_{1}x_{2}x_{1}x_{2}$ collapses to $X^{\alpha}=X_{1}^{2}X_{2}^{2}$ .

We impose an equivalence relation $\sim_{c}$ on NC monomials by saying that $x^{\alpha}$ and $x^{\tilde{\alpha}}$ are commutative equivalent if they have the same commutative collapse:

[TABLE]

Moreover, we say two index tuples $\alpha$ and $\tilde{\alpha}$ are commutative equivalent, denoted $\alpha\sim_{c}{\tilde{\alpha}}$ , iff $x^{\alpha}\sim_{c}x^{\tilde{\alpha}}$ . Note that

[TABLE]

3.3. NC polynomial proof of the NC Waring decomposition

Our presentation contains two parts. First we state a compatibility condition necessary for the existence of a Waring decomposition, §3.3.1. Second, if the compatibility condition holds, we reduce the NC Waring problem to the classical commutative Waring problem, §3.3.2.

3.3.1. The Compatibility Condition

As we next see the following condition is necessary for existence of a NC Waring decomposition.

Definition 3.1.

We say a noncommutative homogeneous degree $d$ polynomial

[TABLE]

satisfies the compatibility condition if

[TABLE]

Sometimes we say that $p$ is compatible. ∎

We note that a noncommutative homogeneous polynomial $p$ satisfies the compatibility condition if and only if the corresponding tensor described in Section 1.1.1 is symmetric. To see this, given a tuple $\alpha=(\alpha_{1},\alpha_{2},\dots,\alpha_{d})$ of length $d$ and a permutation $\pi\in\mathcal{S}_{d}$ define

[TABLE]

It is straight forward to check that $x^{\alpha}\sim_{c}x^{\tilde{\alpha}}$ and $\alpha\sim_{c}\tilde{\alpha}$ if and only if there is a permutation $\pi\in\mathcal{S}_{d}$ such that $\pi(\alpha)=\tilde{\alpha}$ .

Extend the action of $\mathcal{S}_{d}$ to noncommutative homogeneous polynomials of degree $d$ by

[TABLE]

Then $p$ meets the compatibility condition if and only if

[TABLE]

for all permutations $\pi\in\mathcal{S}_{d}$ . That is, for all $\alpha$ and all $\pi\in\mathcal{S}_{d}$ , we have $P_{\alpha}=P_{\pi(\alpha)}$ . It follows that the corresponding tensor is symmetric.

The following lemma shows that the compatibility condition is necessary for existence of a NC Waring decomposition.

Lemma 3.2.

If a NC homogeneous polynomial of degree $d$ has a $t$ -term NC Waring decomposition, then the compatibility condition (3.3.1) holds. Moreover, if $p$ meets the compatibility condition, then $p$ has a $t$ -term NC Waring decomposition over the complex numbers (resp. real numbers) if and only if

[TABLE]

has a solution $A_{j}^{(s)}\in\mathbb{C}(\textrm{resp. }A_{j}^{(s)}\in\mathbb{R}).$

Proof.

By definition, $p$ has a $t$ -term Waring decomposition if and only if

[TABLE]

Comparing the coefficients of $x^{\alpha}$ on both sides, we get

[TABLE]

This also implies $P_{\alpha}=P_{\tilde{\alpha}}$ if $\mathbbm{1}_{j}^{\alpha}=\mathbbm{1}_{j}^{\tilde{\alpha}}$ for all $1\leq j\leq g$ . ∎

Example 3.3.

A NC homogeneous polynomial $p(x)=\sum_{\alpha}{P_{\alpha}x^{\alpha}}$ has the complex (resp. real) $2$ -term Waring decomposition

[TABLE]

if and only if $p$ is compatible and

[TABLE]

has a solution $a,b,c,d\in\mathbb{C}$ (resp. $\mathbb{R}$ ). ∎

3.3.2. Reduction of NC Waring to Classical Waring

We see in this section that the NC Waring problem reduces to the commutative one.

Lemma 3.4.

For an index tuple $\alpha$ , denote $\eta[\alpha]$ as the number of $\tilde{\alpha}$ ’s that satisfy $\mathbbm{1}^{\alpha}_{j}=\mathbbm{1}^{\tilde{\alpha}}_{j}$ for all $1\leq j\leq g$ . Then

[TABLE]

Proof.

The problem is equivalent to calculating how many d-tuples can be formed by elements from $\alpha=(\alpha_{1},\alpha_{2},\dots,\alpha_{d})$ , which is equivalent to

[TABLE]

∎

Theorem 3.5.

Suppose $p$ is a homogeneous NC polynomial which satisfies the compatibility conditions (3.3.1). Then the commutative collapse $p_{c}$ has the Waring decomposition

[TABLE]

(with $X_{i}$ being commuting variables) if and only if $p$ has the NC Waring decomposition

[TABLE]

Note that the number of terms is the same and the real coefficients (resp. complex coefficients) $A_{j}^{(s)}$ are the same.

Proof.

The proof begins by laying out the algebraic connection between $p$ and $p_{c}$ . Let ${\mathcal{R}}$ denote a set consisting of one representative from each $\sim_{c}$ equivalence class. Then from (3.3.1), the NC polynomial $p(x)=\sum_{|\alpha|=d}P_{\alpha}x^{\alpha}$ has commutative collapse satisfying

[TABLE]

where $P_{c,\alpha}=\sum_{\tilde{\alpha}\sim_{c}\alpha}P_{\tilde{\alpha}}$ .

Thus if $p$ satisfies the compatibility condition (3.3.1), then

[TABLE]

Therefore, $p_{c}$ is the commutative collapse of a compatible NC homogeneous degree $d$ polynomial $p$ iff $P_{c,\alpha}=\eta[\alpha]P_{\alpha}$ for all index tuples $\alpha\in{\mathcal{R}}$ of length $d$ .

Now we proceed to prove our theorem. Assume $p$ has the NC Waring decomposition (3.3.6), we shall obtain a reversible formula for the Waring decomposition of $p_{c}$ . By equation (3.3.7) and Lemma 3.2, the commutative collapse $p_{c}$ is

[TABLE]

Thus

[TABLE]

On the other hand, suppose $p$ ’s commutative collapse, $p_{c}$ , has the commutative Waring decomposition (3.3.5), then the calculations in (3.3.8) and (3.3.9) can be reversed. By comparing coefficients, this is equivalent to

[TABLE]

for all $\alpha\in{\mathcal{R}}$ . Therefore by (3.3.7), $p$ satisfies

[TABLE]

for all index tuples $\alpha$ of length $d$ . Hence by Lemma 3.2, $p$ has the Waring decomposition (3.3.6). Thus under the compatibility condition (3.3.1), the NC polynomial $p$ has a Waring decomposition iff its commutative collapse $p_{c}$ has the same Waring decomposition. ∎

3.4. NC Waring decompositions and symmetric tensors

A tensor based approach to the noncommutative Waring problem that can be used to prove Theorem 3.5 is as follows. By considering the correspondence of NC polynomials and tensors described in Section 2 as well as the relationship between NC polynomial decompositions and tensor decompositions, one sees that a NC polynomial has a NC Waring decomposition if and only if the corresponding tensor has a symmetric tensor decomposition.

It is well known that a tensor has a symmetric tensor decomposition if and only if the tensor itself is symmetric, e.g. see [CGLM08, Lemma 4.2] . Therefore, a NC polynomial $p$ has a NC Waring decomposition if and only if the corresponding tensor $T_{p}$ is symmetric. One may check that the tensor $T_{p}$ is symmetric if and only if $p$ satisfies the compatibility condition.

4. The general noncommutative Waring problem

We now consider a more general situation of which the problem in the preceding section is the base case. As you will see, the bookkeeping and notation is formidable, so it is very helpful to have done a simpler case. In the previous section our focus was to determine if a degree $d$ noncommutative homogeneous polynomial can be expressed as sums of powers of linear terms. Now we examine when a degree ${\delta}d$ noncommutative homogeneous polynomial can be expressed as sums of powers of homogeneous degree ${\delta}$ terms.

As in the last section, we consider both noncommutative algebra and (for the tensor proficient) tensor based approaches.

4.1. Classical General Waring Problem.

The classical commutative Waring problem can be generalized from representation by sums of powers of linear functions to representation by sums of powers of homogeneous polynomials. The generalized classical Waring problem has also been well studied. According to Theorem 4 in [FOS12], there is an upper bound for the number of terms needed for such problems:

Theorem 4.1.

A general homogeneous polynomial of degree $\delta d$ in $g$ variables, where $d\geq 2$ , can be expressed as a sum of at most $d^{g-1}$ $d^{th}$ powers of degree $\delta$ homogeneous complex coefficient polynomials. Moreover, for a fixed $g$ , this bound is sharp for all sufficiently large $\delta$ .

4.2. Problem formulation and notation

Let $S^{g}_{\delta}$ be the set of all possible $\delta$ -tuples whose elements are integers between $1$ and $g$ , i.e.,

[TABLE]

Additionally, define $(S^{g}_{\delta})^{d}$ by

[TABLE]

That is, $(S^{g}_{\delta})^{d}$ is the set of $d$ -tuples of $\delta$ tuples of indices. For any $\alpha=(\alpha_{1},\dots,\alpha_{d})\in(S^{g}_{\delta})^{d}$ , where $\alpha_{i}=(\alpha_{i}^{(1)},\dots,\alpha_{i}^{({\delta})})\in S^{g}_{\delta},$ we can write

[TABLE]

That is, $x^{\alpha}$ is the monomial

[TABLE]

Recall our notation for a degree $\delta$ homogeneous polynomial

[TABLE]

where $A_{\beta}=A_{(\beta^{(1)},\beta^{(2)},\dots,\beta^{(\delta)})}\in\mathbb{C}$ .

Remark 4.2.

For any $\alpha=(\alpha_{1},\alpha_{2},\dots,\alpha_{d})\in(S^{g}_{\delta})^{d}$ , we can identify

[TABLE]

with

[TABLE]

On the other hand, for any element of $S^{g}_{\delta d}$ , we can reverse this identification and form groups of size $\delta$ to get a $d$ -tuple of $\delta$ -tuples. We let $\tau$ denote the bijection

[TABLE]

which accomplishes this grouping. ∎

The General NC Waring Problem:

Given a NC homogeneous degree $\delta d$ polynomial p, does it have a t-term $d^{th}$ power real NC Waring (resp. complex NC Waring) decomposition of degree $\delta$ . That is, can $p(x)$ be written as

[TABLE]

We call this problem the $({\delta},d)$ -NC Waring problem and say a decomposition of the form (4.2.1) is a $t$ -term $({\delta},d)$ -NC Waring decomposition. Similarly for a commutative polynomial $p_{c}$ , we say a decomposition of the form (4.2.1) (with $x^{\beta}$ replaced by $X^{\beta}$ ) is a $t$ -term $({\delta},d)$ -Waring decomposition. Note that the problem treated in Section 3 is exactly the $(1,d)$ -NC Waring problem.

An obvious fact is, if $p$ is a degree $\delta d$ NC homogeneous polynomial and $p$ has a $t$ -term $(\delta,d)$ -NC Waring decomposition, then its commutative collapse $p_{c}$ has a $t$ -term $(\delta,d)$ -Waring decomposition. For a conjecture on the generic value of $t$ in this commutative case, see [LORS19, Conjecture 1.2].

4.2.1. Tuple indicator functions

We now extend the notion of indicator function to tuples of ${\delta}$ -tuples. For two $\delta-$ tuples $\beta,\gamma\in S^{g}_{\delta}$ , denote

[TABLE]

Then for an index tuple $\mu\in(S^{g}_{\delta})^{d}$ , the number of times a particular $\delta-$ tuple $\beta\in S^{g}_{\delta}$ appears in $\mu$ is

[TABLE]

Furthermore, denote

[TABLE]

as the number of integers $i$ appearing in all the $\delta$ -tuples in $\alpha$ .

4.3. Main results on the general Waring decomposition

Similar to Section 2, we first state a compatibility condition which is necessary for the existence of a generalized NC Waring decomposition. We then prove that, if this condition holds, then we can reduce the generalized NC Waring problem to a commutative one at the price of increasing our number of variables.

4.3.1. The Compatibility Condition

The generalized version of the $\delta=1$ compatibility condition is defined as follows:

Definition 4.3.

We say a noncommutative homogeneous polynomial of degree $\delta d$ in g variables of the form

[TABLE]

satisfies the $\delta$ -compatibility condition* if*

[TABLE]

for all index sets, $\alpha$ , $\tilde{\alpha}\in{S^{g}_{\delta d}}$ such that $\mathbbm{1}^{\tau(\alpha)}_{\beta}=\mathbbm{1}^{\tau(\tilde{\alpha})}_{\beta}\text{ for all }\beta\in S^{g}_{\delta}$ . Consistent with this, we define the $\delta$ -equivalence relation*, denoted $\sim_{\delta}$ , on ${S^{g}_{\delta d}}$ by*

[TABLE]

for all $\beta\in S^{g}_{\delta}$ . ∎

Remark 4.4.

Here are a few bookkeeping properties of $\delta$ -equivalences.

(1)

We have $\alpha\sim_{1}\tilde{\alpha}$ if and only if $\alpha\sim_{c}\tilde{\alpha}$ . 2. (2)

Let $\delta_{1},\delta_{2}\in\mathbb{N}$ and let $\alpha,\tilde{\alpha}\in S_{\delta_{2}d}^{g}$ . If $\delta_{2}$ divides $\delta_{1}$ , then $\alpha\sim_{\delta_{1}}\tilde{\alpha}$ implies $\alpha\sim_{\delta_{2}}\tilde{\alpha}$ . In the case where $\delta_{2}=1$ this follows from equation (4.2.2). The general case is similar. 3. (3)

Let $\delta_{1},\delta_{2},d\in\mathbb{N}$ and let $p$ be a degree $\delta_{1}d$ NC homogeneous polynomial. If $\delta_{2}$ divides $\delta_{1}$ and $p$ satisfies the $\delta_{2}$ -compatibility condition then $p$ satisfies the $\delta_{1}$ -compatibility condition.

Items (2) and (3) highlight that, as $\delta$ grows, it becomes increasingly difficult for fixed monomials $\alpha$ and $\tilde{\alpha}$ of degree divisible by $\delta$ to be $\delta$ -equivalent. As an immediate consequence, as $\delta$ grows, it become more likely that a fixed NC homogeneous polynomial $p$ of degree divisible by $\delta$ satisfies the $\delta$ -compatibility condition. In the extreme case, monomials $x^{\alpha}$ and $x^{\tilde{\alpha}}$ of degree $\delta$ are $\delta$ -equivalent if and only if $\alpha=\tilde{\alpha}$ . As a result, every degree $\delta$ NC homogeneous polynomial satisfies the $\delta$ -compatibility condition. ∎

Example 4.5.

Let

[TABLE]

Then

[TABLE]

Now let $p$ be the degree four homogeneous NC polynomial

[TABLE]

Then $p$ satisfies the $2$ -compatibility condition and the $4$ -compatibility condition. However, $p$ does not satisfy the $1$ -compatibility condition, since the coefficient of $x_{1}x_{1}x_{2}x_{2}$ in $p$ is [math] but the coefficient of $x_{1}x_{2}x_{2}x_{1}$ is $1$ and

[TABLE]

The following lemma shows that the $\delta$ -compatibility condition is necessary for the general NC Waring problem.

Lemma 4.6.

Suppose a NC homogeneous polynomial $p$ of degree $\delta d$ in $g$ variables has a $t$ -term $({\delta},d)$ -NC Waring decomposition, then $p$ satisfies the $\delta$ -compatibility condition. That is, $P_{\alpha}=P_{\widetilde{\alpha}}$ if ${\alpha}\sim_{\delta}{\widetilde{\alpha}}$ . Here $p$ has coefficients $P_{\alpha}$ .

Moreover, the $({\delta},d)$ -NC Waring problem has a solution over the complex numbers (resp. real numbers) if and only if the equation

[TABLE]

has a solution $A_{\beta}^{(s)}\in\mathbb{C}$ (resp. $A_{\beta}^{(s)}\in\mathbb{R}$ ).

Proof.

The polynomial $p$ has a $t$ -term $(\delta,d)$ -NC Waring decomposition iff $\exists$ $\delta^{th}$ degree homogeneous polynomials, $H_{1},H_{2},\dots,H_{t}$ satisfying

[TABLE]

Comparing coefficients we see, equivalent to the $(\delta,d)$ -NC Waring decomposition is:

[TABLE]

yielding (4.3.3).

As a consequence $P_{\alpha}=P_{\tilde{\alpha}}$ for any $\alpha$ satisfying $\mathbbm{1}^{\tau(\alpha)}_{\beta}=\mathbbm{1}^{\tau(\tilde{\alpha})}_{\beta}$ for every $\beta\in S^{g}_{\delta}$ , yielding the first assertion of the theorem. ∎

Example 4.7.

Let

[TABLE]

Then $p$ is an example where there is no $({\delta},d)=(2,2)$ -NC Waring decomposition; indeed the $2$ -compatibility condition is violated because $P_{(1,1,1,2)}=0\neq 1=P_{(1,2,1,1)}$ . However, its commutative collapse does have the (2,2)-Waring decomposition:

[TABLE]

4.4. Reduction to classical Waring in more variables

To solve the general $(\delta,d)$ -noncommutative Waring problem we reduce to the $\delta=1$ case solved by Theorem 3.5. This reduction is accomplished by identifying a monomial $x^{\beta}$ with a new variable $z_{\beta}$ . Namely, fix $\delta$ and define the map $\phi$ on monomials of the form $x^{\beta}$ for $\beta\in S_{\delta}^{g}$ by

[TABLE]

where the $z_{\beta}$ are noncommutative indeterminates indexed by elements of $S_{\delta}^{g}$ .

We extend our definition of $\phi$ to a noncommutative homogeneous polynomial

[TABLE]

of degree $\delta d$ by

[TABLE]

Lemma 4.8.

The map $\phi$ as defined in equation (4.4.1) defines an algebra isomorphism on the algebra of noncommutative homogeneous polynomials of degree divisible by $\delta$ in the noncommutative indeterminate $x=(x_{1},x_{2},\dots,x_{g})$ which maps to the algebra of noncommutative homogeneous polynomials in the noncommutative indeterminates $\{z_{\beta}\}_{\beta\in S_{\delta}^{g}}$ .

Proof.

This is straightforward from the definition of $\phi$ on a noncommutative homogeneous polynomial of degree $d\delta$ . ∎

Note that in the case of commutative $X$ , substitution of $X^{\beta}$ by a commutative $Z_{\beta}$ is sometimes used, however, the isomorphism property in Lemma 4.8 fails, so conclusions are much less precise than what we get here.

We now give our main result for the $(\delta,d)$ -NC Waring problem.

Theorem 4.9.

Let $p$ be a noncommutative homogeneous polynomial of degree $\delta d$ in the indeterminate $x=(x_{1},\dots,x_{g})$ , and let $\phi$ be as defined in equation (4.4.1). Then we have the following.

(1)

$p(x)$ * has a $t$ -term $({\delta},d)$ -noncommutative Waring decomposition if and only if $\phi(p(x))$ has a $t$ -term $(1,d)$ -noncommutative Waring decomposition.* 2. (2)

$p(x)$ * satisfies the ${\delta}$ -compatibility condition if and only if $\phi(p(x))$ satisfies the $1$ -compatibility condition.* 3. (3)

$p(x)$ * has a $t$ -term $({\delta},d)$ -noncommutative Waring decomposition if and only if $p(x)$ satisfies the ${\delta}$ -compatibility condition and the commutative collapse of $\phi(p(x))$ has a $t$ -term $(1,d)$ -Waring decomposition.*

Proof.

To prove item (1), assume $p(x)$ has a $t$ -term $(\delta,d)$ -noncommutative Waring decomposition

[TABLE]

By Lemma 4.8, $\phi$ is an algebra isomorphism so

[TABLE]

This shows $\phi(p(x))$ has a $t$ -term $(1,d)$ noncommutative Waring decomposition. The reverse direction follows the same reasoning using $\phi^{-1}$ instead of $\phi$ .

To prove item (2) let

[TABLE]

Then

[TABLE]

Observe

[TABLE]

where the $\mu_{j}$ are viewed as elements of the index set $S_{\delta}^{g}$ if and only if

[TABLE]

where the $\mu_{j}$ are viewed as as $\delta$ tuples of elements of $S_{\delta}^{g}$ . It follows that

[TABLE]

where the $\mu_{j}$ are viewed as elements of the index set $S_{\delta}^{g}$ if and only if

[TABLE]

where the $\mu_{j}$ are viewed as as $\delta$ tuples of elements of $S_{\delta}^{g}$ .

Item (3) is an immediate consequence of items (1) and (2) with Theorem 3.5, our main result for $(1,d)$ -NC Waring decompositions. ∎

4.5. Additional variables are necessary for the reduction

It is tempting to try to solve the general $({\delta},d)$ -NC Waring problem by reducing to the commutative case without introducing additional variables. This section will show that this is not possible.

One may hope that the following are true:

(1)

If $p$ is a degree $\delta d$ NC homogeneous polynomial, which satisfies the $\delta$ -compatibility condition (4.3.2), then its commutative collapse $p_{c}$ has the Waring decomposition

[TABLE]

(with $X_{i}$ being commuting variables) if and only if $p$ has the NC Waring decomposition

[TABLE] 2. (2)

The commutative collapse $p_{c}$ of $p$ has a $t$ -term $({\delta},d)$ -NC Waring decomposition iff the commutative collapse $\phi(p)_{c}$ of $\phi(p)$ has a $t$ -term $(1,d)$ -NC Waring decomposition.

The following polynomial gives a counter example to both items. Let

[TABLE]

and let $\delta=d=2$ . Then $p$ satisfies the $2$ -compatibility condition. We will show that the commutative collapse of $p$ has a two term $(2,2)$ -Waring decomposition but that $p$ does not have a two term $(2,2)$ -NC Waring decomposition.

It is straight forward to check

[TABLE]

Item (1) would imply that

[TABLE]

which is contradiction. This shows that item (1) cannot be correct.

In fact, $p$ does not have a two term $(2,2)$ -NC Waring decomposition. To check this set

[TABLE]

Then $\phi(p)(z)=z_{(1,1)}^{2}+z_{(1,2)}z_{(2,1)}+z_{(2,1)}z_{(1,2)}+z_{(2,2)}^{2}$ satisfies the $1$ -compatibility condition but $\phi(p)$ does not have a two term $(1,2)$ -Waring decomposition. To see this, note that the tensor corresponding to $\phi(p)$ is a $4\times 4$ symmetric matrix which has rank $4$ , hence a NC Waring decomposition for $\phi(p)$ requires four terms. It follows from Theorem 4.9 (1) that $p$ does not have a two term $(2,2)$ -NC Waring decomposition.

4.6. General NC Waring and tensors

Standard tensor techniques can also be used to address the general NC Waring problem and to derive Theorem 4.9. One may identify the space of NC homogeneous polynomials of degree $\delta d$ with the space of tensors $\mathbb{(}\mathbb{C}^{g})^{\otimes d\delta}\cong\mathbb{(}(\mathbb{C}^{g})^{\otimes\delta})^{\otimes d}$ . Requiring that a NC polynomial $p$ satisfies the $\delta$ -compatibility condition then corresponds to requiring that the corresponding tensor $T_{p}$ satisfies a restricted symmetry condition. In standard tensor notation one must have $T_{p}\in S^{d}((\mathbb{C}^{g})^{\otimes\delta})$ . In words, $T_{p}$ is a symmetric tensor in the space $V^{\otimes d}$ where $V$ is the space $(\mathbb{C}^{g})^{\otimes\delta}$ . The result again follows from the fact that a tensor in $V^{\otimes d}$ has a symmetric tensor decomposition if and only if it is symmetric.

While this is an expedient approach for those familiar with tensor methods, we expect the noncommutative algebra approach to be more clear for NC algebra experts who are not familiar with tensor methods. Furthermore, the tensor based approach does not easily convert to a condensed statement of Theorem 4.9 which only uses the language of noncommutative polynomials.

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AH 95] J. Alexander and A. Hirschowitz. Polynomial interpolation in several variables. J. Algebraic Geom., 4 (1995), pp. 201-222.
2[AOP 09] H. Abo, G. Ottaviani and C. Peterson. Induction for secant varieties of Segre varieties. Trans. Am. Math. Soc., 361 (2009), pp. 767-792.
3[B 02] D.J. Bernstein, Pippenger’s Exponentiation Algorithm. Preprint, (2002). http://cr.yp.to/papers.html#pippenger
4[BC 13] A. Bodin and M. Car, Waring’s problem for polynomials in two variables. Proc. Amer. Math. Soc., 141 (2013), pp. 1577–1589.
5[BKP 16] S. Burgdorf, I. Klep and J. Povh, Optimization of Polynomials in Noncommuting Variables. Springer, 2016.
6[CHS 06] J.F. Camino, J.W. Helton and R.E. Skelton, Solving matrix inequalities whose unknowns are matrices. SIAM Jour. of Optimization, 17 (2006), no 1, pp. 1-36.
7[COV 17] L. Chiantini, G. Ottaviani, and N. Vannieuwenhoven, Effective Criteria for Specific Identifiability of Tensors and Forms. SIAM J. Matrix Anal. Appl., 38 (2017), pp. 656-681.
8[CGLM 08] P. Comon, G.H. Golub, L.-H. Lim, B. Mourrain, Symmetric tensors and symmetric tensor rank. SIAM J. Matrix Anal. Appl., 30 (2008), no. 3, pp. 1254–1279.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Efficient evaluation of noncommutative polynomials using tensor and noncommutative Waring decompositions

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

1.1. Noncommutative polynomials

1.1.1. Evaluation using tensor decompositions

1.1.2. Example

1.1.3. A basic NC Horner method

1.2. Waring decompositions of noncommutative polynomials

1.3. The NC Waring decomposition

Corollary 1.1**.**

Proof.

1.4. Guide to readers

2. Accelerating NC polynomial evaluation using tensor and Waring decompositions

2.1. NC Waring decompositions and symmetric tensors

2.1.1. Numerical computation of NC Waring decompositions

2.2. Computational savings

2.2.1. Linear product sum

2.2.2. Waring

2.2.3. Horner’s method

2.2.4. The d=2d=2d=2 case

2.3. Comparison of computational costs

2.3.1. Comparison of efficiency: Linear product sum vs. Horner

2.3.2. Comparison of efficiency: Waring vs. linear product sum

2.3.3. Comparison to naive

2.3.4. Accuracy of computations

2.3.5. Experiment comparing run times of linear product sum to other methods

3. The noncommutative Waring problem

3.1. History of the Waring decomposition

3.2. A basic definition

3.3. NC polynomial proof of the NC Waring decomposition

3.3.1. The Compatibility Condition

Definition 3.1**.**

Lemma 3.2**.**

Proof.

Example 3.3**.**

3.3.2. Reduction of NC Waring to Classical Waring

Lemma 3.4**.**

Proof.

Theorem 3.5**.**

Proof.

3.4. NC Waring decompositions and symmetric tensors

4. The general noncommutative Waring problem

4.1. Classical General Waring Problem.

Theorem 4.1**.**

4.2. Problem formulation and notation

Remark 4.2**.**

4.2.1. Tuple indicator functions

4.3. Main results on the general Waring decomposition

4.3.1. The Compatibility Condition

Definition 4.3**.**

Remark 4.4**.**

Example 4.5**.**

Lemma 4.6**.**

Proof.

Example 4.7**.**

4.4. Reduction to classical Waring in more variables

Lemma 4.8**.**

Proof.

Theorem 4.9**.**

Proof.

4.5. Additional variables are necessary for the reduction

4.6. General NC Waring and tensors

Corollary 1.1.

2.2.4. The $d=2$ case

Definition 3.1.

Lemma 3.2.

Example 3.3.

Lemma 3.4.

Theorem 3.5.

Theorem 4.1.

Remark 4.2.

Definition 4.3.

Remark 4.4.

Example 4.5.

Lemma 4.6.

Example 4.7.

Lemma 4.8.

Theorem 4.9.