Efficient Black-Box Identity Testing over Free Group Algebra
V.Arvind, Abhranil Chatterjee, Rajit Datta, Partha, Mukhopadhyay

TL;DR
This paper develops randomized algorithms for identity testing of noncommutative rational functions in free group algebra, extending classical theorems and providing efficient solutions for a specific subclass of rational expressions.
Contribution
It introduces randomized and deterministic algorithms for identity testing in free group algebra, generalizing the Amitsur-Levitzki theorem to this setting.
Findings
Randomized polynomial-time algorithm for identity testing in free group algebra.
Deterministic polynomial-time algorithm based on sparsity for identity testing.
Extension of Amitsur-Levitzki theorem to noncommutative rational functions.
Abstract
Hrube\v{s} and Wigderson [HW14] initiated the study of noncommutative arithmetic circuits with division computing a noncommutative rational function in the free skew field, and raised the question of rational identity testing. It is now known that the problem can be solved in deterministic polynomial time in the white-box model for noncommutative formulas with inverses, and in randomized polynomial time in the black-box model [GGOW16, IQS18, DM18], where the running time is polynomial in the size of the formula. The complexity of identity testing of noncommutative rational functions remains open in general (when the formula size is not polynomially bounded). We solve the problem for a natural special case. We consider polynomial expressions in the free group algebra where , a subclass of rational expressions of inversion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Cryptography and Data Security · Advanced Graph Theory Research
Efficient Black-Box Identity Testing for Free Group Algebra
V. Arvind Institute of Mathematical Sciences (HBNI), Chennai, India, email: [email protected]
Abhranil Chatterjee Institute of Mathematical Sciences (HBNI), Chennai, India, email: [email protected]
Rajit Datta Chennai Mathematical Institute, Chennai, India, email: [email protected]
Partha Mukhopadhyay Chennai Mathematical Institute, Chennai, India, email: [email protected]
Abstract
Hrubeš and Wigderson [HW14] initiated the study of noncommutative arithmetic circuits with division computing a noncommutative rational function in the free skew field, and raised the question of rational identity testing. It is now known that the problem can be solved in deterministic polynomial time in the white-box model for noncommutative formulas with inverses, and in randomized polynomial time in the black-box model [GGOW16, IQS18, DM18], where the running time is polynomial in the size of the formula.
The complexity of identity testing of noncommutative rational functions remains open in general (when the formula size is not polynomially bounded). We solve the problem for a natural special case. We consider polynomial expressions in the free group algebra 111We use to denote . where , a subclass of rational expressions of inversion height one. Our main results are the following.
Given a degree expression in as a black-box, we obtain a randomized algorithm to check whether is an identically zero expression or not. We obtain this by generalizing the Amitsur-Levitzki theorem [AL50] to . This also yields a deterministic identity testing algorithm (and even an expression reconstruction algorithm) that is polynomial time in the sparsity of the input expression. 2. 2.
Given an expression in of degree at most , and sparsity , as black-box, we can check whether is identically zero or not in randomized time.
1 Introduction
Noncommutative computation is an important sub-area of arithmetic circuit complexity. In the usual arithmetic circuit model for noncommutative computation, the arithmetic operations are addition and multiplication. However, the multiplication gates respect the input order since the variables are noncommuting. Analogous to commutative arithmetic computation, the central questions are to show lower bounds for explicit polynomials and derandomization of polynomial identity testing (PIT) for noncommutative polynomial rings. Exploiting the limited cancellations, strong lower bounds and PIT results are known for noncommutative computations (in contrast to the commutative setting). Nisan[Nis91] has shown that any algebraic branching program (ABP) computing the noncommutative Determinant or Permanent polynomial requires exponential (in ) size. On the PIT front, Raz and Shpilka [RS05] have shown a deterministic polynomial-time PIT for noncommutative ABPs in the white-box model. A quasi-polynomial time derandomization is also known for the black-box model [FS12]. However, for general circuits there are no better results (either lower bound or PIT) than known in the commutative setting.
The randomized polynomial-time PIT algorithm for noncommutative circuits computing a polynomial of polynomially bounded degree [BW05] follows from Amitsur-Levitzki theorem [AL50]. The Amitsur-Levitzki theorem states that a nonzero noncommutative polynomial of degree cannot be an identity for the matrix ring . Additionally, it is shown that a nonzero noncommutative polynomial does not vanish on matrices of dimension logarithmic in the sparsity of the polynomial, yielding a randomized polynomial time algorithm for noncommutative circuits computing a nonzero polynomial of exponential degree and exponential sparsity [AJMR17].
Hrubeš and Wigderson [HW14] initiated the study of noncommutative computation with inverses. In the commutative world, it suffices to consider additions and multiplications. By Strassen’s result [Str73] (extended to finite fields [HY11]), divisions can be efficiently replaced by polynomially many additions and multiplications. However, divisions in noncommutative computation are more complex [HW14]. In the same paper [HW14] the authors introduce rational identity testing: Given a noncommutative formula involving addition, multiplication and division gates, efficiently check if the resulting rational expression is identically zero in the free skew-field of noncommutative rational functions. They show that the rational identity testing problem reduces to the following SINGULAR problem:
Given a matrix where the entries are linear forms over noncommuting variables , is invertible in the free skew-field?
In the white-box model the problem is in deterministic polynomial time, and in randomized polynomial time in the black-box model [GGOW16, IQS18, DM18]. Specifically, for rational formulas of size , random matrix substitutions of dimension linear in suffices to test if the rational expression is identically zero [DM18].
The complexity of identity testing for general rational expressions remains open. For example, given a noncommutative circuit involving addition, multiplication and division gates, no efficient algorithm is known to check if the resulting rational expression is identically zero in the free skew-field of noncommutative rational functions. In order to precisely formulate the problem, we define classes of rational expressions based on Bergman’s definition [Ber76] of inversion height which we now recall and elaborate upon with some notation.
Definition 1**.**
[Ber76]* Let be a set of free noncommuting variables. Polynomials in the free ring are defined to be rational expressions of height [math]. A rational expression of height is inductively defined to be a polynomial in rational expressions of height at most , and inverses of such expressions.*
Let denote all polynomials of degree at most in the free ring . We inductively define rational expressions in as follows: Let and be rational expressions in in the variables . Let be a degree- polynomial in . Then is a rational expression (of inversion height ) in .
Black-box identity testing for rational expressions is not well understood in general. Bergman has shown [Ber76, Proposition 5.1] that there are rational expressions that are nonzero over a dense subset of matrices but evaluate to zero on dense subsets of matrices. This makes it difficult to formulate an Amitsur-Levitzki type of theorem[AL50] for rational expressions.
Remark 1**.**
In this connection, we note that Hrubeš and Wigderson [HW14] have observed that testing if a ‘correct’ rational expression is not identically zero is equivalent to testing if the rational expression is ‘correct’. I.e. testing if a correct rational expression of inversion height is identically zero or not can be reduced to testing if a rational expression of inversion height is correct or not. Furthermore, testing if a rational expression of inversion height one is correct can be done by applying (to each inversion operation in this expression) a theorem of Amitsur (see [Row80, LZ09]) which implies that a nonzero degree noncommutative polynomial evaluated on matrices will be invertible with high probability. However, this does not yield an efficient randomized identity testing algorithm for rational expressions of inversion height one. Because that seems to require testing correctness of expressions of inversion height two which is a question left open in their paper [HW14, Section 9].
The Free Group Algebra
This motivates the study of black-box identity testing for rational expressions in the free group algebra .
We consider expressions in the free group algebra , where denotes the free group generated by the generators and their inverses
[TABLE]
Elements of the free group are words in . The only relations satisfied by the generators is for all . Thus, the elements in the free group are the reduced words which are words to which the above relations are not applicable.
The elements of the free group algebra are -linear combinations of the form
[TABLE]
where each is a reduced word. The degree of the expression is defined as the maximum length of a word such that . The expression is said to have sparsity if there are many reduced words such that in . We also use the notation to denote the coefficient of the reduced word in the expression .
The free noncommutative ring is a subalgebra of . Clearly, the elements of are a special case of rational expressions of inversion height one. I.e., we note that:
Proposition 1**.**
.
Note that the rational expressions in allows inverses only of the variables , whereas the free skew field contains all possible rational expressions (with inverses at any nested level).
Our results
The main goal of the current paper is to obtain black-box identity tests for rational expressions in the free group algebra .
Our first result is a generalization of the Amitsur-Levitzki theorem[AL50] to . Let be an associative algebra with identity over . An expression is an identity for if
[TABLE]
for all such that is defined for each .
Theorem 1**.**
Let be any field of characteristic zero and be a nonzero expression of degree . Then is not an identity for the matrix algebra .
The following corollary is immediate.
Corollary 1** (Black-box identity testing for circuits in free group algebra).**
There is a black-box randomized identity test for degree expressions in .
If the black-box contains a sparse expression, we show efficient deterministic algorithms for identity testing and interpolation algorithm.
Theorem 2** (Black-box identity testing and reconstruction for sparse expressions in free group algebra).**
Let be any field of characteristic zero and is an expression in of degree and sparsity given as black-box. Then we can reconstruct in deterministic time with matrix-valued queries to the black-box.
Our next result is another generalization of the Amitsur-Levitzki theorem [AL50] extending a result of [AJMR17] to free group algebras. We show that a nonzero expression of degree and sparsity does not vanish on dimensional matrices. It yields a randomized polynomial-time identity test if the black-box contains an expression of exponential degree and exponential sparsity.
Theorem 3**.**
Let be any field of characteristic zero. Then, a degree- expression of sparsity is not an identity for the matrix algebra for .
Corollary 2** (Black-box identity testing for expoential sparse expressions with exponential degree in free group algebra).**
Given a degree- expression of sparsity as black-box, we can check whether is identically zero or not in randomized time.
Remark 2**.**
We state our results for fields of characteristic zero only for simplicity. However, by suitable modifications, we can extend our results for fields of positive characteristic.
Organization
The paper is organized as follows. In Section 2, we prove Theorem 1, Corollary 1, and Theorem 2. In Section 3, we prove Theorem 3 and Corollary 2. Finally, in Section 4, we discuss suitable modifications to extend our results over finite fields.
2 A Generalization of Amitsur-Levitzki Theorem for Free Group Algebra
The main idea in our proof is to efficiently encode expressions in as polynomials in a suitable commutative ring preserving the identity. Let denote the commutative ring for , where and .
Definition 2**.**
Define a map to be a map such that is identity on , and for each reduced word ,
[TABLE]
where if and otherwise.
By linearity the map is defined on all expressions in . We observe the following properties of .
The map is injective on the reduced words . I.e., it maps each reduced word to a unique monomial over the commuting variables . 2. 2.
Consequently, is identity preserving. I.e., an expression in is identically zero if and only if its image is the zero polynomial in . 3. 3.
preserves the sparsity of the expression. I.e., in is -sparse iff in is -sparse. 4. 4.
Given the image in its sparse description (i.e., as a linear combination of monomials), we can efficiently recover the sparse description of .
Given polynomials , we say and are weakly equivalent, if for each monomial , if and only if , where denotes the coefficient of monomial in .
Given a black-box expression in , we show how to evaluate it on suitable matrices and obtain a polynomial in that is weakly equivalent to as a specific entry of the resulting matrix. The matrix substitutions are based on automata constructions. Similar ideas have been used earlier to design PIT algorithms for noncommutative polynomials [AMS10]. However, since we are dealing with rational expressions, some difficulties arise. The matrix substitutions for the variables are obtained as the corresponding transition matrices of the automaton. The matrix substitution for will be . Therefore, we need to ensure that the transition matrices are invertible and sufficiently structured to be useful for the identity testing.
We first illustrate our construction for an example degree- expression , where .
The basic “building block” for the transition matrix is the block matrix
[TABLE]
whose inverse is
[TABLE]
When the block is the diagonal block in , the corresponding automaton will go from state to state replacing by (or if occurs, it will replace it by ).
We will keep the transition matrix for a block diagonal matrix with such invertible blocks as the principal minors along the diagonal. In order to ensure this we introduce two new variables and substitute by the word in the expression. This will ensure that we do not have two consecutive in the resulting reduced words. In fact, between two variables (or their inverses) we will have inserted exactly two variables (or their inverses). Now, we define for the above example as
[TABLE]
The corresponding transitions of the automaton is shown in Figure 1.
We now describe the transition matrices for . The matrix is also a block diagonal matrix. There are three blocks along the diagonal. The first and third are blocks of the identity. The second one is a block for -transitions from state to state . It ensures that for any subword , , in the resulting product matrix the entry of the block is nonzero. The corresponding transitions of the automaton is depicted in Figure 2.
[TABLE]
Hence, evaluating we obtain (a polynomial weakly equivalent to) at the entry. The complete automaton is depicted in figure 3.
We now explain the general construction. For let denote the degree- homogeneous part of . We will denote by an arbitrary polynomial in weakly equivalent to .
Lemma 1**.**
Let be a nonzero expression of degree . There is an -tuple of matrices whose entries are either scalars, or variables , or their inverses , such that
[TABLE]
Furthermore, for each degree- reduced word of in ,
[TABLE]
Proof.
Let , for , be the elementary matrix in : its entry is and other entries are [math].
We now define the transition matrices of the NFA for variables and . For each , define matrix . Now is a matrix defined as the block diagonal matrix,
[TABLE]
[TABLE]
Each is the block diagonal matrix where each block is a matrix defined as . Their inverses have a similar structure.
[TABLE]
[TABLE]
The corresponding NFA is depicted in Figure 4. We substitute each by the matrix . Each is substituted by its inverse matrix .
Correctness.
Consider a degree- reduced word .
Following the automaton construction of Figure 4, occurring at position is substituted by . Moreover, for each position , the adjacent pair produces a scalar factor due to the product . Consequently, it follows that
[TABLE]
As is a linear map, the lemma follows. ∎
2.1 Black-box identity testing for circuits in free group algebra
Theorem 1 follows easily from Lemma 1. Lemma 1 says that if is nonzero of degree then the entry of the matrix is a nonzero polynomial in . Hence can not be an identity for .
It also immediately gives an identity testing algorithm. We can randomly substitute for the variables and apply the Schwartz-Zippel-Demillo-Lipton Theorem [Sch80, Zip79, DL78]. This completes the proof of the Corollary 1.
2.2 Reconstruction of sparse expressions in free group algebra
If the black-box contains an -sparse expression in , we give a deterministic interpolation algorithm (which also gives a deterministic identity testing for such expressions). We use a result of Klivans-Spielman [KS01, Theorem11] that constructs a test set in deterministic polynomial time for sparse commutative polynomials, which is used for the interpolation algorithm.
Proof of Theorem 2
Let the black-box expression be -sparse of degree . By Lemma 1, a polynomial in is obtained at the entry of the matrix , where is as defined in Lemma 1. By Definition 2, is -sparse and has variables. Let be the corresponding test set from [KS01] to interpolate a polynomial of degree and -sparse over variables. Querying the black-box on for each we can interpolate the commutative polynomial and obtain an expression for as a sum of monomials.
We now need to adjust the extra scalar factors in to obtain . We can perform this adjustment for each monomial as Lemma 1 shows that the extra scalar factor for the word is just . So the algorithm constructs the expression . We can remove the factors for each monomial and invert the map (using the property of Definition 2) on every monomial to obtain as a sum of degree reduced words. This yields the expression for highest degree homogeneous component of . We can repeat the above procedure on and reconstruct the remaining homogeneous components of . ∎
3 Black-box Identity Testing for Expressions of Exponential Degree and Exponential Sparsity
In this section, we prove a different generalization of Amitsur-Levitzki theorem [AL50] for free group algebras, based on ideas from [AJMR17]. We show that the dimension of the matrix algebra for which a nonzero input expression does not vanish is logarithmic in the sparsity of . It yields a randomized time identity testing algorithm when the black-box contains an expression of degree and sparsity .
We first recall the notion of isolating index set from [AJMR17].
Definition 3**.**
Let be a subset of reduced words of degree . An index set is an isolating index set for if there is a word such that for each there is an index for which . I.e. no other word in agrees with on all positions in the index set . We say is an isolated word.
In the following lemma we show that has an isolating index set of size . The proof is identical to [AJMR17]. Nevertheless, we give the simple details for completeness because we deal with both variables and their inverses.
Lemma 2**.**
[AJMR17]* Let be reduced degree- words. Then has an isolating index set of size which is bounded by .*
Proof.
The words are indexed, where denotes the variable (or the inverse of a variable) in the position of . Let be the first index such that not all words agree on the position. Let
[TABLE]
For some , or is of size at most . Let denote that subset, . We replace by and repeat the same argument for at most steps. Clearly, by this process, we identify a set of indices , such that the set shrinks to a singleton set . Clearly, is an isolating index set as witnessed by the isolating word . ∎
Proof of Theorem 3
Let where is the size of the isolating set . As in Section 2, we substitute each by , where are new variables. The transition matrices for and are denoted by and respectively.
For , we define matrix as a block diagonal matrix of many matrices where .
[TABLE]
[TABLE]
Notice that
[TABLE]
We now define the transition matrix as a block diagonal matrix,
[TABLE]
[TABLE]
These matrices can be seen as the transitions of a suitable NFA. We sketch the construction of this NFA.
Let be an isolating set such that . Intuitively, the NFA does one of two operations on each symbol (a variable or its inverse) of the input expression: a Skip or an Encode. In a Skip stage, the NFA deals with positions that are not part of the (guessed) isolating index set. In this stage, the NFA substitutes the variables by suitable scalars (coming from the matrices) and variables by block variables . The NFA nondeterministically decides whether the Skip stage is over and it enters the Encode stage for a guessed index of the isolating set. It substitutes and variables by and respectively. Fig. 5 summarizes the action of the NFA.
Define in to be rational function we obtain at the 222Recall that where is the size of an isolating set. entry by evaluating the expression . Notice that, the isolating word of degree will be of following form where each subword is of length , where some of the could be the empty word as well.
We refer to an NFA transition as a forward edge if and a backward edge if . We classify the backward edges in three categories based on the substitution on the edge-label. We say, a backward edge is of type A if a variable is substituted by a scalar value; a backward edge is of type B if a variable is substituted by for some ; a backward edge is of type C if a variable is substituted by or for some .
Consider a walk of the NFA on an input word that reaches state using only type A backward edges. In that case, is substituted by where is a monomial over of same degree,
[TABLE]
and is some nonzero constant obtained as a product of with the scalars obtained as substitutions from the edges involving the variables in the Skip stages. Indeed, as we can see from the entries of product matrices , where , the scalar is a product of with terms of the form , for , each of which is nonzero for any reduced word.
Claim 1**.**
[TABLE]
Proof.
It suffices to show that for any word , where has degree , no walks of the NFA accepting generate after substitution. We now argue that no other walks in the NFA can generate . For a computation path , the monomial in has two parts, let us call it and where is a monomial over and is a monomial over . If the computation path (which is different from the computation path described above for ) uses only type A backward edges, then necessarily from the definition of isolating index set. This argument is analogous to the argument given in [AJMR17].
Now consider a walk which involves backward edges of other types. Let us first consider those walks that take backward edges only of type A and type B. Such a walk still produces a monomial over and because division only by variables occur in the resulting expression. Since is of highest degree, the total degree of these monomials is strictly lesser than degree of . For those walks that take at least one backward edge of type C, a rational expression in and is produced (as there is division by or variables). As the sum of the degree of the numerator and degree of the numerator is bounded by the total degree, the degree of the numerator is smaller than degree of .
Thus the entry of the output matrix is of the form where are monomials arising from different walks (w.l.o.g. assume that ) and are the rational expressions from the other walks (due to the backward edges of type C). Note that, denominator in each is a monomial over of degree at most . Let . Now, we have,
[TABLE]
Since for any and degree of each degree of for any , the numerator of the final expression is a nonzero polynomial in . ∎
The above proof shows that the matrix is nonzero with rational entries in . Each entry is a linear combination of terms of the form , where and are monomials in of degree bounded by . This completes the proof. ∎
To get an identity testing algorithm, we can do random substitutions.The matrix dimension is and the overall running time of the algorithm is . This also proves Corollary 2. ∎
Remark 3**.**
For algorithmic purposes, we note that Theorem 1 is sometimes preferable to Theorem 3. For instance, the encoding used in Theorem 3 does not preserve the sparsity of the polynomial as required in the sparse reconstruction result (Theorem 2).
4 Adaptation for Fields of Positive Characteristic
Let be any finite field of characteristic . We need to ensure that for each word in the free group algebra, the scalar (see Equation 1) produced by the automaton described in Section 2 is not zero in . Recall that, reading for two consecutive positions, the automaton produces a scalar where . Moreover, this is the only way the automaton produces a scalar and for each , is a product of such terms. Hence, all we need to ensure is that for each pair , . Similarly, it ensures that the scalar produced by the automaton described in Section 3 is non-zero.
We note that, if is more than then each term where and . This results in a dependence on the characteristic of the base field for the analogous statements of Theorems 1, 3 over finite field. Additionally, for Theorem 1, the entry of the output matrix is a polynomial of degree , and for Theorem 3, the degrees of the numerator polynomials in the rational expression of the output matrix is bounded by some scalar multiple of . This lower bounds the size of the fields in the application. We summarize the above discussion in the following.
Observation 1**.**
We can obtain results analogous to Theorem 1 and Theorem 3 over finite fields of characteristic more than and sizes at least or respectively.
However, the algorithms presented in Theorem 2 and Corollaries 1, 2 can be modified to work for finite fields of any characteristic. To this end, we first notice the following simple fact.
Proposition 2**.**
Let be a finite field of characteristic . In We can find elements from a suitable (deterministically constructed) small extension field of in deterministic time, such that for any we have
[TABLE]
Let as given by the above proposition. We modify the matrix in the proof of Theorem 2 and Corollary 1 as
[TABLE]
and in Corollary 2 we modify as
[TABLE]
For each pair , by Proposition 2. Thus, for each word , the scalar produced by the automata are nonzero in the extension field as well. Furthermore, the test set of [KS01] works for all fields. Hence Theorem 2 holds for all finite fields too. To obtain Corollaries 1 and 2, we need to do the random substitution from suitable small degree extension fields and use Schwartz-Zippel-Demillo-Lipton Theorem [Sch80, Zip79, DL78]. In summary, our algorithms in the paper can be adapted to work over all fields.
Proof of Proposition 2. Define polynomial as
[TABLE]
We substitute for . Then is a univariate polynomial of degree at most . Using standard techniques, in deterministic polynomial time we can construct an extension field of such that is of size. We can find an element such that and set . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AJMR 17] Vikraman Arvind, Pushkar S. Joglekar, Partha Mukhopadhyay, and S. Raja. Randomized polynomial time identity testing for noncommutative circuits. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2017, Montreal, QC, Canada, June 19-23, 2017 , pages 831–841, 2017.
- 2[AL 50] A. S. Amitsur and J. Levitzki. Minimal identities for algebras. Proceedings of the American Mathematical Society , 1(4):449–463, 1950.
- 3[AMS 10] Vikraman Arvind, Partha Mukhopadhyay, and Srikanth Srinivasan. New results on noncommutative and commutative polynomial identity testing. Computational Complexity , 19(4):521–558, 2010.
- 4[Ber 76] George M Bergman. Rational relations and rational identities in division rings. Journal of Algebra , 43(1):252 – 266, 1976.
- 5[BW 05] Andrej Bogdanov and Hoeteck Wee. More on noncommutative polynomial identity testing. In 20th Annual IEEE Conference on Computational Complexity (CCC 2005), 11-15 June 2005, San Jose, CA, USA , pages 92–99, 2005.
- 6[DL 78] Richard A. Demillo and Richard J. Lipton. A probabilistic remark on algebraic program testing. Information Processing Letters , 7(4):193 – 195, 1978.
- 7[DM 18] Harm Derksen and Visu Makam. Algorithms for orbit closure separation for invariants and semi-invariants of matrices. Co RR , abs/1801.02043, 2018.
- 8[FS 12] Michael Forbes and Amir Shpilka. Quasipolynomial-time identity testing of non-commutative and read-once oblivious algebraic branching programs. Foundations of Computer Science, 1975., 16th Annual Symposium on , 09 2012.
