Linearizing the Word Problem in (some) Free Fields
Konrad Schrempf

TL;DR
This paper presents a linear algebra-based method for solving the word problem in free fields using minimal linear representations, enabling rational identity testing and inverse construction.
Contribution
It introduces a linear algebra approach leveraging Cohn and Reutenauer's normal form for free fields, with a new method for minimal linear representation construction.
Findings
Effective solution to the word problem in free fields.
Method for testing rational identities.
Construction of minimal linear representations for inverses.
Abstract
We describe a solution of the word problem in free fields (coming from non-commutative polynomials over a commutative field) using elementary linear algebra, provided that the elements are given by minimal linear representations. It relies on the normal form of Cohn and Reutenauer and can be used more generally to (positively) test rational identities. Moreover we provide a construction of minimal linear representations for the inverse of non-zero elements.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Linearizing the Word Problem
in (some) Free Fields
Konrad Schrempf111Contact: [email protected], Universität Wien, Fakultät für Mathematik, Oskar-Morgenstern-Platz 1, 1090 Wien, Austria. Supported by the Austrian FWF Project P25510-N26 “Spectra on Lamplighter groups and Free Probability”
Abstract
We describe a solution of the word problem in free fields (coming from non-commutative polynomials over a commutative field) using elementary linear algebra, provided that the elements are given by minimal linear representations. It relies on the normal form of Cohn and Reutenauer and can be used more generally to (positively) test rational identities. Moreover we provide a construction of minimal linear representations for the inverse of non-zero elements.
Keywords: word problem, minimal linear representation, linearization, realization, admissible linear system, rational series
AMS Classification: 16K40, 03B25, 16S10, 15A22
Introduction
Free (skew) fields arise as universal objects when it comes to embed the ring of non-commutative polynomials, that is, polynomials in (a finite number of) non-commuting variables, into a skew field [Coh85, Chapter 7]. The notion of “free fields” goes back to Amitsur [Ami66]. A brief introduction can be found in [Coh03, Section 9.3], for details we refer to [Coh95, Section 6.4]. In the present paper we restrict the setting to commutative ground fields, as a special case. See also [Rob84]. In [CR94], Cohn and Reutenauer introduced a normal form for elements in free fields in order to extend results from the theory of formal languages. In particular they characterize minimality of linear representations in terms of linear independence of the entries of a column and a row vector, generalizing the concept of “controllability and observability matrix” (Section 3).
It is difficult to solve the word problem, that is, given linear representations (LRs for short) of two elements and , to decide whether . In [CR99] the authors describe an answer of this question (for free fields coming from non-commutative polynomials over a commutative field). In practice however, this technique (using Gröbner bases) can be impractical even for representations of small dimensions.
Fortunately, it turns out that the word problem is equivalent to the solvability of a linear system of equations if both elements are given by minimal linear representations. Constructions of the latter are known for regular elements (non-commutative rational series), but in general non-linear techniques are necessary. This is considered in future work. Here we present a simple construction of minimal LRs for the inverses of arbitrary non-zero elements given by minimal LRs. In particular this applies to the inverses of non-zero polynomials with vanishing constant coefficient (which are not regular anymore). This is of interest especially for those polynomials which are identities [AL50], for example (which vanishes identically on commutative rings).
In any case, positive testing of rational identities becomes easy. Furthermore, the implementation in computer (algebra) software needs only a basic data structure for matrices (linear matrix pencil) and an exact solver for linear systems.
Section 1 introduces the required notation concerning linear representations and admissible linear systems in free fields. Rational operations on representation level are formulated and the related concepts of linearization and realization are briefly discussed. Section 2 describes the word problem. Theorem 2.4 shows that the (in general non-linear) problem of finding appropriate transformation matrices can be reduced to a linear system of equations if the given LRs are minimal. Examples can be constructed for regular elements (rational series) as special cases (of elements in the free field), which are summarized in Section 3. Here algorithms for obtaining minimal LRs are already known. Section 4 provides a first step in the construction of minimal LRs (with linear techniques), namely for the inverses of non-zero elements given itself by a minimal linear representations. This is formulated in Theorem 4.20.
The main result is Theorem 2.4, the “linear” word problem. Although it is rather elementary, it opens the possibility to work directly on linear representations (instead of the spaces they “span”). Or, using Bergman’s words [Ber78]: “The main results in this paper are trivial. But what is trivial when described in the abstract can be far from clear in the context of a complicated situation where it is needed.”
1 Representing Elements
Although there are several ways for representing elements in (a subset of) the free field (linear representation [CR99], linearization [Coh85], realization [HMV06], proper linear system [SS78], etc.) the concept of a linear representation seems to be the most convenient. It has the advantage (among others) that in the special case of regular elements, the general definition of the rank coincides with the Hankel rank [Fli74], [SS78, Section II.3].
Closely related to LRs are admissible linear systems (ALS for short) [Coh85], which could be seen as a special case. Both notations will be used synonymously. Depending on the context an ALS will be written as a triple, for example or as linear system , sometimes as . Like the rational operations defined on linear representation level [CR99], similar constructions can be done easily on ALS level. Thus, starting from systems for monomials (Proposition 4.1) only, a representation for each element in the free field can be constructed recursively.
Notation. Zero entries in matrices are usually replaced by (lower) dots to stress the structure of the non-zero entries unless they result from transformations where there were possibly non-zero entries before. We denote by the identity matrix and the permutation matrix that reverses the order of rows/columns of size . If the size is clear from the context, and are used respectively.
Let be a commutative field and be a finite alphabet. denotes the free associative algebra (or “algebra of non-commutative polynomials”) and denotes the universal field of fractions (or “free field”) of [Coh95], [CR99]. In the examples the alphabet is usually .
Definition 1.1** **(Inner Rank, Full Matrix, Hollow Matrix
Given a matrix , the inner rank of is the smallest number such that there exists a factorization with and . The matrix is called full if , non-full otherwise. It is called hollow if it contains a zero submatrix of size with .
Definition 1.2** **(Associated and Stably Associated Matrices
[Coh95]).
Two matrices and over (of the same size) are called associated over a subring if there exist invertible matrices over such that . and (not necessarily of the same size) are called stably associated if and are associated for some unit matrices and . Here by we denote the diagonal sum \bigl{[}\begin{smallmatrix}C&.\\ .&D\end{smallmatrix}\bigr{]}.
In general it is hard to decide whether a matrix is full or not. For a linear matrix, that is, a matrix of the form with over , the following criterion is known, which is used in (the proof of) Theorem 2.1. If a matrix over is not linear, then Higman’s trick [Coh85, Section 5.8] can be used to linearize it by enlargement. The inner rank is also discussed in [FR04].
Lemma 1.3** ([Coh95, Corollary 6.3.6]).**
A linear square matrix over which is not full is associated over to a linear hollow matrix.
Definition 1.4** **(Linear Representations
[CR94]).
Let . A linear representation of is a triple with , , and such that is full, that is, is invertible over the free field , and . The dimension of the representation is . It is called minimal if has the smallest possible dimension among all linear representations of . The “empty” representation is the minimal representation for with .
Remark. In Definition 1.15 it can be seen that is (up to sign) the Schur complement of the linearization \bigl{[}\begin{smallmatrix}0&u\\ v&A\end{smallmatrix}\bigr{]} with respect to the upper left block.
Definition 1.5** ([CR99]).**
Two linear representations are called equivalent if they represent the same element.
Definition 1.6** **(Rank
[CR99]).
Let and be a minimal representation of . Then the rank of is defined as .
Remark. The connection to the related concepts of inversion height and depth can be found in [Reu96], namely inversion height depth rank. Additional discussion about the depth appears in [Coh06, Section 7.7].
Definition 1.7**.**
Let . An element in is called regular, if it has a linear representation with , that is, in Definition 1.4, or equivalently, if is regular (invertible).
Definition 1.8** **(Left and Right Families
[CR94]).
Let be a linear representation of of dimension . The families with and with are called left family and right family respectively. and denote their linear spans.
Remark. The left family (respectively the right family ) and the solution vector of (respectively of ) will be used synonymously.
Proposition 1.9** ([CR94], Proposition 4.7).**
A representation of an element is minimal if and only if both, the left family and the right family, are -linearly independent.
Definition 1.10** **(Admissible Linear Systems
[Coh72]).
A linear representation of is called admissible linear system (for ), denoted by , if . The element is then the first component of the (unique) solution vector .
Remark. In [Coh85], Cohn defines admissible linear systems with with , and . Writing as block of size the first columns of serve as numerator and the last columns of as denominator. However, in this setting, for regular elements, the dimension of such a minimal system could differ from the Hankel rank [Fli74], [SS78, Section II.3].
Definition 1.11** (Admissible Transformations).**
Given a linear representation of dimension of and invertible matrices , the transformed is again a linear representation (of ). If is an ALS, the transformation is called admissible if the first row of is .
Remark 1.12** (Elementary Transformations).**
In practice, transformations can be done by elementary row- and column operations (with respect to the system matrix ). If we add -times row to row in , we also have to do this in . If we add -times column to column we have to subtract -times row from row in . Since it is not allowed to change the first entry of , column 1 cannot be used to eliminate entries in other columns! As an example consider the ALS
[TABLE]
for the element . Adding column 1 to column 2, that is, Q=\bigl{[}\begin{smallmatrix}1&1\\ .&1\end{smallmatrix}\bigr{]} (and ), yields the ALS
[TABLE]
for the element .
Proposition 1.13** **(Rational Operations
[CR99, Section 1]).
Let be given by the admissible linear systems , and respectively, with and let . Then admissible linear systems for the rational operations can be obtained as follows:
The scalar multiplication is given by
[TABLE]
The sum is given by
[TABLE]
The product is given by
[TABLE]
And the inverse is given by
[TABLE]
Remark. One can easily verify that the solution vectors for the admissible linear systems defined above are
[TABLE]
respectively. It remains to check that the system matrices are full. For the sum and the product this is clear from the fact that the free associative algebra —being a free ideal ring (FIR)— has unbounded generating number (UGN) and therefore the diagonal sum of full matrices is full, see [Coh85, Section 7.3]. The system matrix for the inverse is full because and therefore the linearization of is full, compare [Coh95, Section 4.5].
Remark 1.14**.**
For the rational operations from Proposition 1.13 we observe that the left families satisfy the relations
[TABLE]
And similarly for the right families we have
[TABLE]
Definition 1.15** **(Linearization
Let . A linearization of is a matrix , with , of the form
[TABLE]
such that is invertible over the free field and is the Schur complement, that is, . If then is called a pure linearization. The size of the linearization is , the dimension is .
Proposition 1.16** ([BMS17, Proposition 3.2]).**
Let and , , and be given and assume that is invertible in . Then the matrix \bigl{[}\begin{smallmatrix}A&B\\ C&D\end{smallmatrix}\bigr{]} is invertible in if and only if the Schur complement is invertible in . In this case
[TABLE]
Remark 1.17**.**
(i) Let be given by the linearization . Then is the -quasideterminant [GGRW05] of .
(ii) Given a linear representation of , then L=\bigl{[}\begin{smallmatrix}.&u\\ -v&A\end{smallmatrix}\bigr{]} is a pure linearization of .
(iii) Talking about a minimal linearization, one has to specify which class of matrices is considered: Scalar entries in the first row and column? Pure? And, if applicable, selfadjoint?
Proposition 1.18**.**
Let
[TABLE]
be a linearization of size for some element and define another element by the pure linearization
[TABLE]
of size . Then .
Proof.
Using Proposition 1.16 —taking the Schur complement with respect to the block entry — and , the inverse of \tilde{A}=\bigl{[}\begin{smallmatrix}L&b^{\!\top}\\ b&.\end{smallmatrix}\bigr{]} can be written as
[TABLE]
Hence
[TABLE]
If the first row or column of a linearization for some contains non-scalar entries, then Proposition 1.18 can be used to construct a linear representation of . On the other hand, given a linear representation of dimension (of ) which can be brought to such a form, a linearization of size can be obtained. The characterization of minimality for linearizations will be considered in future work.
Example 1.19**.**
For the anticommutator a minimal ALS is given by
[TABLE]
Permuting the columns and multiplying the system matrix by we get the linearization
[TABLE]
which is of the form in Proposition 1.18 and yields a minimal (pure) linearization of the anticommutator
[TABLE]
Definition 1.20** **(Realization
[HMV06]).
A realization of a matrix is a quadruple with , , , and such that is invertible over the free field and . The dimension of the realization is .
Remark. A realization could be written in block form
[TABLE]
Here, the definition is such that is the -block-quasideterminant [GGRW05] with respect to block . For we obtain the descriptor realization in [HMV06]. Realizations where and/or contain non-scalar entries are sometimes called “butterfly realizations” [HMV06]. Minimality with respect to realizations is investigated in [Vol18].
2 The Word Problem
Let be given by the linear representations and of dimension and respectively and define the matrix
[TABLE]
which is a linearization of , of size . Then if and only if is not full [Coh95, Section 4.5]. For the word problem see also [Coh95, Section 6.6]. Whether is full or not can be decided by the following theorem. For and the commutative polynomial ring is
[TABLE]
Theorem 2.1** ([CR99, Theorem 4.1]).**
For each , denote by the ideal of generated by the polynomials , and the coefficients of each in the entries of the matrix for , . Then the linear matrix is full if and only if for each , the ideal is trivial.
Remark. Notice that there is a misprint in [CR99] and the coefficients of are omitted.
So far we were not able to apply this theorem practically for , where or more unknowns are involved. However, if we have any ALS (or linear representation) for , say from Proposition 1.13, then we could check whether it can be (admissibly) transformed into a smaller system, for example . For polynomials (with and upper triangular and nilpotent) this could be done row by row. In general the pivot blocks (the blocks in the diagonal) can be arbitrarily large. Therefore this elimination has to be done blockwise by setting up a single linear system for row and column operations. This idea is used in the following lemma. Note that the existence of a solution for this linear system is invariant under admissible transformations (on the subsystems). This is a key requirement since the normal form [CR94] is defined modulo similarity transformations (more general by stable association, Definition 1.2).
Theorem 2.2** ([CR99, Theorem 1.4]).**
If and are equivalent (pure) linear representations, of which the first is minimal, then the second is isomorphic to a representation which has the block decomposition
[TABLE]
Lemma 2.3**.**
Let be given by the admissible linear systems and of dimension and respectively. If there exist matrices such that , and , then .
Proof.
The difference can be represented by the admissible linear system with
[TABLE]
Defining the (invertible) transformations
[TABLE]
and , and we get a new system :
[TABLE]
Invertibility of over the free field implies , in particular
[TABLE]
because . ∎
Let be the number of letters in the alphabet , and . To determine the transformation matrices from the lemma we just have to solve a linear system of equations in unknowns. If there is a solution then . Neither nor have to be minimal. Computer experiments show, that Hua’s identity [Ami66]
[TABLE]
can be tested positively by Lemma 2.3 when the left hand side is constructed by the rational operations from Proposition 1.13. However, without assuming minimality, the fact that there is no solution does not imply, that , see Example 2.5 below.
Theorem 2.4** (“Linear solution” of the Word Problem).**
Let be given by the minimal admissible linear systems and of dimension respectively. Then if and only if there exist matrices such that , and .
Proof.
If then, since admissible linear systems correspond to (pure) linear representations, by Theorem 2.2 there exist invertible matrices such that and . Let and . The admissible linear systems are minimal. Hence, the left family is -linearly independent. Since the first component of is equal to the first component of and the left family is -linearly independent, the first row of must be . Therefore . Clearly and
[TABLE]
The other implication follows from Lemma 2.3. ∎
Example 2.5**.**
Let and be given by the admissible linear systems
[TABLE]
respectively. Then the ALS
[TABLE]
represents .
While it is obvious here that the second component of the solution vector is zero, it is not clear in general how one can exclude such “pathological” LRs without minimality assumption. One might ask, for which class of constructions (rational operations [CR99], Higman’s trick [Hig40, Coh85], selfadjoint linearization trick [And13], etc.) there are sufficient conditions for the existence of matrices (over ) in Lemma 2.3 if . Unfortunately this seems to be impossible except for some specific examples as the following ALS (constructed by the rational operations from Proposition 1.13) for suggests (some zeros are kept to emphasize the block structure):
[TABLE]
There are no and therefore no (admissible, with blocks ) such that has a upper right block of zeros and the first two components of are zero. Therefore we would like to construct minimal linear representations directly. For regular elements, algorithms are known, see Section 3. How to proceed in general is open except for the inverse. This is discussed in Section 4.
3 Regular Elements
For regular elements (Definition 1.7) in the free field minimal linear representations can be obtained via the Extended Ho-Algorithm [FM80] from the Hankel matrix or by minimizing a given linear representation via the algorithm in [CC80] by detecting linearly dependent rows in the controllability matrix and linearly dependent columns in the observability matrix, see Definition 3.3. The basic idea goes back to Schützenberger [Sch61]. Controllability and observability is discussed in [KFA69, Chapter 10].
For an alphabet (finite, non-empty set), the free monoid generated by is denoted by . A formal power series (in non-commuting variables) is a mapping from to a commutative field , written as formal sum
[TABLE]
with coefficients . In general, could be replaced by a ring or a skew field. [SS78] and [BR11] contain detailed introductions. On the set of formal power series the following rational operations are defined for with , and :
The scalar multiplication
[TABLE]
The sum
[TABLE]
The product
[TABLE]
And the quasiinverse
[TABLE]
The set of non-commutative (nc) rational series is the smallest rationally closed (that is, closed under scalar multiplication, sum, product and quasiinverse) subset of containing the nc polynomials . A series is called recognizable if there exists a natural number , a monoid homomorphism and two vectors , such that can be written as
[TABLE]
The triple is called a linear representation [SS78, Section II.2].
Theorem 3.1** ([Sch61]).**
A series is rational if and only if it is recognizable.
A rational series can be represented by a proper linear system (PLS for short) where is the first component of the unique solution vector (with , , for some ). Rational operations are then formulated on this level [SS78, Section II.1]. Clearly, every proper linear system gives rise to an admissible linear system with . When is a linear representation of a recognizable series, then with
[TABLE]
is a linear representation of . For a PLS the solution vector can be computed by the quasiinverse :
[TABLE]
Definition 3.3** (Controllability and Observability Matrix).**
Let be a proper linear system of dimension (for some nc rational series). Then the controllability matrix and the observability matrix are defined as
[TABLE]
respectively.
Remark. Note that the monomials (in the polynomials) in have length . The matrices and are over . A priori these matrices would have an infinite number of columns and rows respectively. However, by [Coh95, Lemma 6.6.3], it suffices to use the columns of and rows of only. This gives the connection to [BR11, Section I.2] and could be used for minimizing proper linear systems [SS78]. In other words: Instead of identifying -linear dependence of the left family , we can restrict to the “approximated” left family .
Now let denote the set of words of length and use from (3.2) to define with columns for and with rows for . Then the controllability matrix and the observability matrix can be defined alternatively as
[TABLE]
with entries in . Note that the rank of is at most while the number of columns of the blocks is . So —for an alphabet with more than one letter— most of the columns are not needed. For and we have and . Hence and can be written as
[TABLE]
Compare with [KFA69, Section 6.3]. For controllability and observability in connection with realizations (Definition 1.20) see also [HMV06].
4 Minimizing the Inverse
Having solved the word problem, our next goal is “minimal arithmetics” in the free field . That is, given elements by minimal admissible linear systems, to compute minimal ones for the rational operations. For the scalar multiplication this is trivial. For the inverse some preparation is necessary. The result is presented in Theorem 4.20. The “minimal sum” and the “minimal product” are considered in future work. The main difficulty is not minimality but the restriction to linear techniques.
Proposition 4.1**.**
Let and be a monomial in . Then
[TABLE]
is a minimal ALS of dimension .
Proof.
The system matrix of is full. For row indices and column indices the Hankel matrix [Fli74], [SS78, Section II.3] of is
[TABLE]
and has rank . ∎
Remark. Trivially, is a minimal ALS for the unit element (empty word).
The following proposition is a variant of the inverse in Proposition 1.13 and is motivated by inverting the inverse of a monomial, for example, . A minimal ALS for is given by
[TABLE]
Minimality is clear immediately by also checking the -linear independence of the right family. Using the construction of the inverse from Proposition 1.13 we get the system
[TABLE]
for . To obtain the form of Proposition 4.1 we have to reverse the rows and the columns and multiply the rows by .
Proposition 4.2** (Standard Inverse).**
Let be given by the admissible linear system of dimension . Then an admissible linear system of dimension for is given by
[TABLE]
(Recall that the permutation matrix reverses the order of rows/columns.)
Definition 4.4** (Standard Inverse).**
Let be an ALS for a non-zero element. We call the ALS (4.3) the standard inverse of , denoted by .
Proof.
The reader can easily verify that the solution vector of is
[TABLE]
Compare with Proposition 1.13. ∎
We proceed with the calculation of minimal admissible linear systems for the inverse. We distinguish four types of ALS according to the form of the system matrix. Later, in the remark following Lemma 4.18 and 4.19, we will see how to bring a system matrix to one of these forms depending on the left and right families.
Lemma 4.5** (Inverse Type ).**
Assume that has a minimal admissible linear system of dimension of the form
[TABLE]
Then a minimal ALS for of dimension is given by
[TABLE]
with and .
Proof.
The standard inverse of is
[TABLE]
Adding row 4 to row 3 and -times column 2 to column 1 gives
[TABLE]
It follows that and does not contribute to the solution and thus the first and the last row as well as the second and the last column can be removed. If were not minimal, then there would exist a system of dimension for . The standard inverse would give a system of dimension for , contradicting minimality of . It remains to show that and . Let be the right family of which is (due to minimality) -linearly independent. Then the right family of is , that after the row operation becomes . Removing the first and the last component (corresponding to the first and the last row) yields the right family . Therefore , because otherwise , contradicting -linear independence of . Similar arguments show that . ∎
Lemma 4.8** (Inverse Type ).**
Assume that has a minimal admissible linear system of dimension of the form
[TABLE]
with . Then a minimal ALS for of dimension is given by
[TABLE]
with .
Proof.
The standard inverse of is
[TABLE]
and has dimension . After adding row to row we can remove row and column because does not contribute to the solution . Then we divide the first row by and obtain (4.10). It remains to show that is minimal and . Let be the left family of which is (due to minimality) -linearly independent. Then the left family of is . Note that (admissible) row operations do not affect the left family. Since we removed the last entry , the left family of is . By assumption . Therefore , hence is -linearly independent. Clearly, because . Similarly, let be the right family of which is -linearly independent. Then the right family of is , that after the row operation is . Since we removed the last entry, the right family of is which is clearly -linearly independent. Proposition 1.9 gives minimality of . ∎
Lemma 4.11** (Inverse Type ).**
Assume that has a minimal admissible linear system of dimension of the form
[TABLE]
with . Then a minimal ALS for of dimension is given by
[TABLE]
with .
Proof.
The standard inverse of is
[TABLE]
After adding -times column 2 to column 1 we can remove row 1 and column 2, because . Showing minimality and is similar to the proof of Lemma 4.8 (column operations affect the left family). ∎
Lemma 4.14** (Inverse Type ).**
Let be a minimal admissible linear system of dimension for with and . Then the standard inverse is a minimal ALS of dimension for .
Proof.
If were not minimal, then there would be a system of dimension for . Applying Lemma 4.5 (Inverse Type ), we would get a system of dimension for , contradicting minimality of . ∎
Example 4.15**.**
Taking the minimal ALS (for the anticommutator) from Example 1.19, we get, by Lemma 4.5 (Inverse Type ), a minimal ALS for :
[TABLE]
Lemma 4.14 (Inverse Type ) gives again a minimal system for .
Since it is possible to construct minimal linear representations for regular elements (see Section 3), this is in particular true for polynomials. These are of the form (4.6), since with nilpotent , which can be choosen upper triangular. This can be seen either by looking at proper linear systems (Section 3) where admissible transformations are conjugations (of the system matrix such that the first component of the solution vector is left untouched) or by the following proposition.
Proposition 4.16** ([CR99, Proposition 2.1]).**
Let .
(i) is a power series if and only if in any minimal representation, the constant term of its system matrix is invertible. There is then a minimal representation which is unital, that is, .
(ii) is a polynomial if and only if in any unital minimal representation, the matrix is nilpotent. There is then a minimal representation with a unitriangular (ones on and zeros below the diagonal) system matrix.
Example 4.17**.**
The element admits the following minimal ALS
[TABLE]
with right family and left family . Now Lemma 4.8 (Inverse Type ) can be applied to get the minimal ALS
[TABLE]
for . Note that we can apply Lemma 4.8 again, because .
If an admissible linear system is of the form (4.12) in Lemma 4.11 (Inverse Type ), then it follows immediately that . Conversely, assuming , the proof of the existence of an admissible transformation such that is of the form (4.12) is a bit more involved and requires minimality.
Lemma 4.18** (for Inverse Type ).**
Let be a minimal admissible linear system with and . Then there exists an admissible transformation such that is of the form (4.12).
Proof.
Without loss of generality, assume that and the left family is . Otherwise it can be brought to this form by some admissible transformation . Now let denote the upper left block of , let and write as
[TABLE]
This system is equivalent to
[TABLE]
Since the left family is -linearly independent (by minimality of ), the matrix \tilde{A}=\bigl{[}\begin{smallmatrix}\bar{A}&b\\ c&d-1\end{smallmatrix}\bigr{]} cannot be full. We claim that there is only one possibility to transform to a hollow matrix, namely with zero last row. If we cannot produce a block of zeros (by invertible transformations) in the first rows of , then we cannot get blocks of zeros of size and we are done.
Now assume that there are invertible matrices and (admissible) with , such that contains a zero block of size for some . There are two cases. If the first entries in column 1 cannot be made zero, we construct an upper right zero block:
[TABLE]
where has size . If were not full, then would not be full (the last row is not involved in the transformation). Hence this pivot block is invertible over the free field. Therefore . Otherwise we construct an upper left zero block in . But then . Both contradict -linear independence of the left family.
So there is only one block left, which can make non-full. Hence, by Lemma 1.3, the modified (system) matrix is associated over to a linear hollow matrix with a block of zeros, say in the last row (the columns and the first rows are left unouched):
[TABLE]
Hence we have and therefore
[TABLE]
Clearly . The transformation
[TABLE]
does the job. ∎
Lemma 4.19** (for Inverse Type ).**
Let be a minimal admissible linear system with and . Then there exists an admissible transformation such that is of the form (4.9).
Proof.
The proof is similar to the previous one switching the role of left and right family. ∎
Remark. If for some minimal ALS , say , then, by Lemma 4.19, there is an admissible transformation such that the first column in is . So, if the first column of is not in this form, an admissible transformation can be found in two steps: Firstly, we can set up a linear system to determine an -duple of scalars such that is in for . Secondly, we use elementary row transformations (Gaussian elimination in the first column) and —if necessary— permutations to get the desired form of the first column. Together, these transformations give some (admissible) transformation .
An analogous procedure can be applied if . And it can be combined for the case as in Lemma 4.5 (Inverse Type ). It works more generally for non-minimal systems, but can fail in “pathological” cases. Compare with Example 2.5.
Theorem 4.20** (Minimal Inverse).**
Let be given by the minimal system of dimension . Then a minimal admissible linear system for is given by
[TABLE]
provided that the necessary transformations according to Lemma 4.18 and 4.19 are done before.
Proof.
See Lemma 4.5, 4.8, 4.11 and 4.14. ∎
There are two immediate consequences: The first follows from Proposition 4.16, that is, the inverse type applies in particular to polynomials. The second can be used to distinguish between “trivial” units (in the ground field ) and “non-trivial” units, that is, elements in .
Corollary 4.21**.**
Let with . Then .
Corollary 4.22**.**
Let . Then if and only if .
Acknowledgement
Special thanks go to Marek Bożejko and Victor Vinnikov for the hospitality and all the motivating discussions in Wrocław and Beer-Sheva, respectively. However, I am very grateful to Franz Lehner. He gave me the chance to enter the world of non-commutative mathematics. Without the freedom, support and advice he offered this work would not have been possible. Additonally I thank the anonymous referees for the valuable comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AL 50] A. S. Amitsur and J. Levitzki. Minimal identities for algebras. Proc. Amer. Math. Soc. , 1:449–463, 1950.
- 2[Ami 66] S. A. Amitsur. Rational identities and applications to algebra and geometry. J. Algebra , 3:304–359, 1966.
- 3[And 13] G. W. Anderson. Convergence of the largest singular value of a polynomial in independent Wigner matrices. Ann. Probab. , 41(3B):2103–2181, 2013.
- 4[Ber 78] G. M. Bergman. The diamond lemma for ring theory. Adv. in Math. , 29(2):178–218, 1978.
- 5[BMS 17] S. T. Belinschi, T. Mai, and R. Speicher. Analytic subordination theory of operator-valued free additive convolution and the solution of a general random matrix problem. J. Reine Angew. Math. , 732:21–53, 2017.
- 6[BR 11] J. Berstel and C. Reutenauer. Noncommutative rational series with applications , volume 137 of Encyclopedia of Mathematics and its Applications . Cambridge University Press, Cambridge, 2011.
- 7[CC 80] A. Cardon and M. Crochemore. Détermination de la représentation standard d’une série reconnaissable. RAIRO Inform. Théor. , 14(4):371–379, 1980.
- 8[Coh 72] P. M. Cohn. Generalized rational identities. In Ring theory (Proc. Conf., Park City, Utah, 1971) , pages 107–115. Academic Press, New York, 1972.
