Characteristic free description of semi-invariants of $2\times 2$ matrices
M. Domokos

TL;DR
This paper provides a minimal generating system for the algebra of semi-invariants of tuples of 2x2 matrices over various fields and rings, and also identifies a characteristic-independent separating system.
Contribution
It introduces a characteristic-free minimal generating system for semi-invariants of 2x2 matrices and relates it to vector invariants of the special orthogonal group.
Findings
Minimal homogeneous generating system for semi-invariants over characteristic two and integers.
A characteristic-independent separating system of semi-invariants.
Connection to vector invariants of the special orthogonal group.
Abstract
A minimal homogeneous generating system of the algebra of semi-invariants of tuples of two-by-two matrices over an infinite field of characteristic two or over the ring of integers is given. In an alternative interpretation this yields a minimal system of homogeneous generators for the vector invariants of the special orthogonal group of degree four over a field of characteristic two or over the ring of integers. An irredundant separating system of semi-invariants of tuples of two-by-two matrices is also determined, it turns out to be independent of the characteristic.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Characteristic free description of semi-invariants of matrices
M. Domokos
Alfréd Rényi Institute of Mathematics, Reáltanoda utca 13-15, 1053 Budapest, Hungary
Abstract.
A minimal homogeneous generating system of the algebra of semi-invariants of tuples of two-by-two matrices over an infinite field of characteristic two or over the ring of integers is given. In an alternative interpretation this yields a minimal system of homogeneous generators for the vector invariants of the special orthogonal group of degree four over a field of characteristic two or over the ring of integers. An irredundant separating system of semi-invariants of tuples two-by-two matrices is also determined, it turns out to be independent of the characteristic.
2010 Mathematics Subject Classification:
Primary: 13A50; Secondary: 14L30, 15A72, 16R30.
This research was partially supported by National Research, Development and Innovation Office, NKFIH K 119934.
1. Introduction
For positive integers denote by the space of -tuples of matrices over an infinite field . The group (the direct product of two copies of the special linear group over ) acts on this space as follows:
[TABLE]
for , . Write for the corresponding algebra of polynomial invariants; that is, consists of the polynomial functions on that are constant along the -orbits. We call the algebra of semi-invariants of -tuples of matrices (and sometimes we shall denote it by , when the indication of the base field seems desirable). Interest in this algebra was raised recently by connections to algebraic complexity theory, see for example [18], [21], [22]. Another motivation comes from representation theory of quivers: semi-invariants of tuples of matrices are special instances of semi-invariants associated to representation spaces of quivers, which are used to construct moduli spaces parametrizing quiver representations, see [25].
-vector space spanning sets of were given independently by Derksen and Weyman [7], Domokos and Zubkov [14], and in characteristic zero by Schofield and Van den Bergh [30] (these papers deal with the more general setup of semi-invariants of quiver representations). To deduce a finite -algebra generating set of one needs an explicit degree bound for the generators. Good degree bounds were given recently by Derksen and Makam [4], [5]. These results imply that the -form of (see Section 5) is generated as a ring by an explicitly given finite set of elements (see Theorem 5.4), moreover, generators of yield generators in by base change (see Proposition 5.1). This raises the question of finding an explicit minimal homogeneous system of generators of the ring (as an optimal characteristic free description of the generators of ). In the present paper we answer the special case of this question.
In fact an explicit minimal homogeneous generating system of is known in very few cases only. In the special case and , a minimal homogeneous generating system of was determined by Domokos [9]. The generators have degree and . (As a byproduct of our calculations here we get a new proof of this statement, see Theorem 4.3.) It was pointed out by Domokos and Drensky [10] that the result remains valid for any infinite field with thanks to the characteristic free description of vector invariants of the special orthogonal group due to De Concini and Procesi [2]. Apart from that a minimal homogeneous generating system of is known only when by [8, Theorem 4.3] for (see also [28, Proposition 11.49]) and by [11, Theorem 3] for arbitrary infinite or for .111After publication of the present work came to my attention that this account is not complete, see Section 8 at the end of the paper.
In the present paper we determine a minimal homogeneous generating system of when and (see Theorem 2.1). Simultaneously we get in Theorem 2.2 a minimal system of -algebra generators in the -form of . An irredundant separating system of -invariants on is provided for an algebraically closed base field of arbitrary characteristic in Theorem 6.1.
Additional interest in the calculations of this paper stems from a connection to the open problem of generating vector invariants of the orthogonal group in characteristic two. This is discussed in Section 7, where we deduce that in the -dimensional case, the exotic vector invariants of (with ) constructed by Domokos and Frenkel [13] together with the well-known quadratic invariants generate the corresponding algebra of invariants.
2. The main result
Denote by the function mapping to the -entry of . Then is the -variable polynomial algebra . Write for the matrix over whose -entry is (; ). So (called sometimes generic matrices) are elements of the noncommutative ring . The algebra is graded in the standard way, namely the generators have degree . The subalgebra is generated by homogeneous elements. Write for the subspace of spanned by its homogeneous elements of positive degree (clearly is an ideal in and ). By the graded Nakayama lemma some homogeneous elements in form a minimal homogeneous -algebra generating system of if and only if they constitute an -vector space basis in an -vector space direct complement of the ideal in . A homogeneous semi-invariant is said to be indecomposable if is not contained in (equivalently, can not be expressed as a polynomial of semi-invariants of strictly smaller degree).
Now suppose that , and let us construct some semi-invariants. Clearly the determinant is a semi-invariant for , and we also have its linearization; that is, for define by the equality
[TABLE]
(where are additional commuting variables). That is,
[TABLE]
Fix an integer with , take additional commuting variables , and define as the coefficient of in the determinant
[TABLE]
where we put the zero matrix in those positions of the above block matrix that are not indicated (so in (1) we take the determinant of a matrix over ). If are matrices with entries in an arbitrary commutative -algebra , we shall write for the image of under the unique -algebra homomorphism from that maps to the -entry of . Now we are in position to state the main results of the paper:
Theorem 2.1**.**
Let be an infinite field of characteristic . Then the following set is a minimal homogeneous generating system of the algebra :
[TABLE]
Theorem 2.2**.**
The ring is minimally generated by the elements (2) (defined over ).
3. A spanning set of semi-invariants
We need to develop some notation in order to recall a known -vector space spanning set of . Given an matrix and a matrix with entries in some commutative ring we write for their Kronecker product. That is, is a block matrix with blocks of size , and the block in the position is .
In the following discussion is arbitrary, and recall that . Fix another positive integer and set . Then the coordinate ring of is
[TABLE]
where is the function mapping an element to the -entry of . For set
[TABLE]
Then and are both matrices over , so their Kronecker product is an matrix with entries in the ring . We shall write
[TABLE]
More generally, for any -tuple of matrices over we shall use the notation
[TABLE]
For set
[TABLE]
An element can be viewed as a polynomial over with variables , and denote by the coefficient of in :
[TABLE]
A well known construction of semi-invariants is based on the multiplicativity of the determinant:
Lemma 3.1**.**
For any and we have that belongs to .
Proof.
For write for the element of obtained by glueing copies of along the diagonal. Given we have the matrix equality
[TABLE]
Take determinants, and use the multiplicativity of the determinant to conclude that
[TABLE]
The latter equality shows that the coefficients of (viewed as a polynomial in the with coefficients in ) are -invariant. ∎
Our starting point is the following result of Derksen and Weyman [7], Domokos and Zubkov [14], and (in characteristic zero) of Schofield and Van den Bergh [30]; in positive characteristic this result was obtained using the theory of modules with good filtration (see [16] and the references there):
Theorem 3.2**.**
Let be an infinite field. Then the degree of a non-zero homogeneous element in is a multiple of , and for , the degree homogeneous component of the algebra of matrix semi-invariants is spanned as an -vector space by
[TABLE]
4. The main reductions
In this section we turn to the case , write for and for .
Lemma 4.1**.**
Take and such that is indecomposable in . Then the following hold:
- (i)
For all we have .
- (ii)
The semi-invariant up to sign agrees with , where are elements of .
Proof.
(i) When we expand the determinant of , each of the terms contains at most two entries from each block of . Since , it follows that for all we have . Suppose contrary to our statement (i) that for some . By symmetry we may assume that . Laplace expansion of the determinant of along the first two rows shows that
[TABLE]
where is obtained from by removing the first row and column of each component . The two factors on the right hand side above both belong to by Lemma 3.1. Since is indecomposable, this is a contradiction. Thus for all .
(ii) Consider the set of pairs . Each of the terms of the expansion of the determinant of the matrix contains exactly two factors from the first two rows (respectively columns), two factors from the second two rows (respectively columns), etc. Taking into account (i), we get that for each we have , and for each we have . A theorem of Kőnig [26] (asserting that a regular bipartite graph has a perfect matching) implies that there exist permutations of the set such that for each , and . By symmetry (or simultaneously renumbering the rows of the matrix components of ) we may assume that is the identity permutation, and so is fix point free. We claim that is in fact a -cycle. Suppose to the contrary that preserves a proper non-empty subset of . Again after a possible simultaneous renumbering of both the rows and columns of the matrix components of we may assume that preserves the subset (and then necessarily preserves the complement ). This means that each component of is block diagonal, so , where and . Set , where is the left upper block of , and set , where is the complementary block to in . The Laplace expansion of the determinant of along the first rows shows that
[TABLE]
where and . Both factors on the right hand side above belong to by Lemma 3.1, so this is a contradiction. This proves our claim that is a -cycle. Moreover, after a possible simultaneous renumbering of both the rows and columns of all components of we may assume that is the -cycle . Thus there exist unique elements such that
[TABLE]
For this we have . Since in the process of reducing to this special we had to renumber rows and columns of the components of , the original semi-invariant we started with agrees with this latter up to sign. ∎
For write if . In particular, a homogeneous element of is not indecomposable if and only if .
Lemma 4.2**.**
Let be an integer with .
- (i)
For any permutation of the set we have
[TABLE]
- (ii)
If are not distinct, then .
Proof.
We first investigate what happens if we make the substitution in . Using standard column operations on matrices (with entries in the field of fractions of the polynomial ring ) we have the following:
[TABLE]
By Laplace expansion along the first two rows the determinant on the right hand side above equals
[TABLE]
This implies that substituting and by the same matrix we get
[TABLE]
for and
[TABLE]
In particular, setting we conclude that
[TABLE]
Now note that is an -graded subalgebra of , where the -grading is given by setting the degree of the entries of to be the standard basis vector of . Therefore any multihomogeneous component of the left hand side of (4) must belong to . Observe also that is additive in each of its matrix arguments. So taking the multihomogeneus component of multidegree of (4) we get
[TABLE]
Similar argument shows that changes sign if we interchange any two matrix arguments that appear in the same row or in the same column of the block matrix in (1). Since any permutation of the matrix arguments can be obtained by applying an appropriate sequence of such transpositions, we obtain that (i) holds.
In view of statement (i), it is sufficient to prove (ii) in the special case when . The multihomogeneous component of (4) of multidegree gives
[TABLE]
from which the desired statement follows by making the substitution , . ∎
Proof of Theorem 2.1. Combining Theorem 3.2, Lemma 4.1, and Lemma 4.2 we get that the homogeneous semi-invariants in the statement of Theorem 2.1 indeed generate the -algebra . To prove minimality of this generating system, note that each of the given generators are multihomogeneous with respect to the -grading of mentioned in the proof of Lemma 4.2. Therefore it is sufficient to show that each generator is indecomposable. This is obvious for and since they have minimal possible total degree. Indecomposability of for and was proved by Domokos and Frenkel [12, Proposition 2.5].
We finish this section by pointing out that Theorem 3.2, Lemma 4.1 and Lemma 4.2 allow an alternative deduction of the zero characteristic case of the following known minimal generating system of :
Theorem 4.3**.**
For an infinite field of characteristic different from ,
[TABLE]
is a minimal homogeneous generating system of .
Proof.
As explained in the introduction, this is known by [2], [9], [10]. We give a new argument for the case .
Combining Theorem 3.2, Lemma 4.1, and Lemma 4.2 we get that the semi-invariants (2) generate the -algebra . Therefore to prove that (5) is already a generating system of , it is sufficient to show that is decomposable in for each .
Suppose to the contrary that . We may assume that . Denote by the subspace of spanned by the elements having multidegree . The symmetric group acts linearly on via -algebra automorphisms, namely a permutation sends to . Obviously both and are -invariant subspaces in . By Lemma 4.2 (i) we have that the image of in spans a one-dimensional -invariant subspace on which acts via its sign representation. In particular, the -module has the sign representation as a direct summand. This is a contradiction. Indeed, is the primitive central idempotent in the group algebra corresponding to the sign representation, and when .
To see minimality of this homogeneous generating system, note that the given generators are multihomogeneous with distinct multidegrees, so it is sufficient to show that each of them is indecomposable. The quadratic generators have minimal possible degree in , therefore they are indecomposable. The degree four generators are not contained in the subalgebra of generated by the quadratic generators. Indeed, suppose to the contrary that is contained in the subalgebra generated by the quadratic generators. Since the quadratic generators take the same value on and the transposed -tuple , the same holds then for . However, this is not the case, as one can easily check. ∎
5. -algebra generators
The algebra contains the subring , and contains the subring
[TABLE]
For an infinite field identify with in the obvious way (here and later the symbol stands for tensor product of -modules), and denote by
[TABLE]
Since can be interpreted as a ring of invariants of the affine group scheme over , we have
[TABLE]
(see the book of Jantzen [23, I.2.10] for the relevant material on invariants of group schemes). Even more, we have the following:
Proposition 5.1**.**
Let be an infinite field.
- (i)
The algebra is spanned as an -vector space by .
- (ii)
Any system of generators of the ring is mapped to an -algebra generating system of .
Proof.
For the elements in (3) belong to , and Theorem 3.2 asserts that for an arbitrary infinite field their images span as an -vector space. So (i) holds, and it implies (ii). ∎
The polynomial rings , are graded in the standard way (the variables have degree ), and the subrings , are generated by homogeneous elements. Below for a graded ring we whall write for the degree homogeneous component of .
Lemma 5.2**.**
Suppose that is a graded -submodule of such that spans for any infinite field . Then .
Proof.
Note that the factor -module is torsion-free. Indeed, if for some positive integer and , then . It follows that for any non-negative integer , is a finitely generated free -module. Therefore is a -module direct summand in , hence the embedding induces an isomorphism
[TABLE]
Now tensor with the short exact sequence
[TABLE]
to get the exact sequence
[TABLE]
The image of above is by assumption, which by Proposition 5.1 (i) and by (6) is . So is surjective, and thus the exactness of (7) means that . This holds for any infinite field .
Now is a finitely generated -module. The equality implies that is a torsion -module, hence it is a finite abelian group. We claim that it is zero. Assume on the contrary that it is non-zero, and let be a prime divisor of its order. Then for any field whose characteristic is , we have , a contradiction. So we have holds for all . This means that . ∎
Remark 5.3**.**
A similar argument was used by Donkin [15, Page 399] to get generators in the -form of the algebra of simultaneous conjugation invariants of -tuples of matrices.
As a special case of a general result of Seshadri [31] one gets that the ring is finitely generated. An explicit finite generating system is given below:
Theorem 5.4**.**
The ring is generated by the elements in (3) with , where .
Proof.
Denote by the subring of generated by the elements in (3) with , where . For any infinite field , is a subalgebra of . By Theorem 3.2 it contains the homogeneous components of of degree at most . Derksen and Makam [5] proved that these components generate the -algebra . Thus we have for any infinite field . Consequently, by Lemma 5.2 we conclude . ∎
Remark 5.5**.**
Similar arguments together with the results of [5], [7], [14] yield finite generating systems of the -forms of rings of semi-invariants on spaces of quiver representations. We omit the details.
Next we return to the special case :
Proof of Theorem 2.2. Denote by the subring of generated by the elements (2). The conditions of Lemma 5.2 hold for by Theorem 3.2, Lemma 4.1 and Lemma 4.2, hence by Lemma 5.2 we get that .
Minimality of this generating system follows from the minimality statement in Theorem 2.1 (which follows from [12]) and Proposition 5.1 (ii).
6. Minimal system of separating invariants
In this section we assume that the base field is algebraically closed. Following Derksen and Kemper [3], a subset of is called separating if for any , if there exists an with , then there exists an with .
Theorem 6.1**.**
Let be an algebraically closed field of arbitrary characteristic.
- (i)
The following is a separating system of :
[TABLE]
- (ii)
The above separating system is irredundant.
Proof of Theorem 6.1 (i). In the special case , is even a generating system by Theorem 2.1 and Theorem 4.3. Since , a general result of Draisma, Kemper and Wehlau [17] or Grosshans [20, Theorem 7] implies that the polarizations of a separating system in constitute a separating system in for an arbitrary . Now for the polarizations of the subset are contained in the subalgebra generated by .
In order to prove Theorem 6.1 (ii) it is helpful to recall a connection between the ring of matrix semi-invariants and another ring of invariants. Namely the group acts on by simultaneous conjugation: for and we set
[TABLE]
For consider the embedding
[TABLE]
where stands for the identity matrix.
Proposition 6.2**.**
[8, Proposition 4.1]** The comorphism of maps surjectively onto the algebra of simultaneous conjugation invariants of matrices.
This statement has the following immediate corollary (cf. [6, Proposition 3.2]):
Corollary 6.3**.**
Any separating system of -invariants on is mapped by to a separating system of simultaneous conjugation -invariants on .
Proof.
Let be a separating system. Suppose that for and there exists an with . By Proposition 6.2 there exists a with . Then we have , therefore there exists an with . Now separates and . ∎
For sake of completeness we recall that for a representation of a reductive group, two points can be separated by polynomial invariants if and only if the unique Zariski closed orbits in their orbit closures are different, see for example [19, Theorem A]. Therefore Corollary 6.3 follows from the following geometric refinement of Proposition 6.2. The group acts faithfully on via , hence we can form the asociated fibre bundle (see for example [1, Lemma 5.16] for the properties of associated fibre bundles needed below).
Proposition 6.4**.**
(special case of [14, Lemma 3.1]) Set
[TABLE]
The map
[TABLE]
is -invariant, and factors through an isomorphism
[TABLE]
In particular, the map induces a bijection between -invariant subvarieties in and -invariant subvarieties in . This bijection respects inclusions and Zariski closures, so restricts to a bijection between Zariski closed -orbits in and Zariski closed -orbits in .
Remark 6.5**.**
In a recent preprint Derksen and Makam [6] gave an algorithm that decides whether the -orbit closures of intersect.
We need the following observation of I. Kaygorodov, A. Lopatin and Y. Popov [24, Lemma 3.1]:
Lemma 6.6**.**
The following is an irredundant separating system of conjugation -invariants on :
[TABLE]
Note that we have
[TABLE]
Proof of Theorem 6.1 (ii). Assume to the contrary that omitting an element from we still have a separating system. Now depends only on at most matrix components. By symmetry we may assume that does not depend , so . Identify with the subspace of consisting of the tuples whose last matrix components are zero. Since all elements in vanish identically on the subspace in , we conclude that is a separating system in .
Suppose first that . Denote by the embedding (where is the identity matrix). Then (8) shows that is contained in the subalgebra of generated by the elements of degree at most two in the separating system given in Lemma 6.6. Therefore by Lemma 6.6, is not a separating system on with respect to the conjugation action of , contrary to Corollary 6.3.
If does not depend on all the four matrix components, say does not depend on , then we get that a proper subset of is a separating system on with respect to the action of . This is obviously a contradiction.
The main result of [24] can be obtained as a consequence of Theorem 6.1 (i) and Lemma 6.6:
Corollary 6.7**.**
(Kaygorodov, Lopatin and Popov [24, Theorem 1.1]) For an algebraically closed field of arbitrary characteristic and a positive integer , the set
[TABLE]
is an irredundant separating system of -invariants on .
Proof.
We have the equality
[TABLE]
Indeed, consider (1) in the special case , and substitute by the identity matrix :
[TABLE]
Now (8) gives (9), which implies by Theorem 6.1 (i) and Corollary 6.3 (applied for instead of ) that the set in our statement is indeed a separating system. Its irredundancy is an immediate consequence of Lemma 6.6. ∎
Remark 6.8**.**
We note that similarly to the proof of Theorem 5.4, using Lemma 5.2 one gets form the results of [15] and [5] that the -form of the -algebra of simultaneous conjugation invariants of matrices is generated by coefficients of the characteristic polynomials of monomials in the generic matrices , that have total degree at most .
7. Vector invariants of the special orthogonal group
Up to base change there is only one non-singular quadratic form on , namely the form (). The orthogonal group therefore can be defined as the subgroup of consisting of the linear transformations that preserve the determinant. The connected component of the identity in is an index two subgroup, called the special orthogonal group , and it is well known that , where is the representation of on given by . So the representation of on can be thought of as the sum of copies of the defining representation of , and the algebra is the so-called algebra of vector invariants of the special orthogonal group of degree .
Generators of the algebra have a uniform description in characteristic zero, and this is one of the theorems usually referred to as the First Fundamental Theorem of Invariant Theory, see [32]. The result extends verbatim (but with essentially different proof) for odd positive characteristic by [2]. However, in the case , for all even an indecomposable multilinear -invariant on was constructed in [13, Theorem 7]. This led Procesi [29, page 551] to write that ”in this case the invariant theory has to be deeply modified”. In particular, the following question is open:
Question 7.1**.**
Do the invariants constructed in [13] together with the quadratic invariants generate when ?
Now as an immediate consequence of Theorem 2.1 we get that the answer to Question 7.1 is affirmative in the special case :
Corollary 7.2**.**
Let be an algebraically closed field of characteristic . The elements given in [13, Theorem 7] together with the usual quadratic invariants constitute a minimal generating system of the algebra of vector invariants of the special orthogonal group on .
Moreover, formally the same elements as in Corollary 7.2 generate the appropriate -form of by an argument similar to the proof of Theorem 2.2.
8. Note added after publication
Lopatin in [27, Theorems 2.3 and 2.20] determines a minimal system of generators of the algebra of semi-invariants of an arbitrary quiver with a dimension vector taking value at each vertex of the quiver, over an arbitrary infinite base field. In particular, in [27, Lemma 8.6] Lopatin gives an explicit minimal generating system of semi-invariants of quivers having two vertices with dimension vector taking value at both vertices. So as a special case of Lopatin’s result, a minimal generating system of the algebra of semi-invariants of matrices was known prior to our Theorem 2.1 also in characteristic .
Theorem 2.1 is obtained by different calculations than the corresponding result in [27], and the other topics of the present paper (separating invariants and the connection to the orthogonal group) are not discussed in [27].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. Bongartz, Some geometric aspects of representation theory, Algebras and Modules I, CMS Conf. Proc. 23 (1998), 1-27, (I. Reiten, S.O. Smalø, Ø. Solberg, editors).
- 2[2] C. De Concini and C. Procesi, A characteristic free approach to invariant theory, Adv. Math. 21 (1976), 330-354.
- 3[3] H. Derksen, G. Kemper, Computational Invariant Theory, Encyclopaedia of Mathematical Sciences 130, Springer-Verlag, 2002.
- 4[4] H. Derksen and V. Makam, Polynomial degree bounds for matrix semi-invariants, Adv. Math. 310 (2017), 44-63.
- 5[5] H. Derksen and V. Makam, Generating invariant rings of quivers in arbitrary characteristic, J. Algebra 489 (2017), 435-445.
- 6[6] H. Derksen and V. Makam, Algorithms for orbit closure separation for invariants and semi-invariants of matrices, ar Xiv:1801.02043
- 7[7] H. Derksen and J. Weyman, Semi-invariants of quivers and saturation for Littlewood-Richardson coefficients, J. Amer. Math. Soc. 13 (2000), 467-479.
- 8[8] M. Domokos, Relative invariants of 3 × 3 3 3 3\times 3 matrix triples, Lin. Multilin. Algebra 47 (2000), 175-190.
