Invariant bilinear forms on $W$-graph representations and linear algebra over integral domains
Meinolf Geck, J\"urgen M\"uller

TL;DR
This paper introduces a new algorithmic method for computing invariant bilinear forms on W-graph representations, enhancing the computational tools available for Lie-theoretic structures like those of type E8.
Contribution
A novel algorithm for efficiently computing invariant bilinear forms on W-graph representations, facilitating advanced analysis in Lie theory and related algebraic structures.
Findings
Effective computation of invariant bilinear forms achieved
Algorithm applied successfully to complex Lie-theoretic structures
Enhanced tools for studying decomposition numbers in Hecke algebras
Abstract
Lie-theoretic structures of type (e.g., Lie groups and algebras, Hecke algebras and Kazhdan-Lusztig cells, ...) are considered to serve as a `gold standard' when it comes to judging the effectiveness of a general algorithm for solving a computational problem in this area. Here, we address a problem that occurred in our previous work on decomposition numbers of Iwahori-Hecke algebras, namely, the computation of invariant bilinear forms on so-called -graph representations. We present a new algorithmic solution which makes it possible to produce and effectively use the main results in further applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algebra and Geometry · Algebraic Geometry and Number Theory · Algebraic structures and combinatorial models
∎
11institutetext: Meinolf Geck 22institutetext: IAZ-Lehrstuhl für Algebra, Universität Stuttgart, Pfaffenwaldring 57, 70569 Stuttgart, Germany 22email: [email protected] 33institutetext: Jürgen Müller 44institutetext: Arbeitsgruppe Algebra und Zahlentheorie, Bergische Universität Wuppertal, Gauß-Straße 20, 42119 Wuppertal, Germany 44email: [email protected]
Invariant bilinear forms on
-graph representations and
linear algebra over integral domains
Meinolf Geck and Jürgen Müller
Abstract
Lie-theoretic structures of type (e.g., Lie groups and algebras, Iwahori–Hecke algebras and Kazhdan–Lusztig cells, ) are considered to serve as a “gold standard” when it comes to judging the effectiveness of a general algorithm for solving a computational problem in this area. Here, we address a problem that occurred in our previous work on decomposition numbers of Iwahori–Hecke algebras, namely, the computation of invariant bilinear forms on so-called -graph representations. We present a new algorithmic solution which makes it possible to produce and effectively use the main results in further applications.
1 Introduction
This paper is concerned with the representation theory of Iwahori–Hecke algebras. Such an algebra is a certain deformation of the group algebra of a finite Coxeter group . In myedin , the notion of “balanced representations” of was introduced, which has turned out to be useful in several applications. We mention here the construction of cellular structures on (see, e.g., (geja, , Chap. 2)), the determination of decomposition numbers of (see gemu ), and the computation of Lusztig’s function (see (geha, , §4)). To check whether a given representation of is balanced or not is a computationally hard problem; it involves the construction of a certain invariant bilinear form on the underlying -module. It has been conjectured in myedin that so-called “-graph representations” of are always balanced. But even if such a theoretical result were known to be true, certain applications (e.g., the determination of decomposition numbers) would still require the explicit knowledge of the Gram matrices of the invariant bilinear forms. In this paper, we discuss algorithms for the construction of these Gram matrices for of exceptional type. The biggest challenge—by far—is the case where is of type . (The distinguished role of when it comes to performing explicit computations is highlighted in various recent survey articles; see, e.g., Garibaldi gari , Lusztig shaw , Vogan VoE8 ).
In the situations of interest to us, the algebra is defined over the field of rational functions (where is an indeterminate); it has a natural basis . Explicit models for the irreducible representations of are known by the work of Naruse Naruse0 , Howlett and Yin How , HowYin . Now let us fix an irreducible matrix representation . In order to show that is balanced, one needs to determine a non-zero symmetric matrix such that
[TABLE]
this matrix then has to satisfy certain additional properties. Thus, the computation of essentially amounts to solving a system of linear equations; for theoretical reasons, we know that this system has a unique solution up to multiplication by a scalar. Rescaling a given solution by a suitable non-zero polynomial in , we can assume that all entries of are in and that their greatest common divisor is ; then is unique up to sign and is called a “primitive Gram matrix”. The general theory also shows that a particular solution is given by
[TABLE]
Thus, if the matrices () are known and if is not too large, then we can simply perform the above summation and obtain ; rescaling yields a primitive Gram matrix . This procedure works for types , , for example.
Already for type , one needs to use a more sophisticated approach as described in (gemu, , §4.3), based on Parker’s “standard basis algorithm” parker1 , in combination with interpolation and modular techniques. This also works for type , but it is efficient only for irreducible representations of dimension up to about . In our previous work on decomposition numbers, this was sufficient to obtain the desired results for type ; see (gemu, , Remark 4.10). In principle, one could have run the above procedure on all irreducible representations of type , but experiments showed that this would have needed a total of nearly one year of CPU time. On the other hand, from a strictly logical point of view, one does not need to know exactly how the Gram matrices have been obtained, because as an independent verification one can simply check that they form a solution to the above system of linear equations. However, to store the various primitive Gram matrices requires about GB of disk space, and even the verification alone is a major task as it involves the computation of products of (large) matrices with polynomial entries. — In any case, this raises a serious issue of making sure that our results are reliable and reproducible.
In our view, the solution to deal with this issue is to develop better mathematical tools which make it possible to reproduce the results efficiently as needed, and this is what we will do in this paper. Indeed, for example, in order to deal with the irreducible representation of largest dimension for type (which is ), the old approach would have needed roughly seven weeks of CPU time, while the one described here requires only about hours, which amounts to a factor of almost . (See Section 9.1 for more details.) In view of the complexity of the task, and the experiences made elsewhere with explicit computations in type (see the references cited above), it was clear that developing efficient methods would not be a standard, let alone press-button application of existing tools from computer algebra. Maier et al. mllt proposed an approach based on parallel techniques, but type still seems to be a major challenge there. Hence one of the purposes of this paper is to give a systematic description of the (serial) methods we have used for the computation of Gram matrices of invariant bilinear forms for Iwahori–Hecke algebras.
The basic strategy in our approach is to reduce computational linear algebra over the Laurent polynomial ring to linear algebra over the integers. Thus, generally speaking, we are faced with the problem of devising efficient tools to do computational linear algebra over integral domains, not just over fields. In order to do so, we build on general ideas from computational representation theory, more precisely on the celebrated so-called MeatAxe philosophy parker1 , which comprises of specially tailored, highly efficient techniques for computational linear algebra over (small) finite fields. Attempts to generalize these ideas to linear algebra over the (infinite) field of rational numbers, and further to linear algebra over the integers have been coined the IntegralMeatAxe parker2 . The last word on this has not been said yet, and in this paper we are trying to contribute here as well. (As future work, we are planning to develop a full IntegralMeatAxe package along the present lines.) But we are additionally going one step further by setting out to extend these ideas to linear algebra over the univariate polynomial rings over the rationals or the integers.
To do so, the basic idea is to reduce to linear algebra over the integers by evaluating polynomials with rational coefficients at integral places, where we are using as few “small” places as possible, and to recover the polynomials in question by a Chinese remainder technique. Hence this strategy, fitting nicely into the IntegralMeatAxe philosophy, differs from those known to the literature, inasmuch we are neither using modular methods (which would mean to go over to polynomial rings over finite fields), nor are we in a position to use interpolation (which would mean to use lots of places to evaluate at). Thus another purpose of this paper is to give a detailed description of the new computational tasks arising in pursuing this strategy, and how we have accomplished them. Although the choice of the material presented is governed by our application to Iwahori–Hecke algebras, it is exhibited with a view towards general applicability.
Here is an outline of the paper: In Section 2 we recall some basic facts about representations of finite Coxeter groups and Iwahori–Hecke algebras, in particular the notions of -graphs, balancedness, and invariant bilinear forms. We conclude with Theorem 2.10 saying that for the representations afforded by the -graphs given by Naruse Naruse0 , Howlett and Yin How , HowYin are actually balanced, and in Tables LABEL:Mmaxd0 and LABEL:Mmaxd we list some numerical data associated with their primitive Gram matrices.
In the subsequent sections we describe our general approach towards linear algebra over integral domains, which consists of a cascade of steps: In Section 3 we first deal with linear algebra over . We discuss the key tasks of rational number recovery and of finding integral linear dependencies. Both tasks are known to the literature, but for the former we provide a variant containing a new feature, while for the latter we proceed along another strategy, within the IntegralMeatAxe philosophy. Subsequently, we apply this to computing nullspaces, inverses, and the so-called “exponents” of matrices over . In Section 4 we then describe our general approach to deal with polynomials, in view of our aim to do linear algebra over polynomial rings. The key task is to recover a polynomial with rational coefficients from some of its evaluations at integral places. Here, we are aiming at using as few “small” places as possible, whence we are not in a position to apply interpolation, but we are using a Chinese remainder technique instead. Moreover, we devise a method to recover a polynomial from some of its evaluations where the latter are “rescaled” by unknown scalars; the necessity of being able to solve this task is closely related to our use of the IntegralMeatAxe, hence to our knowledge this method is new as well. In Section 5 we proceed to show how linear algebra over and polynomial recovery, as discussed in earlier sections, can now be combined to do linear algebra over and , by devising methods to computing nullspaces, inverses, exponents and products of matrices using this new approach. In Section 6 we finally recall the “standard basis algorithm” originally developed in parker1 for computations over finite fields. We present a general variant for absolutely irreducible matrix representations over an arbitrary field, show how this can be used to compute homomorphisms between such representations, and discuss how the necessary computations are facilitated over the fields and , using the tools we have developed.
Having the general tools in place, in Section 7 we return to our particular application of computing Gram matrices of invariant bilinear forms for -graph representations of Iwahori–Hecke algebras. We proceed along the strategy which has already been indicated in (gemu, , Section 4.3), where here we take the opportunity to provide full details. We begin by computing standard bases for the representations and , where the latter is given by , for . In order to find suitable seed vectors to start with, we use an observation on restrictions of representations of Iwahori–Hecke algebras to parabolic subalgebras, which naturally leads to certain distinguished elements of having actions of co-rank one on and . To actually run the standard basis algorithm subsequently, we again revert to a specialization technique. In Section 8 we proceed by collecting a few observations on the standard bases and of the representations and thus obtained. Indeed, the matrix entries occurring seem to be much less arbitrary than expected from general principles, but this has only been verified experimentally for the representations under consideration here, while a priori proofs are largely missing (so far). The final computational step then essentially is to determine the product , which up to rescaling is a Gram matrix as desired. To do this efficiently, apart from the general tools developed above, we make heavy use of the special form of the matrix entries of just mentioned. In the concluding Section 9 we provide running times and workspace requirements for our computations in types and , and present an explicit (tiny) example for type .
It should be clear from the above description that to pursue our novel approach we had to solve quite a few tasks for which there was no pre-existing implementation, let alone in one and the same computer algebra system. To develop the necessary new code, as our computational platform we have chosen the computer algebra system GAP GAP . This system provides efficient arithmetics for the various basic objects we need: (i) rational integers and rational numbers, which in turn are handled by the GMP library GMP ; (ii) row vectors and matrices over the integers, the rationals or (small) finite fields, where in this context the entries of row vectors are actually treated as immediate objects; (iii) floating point numbers, where the limited built-in facilities are sufficient for our purposes. Moreover, the necessary input data on Iwahori–Hecke algebras and their representations is provided by the computer algebra system CHEVIE jmich , which conveniently is a branch of GAP.
[TABLE]
2 Iwahori–Hecke algebras and balanced representations
We begin by recalling some basic facts about representations of finite Coxeter groups and Iwahori–Hecke algebras; see gepf , geja , Lusztig03 for further details.
We fix a finite Coxeter group with set of simple reflections ; for , we denote by the length of with respect to . Let be a weight function as in Lusztig03 , that is, we have whenever satisfy . Such a weight function is uniquely determined by its values for . We will assume throughout that
[TABLE]
Let be a subring and be the ring of Laurent polynomials over in the indeterminate . Let be the corresponding generic Iwahori–Hecke algebra. Thus, is an associative -algebra which is free over with a basis ; the multiplication is given by the following rule, where and :
[TABLE]
Let be the field of fractions of and assume that is a splitting field for . (For example, we could take since is known to be a splitting field for .) Let be the set of simple -modules (up to isomorphism); we shall use the following notation:
[TABLE]
where is a finite index set. Let be the field of fractions of and be the -algebra obtained by extension of scalars from to . Then is a split semisimple algebra and there is a bijection between and , the set of simple -modules (up to isomorphism). Given , we denote by a simple -module corresponding to . Then is uniquely determined (up to isomorphism) by the following property. For , we have
[TABLE]
The algebra is symmetric, with trace form given by and for . The basis dual to is given by . By the general theory of symmetric algebras, there are well-defined elements ( such that the following orthogonality relations hold for :
[TABLE]
As observed by Lusztig, we can write each uniquely in the form
[TABLE]
where is a strictly positive real number and is a non-negative integer. The “-invariants” will play a major role in the sequel; these numbers are explicitly known for all types of and all choices of (see (geja, , §1.3), (Lusztig03, , Chap. 22)). Alternatively, can be characterized as follows:
[TABLE]
Let be the localization of in the prime ideal , that is, consists of all fractions of the form where and . Let be a matrix representation afforded by . Following myedin , we say that is balanced if
[TABLE]
This concept plays a crucial role in the study of “cellular structures” on (see myedin ) and the determination of Kazhdan–Lusztig cells (see (geha, , §4)). It is known that every affords a balanced representation. Note that, given some matrix representation afforded by , the above condition is hard to verify since it involves representing matrices for all . Much better for practical purposes is the following condition.
Proposition 2.5 (See (myedin, , Prop. 4.3, Remark 4.4))
Assume that . Let and be a matrix representation afforded by . Then is balanced if and only if there exists a symmetric matrix such that
[TABLE]
Remark 2.6
Note that, if a matrix satisfies (), then it immediately follows that
[TABLE]
Thus, is the Gram matrix of a symmetric bilinear form which is -invariant in the sense that
[TABLE]
Remark 2.7
Assume that . Let and be a matrix representation afforded by . Let be the set of all such that for . Since is irreducible, Schur’s Lemma implies that all matrices in are scalar multiples of each other. By (geja, , Remark 1.4.9), there is a specific element given by
[TABLE]
furthermore, we have . By the Schur Relations (see (gepf, , 7.2.1)), we have
[TABLE]
Using the relation for all , we deduce that
[TABLE]
This provides a direct criterion for checking if a given matrix equals . Furthermore, if is an element of , then for some and so .
The following concept was introduced by Kazhdan–Lusztig KaLu in the equal parameter case (where for all ); for the general case see (geja, , §1.4).
Definition 2.8
Let be an -module with . We say that is afforded by a -graph if there exist
- •
a basis of ,
- •
subsets for ,
- •
and elements , where and ,
such that the following hold. First, we require that
[TABLE]
Furthermore, for , the action of on is given by
[TABLE]
Thus, if is afforded by a -graph representation, then the action of on is given by matrices of a particularly simple form.
It has been conjectured in myedin (see also (geja, , 1.4.14)) that, if the simple -module is afforded by a -graph, then the corresponding matrix representation is balanced. We now turn to the problem of explicitly verifying if a given irreducible matrix representation of is balanced or not.
We shall assume from now that is a finite Weyl group and that we are in the equal parameter case where for all ; we may take , in the above discussion. (The remaining cases have been dealt with in (myedin, , Examples 4.5, 4.6).) It is known that every simple -module is afforded by a -graph; see (geja, , Theorem 2.7.2) and the references there. As far as of exceptional type is concerned, such -graphs have been determined explicitly, by Naruse Naruse0 , Howlett and Yin How , HowYin . They are available in electronic form through Michel’s development version of the CHEVIE system; see jmich . Now let us fix and assume that is a corresponding representation afforded by a -graph. Concretely, this will mean that we are given the collection of matrices . Our aim is to find a matrix such that
[TABLE]
This is a system of homogeneous linear equations for the unknown entries of . (Recall that is symmetric.) We know that is uniquely determined up to scalar multiples. Rescaling a given solution by a suitable non-zero polynomial in , we can assume that all entries of are in and that their greatest common divisor is ; then is unique up to a sign. Such a solution will be called a primitive Gram matrix for . As in 2.7, a specific solution can be singled out by the condition that . We claim that
- •
the matrix has entries in , and
- •
the non-zero entries of have degree at most .
Here, denotes the longest element of . Indeed, since all the entries of the matrices () are in , the same will be true for as well. The formulae in 2.8 show that each matrix () has entries in . Hence, all matrices have entries in and so has entries in . Furthermore, the non-zero entries of each matrix have degree [math], or . This yields the degree bound for the entries of .
Since the entries of are integer polynomials of bounded degree, we can determine by interpolation and modular techniques (Chinese remainder). Combining this with the techniques described in (gemu, , §4.3), one obtains an algorithm which can be implemented in GAP in a straightforward way. Rescaling these matrices by suitable non-zero polynomials in , we obtain primitive Gram matrices as solutions of (). This approach readily produces primitive Gram matrices for of type , and in a few hours of computing time. As was already advertised in Section 1, we also succeeded in obtaining primitive Gram matrices for type , where it is one of the purposes of this paper to describe the methods involved.
Tables LABEL:Mmaxd0 and LABEL:Mmaxd contain some information about these primitive Gram matrices :
[TABLE]
We note that the primes in the 5th column are so-called “bad primes” for (as in (geja, , 1.5.11)). In particular, the fact that always has a non-zero determinant means that (see Proposition 2.5). Thus, we can conclude:
Theorem 2.10
Let be of type , , or and for all . Then the -graph representations of Naruse Naruse0 , Howlett and Yin How , HowYin are balanced.
3 Linear algebra over the integers
As was already mentioned in Section 1, the basic strategy of our approach to determine Gram matrices of invariant bilinear forms for representations of Iwahori–Hecke algebras is to reduce computational linear algebra over the polynomial rings or , where from now on denotes our favorite indeterminate, to computational linear algebra over the integers . Thus in this section we begin by describing how we deal with matrices over , where we restrict ourselves to the aspects needed for our present application.
Let us fix the following convention: For , not both zero, let denote the positive greatest common divisor of and . A vector , where , is called primitive, if actually , and for the greatest common divisor of its entries we have . Clearly greatest common divisor computations in yield a -multiple of which is primitive. Similarly, a matrix , where , is called primitive, if actually , and for the greatest common divisor of its entries we have .
Continued fractions and the Euclidean algorithm. The first computational task we are going to discuss, in Section 3.4 below, is rational number recovery. This has been discussed in the literature at various places, see for example dixon ; mon ; parker2 or (vzG, , Section 5.10). (We also gratefully acknowledge additional private discussions with R. Parker on this topic.) Although the ideas pursued in these references are closely related to ours, none of them completely coincides with our approach, and proofs (if given at all) are not too elucidating. Hence we present our approach in detail, for which we need a few preparations first:
Continued fraction expansions. We recall a few notions from the theory of continued fraction expansions; as a general reference see for example (hw, , Chapter 10): Given such that , let
[TABLE]
be its (regular) continued fraction expansion, where and for . This is obtained by letting , and, as long as , proceeding recursively with instead of . This process terminates, after steps say, if and only if ; otherwise we let . Truncating at yields the -th convergent of , hence we may write , where such that and . Letting additionally and , as well as and , for we get by induction
[TABLE]
Hence the sequences and are strongly increasing.
Now let , where . Then the continued fraction expansion of can be computed by the extended Euclidean algorithm, see (cohen, , Algorithm 1.3.6), as follows: Setting and , for let recursively and
[TABLE]
where is defined by but ; actually we have for , and of course . Hence the sequence has non-negative entries and is strongly decreasing. Moreover, setting and , as well as and , and for letting recursively
[TABLE]
we get . Then it is immediate by induction that and , for , and hence
[TABLE]
Hence the sequences and have positive entries and are strongly increasing. Finally, a direct computation yields
[TABLE]
Another view on the Euclidean algorithm. For we consider the -lattice
[TABLE]
Then we have , and it is immediate that is an element of if and only if . Note that if is primitive, then we necessarily have . Moreover, the extended Euclidean algorithm shows that , for all . We collect a few properties of :
Lemma 3.2
(a) For all we have .
(b) We have , where , if and only if .
Proof
We first show that whenever such that , for some , then : We may assume that . Let such that
[TABLE]
where we may assume that , which entails as well. Since , this implies . Since the sequence has positive entries, we get , as asserted.
(a) We may assume that . Moreover, for letting , it is immediate from that divides . Hence we may assume , too. Then let be a divisor of such that . Then we have and , contradicting the statement above.
(b) It follows from (a) that there are such that and . Hence we get , and since the sequence has non-negative entries and is strongly decreasing, we infer and . ∎
Note that the statement in (b) is trivial if is primitive, that is . But this is not always fulfilled, as the example in (vzG, , Example 5.27) shows.
Proposition 3.3
(a) Let such that and . Then we have , for a unique . In particular, if is primitive then we have or .
(b) Assume there is such that . Then there is a unique such that , and the shortest non-zero elements of are precisely and .
Proof
(a) Since there is such that . Then we have
[TABLE]
Thus by Legendre’s Theorem, see (hw, , Section 10.15, Theorem 184), we infer that occurs as a convergent in the continued fraction expansion of , that is, there is such that . This yields
[TABLE]
Hence we have , and thus from Lemma 3.2 we get , together with the uniqueness statement.
(b) Assume first that , then by Lemma 3.2 we infer that divides , and hence , a contradiction. Hence we have . Moreover, from we get , hence from (a) we see that there is such that . Thus in particular we have .
In order to show uniqueness, and the statement on shortest elements, let such that . Then, as above, there is such that , hence in particular we have . Then Hadamard’s inequality, see (vzG, , Theorem 16.6), implies that
[TABLE]
Since divides \det\Big{(}\begin{bmatrix}s_{i}&r_{i}\\ s_{j}&r_{j}\\ \end{bmatrix}\Big{)} this entails , and hence by Lemma 3.2. ∎
A comparison of the above treatment with the references already mentioned seems to be in order: The statement of Proposition 3.3(a) is roughly equivalent to (dixon, , Theorem) and (mon, , Theorem 1), respectively. Alone, the proof given in dixon appears to be too concise, and provides a slightly worse bound for to be large enough. And (mon, , Theorem 1) is attributed in turn to davguywan , while for a proof the reader is referred to vzG . Unfortunately, (vzG, , Theorem 5.26) is not immediately conclusive for the statements under consideration here.
The main difference between the above-mentioned approaches and ours is the break condition used to actually determine the index referred to in Proposition 3.3(a): In davguywan ; dixon ; vzG a bound on the residues is used, while in (mon, , Section 3) the quotients are considered instead (yielding a randomized algorithm). In contrast, in our decisive Proposition 3.3(b) we are using the minimum of the lattice , which hence treats both the and (in other words the the unknown numbers and ) on a “symmetric” footing. To our knowledge, this point of view is new, its algorithmic relevance being explained below.
Recovering rational numbers. We are now prepared to describe our first computational task, which will appear both in computations over in Section 3.5, and over the polynomial ring in Section 4.2:
Let and such that . Assume we are given such that and ; note that since is invertible modulo we may write instead, which we will feel free to do if convenient. Now, if is large enough compared to and , the task is to recover from its congruence class .
In view of Proposition 3.3(b), this is straightforward: Assuming that , the -lattice has precisely two shortest non-zero elements, namely the primitive elements . In other words, the rational number can be found by computing a shortest non-zero element of . This in turn can be done algorithmically by the Gauß reduction algorithm for -lattices of rank , see (cohen, , Algorithm 1.3.14). Moreover, compared to the general case, for the particular lattice we have a better break condition: We may stop early as soon as we have found an element such that . If then is primitive, the rational number fulfills all assumptions made, where of course its correctness has to be verified independently. Otherwise, if is not primitive, or the shortest element found fulfills , then we report failure. Thus, in practice, we choose small, and rerun the above algorithm with increasing, until we find a valid candidate passing independent verification.
At this stage, we should point out the algorithmic advantage of our approach, compared to the other ones mentioned: The latter refer to the convergents of continued fraction expansions, and thus to the full sequence of non-negative residues of the extended Euclidean algorithm. In contrast, the Gauß reduction algorithm to find a lattice minimum proceeds by iterated pair reduction, starting with the pair and . Although this is essentially equivalent to running the extended Euclidean algorithm on and , here we are allowed to use best approximation. This amounts to using numerically smallest residues, instead of non-negative ones as was necessary in the context of continued fraction expansions. Although we have not carried out a detailed comparison, it is well-known that this saves a non-negligible amount of quotient and remainder steps.
Finding linear combinations. We are now going to describe the basic task we are faced with in order to be able to do computational linear algebra over . To do so, we of course avoid the Gauß algorithm over , but we also do not refer to pure “lattice algorithms”, as they are called in (cohen, , Section 2.1), for example those to compute Hermite normal forms or reduced lattice bases described in (cohen, , Section 2.4–2.7). Instead, we use a modular technique, which is a keystone to make use of the ideas of the MeatAxe in the framework of the IntegralMeatAxe. To our knowledge, this has only been discussed very briefly in the literature, for example in dixon ; parker2 . Moreover, our approach differs from those cited, at least in detail; in particular, dixon only allows for regular square matrices.
To describe the computational task, we again need some preparations first: Given a (rectangular) matrix , with -linearly independent rows , where , let
[TABLE]
be the -lattice spanned by the rows of , and let be its pure closure in , that is the smallest pure -sublattice of containing . Then the index is finite; of course, if then we have . Thus for any vector , we have if and only of there is such that ; in this case, if is chosen minimal then it divides .
Now, given , the task is to decide whether or not , and if this is the case to compute and such that and
[TABLE]
in this case and the are uniquely determined.
The -adic decomposition algorithm. To do so, we choose a (large) prime . Then reduction modulo yields the matrix over the prime field . We assume that the rows of are -linearly independent as well; otherwise we choose another prime . By the structure theory of finitely generated modules over principal ideal domains, this condition is equivalent to saying , which in turn is equivalent to not dividing . In particular, the independence condition on is fulfilled for all but finitely many primes .
Thus we have if and only if , solving the decision problem. Furthermore, if then set , and for proceed successively as follows: Since , there are such that for all , and
[TABLE]
Then we let
[TABLE]
Hence we have as well, and we may recurse. This yields
[TABLE]
or equivalently
[TABLE]
Thus, if , or equivalently , then since there is some such that , implying that , for all , without further independent verification necessary. Otherwise, if , then applying rational number recovery for some large enough, see Section 3.4, reveals the vector ; note that under the assumptions made does not divide . In the latter case correctness is independently verified by computing and checking whether it equals .
Modular computations. In practice, to check for -linear independence, and to compute the vectors we use ideas taken from the MeatAxe. In particular, in order to keep the depth needed smallish, but still to be able to make efficient use of fast arithmetic over small finite prime fields, we choose the prime amongst the largest primes smaller than . (In our application we for example use as the default prime.)
Nullspace. In the framework of the IntegralMeatAxe there is a general method to compute a -basis of the row kernel of a matrix with entries in , see parker2 . But in view of the application to row kernels of matrices over in Section 5.1, here we only deal with the following restricted nullspace problem:
Given a matrix , where , such that , where denotes the row kernel of , compute a primitive vector such that ; then is unique up to sign.
To do so, by going over to a suitable -multiple we may assume that . Let be the rows of . We may assume that , since otherwise we trivially set . Then for we successively check, using the -adic decomposition algorithm in Section 3.5, whether or not . If this is not the case, that is is -linearly independent, then if turns out to be -linearly independent we increment , while otherwise we return failure in order to choose another prime . If is -linearly dependent, then the -adic decomposition algorithm returns and such that and . Thus is primitive.
Inverse. Matrix inversion over , from the point of view of reducing to computations over as much as possible, can be formulated as the following task:
Given a matrix , where , such that , compute and , such that and the overall greatest common divisor of the entries of and equals ; then is unique.
To do so, by going over to a suitable -multiple we may assume that . Then the equation , where denotes the identity matrix, implies that divides , and hence is necessarily primitive. Solving the equations , for the unknown matrix , amounts to writing the rows of the identity matrix as -linear combinations of the rows of , which is done using the -adic decomposition algorithm in Section 3.5; recall that the rows of indeed are assumed to be -linearly independent.
The exponent of a matrix. Given a square matrix such that as above, the number found in the expression , where is chosen to be primitive, turns out to have another interpretation:
Let be the -span of the rows of . By the structure theory of finitely generated modules over principal ideal domains, the annihilator of the -module is a non-zero ideal of , the positive generator of which is called the exponent of . Moreover, divides , which in turn divides some power of . Thus the prime divisors of are precisely the primes such that is not invertible.
Now, actually and coincide: From we conclude that , hence divides ; conversely, since there is such that , implying that , which by the primitivity of shows that divides . In other words, computing the inverse of as described in Section 3.7 also yields a method to compute .
4 Computing with polynomials
Having the necessary pieces of linear algebra over the integers in place, in this section we describe computational aspects of single polynomials, before we turn to linear algebra over polynomials rings in Section 5.
Polynomial arithmetic. As our general strategy is to use linear algebra over or to do linear algebra over or , for all arithmetically heavy computations we recurse to or . Consequently, for the remaining pieces of explicit computation in or we may use a simple straightforward approach:
We use our own standard arithmetic for polynomials over or , where a polynomial is just represented by its coefficient list of length , where . Thus we avoid structural overhead as much as possible, and may use directly the facilities to handle row vectors provided by GAP. But we would like to stress that this is just tailored for our aim of doing linear algebra over polynomial rings, and not intended to become a new general-purpose polynomial arithmetic. For example, we are not providing asymptotically fast multiplication, as is for example described in (vzG, , Section 8.3).
In particular, we only rarely need to compute polynomial greatest common divisors. Hence we avoid sophisticated (modular) techniques, as are for example described and compared in (vzG, , Chapter 6), but we are content with a simple variant of the Euclidean algorithm: Assuming that the operands have integral coefficients, by going over to -multiples if necessary, in order to avoid coefficient explosion we just use denominator-free pseudo-division as described in (cohen, , Algorithm 3.1.2), and Collins’s sub-resultant algorithm given in (cohen, , Algorithm 3.3.1), albeit the latter without intermediate primitivisation.
On the other hand, we very often have to evaluate polynomials at various places, where our strategy is to use as few of these specializations as possible, so that evaluation at distinct places is done step by step. Thus we are not in a position to use multi-point evaluation techniques, as are for example described in (vzG, , Section 10.1). Hence we are just using the Horner scheme, which under these circumstances is well-known to need the optimal number of multiplications.
We now describe the special tasks needed to be solved in our approach:
Recovering polynomials. The aim is to recover a polynomial with rational coefficients, which we are able to evaluate at arbitrary integral places, from as few such evaluations (at “small” places) as possible. More precisely:
Let be a polynomial of degree , having coefficients , where and such that . Then the task is to find pairwise coprime places , for some (small) , such that the degree and the coefficients of can be computed from the values alone. Note that, in particular, we do not assume that , so that polynomial interpolation is not applicable. (Actually, in our application we often enough have , where for example , but .)
To this end, let , and assume that we have and for all . Hence the congruence classes and are well-defined, and for the constant coefficient of we get
[TABLE]
Thus by the Chinese Remainder Theorem, see for example (cohen, , Theorem 1.3.9), there is a unique congruence class , where , such that . To compute , we let such that
[TABLE]
An application of Chinese remainder lifting in to the congruence classes yields the congruence class , and by our choice of applying rational number recovery as described in Section 3.4 reveals . Now we recurse to , whose value at the place can of course be determined directly from as .
Chinese remainder lifting. Hence, apart from rational number recovery, the key computational task to be solved is to perform Chinese remainder lifting in :
We are using the straightforward approach based on the extended Euclidean algorithm, as is described in (cohen, , Section 1.3.3). Since we are computing many lifts with respect to the same places , we make use of a precomputation step, as in (cohen, , Algorithm 1.3.11). But, since again for reasons of time and memory efficiency we are choosing small places , the specially tailored approach in (cohen, , Algorithm 1.3.11) to keep the intermediate numbers occurring small, at the expense of needing more multiplications, does not pay off as experiments show. Moreover, as we are computing the values for step by step, where even the number of places is not determined in advance, we cannot take advantage of fast Chinese remainder lifting techniques, as are described for example in (vzG, , Section 10.3), either.
Our strategy is to rerun the above algorithm with increasing, choosing small integral , and to discard quickly erroneous guesses by an independent verification, until the correct answer passing the verification is found. By the above discussion, this happens after finitely many iterations. Before that, if is too small, or not coprime to all the denominators , the Chinese remainder lifting process does not terminate, or it terminates with a wrong guess. To catch the first case, we impose a degree bound, and stop the lifting process with a failure message if it is exceeded, in order to increment . (In our application, turned out to be a suitable degree bound in all cases.)
To catch the second case, we only allow for denominators dividing an imposed bound. This is justified, since rational number recovery as described in Section 3.4 is a trade-off between finding the numerator and the denominator of the rational number to be reconstructed: In practice, we typically encounter small denominators and large numerators , which escape the Gauß reduction algorithm if is chosen too small, since then the latter tends to return a larger denominator and a smaller numerator . (In our application, denominator bounds such as small -powers, or , or turned out to be sufficient in all cases.)
Degree detection. We keep the setting of Section 4.2. The technique to be described now has arisen out of an attempt to determine the degree of without determining its coefficients. Actually, it deals with the following more general situation (whose relevance for our computations will be explained in Section 4.5 below):
Assume that instead of the values we are only able to compute “rescaled values” , with scalar factors such that , which are only known to come from a finite pool of positive rational numbers associated with . Thus the task now becomes to find and coprime places as above, allowing to determine up to some positive rational scalar multiple, that is to find , for some such that ; note that this also determines all the quotients .
To this end, we let be the complex roots of , and set . Moreover, since is a finite set, we have
[TABLE]
Now, let , and for the places we additionally assume that
[TABLE]
hence, in particular, the are non-zero and have the same sign. The necessity of these choices will become clear below. But this forces us to show that for all and all and there actually exist pairwise coprime integers such that and \ln\big{(}\frac{b_{k}}{b_{1}}\big{)}<\delta. Indeed, we are going to show that the latter can always be chosen to be primes (where the mere existence proof to follow is impractical, but in practice considering small primes works well, see Example 4.4):
Let be the sequence of all primes exceeding , and assume to the contrary that for all -subsets thereof, say, we have \ln\big{(}\frac{q_{k}}{q_{1}}\big{)}\geqslant\delta. Then we have , and thus , for all . Using the prime number function this implies
[TABLE]
From this we get
[TABLE]
contradicting the Prime Number Theorem, see (hw, , Section 1.8, Theorem 6), saying that .
Growth behavior of polynomials. We now consider the growth behavior of the polynomial . For we have
[TABLE]
implying
[TABLE]
Thus, for , by the mean value theorem for derivatives there is such that
[TABLE]
Since by assumption , we have
[TABLE]
for all . All differences having positive real parts, we get
[TABLE]
Moreover, by assumption we have , hence
[TABLE]
Now, letting denote the integer nearest to , we set
[TABLE]
for all such that ; note that . Hence from the above estimates we infer that if and only if . In particular, all these numbers coincide if and only if , hence in this case immediately determining .
Combinatorial translation. Thus our task can now be rephrased in combinatorial terms as follows: For let be the undirected graph on the vertex set , whose edges are the -subsets such that .
Then by the above discussion the connected components of are complete graphs, whose vertex sets coincide with the sets of such that the associated scalars assume one and the same value. On the other hand, if , for some , has a complete connected component with vertices , then for all such that we have
[TABLE]
Thus we infer that the sequence is strictly increasing if , and strictly decreasing if . In particular this implies that . In other words, as soon as we find a complete connected component of a graph having more than elements, then we may conclude that , and we have determined . Moreover, if than this case actually happens.
Our algorithm to determine the degree of , and for some , is now straightforward: Again our strategy is to increase step by step, and to choose places such that is growing and tends to zero. Having made a choice, we compute the numbers for all ; note that here we do not see a way to avoid using non-exact floating point arithmetic (to evaluate logarithms), while everywhere else we are computing exactly. For all numbers thus occurring we then determine the graph . Amongst all the graphs found we choose one, again say, having a complete connected component of maximal cardinality, with vertex set say. Then we run polynomial recovery, see Section 4.2, using the places and the values , with degree bound .
An example. Here is an example to illustrate the above process. (It is a modified version of an example which actually occurred in our application.) Assume as places , for , we have chosen the rational primes between and , and evaluating the unknown polynomial has resulted in the list of values given in Table 1; the scalars are of course not known either.
Then it turns out that the numbers , where , come from an -element subset of . For seven of them the associated graph has a connected component with at least three vertices, but only for two of them we find a complete connected component amongst them: The graph has a complete connected component consisting of the vertices , while the graph consists of three connected components, which all are complete, having the vertices
[TABLE]
Running polynomial recovery, see Section 4.2, using the places fails by exceeding the degree bound. But running it using yields , where
[TABLE]
while running it using and yields and , respectively. Thus we indeed have , and assuming that we have determined the scalars , for , as well. Note that the bounds assumed in Section 4.2 are fulfilled; and the roots of turning out to be complex roots of unity, implying , the bounds assumed in Section 4.3 are fulfilled as well.
It should be noted that for the preceding discussion we have chosen large enough to exhibit the occurrence of the erroneous set , for which we indeed observe that the associated scalars are pairwise distinct. But this also reveals another practical observation, at least for polynomials occurring in the applications in Section 5: The scalars , here coming from the three-element set , typically are not uniformly distributed throughout , but the scalar occurs much more frequently than the other ones.
As was already mentioned, in practice we instead increase step by step. Then for the smallest such that the graph has a complete connected component with at least three vertices, that is for , we find the set of places, indeed being associated to the case . Now polynomial recovery using readily returns ; note that the bounds assumed in Section 4.2 are still fulfilled.
Catching projectivities. We now have to explain where the conditions imposed in Section 4.3 come from: Typically, for example for the tasks described in Sections 5.1 and 5.2, our aim is to determine a matrix over or by computing various specializations first, that is evaluating at certain places , performing some linear algebra over or , as described in Section 3, for each of the specializations, and then lifting back to polynomials as explained in Section 4.2. But the linear algebra step in between might only be unique up to a scalar in , which additionally depends on the particular specialization considered. On the other hand, the matrix we are looking for might also only be unique up to a scalar in .
Let us now, again, agree on the following convention: Given , not both zero, let denote the polynomial greatest common divisor of and with positive leading coefficient. A vector , where , is called primitive, if actually , and for the greatest common divisor of its entries we have . Clearly greatest common divisor computations in and in yield a -multiple of which is primitive. Similarly, a matrix , where , is called primitive, if actually , and for the greatest common divisor of its entries we have .
Specializing primitive vectors. Hence, in the above context the task is to recover a primitive vector not from specializations , for , but from “rescaled” versions instead. This places us in the setting of Section 4.3, but it remains to justify the assumption that the scalars involved indeed come from a finite pool:
Proposition 4.6
Let , where , such that . Then there is a finite set such that for all we have
[TABLE]
Proof
Note first that by assumption do not have any common zeroes, so that is well-defined for any . We proceed by induction on . For we have , and we may let . Hence let , where we may assume that all the , for , are non-constant. Letting we have Letting for , we have , thus by induction let be a set as asserted associated with . Now, given , we may write
[TABLE]
as , where , and divides . Hence divides , and thus divides an element of . Moreover, from we infer that the resultant is different from zero, see (vzG, , Corollary 6.20), which by (vzG, , Corollary 6.21) implies that divides . Thus the set of all positive divisors of the elements of is as desired. ∎
5 Linear algebra over polynomial rings
As was already mentioned, our general strategy to determine matrices over or is to specialize first at integral places, to apply linear algebra techniques as described in Section 3 to the matrices over or thus obtained, and subsequently to recover the polynomial entries in question by the Chinese remainder lifting technique described in Section 4.2, applying degree detection as described in Section 4.3 if necessary. In this section we describe how we can do linear algebra over or using this approach.
Since we are faced with both sparse and dense matrices, we keep two corresponding formats for matrices over polynomial rings. (In our application, representing matrices for -graph representations, see Definition 2.8, are extremely sparse, while Gram matrices for them, see Remark 2.6, typically are dense; see also Example 9.2). We have conversion and multiplication routines between them, but whenever it comes to linear algebra computations we always use the dense matrix format. From the arithmetical side, we are only using standard matrix multiplication, but no asymptotically faster methods, as are for example indicated in (vzG, , Section 12.1).
Nullspace. We have developed a solution to the following restricted nullspace problem only (which is sufficient for our application):
Given a matrix , where , such that , the task is to determine a primitive vector such that ; then the vector is unique up to sign.
To do so, by going over to a suitable -multiple we may assume that is primitive. Then we specialize the matrix successively at integral places , yielding matrices . Since the rank condition on is equivalent to saying that for all -submatrices of , while there is an -submatrix of such that , we have for any , and for all but finitely many such we indeed have . Thus we may assume that all the chosen specializations also fulfill . Note that this provides an implicit check whether the rank condition on indeed holds.
Hence we are in a position to compute the row kernels as described in Section 3.6, where the are primitive, for all . Thus the latter are of the form , where , and is the desired primitive solution vector from above. By Proposition 4.6 we conclude that the scalars involved indeed come from a finite pool only depending on .
Now applying degree detection, see Section 4.3, and polynomial recovery, see Section 4.2, yields candidate vectors , which by going over to a suitable -multiple can be assumed to be primitive. Then the correctness of can be independently verified by explicitly computing and checking whether this is zero.
Inverse. Given a matrix , where , such that , the task is to find and , such that and the overall greatest common divisor of the entries of and equals ; then the pair is unique up to sign.
To do so, by going over to a suitable -multiple we may assume that . Thus the equation implies that divides , and hence is primitive. Then we specialize the matrix successively at integral places , yielding matrices . Since for all but finitely many we have , we may assume that all the chosen specializations indeed also fulfill . Note that this provides an implicit check whether the invertibility condition on indeed holds.
Hence we are in a position to compute the inverses as described in Section 3.7, yielding and , such that is primitive and , for all . Thus, if and are the desired solutions from above, we infer
[TABLE]
By Proposition 4.6 we conclude that the scalars involved indeed come from a finite pool only depending on and .
Now applying degree detection, see Section 4.3, and polynomial recovery, see Section 4.2, yields candidate solutions and , for which by going over to a suitable -multiple we may assume that and is primitive. Then the correctness of can be independently verified by explicitly computing and checking whether it equals .
The exponent of a matrix. In view of the discussion in Section 3.8, and noting that is a principal ideal domain as well, we pursue the analogy between matrix inverses over and over still a little further. Indeed, given a square matrix such that as above, the polynomial in the expression , where is chosen primitive, again has another interpretation:
Let the exponent of be a primitive generator of the annihilator of the -module , where is the -span of the rows of ; then is unique up to sign. Then, similar to Section 3.8, we conclude that and are associated in , and thus the primitivity of yields
[TABLE]
In other words, computing the inverse of as described in Section 5.2 also yields a method to compute the exponent of as . Moreover, governs modular invertibility of as follows:
Proposition 5.4
We keep the notation of Section 5.3. Let be a prime ideal, let be the field of fractions of the integral domain , and let be the matrix obtained from by reduction modulo . Then is invertible in if and only if .
Proof
The prime ideals of being well-understood, we are in precisely one of the following cases: (i) We have , where is a prime; then we have , a rational function field; (ii) we have , where is non-constant and irreducible, hence in particular is primitive; then we have , an algebraic number field; (iii) we have , where and are as above; then we have , a finite field.
Now is non-invertible in if and only if , which holds if and only if there is an irreducible divisor of being contained in . Thus is suffices to determine (i) the primes , and (ii) the non-constant irreducible polynomials dividing in .
(i) From , where is the adjoint matrix of , we infer that divides in . Hence any prime dividing also divides in . Conversely, if does not divide , then -modular reduction yields , hence . Hence the primes we are looking for are precisely the prime divisors of .
(ii) This is equivalent to finding the irreducible polynomials in dividing in . Again similar to Section 3.8 we conclude that the latter are precisely the irreducible polynomials dividing . Hence the polynomials we are looking for are precisely the non-constant irreducible divisors of . ∎
Product. Given matrices and , where , the task is to compute their product .
This is straightforwardly done: Again, by going over to suitable -multiples we may assume that and . Then we specialize the matrices and successively at integral places , yielding matrices and , whose products we compute. Now applying polynomial recovery, see Section 4.2, yields candidate solutions . (Note that since no “rescaling” takes place here it is not necessary to apply degree detection.)
As for correctness, there are a few necessary conditions which can be used as break conditions in polynomial recovery: All entries of must be polynomials with integer coefficients, and the degrees of the entries of the input matrices yield bounds on the degrees of those of . But these conditions are far from being sufficient, so that, in contrast to the tasks in Sections 5.1 and 5.2, here we do not have a general way of independently verifying correctness. (In our application, as a very efficient break condition we have used the fact that the entries of have to be of a particular form, see Section 8.4.)
An alternative approach. The idea of our approach is, essentially, to reduce computations over to computations over , where lifting back to polynomials is done in one step by combining specialization and Chinese remainder lifting. In consequence, we almost entirely use arithmetic in characteristic zero (except the use of a large prime field in the -adic decomposition algorithm in Section 3.5). But it seems to be worth-while to say a few more words on the following “two-step” approach, which was already mentioned briefly in Sections 1 and 2.9:
Assume our aim is to determine a matrix , where . To this end, we choose pairwise distinct places , for some such that , where is the maximum of the degrees of the non-zero entries of . Thus, if we are able to compute the specializations , for , we may recover the entries of by polynomial interpolation, as for example is described in (vzG, , Section 10.2). In turn, to find the specializations we choose pairwise distinct primes , for some , such that the denominators of all the entries of are coprime to , for all and . Then reduction modulo the chosen primes yields matrices . Hence, if is large enough, and we are able to compute the modular reductions , for , then rational number recovery, see Section 3.4, reveals the entries of . Hence this reduces finding the matrix to finding the matrices over prime fields, for which we in turn may use techniques of the MeatAxe.
Thus here specialization and Chinese remainder lifting are done in two separate steps, aiming at taking advantage of the efficiency of computations in prime characteristic. But the “two-step” approach has severe disadvantages: The number of places to specialize at is at least as large as the degree of the polynomials in question, hence many more and larger than in our approach are needed, increasing time and memory requirements, presumably drastically. (In our application this means .) Moreover, in order to use rational number recovery, the number of primes used for modular reduction must not be too small, at the expense of possibly loosing the very fast arithmetic over small finite fields, which otherwise is a major advantage of the MeatAxe.
Actually, apart from our own experiences, this kind of approach is pursued in mllt , and the figures on timings and memory consumption given there seem to support the above comments. But it should be stressed that the emphasis of mllt is on parallelizing this kind of computations, which we here do not consider at all.
6 Computing with representations
As was already mentioned in Section 1, in our application we will make use of a suitable variant of the “standard basis algorithm”, which was originally used in parker1 for computations over finite fields. In this section we present the necessary ideas from computational representation theory, which can be formulated in terms of the following general setting:
Standard bases. Let be a -algebra, where is a field, being generated by the (ordered) set , where . Moreover, let be an absolutely irreducible matrix representation of , where . Then the task is to find a “canonical” -basis of the row space with respect to the representation , where we consider right actions, as is common in the computational world.
To this end, let such that ; note that whenever is irreducible such an element exists if and only if is absolutely irreducible. This leads to the following breadth-first search algorithm; see also parker1 : Choose a seed vector , let and , and set . As long as does not exceed the cardinality of , let be the -th element of . Then for let successively , and check whether or not . If so, then discard ; if not, then append to , and append to . Having done this for all , increment and recurse.
Since the growing set is -linearly independent throughout, this algorithms terminates after at most loops. After termination, is a non-zero submodule of the irreducible -module , and thus indeed is a -basis. (Of course, we may terminate early, without any further checking, as soon as the cardinality of equals , since from this point on would not change anymore anyway.) The (ordered) set is called a standard basis of with respect to the representation , the generators , and the distinguished element , and the “bookkeeping list” is called the associated Schreier tree.
Strictly speaking, also depends on the chosen seed vector, but it is essentially unique in the following sense: If gives rise to the standard basis with Schreier tree , then we have , for some , and thus and . Moreover, using the Schreier tree , we may recover , up to a scalar, without any searching as follows: Choose , and for let successively .
In practice. We are able to run the above standard basis algorithm in the following particular cases: If is a (small) finite field, then this can of course be done using ideas from the MeatAxe, as is already described in parker1 .
More important from our point of view is the case . Then we may assume that , and if additionally , for all , then we have , hence the key step in the above algorithm, to decide whether or not , can be done using the -adic decomposition algorithm in Section 3.5, where whenever is enlarged we also check whether its -modular reduction is -linearly independent; if not, then we return failure in order to choose another prime . (Note that this is reminiscent of the strategy in Section 3.6.)
Computing homomorphisms. We return to the general setting in Section 6.1, and let be a matrix representation of , which is equivalent to . Then a standard basis of with respect to the representation is found by choosing and just applying the Schreier tree already known from the standard basis computation for by letting successively , for ; note that by assumption we indeed have .
Now let be an -homomorphism from to , that is we have
[TABLE]
of course, it suffices to require this condition for the generators only. Since is absolutely irreducible, it follows that and is unique up to a scalar. Moreover, we have , and thus going over from the standard bases and with respect to and , respectively, to the associated invertible matrices and with rows and , respectively, we get , or equivalently
[TABLE]
Thus to determine we have to perform the following steps: find such that ; compute and ; compute a Schreier tree with respect to and ; apply the Schreier tree in order to compute standard bases and of with respect to and , respectively; going over to matrices, compute the inverse ; and compute the product .
In practice. If , the nullspaces required can be found as described in Section 5.1, where we may assume that and are primitive. Moreover, computing matrix inverses and matrix products can be done as described in Sections 5.2 and 5.5, respectively; by multiplying with a suitable element of we may assume that is primitive as well, then is unique up to sign. Hence for our application it remains to describe how a distinguished element and a Schreier tree can be found, and we have to give an efficient break condition for the algorithm in Section 5.5.
7 Finding standard bases for -graph representations
We have now described the necessary infrastructure from linear algebra over integral domains, and some relevant general ideas how to compute with representations, to proceed to the explicit determination of Gram matrices of invariant bilinear forms for balanced representations of Iwahori–Hecke algebras. We recall the setting of Section 2.9, which we keep from now on:
Let be a finite Coxeter group, and let be the associated generic Iwahori–Hecke algebras with equal parameters over the ring and the field , respectively, being generated by . Moreover, let , where , be a -graph representation associated with , and let
[TABLE]
As far as computer implementations are concerned, it is more convenient and more efficient to work with row vectors instead of column vectors. Therefore, we will now work throughout with right actions rather than left actions as in Section 2. Our aim is to find a primitive Gram matrix for , that is, using the language of right actions, a primitive matrix such that
[TABLE]
Thus the task is to find a non-zero -homomorphism from to . In order to use the approach described in Section 6.2, we proceed as follows, where the basic idea of this strategy has already been indicated in (gemu, , Section 4.3):
Finding seed vectors. To find a suitable seed vector for the standard basis algorithm with respect to , we proceed as follows:
Specializing we from recover the group algebra , and corresponds to an irreducible representation . In particular, the index and sign representations of , given by and , respectively, for all , correspond to the trivial and sign representations of , given by and , respectively.
As was observed by Benson and Curtis (see (gepf, , Section 6.3) and the references there), there is a subset (depending on , and in general not being unique), such that the restriction of to the parabolic subgroup associated with fulfills
[TABLE]
Note that and if and only if equals and , respectively. Letting be the parabolic subalgebra associated with , this implies
[TABLE]
In other words, we equivalently have
[TABLE]
Now we are going to use the fact that is a -graph representation: Using the -sets associated with , see Definition 2.8, we conclude that for all , where denotes the -th “unit” vector. This implies
[TABLE]
Hence we may let , where is the unique index such that .
Note that this conversely also yields a way to find all subsets of fulfilling the Benson–Curtis condition: We run through all subsets , and just check whether there is precisely one index such that .
Finding a distinguished element. The above immediate approach strongly uses the fact that is a -graph representation. Thus, in order to find a suitable seed vector for the standard basis algorithm with respect to we specify a distinguished element such that . Let
[TABLE]
Hence we have , and it remains to be shown that :
Assume to the contrary that \dim_{K}(\ker\big{(}{\mathfrak{X}}^{\lambda}(T^{\lambda})))\geqslant 2. Then letting
[TABLE]
specializing shows that as well. Since for any vector we have , for all , Lemma 7.3 proven below implies that is -invariant and carries the sign representation. Thus we have , a contradiction.
Lemma 7.3
For let . Moreover, let
[TABLE]
Then, with respect to the natural topology on , we have
[TABLE]
Proof
We consider the Markov chain with (finite) state space , and transition matrix , where denotes the regular matrix representation of . In other words, the matrix entry , where , is given as
[TABLE]
Now, since for all and , we conclude that induces Markov chains on both and . Moreover, since any element of can be written as a word of length at most in the generators , we infer that has positive entries in both the block submatrices belonging to and , respectively. Hence the induced Markov chains are both irreducible and aperiodic. They thus converge towards stationary distributions, which since is doubly-stochastic are both equal to the respective uniform distributions. Thus, in particular, the initial state yields
[TABLE]
∎
Finding standard bases. The distinguished element can now be used to find a primitive vector . Next, having both seed vectors and in place, we aim at computing the associated standard bases with respect to , and with respect to , for the -algebra generated by . But since we do not have a standard basis algorithm available for representations over the field , we again use suitable specializations:
Given a place , let be the representation of obtained by specializing , that is, considering as the -algebra generated by we have
[TABLE]
thus in particular for , identifying with , we recover .
Now we compare a putative run of the standard basis algorithm, as described in Section 6.1, with respect to the seed vector and the generators , with a run with respect to the specialized seed vector and the generators . These successively produce standard bases and , respectively. We show by induction on the cardinality of the intermediate sets , that for all but finitely many the set is obtained by specializing , and that the Schreier trees found in both runs coincide:
Indeed, the key steps are to decide for some whether or not , and similarly for its specialization whether or not . Identifying and with matrices and , respectively, we have . Considering the matrix obtained by concatenating and , we have if and only if there is an -submatrix of such that . Similarly, we have if and only if there is an -submatrix of such that . Hence, whenever we also have , and conversely for all but finitely many from we may conclude that . (We have used a similar argument in Section 5.1.)
Thus assuming that is suitably chosen, we may just run the standard basis algorithm for the seed vector , the -th “unit” vector, and the generators , as described in Section 6.1, yielding a Schreier tree . Letting , and , if is the -th entry in , for , we thus obtain reduced expressions of the elements , and hence the number of steps needed to find the -th element of equals the length . (In practice, it turns out that choosing either or is sufficient, where actually almost always works.)
Applying the Schreier tree to and this yields a standard basis of . Similarly, applying to and we get a standard basis of . But note that this does not ensure that the -lattices and are invariant under the -algebras generated by and , respectively. (In practice they are not, typically.)
8 Finding Gram matrices for -graph representations
We keep the setting of Section 7; in particular still is a -graph representation. Having found standard bases and for and , respectively, we proceed by writing them as matrices and , respectively, where by construction both and are primitive. In order to complete the final task of computing the product efficiently, we need a few preparations.
Palindromicity. Let be the involutory field automorphism given by . Hence is -invariant, and by entry-wise application we get involutory module automorphisms on and , and algebra automorphisms on and , all of which will also be denoted by .
A polynomial is called (-)palindromic, for some , if , and is called (-)skew-palindromic if . In these cases, letting be the maximum power of dividing in , we have . Hence is palindromic or skew-palindromic if and only if and are associated in . Moreover, if is -skew-palindromic, then specializing we get , implying that divides in ; similarly, if is -palindromic, then specializing we get , implying that is even, or divides in .
Proposition 8.2
(a) Let be a primitive Gram matrix for . Then we have , where is even and coincides with the maximum of the degrees of the non-zero entries of .
(b) For the primitive seed vector we have , where is even and coincides with the maximum of the degrees of the non-zero entries of . (Trivially, the analogous statement holds for with .)
Proof
Letting be the identity matrix, by Definition 2.8 for we have
[TABLE]
In particular, this yields
[TABLE]
(a) We consider the matrix : For all we have
[TABLE]
[TABLE]
Now as above is minimal such that , hence we infer that is a primitive Gram matrix for as well, and thus we have or . Assume the latter case holds, then all non-zero entries of are -skew-palindromic, implying that divides , contradicting the primitivity of . Hence we have , that is all non-zero entries of are -palindromic. Assume that is odd, then we infer that divides , again contradicting the primitivity of . Hence is even.
(b) We consider the vector : We have
[TABLE]
[TABLE]
Now as above is minimal such that , hence we infer that is primitive. Thus from we conclude that or . Now we argue as above. ∎
Properties of the standard bases. We have a closer look at the standard bases and , and the associated matrices and , where we assume to be chosen according to Section 7.4. The facts collected are largely due to experimental observation, and will be helpful in the final computational steps in Section 8.4. Still, these properties seem to be stronger than expected from general principles, and it should be worth-while to try and prove the particular observations specified below. (In particular, we have checked the standard bases associated with all subsets fulfilling the Benson–Curtis condition, see Section 7.1, for the types , and .)
Recall that for all we have
[TABLE]
hence by the proof of Proposition 8.2 we get
[TABLE]
The elements of . For any , where , we have , for some and . This yields
[TABLE]
We conclude that and are associated in . Hence by recursion, since is primitive, we infer that for some .
Moreover, we have . Since , this implies for all , where is as in Section 7.4. (Experiments show that all three cases actually occur.) But the growth behavior of the seems to be more restricted than given by these bounds: Considering the case , we have for some such that the “unit” vector is not an eigenvector of , hence using the shape of we conclude that .
Now, experimentally, we have made the following
Observation 1
We have , for all .
(Actually, almost always we have got , for all , where often we have even seen equality throughout; the only cases found where actually , for some , are for type , the representation labeled by , and two out of the twelve Benson–Curtis subsets of generators.)
The matrix . Letting and be as above, we get
[TABLE]
Since the standard basis algorithm is a breadth-first search, from we conclude that there is lower unitriangular matrix and a diagonal matrix , such that
[TABLE]
(Note that if the -lattice was invariant under the -algebra generated by , then we even had .)
In particular, letting , we infer that
[TABLE]
hence is palindromic. Letting denote the exponent of in the sense of Section 5.3, it follows from Proposition 5.4 that the non-constant irreducible polynomials dividing are precisely those dividing . Now, experimentally, we have made the following
Observation 2
Any irreducible divisor of in is monic and palindromic.
(Actually, in general the entries of the matrix are neither palindromic nor skew-palindromic; moreover, quite often is a product of cyclotomic polynomials, but this does not always happen.)
In particular, if denotes the -th column of , for , then divides , hence is palindromic as well. (Actually, contrary to , in general the are not just powers of .)
The elements of . The recursion used in the standard basis algorithm only depends on the Schreier tree , but is independent of the representation considered. Hence for , where , and is primitive, we get for some . Moreover, if and are as above, we get and . Actually, the seem to be closely related to the from above, inasmuch experimentally we have made the following
Observation 3
We have , for all .
The matrix . Again by the fact that the recursion used in the standard basis algorithm only depends on , and using , where is as in Proposition 8.2, we get
[TABLE]
for the same matrices and . In particular, it follows that is palindromic. (In general neither and , nor and are associated in , so that and are inequivalent -sublattices of , which typically are not included in each other.) Again, experimentally we have made the following
Observation 4
Any irreducible divisor of in is monic and palindromic.
In particular, similarly, if denotes the -th column of , for , then is palindromic.
The product . In combination the above yields
[TABLE]
Hence the non-zero entries of are palindromic.
Letting and primitive such that , we get
[TABLE]
where is a primitive Gram matrix, and . In particular, since by Observation 2 the exponent is palindromic, we conclude that the non-zero entries of are palindromic as well.
Moreover, letting such that , we get
[TABLE]
Hence from , where is as in Proposition 8.2, we get
[TABLE]
providing an upper bound on the degrees of the non-zero entries of .
The final product. We are now prepared to do the last computational steps. To do so, we could quite straightforwardly compute first the inverse , that is essentially , and then the product . But it will substantially add to the efficiency if we keep the degrees of the non-zero entries of the matrices involved as small as possible. Now we have already observed above that the rows of and are far from being primitive, and it turns out in practice that this also holds for their columns. We take advantage of this as follows:
Keeping the notation of Section 8.3, let . Then the rows of are primitive. As for its columns, letting denote the -th column of , for , let
[TABLE]
Since by Observation 2 the polynomial is palindromic, using the particular form of , we conclude that the are palindromic as well. We let and be primitive such that . The latter are of course straightforwardly computed, where both and the diagonal entries of are palindromic.
Then we get such that , where now all the rows and all the columns of are primitive. We use the algorithm in Section 5.2 to compute and primitive such that , Since by Observation 2 the exponent is palindromic, using the particular form of and , we conclude that is palindromic as well. Thus altogether we have
[TABLE]
Similarly, let and
[TABLE]
where denotes the -th column of , for . As above, using Observation 4 implying the palindromicity of , we conclude that the diagonal entries of are palindromic as well, and thus those of are too. Then we get such that , where now all the rows and all the columns of are primitive.
In combination this yields
[TABLE]
By the above considerations we conclude that the non-zero entries of are palindromic, which entails that those of are as well. Now by Observation 3 we have , hence this simplifies to
[TABLE]
where the non-zero entries of are palindromic.
In practice. To find , finally, we apply the matrix multiplication algorithm in Section 5.5 to compute the product . As was already mentioned, in order to apply it efficiently we need good break conditions to discard erroneous guesses quickly: Apart from requiring that rational number recovery, see Section 3.4, returns only integral coefficients but not rational ones, it turns out that checking for palindromicity is highly effective in this respect.
Having found a good candidate for , multiplying with the diagonal matrices and is straightforward. Note that, since the result is expected to be a symmetric matrix, it is sufficient to compute only the lower triangular half of the product. Thus we get a candidate for a primitive Gram matrix from . (In many cases already is primitive, but this does not happen always, in which cases typically has a smallish degree.)
As independent verification we of course just explicitly check whether the candidate fulfills the condition
[TABLE]
9 Timings
We conclude by providing running times and workspace requirements for our computations in types and , and by presenting an explicit example for type .
Timings. In Table 2, we give the running time (on a single processor running at a clock speed of ) and GAP workspace requirements needed to compute primitive Gram matrices for types and , and the irreducible -graph representations of given in How , HowYin . The figures for should be compared with those given in Section 2.9 for the approach used there. Recalling that in (gemu, , Remark 4.10) degree was the limit of feasibility, in Table 3 we present the resources now needed for the individual representations of degree at least , where for comparison we repeat the first three columns of the relevant part of Table LABEL:Mmaxd.
Finally, in Table 4 we give some details about the various steps in the computation for the unique representation of largest degree, which is labeled by . In the two last columns we indicate the actual size of the object under consideration in the GAP workspace, and the disc space needed to store it (as an uncompressed text file), respectively; the difference is accounted for by the space consumption of the data structure we are using within GAP, where matrices with polynomial entries are kept as lists of lists of (short) lists of (small long) integers. In particular, in the workspace needed to compute the product, next to the matrices and and (the lower triangular half of) the product , we also keep various specializations of the right hand factor , which have a cumulative size of . Hence to compute a primitive Gram matrix for the representation labeled by we need a running time of and a workspace of size .
An explicit example. We conclude by revisiting the (tiny) example already presented in (gemu, , Example 4.9) (which of course in practice runs in a fraction of a second): Let be of type with Dynkin diagram
s_{1}$$s_{3}$$s_{4}$$s_{2}$$s_{5}$$s_{6}
We consider the irreducible -graph representation of , see Naruse0 , labeled by the representation of , which is the unique one of degree , see Table LABEL:Mmaxd0. The -graph in question is depicted in (gemu, , Example 4.9), hence we do not repeat it here. But to illustrate the shape, and in particular the sparseness of the representing matrices for the generators we present a few of them:
[TABLE]
As it turns out, there are possible choices of a distinguished subset . We choose , in accordance with (gepf, , Table C.4). Then associated primitive seed vectors and are as given below, in the first row of the matrices and , respectively. Running the standard basis algorithm on the specialization of the above -graph representation with respect to yields the following Schreier tree , which we depict as an oriented graph, whose vertices correspond to the vectors in the (ordered) standard bases, and where an arrow from vertex to vertex with label says that is the -th entry of :
1$$s_{4}$$2$$s_{2}$$5$$s_{5}$$4$$s_{3}$$8$$s_{5}$$3$$s_{3}$$7$$s_{5}$$6$$s_{5}$$9$$s_{4}$$10
We find the standard basis with associated matrix as shown below. (It is not always the case that the entries of are only monomials.) Hence we have , where , and is the identity matrix. Thus we get the matrix , and from that and the matrix as also shown below. Note that the entries of are not necessarily palindromic or skew-palindromic, and that the maximum degree of the non-zero entries of , and equals , and , respectively:
[TABLE]
[TABLE]
Similarly, we find the standard basis with associated matrix . As it turns out we indeed have , and is the identity matrix. This yields the matrix as shown below. Note that the entries of are not necessarily palindromic or skew-palindromic, and that the maximum degree of the non-zero entries of is :
[TABLE]
[TABLE]
From this we get . As it turns out we already have , thus we may let be as shown below. Indeed, independent verification shows that is a primitive Gram matrix as desired, coinciding with the one already given in (gemu, , Example 4.9). Note that indeed is a completely dense matrix, all of whose entries are -palindromic, where the maximum degree occurring is , and that in accordance with Table LABEL:Mmaxd0 the largest coefficient occurring has absolute value , and that the specialization yields the identity matrix:
[TABLE]
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Cohen, H.: A course in computational algebraic number theory. Graduate Texts in Mathematics 138 . Springer-Verlag, 1993.
- 2(2) Davenport, J., Guy, M., Wang, P.: P 𝑃 P -adic reconstruction of rational numbers. SIGSAM Bulletin 16 (2) (1982), 2–33.
- 3(3) Dixon, J.: Exact solution of linear equations using p 𝑝 p -adic expansions. Numer. Math. 40 (1982), 137–141.
- 4(4) The GAP Group: GAP — Groups, Algorithms, Programming — A System for Computational Discrete Algebra. Version 4.8.5 (2016). http://www.gap-system.org .
- 5(5) von zur Gathen, J., Gerhard, J.: Modern computer algebra. Third edition. Cambridge University Press, 2013.
- 6(6) Garibaldi, S.: E 8 subscript 𝐸 8 E_{8} , the most exceptional group. Bull. Amer. Math. Soc. 53 (2016), 643–671.
- 7(7) Geck, M.: Leading coefficients and cellular bases of Hecke algebras. Proc. Edinburgh Math. Soc. 52 (2009), 653–677.
- 8(8) Geck, M., Halls, A,: On the Kazhdan–Lusztig cells in type E 8 subscript 𝐸 8 E_{8} . Math. Comp. 84 (2015), 3029–3049.
