Invariant bilinear forms on $W$-graph representations and linear algebra   over integral domains

Meinolf Geck; J\"urgen M\"uller

arXiv:1701.02331·math.RT·May 9, 2017

Invariant bilinear forms on $W$-graph representations and linear algebra over integral domains

Meinolf Geck, J\"urgen M\"uller

PDF

Open Access

TL;DR

This paper introduces a new algorithmic method for computing invariant bilinear forms on W-graph representations, enhancing the computational tools available for Lie-theoretic structures like those of type E8.

Contribution

A novel algorithm for efficiently computing invariant bilinear forms on W-graph representations, facilitating advanced analysis in Lie theory and related algebraic structures.

Findings

01

Effective computation of invariant bilinear forms achieved

02

Algorithm applied successfully to complex Lie-theoretic structures

03

Enhanced tools for studying decomposition numbers in Hecke algebras

Abstract

Lie-theoretic structures of type $E_{8}$ (e.g., Lie groups and algebras, Hecke algebras and Kazhdan-Lusztig cells, ...) are considered to serve as a `gold standard' when it comes to judging the effectiveness of a general algorithm for solving a computational problem in this area. Here, we address a problem that occurred in our previous work on decomposition numbers of Iwahori-Hecke algebras, namely, the computation of invariant bilinear forms on so-called $W$ -graph representations. We present a new algorithmic solution which makes it possible to produce and effectively use the main results in further applications.

Tables4

Table 1. Table 1: An example for degree detection

\begin{matrix} j & b_{j} & a_{j} ​ f ​ (b_{j}) & a_{j} \\ 1 & 29 & 471132000262895400 & \frac{1}{25} ​ 2 & 31 & 5556161802048405504 & \frac{1}{5} ​ 3 & 37 & 271378870503231142344 & 1 ​ 4 & 41 & 203982274364082601464 & \frac{1}{5} ​ 5 & 43 & 1885780898401789278912 & 1 ​ 6 & 47 & 5946135224244400779264 & 1 ​ 7 & 53 & 28077873950889396256392 & 1 ​ 8 & 59 & 4493456499569142283200 & \frac{1}{25} ​ 9 & 61 & 34577756822169042208584 & \frac{1}{5} ​ 10 & 67 & 581970465933078043504704 & 1 ​ 11 & 71 & 246522309921169431519744 & \frac{1}{5} ​ 12 & 73 & 1766015503219395154436952 & 1 ​ 13 & 79 & 196427398952317706342400 & \frac{1}{25} ​ \end{matrix}

Table 2. Table 2: Time and space consumption

\begin{matrix} degree & no. repr. & time & workspace E_{7} & all & 60 & 4 ​ min & 0.2 ​ GB E_{8} & ⩽1000 & 50 & 30 ​ min & 0.7 ​ GB 1000 ​ — ​ 2000 20 137 ​ min 2.2 ​ GB 2000 ​ — ​ 2500 10 329 ​ min 4.3 ​ GB 2500 ​ — ​ 3000 5 350 ​ min 5.9 ​ GB 3000 ​ — ​ 4000 7 874 ​ min 11.6 ​ GB 4000 ​ — ​ 5000 13 3175 ​ min 16.3 ​ GB 5000 ​ — ​ 7000 6 2784 ​ min 23.2 ​ GB ⩾7000 1 1183 ​ min 31.5 ​ GB \end{matrix}

Table 3. Table 3: Time and space consumption for degree ⩾ 2500 absent 2500 \geqslant 2500

\begin{matrix} ​ E_{8} ​ & ​ m_{P} ​ & abs.val. & time & workspace 2688_{y} & 24 & 169180 & 39 ​ min & 3.9 ​ GB 2800_{z} & 20 & 38038 & 61 ​ min & 3.7 ​ GB 2800_{z}^{'} & 30 & 882222 & 116 ​ min & 5.9 ​ GB 2835_{x} & 24 & 1344484 & 52 ​ min & 3.1 ​ GB 2835_{x}^{'} & 32 & 5391418 & 82 ​ min & 5.3 ​ GB 3150_{y} & 26 & 6166994 & 72 ​ min & 5.8 ​ GB 3200_{x} & 24 & 266284 & 79 ​ min & 4.9 ​ GB 3200_{x}^{'} & 30 & 587345 & 104 ​ min & 6.1 ​ GB 3240_{z} & 16 & 25586 & 60 ​ min & 4.0 ​ GB 3240_{z}^{'} & 48 & 33653538 & 326 ​ min & 11.6 ​ GB 3360_{z} & 20 & 29722 & 74 ​ min & 5.1 ​ GB 3360_{z}^{'} & 32 & 775084 & 159 ​ min & 8.1 ​ GB 4096_{x} & 22 & 531634 & 156 ​ min & 8.0 ​ GB 4096_{x}^{'} & 44 & 234956568 & 392 ​ min & 16.0 ​ GB 4096_{z} & 22 & 531634 & 143 ​ min & 8.1 ​ GB 4096_{z}^{'} & 44 & 234956568 & 428 ​ min & 16.1 ​ GB 4200_{y} & 28 & 58249760 & 171 ​ min & 10.1 ​ GB 4200_{x} & 24 & 5413484 & 171 ​ min & 9.8 ​ GB 4200_{x}^{'} & 36 & 129331224 & 277 ​ min & 13.3 ​ GB 4200_{z} & 26 & 728053 & 183 ​ min & 10.4 ​ GB 4200_{z}^{'} & 28 & 1298612 & 199 ​ min & 10.3 ​ GB 4480_{y} & 32 & 85556320920 & 239 ​ min & 13.9 ​ GB 4536_{y} & 28 & 3887856 & 180 ​ min & 11.7 ​ GB 4536_{z} & 24 & 2728756 & 217 ​ min & 11.4 ​ GB 4536_{z}^{'} & 38 & 50779421 & 419 ​ min & 16.3 ​ GB 5600_{w} & 26 & 372230 & 331 ​ min & 16.6 ​ GB 5600_{z} & 26 & 3115126 & 335 ​ min & 15.4 ​ GB 5600_{z}^{'} & 30 & 3848044 & 473 ​ min & 17.5 ​ GB 5670_{y} & 30 & 10762741 & 351 ​ min & 21.7 ​ GB 6075_{x} & 26 & 894864 & 542 ​ min & 19.5 ​ GB 6075_{x}^{'} & 34 & 10488013 & 752 ​ min & 23.2 ​ GB 7168_{w} & 32 & 1190470476 & 1183 ​ min & 31.5 ​ GB \end{matrix}

Table 4. Table 4: Time and space consumption for 7168 w subscript 7168 𝑤 7168_{w}

\begin{matrix} ​ 7168_{w} ​ & time & workspace & space & disc 𝔗 & 9 ​ min & 0.6 ​ GB u_{1}^{'} & 5 ​ min & 1.3 ​ GB \hat{B} & 925 ​ min & 7.6 ​ GB & 1.7 ​ GB & 0.3 ​ GB {\tilde{B}}^{'} & 29 ​ min & 17.5 ​ GB & 12.6 ​ GB & 4.7 ​ GB \hat{B} \cdot {\tilde{B}}^{'} & 207 ​ min & 31.5 ​ GB & 5.8 ​ GB & 2.4 ​ GB P & 8 ​ min & 7.9 ​ GB & 5.8 ​ GB & 2.5 ​ GB \end{matrix}

Equations202

P\,{\mathfrak{X}}(T_{w})={\mathfrak{X}}(T_{w^{-1}})^{\operatorname{tr}}\,P\qquad\mbox{for all $w\in W$};

P\,{\mathfrak{X}}(T_{w})={\mathfrak{X}}(T_{w^{-1}})^{\operatorname{tr}}\,P\qquad\mbox{for all $w\in W$};

P_{0} = w \in W \sum X (T_{w})^{tr} X (T_{w}) \in K^{d \times d} .

P_{0} = w \in W \sum X (T_{w})^{tr} X (T_{w}) \in K^{d \times d} .

L(s)>0\qquad\mbox{for all $s\in S$}.

L(s)>0\qquad\mbox{for all $s\in S$}.

T_{s}T_{w}=\left\{\begin{array}[]{cl}T_{sw}&\qquad\mbox{if $l(sw)=l(w)+1$},\\ T_{sw}+(v^{L(s)}-v^{-L(s)})T_{s}&\qquad\mbox{if $l(sw)=l(w)-1$}.\end{array}\right.

T_{s}T_{w}=\left\{\begin{array}[]{cl}T_{sw}&\qquad\mbox{if $l(sw)=l(w)+1$},\\ T_{sw}+(v^{L(s)}-v^{-L(s)})T_{s}&\qquad\mbox{if $l(sw)=l(w)-1$}.\end{array}\right.

Irr (W) = {E^{λ} ∣ λ \in Λ} \mbox an d d_{λ} = dim E^{λ} (λ \in Λ),

Irr (W) = {E^{λ} ∣ λ \in Λ} \mbox an d d_{λ} = dim E^{λ} (λ \in Λ),

\mbox t r a ce (T_{w}, E_{v}^{λ}) \in F [v, v^{- 1}] \mbox an d \mbox t r a ce (w, E^{λ}) = \mbox t r a ce (T_{w}, E_{v}^{λ}) ∣_{v \mapsto 1} .

\mbox t r a ce (T_{w}, E_{v}^{λ}) \in F [v, v^{- 1}] \mbox an d \mbox t r a ce (w, E^{λ}) = \mbox t r a ce (T_{w}, E_{v}^{λ}) ∣_{v \mapsto 1} .

\sum_{w\in W}\mbox{trace}(T_{w},E_{v}^{\lambda})\mbox{trace}(T_{w^{-1}},E_{v}^{\mu})=\left\{\begin{array}[]{cl}d_{\lambda}{\mathbf{c}}_{\lambda}&\qquad\mbox{if $\lambda=\mu$},\\ 0&\qquad\mbox{if $\lambda\neq\mu$}.\end{array}\right.

\sum_{w\in W}\mbox{trace}(T_{w},E_{v}^{\lambda})\mbox{trace}(T_{w^{-1}},E_{v}^{\mu})=\left\{\begin{array}[]{cl}d_{\lambda}{\mathbf{c}}_{\lambda}&\qquad\mbox{if $\lambda=\mu$},\\ 0&\qquad\mbox{if $\lambda\neq\mu$}.\end{array}\right.

{\mathbf{c}}_{\lambda}=f_{\lambda}v^{-2{\mathbf{a}}_{\lambda}}+\mbox{linear combination of larger powers of $v$},

{\mathbf{c}}_{\lambda}=f_{\lambda}v^{-2{\mathbf{a}}_{\lambda}}+\mbox{linear combination of larger powers of $v$},

{\mathbf{a}}_{\lambda}=\min\{i\geqslant 0\mid v^{i}\mbox{trace}(T_{w},E_{v}^{\lambda})\in F[v]\mbox{ for all $w\in W$}\}.

{\mathbf{a}}_{\lambda}=\min\{i\geqslant 0\mid v^{i}\mbox{trace}(T_{w},E_{v}^{\lambda})\in F[v]\mbox{ for all $w\in W$}\}.

v^{{\mathbf{a}}_{\lambda}}{\mathfrak{X}}^{\lambda}(T_{w})\in{\mathscr{O}}^{d_{\lambda}\times d_{\lambda}}\qquad\mbox{for all $w\in W$}.

v^{{\mathbf{a}}_{\lambda}}{\mathfrak{X}}^{\lambda}(T_{w})\in{\mathscr{O}}^{d_{\lambda}\times d_{\lambda}}\qquad\mbox{for all $w\in W$}.

\Omega^{\lambda}\,{\mathfrak{X}}^{\lambda}(T_{s})={\mathfrak{X}}^{\lambda}(T_{s})^{\operatorname{tr}}\,\Omega^{\lambda}\qquad\mbox{for all $s\in S$}.

\Omega^{\lambda}\,{\mathfrak{X}}^{\lambda}(T_{s})={\mathfrak{X}}^{\lambda}(T_{s})^{\operatorname{tr}}\,\Omega^{\lambda}\qquad\mbox{for all $s\in S$}.

\Omega^{\lambda}\,{\mathfrak{X}}^{\lambda}(T_{w^{-1}})={\mathfrak{X}}^{\lambda}(T_{w})^{\operatorname{tr}}\,\Omega^{\lambda}\qquad\mbox{for all $w\in W$}.

\Omega^{\lambda}\,{\mathfrak{X}}^{\lambda}(T_{w^{-1}})={\mathfrak{X}}^{\lambda}(T_{w})^{\operatorname{tr}}\,\Omega^{\lambda}\qquad\mbox{for all $w\in W$}.

\langle T_{w}.e,e^{\prime}\rangle_{\lambda}=\langle e,T_{w^{-1}}.e^{\prime}\rangle_{\lambda}\qquad\mbox{for all $e,e^{\prime}\in E_{v}^{\lambda}$ and $w\in W$}.

\langle T_{w}.e,e^{\prime}\rangle_{\lambda}=\langle e,T_{w^{-1}}.e^{\prime}\rangle_{\lambda}\qquad\mbox{for all $e,e^{\prime}\in E_{v}^{\lambda}$ and $w\in W$}.

P_{0} := w \in W \sum X^{λ} (T_{w})^{tr} X^{λ} (T_{w}) \in K^{d_{λ} \times d_{λ}};

P_{0} := w \in W \sum X^{λ} (T_{w})^{tr} X^{λ} (T_{w}) \in K^{d_{λ} \times d_{λ}};

w \in W \sum X^{λ} (T_{w^{- 1}}) P_{0}^{- 1} X^{λ} (T_{w}) = \mbox t r a ce (P_{0}^{- 1}) c_{λ} I_{d_{λ}} .

w \in W \sum X^{λ} (T_{w^{- 1}}) P_{0}^{- 1} X^{λ} (T_{w}) = \mbox t r a ce (P_{0}^{- 1}) c_{λ} I_{d_{λ}} .

\mbox t r a ce (P_{0}^{- 1}) c_{λ} = 1.

\mbox t r a ce (P_{0}^{- 1}) c_{λ} = 1.

v^{L(s)}m_{ij}^{s}\in vR[v]\quad\mbox{and}\quad m_{ij}^{s}=m_{ij}^{s}|_{v\mapsto v^{-1}}\quad\mbox{for all $1\leqslant i,j\leqslant d$, $s\in I_{i}\setminus I_{j}$}.

v^{L(s)}m_{ij}^{s}\in vR[v]\quad\mbox{and}\quad m_{ij}^{s}=m_{ij}^{s}|_{v\mapsto v^{-1}}\quad\mbox{for all $1\leqslant i,j\leqslant d$, $s\in I_{i}\setminus I_{j}$}.

T_{s}.e_{j}=\left\{\begin{array}[]{ll}\displaystyle{v^{L(s)}\,e_{j}+\sum_{1\leqslant i\leqslant d:\,s\in I_{i}}m_{ij}^{s}\,e_{i}}&\qquad\mbox{if $s\not\in I_{j}$},\\ -v^{-L(s)}\,e_{j}&\qquad\mbox{if $s\in I_{j}$}.\end{array}\right.

T_{s}.e_{j}=\left\{\begin{array}[]{ll}\displaystyle{v^{L(s)}\,e_{j}+\sum_{1\leqslant i\leqslant d:\,s\in I_{i}}m_{ij}^{s}\,e_{i}}&\qquad\mbox{if $s\not\in I_{j}$},\\ -v^{-L(s)}\,e_{j}&\qquad\mbox{if $s\in I_{j}$}.\end{array}\right.

PX_{s}=X_{s}^{\operatorname{tr}}\,P\qquad\mbox{for all $s\in S$}.

PX_{s}=X_{s}^{\operatorname{tr}}\,P\qquad\mbox{for all $s\in S$}.

cf [q_{1}, q_{2}, \dots] = q_{1} + \frac{1}{q _{2} + \frac{1}{⋱}}

cf [q_{1}, q_{2}, \dots] = q_{1} + \frac{1}{q _{2} + \frac{1}{⋱}}

σ_{i} = q_{i} σ_{i - 1} + σ_{i - 2} and τ_{i} = q_{i} τ_{i - 1} + τ_{i - 2} .

σ_{i} = q_{i} σ_{i - 1} + σ_{i - 2} and τ_{i} = q_{i} τ_{i - 1} + τ_{i - 2} .

r_{i + 1} := r_{i - 1} - q_{i} r_{i} \in N_{0} such that r_{i + 1} < r_{i},

r_{i + 1} := r_{i - 1} - q_{i} r_{i} \in N_{0} such that r_{i + 1} < r_{i},

s_{i + 1} := s_{i - 1} - q_{i} s_{i} and t_{i + 1} := t_{i - 1} - q_{i} t_{i},

s_{i + 1} := s_{i - 1} - q_{i} s_{i} and t_{i + 1} := t_{i - 1} - q_{i} t_{i},

ρ_{i} = - \frac{t _{i + 1}}{s _{i + 1}}, where g cd (s_{i + 1}, t_{i + 1}) = 1, for 1 ⩽ i ⩽ l .

ρ_{i} = - \frac{t _{i + 1}}{s _{i + 1}}, where g cd (s_{i + 1}, t_{i + 1}) = 1, for 1 ⩽ i ⩽ l .

ρ - ρ_{i} = \frac{a}{b} - \frac{σ _{i}}{τ _{i}} = \frac{τ _{i} a - σ _{i} b}{τ _{i} b} = \frac{s _{i + 1} a + t _{i + 1} b}{s _{i + 1} b} = \frac{r _{i + 1}}{b s _{i + 1}}, for 1 ⩽ i ⩽ l .

ρ - ρ_{i} = \frac{a}{b} - \frac{σ _{i}}{τ _{i}} = \frac{τ _{i} a - σ _{i} b}{τ _{i} b} = \frac{s _{i + 1} a + t _{i + 1} b}{s _{i + 1} b} = \frac{r _{i + 1}}{b s _{i + 1}}, for 1 ⩽ i ⩽ l .

L_{a, b} := ⟨[1, a], [0, b] ⟩_{Z} \subseteq Z^{2} .

L_{a, b} := ⟨[1, a], [0, b] ⟩_{Z} \subseteq Z^{2} .

[x, y] = [c, d] \cdot [s_{i} s_{i + 1} r_{i} r_{i + 1}],

[x, y] = [c, d] \cdot [s_{i} s_{i + 1} r_{i} r_{i + 1}],

\Big{|}\frac{a}{b}-\frac{z}{x}\Big{|}=\frac{|y|}{b\cdot|x|}=\frac{|x|\cdot|y|}{b\cdot|x|^{2}}\leqslant\frac{1}{2\cdot|x|^{2}}.

\Big{|}\frac{a}{b}-\frac{z}{x}\Big{|}=\frac{|y|}{b\cdot|x|}=\frac{|x|\cdot|y|}{b\cdot|x|^{2}}\leqslant\frac{1}{2\cdot|x|^{2}}.

\frac{y}{x} = \frac{x a - z b}{x} = a - \frac{z b}{x} = a - b ρ_{i - 1} = b (ρ - ρ_{i - 1}) = \frac{r _{i}}{s _{i}} .

\frac{y}{x} = \frac{x a - z b}{x} = a - \frac{z b}{x} = a - b ρ_{i - 1} = b (ρ - ρ_{i - 1}) = \frac{r _{i}}{s _{i}} .

\det\Big{(}\begin{bmatrix}s_{i}&r_{i}\\ s_{j}&r_{j}\\ \end{bmatrix}\Big{)}\leqslant\|[s_{i},r_{i}]\|\cdot\|[s_{j},r_{j}]\|<b.

\det\Big{(}\begin{bmatrix}s_{i}&r_{i}\\ s_{j}&r_{j}\\ \end{bmatrix}\Big{)}\leqslant\|[s_{i},r_{i}]\|\cdot\|[s_{j},r_{j}]\|<b.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Algebra and Geometry · Algebraic Geometry and Number Theory · Algebraic structures and combinatorial models

Full text

∎

11institutetext: Meinolf Geck 22institutetext: IAZ-Lehrstuhl für Algebra, Universität Stuttgart, Pfaffenwaldring 57, 70569 Stuttgart, Germany 22email: [email protected] 33institutetext: Jürgen Müller 44institutetext: Arbeitsgruppe Algebra und Zahlentheorie, Bergische Universität Wuppertal, Gauß-Straße 20, 42119 Wuppertal, Germany 44email: [email protected]

Invariant bilinear forms on

$W$ -graph representations and

linear algebra over integral domains

Meinolf Geck and Jürgen Müller

Abstract

Lie-theoretic structures of type $E_{8}$ (e.g., Lie groups and algebras, Iwahori–Hecke algebras and Kazhdan–Lusztig cells, $\ldots$ ) are considered to serve as a “gold standard” when it comes to judging the effectiveness of a general algorithm for solving a computational problem in this area. Here, we address a problem that occurred in our previous work on decomposition numbers of Iwahori–Hecke algebras, namely, the computation of invariant bilinear forms on so-called $W$ -graph representations. We present a new algorithmic solution which makes it possible to produce and effectively use the main results in further applications.

1 Introduction

This paper is concerned with the representation theory of Iwahori–Hecke algebras. Such an algebra ${\mathscr{H}}$ is a certain deformation of the group algebra of a finite Coxeter group $W$ . In myedin , the notion of “balanced representations” of ${\mathscr{H}}$ was introduced, which has turned out to be useful in several applications. We mention here the construction of cellular structures on ${\mathscr{H}}$ (see, e.g., (geja, , Chap. 2)), the determination of decomposition numbers of ${\mathscr{H}}$ (see gemu ), and the computation of Lusztig’s function ${\mathbf{a}}\colon W\rightarrow{\mathbb{Z}}$ (see (geha, , §4)). To check whether a given representation of ${\mathscr{H}}$ is balanced or not is a computationally hard problem; it involves the construction of a certain invariant bilinear form on the underlying ${\mathscr{H}}$ -module. It has been conjectured in myedin that so-called “ $W$ -graph representations” of ${\mathscr{H}}$ are always balanced. But even if such a theoretical result were known to be true, certain applications (e.g., the determination of decomposition numbers) would still require the explicit knowledge of the Gram matrices of the invariant bilinear forms. In this paper, we discuss algorithms for the construction of these Gram matrices for $W$ of exceptional type. The biggest challenge—by far—is the case where $W$ is of type $E_{8}$ . (The distinguished role of $E_{8}$ when it comes to performing explicit computations is highlighted in various recent survey articles; see, e.g., Garibaldi gari , Lusztig shaw , Vogan VoE8 ).

In the situations of interest to us, the algebra ${\mathscr{H}}$ is defined over the field of rational functions $K={\mathbb{Q}}(v)$ (where $v$ is an indeterminate); it has a natural basis $\{T_{w}\mid w\in W\}$ . Explicit models for the irreducible representations of ${\mathscr{H}}$ are known by the work of Naruse Naruse0 , Howlett and Yin How , HowYin . Now let us fix an irreducible matrix representation ${\mathfrak{X}}\colon{\mathscr{H}}\rightarrow K^{d\times d}$ . In order to show that ${\mathfrak{X}}$ is balanced, one needs to determine a non-zero symmetric matrix $P\in K^{d\times d}$ such that

[TABLE]

this matrix $P$ then has to satisfy certain additional properties. Thus, the computation of $P$ essentially amounts to solving a system of linear equations; for theoretical reasons, we know that this system has a unique solution up to multiplication by a scalar. Rescaling a given solution by a suitable non-zero polynomial in ${\mathbb{Q}}[v]$ , we can assume that all entries of $P$ are in ${\mathbb{Z}}[v]$ and that their greatest common divisor is $\pm 1$ ; then $P$ is unique up to sign and is called a “primitive Gram matrix”. The general theory also shows that a particular solution is given by

[TABLE]

Thus, if the matrices ${\mathfrak{X}}(T_{w})$ ( $w\in W$ ) are known and if $|W|$ is not too large, then we can simply perform the above summation and obtain $P_{0}$ ; rescaling $P_{0}$ yields a primitive Gram matrix $P$ . This procedure works for types $F_{4}$ , $E_{6}$ , for example.

Already for type $E_{7}$ , one needs to use a more sophisticated approach as described in (gemu, , §4.3), based on Parker’s “standard basis algorithm” parker1 , in combination with interpolation and modular techniques. This also works for type $E_{8}$ , but it is efficient only for irreducible representations of dimension up to about $2500$ . In our previous work on decomposition numbers, this was sufficient to obtain the desired results for type $E_{8}$ ; see (gemu, , Remark 4.10). In principle, one could have run the above procedure on all irreducible representations of type $E_{8}$ , but experiments showed that this would have needed a total of nearly one year of CPU time. On the other hand, from a strictly logical point of view, one does not need to know exactly how the Gram matrices have been obtained, because as an independent verification one can simply check that they form a solution to the above system of linear equations. However, to store the various primitive Gram matrices requires about $28$ GB of disk space, and even the verification alone is a major task as it involves the computation of products of (large) matrices with polynomial entries. — In any case, this raises a serious issue of making sure that our results are reliable and reproducible.

In our view, the solution to deal with this issue is to develop better mathematical tools which make it possible to reproduce the results efficiently as needed, and this is what we will do in this paper. Indeed, for example, in order to deal with the irreducible representation of largest dimension for type $E_{8}$ (which is $7168$ ), the old approach would have needed roughly seven weeks of CPU time, while the one described here requires only about $20$ hours, which amounts to a factor of almost $60$ . (See Section 9.1 for more details.) In view of the complexity of the task, and the experiences made elsewhere with explicit computations in type $E_{8}$ (see the references cited above), it was clear that developing efficient methods would not be a standard, let alone press-button application of existing tools from computer algebra. Maier et al. mllt proposed an approach based on parallel techniques, but type $E_{8}$ still seems to be a major challenge there. Hence one of the purposes of this paper is to give a systematic description of the (serial) methods we have used for the computation of Gram matrices of invariant bilinear forms for Iwahori–Hecke algebras.

The basic strategy in our approach is to reduce computational linear algebra over the Laurent polynomial ring ${{\mathbb{Q}}}[v,v^{-1}]$ to linear algebra over the integers. Thus, generally speaking, we are faced with the problem of devising efficient tools to do computational linear algebra over integral domains, not just over fields. In order to do so, we build on general ideas from computational representation theory, more precisely on the celebrated so-called MeatAxe philosophy parker1 , which comprises of specially tailored, highly efficient techniques for computational linear algebra over (small) finite fields. Attempts to generalize these ideas to linear algebra over the (infinite) field of rational numbers, and further to linear algebra over the integers have been coined the IntegralMeatAxe parker2 . The last word on this has not been said yet, and in this paper we are trying to contribute here as well. (As future work, we are planning to develop a full IntegralMeatAxe package along the present lines.) But we are additionally going one step further by setting out to extend these ideas to linear algebra over the univariate polynomial rings over the rationals or the integers.

To do so, the basic idea is to reduce to linear algebra over the integers by evaluating polynomials with rational coefficients at integral places, where we are using as few “small” places as possible, and to recover the polynomials in question by a Chinese remainder technique. Hence this strategy, fitting nicely into the IntegralMeatAxe philosophy, differs from those known to the literature, inasmuch we are neither using modular methods (which would mean to go over to polynomial rings over finite fields), nor are we in a position to use interpolation (which would mean to use lots of places to evaluate at). Thus another purpose of this paper is to give a detailed description of the new computational tasks arising in pursuing this strategy, and how we have accomplished them. Although the choice of the material presented is governed by our application to Iwahori–Hecke algebras, it is exhibited with a view towards general applicability.

Here is an outline of the paper: In Section 2 we recall some basic facts about representations of finite Coxeter groups and Iwahori–Hecke algebras, in particular the notions of $W$ -graphs, balancedness, and invariant bilinear forms. We conclude with Theorem 2.10 saying that for the representations afforded by the $W$ -graphs given by Naruse Naruse0 , Howlett and Yin How , HowYin are actually balanced, and in Tables LABEL:Mmaxd0 and LABEL:Mmaxd we list some numerical data associated with their primitive Gram matrices.

In the subsequent sections we describe our general approach towards linear algebra over integral domains, which consists of a cascade of steps: In Section 3 we first deal with linear algebra over ${\mathbb{Z}}$ . We discuss the key tasks of rational number recovery and of finding integral linear dependencies. Both tasks are known to the literature, but for the former we provide a variant containing a new feature, while for the latter we proceed along another strategy, within the IntegralMeatAxe philosophy. Subsequently, we apply this to computing nullspaces, inverses, and the so-called “exponents” of matrices over ${\mathbb{Z}}$ . In Section 4 we then describe our general approach to deal with polynomials, in view of our aim to do linear algebra over polynomial rings. The key task is to recover a polynomial with rational coefficients from some of its evaluations at integral places. Here, we are aiming at using as few “small” places as possible, whence we are not in a position to apply interpolation, but we are using a Chinese remainder technique instead. Moreover, we devise a method to recover a polynomial from some of its evaluations where the latter are “rescaled” by unknown scalars; the necessity of being able to solve this task is closely related to our use of the IntegralMeatAxe, hence to our knowledge this method is new as well. In Section 5 we proceed to show how linear algebra over ${\mathbb{Z}}$ and polynomial recovery, as discussed in earlier sections, can now be combined to do linear algebra over ${\mathbb{Z}}[X]$ and ${\mathbb{Q}}[X]$ , by devising methods to computing nullspaces, inverses, exponents and products of matrices using this new approach. In Section 6 we finally recall the “standard basis algorithm” originally developed in parker1 for computations over finite fields. We present a general variant for absolutely irreducible matrix representations over an arbitrary field, show how this can be used to compute homomorphisms between such representations, and discuss how the necessary computations are facilitated over the fields ${\mathbb{Q}}$ and ${\mathbb{Q}}(X)$ , using the tools we have developed.

Having the general tools in place, in Section 7 we return to our particular application of computing Gram matrices of invariant bilinear forms for $W$ -graph representations ${\mathfrak{X}}$ of Iwahori–Hecke algebras. We proceed along the strategy which has already been indicated in (gemu, , Section 4.3), where here we take the opportunity to provide full details. We begin by computing standard bases for the representations ${\mathfrak{X}}$ and ${\mathfrak{X}}^{\prime}$ , where the latter is given by ${\mathfrak{X}}^{\prime}(T_{w}):={\mathfrak{X}}(T_{w^{-1}})^{\operatorname{tr}}$ , for $w\in W$ . In order to find suitable seed vectors to start with, we use an observation on restrictions of representations of Iwahori–Hecke algebras to parabolic subalgebras, which naturally leads to certain distinguished elements of ${\mathscr{H}}$ having actions of co-rank one on ${\mathfrak{X}}$ and ${\mathfrak{X}}^{\prime}$ . To actually run the standard basis algorithm subsequently, we again revert to a specialization technique. In Section 8 we proceed by collecting a few observations on the standard bases $B$ and $B^{\prime}$ of the representations ${\mathfrak{X}}$ and ${\mathfrak{X}}^{\prime}$ thus obtained. Indeed, the matrix entries occurring seem to be much less arbitrary than expected from general principles, but this has only been verified experimentally for the representations under consideration here, while a priori proofs are largely missing (so far). The final computational step then essentially is to determine the product $B^{-1}\cdot B^{\prime}$ , which up to rescaling is a Gram matrix as desired. To do this efficiently, apart from the general tools developed above, we make heavy use of the special form of the matrix entries of $B^{-1}\cdot B^{\prime}$ just mentioned. In the concluding Section 9 we provide running times and workspace requirements for our computations in types $E_{7}$ and $E_{8}$ , and present an explicit (tiny) example for type $E_{6}$ .

It should be clear from the above description that to pursue our novel approach we had to solve quite a few tasks for which there was no pre-existing implementation, let alone in one and the same computer algebra system. To develop the necessary new code, as our computational platform we have chosen the computer algebra system GAP GAP . This system provides efficient arithmetics for the various basic objects we need: (i) rational integers and rational numbers, which in turn are handled by the GMP library GMP ; (ii) row vectors and matrices over the integers, the rationals or (small) finite fields, where in this context the entries of row vectors are actually treated as immediate objects; (iii) floating point numbers, where the limited built-in facilities are sufficient for our purposes. Moreover, the necessary input data on Iwahori–Hecke algebras and their representations is provided by the computer algebra system CHEVIE jmich , which conveniently is a branch of GAP.

[TABLE]

2 Iwahori–Hecke algebras and balanced representations

We begin by recalling some basic facts about representations of finite Coxeter groups and Iwahori–Hecke algebras; see gepf , geja , Lusztig03 for further details.

We fix a finite Coxeter group $W$ with set of simple reflections $S$ ; for $w\in W$ , we denote by $l(w)$ the length of $w$ with respect to $S$ . Let $L\colon W\rightarrow{\mathbb{Z}}$ be a weight function as in Lusztig03 , that is, we have $L(ww^{\prime})=L(w)+L(w^{\prime})$ whenever $w,w^{\prime}\in W$ satisfy $l(ww^{\prime})=l(w)+l(w^{\prime})$ . Such a weight function is uniquely determined by its values $L(s)$ for $s\in S$ . We will assume throughout that

[TABLE]

Let $R\subseteq{\mathbb{C}}$ be a subring and $A=R[v,v^{-1}]$ be the ring of Laurent polynomials over $R$ in the indeterminate $v$ . Let ${\mathscr{H}}={\mathscr{H}}_{A}(W,L)$ be the corresponding generic Iwahori–Hecke algebra. Thus, ${\mathscr{H}}$ is an associative $A$ -algebra which is free over $A$ with a basis $\{T_{w}\mid w\in W\}$ ; the multiplication is given by the following rule, where $s\in S$ and $w\in W$ :

[TABLE]

Let $F\subseteq{\mathbb{C}}$ be the field of fractions of $R$ and assume that $F$ is a splitting field for $W$ . (For example, we could take $R=F={\mathbb{R}}$ since ${\mathbb{R}}$ is known to be a splitting field for $W$ .) Let ${\operatorname{Irr}}(W)$ be the set of simple $F[W]$ -modules (up to isomorphism); we shall use the following notation:

[TABLE]

where $\Lambda$ is a finite index set. Let $K=F(v)$ be the field of fractions of $A$ and ${\mathscr{H}}_{K}=K\otimes_{A}{\mathscr{H}}$ be the $K$ -algebra obtained by extension of scalars from $A$ to $K$ . Then ${\mathscr{H}}_{K}$ is a split semisimple algebra and there is a bijection between ${\operatorname{Irr}}(W)$ and ${\operatorname{Irr}}({\mathscr{H}}_{K})$ , the set of simple ${\mathscr{H}}_{K}$ -modules (up to isomorphism). Given $\lambda\in\Lambda$ , we denote by $E_{v}^{\lambda}$ a simple ${\mathscr{H}}_{K}$ -module corresponding to $E^{\lambda}$ . Then $E_{v}^{\lambda}$ is uniquely determined (up to isomorphism) by the following property. For $w\in W$ , we have

[TABLE]

The algebra ${\mathscr{H}}_{K}$ is symmetric, with trace form $\tau\colon{\mathscr{H}}_{K}\rightarrow K$ given by $\tau(T_{1})=1$ and $\tau(T_{w})=0$ for $1\neq w\in W$ . The basis dual to $\{T_{w}\mid w\in W\}$ is given by $\{T_{w^{-1}}\mid w\in W\}$ . By the general theory of symmetric algebras, there are well-defined elements $0\neq{\mathbf{c}}_{\lambda}\in A$ ( $\lambda\in\Lambda)$ such that the following orthogonality relations hold for $\lambda,\mu\in\Lambda$ :

[TABLE]

As observed by Lusztig, we can write each ${\mathbf{c}}_{\lambda}$ uniquely in the form

[TABLE]

where $f_{\lambda}$ is a strictly positive real number and ${\mathbf{a}}_{\lambda}$ is a non-negative integer. The “ $a$ -invariants” ${\mathbf{a}}_{\lambda}$ will play a major role in the sequel; these numbers are explicitly known for all types of $W$ and all choices of $L$ (see (geja, , §1.3), (Lusztig03, , Chap. 22)). Alternatively, ${\mathbf{a}}_{\lambda}$ can be characterized as follows:

[TABLE]

Let ${\mathscr{O}}\subseteq K$ be the localization of $F[v]$ in the prime ideal $(v)$ , that is, ${\mathscr{O}}$ consists of all fractions of the form $f/g\in K$ where $f,g\in F[v]$ and $g(0)\neq 0$ . Let ${\mathfrak{X}}^{\lambda}\colon{\mathscr{H}}_{K}\rightarrow K^{d_{\lambda}\times d_{\lambda}}$ be a matrix representation afforded by $E_{v}^{\lambda}$ . Following myedin , we say that ${\mathfrak{X}}^{\lambda}$ is balanced if

[TABLE]

This concept plays a crucial role in the study of “cellular structures” on ${\mathscr{H}}$ (see myedin ) and the determination of Kazhdan–Lusztig cells (see (geha, , §4)). It is known that every $E_{v}^{\lambda}$ affords a balanced representation. Note that, given some matrix representation afforded by $E_{v}^{\lambda}$ , the above condition is hard to verify since it involves representing matrices for all $w\in W$ . Much better for practical purposes is the following condition.

Proposition 2.5 (See (myedin, , Prop. 4.3, Remark 4.4))

Assume that $F\subseteq{\mathbb{R}}$ . Let $\lambda\in\Lambda$ and ${\mathfrak{X}}^{\lambda}\colon{\mathscr{H}}_{K}\rightarrow K^{d_{\lambda}\times d_{\lambda}}$ be a matrix representation afforded by $E_{v}^{\lambda}$ . Then ${\mathfrak{X}}^{\lambda}$ is balanced if and only if there exists a symmetric matrix $\Omega^{\lambda}\in{\operatorname{GL}}_{d_{\lambda}}({\mathscr{O}})$ such that

[TABLE]

Remark 2.6

Note that, if a matrix $\Omega^{\lambda}$ satisfies ( $*$ ), then it immediately follows that

[TABLE]

Thus, $\Omega^{\lambda}$ is the Gram matrix of a symmetric bilinear form $\langle\;,\;\rangle_{\lambda}\colon E_{v}^{\lambda}\times E_{v}^{\lambda}\rightarrow K$ which is ${\mathscr{H}}_{K}$ -invariant in the sense that

[TABLE]

Remark 2.7

Assume that $F\subseteq{\mathbb{R}}$ . Let $\lambda\in\Lambda$ and ${\mathfrak{X}}^{\lambda}\colon{\mathscr{H}}_{K}\rightarrow K^{d_{\lambda}\times d_{\lambda}}$ be a matrix representation afforded by $E_{v}^{\lambda}$ . Let ${\mathscr{E}}({\mathfrak{X}}^{\lambda})$ be the set of all $P\in K^{d_{\lambda}\times d_{\lambda}}$ such that $P\,{\mathfrak{X}}^{\lambda}(T_{s})={\mathfrak{X}}^{\lambda}(T_{s})^{\operatorname{tr}}\,P$ for $s\in S$ . Since ${\mathfrak{X}}^{\lambda}$ is irreducible, Schur’s Lemma implies that all matrices in ${\mathscr{E}}({\mathfrak{X}}^{\lambda})$ are scalar multiples of each other. By (geja, , Remark 1.4.9), there is a specific element $P_{0}\in{\mathscr{E}}({\mathfrak{X}}^{\lambda})$ given by

[TABLE]

furthermore, we have $\det(P_{0})\neq 0$ . By the Schur Relations (see (gepf, , 7.2.1)), we have

[TABLE]

Using the relation $P_{0}\,{\mathfrak{X}}^{\lambda}(T_{w^{-1}})={\mathfrak{X}}^{\lambda}(T_{w})^{\operatorname{tr}}\,P_{0}$ for all $w\in W$ , we deduce that

[TABLE]

This provides a direct criterion for checking if a given matrix $P\in{\mathscr{E}}({\mathfrak{X}}^{\lambda})$ equals $P_{0}$ . Furthermore, if $P\neq 0$ is an element of ${\mathscr{E}}({\mathfrak{X}}^{\lambda})$ , then $P=cP_{0}$ for some $0\neq c\in K$ and so ${\mathbf{c}}_{\lambda}\mbox{trace}(P^{-1})P={\mathbf{c}}_{\lambda}\mbox{trace}(P_{0}^{-1})P_{0}=P_{0}$ .

The following concept was introduced by Kazhdan–Lusztig KaLu in the equal parameter case (where $L(s)=1$ for all $s\in S$ ); for the general case see (geja, , §1.4).

Definition 2.8

Let $V$ be an ${\mathscr{H}}_{K}$ -module with $d:=\dim V<\infty$ . We say that $V$ is afforded by a $W$ -graph if there exist

•

a basis $\{e_{1},\ldots,e_{d}\}$ of $V$ ,

•

subsets $I_{i}\subseteq S$ for $1\leqslant i\leqslant d$ ,

•

and elements $m_{ij}^{s}\in A$ , where $1\leqslant i,j\leqslant d$ and $s\in I_{i}\setminus I_{j}$ ,

such that the following hold. First, we require that

[TABLE]

Furthermore, for $s\in S$ , the action of $T_{s}$ on $V$ is given by

[TABLE]

Thus, if $V$ is afforded by a $W$ -graph representation, then the action of $T_{s}$ on $V$ is given by matrices of a particularly simple form.

It has been conjectured in myedin (see also (geja, , 1.4.14)) that, if the simple ${\mathscr{H}}_{K}$ -module $E_{v}^{\lambda}$ is afforded by a $W$ -graph, then the corresponding matrix representation is balanced. We now turn to the problem of explicitly verifying if a given irreducible matrix representation of ${\mathscr{H}}_{K}$ is balanced or not.

We shall assume from now that $W$ is a finite Weyl group and that we are in the equal parameter case where $L(s)=1$ for all $s\in S$ ; we may take $R={\mathbb{Z}}$ , $F={\mathbb{Q}}$ in the above discussion. (The remaining cases have been dealt with in (myedin, , Examples 4.5, 4.6).) It is known that every simple ${\mathscr{H}}_{K}$ -module $E_{v}^{\lambda}$ is afforded by a $W$ -graph; see (geja, , Theorem 2.7.2) and the references there. As far as $W$ of exceptional type is concerned, such $W$ -graphs have been determined explicitly, by Naruse Naruse0 , Howlett and Yin How , HowYin . They are available in electronic form through Michel’s development version of the CHEVIE system; see jmich . Now let us fix $\lambda\in\Lambda$ and assume that ${\mathfrak{X}}^{\lambda}\colon{\mathscr{H}}_{K}\rightarrow K^{d_{\lambda}\times d_{\lambda}}$ is a corresponding representation afforded by a $W$ -graph. Concretely, this will mean that we are given the collection of matrices $\{X_{s}:={\mathfrak{X}}^{\lambda}(T_{s})\mid s\in S\}$ . Our aim is to find a matrix $P=(p_{ij})_{1\leqslant i,j\leqslant d_{\lambda}}$ such that

[TABLE]

This is a system of $|S|d_{\lambda}^{2}$ homogeneous linear equations for the $d_{\lambda}(d_{\lambda}+1)/2$ unknown entries of $P$ . (Recall that $P$ is symmetric.) We know that $P$ is uniquely determined up to scalar multiples. Rescaling a given solution by a suitable non-zero polynomial in ${\mathbb{Q}}[v]$ , we can assume that all entries of $P$ are in ${\mathbb{Z}}[v]$ and that their greatest common divisor is $\pm 1$ ; then $P$ is unique up to a sign. Such a solution $P$ will be called a primitive Gram matrix for ${\mathfrak{X}}^{\lambda}$ . As in 2.7, a specific solution $P_{0}$ can be singled out by the condition that $\mbox{trace}(P_{0}^{-1}){\mathbf{c}}_{\lambda}=1$ . We claim that

•

the matrix $P_{0}^{\prime}:=v^{2l(w_{0})}P_{0}$ has entries in ${\mathbb{Z}}[v]$ , and

•

the non-zero entries of $P_{0}^{\prime}$ have degree at most $2l(w_{0})$ .

Here, $w_{0}$ denotes the longest element of $W$ . Indeed, since all the entries of the matrices $X_{s}$ ( $s\in S$ ) are in ${\mathbb{Z}}[v,v^{-1}]$ , the same will be true for $P_{0}$ as well. The formulae in 2.8 show that each matrix $vX_{s}$ ( $s\in S$ ) has entries in ${\mathbb{Z}}[v]$ . Hence, all matrices $v^{l(w_{0})}{\mathfrak{X}}^{\lambda}(T_{w})$ have entries in ${\mathbb{Z}}[v]$ and so $P_{0}^{\prime}$ has entries in ${\mathbb{Z}}[v]$ . Furthermore, the non-zero entries of each matrix $vX_{s}$ have degree [math], $1$ or $2$ . This yields the degree bound for the entries of $P_{0}^{\prime}$ .

Since the entries of $P_{0}^{\prime}$ are integer polynomials of bounded degree, we can determine $P_{0}^{\prime}$ by interpolation and modular techniques (Chinese remainder). Combining this with the techniques described in (gemu, , §4.3), one obtains an algorithm which can be implemented in GAP in a straightforward way. Rescaling these matrices by suitable non-zero polynomials in ${\mathbb{Q}}[v]$ , we obtain primitive Gram matrices as solutions of ( $*$ ). This approach readily produces primitive Gram matrices for $W$ of type $F_{4}$ , $E_{6}$ and $E_{7}$ in a few hours of computing time. As was already advertised in Section 1, we also succeeded in obtaining primitive Gram matrices for type $E_{8}$ , where it is one of the purposes of this paper to describe the methods involved.

Tables LABEL:Mmaxd0 and LABEL:Mmaxd contain some information about these primitive Gram matrices $P$ :

[TABLE]

We note that the primes in the 5th column are so-called “bad primes” for $W$ (as in (geja, , 1.5.11)). In particular, the fact that $P|_{v\rightarrow 0}$ always has a non-zero determinant means that $\det(P)\in{\mathscr{O}}^{\times}$ (see Proposition 2.5). Thus, we can conclude:

Theorem 2.10

Let $W$ be of type $F_{4}$ , $E_{6}$ , $E_{7}$ or $E_{8}$ and $L(s)=1$ for all $s\in S$ . Then the $W$ -graph representations of Naruse Naruse0 , Howlett and Yin How , HowYin are balanced.

3 Linear algebra over the integers

As was already mentioned in Section 1, the basic strategy of our approach to determine Gram matrices of invariant bilinear forms for representations of Iwahori–Hecke algebras is to reduce computational linear algebra over the polynomial rings ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ , where from now on $X$ denotes our favorite indeterminate, to computational linear algebra over the integers ${\mathbb{Z}}$ . Thus in this section we begin by describing how we deal with matrices over ${\mathbb{Z}}$ , where we restrict ourselves to the aspects needed for our present application.

Let us fix the following convention: For $x,y\in{\mathbb{Z}}$ , not both zero, let $\gcd(x,y)\in{\mathbb{Z}}$ denote the positive greatest common divisor of $x$ and $y$ . A vector $0\neq v\in{\mathbb{Q}}^{m}$ , where $m\in{\mathbb{N}}$ , is called primitive, if actually $v\in{\mathbb{Z}}^{m}$ , and for the greatest common divisor $\gcd(v)$ of its entries we have $\gcd(v)=1$ . Clearly greatest common divisor computations in ${\mathbb{Z}}$ yield a ${\mathbb{Q}}$ -multiple of $v$ which is primitive. Similarly, a matrix $0\neq A\in{\mathbb{Z}}^{m\times n}$ , where $m,n\in{\mathbb{N}}$ , is called primitive, if actually $A\in{\mathbb{Z}}^{m\times n}$ , and for the greatest common divisor $\gcd(A)$ of its entries we have $\gcd(A)=1$ .

Continued fractions and the Euclidean algorithm. The first computational task we are going to discuss, in Section 3.4 below, is rational number recovery. This has been discussed in the literature at various places, see for example dixon ; mon ; parker2 or (vzG, , Section 5.10). (We also gratefully acknowledge additional private discussions with R. Parker on this topic.) Although the ideas pursued in these references are closely related to ours, none of them completely coincides with our approach, and proofs (if given at all) are not too elucidating. Hence we present our approach in detail, for which we need a few preparations first:

Continued fraction expansions. We recall a few notions from the theory of continued fraction expansions; as a general reference see for example (hw, , Chapter 10): Given $\rho\in{\mathbb{R}}$ such that $\rho\geqslant 0$ , let

[TABLE]

be its (regular) continued fraction expansion, where $q_{1}\in{\mathbb{N}}_{0}$ and $q_{i}\in{\mathbb{N}}$ for $i\geqslant 2$ . This is obtained by letting $q_{1}:=\lfloor{\rho}\rfloor$ , and, as long as $\rho\neq q_{1}$ , proceeding recursively with $\frac{1}{\rho-q_{1}}$ instead of $\rho$ . This process terminates, after $l\geqslant 1$ steps say, if and only if $\rho\in{\mathbb{Q}}$ ; otherwise we let $l:=\infty$ . Truncating at $i\leqslant l$ yields the $i$ -th convergent $\rho_{i}:=\textrm{cf}[q_{1},\ldots,q_{i}]\in{\mathbb{Q}}$ of $\rho$ , hence we may write $\rho_{i}:=\frac{\sigma_{i}}{\tau_{i}}$ , where $\sigma_{i},\tau_{i}\in{\mathbb{N}}_{0}$ such that $\tau_{i}\geqslant 1$ and $\gcd(\sigma_{i},\tau_{i})=1$ . Letting additionally $\sigma_{-1}:=0$ and $\tau_{-1}:=1$ , as well as $\sigma_{0}:=1$ and $\tau_{0}:=0$ , for $i\geqslant 1$ we get by induction

[TABLE]

Hence the sequences $[\sigma_{1},\sigma_{2},\ldots,\sigma_{l}]$ and $[\tau_{2},\tau_{3},\ldots,\tau_{l}]$ are strongly increasing.

Now let $\rho=\frac{a}{b}\in{\mathbb{Q}}$ , where $a,b\in{\mathbb{N}}$ . Then the continued fraction expansion of $\rho$ can be computed by the extended Euclidean algorithm, see (cohen, , Algorithm 1.3.6), as follows: Setting $r_{0}:=a$ and $r_{1}:=b$ , for $1\leqslant i\leqslant l$ let recursively $q_{i}\in{\mathbb{N}}_{0}$ and

[TABLE]

where $l\geqslant 1$ is defined by $r_{l}>0$ but $r_{l+1}=0$ ; actually we have $q_{i}\geqslant 1$ for $i\geqslant 2$ , and of course $r_{l}=\gcd(a,b)$ . Hence the sequence $[r_{1},\ldots,r_{l+1}]$ has non-negative entries and is strongly decreasing. Moreover, setting $s_{0}:=1$ and $t_{0}:=0$ , as well as $s_{1}:=0$ and $t_{1}:=1$ , and for $1\leqslant i\leqslant l$ letting recursively

[TABLE]

we get $r_{i}=s_{i}a+t_{i}b$ . Then it is immediate by induction that $\sigma_{i}=(-1)^{i}\cdot t_{i+1}$ and $\tau_{i}=(-1)^{i+1}\cdot s_{i+1}$ , for $i\geqslant 1$ , and hence

[TABLE]

Hence the sequences $[-s_{3},s_{4},-s_{5}\ldots,\pm s_{l+1}]$ and $[-t_{2},t_{3},-t_{4}\ldots,\pm t_{l+1}]$ have positive entries and are strongly increasing. Finally, a direct computation yields

[TABLE]

Another view on the Euclidean algorithm. For $a,b\in{\mathbb{N}}$ we consider the ${\mathbb{Z}}$ -lattice

[TABLE]

Then we have $|\det(L_{a,b})|=b$ , and it is immediate that $[x,y]\in{\mathbb{Z}}^{2}$ is an element of $L_{a,b}$ if and only if $y\equiv ax\pmod{b}$ . Note that if $0\neq[x,y]\in L_{a,b}$ is primitive, then we necessarily have $\gcd(x,b)=1$ . Moreover, the extended Euclidean algorithm shows that $L_{a,b}=\langle[s_{i},r_{i}],[s_{i+1},r_{i+1}]\rangle_{\mathbb{Z}}$ , for all $0\leqslant i\leqslant l$ . We collect a few properties of $L_{a,b}$ :

Lemma 3.2

(a) For all $0\leqslant i\leqslant l+1$ we have $\langle[s_{i},r_{i}]\rangle_{\mathbb{Q}}\cap L_{a,b}=\langle[s_{i},r_{i}]\rangle_{\mathbb{Z}}$ .

(b) We have $\langle[s_{i},r_{i}]\rangle_{\mathbb{Q}}=\langle[s_{j},r_{j}]\rangle_{\mathbb{Q}}$ , where $1\leqslant i,j\leqslant l+1$ , if and only if $i=j$ .

Proof

We first show that whenever $[x,y]\in L_{a,b}$ such that $0<|y|<r_{i}$ , for some $0\leqslant i\leqslant l$ , then $|x|\geqslant|s_{i+1}|$ : We may assume that $i\geqslant 2$ . Let $c,d\in{\mathbb{Z}}$ such that

[TABLE]

where we may assume that $c\neq 0$ , which entails $d\neq 0$ as well. Since $r_{i}>r_{i+1}\geqslant 0$ , this implies $c\cdot d<0$ . Since the sequence $[s_{2},-s_{3},s_{4},-s_{5}\ldots,\pm s_{l+1}]$ has positive entries, we get $|x|=|cs_{i}+ds_{i+1}|=|c|\cdot|s_{i}|+|d|\cdot|s_{i+1}|\geqslant|s_{i+1}|$ , as asserted.

(a) We may assume that $i\geqslant 2$ . Moreover, for $i=l+1$ letting $[x,0]\in L_{a,b}$ , it is immediate from $ax\equiv 0\pmod{b}$ that $|s_{l+1}|=\frac{b}{r_{l}}=\frac{b}{\gcd(a,b)}$ divides $x$ . Hence we may assume $i\leqslant l$ , too. Then let $d\neq 1$ be a divisor of $\gcd(s_{i},r_{i})$ such that $\frac{1}{d}\cdot[s_{i},r_{i}]\in L_{a,b}$ . Then we have $0<|\frac{r_{i}}{d}|<r_{i}$ and $|\frac{s_{i}}{d}|<|s_{i}|\leqslant|s_{i+1}|$ , contradicting the statement above.

(b) It follows from (a) that there are $c,d\in{\mathbb{Z}}$ such that $[s_{j},r_{j}]=c\cdot[s_{i},r_{i}]$ and $[s_{i},r_{i}]=d\cdot[s_{j},r_{j}]$ . Hence we get $cd=1$ , and since the sequence $[r_{1},\ldots,r_{l+1}]$ has non-negative entries and is strongly decreasing, we infer $r_{i}=r_{j}$ and $i=j$ . ∎

Note that the statement in (b) is trivial if $[s_{i},r_{i}]$ is primitive, that is $\gcd(s_{i},r_{i})=1$ . But this is not always fulfilled, as the example in (vzG, , Example 5.27) shows.

Proposition 3.3

(a) Let $[x,y]\in L_{a,b}$ such that $x\neq 0$ and $|x|\cdot|y|\leqslant\frac{b}{2}$ . Then we have $[x,y]\in\langle[s_{i},r_{i}]\rangle_{\mathbb{Z}}$ , for a unique $2\leqslant i\leqslant l+1$ . In particular, if $[x,y]$ is primitive then we have $[x,y]=[s_{i},r_{i}]$ or $[x,y]=-[s_{i},r_{i}]$ .

(b) Assume there is $0\neq[x,y]\in L_{a,b}$ such that $\|[x,y]\|:=\sqrt{x^{2}+y^{2}}<\sqrt{b}$ . Then there is a unique $2\leqslant i\leqslant l+1$ such that $\|[s_{i},r_{i}]\|<\sqrt{b}$ , and the shortest non-zero elements of $L_{a,b}$ are precisely $[s_{i},r_{i}]$ and $-[s_{i},r_{i}]$ .

Proof

(a) Since $[x,y]\in L_{a,b}$ there is $z\in{\mathbb{Z}}$ such that $y=xa-zb$ . Then we have

[TABLE]

Thus by Legendre’s Theorem, see (hw, , Section 10.15, Theorem 184), we infer that $\frac{z}{x}$ occurs as a convergent in the continued fraction expansion of $\rho=\frac{a}{b}$ , that is, there is $2\leqslant i\leqslant l+1$ such that $\frac{z}{x}=\rho_{i-1}$ . This yields

[TABLE]

Hence we have $[x,y]\in\langle[s_{i},r_{i}]\rangle_{\mathbb{Q}}$ , and thus from Lemma 3.2 we get $[x,y]\in\langle[s_{i},r_{i}]\rangle_{\mathbb{Z}}$ , together with the uniqueness statement.

(b) Assume first that $x=0$ , then by Lemma 3.2 we infer that $b$ divides $y$ , and hence $\|[x,y]\|\geqslant b\geqslant\sqrt{b}$ , a contradiction. Hence we have $x\neq 0$ . Moreover, from $(x-y)^{2}=x^{2}+y^{2}-2xy\geqslant 0$ we get $2\cdot|x|\cdot|y|\leqslant x^{2}+y^{2}=\|[x,y]\|^{2}<b$ , hence from (a) we see that there is $2\leqslant i\leqslant l+1$ such that $[x,y]=\langle[s_{i},r_{i}]\rangle_{\mathbb{Z}}$ . Thus in particular we have $\|[s_{i},r_{i}]\|<\sqrt{b}$ .

In order to show uniqueness, and the statement on shortest elements, let $0\neq[x^{\prime},y^{\prime}]\in L_{a,b}$ such that $\|[x^{\prime},y^{\prime}]\|<\sqrt{b}$ . Then, as above, there is $2\leqslant i\leqslant l+1$ such that $[x^{\prime},y^{\prime}]=\langle[s_{j},r_{j}]\rangle_{\mathbb{Z}}$ , hence in particular we have $\|[s_{j},r_{j}]\|<\sqrt{b}$ . Then Hadamard’s inequality, see (vzG, , Theorem 16.6), implies that

[TABLE]

Since $|\det(L_{a,b})|=b$ divides $\det\Big{(}\begin{bmatrix}s_{i}&r_{i}\\ s_{j}&r_{j}\\ \end{bmatrix}\Big{)}$ this entails $\langle[s_{i},r_{i}]\rangle_{\mathbb{Q}}=\langle[s_{j},r_{j}]\rangle_{\mathbb{Q}}$ , and hence $i=j$ by Lemma 3.2. ∎

A comparison of the above treatment with the references already mentioned seems to be in order: The statement of Proposition 3.3(a) is roughly equivalent to (dixon, , Theorem) and (mon, , Theorem 1), respectively. Alone, the proof given in dixon appears to be too concise, and provides a slightly worse bound for $b$ to be large enough. And (mon, , Theorem 1) is attributed in turn to davguywan , while for a proof the reader is referred to vzG . Unfortunately, (vzG, , Theorem 5.26) is not immediately conclusive for the statements under consideration here.

The main difference between the above-mentioned approaches and ours is the break condition used to actually determine the index $i$ referred to in Proposition 3.3(a): In davguywan ; dixon ; vzG a bound on the residues $r_{i}$ is used, while in (mon, , Section 3) the quotients $q_{i}$ are considered instead (yielding a randomized algorithm). In contrast, in our decisive Proposition 3.3(b) we are using the minimum of the lattice $L_{a,b}$ , which hence treats both the $r_{i}$ and $s_{i}$ (in other words the the unknown numbers $y$ and $x$ ) on a “symmetric” footing. To our knowledge, this point of view is new, its algorithmic relevance being explained below.

Recovering rational numbers. We are now prepared to describe our first computational task, which will appear both in computations over ${\mathbb{Z}}$ in Section 3.5, and over the polynomial ring ${\mathbb{Q}}[X]$ in Section 4.2:

Let $x\in{\mathbb{N}}$ and $0\neq y\in{\mathbb{Z}}$ such that $\gcd(x,y)=1$ . Assume we are given $a,b\in{\mathbb{N}}$ such that $\gcd(x,b)=1$ and $y\equiv ax\pmod{b}$ ; note that since $x$ is invertible modulo $b$ we may write $\frac{y}{x}\equiv a\pmod{b}$ instead, which we will feel free to do if convenient. Now, if $b$ is large enough compared to $x$ and $|y|$ , the task is to recover $\frac{y}{x}\in{\mathbb{Q}}$ from its congruence class $a\pmod{b}$ .

In view of Proposition 3.3(b), this is straightforward: Assuming that $x^{2}+y^{2}<b$ , the ${\mathbb{Z}}$ -lattice $L_{a,b}=\langle[1,a],[0,b]\rangle_{\mathbb{Z}}\subseteq{\mathbb{Z}}^{2}$ has precisely two shortest non-zero elements, namely the primitive elements $\pm[x,y]$ . In other words, the rational number $\frac{y}{x}\in{\mathbb{Q}}$ can be found by computing a shortest non-zero element of $L_{a,b}$ . This in turn can be done algorithmically by the Gauß reduction algorithm for ${\mathbb{Z}}$ -lattices of rank $2$ , see (cohen, , Algorithm 1.3.14). Moreover, compared to the general case, for the particular lattice $L_{a,b}$ we have a better break condition: We may stop early as soon as we have found an element $[x,y]\in L_{a,b}$ such that $x^{2}+y^{2}<b$ . If then $[x,y]$ is primitive, the rational number $\frac{y}{x}$ fulfills all assumptions made, where of course its correctness has to be verified independently. Otherwise, if $[x,y]$ is not primitive, or the shortest element $[x^{\prime},y^{\prime}]\in L_{a,b}$ found fulfills $x^{\prime 2}+y^{\prime 2}\geqslant b$ , then we report failure. Thus, in practice, we choose $b$ small, and rerun the above algorithm with $b$ increasing, until we find a valid candidate passing independent verification.

At this stage, we should point out the algorithmic advantage of our approach, compared to the other ones mentioned: The latter refer to the convergents of continued fraction expansions, and thus to the full sequence of non-negative residues of the extended Euclidean algorithm. In contrast, the Gauß reduction algorithm to find a lattice minimum proceeds by iterated pair reduction, starting with the pair $[0,b]$ and $[1,a]$ . Although this is essentially equivalent to running the extended Euclidean algorithm on $a$ and $b$ , here we are allowed to use best approximation. This amounts to using numerically smallest residues, instead of non-negative ones as was necessary in the context of continued fraction expansions. Although we have not carried out a detailed comparison, it is well-known that this saves a non-negligible amount of quotient and remainder steps.

Finding linear combinations. We are now going to describe the basic task we are faced with in order to be able to do computational linear algebra over ${\mathbb{Z}}$ . To do so, we of course avoid the Gauß algorithm over ${\mathbb{Q}}$ , but we also do not refer to pure “lattice algorithms”, as they are called in (cohen, , Section 2.1), for example those to compute Hermite normal forms or reduced lattice bases described in (cohen, , Section 2.4–2.7). Instead, we use a modular technique, which is a keystone to make use of the ideas of the MeatAxe in the framework of the IntegralMeatAxe. To our knowledge, this has only been discussed very briefly in the literature, for example in dixon ; parker2 . Moreover, our approach differs from those cited, at least in detail; in particular, dixon only allows for regular square matrices.

To describe the computational task, we again need some preparations first: Given a (rectangular) matrix $A\in{\mathbb{Z}}^{m\times n}$ , with ${\mathbb{Q}}$ -linearly independent rows $w_{1},\ldots,w_{m}\in{\mathbb{Z}}^{n}$ , where $m,n\in{\mathbb{N}}$ , let

[TABLE]

be the ${\mathbb{Z}}$ -lattice spanned by the rows of $A$ , and let $L\leqslant\widehat{L}\leqslant{\mathbb{Z}}^{n}$ be its pure closure in ${\mathbb{Z}}^{n}$ , that is the smallest pure ${\mathbb{Z}}$ -sublattice of ${\mathbb{Z}}^{n}$ containing $L$ . Then the index $\det(L):=[\widehat{L}\colon L]$ is finite; of course, if $m=n$ then we have $\det(L)=|\det(A)|$ . Thus for any vector $v\in{\mathbb{Z}}^{n}$ , we have $v\in\widehat{L}$ if and only of there is $a\in{\mathbb{N}}$ such that $av\in L$ ; in this case, if $a$ is chosen minimal then it divides $\det(L)$ .

Now, given $v\in{\mathbb{Z}}^{n}$ , the task is to decide whether or not $v\in\widehat{L}$ , and if this is the case to compute $a_{1},\ldots,a_{m}\in{\mathbb{Z}}$ and $a\in{\mathbb{N}}$ such that $\gcd(a,a_{1},\ldots,a_{m})=1$ and

[TABLE]

in this case $a$ and the $a_{i}$ are uniquely determined.

The $p$ -adic decomposition algorithm. To do so, we choose a (large) prime $p$ . Then reduction modulo $p$ yields the matrix $\overline{A}\in{\mathbb{F}}_{p}^{m\times n}$ over the prime field ${\mathbb{F}}_{p}$ . We assume that the rows $\overline{w}_{1},\ldots,\overline{w}_{m}\in{\mathbb{F}}_{p}^{n}$ of $\overline{A}$ are ${\mathbb{F}}_{p}$ -linearly independent as well; otherwise we choose another prime $p$ . By the structure theory of finitely generated modules over principal ideal domains, this condition is equivalent to saying $\overline{\widehat{L}}=\overline{L}$ , which in turn is equivalent to $p$ not dividing $\det(L)$ . In particular, the independence condition on $\overline{w}_{1},\ldots,\overline{w}_{m}\in{\mathbb{F}}_{p}^{n}$ is fulfilled for all but finitely many primes $p$ .

Thus we have $v\in\widehat{L}$ if and only if $\overline{v}\in\overline{L}=\langle\overline{w}_{1},\ldots,\overline{w}_{m}\rangle_{{\mathbb{F}}_{p}}$ , solving the decision problem. Furthermore, if $v\in\widehat{L}$ then set $v_{0}:=v$ , and for $d\in{\mathbb{N}}_{0}$ proceed successively as follows: Since $v_{d}\in\widehat{L}$ , there are $[a_{d,1},\ldots,a_{d,m}]\in{\mathbb{Z}}^{m}$ such that $-\frac{p}{2}<a_{d,j}\leqslant\frac{p}{2}$ for all $1\leqslant j\leqslant m$ , and

[TABLE]

Then we let

[TABLE]

Hence we have $v_{d+1}\in\widehat{L}$ as well, and we may recurse. This yields

[TABLE]

or equivalently

[TABLE]

Thus, if $v\in L$ , or equivalently $a=1$ , then since $-\frac{p^{d+1}}{2}<\sum_{i=0}^{d}p^{i}a_{i,j}\leqslant\frac{p^{d+1}}{2}$ there is some $d\in{\mathbb{N}}_{0}$ such that $v_{d+1}=0$ , implying that $a_{j}=\sum_{i=0}^{d}p^{i}a_{i,j}$ , for all $1\leqslant j\leqslant m$ , without further independent verification necessary. Otherwise, if $v\in\widehat{L}\setminus L$ , then applying rational number recovery for some $d\in{\mathbb{N}}_{0}$ large enough, see Section 3.4, reveals the vector $\frac{1}{a}\cdot[a_{1},\ldots,a_{m}]\in{\mathbb{Q}}^{m}$ ; note that under the assumptions made $p$ does not divide $a$ . In the latter case correctness is independently verified by computing $[a_{1},\ldots,a_{m}]\cdot A\in{\mathbb{Z}}^{n}$ and checking whether it equals $av\in{\mathbb{Z}}^{n}$ .

Modular computations. In practice, to check $\overline{w}_{1},\ldots,\overline{w}_{m}\in{\mathbb{F}}_{p}^{n}$ for ${\mathbb{F}}_{p}$ -linear independence, and to compute the vectors $[\overline{a}_{d,1},\ldots,\overline{a}_{d,m}]\in{\mathbb{F}}_{p}^{m}$ we use ideas taken from the MeatAxe. In particular, in order to keep the depth $d$ needed smallish, but still to be able to make efficient use of fast arithmetic over small finite prime fields, we choose the prime $p$ amongst the largest primes smaller than $2^{8}=256$ . (In our application we for example use $p=251$ as the default prime.)

Nullspace. In the framework of the IntegralMeatAxe there is a general method to compute a ${\mathbb{Z}}$ -basis of the row kernel of a matrix with entries in ${\mathbb{Z}}$ , see parker2 . But in view of the application to row kernels of matrices over ${\mathbb{Q}}[X]$ in Section 5.1, here we only deal with the following restricted nullspace problem:

Given a matrix $A\in{\mathbb{Q}}^{m\times n}$ , where $m,n\in{\mathbb{N}}$ , such that $\dim_{{\mathbb{Q}}}(\ker(A))=1$ , where $\ker(A)$ denotes the row kernel of $A$ , compute a primitive vector $v\in{\mathbb{Z}}^{m}$ such that $\ker(A)=\langle v\rangle_{{\mathbb{Q}}}$ ; then $v$ is unique up to sign.

To do so, by going over to a suitable ${\mathbb{Q}}$ -multiple we may assume that $A\in{\mathbb{Z}}^{m\times n}$ . Let $w_{1},\ldots,w_{m}\in{\mathbb{Z}}^{n}$ be the rows of $A$ . We may assume that $w_{1}\neq 0$ , since otherwise we trivially set $v:=[1,0,\ldots,0]\in{\mathbb{Z}}^{m}$ . Then for $2\leqslant i\leqslant m$ we successively check, using the $p$ -adic decomposition algorithm in Section 3.5, whether or not $w_{i}\in\langle w_{1},\ldots,w_{i-1}\rangle_{{\mathbb{Q}}}$ . If this is not the case, that is $\{w_{1}\ldots,w_{i}\}$ is ${\mathbb{Q}}$ -linearly independent, then if $\overline{w}_{1},\ldots,\overline{w}_{i}\in{\mathbb{F}}_{p}^{n}$ turns out to be ${\mathbb{F}}_{p}$ -linearly independent we increment $i$ , while otherwise we return failure in order to choose another prime $p$ . If $\{w_{1}\ldots,w_{i}\}$ is ${\mathbb{Q}}$ -linearly dependent, then the $p$ -adic decomposition algorithm returns $a_{1},\ldots,a_{i-1}\in{\mathbb{Z}}$ and $a\in{\mathbb{N}}$ such that $\gcd(a,a_{1},\ldots,a_{i-1})=1$ and $w_{i}=\frac{1}{a}\cdot\sum_{j=1}^{i-1}a_{j}w_{j}$ . Thus $v:=[a_{1},\ldots,a_{i-1},-a,0,\ldots,0]\in\ker(A)\leqslant{\mathbb{Z}}^{m}$ is primitive.

Inverse. Matrix inversion over ${\mathbb{Q}}$ , from the point of view of reducing to computations over ${\mathbb{Z}}$ as much as possible, can be formulated as the following task:

Given a matrix $A\in{\mathbb{Q}}^{n\times n}$ , where $n\in{\mathbb{N}}$ , such that $\det(A)\neq 0$ , compute $B\in{\mathbb{Z}}^{n\times n}$ and $c\in{\mathbb{N}}$ , such that $A^{-1}=\frac{1}{c}\cdot B\in{\mathbb{Q}}^{n\times n}$ and the overall greatest common divisor $\gcd(B,c)$ of the entries of $B$ and $c$ equals $\gcd(B,c)=1$ ; then $(B,c)$ is unique.

To do so, by going over to a suitable ${\mathbb{Q}}$ -multiple we may assume that $A\in{\mathbb{Z}}^{n\times n}$ . Then the equation $BA=c\cdot E_{n}$ , where $E_{n}$ denotes the identity matrix, implies that $\gcd(B)$ divides $c$ , and hence $B$ is necessarily primitive. Solving the equations $\mathscr{X}A=E_{n}$ , for the unknown matrix $\mathscr{X}\in{\mathbb{Q}}^{n\times n}$ , amounts to writing the rows of the identity matrix as ${\mathbb{Q}}$ -linear combinations of the rows of $A$ , which is done using the $p$ -adic decomposition algorithm in Section 3.5; recall that the rows of $A$ indeed are assumed to be ${\mathbb{Q}}$ -linearly independent.

The exponent of a matrix. Given a square matrix $A\in{\mathbb{Z}}^{n\times n}$ such that $\det(A)\neq 0$ as above, the number $c\in{\mathbb{N}}$ found in the expression $A^{-1}=\frac{1}{c}\cdot B$ , where $B\in{\mathbb{Z}}^{n\times n}$ is chosen to be primitive, turns out to have another interpretation:

Let $\textrm{im}(A)\leqslant{\mathbb{Z}}^{n}$ be the ${\mathbb{Z}}$ -span of the rows of $A$ . By the structure theory of finitely generated modules over principal ideal domains, the annihilator of the ${\mathbb{Z}}$ -module ${\mathbb{Z}}^{n}/\textrm{im}(A)$ is a non-zero ideal of ${\mathbb{Z}}$ , the positive generator $\exp(A)$ of which is called the exponent of $A$ . Moreover, $\exp(A)$ divides $\det(A)$ , which in turn divides some power of $\exp(A)$ . Thus the prime divisors of $\exp(A)$ are precisely the primes $p\in{\mathbb{Z}}$ such that $\overline{A}\in{\mathbb{F}}_{p}^{n\times n}$ is not invertible.

Now, actually $\exp(A)$ and $c$ coincide: From $BA=c\cdot E_{n}$ we conclude that $(c{\mathbb{Z}})^{n}\leqslant\textrm{im}(A)$ , hence $\exp(A)$ divides $c$ ; conversely, since $(\exp(A)\cdot{\mathbb{Z}})^{n}\leqslant\textrm{im}(A)$ there is $B^{\prime}\in{\mathbb{Z}}^{n\times n}$ such that $B^{\prime}A=\exp(A)\cdot E_{n}$ , implying that $\exp(A)\cdot B=c\cdot B^{\prime}$ , which by the primitivity of $B$ shows that $c$ divides $\exp(A)$ . In other words, computing the inverse of $A$ as described in Section 3.7 also yields a method to compute $\exp(A)$ .

4 Computing with polynomials

Having the necessary pieces of linear algebra over the integers in place, in this section we describe computational aspects of single polynomials, before we turn to linear algebra over polynomials rings in Section 5.

Polynomial arithmetic. As our general strategy is to use linear algebra over ${\mathbb{Z}}$ or ${\mathbb{Q}}$ to do linear algebra over ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ , for all arithmetically heavy computations we recurse to ${\mathbb{Z}}$ or ${\mathbb{Q}}$ . Consequently, for the remaining pieces of explicit computation in ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ we may use a simple straightforward approach:

We use our own standard arithmetic for polynomials over ${\mathbb{Z}}$ or ${\mathbb{Q}}$ , where a polynomial $0\neq f=\sum_{i=0}^{d}z_{i}X^{i}\in{\mathbb{Q}}[X]$ is just represented by its coefficient list $[z_{0},\ldots,z_{d}]\in{\mathbb{Q}}^{d+1}$ of length $d+1$ , where $d=\deg(f)$ . Thus we avoid structural overhead as much as possible, and may use directly the facilities to handle row vectors provided by GAP. But we would like to stress that this is just tailored for our aim of doing linear algebra over polynomial rings, and not intended to become a new general-purpose polynomial arithmetic. For example, we are not providing asymptotically fast multiplication, as is for example described in (vzG, , Section 8.3).

In particular, we only rarely need to compute polynomial greatest common divisors. Hence we avoid sophisticated (modular) techniques, as are for example described and compared in (vzG, , Chapter 6), but we are content with a simple variant of the Euclidean algorithm: Assuming that the operands have integral coefficients, by going over to ${\mathbb{Q}}$ -multiples if necessary, in order to avoid coefficient explosion we just use denominator-free pseudo-division as described in (cohen, , Algorithm 3.1.2), and Collins’s sub-resultant algorithm given in (cohen, , Algorithm 3.3.1), albeit the latter without intermediate primitivisation.

On the other hand, we very often have to evaluate polynomials at various places, where our strategy is to use as few of these specializations as possible, so that evaluation at distinct places is done step by step. Thus we are not in a position to use multi-point evaluation techniques, as are for example described in (vzG, , Section 10.1). Hence we are just using the Horner scheme, which under these circumstances is well-known to need the optimal number of multiplications.

We now describe the special tasks needed to be solved in our approach:

Recovering polynomials. The aim is to recover a polynomial with rational coefficients, which we are able to evaluate at arbitrary integral places, from as few such evaluations (at “small” places) as possible. More precisely:

Let $0\neq f:=\sum_{i=0}^{d}z_{i}X^{i}\in{\mathbb{Q}}[X]$ be a polynomial of degree $d=\deg(f)\in{\mathbb{N}}_{0}$ , having coefficients $z_{i}=\frac{y_{i}}{x_{i}}\in{\mathbb{Q}}$ , where $x_{i}\in{\mathbb{N}}$ and $y_{i}\in{\mathbb{Z}}$ such that $\gcd(x_{i},y_{i})=1$ . Then the task is to find pairwise coprime places $b_{1},\ldots,b_{k}\in{\mathbb{Z}}\setminus\{0,\pm 1\}$ , for some (small) $k\in{\mathbb{N}}$ , such that the degree $d$ and the coefficients $z_{0},\ldots,z_{d}$ of $f$ can be computed from the values $f(b_{1}),\ldots,f(b_{k})\in{\mathbb{Q}}$ alone. Note that, in particular, we do not assume that $k>d$ , so that polynomial interpolation is not applicable. (Actually, in our application we often enough have $k\ll d$ , where for example $k\sim 5$ , but $d\lesssim 200$ .)

To this end, let $b:=\prod_{j=1}^{k}|b_{j}|\in{\mathbb{N}}$ , and assume that we have $\gcd(x_{i},b)=1$ and $x_{i}^{2}+y_{i}^{2}<b$ for all $0\leqslant i\leqslant d$ . Hence the congruence classes $z_{i}\equiv\frac{y_{i}}{x_{i}}\pmod{b_{j}}$ and $f(b_{j})\pmod{b_{j}}$ are well-defined, and for the constant coefficient of $f$ we get

[TABLE]

Thus by the Chinese Remainder Theorem, see for example (cohen, , Theorem 1.3.9), there is a unique congruence class $a\pmod{b}$ , where $a\in{\mathbb{Z}}$ , such that $a\equiv z_{0}\pmod{b}$ . To compute $a\in{\mathbb{Z}}$ , we let $a_{j}\in{\mathbb{Z}}$ such that

[TABLE]

An application of Chinese remainder lifting in ${\mathbb{Z}}$ to the congruence classes $a_{1}\pmod{b_{1}},\ldots,a_{k}\pmod{b_{k}}$ yields the congruence class $a\pmod{b}$ , and by our choice of $b$ applying rational number recovery as described in Section 3.4 reveals $z_{0}\in{\mathbb{Q}}$ . Now we recurse to $\widetilde{f}:=\frac{f-z_{0}}{X}\in{\mathbb{Q}}[X]$ , whose value at the place $b_{j}$ can of course be determined directly from $f(b_{j})$ as $\widetilde{f}(b_{j})=\frac{f(b_{j})-z_{0}}{b_{j}}\in{\mathbb{Q}}$ .

Chinese remainder lifting. Hence, apart from rational number recovery, the key computational task to be solved is to perform Chinese remainder lifting in ${\mathbb{Z}}$ :

We are using the straightforward approach based on the extended Euclidean algorithm, as is described in (cohen, , Section 1.3.3). Since we are computing many lifts with respect to the same places $b_{1},\ldots,b_{k}$ , we make use of a precomputation step, as in (cohen, , Algorithm 1.3.11). But, since again for reasons of time and memory efficiency we are choosing small places $b_{j}$ , the specially tailored approach in (cohen, , Algorithm 1.3.11) to keep the intermediate numbers occurring small, at the expense of needing more multiplications, does not pay off as experiments show. Moreover, as we are computing the values $f(b_{j})$ for $1\leqslant j\leqslant k$ step by step, where even the number $k$ of places is not determined in advance, we cannot take advantage of fast Chinese remainder lifting techniques, as are described for example in (vzG, , Section 10.3), either.

Our strategy is to rerun the above algorithm with $k$ increasing, choosing small integral $2\leqslant b_{1}<b_{2}<\cdots<b_{k}$ , and to discard quickly erroneous guesses by an independent verification, until the correct answer passing the verification is found. By the above discussion, this happens after finitely many iterations. Before that, if $b=|\prod_{j=1}^{k}b_{j}|$ is too small, or not coprime to all the denominators $x_{i}$ , the Chinese remainder lifting process does not terminate, or it terminates with a wrong guess. To catch the first case, we impose a degree bound, and stop the lifting process with a failure message if it is exceeded, in order to increment $k$ . (In our application, $200$ turned out to be a suitable degree bound in all cases.)

To catch the second case, we only allow for denominators $x_{i}$ dividing an imposed bound. This is justified, since rational number recovery as described in Section 3.4 is a trade-off between finding the numerator $y$ and the denominator $x$ of the rational number $\frac{y}{x}$ to be reconstructed: In practice, we typically encounter small denominators $x$ and large numerators $y$ , which escape the Gauß reduction algorithm if $b$ is chosen too small, since then the latter tends to return a larger denominator $x^{\prime}>x$ and a smaller numerator $|y^{\prime}|<|y|$ . (In our application, denominator bounds such as small $2$ -powers, or $12$ , or $20$ turned out to be sufficient in all cases.)

Degree detection. We keep the setting of Section 4.2. The technique to be described now has arisen out of an attempt to determine the degree of $f$ without determining its coefficients. Actually, it deals with the following more general situation (whose relevance for our computations will be explained in Section 4.5 below):

Assume that instead of the values $f(b_{1}),\ldots,f(b_{k})$ we are only able to compute “rescaled values” $a_{1}f(b_{1}),\ldots,a_{k}f(b_{k})\in{\mathbb{Q}}$ , with scalar factors $a_{j}\in{\mathbb{Q}}$ such that $a_{j}>0$ , which are only known to come from a finite pool $\mathscr{R}$ of positive rational numbers associated with $f$ . Thus the task now becomes to find $k\in{\mathbb{N}}$ and coprime places $b_{1},\ldots,b_{k}\in{\mathbb{Z}}\setminus\{0,\pm 1\}$ as above, allowing to determine $f$ up to some positive rational scalar multiple, that is to find $af\in{\mathbb{Q}}[X]$ , for some $a\in{\mathbb{Q}}$ such that $a>0$ ; note that this also determines all the quotients $\frac{a_{j}}{a}$ .

To this end, we let $\alpha_{1},\ldots,\alpha_{d}\in{\mathbb{C}}$ be the complex roots of $f$ , and set $\mu:=\max\{0,|\alpha_{1}|,\ldots,|\alpha_{d}|\}$ . Moreover, since $\mathscr{R}$ is a finite set, we have

[TABLE]

Now, let $k\geqslant 2$ , and for the places $b_{1},\ldots,b_{k}$ we additionally assume that

[TABLE]

hence, in particular, the $f(b_{j})$ are non-zero and have the same sign. The necessity of these choices will become clear below. But this forces us to show that for all $k\geqslant 2$ and all $x>0$ and $\delta>0$ there actually exist pairwise coprime integers $b_{1}<\cdots<b_{k}$ such that $x<b_{1}$ and $\ln\big{(}\frac{b_{k}}{b_{1}}\big{)}<\delta$ . Indeed, we are going to show that the latter can always be chosen to be primes (where the mere existence proof to follow is impractical, but in practice considering small primes works well, see Example 4.4):

Let $p_{0}<p_{1}<\cdots$ be the sequence of all primes exceeding $x$ , and assume to the contrary that for all $k$ -subsets thereof, $q_{1}<\cdots<q_{k}$ say, we have $\ln\big{(}\frac{q_{k}}{q_{1}}\big{)}\geqslant\delta$ . Then we have $p_{k-1}\geqslant e^{\delta}\cdot p_{0}$ , and thus $p_{j(k-1)}\geqslant e^{j\delta}\cdot p_{0}$ , for all $j\in{\mathbb{N}}$ . Using the prime number function $\pi(x):=|\{p\in{\mathbb{N}};p\textrm{ prime},p\leqslant x\}|$ this implies

[TABLE]

From this we get

[TABLE]

contradicting the Prime Number Theorem, see (hw, , Section 1.8, Theorem 6), saying that $\lim_{x\rightarrow\infty}\frac{\pi(x)\cdot\ln(x)}{x}=1$ .

Growth behavior of polynomials. We now consider the growth behavior of the polynomial $f$ . For $x>\mu$ we have

[TABLE]

implying

[TABLE]

Thus, for $1\leqslant i<j\leqslant k$ , by the mean value theorem for derivatives there is $b_{i}<\beta<b_{j}$ such that

[TABLE]

Since by assumption $b_{i}>(1+2d)\cdot\mu\geqslant(1+2d)\cdot|\alpha_{r}|$ , we have

[TABLE]

for all $1\leqslant r\leqslant d$ . All differences $\beta-\alpha_{r}\in{\mathbb{C}}$ having positive real parts, we get

[TABLE]

Moreover, by assumption we have $0<\ln(b_{j})-\ln(b_{i})<\delta\leqslant|\ln(a_{j})-\ln(a_{i})|$ , hence

[TABLE]

Now, letting $\lfloor x\rceil:=\lfloor x+\frac{1}{2}\rfloor\in{\mathbb{Z}}$ denote the integer nearest to $x\in{\mathbb{R}}$ , we set

[TABLE]

for all $1\leqslant i,j\leqslant k$ such that $i\neq j$ ; note that $d_{ij}=d_{ji}$ . Hence from the above estimates we infer that $d_{ij}=d$ if and only if $a_{i}=a_{j}$ . In particular, all these numbers $d_{ij}$ coincide if and only if $a_{1}=\cdots=a_{k}$ , hence in this case immediately determining $d$ .

Combinatorial translation. Thus our task can now be rephrased in combinatorial terms as follows: For $c\in{\mathbb{Z}}$ let $\Gamma_{d+c}$ be the undirected graph on the vertex set $\{1,\ldots,k\}$ , whose edges are the $2$ -subsets $\{i,j\}\subseteq\{1,\ldots,k\}$ such that $d_{ij}=d+c$ .

Then by the above discussion the connected components of $\Gamma_{d}$ are complete graphs, whose vertex sets coincide with the sets of $j\in\{1,\ldots,k\}$ such that the associated scalars $a_{j}$ assume one and the same value. On the other hand, if $\Gamma_{d+c}$ , for some $c\neq 0$ , has a complete connected component with $r\geqslant 2$ vertices $b_{j_{1}}<\cdots<b_{j_{r}}$ , then for all $i,j\in\{j_{1},\ldots,j_{r}\}$ such that $i<j$ we have

[TABLE]

Thus we infer that the sequence $a_{j_{1}},\ldots,a_{j_{r}}$ is strictly increasing if $c>0$ , and strictly decreasing if $c<0$ . In particular this implies that $r\leqslant|\mathscr{R}|$ . In other words, as soon as we find a complete connected component of a graph $\Gamma_{d+c}$ having more than $|\mathscr{R}|$ elements, then we may conclude that $c=0$ , and we have determined $d$ . Moreover, if $k>|\mathscr{R}|^{2}$ than this case actually happens.

Our algorithm to determine the degree $d$ of $f$ , and $af$ for some $a>0$ , is now straightforward: Again our strategy is to increase $k$ step by step, and to choose places $2\leqslant b_{1}<b_{2}<\cdots<b_{k}$ such that $b_{1}$ is growing and $\ln(b_{k})-\ln(b_{1})$ tends to zero. Having made a choice, we compute the numbers $d_{ij}\in{\mathbb{Z}}$ for all $1\leqslant i<j\leqslant k$ ; note that here we do not see a way to avoid using non-exact floating point arithmetic (to evaluate logarithms), while everywhere else we are computing exactly. For all numbers $d^{\prime}\in{\mathbb{Z}}$ thus occurring we then determine the graph $\Gamma_{d^{\prime}}$ . Amongst all the graphs found we choose one, again $\Gamma_{d^{\prime}}$ say, having a complete connected component of maximal cardinality, with vertex set $\mathscr{J}\subseteq\{1,\ldots,k\}$ say. Then we run polynomial recovery, see Section 4.2, using the places $\{b_{j};j\in\mathscr{J}\}$ and the values $\{a_{j}f(b_{j});j\in\mathscr{J}\}$ , with degree bound $d^{\prime}$ .

An example. Here is an example to illustrate the above process. (It is a modified version of an example which actually occurred in our application.) Assume as places $b_{j}$ , for $1\leqslant j\leqslant k=13$ , we have chosen the rational primes between $29$ and $79$ , and evaluating the unknown polynomial $f$ has resulted in the list of values $a_{j}f(b_{j})$ given in Table 1; the scalars $a_{j}$ are of course not known either.

Then it turns out that the numbers $d^{\prime}\in{\mathbb{Z}}$ , where $1\leqslant i<j\leqslant 13$ , come from an $34$ -element subset of $\{-27,\ldots,71\}$ . For seven of them the associated graph $\Gamma_{d^{\prime}}$ has a connected component with at least three vertices, but only for two of them we find a complete connected component amongst them: The graph $\Gamma_{7}$ has a complete connected component consisting of the vertices $\mathscr{B}_{0}:=\{47,61,79\}$ , while the graph $\Gamma_{13}$ consists of three connected components, which all are complete, having the vertices

[TABLE]

Running polynomial recovery, see Section 4.2, using the places $\mathscr{B}_{0}$ fails by exceeding the degree bound. But running it using $\mathscr{B}_{1}$ yields $af=\sum_{i=0}^{13}z_{i}X^{i}\in{\mathbb{Z}}[X]$ , where

[TABLE]

while running it using $\mathscr{B}_{2}$ and $\mathscr{B}_{3}$ yields $\frac{1}{5}\cdot af\in{\mathbb{Q}}[X]$ and $\frac{1}{25}\cdot af\in{\mathbb{Q}}[X]$ , respectively. Thus we indeed have $d=\deg(f)=13$ , and assuming that $a=1$ we have determined the scalars $a_{j}$ , for $1\leqslant j\leqslant 13$ , as well. Note that the bounds assumed in Section 4.2 are fulfilled; and the roots of $f$ turning out to be complex roots of unity, implying $\mu=1$ , the bounds assumed in Section 4.3 are fulfilled as well.

It should be noted that for the preceding discussion we have chosen $k$ large enough to exhibit the occurrence of the erroneous set $\mathscr{B}_{0}$ , for which we indeed observe that the associated scalars $a_{j}$ are pairwise distinct. But this also reveals another practical observation, at least for polynomials occurring in the applications in Section 5: The scalars $a_{j}$ , here coming from the three-element set $\mathscr{R}=\{1,\frac{1}{5},\frac{1}{25}\}$ , typically are not uniformly distributed throughout $\mathscr{R}$ , but the scalar $a_{j}=1$ occurs much more frequently than the other ones.

As was already mentioned, in practice we instead increase $k$ step by step. Then for the smallest $k\geqslant 3$ such that the graph $\Gamma_{13}$ has a complete connected component with at least three vertices, that is for $k=6$ , we find the set $\mathscr{B}:=\{37,43,47\}$ of places, indeed being associated to the case $a_{j}=1$ . Now polynomial recovery using $\mathscr{B}$ readily returns $f$ ; note that the bounds assumed in Section 4.2 are still fulfilled.

Catching projectivities. We now have to explain where the conditions imposed in Section 4.3 come from: Typically, for example for the tasks described in Sections 5.1 and 5.2, our aim is to determine a matrix over ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ by computing various specializations first, that is evaluating at certain places $b_{1},\ldots,b_{k}$ , performing some linear algebra over ${\mathbb{Z}}$ or ${\mathbb{Q}}$ , as described in Section 3, for each of the specializations, and then lifting back to polynomials as explained in Section 4.2. But the linear algebra step in between might only be unique up to a scalar in ${\mathbb{Q}}$ , which additionally depends on the particular specialization considered. On the other hand, the matrix we are looking for might also only be unique up to a scalar in ${\mathbb{Q}}(X)$ .

Let us now, again, agree on the following convention: Given $f,g\in{\mathbb{Z}}[X]$ , not both zero, let $\gcd(f,g)\in{\mathbb{Z}}[X]$ denote the polynomial greatest common divisor of $f$ and $g$ with positive leading coefficient. A vector $0\neq v\in{\mathbb{Q}}[X]^{m}$ , where $m\in{\mathbb{N}}$ , is called primitive, if actually $v\in{\mathbb{Z}}[X]^{m}$ , and for the greatest common divisor $\gcd(v)$ of its entries we have $\gcd(v)=1$ . Clearly greatest common divisor computations in ${\mathbb{Z}}$ and in ${\mathbb{Z}}[X]$ yield a ${\mathbb{Q}}(X)$ -multiple of $v$ which is primitive. Similarly, a matrix $A\in{\mathbb{Q}}[X]^{m\times n}$ , where $m,n\in{\mathbb{N}}$ , is called primitive, if actually $A\in{\mathbb{Z}}[X]^{m\times n}$ , and for the greatest common divisor $\gcd(A)$ of its entries we have $\gcd(A)=1$ .

Specializing primitive vectors. Hence, in the above context the task is to recover a primitive vector $[f_{1},\ldots,f_{m}]\in{\mathbb{Z}}[X]^{m}$ not from specializations $[f_{1}(b_{j}),\ldots,f_{m}(b_{j})]\in{\mathbb{Z}}^{m}$ , for $1\leqslant j\leqslant k$ , but from “rescaled” versions $[a_{j}f_{1}(b_{j}),\ldots,a_{j}f_{m}(b_{j})]\in{\mathbb{Q}}^{m}$ instead. This places us in the setting of Section 4.3, but it remains to justify the assumption that the scalars $a_{j}\in{\mathbb{Q}}$ involved indeed come from a finite pool:

Proposition 4.6

Let $f_{1},\ldots,f_{m}\in{\mathbb{Z}}[X]$ , where $m\in{\mathbb{N}}$ , such that $\gcd(f_{1},\ldots,f_{m})=1\in{\mathbb{Z}}[X]$ . Then there is a finite set $\mathscr{P}\subseteq{\mathbb{N}}$ such that for all $b\in{\mathbb{Z}}$ we have

[TABLE]

Proof

Note first that by assumption $f_{1},\ldots,f_{m}$ do not have any common zeroes, so that $\gcd(f_{1}(b),\ldots,f_{m}(b))\in{\mathbb{N}}$ is well-defined for any $b\in{\mathbb{Z}}$ . We proceed by induction on $m\in{\mathbb{N}}$ . For $m=1$ we have $f_{1}=\pm 1$ , and we may let $\mathscr{P}:=\{\pm 1\}$ . Hence let $m\geqslant 2$ , where we may assume that all the $f_{i}$ , for $1\leqslant i\leqslant m$ , are non-constant. Letting $g:=\gcd(f_{1},\ldots,f_{m-1})\in{\mathbb{Z}}[X]$ we have $\gcd(g,f_{m})=1$ Letting $g_{i}:=f_{i}/g\in{\mathbb{Z}}[X]$ for $1\leqslant i\leqslant m-1$ , we have $\gcd(g_{1},\ldots,g_{m-1})=1$ , thus by induction let $\mathscr{Q}\subseteq{\mathbb{N}}$ be a set as asserted associated with $g_{1},\ldots,g_{m-1}$ . Now, given $b\in{\mathbb{Z}}$ , we may write

[TABLE]

as $x=yz$ , where $y=\gcd(g(b),f_{m}(b))$ , and $z$ divides $\gcd(g_{1}(b),\ldots,g_{m-1}(b),f_{m}(b))$ . Hence $z$ divides $\gcd(g_{1}(b),\ldots,g_{m-1}(b))$ , and thus divides an element of $\mathscr{Q}$ . Moreover, from $\gcd(g,f_{m})=1$ we infer that the resultant $\rho:=\textrm{res}(g,f_{m})\in{\mathbb{Z}}$ is different from zero, see (vzG, , Corollary 6.20), which by (vzG, , Corollary 6.21) implies that $y=\gcd(g(b),f_{m}(b))$ divides $\rho$ . Thus the set $\mathscr{P}$ of all positive divisors of the elements of $\rho\mathscr{Q}:=\{\rho r\in{\mathbb{N}};r\in\mathscr{Q}\}$ is as desired. ∎

5 Linear algebra over polynomial rings

As was already mentioned, our general strategy to determine matrices over ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ is to specialize first at integral places, to apply linear algebra techniques as described in Section 3 to the matrices over ${\mathbb{Z}}$ or ${\mathbb{Q}}$ thus obtained, and subsequently to recover the polynomial entries in question by the Chinese remainder lifting technique described in Section 4.2, applying degree detection as described in Section 4.3 if necessary. In this section we describe how we can do linear algebra over ${\mathbb{Z}}[X]$ or ${\mathbb{Q}}[X]$ using this approach.

Since we are faced with both sparse and dense matrices, we keep two corresponding formats for matrices over polynomial rings. (In our application, representing matrices for $W$ -graph representations, see Definition 2.8, are extremely sparse, while Gram matrices for them, see Remark 2.6, typically are dense; see also Example 9.2). We have conversion and multiplication routines between them, but whenever it comes to linear algebra computations we always use the dense matrix format. From the arithmetical side, we are only using standard matrix multiplication, but no asymptotically faster methods, as are for example indicated in (vzG, , Section 12.1).

Nullspace. We have developed a solution to the following restricted nullspace problem only (which is sufficient for our application):

Given a matrix $A\in{\mathbb{Q}}[X]^{m\times n}$ , where $m,n\in{\mathbb{N}}$ , such that $\textrm{rk}_{{\mathbb{Q}}[X]}(\ker(A))=1$ , the task is to determine a primitive vector $v\in{\mathbb{Z}}[X]^{m}$ such that $\ker(A)=\langle v\rangle_{{\mathbb{Q}}[X]}$ ; then the vector $v$ is unique up to sign.

To do so, by going over to a suitable ${\mathbb{Q}}(X)$ -multiple we may assume that $A\in{\mathbb{Z}}[X]^{m\times n}$ is primitive. Then we specialize the matrix $A$ successively at integral places $b_{1},\ldots,b_{k}$ , yielding matrices $A(b_{j})\in{\mathbb{Z}}^{m\times n}$ . Since the rank condition on $A$ is equivalent to saying that $\det(A^{\prime})=0$ for all $(m\times m)$ -submatrices $A^{\prime}$ of $A$ , while there is an $((m-1)\times(m-1))$ -submatrix $A^{\prime\prime}$ of $A$ such that $\det(A^{\prime\prime})\neq 0$ , we have $\textrm{rk}_{{\mathbb{Z}}}(\ker(A(b)))\geqslant 1$ for any $b\in{\mathbb{Z}}$ , and for all but finitely many such $b$ we indeed have $\textrm{rk}_{{\mathbb{Z}}}(\ker(A(b)))=1$ . Thus we may assume that all the chosen specializations $A(b_{j})$ also fulfill $\textrm{rk}_{{\mathbb{Z}}}(\ker(A(b_{j})))=1$ . Note that this provides an implicit check whether the rank condition on $A$ indeed holds.

Hence we are in a position to compute the row kernels $\ker(A(b_{j}))=\langle v_{j}\rangle_{\mathbb{Z}}\leqslant{\mathbb{Z}}^{m}$ as described in Section 3.6, where the $v_{j}\in{\mathbb{Z}}^{m}$ are primitive, for all $1\leqslant j\leqslant k$ . Thus the latter are of the form $v_{j}=\frac{1}{a_{j}}\cdot v(b_{j})$ , where $a_{j}=\gcd(v(b_{j}))\in{\mathbb{N}}$ , and $v\in{\mathbb{Z}}[X]^{m}$ is the desired primitive solution vector from above. By Proposition 4.6 we conclude that the scalars $a_{j}$ involved indeed come from a finite pool only depending on $v$ .

Now applying degree detection, see Section 4.3, and polynomial recovery, see Section 4.2, yields candidate vectors $0\neq\widetilde{v}\in{\mathbb{Q}}[X]^{m}$ , which by going over to a suitable ${\mathbb{Q}}$ -multiple can be assumed to be primitive. Then the correctness of $\widetilde{v}$ can be independently verified by explicitly computing $\widetilde{v}A$ and checking whether this is zero.

Inverse. Given a matrix $A\in{\mathbb{Q}}[X]^{n\times n}$ , where $n\in{\mathbb{N}}$ , such that $\det(A)\neq 0$ , the task is to find $B\in{\mathbb{Z}}[X]^{n\times n}$ and $c\in{\mathbb{Z}}[X]$ , such that $A^{-1}=\frac{1}{c}\cdot B\in{\mathbb{Q}}(X)^{n\times n}$ and the overall greatest common divisor $\gcd(B,c)\in{\mathbb{Z}}[X]$ of the entries of $B$ and $c$ equals $\gcd(B,c)=1$ ; then the pair $(B,c)$ is unique up to sign.

To do so, by going over to a suitable ${\mathbb{Q}}$ -multiple we may assume that $A\in{\mathbb{Z}}[X]^{n\times n}$ . Thus the equation $BA=c\cdot E_{n}$ implies that $\gcd(B)$ divides $c$ , and hence $B$ is primitive. Then we specialize the matrix $A$ successively at integral places $b_{1},\ldots,b_{k}$ , yielding matrices $A(b_{j})\in{\mathbb{Z}}^{n\times n}$ . Since for all but finitely many $b\in{\mathbb{Z}}$ we have $\det(A(b))\neq 0$ , we may assume that all the chosen specializations $A(b_{j})$ indeed also fulfill $\det(A(b_{j}))\neq 0$ . Note that this provides an implicit check whether the invertibility condition on $A$ indeed holds.

Hence we are in a position to compute the inverses $A(b_{j})^{-1}\in{\mathbb{Q}}^{n\times n}$ as described in Section 3.7, yielding $B_{j}\in{\mathbb{Z}}^{n\times n}$ and $c_{j}\in{\mathbb{Z}}$ , such that $B_{j}$ is primitive and $A(b_{j})^{-1}=\frac{1}{c_{j}}\cdot B_{j}$ , for all $1\leqslant j\leqslant k$ . Thus, if $B\in{\mathbb{Z}}[X]^{n\times n}$ and $c\in{\mathbb{Z}}[X]$ are the desired solutions from above, we infer

[TABLE]

By Proposition 4.6 we conclude that the scalars $a_{j}$ involved indeed come from a finite pool only depending on $B$ and $c$ .

Now applying degree detection, see Section 4.3, and polynomial recovery, see Section 4.2, yields candidate solutions $\widetilde{B}\in{\mathbb{Q}}[X]^{n\times n}$ and $\widetilde{c}\in{\mathbb{Q}}[X]^{n}$ , for which by going over to a suitable ${\mathbb{Q}}$ -multiple we may assume that $\widetilde{c}\in{\mathbb{Z}}[X]^{n}$ and $\widetilde{B}\in{\mathbb{Z}}[X]^{n\times n}$ is primitive. Then the correctness of $(\widetilde{B},\widetilde{c})$ can be independently verified by explicitly computing $A\widetilde{B}$ and checking whether it equals $\widetilde{c}\cdot E_{n}$ .

The exponent of a matrix. In view of the discussion in Section 3.8, and noting that ${\mathbb{Q}}[X]$ is a principal ideal domain as well, we pursue the analogy between matrix inverses over ${\mathbb{Z}}$ and over ${\mathbb{Q}}[X]$ still a little further. Indeed, given a square matrix $A\in{\mathbb{Z}}[X]^{n\times n}$ such that $\det(A)\neq 0$ as above, the polynomial $c\in{\mathbb{Z}}[X]$ in the expression $A^{-1}=\frac{1}{c}\cdot B$ , where $B\in{\mathbb{Z}}[X]^{n\times n}$ is chosen primitive, again has another interpretation:

Let the exponent $\exp(A)\in{\mathbb{Z}}[X]$ of $A$ be a primitive generator of the annihilator of the ${\mathbb{Q}}[X]$ -module ${\mathbb{Q}}[X]^{n}/\textrm{im}(A)$ , where $\textrm{im}(A)\leqslant{\mathbb{Q}}[X]^{n}$ is the ${\mathbb{Q}}[X]$ -span of the rows of $A$ ; then $\exp(A)$ is unique up to sign. Then, similar to Section 3.8, we conclude that $\exp(A)$ and $c$ are associated in ${\mathbb{Q}}[X]$ , and thus the primitivity of $\exp(A)$ yields

[TABLE]

In other words, computing the inverse of $A$ as described in Section 5.2 also yields a method to compute the exponent of $A$ as $\exp(A)=\frac{1}{\gcd(c)}\cdot c$ . Moreover, $c$ governs modular invertibility of $A$ as follows:

Proposition 5.4

We keep the notation of Section 5.3. Let $\{0\}\neq\mathfrak{p}\lhd{\mathbb{Z}}[X]$ be a prime ideal, let $Q_{\mathfrak{p}}:=\operatorname{Quot}({\mathbb{Z}}[X]/\mathfrak{p})$ be the field of fractions of the integral domain ${\mathbb{Z}}[X]/\mathfrak{p}$ , and let $A_{\mathfrak{p}}\in({\mathbb{Z}}[X]/\mathfrak{p})^{n\times n}$ be the matrix obtained from $A$ by reduction modulo $\mathfrak{p}$ . Then $A_{\mathfrak{p}}$ is invertible in $Q_{\mathfrak{p}}^{n\times n}$ if and only if $c\not\in\mathfrak{p}$ .

Proof

The prime ideals of ${\mathbb{Z}}[X]$ being well-understood, we are in precisely one of the following cases: (i) We have $\mathfrak{p}=(p)$ , where $p\in{\mathbb{Z}}$ is a prime; then we have $Q_{\mathfrak{p}}\cong\textrm{Quot}({\mathbb{F}}_{p}[X])={\mathbb{F}}_{p}(X)$ , a rational function field; (ii) we have $\mathfrak{p}=(f)$ , where $f\in{\mathbb{Z}}[X]$ is non-constant and irreducible, hence in particular is primitive; then we have $Q_{\mathfrak{p}}\cong{\mathbb{Q}}[X]/(f)$ , an algebraic number field; (iii) we have $\mathfrak{p}=(p,f)$ , where $p$ and $f$ are as above; then we have $Q_{\mathfrak{p}}={\mathbb{Z}}[X]/\mathfrak{p}\cong{\mathbb{F}}_{p}[X]/(\overline{f})$ , a finite field.

Now $A_{\mathfrak{p}}$ is non-invertible in $Q_{\mathfrak{p}}^{n\times n}$ if and only if $\det(A)\in\mathfrak{p}$ , which holds if and only if there is an irreducible divisor of $\det(A)$ being contained in $\mathfrak{p}$ . Thus is suffices to determine (i) the primes $p\in{\mathbb{Z}}$ , and (ii) the non-constant irreducible polynomials $f\in{\mathbb{Z}}[X]$ dividing $\det(A)$ in ${\mathbb{Z}}[X]$ .

(i) From $A^{-1}=\frac{1}{\det(A)}\cdot\textrm{adj}(A)\in{\mathbb{Q}}(X)^{n\times n}$ , where $\textrm{adj}(A)\in{\mathbb{Z}}[X]^{n\times n}$ is the adjoint matrix of $A$ , we infer that $c$ divides $\det(A)$ in ${\mathbb{Z}}[X]$ . Hence any prime $p\in{\mathbb{Z}}$ dividing $\gcd(c)$ also divides $\det(A)$ in ${\mathbb{Z}}[X]$ . Conversely, if $p$ does not divide $\gcd(c)$ , then $p$ -modular reduction yields $\overline{AB}=\overline{cE_{n}}\neq 0\in{\mathbb{F}}_{p}[X]^{n\times n}$ , hence $\det(\overline{A})\neq 0\in{\mathbb{F}}_{p}[X]$ . Hence the primes $p\in{\mathbb{Z}}$ we are looking for are precisely the prime divisors of $\gcd(c)$ .

(ii) This is equivalent to finding the irreducible polynomials in ${\mathbb{Q}}[X]$ dividing $\det(A)$ in ${\mathbb{Q}}[X]$ . Again similar to Section 3.8 we conclude that the latter are precisely the irreducible polynomials dividing $\exp(A)$ . Hence the polynomials $f\in{\mathbb{Z}}[X]$ we are looking for are precisely the non-constant irreducible divisors of $\frac{1}{\gcd(c)}\cdot c$ . ∎

Product. Given matrices $A\in{\mathbb{Q}}[X]^{l\times m}$ and $B\in{\mathbb{Q}}[X]^{m\times n}$ , where $l,m,n\in{\mathbb{N}}$ , the task is to compute their product $AB\in{\mathbb{Q}}[X]^{l\times n}$ .

This is straightforwardly done: Again, by going over to suitable ${\mathbb{Q}}$ -multiples we may assume that $A\in{\mathbb{Z}}[X]^{l\times m}$ and $B\in{\mathbb{Z}}[X]^{m\times n}$ . Then we specialize the matrices $A$ and $B$ successively at integral places $b_{1},\ldots,b_{k}$ , yielding matrices $A(b_{j})\in{\mathbb{Z}}^{l\times m}$ and $B(b_{j})\in{\mathbb{Z}}^{m\times n}$ , whose products $A(b_{j})B(b_{j})\in{\mathbb{Z}}^{l\times n}$ we compute. Now applying polynomial recovery, see Section 4.2, yields candidate solutions $\widetilde{C}\in{\mathbb{Q}}[X]^{l\times n}$ . (Note that since no “rescaling” takes place here it is not necessary to apply degree detection.)

As for correctness, there are a few necessary conditions which can be used as break conditions in polynomial recovery: All entries of $\widetilde{C}$ must be polynomials with integer coefficients, and the degrees of the entries of the input matrices yield bounds on the degrees of those of $\widetilde{C}$ . But these conditions are far from being sufficient, so that, in contrast to the tasks in Sections 5.1 and 5.2, here we do not have a general way of independently verifying correctness. (In our application, as a very efficient break condition we have used the fact that the entries of $\widetilde{C}$ have to be of a particular form, see Section 8.4.)

An alternative approach. The idea of our approach is, essentially, to reduce computations over ${\mathbb{Q}}[X]$ to computations over ${\mathbb{Z}}$ , where lifting back to polynomials is done in one step by combining specialization and Chinese remainder lifting. In consequence, we almost entirely use arithmetic in characteristic zero (except the use of a large prime field in the $p$ -adic decomposition algorithm in Section 3.5). But it seems to be worth-while to say a few more words on the following “two-step” approach, which was already mentioned briefly in Sections 1 and 2.9:

Assume our aim is to determine a matrix $0\neq A\in{\mathbb{Q}}[X]^{m\times n}$ , where $m,n\in{\mathbb{N}}$ . To this end, we choose pairwise distinct places $b_{1},\ldots,b_{k}\in{\mathbb{Z}}$ , for some $k\in{\mathbb{N}}$ such that $k>d$ , where $d\in{\mathbb{N}}_{0}$ is the maximum of the degrees of the non-zero entries of $A$ . Thus, if we are able to compute the specializations $A(b_{j})\in{\mathbb{Q}}^{n\times n}$ , for $1\leqslant j\leqslant k$ , we may recover the entries of $A$ by polynomial interpolation, as for example is described in (vzG, , Section 10.2). In turn, to find the specializations $A(b_{j})$ we choose pairwise distinct primes $p_{1},\ldots,p_{l}\in{\mathbb{N}}$ , for some $l\in{\mathbb{N}}$ , such that the denominators of all the entries of $A(b_{j})$ are coprime to $p_{i}$ , for all $1\leqslant j\leqslant k$ and $1\leqslant i\leqslant l$ . Then reduction modulo the chosen primes yields matrices $A_{p_{i}}(b_{j})\in{\mathbb{F}}_{p_{i}}^{m\times n}$ . Hence, if $\prod_{i=1}^{l}p_{i}$ is large enough, and we are able to compute the modular reductions $A_{p_{i}}(b_{j})$ , for $1\leqslant i\leqslant l$ , then rational number recovery, see Section 3.4, reveals the entries of $A(b_{j})$ . Hence this reduces finding the matrix $A$ to finding the matrices $A_{p_{i}}(b_{j})$ over prime fields, for which we in turn may use techniques of the MeatAxe.

Thus here specialization and Chinese remainder lifting are done in two separate steps, aiming at taking advantage of the efficiency of computations in prime characteristic. But the “two-step” approach has severe disadvantages: The number $k$ of places to specialize at is at least as large as the degree of the polynomials in question, hence many more and larger $b_{j}$ than in our approach are needed, increasing time and memory requirements, presumably drastically. (In our application this means $k\lesssim 200$ .) Moreover, in order to use rational number recovery, the number $l$ of primes used for modular reduction must not be too small, at the expense of possibly loosing the very fast arithmetic over small finite fields, which otherwise is a major advantage of the MeatAxe.

Actually, apart from our own experiences, this kind of approach is pursued in mllt , and the figures on timings and memory consumption given there seem to support the above comments. But it should be stressed that the emphasis of mllt is on parallelizing this kind of computations, which we here do not consider at all.

6 Computing with representations

As was already mentioned in Section 1, in our application we will make use of a suitable variant of the “standard basis algorithm”, which was originally used in parker1 for computations over finite fields. In this section we present the necessary ideas from computational representation theory, which can be formulated in terms of the following general setting:

Standard bases. Let $\mathscr{A}$ be a $K$ -algebra, where $K$ is a field, being generated by the (ordered) set $A_{1},\ldots,A_{r}$ , where $r\in{\mathbb{N}}_{0}$ . Moreover, let $\mathfrak{X}\colon\mathscr{A}\rightarrow K^{n\times n}$ be an absolutely irreducible matrix representation of $\mathscr{A}$ , where $n\in{\mathbb{N}}$ . Then the task is to find a “canonical” $K$ -basis of the row space $K^{n}$ with respect to the representation ${\mathfrak{X}}$ , where we consider right actions, as is common in the computational world.

To this end, let $A_{0}\in\mathscr{A}$ such that $\dim_{K}(\ker({\mathfrak{X}}(A_{0})))=1$ ; note that whenever ${\mathfrak{X}}$ is irreducible such an element $A_{0}$ exists if and only if ${\mathfrak{X}}$ is absolutely irreducible. This leads to the following breadth-first search algorithm; see also parker1 : Choose a seed vector $0\neq u\in\ker({\mathfrak{X}}(A_{0}))$ , let $\mathfrak{B}:=[u]$ and $\mathfrak{T}:=[[0,0]]$ , and set $i:=1$ . As long as $i$ does not exceed the cardinality of $\mathfrak{B}$ , let $v$ be the $i$ -th element of $\mathfrak{B}$ . Then for $1\leqslant j\leqslant r$ let successively $w:=v\cdot{\mathfrak{X}}(A_{j})$ , and check whether or not $w\in\langle\mathfrak{B}\rangle_{K}$ . If so, then discard $w$ ; if not, then append $w$ to $\mathfrak{B}$ , and append $[i,j]$ to $\mathfrak{T}$ . Having done this for all $j$ , increment $i$ and recurse.

Since the growing set $\mathfrak{B}$ is $K$ -linearly independent throughout, this algorithms terminates after at most $n$ loops. After termination, $\langle\mathfrak{B}\rangle_{K}$ is a non-zero submodule of the irreducible $\mathscr{A}$ -module $K^{n}$ , and thus $\mathfrak{B}$ indeed is a $K$ -basis. (Of course, we may terminate early, without any further checking, as soon as the cardinality of $\mathfrak{B}$ equals $n$ , since from this point on $\mathfrak{B}$ would not change anymore anyway.) The (ordered) set $\mathfrak{B}$ is called a standard basis of $K^{n}$ with respect to the representation ${\mathfrak{X}}$ , the generators $A_{1},\ldots,A_{r}$ , and the distinguished element $A_{0}$ , and the “bookkeeping list” $\mathfrak{T}$ is called the associated Schreier tree.

Strictly speaking, $\mathfrak{B}$ also depends on the chosen seed vector, but it is essentially unique in the following sense: If $0\neq\widetilde{u}\in\ker({\mathfrak{X}}(A_{0}))$ gives rise to the standard basis $\widetilde{\mathfrak{B}}$ with Schreier tree $\widetilde{\mathfrak{T}}$ , then we have $\widetilde{u}=c\cdot u$ , for some $0\neq c\in K$ , and thus $\widetilde{\mathfrak{B}}=c\cdot\mathfrak{B}$ and $\widetilde{\mathfrak{T}}=\mathfrak{T}$ . Moreover, using the Schreier tree $\mathfrak{T}=[[i_{1},j_{j}],\ldots,[i_{n},j_{n}]]$ , we may recover $\mathfrak{B}=[u_{1},\ldots,u_{n}]$ , up to a scalar, without any searching as follows: Choose $0\neq u_{1}\in\ker({\mathfrak{X}}(A_{0}))$ , and for $2\leqslant k\leqslant n$ let successively $u_{k}:=u_{i_{k}}\cdot{\mathfrak{X}}(A_{j_{k}})$ .

In practice. We are able to run the above standard basis algorithm in the following particular cases: If $K$ is a (small) finite field, then this can of course be done using ideas from the MeatAxe, as is already described in parker1 .

More important from our point of view is the case $K={\mathbb{Q}}$ . Then we may assume that $u\in{\mathbb{Z}}^{n}$ , and if additionally ${\mathfrak{X}}(A_{i})\in{\mathbb{Z}}^{n\times n}$ , for all $1\leqslant i\leqslant r$ , then we have $\mathfrak{B}\subseteq{\mathbb{Z}}^{n}$ , hence the key step in the above algorithm, to decide whether or not $w\in\langle\mathfrak{B}\rangle_{\mathbb{Q}}$ , can be done using the $p$ -adic decomposition algorithm in Section 3.5, where whenever $\mathfrak{B}$ is enlarged we also check whether its $p$ -modular reduction $\overline{\mathfrak{B}}\subseteq{\mathbb{F}}_{p}^{n}$ is ${\mathbb{F}}_{p}$ -linearly independent; if not, then we return failure in order to choose another prime $p$ . (Note that this is reminiscent of the strategy in Section 3.6.)

Computing homomorphisms. We return to the general setting in Section 6.1, and let ${\mathfrak{X}}^{\prime}\colon\mathscr{A}\rightarrow K^{n\times n}$ be a matrix representation of $\mathscr{A}$ , which is equivalent to ${\mathfrak{X}}$ . Then a standard basis $\mathfrak{B}^{\prime}=[v^{\prime}_{1},\ldots,v^{\prime}_{n}]$ of $K^{n}$ with respect to the representation ${\mathfrak{X}}^{\prime}$ is found by choosing $0\neq v^{\prime}_{1}\in\ker({\mathfrak{X}}^{\prime}(A_{0}))$ and just applying the Schreier tree $\mathfrak{T}=[[i_{1},j_{j}],\ldots,[i_{n},j_{n}]]$ already known from the standard basis computation for ${\mathfrak{X}}$ by letting successively $v^{\prime}_{k}:=v^{\prime}_{i_{k}}\cdot{\mathfrak{X}}^{\prime}(A_{j_{k}})$ , for $2\leqslant k\leqslant n$ ; note that by assumption we indeed have $\dim_{K}(\ker({\mathfrak{X}}^{\prime}(A_{0})))=1$ .

Now let $0\neq C\in K^{n\times n}$ be an $\mathscr{A}$ -homomorphism from ${\mathfrak{X}}$ to ${\mathfrak{X}}^{\prime}$ , that is we have

[TABLE]

of course, it suffices to require this condition for the generators $A_{1},\ldots,A_{r}$ only. Since ${\mathfrak{X}}$ is absolutely irreducible, it follows that $C\in{\operatorname{GL}}_{n}(K)$ and is unique up to a scalar. Moreover, we have $\ker({\mathfrak{X}}(A_{0}))\cdot C=\ker({\mathfrak{X}}^{\prime}(A_{0}))$ , and thus going over from the standard bases $\mathfrak{B}$ and $\mathfrak{B}^{\prime}$ with respect to ${\mathfrak{X}}$ and ${\mathfrak{X}}^{\prime}$ , respectively, to the associated invertible matrices $B$ and $B^{\prime}$ with rows $v_{1},\ldots,v_{n}\in K^{n}$ and $v^{\prime}_{1},\ldots,v^{\prime}_{n}\in K^{n}$ , respectively, we get $B\cdot C=B^{\prime}$ , or equivalently

[TABLE]

Thus to determine $C$ we have to perform the following steps: find $A_{0}\in\mathscr{A}$ such that $\dim_{K}(\ker({\mathfrak{X}}(A_{0})))=1$ ; compute $\ker({\mathfrak{X}}(A_{0}))\leqslant K^{n}$ and $\ker({\mathfrak{X}}^{\prime}(A_{0}))\leqslant K^{n}$ ; compute a Schreier tree $\mathfrak{T}$ with respect to ${\mathfrak{X}}\cong{\mathfrak{X}}^{\prime}$ and $A_{0}$ ; apply the Schreier tree $\mathfrak{T}$ in order to compute standard bases $\mathfrak{B}$ and $\mathfrak{B}^{\prime}$ of $K^{n}$ with respect to ${\mathfrak{X}}$ and ${\mathfrak{X}}^{\prime}$ , respectively; going over to matrices, compute the inverse $B^{-1}\in{\operatorname{GL}}_{n}(K)$ ; and compute the product $C=B^{-1}\cdot B^{\prime}\in{\operatorname{GL}}_{n}(K)$ .

In practice. If $K={\mathbb{Q}}(X)$ , the nullspaces required can be found as described in Section 5.1, where we may assume that $v_{1}$ and $v^{\prime}_{1}$ are primitive. Moreover, computing matrix inverses and matrix products can be done as described in Sections 5.2 and 5.5, respectively; by multiplying with a suitable element of $K$ we may assume that $C$ is primitive as well, then $C$ is unique up to sign. Hence for our application it remains to describe how a distinguished element and a Schreier tree can be found, and we have to give an efficient break condition for the algorithm in Section 5.5.

7 Finding standard bases for $W$ -graph representations

We have now described the necessary infrastructure from linear algebra over integral domains, and some relevant general ideas how to compute with representations, to proceed to the explicit determination of Gram matrices of invariant bilinear forms for balanced representations of Iwahori–Hecke algebras. We recall the setting of Section 2.9, which we keep from now on:

Let $(W,S)$ be a finite Coxeter group, and let ${\mathscr{H}}_{A}\subseteq{\mathscr{H}}_{K}$ be the associated generic Iwahori–Hecke algebras with equal parameters over the ring $A={\mathbb{Z}}[v,v^{-1}]$ and the field $K={\mathbb{Q}}(v)$ , respectively, being generated by $\{T_{s};s\in S\}$ . Moreover, let ${\mathfrak{X}}^{\lambda}\colon{\mathscr{H}}_{K}\rightarrow K^{n\times n}$ , where $n=d_{\lambda}$ , be a $W$ -graph representation associated with $\lambda\in\Lambda$ , and let

[TABLE]

As far as computer implementations are concerned, it is more convenient and more efficient to work with row vectors instead of column vectors. Therefore, we will now work throughout with right actions rather than left actions as in Section 2. Our aim is to find a primitive Gram matrix $P\in{\mathbb{Z}}[v]^{n\times n}$ for ${\mathfrak{X}}^{\lambda}$ , that is, using the language of right actions, a primitive matrix such that

[TABLE]

Thus the task is to find a non-zero ${\mathscr{H}}_{K}$ -homomorphism from ${\mathfrak{X}}^{\lambda}$ to $({\mathfrak{X}}^{\lambda})^{\prime}$ . In order to use the approach described in Section 6.2, we proceed as follows, where the basic idea of this strategy has already been indicated in (gemu, , Section 4.3):

Finding seed vectors. To find a suitable seed vector $u_{1}\in K^{n}$ for the standard basis algorithm with respect to ${\mathfrak{X}}^{\lambda}$ , we proceed as follows:

Specializing $v\mapsto 1$ we from ${\mathscr{H}}_{A}$ recover the group algebra ${\mathbb{Q}}[W]$ , and ${\mathfrak{X}}^{\lambda}$ corresponds to an irreducible representation $\mathfrak{Y}^{\lambda}\colon{\mathbb{Q}}[W]\rightarrow{\mathbb{Q}}^{n\times n}$ . In particular, the index and sign representations of ${\mathscr{H}}_{K}$ , given by $\operatorname{ind}_{\mathscr{H}}\colon T_{s}\mapsto v$ and $\operatorname{sgn}_{\mathscr{H}}\colon T_{s}\mapsto-v^{-1}$ , respectively, for all $s\in S$ , correspond to the trivial and sign representations of ${\mathbb{Q}}[W]$ , given by $\operatorname{1}_{W}\colon s\mapsto 1$ and $\operatorname{sgn}_{W}\colon s\mapsto-1$ , respectively.

As was observed by Benson and Curtis (see (gepf, , Section 6.3) and the references there), there is a subset $J\subseteq S$ (depending on $\lambda$ , and in general not being unique), such that the restriction of $\mathfrak{Y}^{\lambda}$ to the parabolic subgroup $\widetilde{W}:=W_{J}\leqslant W$ associated with $J$ fulfills

[TABLE]

Note that $J=\emptyset$ and $J=S$ if and only if $\mathfrak{Y}^{\lambda}$ equals $1_{W}$ and $\operatorname{sgn}_{W}$ , respectively. Letting $\widetilde{\mathscr{H}}_{K}\subseteq{\mathscr{H}}_{K}$ be the parabolic subalgebra associated with $J$ , this implies

[TABLE]

In other words, we equivalently have

[TABLE]

Now we are going to use the fact that ${\mathfrak{X}}^{\lambda}$ is a $W$ -graph representation: Using the $I$ -sets associated with ${\mathfrak{X}}^{\lambda}$ , see Definition 2.8, we conclude that $\ker({\mathfrak{X}}^{\lambda}(T_{s}+v^{-1}))=\langle e_{i};s\in I_{i}\rangle_{K}$ for all $s\in S$ , where $e_{i}\in K^{n}$ denotes the $i$ -th “unit” vector. This implies

[TABLE]

Hence we may let $u_{1}:=e_{i}$ , where $1\leqslant i\leqslant n$ is the unique index such that $J\subseteq I_{i}$ .

Note that this conversely also yields a way to find all subsets of $S$ fulfilling the Benson–Curtis condition: We run through all subsets $J\subseteq S$ , and just check whether there is precisely one index $1\leqslant i\leqslant n$ such that $J\subseteq I_{i}$ .

Finding a distinguished element. The above immediate approach strongly uses the fact that ${\mathfrak{X}}^{\lambda}$ is a $W$ -graph representation. Thus, in order to find a suitable seed vector $u^{\prime}_{1}\in K^{n}$ for the standard basis algorithm with respect to $({\mathfrak{X}}^{\lambda})^{\prime}$ we specify a distinguished element $T^{\lambda}\in{\mathscr{H}}_{K}$ such that $\dim_{K}(\ker({\mathfrak{X}}^{\lambda}(T^{\lambda})))=1$ . Let

[TABLE]

Hence we have $\bigcap_{s\in J}\ker({\mathfrak{X}}^{\lambda}(T_{s}+v^{-1}))\leqslant\ker({\mathfrak{X}}^{\lambda}(T^{\lambda}))$ , and it remains to be shown that $\dim_{K}(\ker({\mathfrak{X}}^{\lambda}(T^{\lambda})))=1$ :

Assume to the contrary that $\dim_{K}(\ker\big{(}{\mathfrak{X}}^{\lambda}(T^{\lambda})))\geqslant 2$ . Then letting

[TABLE]

specializing $v\mapsto 1$ shows that $\dim_{{\mathbb{Q}}}(\ker(\mathfrak{Y}^{\lambda}(1+\sigma_{J})))\geqslant 2$ as well. Since for any vector $u\in\ker(\mathfrak{Y}^{\lambda}(1+\sigma_{J}))$ we have $u\cdot\mathfrak{Y}^{\lambda}(\sigma_{J}^{k})=(-1)^{k}\cdot u$ , for all $k\in{\mathbb{N}}_{0}$ , Lemma 7.3 proven below implies that $\langle u\rangle_{\mathbb{Q}}\leqslant K^{n}$ is ${\mathbb{Q}}[\widetilde{W}]$ -invariant and carries the sign representation. Thus we have $\dim_{\mathbb{Q}}({\operatorname{Hom}}_{{\mathbb{Q}}[\widetilde{W}]}(\operatorname{sgn}_{\widetilde{W}},\mathfrak{Y}^{\lambda}))\geqslant 2$ , a contradiction.

Lemma 7.3

For $\epsilon\in\{0,1\}$ let $W_{\epsilon}:=\{w\in W;\operatorname{sgn}(w)=(-1)^{\epsilon}\}$ . Moreover, let

[TABLE]

Then, with respect to the natural topology on ${\mathbb{Q}}[W]\cong{\mathbb{Q}}^{|W|}$ , we have

[TABLE]

Proof

We consider the Markov chain with (finite) state space $W=W_{0}\stackrel{{\scriptstyle.}}{{\cup}}W_{1}$ , and transition matrix $M=\operatorname{reg}_{W}(\sigma_{S})\in{\mathbb{Q}}^{|W|\times|W|}$ , where $\operatorname{reg}_{W}\colon{\mathbb{Q}}[W]\rightarrow{\mathbb{Q}}^{|W|\times|W|}$ denotes the regular matrix representation of ${\mathbb{Q}}[W]$ . In other words, the matrix entry $M_{w,w^{\prime}}$ , where $w,w^{\prime}\in W$ , is given as

[TABLE]

Now, since $\operatorname{sgn}(ws)=-\operatorname{sgn}(w)$ for all $w\in W$ and $s\in S$ , we conclude that $M^{2}=\operatorname{reg}_{W}(\sigma_{S}^{2})$ induces Markov chains on both $W_{0}$ and $W_{1}$ . Moreover, since any element of $W$ can be written as a word of length at most $l(w_{0})$ in the generators $S$ , we infer that $M^{2l(w_{0})}$ has positive entries in both the block submatrices belonging to $W_{0}$ and $W_{1}$ , respectively. Hence the induced Markov chains are both irreducible and aperiodic. They thus converge towards stationary distributions, which since $M$ is doubly-stochastic are both equal to the respective uniform distributions. Thus, in particular, the initial state $\sigma_{S}^{\epsilon}\in\langle W_{\epsilon}\rangle_{\mathbb{Q}}$ yields

[TABLE]

∎

Finding standard bases. The distinguished element $T^{\lambda}$ can now be used to find a primitive vector $u^{\prime}_{1}\in\ker(({\mathfrak{X}}^{\lambda})^{\prime}(T^{\lambda}))$ . Next, having both seed vectors $u_{1}$ and $u^{\prime}_{1}$ in place, we aim at computing the associated standard bases $\mathfrak{B}$ with respect to ${\mathfrak{X}}^{\lambda}$ , and $\mathfrak{B}^{\prime}$ with respect to $({\mathfrak{X}}^{\lambda})^{\prime}$ , for the $A$ -algebra generated by $\{vT_{s};s\in S\}$ . But since we do not have a standard basis algorithm available for representations over the field $K$ , we again use suitable specializations:

Given a place $0\neq b\in{\mathbb{Z}}$ , let $\mathfrak{Y}^{\lambda}_{b}\colon{\mathscr{H}}_{\mathbb{Q}}\rightarrow{\mathbb{Q}}^{n\times n}$ be the representation of ${\mathscr{H}}_{\mathbb{Q}}$ obtained by specializing $v\mapsto b$ , that is, considering ${\mathscr{H}}_{\mathbb{Q}}$ as the ${\mathbb{Q}}$ -algebra generated by $\{bT_{s};s\in S\}$ we have

[TABLE]

thus in particular for $b=1$ , identifying ${\mathscr{H}}_{\mathbb{Q}}$ with ${\mathbb{Q}}[W]$ , we recover $\mathfrak{Y}^{\lambda}_{1}=\mathfrak{Y}^{\lambda}$ .

Now we compare a putative run of the standard basis algorithm, as described in Section 6.1, with respect to the seed vector $u_{1}\in{\mathbb{Z}}[v]^{n}$ and the generators $\{{\mathfrak{X}}^{\lambda}(vT_{s})\in{\mathbb{Z}}[v]^{n\times n};s\in S\}$ , with a run with respect to the specialized seed vector $u_{1}(b)\in{\mathbb{Z}}^{n}$ and the generators $\{\mathfrak{Y}_{b}^{\lambda}(bT_{s})\in{\mathbb{Z}}^{n\times n};s\in S\}$ . These successively produce standard bases $\mathfrak{B}\subseteq{\mathbb{Z}}[v]^{n}$ and $\mathfrak{C}\subseteq{\mathbb{Z}}^{n}$ , respectively. We show by induction on the cardinality $0\leqslant m\leqslant n$ of the intermediate sets $\mathfrak{B}$ , that for all but finitely many $b$ the set $\mathfrak{C}$ is obtained by specializing $\mathfrak{B}$ , and that the Schreier trees found in both runs coincide:

Indeed, the key steps are to decide for some $w:=u\cdot{\mathfrak{X}}^{\lambda}(vT_{s})\in{\mathbb{Z}}[v]^{n}$ whether or not $w\in\langle\mathfrak{B}\rangle_{K}$ , and similarly for its specialization $w(b):=u(b)\cdot\mathfrak{Y}_{b}^{\lambda}(bT_{s})\in{\mathbb{Z}}^{n}$ whether or not $w(b)\in\langle\mathfrak{C}\rangle_{\mathbb{Q}}$ . Identifying $\mathfrak{B}$ and $\mathfrak{C}$ with matrices $B\in{\mathbb{Z}}[v]^{m\times n}$ and $C\in{\mathbb{Z}}^{m\times n}$ , respectively, we have $C=B(b)$ . Considering the matrix $B_{w}\in{\mathbb{Z}}[v]^{(m+1)\times n}$ obtained by concatenating $B$ and $w$ , we have $w\not\in\langle\mathfrak{B}\rangle_{K}$ if and only if there is an $((m+1)\times(m+1))$ -submatrix $B^{\prime}$ of $B_{w}$ such that $\det(B^{\prime}_{w})\neq 0$ . Similarly, we have $w(b)\not\in\langle\mathfrak{C}\rangle_{\mathbb{Q}}$ if and only if there is an $((m+1)\times(m+1))$ -submatrix $C^{\prime}$ of $C_{w(b)}=B_{w}(b)\in{\mathbb{Z}}^{(m+1)\times n}$ such that $\det(C^{\prime})\neq 0$ . Hence, whenever $w(b)\not\in\langle\mathfrak{C}\rangle_{\mathbb{Q}}$ we also have $w\not\in\langle\mathfrak{B}\rangle_{K}$ , and conversely for all but finitely many $b$ from $w\not\in\langle\mathfrak{B}\rangle_{K}$ we may conclude that $w(b)\not\in\langle\mathfrak{C}\rangle_{\mathbb{Q}}$ . (We have used a similar argument in Section 5.1.)

Thus assuming that $0\neq b\in{\mathbb{Z}}$ is suitably chosen, we may just run the standard basis algorithm for the seed vector $u_{1}(b)=u_{1}=e_{i}\in{\mathbb{Z}}^{n}$ , the $i$ -th “unit” vector, and the generators $\mathfrak{Y}_{b}^{\lambda}(bT_{s})\in{\mathbb{Z}}^{n\times n}$ , as described in Section 6.1, yielding a Schreier tree $\mathfrak{T}$ . Letting $w_{1}:=1\in W$ , and $w_{i}:=w_{j}\cdot s\in W$ , if $[j,s]$ is the $i$ -th entry in $\mathfrak{T}$ , for $2\leqslant i\leqslant n$ , we thus obtain reduced expressions of the elements $w_{i}\in W$ , and hence the number of steps needed to find the $i$ -th element of $\mathfrak{C}$ equals the length $l(w_{i})\in{\mathbb{N}}_{0}$ . (In practice, it turns out that choosing either $b=1$ or $b=2$ is sufficient, where actually almost always $b=1$ works.)

Applying the Schreier tree $\mathfrak{T}$ to $u_{1}$ and $\{{\mathfrak{X}}^{\lambda}(vT_{s});s\in S\}$ this yields a standard basis $\mathfrak{B}\subseteq{\mathbb{Z}}[v]^{n}$ of $K^{n}$ . Similarly, applying $\mathfrak{T}$ to $u^{\prime}_{1}\in{\mathbb{Z}}[v]^{n}$ and $\{({\mathfrak{X}}^{\lambda})^{\prime}(vT_{s})\in{\mathbb{Z}}[v]^{n\times n};s\in S\}$ we get a standard basis $\mathfrak{B}^{\prime}\subseteq{\mathbb{Z}}[v]^{n}$ of $K^{n}$ . But note that this does not ensure that the $A$ -lattices $\langle\mathfrak{B}\rangle_{A}$ and $\langle\mathfrak{B}^{\prime}\rangle_{A}$ are invariant under the $A$ -algebras generated by $\{{\mathfrak{X}}^{\lambda}(vT_{s});s\in S\}$ and $\{({\mathfrak{X}}^{\lambda})^{\prime}(vT_{s});s\in S\}$ , respectively. (In practice they are not, typically.)

8 Finding Gram matrices for $W$ -graph representations

We keep the setting of Section 7; in particular ${\mathfrak{X}}^{\lambda}$ still is a $W$ -graph representation. Having found standard bases $\mathfrak{B}$ and $\mathfrak{B}^{\prime}$ for ${\mathfrak{X}}^{\lambda}$ and $({\mathfrak{X}}^{\lambda})^{\prime}$ , respectively, we proceed by writing them as matrices $B\in{\mathbb{Z}}[v]^{n\times n}$ and $B^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ , respectively, where by construction both $B$ and $B^{\prime}$ are primitive. In order to complete the final task of computing the product $B^{-1}\cdot B^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ efficiently, we need a few preparations.

Palindromicity. Let $\ast\colon K\rightarrow K$ be the involutory field automorphism given by $\ast\colon v\mapsto v^{-1}$ . Hence $A$ is $\ast$ -invariant, and by entry-wise application we get involutory module automorphisms on $K^{n}$ and $A^{n}$ , and algebra automorphisms on $K^{n\times n}$ and $A^{n\times n}$ , all of which will also be denoted by $\ast$ .

A polynomial $0\neq f\in{\mathbb{Z}}[v]$ is called ( $k$ -)palindromic, for some $k\in{\mathbb{N}}_{0}$ , if $v^{k}\cdot f^{\ast}=f\in A$ , and $f$ is called ( $k$ -)skew-palindromic if $v^{k}\cdot f^{\ast}=-f\in A$ . In these cases, letting $\delta(f)\in{\mathbb{N}}_{0}$ be the maximum power of $v$ dividing $f$ in ${\mathbb{Z}}[v]$ , we have $k=\delta(f)+\deg(f)$ . Hence $f$ is palindromic or skew-palindromic if and only if $f\in{\mathbb{Z}}[v]$ and $f^{\ast}\in{\mathbb{Z}}[v^{-1}]$ are associated in $A$ . Moreover, if $f$ is $k$ -skew-palindromic, then specializing $v\mapsto 1$ we get $f(1)=-f(1)$ , implying that $v-1$ divides $f$ in ${\mathbb{Z}}[v]$ ; similarly, if $f$ is $k$ -palindromic, then specializing $v\mapsto-1$ we get $(-1)^{k}\cdot f(-1)=f(-1)$ , implying that $k$ is even, or $v+1$ divides $f$ in ${\mathbb{Z}}[v]$ .

Proposition 8.2

(a) Let $P\in{\mathbb{Z}}[v]^{n\times n}$ be a primitive Gram matrix for ${\mathfrak{X}}^{\lambda}$ . Then we have $v^{m}\cdot P^{\ast}=P$ , where $m=m_{P}\in{\mathbb{N}}$ is even and coincides with the maximum of the degrees of the non-zero entries of $P$ .

(b) For the primitive seed vector $u^{\prime}_{1}\in{\mathbb{Z}}[v]^{n}$ we have $v^{m}\cdot(u^{\prime}_{1})^{\ast}=u^{\prime}_{1}$ , where $m=m_{u^{\prime}_{1}}\in{\mathbb{N}}_{0}$ is even and coincides with the maximum of the degrees of the non-zero entries of $u^{\prime}_{1}$ . (Trivially, the analogous statement holds for $u_{1}\in{\mathbb{Z}}[v]^{n}$ with $m_{u_{1}}=0$ .)

Proof

Letting $E_{n}\in A^{n\times n}$ be the identity matrix, by Definition 2.8 for $s\in S$ we have

[TABLE]

In particular, this yields

[TABLE]

(a) We consider the matrix $P^{\ast}\in{\mathbb{Z}}[v^{-1}]^{n\times n}$ : For all $s\in S$ we have

[TABLE]

Now $m=m_{P}\in{\mathbb{N}}$ as above is minimal such that $v^{m}P^{\ast}\in{\mathbb{Z}}[v]^{n\times n}$ , hence we infer that $v^{m}P^{\ast}$ is a primitive Gram matrix for ${\mathfrak{X}}^{\lambda}$ as well, and thus we have $v^{m}P^{\ast}=P$ or $v^{m}P^{\ast}=-P$ . Assume the latter case holds, then all non-zero entries of $P$ are $m$ -skew-palindromic, implying that $v-1$ divides $\gcd(P)$ , contradicting the primitivity of $P$ . Hence we have $v^{m}P^{\ast}=P$ , that is all non-zero entries of $P$ are $m$ -palindromic. Assume that $m$ is odd, then we infer that $v+1$ divides $\gcd(P)$ , again contradicting the primitivity of $P$ . Hence $m$ is even.

(b) We consider the vector $(u^{\prime}_{1})^{\ast}\in{\mathbb{Z}}[v^{-1}]^{n}$ : We have

[TABLE]

Now $m=m_{u^{\prime}_{1}}\in{\mathbb{N}}_{0}$ as above is minimal such that $v^{m}\cdot(u^{\prime}_{1})^{\ast}\in{\mathbb{Z}}[v]^{n}$ , hence we infer that $v^{m}\cdot(u^{\prime}_{1})^{\ast}$ is primitive. Thus from $\dim_{K}(\ker(({\mathfrak{X}}^{\lambda})^{\prime}(T^{\lambda})))=1$ we conclude that $v^{m}\cdot(u^{\prime}_{1})^{\ast}=u^{\prime}_{1}$ or $v^{m}\cdot(u^{\prime}_{1})^{\ast}=-u^{\prime}_{1}$ . Now we argue as above. ∎

Properties of the standard bases. We have a closer look at the standard bases $\mathfrak{B}$ and $\mathfrak{B}^{\prime}$ , and the associated matrices $B$ and $B^{\prime}$ , where we assume $\mathfrak{B}$ to be chosen according to Section 7.4. The facts collected are largely due to experimental observation, and will be helpful in the final computational steps in Section 8.4. Still, these properties seem to be stronger than expected from general principles, and it should be worth-while to try and prove the particular observations specified below. (In particular, we have checked the standard bases associated with all subsets $J\subseteq S$ fulfilling the Benson–Curtis condition, see Section 7.1, for the types $E_{6}$ , $E_{7}$ and $E_{8}$ .)

Recall that for all $s\in S$ we have

[TABLE]

hence by the proof of Proposition 8.2 we get

[TABLE]

The elements of $\mathfrak{B}$ . For any $u_{i}\in\mathfrak{B}$ , where $2\leqslant i\leqslant n$ , we have $u_{i}=u_{j}\cdot{\mathfrak{X}}^{\lambda}(vT_{s})$ , for some $1\leqslant j<i$ and $s\in S$ . This yields

[TABLE]

We conclude that $\gcd(u_{i})\in{\mathbb{Z}}[v]$ and $\gcd(u_{j})\in{\mathbb{Z}}[v]$ are associated in $A$ . Hence by recursion, since $u_{1}$ is primitive, we infer that $\gcd(u_{i})=v^{d_{i}}\in{\mathbb{Z}}[v]$ for some $d_{i}\in{\mathbb{N}}_{0}$ .

Moreover, we have $d_{j}\leqslant d_{i}\leqslant d_{j}+2$ . Since $d_{1}=0=l(w_{1})$ , this implies $d_{i}\leqslant 2l(w_{i})$ for all $1\leqslant i\leqslant n$ , where $w_{i}\in W$ is as in Section 7.4. (Experiments show that all three cases $d_{i}\in\{d_{j},d_{j}+1,d_{j}+2\}$ actually occur.) But the growth behavior of the $d_{i}$ seems to be more restricted than given by these bounds: Considering the case $l(w_{i})=1$ , we have $w_{i}=s$ for some $s\in S$ such that the “unit” vector $u_{1}$ is not an eigenvector of $T_{s}$ , hence using the shape of ${\mathfrak{X}}^{\lambda}(vT_{s})$ we conclude that $d_{i}=1=l(w_{i})$ .

Now, experimentally, we have made the following

Observation 1

We have $d_{i}\leqslant l(w_{i})+1$ , for all $1\leqslant i\leqslant n$ .

(Actually, almost always we have got $d_{i}\leqslant l(w_{i})$ , for all $1\leqslant i\leqslant n$ , where often we have even seen equality throughout; the only cases found where actually $d_{i}=l(w_{i})+1$ , for some $i$ , are for type $E_{8}$ , the representation labeled by $3200_{x}$ , and two out of the twelve Benson–Curtis subsets of generators.)

The matrix $B$ . Letting $1\leqslant j<i\leqslant n$ and $s\in S$ be as above, we get

[TABLE]

Since the standard basis algorithm is a breadth-first search, from $u_{1}^{\ast}=u_{1}$ we conclude that there is lower unitriangular matrix $U\in K^{n\times n}$ and a diagonal matrix $D=\operatorname{diag}[v^{2l(w_{1})},\ldots,v^{2l(w_{n})}]\in{\mathbb{Z}}[v]^{n\times n}$ , such that

[TABLE]

(Note that if the $A$ -lattice $\langle\mathfrak{B}\rangle_{A}$ was invariant under the $A$ -algebra generated by $\{{\mathfrak{X}}^{\lambda}(vT_{s});s\in S\}$ , then we even had $U\in A^{n\times n}$ .)

In particular, letting $l:=\sum_{i=1}^{n}l(w_{i})\in{\mathbb{N}}_{0}$ , we infer that

[TABLE]

hence $\det(B)\in{\mathbb{Z}}[v]$ is palindromic. Letting $\exp(B)\in{\mathbb{Z}}[v]$ denote the exponent of $B$ in the sense of Section 5.3, it follows from Proposition 5.4 that the non-constant irreducible polynomials dividing $\det(B)$ are precisely those dividing $\exp(B)$ . Now, experimentally, we have made the following

Observation 2

Any irreducible divisor of $\exp(B)$ in ${\mathbb{Z}}[v]$ is monic and palindromic.

(Actually, in general the entries of the matrix $B$ are neither palindromic nor skew-palindromic; moreover, quite often $\exp(B)$ is a product of cyclotomic polynomials, but this does not always happen.)

In particular, if $\widehat{u}_{k}^{\operatorname{tr}}\in{\mathbb{Z}}[v]^{1\times n}$ denotes the $k$ -th column of $B$ , for $1\leqslant k\leqslant n$ , then $\gcd(\widehat{u}_{k})\in{\mathbb{Z}}[v]$ divides $\det(B)$ , hence $\gcd(\widehat{u}_{k})$ is palindromic as well. (Actually, contrary to $\gcd(u_{k})=v^{d_{k}}$ , in general the $\gcd(\widehat{u}_{k})$ are not just powers of $v$ .)

The elements of $\mathfrak{B}^{\prime}$ . The recursion used in the standard basis algorithm only depends on the Schreier tree $\mathfrak{T}$ , but is independent of the representation considered. Hence for $u^{\prime}_{i}\in\mathfrak{B}^{\prime}$ , where $1\leqslant i\leqslant n$ , and $u^{\prime}_{1}$ is primitive, we get $\gcd(u^{\prime}_{i})=v^{d^{\prime}_{i}}\in{\mathbb{Z}}[v]$ for some $d^{\prime}_{i}\in{\mathbb{N}}_{0}$ . Moreover, if $1\leqslant j<i\leqslant n$ and $s\in S$ are as above, we get $d^{\prime}_{j}\leqslant d^{\prime}_{i}\leqslant d^{\prime}_{j}+2$ and $d^{\prime}_{i}\leqslant 2l(w_{i})$ . Actually, the $d^{\prime}_{i}$ seem to be closely related to the $d_{i}$ from above, inasmuch experimentally we have made the following

Observation 3

We have $d^{\prime}_{i}=d_{i}$ , for all $1\leqslant i\leqslant n$ .

The matrix $B^{\prime}$ . Again by the fact that the recursion used in the standard basis algorithm only depends on $\mathfrak{T}$ , and using $v^{m}\cdot(u^{\prime}_{1})^{\ast}=u^{\prime}_{1}$ , where $m=m_{u^{\prime}_{1}}\in{\mathbb{N}}_{0}$ is as in Proposition 8.2, we get

[TABLE]

for the same matrices $U$ and $D$ . In particular, it follows that $\det(B^{\prime})$ is palindromic. (In general neither $\det(B^{\prime})$ and $\det(B)$ , nor $\exp(B^{\prime})$ and $\exp(B)$ are associated in $A$ , so that $\langle\mathfrak{B}\rangle_{A}$ and $\langle\mathfrak{B}^{\prime}\rangle_{A}$ are inequivalent $A$ -sublattices of $A^{n}$ , which typically are not included in each other.) Again, experimentally we have made the following

Observation 4

Any irreducible divisor of $\exp(B^{\prime})$ in ${\mathbb{Z}}[v]$ is monic and palindromic.

In particular, similarly, if $\widehat{u}_{k}^{\prime\operatorname{tr}}\in{\mathbb{Z}}[v]^{1\times n}$ denotes the $k$ -th column of $B^{\prime}$ , for $1\leqslant k\leqslant n$ , then $\gcd(\widehat{u}_{k}^{\prime})\in{\mathbb{Z}}[v]$ is palindromic.

The product $B^{-1}\cdot B^{\prime}$ . In combination the above yields

[TABLE]

Hence the non-zero entries of $B^{-1}\cdot B^{\prime}$ are palindromic.

Letting $0\neq b\in{\mathbb{Z}}$ and $\widehat{B}\in{\mathbb{Z}}[v]^{n\times n}$ primitive such that $B^{-1}=\frac{1}{b\cdot\exp(B)}\cdot\widehat{B}$ , we get

[TABLE]

where $P\in{\mathbb{Z}}[v]^{n\times n}$ is a primitive Gram matrix, and $0\neq c\in{\mathbb{Z}}[v]$ . In particular, since by Observation 2 the exponent $\exp(B)$ is palindromic, we conclude that the non-zero entries of $\widehat{B}\cdot B^{\prime}$ are palindromic as well.

Moreover, letting $\widetilde{m}=m_{\exp(B)}\in{\mathbb{N}}_{0}$ such that $v^{\widetilde{m}}\cdot\exp(B)^{\ast}=\exp(B)$ , we get

[TABLE]

Hence from $v^{m_{P}}\cdot P^{\ast}=P$ , where $m_{P}\in{\mathbb{N}}_{0}$ is as in Proposition 8.2, we get

[TABLE]

providing an upper bound on the degrees of the non-zero entries of $P$ .

The final product. We are now prepared to do the last computational steps. To do so, we could quite straightforwardly compute first the inverse $B^{-1}$ , that is essentially $\widehat{B}$ , and then the product $\widehat{B}\cdot B^{\prime}$ . But it will substantially add to the efficiency if we keep the degrees of the non-zero entries of the matrices involved as small as possible. Now we have already observed above that the rows of $B$ and $B^{\prime}$ are far from being primitive, and it turns out in practice that this also holds for their columns. We take advantage of this as follows:

Keeping the notation of Section 8.3, let $R:=\operatorname{diag}[v^{d_{1}},\ldots,v^{d_{n}}]\in{\mathbb{Z}}[v]^{n\times n}$ . Then the rows of $R^{-1}\cdot B\in{\mathbb{Z}}[v]^{n\times n}$ are primitive. As for its columns, letting $\widetilde{u}_{k}^{\operatorname{tr}}\in{\mathbb{Z}}[v]^{1\times n}$ denote the $k$ -th column of $R^{-1}\cdot B$ , for $1\leqslant k\leqslant n$ , let

[TABLE]

Since by Observation 2 the polynomial $\gcd(\widehat{u}_{k})$ is palindromic, using the particular form of $R$ , we conclude that the $\gcd(\widetilde{u}_{k})$ are palindromic as well. We let $0\neq\widehat{c}\in{\mathbb{Z}}[v]$ and $\widehat{C}\in{\mathbb{Z}}[v]^{n\times n}$ be primitive such that $C^{-1}=\frac{1}{\widehat{c}}\cdot\widehat{C}$ . The latter are of course straightforwardly computed, where both $\widehat{c}$ and the diagonal entries of $\widehat{C}$ are palindromic.

Then we get $\widetilde{B}\in{\mathbb{Z}}[v]^{n\times n}$ such that $B=R\cdot\widetilde{B}\cdot C$ , where now all the rows and all the columns of $\widetilde{B}$ are primitive. We use the algorithm in Section 5.2 to compute $0\neq\widehat{b}\in{\mathbb{Z}}[v]$ and $\widehat{B}\in{\mathbb{Z}}[v]^{n\times n}$ primitive such that $\widetilde{B}^{-1}=\frac{1}{\widehat{b}}\cdot\widehat{B}$ , Since by Observation 2 the exponent $\exp(B)$ is palindromic, using the particular form of $R$ and $C$ , we conclude that $\widehat{b}$ is palindromic as well. Thus altogether we have

[TABLE]

Similarly, let $R^{\prime}:=\operatorname{diag}[v^{d^{\prime}_{1}},\ldots,v^{d^{\prime}_{n}}]\in{\mathbb{Z}}[v]^{n\times n}$ and

[TABLE]

where $\widetilde{u}_{k}^{\prime\operatorname{tr}}\in{\mathbb{Z}}[v]^{1\times n}$ denotes the $k$ -th column of $(R^{\prime})^{-1}\cdot B^{\prime}$ , for $1\leqslant k\leqslant n$ . As above, using Observation 4 implying the palindromicity of $\gcd(\widehat{u}^{\prime}_{k})$ , we conclude that the diagonal entries of $C^{\prime}$ are palindromic as well, and thus those of $(C^{\prime})^{-1}$ are too. Then we get $\widetilde{B}^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ such that $B^{\prime}=R^{\prime}\cdot\widetilde{B}^{\prime}\cdot C^{\prime}$ , where now all the rows and all the columns of $\widetilde{B}$ are primitive.

In combination this yields

[TABLE]

By the above considerations we conclude that the non-zero entries of $Q$ are palindromic, which entails that those of $\widehat{B}\cdot R^{-1}\cdot R^{\prime}\cdot\widetilde{B}^{\prime}$ are as well. Now by Observation 3 we have $R^{\prime}=R$ , hence this simplifies to

[TABLE]

where the non-zero entries of $\widehat{B}\cdot\widetilde{B}^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ are palindromic.

In practice. To find $Q$ , finally, we apply the matrix multiplication algorithm in Section 5.5 to compute the product $\widehat{B}\cdot\widetilde{B}^{\prime}$ . As was already mentioned, in order to apply it efficiently we need good break conditions to discard erroneous guesses quickly: Apart from requiring that rational number recovery, see Section 3.4, returns only integral coefficients but not rational ones, it turns out that checking for palindromicity is highly effective in this respect.

Having found a good candidate for $\widehat{B}\cdot\widetilde{B}^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ , multiplying with the diagonal matrices $\widehat{C}\in{\mathbb{Z}}[v]^{n\times n}$ and $C^{\prime}\in{\mathbb{Z}}[v]^{n\times n}$ is straightforward. Note that, since the result is expected to be a symmetric matrix, it is sufficient to compute only the lower triangular half of the product. Thus we get a candidate for a primitive Gram matrix $P$ from $Q=\gcd(Q)\cdot P\in{\mathbb{Z}}[v]^{n\times n}$ . (In many cases $Q$ already is primitive, but this does not happen always, in which cases $\gcd(Q)$ typically has a smallish degree.)

As independent verification we of course just explicitly check whether the candidate $P$ fulfills the condition

[TABLE]

9 Timings

We conclude by providing running times and workspace requirements for our computations in types $E_{7}$ and $E_{8}$ , and by presenting an explicit example for type $E_{6}$ .

Timings. In Table 2, we give the running time (on a single processor running at a clock speed of $3.5\textrm{GHz}$ ) and GAP workspace requirements needed to compute primitive Gram matrices for types $E_{7}$ and $E_{8}$ , and the irreducible $W$ -graph representations of ${\mathscr{H}}_{K}$ given in How , HowYin . The figures for $E_{7}$ should be compared with those given in Section 2.9 for the approach used there. Recalling that in (gemu, , Remark 4.10) degree $2500$ was the limit of feasibility, in Table 3 we present the resources now needed for the individual representations of degree at least $2500$ , where for comparison we repeat the first three columns of the relevant part of Table LABEL:Mmaxd.

Finally, in Table 4 we give some details about the various steps in the computation for the unique representation of largest degree, which is labeled by $7168_{w}$ . In the two last columns we indicate the actual size of the object under consideration in the GAP workspace, and the disc space needed to store it (as an uncompressed text file), respectively; the difference is accounted for by the space consumption of the data structure we are using within GAP, where matrices with polynomial entries are kept as lists of lists of (short) lists of (small long) integers. In particular, in the workspace needed to compute the product, next to the matrices $\widehat{B}$ and $\widetilde{B}^{\prime}$ and (the lower triangular half of) the product $\widehat{B}\cdot\widetilde{B}^{\prime}$ , we also keep various specializations of the right hand factor $\widetilde{B}^{\prime}$ , which have a cumulative size of $7.1\textrm{GB}$ . Hence to compute a primitive Gram matrix for the representation labeled by $7168_{w}$ we need a running time of $1183\textrm{min}\sim 20\textrm{h}$ and a workspace of size $31.5\textrm{GB}$ .

An explicit example. We conclude by revisiting the (tiny) example already presented in (gemu, , Example 4.9) (which of course in practice runs in a fraction of a second): Let $W$ be of type $E_{6}$ with Dynkin diagram

$s_{1}$$s_{3}$$s_{4}$$s_{2}$$s_{5}$$s_{6}$

We consider the irreducible $W$ -graph representation of ${\mathscr{H}}_{K}$ , see Naruse0 , labeled by the representation $10_{s}$ of ${\mathbb{Q}}[W]$ , which is the unique one of degree $10$ , see Table LABEL:Mmaxd0. The $W$ -graph in question is depicted in (gemu, , Example 4.9), hence we do not repeat it here. But to illustrate the shape, and in particular the sparseness of the representing matrices for the generators $vT_{s_{1}},\ldots,vT_{s_{6}}$ we present a few of them:

[TABLE]

As it turns out, there are $22$ possible choices of a distinguished subset $J\subseteq S$ . We choose $J:=\{s_{1},s_{2},s_{3},s_{5},s_{6}\}$ , in accordance with (gepf, , Table C.4). Then associated primitive seed vectors $u_{1}$ and $u^{\prime}_{1}$ are as given below, in the first row of the matrices $B$ and $\widetilde{B}^{\prime}$ , respectively. Running the standard basis algorithm on the specialization of the above $W$ -graph representation with respect to $v\mapsto 1$ yields the following Schreier tree $\mathfrak{T}$ , which we depict as an oriented graph, whose vertices $1,\ldots,10$ correspond to the vectors in the (ordered) standard bases, and where an arrow from vertex $j$ to vertex $i$ with label $s_{k}$ says that $[j,s_{k}]$ is the $i$ -th entry of $\mathfrak{T}$ :

$1$$s_{4}$$2$$s_{2}$$5$$s_{5}$$4$$s_{3}$$8$$s_{5}$$3$$s_{3}$$7$$s_{5}$$6$$s_{5}$$9$$s_{4}$$10$

We find the standard basis $\mathfrak{B}$ with associated matrix $B$ as shown below. (It is not always the case that the entries of $B$ are only monomials.) Hence we have $R=\operatorname{diag}[v^{d_{1}},\ldots,v^{d_{10}}]$ , where $[d_{1},\ldots,d_{10}]=[0,1,2,2,2,3,3,3,4,5]=[l(w_{1}),\ldots,l(w_{10})]$ , and $C$ is the identity matrix. Thus we get the matrix $\widetilde{B}$ , and from that $\widehat{b}=1$ and the matrix $\widehat{B}$ as also shown below. Note that the entries of $\widehat{B}$ are not necessarily palindromic or skew-palindromic, and that the maximum degree of the non-zero entries of $B$ , $\widetilde{B}$ and $\widehat{B}$ equals $8$ , $3$ and $5$ , respectively:

[TABLE]

Similarly, we find the standard basis $\mathfrak{B}^{\prime}$ with associated matrix $B^{\prime}$ . As it turns out we indeed have $R^{\prime}=R$ , and $C^{\prime}$ is the identity matrix. This yields the matrix $\widetilde{B}^{\prime}$ as shown below. Note that the entries of $\widetilde{B}^{\prime}$ are not necessarily palindromic or skew-palindromic, and that the maximum degree of the non-zero entries of $\widetilde{B}^{\prime}$ is $9$ :

[TABLE]

From this we get $Q=\widehat{B}\cdot\widetilde{B}^{\prime}$ . As it turns out we already have $\gcd(Q)=1$ , thus we may let $P=-Q$ be as shown below. Indeed, independent verification shows that $P$ is a primitive Gram matrix as desired, coinciding with the one already given in (gemu, , Example 4.9). Note that indeed $P$ is a completely dense matrix, all of whose entries are $6$ -palindromic, where the maximum degree occurring is $6$ , and that in accordance with Table LABEL:Mmaxd0 the largest coefficient occurring has absolute value $3$ , and that the specialization $v\mapsto 0$ yields the identity matrix:

[TABLE]

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Cohen, H.: A course in computational algebraic number theory. Graduate Texts in Mathematics 138 . Springer-Verlag, 1993.
2(2) Davenport, J., Guy, M., Wang, P.: P 𝑃 P -adic reconstruction of rational numbers. SIGSAM Bulletin 16 (2) (1982), 2–33.
3(3) Dixon, J.: Exact solution of linear equations using p 𝑝 p -adic expansions. Numer. Math. 40 (1982), 137–141.
4(4) The GAP Group: GAP — Groups, Algorithms, Programming — A System for Computational Discrete Algebra. Version 4.8.5 (2016). http://www.gap-system.org .
5(5) von zur Gathen, J., Gerhard, J.: Modern computer algebra. Third edition. Cambridge University Press, 2013.
6(6) Garibaldi, S.: E 8 subscript 𝐸 8 E_{8} , the most exceptional group. Bull. Amer. Math. Soc. 53 (2016), 643–671.
7(7) Geck, M.: Leading coefficients and cellular bases of Hecke algebras. Proc. Edinburgh Math. Soc. 52 (2009), 653–677.
8(8) Geck, M., Halls, A,: On the Kazhdan–Lusztig cells in type E 8 subscript 𝐸 8 E_{8} . Math. Comp. 84 (2015), 3029–3049.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Invariant bilinear forms on

Abstract

1 Introduction

2 Iwahori–Hecke algebras and balanced representations

Proposition 2.5 (See (myedin, , Prop. 4.3, Remark 4.4))

Remark 2.6

Remark 2.7

Definition 2.8

Theorem 2.10

3 Linear algebra over the integers

Lemma 3.2

Proof

Proposition 3.3

Proof

4 Computing with polynomials

Proposition 4.6

Proof

5 Linear algebra over polynomial rings

Proposition 5.4

Proof

6 Computing with representations

7 Finding standard bases for WWW-graph representations

Lemma 7.3

Proof

8 Finding Gram matrices for WWW-graph representations

Proposition 8.2

Proof

Observation 1

Observation 2

Observation 3

Observation 4

9 Timings

7 Finding standard bases for $W$ -graph representations

8 Finding Gram matrices for $W$ -graph representations