Algorithms for arithmetic groups with the congruence subgroup property
A. S. Detinko, D. L. Flannery, A. Hulpke

TL;DR
This paper introduces practical algorithms for computing with arithmetic groups in SL(n,Q) for n>2, including membership testing, subgroup analysis, and orbit-stabilizer problems, implemented in GAP.
Contribution
It presents new methods for effective computation with arithmetic groups with the congruence subgroup property, focusing on constructing principal congruence subgroups.
Findings
Algorithms successfully tested in GAP
Effective membership testing developed
Subnormal structure analysis enabled
Abstract
We develop practical techniques to compute with arithmetic groups for . Our approach relies on constructing a principal congruence subgroup in . Problems solved include testing membership in , analyzing the subnormal structure of , and the orbit-stabilizer problem for . Effective computation with subgroups of is vital to this work. All algorithms have been implemented in GAP.
| Group | gens | Index in | Time | |||
|---|---|---|---|---|---|---|
| Group | Time | ||||
|---|---|---|---|---|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinite Group Theory Research · Coding theory and cryptography · Advanced Algebra and Geometry
Algorithms for arithmetic
groups with the congruence subgroup property
A. S. Detinko
,
D. L. Flannery
and
A. Hulpke
Abstract.
We develop practical techniques to compute with arithmetic groups for . Our approach relies on constructing a principal congruence subgroup in . Problems solved include testing membership in , analyzing the subnormal structure of , and the orbit-stabilizer problem for . Effective computation with subgroups of is vital to this work. All algorithms have been implemented in GAP.
In [8, 9, 10] we established methods for computing with finitely generated linear groups over an infinite field, based on the use of congruence homomorphisms. These have been applied to test virtual solvability and answer questions about solvable-by-finite (SF) linear groups.
Computing with finitely generated linear groups that are not SF is a largely unexplored topic. These groups comprise a wide class in which certain algorithmic problems are undecidable [6, Section 3]. We might be more confident of progress if we restrict ourselves to arithmetic subgroups of linear algebraic groups. Decision problems for such groups were investigated by Grunewald and Segal [17]; see also [7]. We note renewed activity focussed on deciding arithmeticity [32].
This paper is a revision of [11], which provides a starting point for computation with semisimple arithmetic groups that have the congruence subgroup property (CSP). A prominent example is , . Recall that is arithmetic if has finite index in both and (in particular, finite index subgroups of are arithmetic). Each arithmetic group contains a principal congruence subgroup for some , namely the kernel of the congruence homomorphism induced by natural surjection [3, 27]. So if we know that then we can transfer much of the computing to , for which efficient machinery is available [20]. We give a method to construct in . This implies that membership testing and other fundamental problems are decidable.
We pay special attention to subnormality and the orbit-stabilizer problem. Aside from their computational importance, these were the earliest questions considered for arithmetic groups. The study of subnormal subgroups of originated in the late 19th century and led up to formulation of the Congruence Subgroup Problem. In turn, the solution of that problem used knowledge of -orbits in [21, §17].
The paper is organized as follows. Section 1 covers background on arithmetic groups: basic facts; material about principal congruence subgroups (their generating sets, maximality); and subnormal structure. Section 2 details relevant theory of matrix groups over and computing in . In Section 3 we give a suite of algorithms for arithmetic groups in . After verifying decidability, we describe computing a maximal principal congruence subgroup; membership testing; and aspects of subnormality, e.g., testing whether an arithmetic group is subnormal or normal, and constructing the normal closure of a subgroup of . In Section 4 we solve the orbit-stabilizer problem for arithmetic groups in acting on . Our solution draws on a comprehensive description of orbits and stabilizers for a principal congruence subgroup acting on . Section 5 shows how to extend results from to . Finally, we examine the performance of our GAP [16] implementation of the algorithms.
We remark that the scope of this paper may be widened to other groups with the CSP, such as or for and , where is the ring of integers of a number field that is not totally imaginary [3].
1. Arithmetic subgroups of : background
1.1. Preliminaries
Let be a commutative ring with , and be an ideal. The natural surjection induces a congruence homomorphism . Let and . The kernel of on or is a principal congruence subgroup (PCS) of level . Such a subgroup of will be denoted . We set . If then for some non-negative integer , and the subscript ‘’ is replaced by ‘’.
For computational purposes, and should be finitely generated, and proper quotients of should be finite. The latter is true if or is the univariate polynomial ring over the finite field of size . These are two major types of ambient ring encountered when computing with finitely generated linear groups.
Define , where has in position and zeros everywhere else. The matrices for distinct , are transvections. The subgroup
[TABLE]
of is the elementary group of level . We write , , for , , respectively.
Lemma 1.1**.**
- (i)
For all , .
- (ii)
If are pairwise distinct then and .
- (iii)
If and then commutes with .
Proposition 1.2**.**
In each of the following situations, : (i) and is Euclidean or semi-local; (ii) and is a Hasse domain of a global field.
Proof.
See [19, 4.3.9, pp. 172–173]. ∎
Remark 1.3*.*
is a Hasse domain of a global field, is Euclidean, and is semi-local.
Proposition 1.2 implies that maps onto . However, may not be surjective.
Proposition 1.4**.**
Let or . If or then , , and are finitely generated. None of the groups , , or is finitely generated when .
Proof.
If then is finitely generated by [19, 4.3.11, p. 174]; hence so too is , by [19, 1.2.17, p. 29] and Dirichlet’s unit theorem. See [19, 4.3.16, p. 175] and subsequent comments for the remaining claims. ∎
The notation means that is of finite index in the group . For , has the congruence subgroup property: is equivalent to containing some [3, 27]. On the other hand, does not have the CSP [35, §1.1].
1.2. Generators of congruence subgroups
Let . We first discuss generating sets for and , and thus for their homomorphic images , .
By Lemma 1.1 (ii), the transvections constitute a generating set for . In fact has a generating set of minimal size : and
[TABLE]
see [31, p. 107]. Adding the diagonal matrix produces a generating set for of size (better still, it is known that is -generated). Similarly, two generators of , together with all diagonal matrices as runs over a generating set for the unit group of , generate . If or an odd prime power then is -generated. For all , is -generated, and is -generated.
The normal closure of in is denoted . Let be the permutation matrix obtained by swapping rows and of .
Lemma 1.5**.**
For any , .
Proof.
Put . We prove that for all . By Lemma 1.1 (ii),
[TABLE]
so . Then if . Since where with in position , this concludes the proof. ∎
Proposition 1.6**.**
If and then (hence ).
Proof.
Remark 1.7*.*
For , is not normal in .
Remark 1.8*.*
m_{2}\hskip 0.56905pt\big{|}\hskip 0.56905ptm_{1}.
A PCS in for is the image under of a PCS in .
Corollary 1.9**.**
Let be an ideal of , so for some divisor of . If then the kernel of on is
[TABLE]
Furthermore, for any and .
Proposition 1.10**.**
If then has generating set
[TABLE]
where
[TABLE]
Proof.
See [36]. ∎
We emphasize that the number of generators in (1) does not depend on . The minimal size of a generating set for is unknown. However, by Lemma 2.10 below, this size can be no less than . As Professor A. Lubotzky has pointed out to us, [33, Theorem 1] and Lemma 2.10 imply that has a generating set of size . In [23] it is conjectured that for contains a -generator subgroup of finite index (cf. [22, p. 412]). This conjecture has been settled affirmatively (see [26]), so is -generated.
Let denote the minimal size of a generating set of . Although can be arbitrarily large [36, pp. 355–356], we have
Lemma 1.11**.**
Suppose that and . Then is bounded above by a function of , only.
Proof.
This is clear from Proposition 1.10 and the fact that . ∎
1.3. Constructing a PCS in an arithmetic subgroup
Let . Our overall strategy rests on knowing some in the arithmetic group . We show that such a PCS can always be constructed.
Proposition 1.12**.**
, so is finite.
Proof.
Let and . Then is generated by the for and their conjugates as in Proposition 1.10. Our goal is to prove that these all lie in , i.e., that they can be expressed as words in the . Since where and , it suffices to look at conjugation by for . Furthermore, if then and commute: thus it suffices to consider conjugation of by , , , .
First we suppose that the conjugating element has subscript or . For and ,
[TABLE]
If we have
[TABLE]
For ,
[TABLE]
while and commute.
Now suppose that the subscript of the conjugating element is or . For ,
[TABLE]
If then and (2) applies.
If then
[TABLE]
and if , again as noted above, and commute. ∎
The group has a (finite) presentation where consists of all commutator relations , from Lemma 1.1 (ii) and (iii), with a single extra relation [29, Corollary 10.3].
Lemma 1.13**.**
Given we can find an elementary group in .
Proof.
Express each generator of as a product of transvections (for which see, e.g., [21, p. 99]). Then the Todd-Coxeter procedure with input and terminates, returning . So for all , and known we have ( say). Hence . ∎
Using Proposition 1.12, we rescue one item (slightly generalized) from the proof of Lemma 1.13.
Lemma 1.14**.**
If then where .
Proposition 1.12 and Lemma 1.13 yield the promised
Corollary 1.15**.**
Construction of a PCS in is decidable.
1.4. Maximal congruence subgroups
In this subsection and .
Lemma 1.16**.**
Let , be positive integers, , and . Then
- (i)
.
- (ii)
.
Proof.
(i) For and integers , such that ,
[TABLE]
Thus by Proposition 1.6.
(ii) Certainly . The reverse containment is just the Chinese Remainder Theorem. ∎
Corollary 1.17**.**
If then contains a unique maximal PCS (of ): there is a positive integer such that , and .
Remark 1.18*.*
If has maximal PCS and then . Hence we know such that for all primes ; cf. the query raised at the foot of [24, p. 126].
Remark 1.19*.*
Although similarly contains a unique maximal elementary subgroup , the -normal closure of need not be the maximal PCS in , nor even be in .
Remark 1.20*.*
Lemma 1.14 provides an upper bound on such that is the maximal PCS of an arithmetic group in ; cf. [25, Proposition 6.1.1, p. 115].
Lemma 1.21**.**
Each subgroup of contains a (perhaps trivial) unique maximal PCS of . In more detail, suppose that and is the maximal PCS in ; then is the maximal PCS in .
Proof.
Since , we have that divides , and so is a PCS in . Corollary 1.9 tells us that each PCS in has the form for some k\hskip 0.56905pt\big{|}\hskip 0.56905ptm. Moreover , because contains . Hence is as claimed. ∎
1.5. Subnormal structure
Let denote the full preimage of the center (scalar subgroup) of in under . As per [37, p. 166], the level of is the ideal of generated by
[TABLE]
Then for . So is the smallest ideal such that . When is a principal ideal ring we write in place of . For or , may be defined unambiguously as the non-negative integer or integer modulo that generates ; e.g., .
Lemma 1.22**.**
If then .
Proof.
It is evident from the definitions that and for , . Since by [37, Lemma 1], as required. ∎
From now on in this subsection, and or . We write to denote that is subnormal. The defect of is the least such that there exists a series .
Theorem 1.23**.**
* if and only if*
[TABLE]
for some , . If (7) holds then where is the defect of , and the least possible is bounded above by a function of and only.
Proof.
See [37, Corollary 3]. ∎
Although non-scalar subnormal subgroups of have finite index, this is not true for ; the normal closure of in has infinite index for [27, p. 31].
Theorem 1.24**.**
Let be a subgroup of of level , with maximal PCS . Then if and only if r\hskip 0.56905pt\big{|}\hskip 0.56905ptl^{e} for some . In that event, the defect of is bounded above by where is the least such .
Proof.
If is subnormal then and for , as in Theorem 1.23; so k\hskip 0.56905pt\big{|}\hskip 0.56905ptl and r\hskip 0.56905pt\big{|}\hskip 0.56905ptk^{e}. Conversely, if r\hskip 0.56905pt\big{|}\hskip 0.56905ptl^{e} then satisfies (7) with . ∎
Lemma 1.25** ([37], p. 165).**
* is nilpotent of class at most .*
We now consider normality.
Lemma 1.26**.**
If then and the level of the maximal PCS in .
Proof.
We first observe that . Let be the maximal PCS in . Then r\hskip 0.56905pt\big{|}\hskip 0.56905ptl; and l\hskip 0.56905pt\big{|}\hskip 0.56905ptr because . ∎
Lemma 1.27**.**
Suppose that has level . Then
- (i)
.
- (ii)
.
Proof.
(i) The inclusion is clear. If has level then by Theorems 1 and 4 of [5]. As a consequence, . Now this part is assured by Proposition 1.6 and Corollary 1.9.
(ii) Let . Since (Lemma 1.26), . Also by (i). ∎
Corollary 1.28**.**
* if and only if is the level of the maximal PCS in .*
Proposition 1.29**.**
Lemma 1.27 remains true with replaced by . That is, , and so is normal in precisely when it is normal in .
2. Matrix groups over
2.1. Relevant theoretical results
Let where the are distinct primes and . We define a ring isomorphism by where , , and .
Lemma 2.1**.**
- (i)
* extends to an isomorphism of onto \oplus_{i=1}^{t}\mathrm{Mat}\big{(}n,\mathbb{Z}_{p_{i}^{k_{i}}}\big{)}, which restricts to isomorphisms \mathrm{GL}(n,\mathbb{Z}_{m})\rightarrow\times_{i=1}^{t}\mathrm{GL}\big{(}n,\mathbb{Z}_{p_{i}^{k_{i}}}\big{)} and \mathrm{SL}(n,\mathbb{Z}_{m})\rightarrow\times_{i=1}^{t}\mathrm{SL}\big{(}n,\mathbb{Z}_{p_{i}^{k_{i}}}\big{)}.*
- (ii)
Let be an ideal of , and let be the ideal of generated by . Denote by , the kernels of , on , \mathrm{GL}\big{(}n,\mathbb{Z}_{p_{i}^{k_{i}}}\big{)} respectively. Then
[TABLE]
For ,
[TABLE]
are normal subgroups of .
Lemma 2.2** (Cf. Corollary 1.9).**
If is the ideal of generated by , then and are surjective, with kernels , respectively.
The notation , supersedes our earlier notation for principal congruence subgroups in this special case. Let .
Lemma 2.3**.**
Suppose that and . Then , and has a subgroup isomorphic to .
Proof.
Treating as an additive group, we confirm that defined by is a homomorphism with kernel . Now and , so is surjective. Since contains , the second assertion follows too. ∎
Lemma 2.4**.**
.
Proof.
(Cf. Lemma 1.25.) Let and . For some , and , such that and , we have
[TABLE]
Therefore . Also ; thus by Corollary 1.9. ∎
Lemma 2.5**.**
- (i)
.
- (ii)
.
Proof.
Lemma 2.3 takes care of (i). By Lemma 2.2, we then get (ii). ∎
Corollary 2.6**.**
If then is abelian of exponent .
The next two corollaries use Lemma 2.1. Let where . Note that generates the ideal of . Set and .
Corollary 2.7**.**
- (i)
|\mathrm{GL}(n,\mathbb{Z}_{m})|=\prod_{i=1}^{t}\big{(}|\mathrm{GL}(n,p_{i})|\cdot p_{i}^{n^{2}(k_{i}-1)}\big{)}.
- (ii)
The PCS of of level has order .
Lemma 2.8**.**
- (i)
.
- (ii)
For , and .
Proof.
The unit group of has order , so Lemma 2.5 (ii) gives (i). By Lemma 2.3, . Thus, if for some then , which contradicts (i) by Lemma 2.2. ∎
Corollary 2.9**.**
- (i)
|\mathrm{SL}(n,\mathbb{Z}_{m})|=\prod_{i=1}^{t}\big{(}|\mathrm{SL}(n,p_{i})|\cdot p_{i}^{(n^{2}-1)(k_{i}-1)}\big{)}.
- (ii)
The PCS of of level has order .
Define subsets
[TABLE]
of and
[TABLE]
of . We see that if and only if is a unit of .
Lemma 2.10**.**
Suppose that .
- (i)
* has minimal size generating set , so .*
- (ii)
Unless , , and , and has minimal size generating set .
- (iii)
* for has minimal size generating set of size .*
Proof.
In the proof of Lemma 2.3 we saw that . Since is nilpotent with derived group by Lemma 2.4, we have . So by Lemma 2.8 (ii).
The rest of the proof is along similar lines. Note that , and is trivial when , or cyclic of order generated by the coset of otherwise. Also unless , , and ; whereas for . Therefore in (ii). Since and has rank , this proves (ii). The verification of (iii) is left as an exercise. ∎
Proposition 2.11**.**
Let , be non-trivial principal congruence subgroups of level in , respectively, where for all . Then
- (i)
; unless and the Sylow -subgroup of is , in which case .
- (ii)
.
Proof.
If , are groups of coprime order with generating sets and of minimal size, where , then . Indeed
[TABLE]
The result follows from Lemmas 2.1 (ii) and 2.10. ∎
Remark 2.12*.*
- (i)
If for any then a full GL (-generated) or SL (-generated) appears as a factor in or .
- (ii)
The proof of Proposition 2.11 shows how to construct minimal size generating sets for and with the aid of Lemma 2.10. Note that we get a generating set for a PCS in by reducing (1) in Proposition 1.10 modulo .
2.2. Computing in
As above, suppose that has prime factorization . Let be the isomorphism introduced just before Lemma 2.1. We identify with .
To compute with , we use composition tree methods and the data structure from [20]. The latter consists of an effective homomorphism into whose kernel is the solvable radical of , and a polycyclic generating sequence (PCGS) for . Data structures for the images of the projections of modulo can be combined into a data structure for . We therefore assume that .
Clearly is isomorphic to a quotient of , and a PCGS for gives the initial terms of a PCGS for ; the rest are found by reductions modulo (cf. Subsection 2.1). As we have seen, if is the kernel of reduction modulo and the kernel of reduction modulo , then is described by matrices for , which multiply by addition of their -parts. A PCGS for the elementary abelian group can be determined easily by linear algebra.
2.3. Subnormal structure
Let . We adhere to previous notation and conventions.
Let be a function that returns for a subgroup of ; see Lemma 1.22.
Input: .
Output: a generating set for a maximal PCS of in .
- (1)
. 2. (2)
If then return ,
else return a generating set for the PCS of level in as given by Proposition 2.11,
where is minimal subject to dividing , dividing , and .
Step (2) requires membership testing. As an application of , we have
Input: .
Output: if ; otherwise.
- If then return
- else return .
The following reiterates Theorem 1.24.
Input: .
Output: and an upper bound on the defect of if ; otherwise.
- (1)
, . 2. (2)
If such that l_{2}\hskip 0.56905pt\big{|}\hskip 0.56905ptl_{1}^{e} then return ,
else return and where the least such that l_{2}\hskip 0.56905pt\big{|}\hskip 0.56905ptl_{1}^{e}.
Remark 2.13*.*
Let . Obviously if and only if . The defect of as a subnormal subgroup of is either equal to or one less than its defect as a subgroup of .
returns the normal closure of in according to Lemma 1.27. tests whether , returning if and only if (Corollary 1.28).
By Proposition 1.29, also returns the normal closure in of , and tests whether .
We can list the subnormal subgroups of in .
Input: and a positive integer .
Output: all normal subgroups of in of level .
- (1)
. 2. (2)
If does not divide then return . 3. (3)
a list of all subgroups of . 4. (4)
Return the full preimage of in under .
We sketch a more general method. Let be the list of all such that . Define where ranges over the multiples of dividing , and is maximal subject to r\hskip 0.56905pt\big{|}\hskip 0.56905ptk^{t}. Then is a complete list of subnormal subgroups of in . By Lemma 1.25, consists of preimages of subgroups of the nilpotent group . Redundancies in are removed using where , by Lemma 1.16.
3. Computing with arithmetic groups in
3.1. Decidability
An arithmetic subgroup of an algebraic -group is ‘explicitly given’ if (i) an upper bound on is known, and (ii) membership testing in is possible; i.e., for any it can be decided whether [17, pp. 531–532]. Conditions (i) and (ii) were assumed in [17] to prove decidability of algorithmic problems for . As the next lemma shows, these conditions are equivalent to knowing a PCS in . Such a PCS can always be found: see Corollary 1.15.
Lemma 3.1**.**
Let . The following are equivalent.
- (i)
A positive integer such that is known.
- (ii)
An upper bound on is known, and testing membership of in is decidable.
Proof.
(i) (ii). , and if and only if .
(ii) (i). Suppose that . For as in Proposition 1.10 and each pair , after no more than rounds we are guaranteed to find positive integers such that . Thus, if is any common multiple of the then . ∎
Proposition 3.2**.**
If is a finite index subgroup of specified by a finite generating set then testing membership of any in is decidable.
Proof.
This follows from Corollary 1.15 and Lemma 3.1. ∎
Of course, a key problem is (AT), arithmeticity testing: if is a finitely generated subgroup of , determine whether is finite. We are unaware of any proof that (AT) is decidable—although it seems not to be [28]. Nonetheless, (AT) is decidable when is solvable [7].
3.2. Algorithms for arithmetic groups
Now we design algorithms for , , given by a finite generating set.
By Corollary 1.15 (and the proof of Lemma 1.13), we have a procedure that returns the level of a PCS in . It depends on representing elements of as products of transvections. Once we know , say, then returns a generating set for as in Proposition 1.10.
Let . Lemma 1.21 underpins the following, which finds the maximal PCS in . (To improve efficiency we could substitute for in algorithms of this section.)
Input: such that .
Output: a generating set for the maximal PCS in .
- (1)
. 2. (2)
Return .
Remember that the level of a finitely generated subgroup of is calculated straightforwardly by Lemma 1.22. returns if has level and otherwise.
We mention a few more sample procedures.
returns .
tests whether a finitely generated subgroup of is contained in , returning if and only if .
. Suppose that , , . Let . This procedure returns , which by Lemma 1.16 (ii) is the full preimage in under of .
returns and a bound on the defect of if ; otherwise it returns . The steps mimic those of from Subsection 2.3, but are now carried out over . The same comment applies to normality testing of .
: as before, immediate from Lemma 1.27. We need not know a PCS in .
returns , the full preimage in of . Note that is either trivial if is odd or if is even, because is absolutely irreducible over .
returns all normal subgroups of in containing : this is the full preimage of the list as ranges over the divisors of . All subnormal subgroups of in containing are extracted similarly from the corresponding list in .
4. The orbit-stabilizer problem
Let be a commutative ring with , and let . This section addresses the orbit-stabilizer problem: for arbitrary , ,
- (I)
decide whether there is such that , and find a if it exists;
- (II)
determine .
The element and a generating set for should be written as words over . We solve (I) and (II) for and . Along the way, partial results for subgroups of are proved as well.
4.1. Preliminaries
Suppose that . We denote images under by overlining.
Lemma 4.1**.**
Let , and let be the full preimage of in . Then
- (i)
* if and only if and for any such that .*
- (ii)
.
Proposition 4.2**.**
If we can solve the orbit-stabilizer problem for (acting on ), then we can solve it for .
Proof.
(Cf. [14, p. 255] and [15, Lemma 3.1].) First, note that permutes the -orbits in . Let be a set of representatives for the -orbit of . In the notation of Lemma 4.1,
[TABLE]
Secondly, we can find (Schreier) generators of ; and find such that , . Then
[TABLE]
As suggested by Proposition 4.2, we first aim to solve the orbit-stabilizer problem for a PCS in .
Let , and let denote the ideal of generated by the .
Lemma 4.3**.**
* for any ; thus, if and are in the same -orbit then .*
A vector such that is said to be unimodular. By Lemma 4.3, permutes the unimodular vectors among themselves.
4.2. -orbits in
Suppose that has prime factorization , and write each as , .
Lemma 4.4**.**
If is unimodular then is a unit of for some .
Lemma 4.4 is proved in [21, p. 104]. We summarize the proof as follows.
Input: unimodular .
Output: as in Lemma 4.4.
- (1)
For do
let be the least index such that ;
and for . 2. (2)
Return .
Lemma 4.5**.**
If is unimodular then for some .
Proof.
By Lemma 4.4,
[TABLE]
where is a unit of . Further,
[TABLE]
Finally,
[TABLE]
Corollary 4.6**.**
The set of all unimodular vectors is a -orbit in .
Proposition 4.7**.**
Non-zero vectors , are in the same -orbit if and only if .
Proof.
Suppose that ; so and for some dividing , , and unimodular , . Now the result is apparent by Lemma 4.3 and Corollary 4.6. ∎
Corollary 4.8**.**
The map defined by is a bijection between the set of -orbits in and the set of ideals of .
4.3. Orbits in
4.3.1. -orbits
Lemma 4.9**.**
Let and let be the gcd of the non-zero entries of . Then for some .
Proof.
(Cf. [34, Lemma 3, pp. 72–73].) Say the non-zero entries of are where . If then
[TABLE]
So the lemma holds for , and we may assume that and .
Formally, the proof is by induction on . We manufacture by applying the Euclidean algorithm repeatedly to pairs of adjacent nonzero entries of . To begin, put , ; then for and while , let , be the integers such that and . If is the last non-zero remainder then
[TABLE]
where
[TABLE]
At the next stage we put , , and repeat the above. Continuing in this fashion ultimately gives as desired. ∎
Proposition 4.10** **(cf.
[34], Corollary 1, p. 73).
Vectors , belong to the same -orbit if and only if .
Proof.
In the notation of Lemma 4.9, . ∎
Corollary 4.11**.**
There is a one-to-one correspondence between the set of -orbits in and the set of ideals of .
accepts and (as per the proof of Lemma 4.9) returns a pair where , is the gcd of all non-zero entries of , and .
By Proposition 4.10, the next procedure solves the orbit problem for acting on .
Input: , .
Output: such that , or if , are not in the same -orbit.
- (1)
, 2. . 3. (2)
If then return ,
else return .
4.3.2. -orbits
Lemma 4.12** ([21], Lemma 2, p. 105).**
Let , . Suppose that there is a non-empty subset such that for and for , where . Then , are in the same -orbit.
We outline the proof of Lemma 4.12 in the form of an algorithm.
Input: , , as in Lemma 4.12.
Output: such that .
- (1)
For each and , find such that . 2. (2)
Return .
Theorem 4.13**.**
Let , where . Then and are in the same -orbit if and only if and , .
Proof.
See the theorem on p. 101 of [21] for . Suppose that , , and . Then and for some and , such that , say . Consequently where
[TABLE]
The procedure below incorporates the method for in [21, pp. 105–106]. Lines beginning ‘’ contain explanatory comments.
Input: , , .
Output: such that , or if , are not in the same -orbit.
- (1)
If then return .
- (2)
If for some , where , then return ,
else .
is now unimodular, , and for .
- (3)
Apply to find such that is coprime to , where , .
unimodular unimodular mod .
- (4)
If then
,
, , and satisfy the hypotheses of Lemma 4.12.
.
Lemma 4.12 again, with .
- (5)
If then as in the proof of Theorem 4.13,
else where .
because .
- (6)
Return .
and for the original input , .
4.4. Stabilizers in and
Suppose that and . As an arithmetic subgroup of an algebraic group, is finitely generated [18, p. 744]. Indeed, where and is the affine group
[TABLE]
Hence is generated by , , and , where , are the generators of given in Subsection 1.2. Next,
[TABLE]
Plainly is generated by as ranges over a generating set of (for which see Proposition 1.10), together with . We denote by the procedure that returns the set of -conjugates of these matrices for input .
4.5. Solution of the orbit-stabilizer problem
for arithmetic groups
With Proposition 4.2 and its proof in mind, we now describe the main algorithms of this section.
As , the orbits of form a block system for . All vectors in a block have the same reduction modulo (but vectors with equal reduction may not be in the same block). We first check for equivalence of vectors under the action by , and compute generators for stabilizers in . Then we represent each -orbit by a vector in and use to test orbit equality. We shall write for ; that is, if and only if is not .
To determine stabilizers (and thereby eliminate surplus generators) in we calculate the induced action of and then take preimages.
If stabilizes then we put . Hence is generated by together with the corrected elements .
We state the algorithms below.
Input: , and such that .
Output: such that , if ; otherwise.
- (1)
Determine and .
If then return ,
else select such that and replace by . 2. (2)
Determine the -orbit of , where is the full preimage of in .
If then return ,
else select such that and replace by . 3. (3)
. 4. (4)
Return .
Input: and such that .
Output: a generating set for .
- (1)
the full preimage of in . 2. (2)
. 3. (3)
for each generator of ,
A:=\{g_{h}^{-1}h\mid h\mbox{\ a generator of L}\}. 4. (4)
Return .
4.6. Remarks on and refinements of the algorithms
The stabilizer calculations for and are done in via the data structure of Subsection 2.2. We use the solvable radical of to deal with orbits, as in [20]. Typically the main obstacle is that can be very long. To ameliorate this we take orbits of for an increasing sequence of divisors of .
A further refinement (as with any linear action) is given by the imprimitivity system arising from the relation of vectors being unit multiples of each other. Here acts on blocks projectively; i.e., as where . We implement this action by representing each block by a normalized vector. For prime , this means scaling the vector so that its first nonzero entry is . If the original entry has a common divisor with greater than , then a minimal associate will be different from and will usually have a nontrivial stabilizer. This stabilizer is then used to minimize entries in subsequent positions.
4.7. Preimages under
A basic operation when utilizing congruence homomorphisms is to find preimages: for find such that (any preimage will do because ). We cannot simply treat as an integer matrix; it need not have determinant over .
Matrix group recognition [1] maintains a history of how each element of was obtained as a word in congruence images of generators of . Long product expressions tend to build up when constructing a composition tree for using pseudo-random products. Evaluating these expressions back in characteristic [math] leads to large matrix entries.
We could write as a product of transvections in and then form the same product over . Similarly, suppose that has Smith Normal Form where , and . Thus and is a suitable preimage. Still, these approaches sometimes produced larger matrix entries than in the following heuristic.
Let be the transposed adjugate . Adding to for adds to . If and is positive of smaller absolute value, then add to . Repeat with updated . If no such exists (all entries of are larger in absolute value than ), then we can try to use instead the gcd of two entries of in the same row or column. Eventually , or we must resort to the other methods.
5. Generalizing to any arithmetic group in
Let be arithmetic. We explain how to compute such that . Our algorithms may therefore be modified to accept any arithmetic group in ; i.e., not necessarily given by a generating set of integer matrices.
Lemma 5.1**.**
The following are equivalent, for a finitely generated subgroup of .
- •
* has finite index in .*
- •
* is -conjugate to a subgroup of .*
- •
There exists a positive integer such that .
- •
.
Proof.
See [7, Section 3] and [2, Theorem 2.4]. ∎
An integer as in Lemma 5.1 is a common denominator for . Suppose that is arithmetic. Hence exists. Let be a basis of the enveloping algebra , and let be a common multiple of the denominators of all entries in the . By the proof of [2, Theorem 2.4] we can take . A basis can be found by, e.g., a standard ‘spinning-up’ process. However, when we know such that is in the finite index subgroup of , we can write down directly. Let be the block diagonal matrix with
[TABLE]
in rows/columns , , and s elsewhere on the main diagonal. Then
[TABLE]
is a basis with .
With a common denominator in hand, we invoke from [7, Section 3] with input , . If is any matrix whose columns are the elements of then and .
6. Implementation
Our algorithms have been implemented in GAP [16]. For matrix group recognition, we rely on the package [30] of Max Neunhöffer and Ákos Seress.
To demonstrate practicality, and the effect that input parameters (degree , number of generators, size of matrix entries, index in ) have on performance, we ran experiments on a range of arithmetic groups. Except for the elementary groups (see Proposition 1.12), we chose a value of that exposed a nontrivial quotient but which we cannot yet prove to be maximal; that is, the groups all contain .
In Table 1, ‘ gens’ is the number of generators outside , and is the decadic logarithm of the largest generator entry. Times (in seconds on a 3.7GHz Quad-Core late 2013 Mac Pro with 32GB memory) are for computing the index in .
Each group is generated by and products of transvections of level dividing (see http://www.math.colostate.edu/~hulpke/examples/arithmetic.html for the explicit matrices). They seem to be different from any elementary group.
The and are -closures of their namesakes from [22, p. 414]. Apart from , these are arithmetic [22, Theorems 3.1 and 4.1]. We discovered that has larger index than the lower bound in [22].
For a second batch of examples we tested our orbit-stabilizer algorithms on groups from Table 1. Times in Table 2 are solely for , and include the setup for . Here is the length of , and is the length of the orbit of under the preimage of . While the look rather specific, random choices of do not alter runtimes appreciably. The magnitude of likewise has minor impact; if is composite then the calculation of can be separated into orbits modulo divisors of .
What does have an impact is divisibility of entries in by divisors of , which yields longer orbits of . The reason that this affects runtime appears to be twofold. First, we must compare representatives for using . The number of comparisons is quadratic in orbit length. Moreover, integer entries grow quickly even for modest examples (it can happen that stabilizer elements have entries with – digits). As the auxiliary operations entail iterated gcd calculations and integer factorization, each equivalence test becomes relatively expensive.
We do not report on other procedures from Subsection 3.2 that are essentially computations in .
Postscript. For developments in the area since the publication of [11] (including further experiments with groups from [22]), see, e.g., [12, 13].
Acknowledgments
The authors received support from Science Foundation Ireland grant 11/RFP.1/ MTH3212 (Detinko and Flannery) and Simons Foundation Collaboration Grant 244502 (Hulpke). We are grateful to Professors A. Lubotzky, C. F. Miller III, and T. N. Venkataramana for helpful advice. We also thank Steffen Kionke, who detected an error in Proposition 2.11 of [11].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Bäärnhielm, D. F. Holt, C. R. Leedham-Green, and E.A. O’Brien, A practical model for computation with matrix groups , J. Symbolic Comput. 68 (2015), 27–60.
- 2[2] L. Babai, R. Beals, and D. N. Rockmore, Deciding finiteness of matrix groups in deterministic polynomial time , Proc. of International Symposium on Symbolic and Algebraic Computation ISSAC ’93 (ACM Press), 1993, pp. 117–126.
- 3[3] H. Bass, M. Lazard, and J-P. Serre, Sous-groupes d’indice fini dans SL ( n , 𝐙 ) SL 𝑛 𝐙 {\rm SL}(n,\,{\bf Z}) , Bull. Amer. Math. Soc. 70 (1964), 385–392.
- 4[4] H. Bass, J. Milnor, and J.-P. Serre, Solution of the congruence subgroup problem for SL n ( n ≥ 3 ) subscript SL 𝑛 𝑛 3 {\rm SL}_{n}\,(n\geq 3) and Sp 2 n ( n ≥ 2 ) subscript Sp 2 𝑛 𝑛 2 {\rm Sp}_{2n}\,(n\geq 2) , Inst. Hautes Études Sci. Publ. Math. (1967), no. 33, 59–137.
- 5[5] J. Brenner, The linear homogeneous group, III , Ann. of Math. (2) 71 (1960), no. 2, 210–223.
- 6[6] A. S. Detinko, B. Eick, and D. L. Flannery, Computing with matrix groups over infinite fields , London Math. Soc. Lecture Note Ser. 387 , 256–270, 2011.
- 7[7] A. S. Detinko, D. L. Flannery, and W. de Graaf, Integrality and arithmeticity of solvable linear groups , J. Symbolic Comput. 68 (2015), part 1, 138–145.
- 8[8] A. S. Detinko, D. L. Flannery, and E. A. O’Brien, Algorithms for the Tits alternative and related problems , J. Algebra 344 (2011), 397–406.
