Generalised shuffle groups
Carmen Amarra, Luke Morgan, Cheryl E. Praeger
Carmen Amarra
Institute of Mathematics, University of the Philippines Diliman, C. P. Garcia Avenue, Diliman, Quezon City 1101, Philippines
[email protected]
Luke Morgan
University of Primorska, UP FAMNIT, Glagoljaška 8, 6000 Koper, Slovenia, and University of Primorska, UP IAM, Muzejski trg 2, 6000 Koper, Slovenia.
[email protected]
Cheryl E. Praeger
Centre for the Mathematics of Symmetry and Computation, Department of Mathematics and Statistics, The University of Western Australia, 35 Stirling Highway, Crawley, 6009, Western Australia, Australia
[email protected]
Abstract.
The mathematics of shuffling a deck of 2n cards with two “perfect shuffles” was brought into clarity by Diaconis, Graham and Kantor. Here we consider a generalisation of this problem, with a so-called “many handed dealer” shuffling kn cards by cutting into k piles with n cards in each pile and using k! shuffles. A conjecture of Medvedoff and Morrison suggests that all possible permutations of the deck of cards are achieved, so long as k=4 and n is not a power of k. We confirm this conjecture for three doubly infinite families of integers: all (k,n) with k>n; all (k,n)∈{(ℓe,ℓf)∣ℓ⩾2,ℓe>4,f \mboxnotamultipleof e}; and all (k,n) with
k=2e⩾4 and n not a power of 2. We open up a more general study of shuffle groups, which admit an arbitrary subgroup of shuffles.
Key words and phrases:
card shuffling; permutation groups; primitive groups
2010 Mathematics Subject Classification:
Primary 20B25; Secondary 05E18
The first author was supported by a Post Doctoral Research Award (FRASDP) of the University of the Philippines. The second author acknowledges the Australian Research Council Grant DE160100081 and the Slovenian Research Agency (research program P1-0285).
1. Introduction
The crux of a card trick performed with a deck of cards usually depends on understanding how shuffles of the deck change the order of the cards. By understanding which permutations are possible, one knows if a given card may be brought into a certain position. The two standard “perfect” shuffles of a deck of 52 cards involve “cutting” the deck into two piles, holding one pile in each hand, and then perfectly interlacing the two piles. Depending on whether the card from the left or right hand pile ends up on top, these two shuffles are referred to as the in-shuffle and out-shuffle respectively.
To understand fully what permutations can be achieved on a deck of cards via unlimited repetitions of these two operations, we have to know the permutation group generated by the two shuffles. If we have a set of 2n cards (for an integer n) labelled with integers, and write σ and δ for the permutations induced by the in- and out-shuffle, respectively, we are asking about the structure of the subgroup ⟨σ,δ⟩ of Sym(2n). Diaconis, Graham and Kantor [8] were the first to answer this problem completely – although there were partial results earlier for certain values of n (see the discussion in [8, §3]).
They observed first that the group ⟨σ,δ⟩ preserves “central symmetry”, that is, the group preserves a partition of the 2n cards into n parts each of size two, and so ⟨σ,δ⟩ is a subgroup of the Weyl group of type Bn, which we denote simply by Bn; the group Bn≅C2≀Sym(n) (an example of a wreath product in imprimitive action). For a precise description of ⟨σ,δ⟩ as a subgroup of Bn, we require some definitions.
The derived subgroup of Bn has index four, and the quotient is elementary abelian. Thus there are three index two subgroups, which are the kernels of sign-like homomorphisms. If g∈Bn, sgn(g) is the sign of g as a permutation of 2n cards and sgn(g) is the sign of g as a permutation of the n parts of size two. Both maps are homomorphisms from Bn to {−1,1}, so there is a third map which is their product sgnsgn(g):=sgn(g)sgn(g).
With these definitions, the structure found by Diaconis et al. is as follows.
Theorem 1.1**.**
[8*, Theorem 1]**
The structure of the shuffle group ⟨σ,δ⟩ on 2n points, where n⩾2, is given in Table 1.
*
The appearance of Mathieu’s group M12 in the list is quite remarkable. In the case where n=2f, the wreath product is acting in product action on 2f+1 points (see Section 2.1). All the groups in Table 1 are imprimitive subgroups of Sym(2n) preserving a partition with n parts of size 2.
A natural question concerning the mathematics of shuffling is raised at the end of the paper of Diaconis et al. We could divide a pack of kn cards into k piles (with the first n cards in the first pile, the second n cards in the second pile, and so on), and then perform a shuffle. There are now more shuffles to consider – because there are more permutations of the piles of cards. We again denote by σ the standard shuffle, where the cards are picked up from left to right without permuting the piles at all. For a permutation τ of the piles, we have an induced permutation ρτ∈Sym(kn) of the set of cards. The corresponding shuffle obtained by first permuting the piles according to τ and then picking up the cards is therefore equal to ρτσ, and σ=ρτσ with τ the identity element of Sym(k). Thus if P is a group of permutations on the set of k piles, we define
[TABLE]
which we call a generalised shuffle group. Note that since the identity permutation is in P, we have Sh(P,n)=⟨σ,ρτ∣τ∈P⟩.
In this notation, the group described by Theorem 1.1 is Sh(Sym(2),n).
For a positive integer m, we write [m]={0,…,m−1} and use this as a set of residues modulo m. It is convenient to denote the set of piles by [k] and the set of cards by [kn].
The case where P=Sym(k) was considered by Medvedoff and Morrison [19] (they investigated the group Sh(Sym(k),n), which they called Gkn,n). As a generalisation of the case where n=2f in Theorem 1.1, they showed:
Theorem 1.2**.**
[19*, Theorem 2]**
Suppose that n=kf for some positive integers k and f with k⩾2. Then the elements of [kn] can be identified with the set [k]f+1 of (f+1)-tuples of elements of [k], and Sh(Sym(k),kf)=Sym(k)≀Cf+1 in product action on [k]f+1.*
Depending on the congruences of k and n modulo four, it is easy to decide whether Sh(Sym(k),n) is contained in Alt(kn) or not – see [19, Theorem 1]. Computations by the authors of [19] for k=3 and k=4, for small values of n, showed that when n is not a power of 3 or 2 respectively, the shuffle group Sh(Sym(k),n) is actually equal to the symmetric group Sym(kn), or to Alt(kn) if it is contained there. Based on this evidence, they posed conjectures for the structure of the group when k=3 or k=4. Later in [5], the case of k=4 and n=2f for f an odd integer was examined further (the case n=22f=4f being covered by the above theorem). The main result [5, Theorem 2.6] shows that Sh(Sym(4),2f) is the full affine group of degree 2f+2 when f is an odd integer. The authors of [5] then made the following conjecture for general k, which could be summarised as saying “Sh(Sym(k),n) is as large as possible”.
Conjecture 1.3**.**
[5*, Conjecture 1.1]**
Assume that k⩾3, that n is not a power of k and if k=4 that n=2f for any f. Let k(4),n(4)∈[4] be the residues of k and n modulo 4 respectively.
Then*
[TABLE]
We were motivated to study the validity of this conjecture. More generally, we wondered what might be said about the structure of Sh(P,n) for any subgroup P of Sym(k). It is reasonable to guess that Conjecture 1.3 might be true, as a plethora of results show that it is “easy” to generate a subgroup of Sym(m) that contains Alt(m). For example, a result of Dixon [9] shows that as m→∞, the probability that a randomly chosen pair of elements of Sym(m) generate a subgroup that contains Alt(m) tends to 1. On the other hand, the groups that we consider here are far from randomly generated. In the next section we describe our progress.
1.1. Main results
To frame our first main result, we briefly discuss structures preserved by permutation groups. For a prime p, the set [pe] is in bijection with the vector space Fpe. This gives an embedding of AGL(e,p)=Cpe⋊GL(e,p), the group of affine transformations of Fpe, into Sym(pe). We say that a permutation group P⩽Sym(pe) preserves an affine structure on [pe] if P is a subgroup of AGL(e,p) (relative to some some bijection [pe]→Fpe). For integers ℓ and e, the set [ℓe] is in bijection with the cartesian product [ℓ]e. On the latter set, the wreath product Sym(ℓ)≀Sym(e) acts in product action: if (g1,…,ge)∈Sym(ℓ)e then (g1,…,ge):(ω1,…,ωe)↦(ω1g1,…,ωege) and if σ∈Sym(e) then σ:(ω1,…,ωe)↦(ω1σ−1,…,ωeσ−1). As in the affine case, we say that a permutation group P⩽Sym(ℓe) preserves a product structure on [ℓe] if P is a subgroup Sym(ℓ)≀Sym(e) acting in product action (for some bijection). There is also the imprimitive action of a wreath product Sym(ℓ)≀Sym(e) on the set [ℓ]×[e] where (g1,…,ge)σ:(a,b)↦(agb,bσ). This action is imprimitive whenever both ℓ>1 and e>1.
Theorem 1.2 implies that Sh(Sym(k),kf) preserves a product structure on [kf+1], and [5, Theorem 2.6] shows that Sh(Sym(4),2f) preserves an affine structure on [2f+2]. Since Sym(4)=AGL(2,2) is an affine group, and we can regard Sym(k) as preserving the “trivial” product structure [k]1, the aforementioned results may be interpreted as saying that, if P preserves a structure on [k], then for appropriate n the shuffle group Sh(P,n) also preserves such a structure on [kn]. In our first result, Theorem 1.4, we show that this is the case in general.
Theorem 1.4**.**
Let ℓ, e and f be positive integers with ℓ⩾2. The following hold:
- (1)
If e∣f, then Sh(P,ℓf)=P≀C1+f/e, for any P⩽Sym(ℓe).
2. (2)
*If e∤f and P=Sym(ℓ)≀Sym(e) acts in product action on [ℓ]e, then Sh(P,ℓf) acts in product action on [ℓe+f] and Sh(P,ℓf)=Sym(ℓ)≀Sym(e+f).
*
3. (3)
*If e∤f, ℓ is prime and P=AGL(e,ℓ), then Sh(P,ℓf)=AGL(e+f,ℓ).
*
Theorem 1.4 is proved in Section 3 where we also make precise statements about the structure of Sh(P,ℓf) for P⩽Sym(ℓ)≀Sym(e) and for P⩽AGL(e,ℓ). We have the following application of Theorem 1.4 which establishes Conjecture 1.3 for a doubly infinite family of integers.
Corollary 1.5**.**
Suppose that k=ℓe and n=ℓf for some positive integers ℓ, e and f with ℓ⩾2. If f is not a multiple of e and k=4, then Sh(Sym(k),n)=Alt(kn) if ℓ is even and Sh(Sym(k),n)=Sym(kn) if ℓ is odd.
We note that Corollary 1.5 with (k,n)=(9,3) is proved in [5, Theorem 3.6], using different methods.
In our goal of proving Conjecture 1.3, we are motivated to consider properties of Sh(P,n) for P⩽Sym(k) that should hold if the conjecture is valid. The most basic concerns transitivity, and is established in Lemma 2.5. A more important notion is that of primitivity, which we address below.
Theorem 1.6**.**
If P is primitive and non-regular on [k], then Sh(P,n) is primitive on [kn].
Moreover, Sh(Sym(k),n) is primitive on [kn] if and only if k⩾3.
The non-regularity assumption for P cannot be removed in general, since P=Sym(2) is primitive and regular on two points and Theorem 1.1 implies that, for all n, the group Sh(Sym(2),n) is imprimitive. Similarly the primitivity assumption on P cannot be relaxed, since if P is an imprimitive group of degree k, then Theorem 1.4 shows that Sh(P,kf)=P≀Cf+1 in product action on [k]f+1 – and this action is imprimitive whether P is non-regular or not. Hence both parts of the hypothesis are necessary. We note also that, if P=Ck for an odd prime k, so that P is primitive and regular on [k], then if n=kf, Theorem 1.4 implies that Sh(Ck,n)⩽Sh(AGL(1,k),n)=AGL(1,k)≀C1+f. Now AGL(1,k)≀C1+f is imprimitive on [kn] since the diagonal subgroup of Ck1+f is an intransitive normal subgroup, and therefore also Sh(Ck,n) is imprimitive in this case. However we have verified computationally that, if k⩽13, k<n⩽1000,
and n is not a power of k, then Sh(Ck,n) contains Alt(kn) (see Section 1.3).
Conjecture 1.7**.**
If k is an odd prime, n>k, and n is not a power of k, then Sh(Ck,n) contains Alt(kn).
If Conjecture 1.7 is true then for n=kf we would have a stronger form of Theorem 1.6, but even our current
Theorem 1.6 has a number of applications in this work, notably in Section 6.
The next property we sought to explore is that of 2-transitivity. In part (1) of the theorem below we see that Conjecture 1.3 holds whenever k>n, that is, when there are more piles than cards in a pile. A group G is almost simple if there is a nonabelian finite simple group T such that G is isomorphic to a subgroup of Aut(T) containing the inner automorphism group of T. A classical result of Burnside says that 2-transitive groups are either affine or almost simple, [10, Theorem 4.1B]. The latter have been classified using the Classification of the Finite Simple Groups; we have reproduced a list of them in Section 5.
Theorem 1.8**.**
Suppose that k>n⩾2 and that P⩽Sym(k) is 2-transitive. Let k(4),n(4)∈[4] be such that k≡k(4)(mod4) and n≡n(4)(mod4). Then Sh(P,n) is 2-transitive and the following hold:
- (1)
If P is Alt(k) or Sym(k) then either (k,n)=(4,2) and Sh(P,n)=AGL(3,2), or (k,n)=(4,2) and Sh(P,n)=Alt(kn) or Sym(kn). (Details in Lemma 5.2.)
2. (2)
If P is almost simple, then also Sh(P,n) is almost simple.
3. (3)
If P is affine with k=pe⩾3 for some prime p and positive integer e, then either n=pf for some positive integer f and Sh(P,n) is affine, or n=pf for any f and Sh(P,n)=Alt(kn) or Sym(kn).
The condition that k is greater than n is necessary in Theorem 1.8, for Theorem 1.4 shows that Sh(Sym(k),k)=Sym(k)≀C2, which is not 2-transitive on [k2].
1.2. Cascading shuffle groups
Different values of k and n with the same product kn can yield different shuffle groups acting on the same set. A priori these groups have nothing to do with each other, yet we have found that for certain values of k, there is a deep connection between the groups.
Fix k=2e for some integer e⩾2 and let n be arbitrary. Then, for each t∈{1,2,…,e}, define Vt to be the elementary abelian group of order 2t acting regularly on the set [2t]. Further, set Gt=Sh(Vt,2e−tn), so that each of the e shuffle groups G1, …, Ge acts on 2en points. Note that G1=Sh(Sym(2),2e−1n) is a shuffle group appearing in the theorem of Diaconis et al, Theorem 1.1.
Theorem 1.9**.**
Suppose that n is not a power of 2 and k=2e with e⩾2. The following hold:
- (1)
if (k,n)=(4,3), then G1=C26⋊Sym(5) and G2=C25⋊Sym(5);
2. (2)
if (k,n)=(4,6), then G1=G2 and G1≅C211⋊M12;
3. (3)
if (k,n)=(8,3), then G1=G2=G3 and G1≅C211⋊M12;
4. (4)
if k=4 and n⩾5 is odd, then G1=C2≀Sym(2n) and G2=ker(sgn);
5. (5)
in all other cases, G1=G2=⋯=Ge and G1=ker(sgn)∩ker(sgn).
Although the connection between these cascading shuffle groups is striking in its own right, our principal goal for investigating their structure is to provide tools to analyse their overgroups. In particular, since Ge=Sh(Ve,n)⩽Sh(Sym(k),n), we are able to use the previous theorem to obtain:
Corollary 1.10**.**
*Suppose that n is not a power of 2 and that k=2e for some e⩾2. Then Sh(Sym(k),n) is the alternating group or the symmetric group if n is even or odd, respectively.
*
The results on cascading shuffle groups are proved in Section 6. Using Theorem 1.1 for k=2, Theorem 1.2 and [5, Theorem 2.6] for k=4 and n a power of 2, Corollary 1.5 for k=2e⩾8 and n a power of 2 and Corollary 1.10 for k=2e⩾4 and n not a power of 2, the structure of the shuffle group Sh(Sym(2e),n) is now known for all integers e and n.
1.3. Summary and questions
Concerning Conjecture 1.3 we have shown that the conjecture holds, that is, that Sh(Sym(k),n)⩾Alt(kn), in the following instances:
- (1)
k>n,
2. (2)
k=2e for some e⩾2 and n=2f for any f,
3. (3)
k=ℓe and n=ℓf for some positive integer ℓ where e∤f and k=4.
The first extension of this work would be to the case that k=3e and n=3f for any f. Some arguments that we have given in Section 6 might generalise, but we note that for such n there is no information concerning the structure of Sh(Sym(3),3e−1n) and thus the endgame to exploit the potential connection between Sh(Sym(3e),n) and Sh(Sym(3),3e−1n) cannot yet be achieved.
Concerning computations, at several points we have made use of Magma [3]. We have verified
the following remarkable fact: for k∈{3,…,13} and n∈{k+1,…,1000} with n=kf for any f, we have that Sh(Ck,n) contains Alt(kn).
This suggests that Conjecture 1.3 might be “overwhelmingly true”.
Considering what might be the converse of the conjecture, if P⩽Sym(k) is such that Sh(P,n) contains Alt(kn), then P need not even be transitive. There are many examples, for small values of k and n, of intransitive groups P with Alt(kn)⩽Sh(P,n). On the other hand, in Remark 2.6 we give infinitely many easy examples of nontrivial intransitive subgroups P with Sh(P,n) intransitive on [kn].
Thus the precise conditions on k, n and P⩽Sym(k) to guarantee that Sh(P,n) is not the full alternating group or symmetric group seem to be quite delicate.
Question 1.11**.**
Is it possible to classify the pairs (P,M) such that P<Sym(k) and M<Sym(kn) are both maximal subgroups, and Sh(P,n)≤M?
Theorems 1.4 and 1.8 give some examples of (P,M). An answer in the general case would be very interesting.
Acklowledgements
We thank Kent Morrison for sharing the slides of his talk at the 2009 MathFest on this subject and general enthusiasm about our work.
2. Preliminaries
Throughout this paper we use the following notation: for any positive integer u, let
[TABLE]
and let Sym(u) and Alt(u) denote the symmetric and alternating groups on the set [u].
Let k and n be fixed integers greater than 1, and let [kn] be a deck of cards divided into k piles with n cards each. Thus the cards are labelled 0,…,kn−1 and the piles as 0,…,k−1 (from left to right, say), with an,an+1,…,(a+1)n−1 (from top to bottom) being the cards in pile a. Thus each card in [kn] has a unique label an+b for some a∈[k] and b∈[n].
Suppose that the piles are arranged in a single row. The standard shuffle is performed by picking up the top card from each of the piles from left to right and repeating until all cards have been picked up. As a permutation on [kn] we denote the standard shuffle by σ. Let i∈[kn] and suppose that i=an+b, with a∈[k] and b∈[n]. Then the card i is the bth card in the ath pile; thus under the shuffle σ, the bk cards in the top b rows have already been picked up prior to card i and i is the ath card in its row to be picked up. Hence, taking [n] and [k] as sets of residues modulo n and k, respectively, we have
[TABLE]
Each permutation τ of the k piles induces a permutation ρτ of the kn cards. Since ρτ does not change the order of the cards within each pile, it follows that for each a∈[k] and b∈[n],
[TABLE]
If P is a permutation group on the set of piles, we denote by Pρ the group {ρτ ∣ τ∈P} of corresponding permutations of [kn] (in effect, ρ may be thought of as an embedding of P into Sym(kn)).
This leads to the following definition.
Definition 2.1**.**
Let k and n be integers greater than 1, view [kn] a deck of kn cards divided into k piles with n cards each, and let P⩽Sym(k). The generalised shuffle group on [kn] is the group Sh(P,n) defined by
[TABLE]
The following is fundamental.
Lemma 2.2**.**
[19, Proposition 1]**
The shuffle σ fixes both [math] and kn−1. For i∈[kn] with i=kn−1 we have
[TABLE]
taking {0,…,kn−2} to be a set of residues modulo kn−1.
Proof.
Since the first card is labelled [math] and is picked up first, we have 0σ=0≡k⋅0(modkn−1). Similarly, the last card is labelled kn−1 and is picked up last, so (kn−1)σ=kn−1. Now let i∈[kn] with i=0,kn−1. Let a,b∈N be such that i=an+b. Then iσ=a+bk by (1). Now observe that, since i=kn−1, at least one of a<k−1 or b<n−1 holds and hence a+bk<(k−1)+(n−1)k=kn−1, so a+bk=iσ(modkn−1). Using this fact, we observe that ik=(an+b)k=akn+bk≡a+bk(modkn−1). Thus we have established that iσ≡ki(modkn−1) as required.
∎
Lemma 2.3** ([19, Lemmas 1 and 2]).**
The following hold:
- (1)
The shuffle σ∈Alt(kn) if and only if either n(4)∈{0,1} or k(4)∈{0,1}.
2. (2)
For any τ∈Sym(k), ρτ∈Alt(kn) if and only if either n is even or τ∈Alt(k).
Corollary 2.4**.**
Sh(P,n)⩽Alt(kn)* if and only if one of the following holds:*
- (1)
n(4)=0, or
2. (2)
n(4)=2* and k(4)∈{0,1}, or*
3. (3)
n* is odd, P⩽Alt(k), and either n(4)=1 or k(4)∈{0,1};*
We extend [19, Proposition 3] to the case where P is any transitive subgroup of Sym(k) (so the proof cannot use arbitrary permutations in Sym(k)).
Lemma 2.5**.**
If P is a transitive subgroup of Sym(k), then Sh(P,n) is a transitive subgroup of Sym(kn).
Proof.
Let G=Sh(P,n) and let i=an+b for a∈[k] and b∈[n]. We claim that there is an element g∈G such that 0g=b. If the claim is true, then since P is transitive, there is τ∈P such that 0τ=a (recall that τ acts on [k]). Hence 0gρτ=bρτ=(0⋅n+b)ρτ=(0τ)n+b=an+b=i (using (2)). Thus if the claim holds then G is transitive.
We now prove the claim. Let m be such that km⩽n−1<km+1. Since b∈[n] we may write b=∑r=0mirkr, where 0⩽ir⩽k−1 for each r. Since P is transitive on [k], for each r∈{0,…,m} there is an element τr∈P such that 0τr=ir. Now for any c∈[n], it follows from (2) that
cρτr=(0⋅n+c)ρτr=irn+c, so that, using (1),
[TABLE]
If m=0 then b=i0 and 0ρτ0σ=i0=b, as required. If m⩾1 then for any ℓ⩽m−1,
[TABLE]
and for ℓ=m,
[TABLE]
Using this fact together with (2) and (3) we have
[TABLE]
This establishes the claim, and hence the lemma.
∎
Remark 2.6*.*
A shuffle group can be intransitive. Let k and n be arbitrary. If P⩽Sym(k) is such that 0τ=0 for all τ∈P, then in the action of Pρ on [kn] we have 0ρτ=0 for all τ∈P. Since 0σ=0, the group Sh(P,n) fixes [math], and is therefore intransitive.
2.1. Permutation group concepts and results
Let A be a permutation group acting on a finite set Δ. For an element a∈A, we denote by suppΔ(a) the set of points of Δ that are moved by a, that is, points δ∈Δ such that δa=δ, and we call ∣suppΔ(a)∣ the degree or support size of a. The minimal degree of A, denoted mindegΔ(A), is the minimum of ∣suppΔ(a)∣ over all non-identity elements a∈A. The action of A is said to be semiregular if the stabiliser of any point is the identity.
Let Σ=Δc for some integer c⩾2, and denote the ith factor of Σ by Δi. Each of the groups Ac and Sym(c) has a natural action on Σ: the former by acting componentwise, and the latter by permutation of the coordinates i. The product action of the wreath product A≀Sym(c)=Ac⋊Sym(c) on Σ is the combination of these actions of Ac and Sym(c). More precisely, for any δ=(δ1,…,δc)∈Σ, g=(a1,…,ac)∈Ac, and ρ∈Sym(c),
[TABLE]
where i′=iρ−1 for each i.
Lemma 2.7**.**
Let Δ be a set with ∣Δ∣=d, and A⩽Sym(Δ) such that A=1. Let X=A≀Sym(c)=Ac⋊Sym(c) with c⩾2, in product action on Σ=Δc. Let m=mindegΔ(A). Then
[TABLE]
In particular mindegΣ(X)⩽dc−1(d−1).
Proof.
Note that m⩽d−1 if A is not semiregular on Δ. First we exhibit an element of X with support size dc−1m when A is not semiregular. By definition A contains an element with support size m in Δ. Let a be such an element, and let g:=(a,1,…,1)∈Ac⩽X. Then a point δ=(δ1,…,δc)∈Σ is moved by g if and only if δ1a=δ1. Thus there are no restrictions on the δi for i>1 but we must have δ1∈suppΔ(a), and hence ∣suppΣ(g)∣=dc−1m.
Next we show that each g=(g1,…,gc)∈Ac∖{1} has support size at least dc−1m in Σ. This holds if A is semiregular on Δ since in this case m=d and Ac is semiregular on Δc. If A is not semiregular, then since g=1, we must have gi=1 for some i, and without loss of generality we may assume that i=1. Thus g1=1 so m′:=∣suppΔ(g1)∣⩾m. As in the previous paragraph, each point δ=(δ1,…,δc)∈Σ such that δ1∈suppΔ(g1) is moved by g, and this implies that ∣suppΣ(g)∣⩾dc−1m′⩾dc−1m.
Finally we examine the supports of elements g∈X∖Ac. Such elements have the form g=(g1,…,gc)α with the gi∈A and α a non-identity element of Sym(c). Since α=1, there exist distinct i,j∈{1,…,c} such that j=iα=i. For δ=(δ1,…,δc)∈Σ, we have
[TABLE]
The jth entry of δg is δjα−1gjα−1=δigi, so if δj=δigi then δg=δ (regardless of the choices of the δu for u=j). Hence g has support size at least dc−1(d−1). If A is not semiregular then this is at least dc−1m because m⩽d−1, and hence mindegΣ(X)=dc−1m.
If A is semiregular then all g∈Ac have support size at least dc−1(d−1), and if g=α is the transposition (i,j) then the discussion shows that g has support size precisely dc−1(d−1), and hence mindegΣ(X)=dc−1(d−1).
∎
2.2. Primitive prime divisors
We need the following results in Section 5.
For a prime p and positive integer n, we denote by np the highest power of p dividing n; np is called the p-part of n. For a prime power q and an integer d, the prime p is called a primitive prime divisor of qd−1 if p divides qd−1 and does not divide qe−1 for any e<d.
Lemma 2.8** (Zsigmondy’s Theorem [25]).**
Let q be a prime power. Then there exists a primitive prime divisor of qd−1, unless (q,d)=(2,6) or q is odd, d=2 and q+1=2c for some c.
Lemma 2.9** ([18, Lemma 4.1], [20, Lemma 4.1]).**
Let q be a prime power, p a prime not dividing q, and e the smallest positive integer such that p divides qe−1. Then p divides qd−1 if and only if e divides d. Furthermore:
- (1)
If p=2, then (q^{d}-1)_{2}=(q^{d_{2}}-1)_{2}=\left\{\begin{aligned} &(q^{2}-1)_{2}\cdot(d/2)_{2}&&\text{if dis even}\\
&(q-1)_{2}&&\text{ifd is odd.}\end{aligned}\right.
2. (2)
If p=2 and e divides d, then (qd−1)p=(qe−1)p⋅(d/e)p.
Remark 2.10*.*
For a primitive prime divisor p of qd−1, the Sylow p-subgroups of GL(d,q) are cyclic. Indeed, the Sylow p-subgroups are contained in the so-called Singer cycle subgroups which are cyclic of order qd−1 (see [12, Kapitel I, 7.3]). This observation also applies to the simple sections of GL(d,q), such as PSL(d,q), PSp(d,q) (if d is even) and PSU(d,q1/2) (if q is a square), etc.
3. Product identification in the “power case”
In this section we prove Theorem 1.4.
Let us say that we are in the power case if there is an integer ℓ⩾2 such that k=ℓe and n=ℓf for some positive integers e and f.
In the power case we can define an identification of [kn]=[ℓe+f] with [ℓ]e+f as follows. For an integer m and i∈[ℓm], there are unique integers ir∈[ℓ] such that
[TABLE]
and we identify i∈[ℓm] with an element in [ℓ]m as follows:
[TABLE]
Lemma 3.1**.**
Assume that we are in the power case with k=ℓe and n=ℓf.
The shuffle respects the identification of [ℓe+f] with [ℓ]e+f in (5) and for each i∈[kn] we have
[TABLE]
That is, the effect of σ is to shift the coordinates of any card e places to the left (modulo e+f). Moreover, the order of σ is gcd(e,f)e+f.
Proof.
Let i∈[kn] and let i∼(i0,…,ie+f−1) as in (5). We compute the effect of the shuffle σ on i. If i=0 or i=kn−1, then i∼(0,…,0) or i∼(ℓ−1,…,ℓ−1), respectively. Now iσ=i, and since all of the coordinates of the representation of i as in (5) are equal, we see that shifting the entries to the left by e places (modulo e+f) also fixes this representation of i. Thus we may assume that 0=i=kn−1. By Lemma 2.2 we have iσ≡ik(modℓe+f−1). Now i=∑r=0e+f−1irℓe+f−1−r, and to evaluate ik, we write i=A+B where A=∑r=0e−1irℓe+f−1−r and B=∑r=ee+f−1irℓe+f−1−r. First,
[TABLE]
Now ∑r=0e−1irℓe−1−r=∑r=fe+f−1ir−f(mode+f)ℓe+f−1−r=∑r=fe+f−1ie+r(mode+f)ℓe+f−1−r.
Secondly, we write B=∑r=ee+f−1irℓe+f−1−r=∑r=0f−1ie+rℓf−1−r and then
[TABLE]
Hence ik≡∑r=0e+f−1ie+r(mode+f)ℓe+f−1−r(modℓe+f−1), and so
[TABLE]
as required. For each j, σj permutes the coordinates by moving all coordinates of a tuple to the left by ej places, modulo e+f, and so the order of σ is the least positive integer q such that eq is a multiple of e+f, namely q=gcd(e,f)e+f.
∎
Lemma 3.2**.**
Assume we are in the power case with k=ℓe and n=ℓf. If P⩽Sym(ℓ)≀Sym(e) respects the identification of [k] with [ℓ]e as in (5), then Pρ respects the identification of [kn] with [ℓ]e+f as in (5) and hence Pρ⩽Sym(ℓ)≀Sym(e+f) in product action on [ℓ]e+f. Further, if τ=(x0,…,xe−1)α∈P, then ρτ=(x0,…,xe−1,1,…,1)α~, where α~∈Sym(e+f) is defined by α~:r↦rα for 0⩽r⩽e−1 and α~:r↦r for e⩽r⩽e+f−1.
Proof.
Write each card i∈[kn] in its base-ℓ expansion as in (4), and note that
[TABLE]
where a=∑r=0e−1irℓe−r−1∈[k] and b=∑r=ee+f−1irℓe+f−r−1∈[n]
(as in the proof of Lemma 3.1). By (5), a∈[k] is identified with the tuple (i0,…,ie−1) in [ℓ]e. Let τ∈P. Then τ=(x0,…,xe−1)α for some α∈Sym(e) and x0,…,xe−1∈Sym(ℓ). By (2) we have iρτ=(an+b)ρτ=(aτ)n+b, where
[TABLE]
and so, with α~ as in the statement, setting r′=rα~−1 for each r∈[e+f],
[TABLE]
Hence
[TABLE]
and so ρτ is the element (x0,…,xe−1,1,…,1)α~ of Sym(ℓ)≀Sym(e+f), as required.
∎
Proposition 3.3**.**
Assume we are in the power case with k=ℓe and n=ℓf. For Q⩽Sym(ℓ) and T⩽Sym(e) let P=Q≀T=S⋊T act in product action on [ℓe] with respect to the identification in (5) and set G=Sh(P,n).
Let W=Q≀Sym(e+f)=X⋊Y act in product action on [ℓe+f] with respect to the identification in (5), where X=X0×⋯×Xe+f−1≅Qe+f, Xr≅Q for each r∈[e+f], and Y≅Sym(e+f). Then G⩽W and the following hold:
- (1)
Pρ∩X=X0×⋯×Xe−1* and Pρ∩Y=Tρ⩽Sym(e)×1<Sym(e+f);*
2. (2)
the shuffle σ lies in Y;
3. (3)
G∩Y=⟨ρτ,σ∣τ∈T⟩=⟨Tρ,σ⟩;
4. (4)
if T is transitive on [e], then G∩Y is a transitive subgroup of Y, X⩽G, and consequently G=X⋊(G∩Y)=Q≀(G∩Y) in product action on [ℓ]e+f;
5. (5)
if f=ce for some c∈N, then G∩Y≅T≀C1+c (acting imprimitively on [e+f]);
6. (6)
if f∤e and T=Sym(e), then G∩Y=Sym(e+f).
Proof.
By Lemmas 3.1 and 3.2, Tρ and σ both lie in the top group Sym(e+f) of W, and we comment on this in more detail. By Lemma 3.2, each element (x0,…,xe−1) of S is mapped by ρ to the element (x0,…,xe−1,1,…,1) of X, and each element of T is is mapped by ρ to an element of Y with support in [e]={0,…,e−1}. Thus Sρ≅S is the subgroup X0×⋯×Xe−1 of X, and Tρ≅T is the subgroup {y∈Y ∣ supp(y)=[e]} of Y. Therefore Pρ∩X=Sρ and Pρ∩Y=Tρ, which proves (1). By Lemma 3.1, the action of σ on [kn] induces a permutation of the coordinates of points of [ℓ]e+f which sends each r∈[e+f] to (r+e)(mode+f), so σ∈Y. This proves (2).
Now G=⟨Pρ,σ⟩, Pρ=(Pρ∩X)⋊(Pρ∩Y) by part (1), and σ∈Y by part (2). Thus G∩Y=⟨Pρ,σ⟩∩Y=⟨Pρ∩Y,σ⟩=⟨Tρ,σ⟩,
proving part (3). Now suppose that T is transitive on [e]. Let r∈[e+f] and write r=pe+q for some p,q∈Z with 0⩽q⩽e−1. By Lemma 3.1, σp maps an arbitrary card i∼(i0,…,ie+f−1) to iσp∼(ipe,…,ie+f−1,i0,…,ipe−1) so in iσp the entry ir appears in position q∈[e+f]. Then since T is transitive on [e] and q<e, some element τ∈T satisfies qτ=0, and hence by Lemma 3.2, ρτ permutes the entries of the tuple iσp in such a way that iσpρτ has entry ir in position [math]. Thus ⟨Tρ,σ⟩ is transitive on [e+f], proving the first part of (4).
To complete the proof of (4), note that G∩X⩾Pρ∩X=Sρ=X0×⋯×Xe−1 by part (1), and we have just proved that G∩Y is transitive on [e+f]. Now the action of G∩Y on [e+f] is permutationally isomorphic to the conjugation action of G∩Y on the set {X0,…,Xe+f−1} of direct factors of X. It follows that each Xj⩽G∩X and hence X⩽G. Thus G=X⋊(G∩Y)=Q≀(G∩Y) and (4) is proved.
For (5) and (6) set E:=Tρ, and for any integer r set Er:=Eσ−r and Δr:=supp(Er)={re,re+1,…,(r+1)e−1}, reading entries of Δr modulo e+f.
Assume first that f is a multiple of e. By Lemma 3.1, σ sends each r∈[e+f] to (r−e)(mode+f), and σ has order (e+f)/e=1+f/e. For distinct r,s∈[1+f/e] we see that Δr∩Δs=∅, so [Er,Es]=1. In fact ∏r=0f/eEr≅T1+f/e, and this group is normalised by σ. Clearly, G∩Y=⟨E,σ⟩=⟨Er,σ ∣ 0⩽r⩽f/e⟩≅T≀C1+f/e. Hence (5) is proved.
Now assume that f is not a multiple of e, and that T=Sym(e). Then Er=Sym(Δr) for each r∈[e+f]. Write e+f=pe+q for some p,q∈Z with 0⩽q⩽e−1, and note that q⩾1 since f is not a multiple of e. Now [e+f]=⋃r=0pΔr. Note that if, for finite sets Δ and Δ′ we have Δ∩Δ′=∅, then ⟨Sym(Δ),Sym(Δ′)⟩=Sym(Δ∪Δ′). For 0⩽j⩽p−1, we have Δp+j+1={(p+j+1)e,…,(p+j+2)e−1} (modulo e+f), which is
{(j+1)e−q,…,(j+2)e−q−1} (modulo e+f) since e+f=pe+q, and so Δp+j+1∩Δj contains (j+1)e−1 (since q⩾1) and Δp+j+1∩Δj+1 contains (j+1)e (since q⩽e−1). Thus (applying our observation twice), ⟨Ej,Ej+1,Ep+j+1⟩=Sym(Δj∪Δj+1∪Δp+j+1) and it follows that \langle E_{j}\ |\ 0\leqslant j\leqslant p\rangle=\mathrm{Sym}\big{(}\bigcup_{j=0}^{p}\Delta_{j}\big{)}=\mathrm{Sym}([e+f])\cong\mathrm{Sym}(e+f). Since each Ej⩽G∩Y we conclude that G∩Y≅Sym(e+f), proving (6).
∎
If ℓ is prime, then the identification in (5) also defines an identification of [ℓm] with the vector space Fℓm, relative to a given basis of Fℓm.
Proposition 3.4**.**
Assume we are in the power case with k=ℓe, n=ℓf and that ℓ is prime. Suppose that P=V⋊T⩽AGL(V)=AGL(e,ℓ) is an affine subgroup of Sym(k) preserving the identification of [k] with Fℓe as in (5). Then there is an identification of [kn] with W:=Fpe+f such that Sh(P,ℓf)⩽AGL(W). Further,
- (1)
Sh(P,n)=W⋊S, where S⩽GL(W) and S=⟨Tρ,σ⟩,
2. (2)
if f=ce for some c∈N, then S≅T≀Cc+1, an imprimitive linear group on W,
3. (3)
if e∤f and T=GL(V), then S=GL(W).
Proof.
Recall from (4) and (5) that each i∈[kn] can identified with the (e+f)-tuple (i0,…,ie+f−1) of coefficients of its base-ℓ expansion i=∑r=0e+f−1irℓe+f−1−r, where 0⩽ir⩽ℓ−1 for each r. This establishes a bijection of [kn] with W and this particular bijection defines a subgroup AGL(W) of Sym(kn). Similarly, P respects the identification of [k] with V=Fℓe as in (5). We can view each i∈[kn] as a vector v+u∈W where v=(v0,…,ve+f−1) with vr=ir for r=0,…,e−1 and vr=0 for r=e,…,e+f−1 and u=(u0,…,ue+f−1) with ur=0 for r=0,…,e−1 and ur=ir for r=e,…,e+f−1. Also
[TABLE]
where a=∑r=0e−1irℓe−r−1∈[ℓe] and b=∑r=0f−1ie+rℓf−r−1∈[ℓf]. Since b<ℓf=n, we see that the first e coordinates of the representation of i determine the column a that card i lies in and the last f coordinates determine the row b.
We work with the following bases: let w0=(1,0,…,0), w1=(0,1,0,…,0), …, we+f−1=(0,…,0,1) be basis vectors for W, and let v0=(1,0,…,0), …, ve−1=(0,…,1) be basis vectors for V. We write elements of GL(W),GL(V) as matrices relative to these bases.
By Lemma 3.1, the shuffle σ preserves the above identification of [kn] with W, and permutes the coordinates of a card by moving each entry e places to the left (reading subscripts modulo e+f). Thus σ fixes 0∼(0,…,0) and acts as a linear map on W corresponding to a permutation matrix, so σ∈GL(W).
Let τ∈P, and consider the action of ρτ on a typical element i=an+b, as above, so a∼(a0,…,ae−1)∈V, and i=an+b∼(a0,…,ae+f−1)∈W. Then iρτ=(an+b)ρτ=aτn+b by (2).
If τ=tv is a translation by some vector v=(v0,…,ve−1) in V, then aτ∼(a0,…,ae−1)+(v0,…,ve−1) and hence iρτ∼(a0,…,ae+f−1)+(v0,…,ve−1,0,…,0), so ρτ is a translation by a vector of W.
On the other hand, if τ∈GL(V) corresponds to multiplication by a matrix A, then aτ=aA=(a0′,…,ae−1′), say, and so iρτ∼(a0′,…,ae−1′,ae,…,ae+f−1) is the image of (a0,…,ae+f−1) under the matrix \left[\begin{array}[]{cc}A&0\\
0&I_{f}\end{array}\right] in GL(W). Hence ρτ∈GL(W). Thus S=⟨Tρ,σ⟩=⟨ρτ,σ∣τ∈T⟩⩽GL(W).
From the previous paragraph we have that, for each r∈[e], if τ=tvr is the translation by vr, then ρτ=twr is the translation by wr. We claim that the span ⟨(Vρ)σi∣i∈[e+f]⟩=W.
For an arbitrary i∈[e+f], write i=pe+q with q∈[e] and p∈[f]. Then Lemma 3.1 implies that the conjugate
[TABLE]
and this lies in (Vρ)σ−p=(Vρ)σe+f−p. Hence the claim is proved. Now Sh(P,n)=⟨Vρ,Tρ,σ⟩⩽WS, and since W⩽Sh(P,n), we have equality. This establishes part (1).
If f=ce, then σ has order c+1 and the c+1 conjugates of Tρ under σ pairwise commute. Hence S≅T≀Cc+1 is an imprimitive linear group, preserving a decomposition W=⨁i=1c+1Vi where each Vi has dimension e and is in bijection with (Vρ)σ−i. Thus (2) holds.
Suppose now that e∤f and that T=GL(V). Similarly to the proof of Proposition 3.3(6), we use the fact that if W can be written as X+Y with X∩Y={0}, then, for the obvious embeddings of GL(X) and GL(Y) in GL(W), we have GL(W)=⟨GL(X),GL(Y)⟩. Since e∤f, there are integers p,q with 1⩽q⩽e−1 such that e+f=pe+q. Thus q≡−pe(mode+f) and we set
[TABLE]
For r=0,…,p−1, Wr∩Wp−r′ contains wre+q and hence is non-zero, and also, if r⩽p−2, then
Wr+1∩Wp−r′ contains w(r+1)e and hence is non-zero. Now
(Tρ)σ−r and (Tρ)σp−r are isomorphic to and induce the full general linear groups on Wr and Wp−r′ respectively. Hence, repeatedly applying the fact mentioned above to W0+Wp′, W0+Wp′+W1, W0+Wp′+W1+Wp−1′, etc.,
we obtain S=GL(W), proving part (3).
∎
Proof of Theorem 1.4.
Part (1) follows from setting P=Sym(ℓ)≀Sym(e) in Proposition 3.3. Part (2) follows from setting P=AGL(e,ℓ) in Proposition 3.4.
∎
Proof of Corollary 1.5.
We assume that f is not a multiple of e and that k=4. In particular, this means that e⩾2 and e+f⩾3. Without loss of generality, we may assume that ℓ is not itself a proper power. If ℓ⩾5 set P=Sym(ℓ)≀Sym(e) and otherwise (i.e. if ℓ=2 or 3), set P=AGL(e,ℓ). Now, let G=Sh(Sym(k),n) and H=Sh(P,n). By Theorem 1.4, if ℓ⩾5, then H=Sym(ℓ)≀Sym(e+f), otherwise ℓ=2 or 3 and H=AGL(e+f,ℓ). Since P⩽Sym(k), we have H⩽G.
Claim 1. H⩽Alt(kn) if and only if ℓ is even; and G⩽Alt(kn) if and only if ℓ is even. If ℓ is even, then both k and n are even, and since e⩾2, k≡0(mod4). Hence by Corollary 2.4(1) and (2),
G⩽Alt(kn) and hence also H⩽Alt(kn).
Suppose now that ℓ is odd. We will show that HAlt(kn), and hence also that GAlt(kn). Since ℓ is odd, both k and n are odd. Further, if ℓ⩾5 then P=Sym(ℓ)≀Sym(e)⩽Alt(ℓe), and also if ℓ=3 then P=AGL(e,3)Alt(3e). Hence by Corollary 2.4(3), HAlt(kn).
Claim 2. One of the following occurs: (i) G=H; (ii) G=Sym(kn) (ℓ odd) or Alt(kn) (ℓ even).
Let X=Sym(kn) if ℓ is odd or Alt(kn) if ℓ is even. By Claim 1, H⩽G⩽X. If H is maximal in X then G is as in (i) or (ii), so assume that H<G<X. If ℓ⩾5 then since e+f⩾3, it follows from [16, Theorem, and Remark 2] that H is maximal in X, contradiction. Thus ℓ=2 or 3 and H=AGL(e+f,ℓ). Then, since e⩾2 and k=4, again [16, Theorem] implies that H is maximal in X, a contradiction.
It remains to prove that case (i) of Claim 2 does not arise.
Case (i) of Claim 2 does not arise if ℓ⩾5. Suppose that G=H. Let τ∈Sym(k) be a transposition. Then ρτ∈G has support size 2n=2ℓf. Thus mindeg[ℓ]e+f(G)⩽2ℓf. On the other hand, by Lemma 2.7, mindeg[ℓ]e+f(G)=2ℓe+f−1, and hence e=1, which is a contradiction.
Case (i) of Claim 2 does not arise if ℓ=2 or 3. Suppose that ℓ=2 or 3 and that G=H so G=AGL(e+f,ℓ). Note that G is not regular and consider a non-identity element g∈G with at least one fixed point. Without loss of generality we may assume that g fixes the zero vector of the vector space so that g lies in GL(e+f,ℓ). The set of fixed points of g forms a subspace, and so \big{|}\mathrm{supp}_{[\ell]^{e+f}}(g)\big{|}=\ell^{e+f}-\ell^{j}, for some j⩾0. Now choose g=ρτ for some τ∈Sym(k) moving u points of [k] with u⩾2. Then \big{|}\mathrm{supp}_{[\ell^{e+f}]}(g)\big{|}=un=u\ell^{f}, so ℓe+f−ℓj=uℓf and hence j⩾f and ℓe−ℓj−f=u. If we choose τ to be a transposition in Sym(k) then u=2, which is coprime to 3. Hence if ℓ=3 then j must be equal to f and 3e−1=2, which
implies that e=1, contradiction. Thus ℓ=2 and we have 2e−2j−f=2. Since e⩾2 this implies that j=f+1 and 2e=4, a contradiction since k=4.
∎
4. Primitivity of Sh(P,n) for k⩾3
The situation where k=2 was completely dealt with in [8] (see Theorem 1.1). A first observation is that each group Sh(Sym(2),n) is imprimitive. On the other hand it follows from the first assertion of Theorem 1.6 (which we prove in this section) that
Sh(Sym(k),n) is primitive for each k⩾3. In particular, to prove Theorem 1.6, we only need to establish its first assertion.
Recall that [u]={0,1,…,u−1} for u a positive integer.
For a∈[k] and b∈[n], define
[TABLE]
Thus the sets Ca are the piles of cards in [kn], and for each b, the set Rb consists of the bth card from each pile.
Let P be a transitive subgroup of Sym(k) and set G=Sh(P,n). Recall that G is a permutation group on [kn], and that the action of its subgroup Pρ on {Ca ∣ a∈[k]} is permutationally isomorphic to the action of P on [k].
Our next result uses the notion of an orbital digraph: for a∈[k]∖{0}, the P-orbit on ordered pairs containing Δ=(0,a)P is called an orbital, and its associated orbital digraph G(Δ) has vertex set [k] and set of directed edges Δ. By definition P acts arc-transitively on G(Δ).
Proof of Theorem 1.6.
Assume for the sake of contradiction that G is imprimitive on [kn] for some k⩾3,n⩾2, and let B be a nontrivial block of imprimitivity for G containing [math]. Recall that σ∈G0. Thus σ fixes B setwise.
Claim 1. If τ∈P is such that 0τ=0 in [k] and Bρτ=B in [kn], then Rb⊆B for all b∈B∩C0. To prove this let b∈B∩C0 and r∈[k], and set a:=0τ, and consider the orbital Δ:=(0,a)P in [k]×[k]. Since P is primitive on [k], the orbital digraph G(Δ) for Δ is connected, see [10, Theorem 3.2A]. We shall prove that rn+b∈B by induction on the distance d=dΔ(0,r) from [math] to r in G(Δ). This is true if d=0 by assumption. Suppose then that d⩾1, and that sn+b∈B whenever dΔ(0,s)⩽d−1. Then there exists t∈[k] such that dΔ(0,t)=d−1 and dΔ(t,r)=1. Now both (0,a) and (t,r) are arcs of G(Δ), so for some δ∈P we have 0δ=t and aδ=r. By (2), the image of b∈B under ρδ is equal to bρδ=(0.n+b)ρδ=0δn+b=tn+b, and since tn+b∈B by induction, it follows that B∩Bρδ contains tn+b and hence ρδ fixes B setwise. Thus also ρτρδ=ρτδ fixes B setwise, and hence B contains bρτδ. Again using (2),
[TABLE]
Thus, by induction, B contains rn+b for all r∈[k], proving the claim.
Claim 2. If τ∈P is such that 0τ=0 in [k], then B∩Bρτ=∅. Assume to the contrary that 0τ=0 in [k], and Bρτ=B. Then R0⊆B by Claim 1. Since σ fixes B setwise, we have R0σ⊆B, and hence, using (1), R0σ={0,…,k−1}⊆B. Setting m1=min{n−1,k−1} it follows again from Claim 1 that ⋃b=0m1Rb⊆B. For r⩾1, set mr:=min{n−1,kr−1}. Let r be maximal such that ⋃b=0mrRb⊆B, and note that we have shown that this holds for r=1. If mr=n−1 then B contains ⋃b=0n−1Rb=[kn], contradicting the fact that B is nontrivial. Thus mr=kr−1<n−1, and again using the fact that σ fixes B setwise, it follows that, for all b satisfying 0⩽b⩽kr−1,
[TABLE]
In particular, {0,…,kr+1−1}⊆B. If kr+1−1⩽n−1 then applying Claim 1, ⋃b=0mr+1Rb⊆B, contradicting the maximality of r. Hence kr+1−1>n−1, so mr+1=n−1, B contains C0, and applying Claim 1, [kn]=⋃b=0n−1Rb⊆B, a contradiction.
Claim 3. B⊆C0. Indeed, let a∈[k]∖{0}. Since P is primitive and non-regular the only point fixed by Pa is a, so some τ∈Pa does not fix [math]. Thus Bρτ∩B=∅ by Claim 2. This means that ρτ must move every point of B, and hence B⊆supp(ρτ)=⋃r∈supp(τ)Cr. Thus, since τ fixes a, we have B∩Ca=∅, and since this holds for all a∈[k]∖{0},
it follows that B⊆C0, proving the claim.
To obtain a final contradiction, let b be the maximum element of B. Since B is nontrivial, and using Claim 3, we have 0<b⩽n−1. Now since σ fixes B setwise, B contains bσ, and by (1), bσ=bk. Note that bk⩽kn−1 since b⩽n−1, so bσ=bk∈[kn]. However bk>b (since b=0), contradicting the maximality of b. Thus no nontrivial blocks exist and therefore G must be primitive.
∎
5. 2-transitivity of Sh(P,n) when k>n
Assume throughout this section that k>n⩾2. For any a∈[k] and b∈[n] let Ca and Rb be as in (6) and (7), respectively. In addition let
[TABLE]
The next lemma proves the first assertion of Theorem 1.8.
Lemma 5.1**.**
Suppose that 2⩽n<k. If P is 2-transitive then Sh(P,n) is 2-transitive.
Proof.
Let G=Sh(P,n), and let Δ=nG0, the G0-orbit containing n. We will show that Δ=[kn]∖{0},
from which it follows that G is 2-transitive..
First observe that if τ∈P0 (in the action of P on [k]), then ρτ fixes C0 pointwise, and so ρτ fixes [math]. In particular, since P is 2-transitive, for any a∈[k]∖{0} there exists τ∈P0 such that 1τ=a. Hence, by (2), we have (n+b)ρτ=1τn+b=an+b, which shows that Rb′⊆(n+b)G0 for each b∈[n], and in particular that R0′⊆Δ. Also, recall from Lemma 2.2 that σ∈G0, and that for any b∈[n]∖{0} we have, since b<k, that (bn)σ=b by (1). It follows since Δ contains R0′ that Δ also contains
[TABLE]
Suppose that k⩾2n. Then, for n⩽i⩽2n−1, we have i∈C1 and, since n<k, in∈R0′⊆Δ. Again (in)σ≡ikn(modkn−1)=i by Lemma 2.2, and it follows, as above, that C1⊆Δ. Hence, for b⩽n−1, n+b∈Δ so (n+b)G0=Δ, and we showed above that Rb′⊆(n+b)G0. Thus Δ contains ∪b∈[n]Rb′ as well as C0′, so Δ=[kn]∖{0}. Therefore G is 2-transitive on [kn].
Finally suppose that n+1⩽k<2n, so k=n+q, where 1⩽q⩽n−1. Let b∈[n]. If 1⩽b⩽q−1 then (n+b)n∈R0, and by Lemma 2.2, ((n+b)n)σ=n+b. Since R0′⊆Δ, it follows that n+b∈Δ for 1⩽b⩽q−1, and hence that Rb′⊆Δ for 1⩽b⩽q−1. Suppose first that q⩾2.
Then this gives in particular R1′⊆Δ, and as C0′⊆Δ we have R1⊆Δ.
Let 0⩽a⩽n−q−1 and set b=q+a. Then an+1∈R1⊂Δ so that (an+1)σ∈Δ,
and by (1), (an+1)σ=1⋅k+a=n+q+a=n+b, so n+b∈Δ. Now q⩽b⩽n−1 so n+b∈Rb′,
and hence Rb′⊆(n+b)G0=Δ. Since this is true for each b=q,…,n−1, it follows as in the previous case that
Δ=[kn]∖{0}, and G is 2-transitive on [kn].
This leaves the case where k=n+1. Let Δ′:=(n+1)G0. We have already shown that C0′∪R0′⊆Δ.
Claim. ⋃i=1n−1Ri′⊆Δ′. Let 1⩽b⩽n−1, so b∈[k] and n+b∈Rb′⊆(n+b)G0. Using (1) we have (n+b)σ=bk+1=bn+(b+1). Note that b+1⩽n<k so b+1∈[k], and hence (n+b)σ∈Rb+1′. Applying this for each b∈{1,…,n−1} shows that each Rb+1′ is contained in (R1′)G0⊆(n+1)G0=Δ′, proving the claim.
Thus the union Δ∪Δ′=[kn]∖{0}, and to complete the proof we simply need to find a single point in Δ∩Δ′. Consider 1∈C0′⊂Δ. Then Δ also contains 1σ=k=n+1∈Rb′⊂Δ′. Thus the orbits Δ,Δ′ intersect nontrivially, and hence are equal. Thus Δ=[kn]∖{0}, completing the proof.
∎
Lemma 5.2** (Theorem 1.8 (1)).**
Suppose that 2⩽n<k and let P=Sym(k) or Alt(k). Then either (k,n)=(4,2) and Sh(P,n)=AGL(3,2), or (k,n)=(4,2) and Sh(P,n)=Sym(kn) or Alt(kn). Moreover, Sh(P,n)=Alt(kn) if and only if one of the following holds:
- (1)
n(4)=0, or
2. (2)
n(4)=2* and k(4)∈{0,1}, or*
3. (3)
n* is odd, P⩽Alt(k), and either n(4)=1 or k(4)∈{0,1};*
and Sh(P,n)=Sym(kn) if and only if one of the following holds:
- (4)
n(4),k(4)∈{2,3}, or
2. (5)
n* is odd and P=Sym(k).*
Proof.
Let G=Sh(P,n) where P=Sym(k) or Alt(k). If (k,n)=(4,2) then it was found that G=AGL(3,2) in [19, Table 2].
So we may assume that (k,n)=(4,2). Suppose for a contradiction that GAlt(kn). Pick 1=τ∈P of minimal support μ(P) on [k]. Then the element ρτ of G moves μ(P)n cards, and so
μ(G)⩽μ(P)n.
Since G is 2-transitive and doesn’t contain the alternating group, it follows that μ(G)⩾(kn/4)−1 by a result of Bochert [1] which can also be found with more details in de Séguier’s book [7, pp. 52–54]. Using the fact that μ(G)⩽μ(P)n and rearranging, we have k⩽4μ(P)+4/n. Now μ(P)=2 or 3 and since n⩾2, we have k⩽14. Since 2⩽n<k⩽14, these cases may be checked by computer. We used Magma [3] which showed that G=Alt(kn) or G=Sym(kn) in the respective cases, which gives a contradiction. Hence G contains Alt(kn). The conditions follow from Corollary 2.4.
∎
Recall that Burnside’s Theorem states that a 2-transitive group is either of affine or almost simple type. Thus, both P and the 2-transitive group Sh(P,n) appearing in Lemma 5.1 have one of these types. In the remainder of this section we explore the connection between the type of P and the type of Sh(P,n).
Lemma 5.3** (Theorem 1.8 (2)).**
Suppose that 2⩽n<k, and suppose that P is 2-transitive on the set of piles. If Sh(P,n) is affine then P is affine.
Proof.
Suppose that Sh(P,n) is affine, so that there is a prime p and integers e and f such that k=pe and n=pf, with e>f since k>n. Let us assume for a contradiction that P is almost simple, and let T=soc(P). So T is a nonabelian finite simple group; moreover, T is a transitive subgroup of Sym(k) since T is normal in P and P is 2-transitive. Hence, for any stabiliser H in T (in the action of T on [k]), we see that ∣T:H∣=k=pe. Thus, by [13, Theorem 1], T and H belong to one of five cases, which we consider separately below.
Case (a). T=Alt(k) and H≅Alt(k−1), where k⩾5. Then P=Alt(k) or Sym(k). Since k⩾5, Sh(P,n)=Alt(kn) or Sym(kn) by Lemma 5.2. However kn⩾5, so neither Alt(kn) nor Sym(kn) is affine, a contradiction.
Cases (c) and (d). T=PSL(2,11) and H≅Alt(5); or T=M23 and H≅M22; or T=M11 and H≅M10. In these cases the degree of T is prime, and so k=p∈{11,23}. Since pf=n<k=p, this gives a contradiction.
Case (e). T=PSU(4,2) and ∣T:H∣=27. Since k=27 either n=3 or n=9. Using Magma [3] we see that Sh(P,n) contains Alt(kn), a contradiction.
Case (b). T=PSL(d,q) for some prime power q and prime d, and H is the stabiliser of a line or hyperplane. In this case k⩾5 and
[TABLE]
Let V=soc(Sh(P,n)), so that V is elementary abelian of order pe+f. Since T⩽Sh(P,n), and T is simple, we have T∩V=1. Hence T embeds into Sh(P,n)/V, which can be identified with a subgroup of GL(e+f,p). By (8) we must have p=q. Thus we have a coprime representation of T, and so by [15, Theorem 5.3.9] we have
[TABLE]
Suppose first that d⩾3. If q⩾3 then
[TABLE]
and so pe⩽2e+2f. Recall that e>f; hence pe<4e and since k⩾5 this implies that p=2 and k=pe=8. Now, since q⩾3 and d⩾3 we obtain from (8) that
[TABLE]
a contradiction. If q=2 then by (8) we have
[TABLE]
also a contradiction. Hence d=2, and by (8) we have pe=q+1, so that pe=(q−1)+2⩽2(e+f)+2⩽4e. Since
k⩾5, this implies that e⩾2, and hence (p,e)∈{(2,3),(2,4),(3,2)}. This gives k=pe=8,16,9. Since k=q+1, and q is a prime power, we have (k,q)∈{(8,7),(9,8)}. Thus either k=8, n=2 or 4, and T=PSL(2,7), or k=9, n=3, and T=PSL(2,8). In these cases, we use Magma to verify that Sh(T,n) contains Alt(kn), and since Sh(T,n)⩽Sh(P,n), this contradicts Sh(P,n) being affine. This completes the proof.
∎
5.1. The case where P is affine and Sh(P,n) is almost simple
It follows from Lemma 5.3 that the 2-transitive group Sh(P,n) is almost simple whenever P is 2-transitive and almost simple. If P is affine, then by Proposition 3.4 the group Sh(P,n) is affine whenever the power case holds, that is, whenever k and n are powers of the same prime. We next consider the case where P is 2-transitive and affine but the power case does not hold, and determine which 2-transitive almost simple group could arise as Sh(P,n). Let us fix the following notation for the remainder of this section:
2⩽n<k and k=pe for some prime p;
P is a 2-transitive affine group of degree k;
G:=Sh(P,n) and T:=soc(G) is a finite nonabelian simple group.
We observe that kn is not prime, and so
[TABLE]
and since P is 2-transitive, ∣P∣ is divisible by pe(pe−1). Further, by Lemma 5.1, G is 2-transitive.
The classification of 2-transitive almost simple groups is complete and the list of possible groups can be found in [4, Table 7.4]; it consists of seven infinite families, which are listed in Table 2, and several sporadic examples which we can deal with immediately.
Lemma 5.4**.**
If G is almost simple, then T must appear in Table 2.
Proof.
If G is almost simple and T=soc(G) does not appear in Table 2 then G must appear in one of rows 8–18 of [4, Table 7.4]. Since k=pe, n<k, and kn is the degree of the group Sh(P,n), clearly kn is not prime. In each case, there is a unique choice for k which in turn defines n, and we are then able to use Magma [3] to calculate Sh(P,n). We obtain a contradiction in each case. For example, if Sh(P,n)=Co3 with kn=276 then k=23, n=12 and P is an affine group of degree k. The computer calculation shows that Sh(P,n) contains Alt(276), a contradiction.
∎
In Lemmas 5.6–5.9 we consider the groups in rows 1–6 of Table 2 and eliminate all but the short list given in Table 3 below. These remaining groups are treated in Theorem 5.10. First we show that Pρ intersects T nontrivially.
Lemma 5.5**.**
If one of the rows of Table 2 holds for G=Sh(P,n) and T=soc(G), then Pρ∩T⩾soc(Pρ)=Cpe.
Proof.
This is trivially satisfied if row 1 or 2 of the table holds, since in this case G=T.
Suppose for the sake of contradiction that Pρ∩Tsoc(Pρ). Since soc(P) is the unique minimal normal subgroup of P and P≅Pρ, we then have Pρ∩T=1. Hence Pρ≅PρT/T⩽G/T, and in particular ∣G/T∣⩾pe(pe−1). Since pe⩾3, the 2-transitive group P is not cyclic, and so G/T is not cyclic. This rules out rows 3, 4, and 7. Consider next row 6. Now G/T=Cr.Cs with r dividing (d,q−1) and s dividing f. Since the 2-transitive group P is nonabelian and isomorphic to a subgroup of G/T, we must have that r⩾3, and thus d⩾3. Hence q2⩽qd−1<m<k2=p2e. On the other hand, we have k=pe>2, and since q=q0f we have f⩽2q, so
[TABLE]
which gives q2<p2e<q2, a contradiction.
It remains to deal with row 5, and, since T is nonabelian simple, we must have q⩾3.
Since m=q3+1⩾28 it follows from (9) that pe⩾6. Then, again using f⩽q/2 and (9),
[TABLE]
which has no solutions with q⩾3. Thus Pρ∩T⩾soc(Pρ).
∎
Lemma 5.6**.**
If G=Sh(P,n)=Sp(2d,2)=T as in row 1 or 2 of Table 2, then row 1 or 2 of Table 3 holds.
Proof.
Let G=Sh(P,n) and suppose for a contradiction that G=Sp(2d,2) for some integer d⩾3. Then kn=2^{d-1}\big{(}2^{d}+\epsilon\big{)} where ϵ∈{1,−1}. In particular, since k>n and k is a prime power, we have that k divides 2d+ϵ. By (9), 2^{d-1}\big{(}2^{d}+\epsilon\big{)}<p^{2e}. If p^{e}=k\leqslant\big{(}2^{d}+\epsilon\big{)}/3 then the above yields \big{(}2^{d}+\epsilon\big{)}^{2}/9>2^{d-1}\big{(}2^{d}+\epsilon\big{)}, so 2d+ϵ>9⋅2d−1=2d+7⋅2d−1. Since d⩾3, this gives a contradiction. Hence p^{e}=k>\big{(}2^{d}+\epsilon\big{)}/3, and since k divides 2d+ϵ we have that
[TABLE]
If d=3 we get the two cases given in rows 1 and 2 of Table 3, so we now assume d⩾4.
If ϵ=−1 then pe=2d−1. In particular, d=6 so by Lemma 2.8, 2d−1 has a primitive prime divisor, r say. Since r divides 2d−1=pe, we have r=p. Let c be a prime divisor of d, then 2c−1 divides 2d−1=pe, and so p must divide 2c−1. Since p is a primitive prime divisor of 2d−1, it follows that c=d. Hence d is prime, and since d⩾4, d is an odd prime.
If ϵ=1 then pe=2d+1.
By Lemma 2.8 and since d⩾4, 22d−1 has a primitive prime divisor, r say, and r does not divide 2d−1 by the definition of a primitive prime divisor. Thus r must divide 2d+1=pe, and again we have r=p. Hence, for both values of ϵ, p is a primitive prime divisor of 22d−1.
We claim that k=p and \big{|}N_{G}(\mathrm{soc}(P^{\rho}))/C_{G}(\mathrm{soc}(P^{\rho}))\big{|} divides 2d. If ϵ=1 then since p is a primitive prime divisor of 22d−1, soc(Pρ) is contained in a Singer cycle group Z22d−1 of GL(2d,2) and NGL(2d,2)(soc(Pρ))≅Z22d−1.2d (see [12, Kapitel II, 7.3]). Thus k=p and the claim is proved in this case. If ϵ=−1 then since d is odd, by considering orders, we see that the stabiliser GU=2(2d).GL(d,2) of a totally isotropic subspace U of dimension d contains a Sylow p-subgroup of G. Hence a Sylow p-subgroup of G is isomorphic to a Sylow p-subgroup of GL(d,2). Since p is a primitive prime divisor of 2d−1, the Sylow p-subgroups of GL(d,2), G, and therefore soc(Pρ), are cyclic. Hence k=p. Since soc(Pρ) preserves the decomposition U⊕U∗ of the underlying vector space, and this decomposition is unique for soc(Pρ), NG(soc(Pρ)) also preserves this decomposition. Thus NG(soc(Pρ))=NGL(d,2).2(soc(Pρ)). The same argument as above now gives that ∣NG(soc(Pρ))/CG(soc(Pρ))∣ divides 2d, and hence the claim is proved for both cases.
It follows that p−1⩽2d. This gives
[TABLE]
and so 2d⩾2d−1. Since we have d⩾4, this implies that d=4. Since 2d−1⩽2d+ϵ=k=p⩽2d+1, we get a contradiction if d=4. This completes the proof.
∎
Lemma 5.7**.**
If T=Sz(q) as in row 3 of Table 2, then G=T.3=Aut(T) and row 3 of Table 3 holds.
Proof.
Let q=2f⩾8 with f odd. Let r=2(f+1)/2 so r2=2q. Then
[TABLE]
Since (q+r+1,q−r+1)=1, pe divides one of these factors. If pe divides q−r+1 then q2+1>p2e, a contradiction.
Thus pe divides q+r+1, and we get a similar contradiction if pe⩽(q+r+1)/3 since 3(q−r+1)>q+r+1 for all f⩾3.
It follows (since q+r+1 is odd) that pe=q+r+1, and n=q−r+1. Now all subgroups of T=Sz(q) of order q+r+1 are cyclic [23, Section 4]
and hence, by Lemma 5.5, we must have e=1 and AGL(1,p)≅Pρ⩽NAut(T)(soc(Pρ)). By [23], ∣NAut(T)(Cp)∣ divides 4fp, and hence p−1=q+r divides 4f. The only solution is f=3, T=Sz(8), in which case we do have AGL(1,13)⩽Aut(T) [6, p.28].
∎
Lemma 5.8**.**
If T=Ree(q) or PSU(3,q) with q⩾3 as in row 4 or 5 of Table 2, then G=Aut(T) and row 4 of
Table 3 holds.
Proof.
Here
[TABLE]
and (q+1,q2−q+1)=(q+1,3). If pe divides q+1, then n⩾q2−q+1⩾q+1⩾pe, which is a contradiction.
Thus either (a) p=3 and pe divides q2−q+1, or (b) p=3 and 3 divides (q+1,q2−q+1).
Consider case (a). Here pe divides q6−1. Let c be minimal such that p divides qc−1. Then c divides 6. If c=1 then p divides (q2−q+1,q−1)=1, a contradiction. If c=2 then p divides (q2−q+1,q2−1)=(q−2,3), which contradicts the fact that p=3. If c=3 then p divides (q3−1,q3+1)=(q+1,2), so p=2, but then p divides q−1, contradicting c=3. Thus c=6 and p is a primitive prime divisor of q6−1. Suppose that T=PSU(3,q). Now a Sylow p-subgroup of GU(3,q) is contained in a Singer cycle subgroup of GL(3,q2), and is therefore cyclic. Suppose now that T=Ree(q). The p-part of ∣T∣ is equal to the p-part of q2−q+1. Since Ree(q) contains cyclic subgroups of order q2−q+1 (see [14]), the Sylow p-subgroups of Ree(q) are cyclic. Thus in both cases the Sylow p-subgroups of T are cyclic, and hence e=1 and AGL(1,p)≅Pρ⩽NAut(T)(soc(Pρ)). This means that p−1 divides ∣NAut(T)(soc(Pρ))/CAut(T)(soc(Pρ))∣=6f (see [2, Table 8.5] if T=PSU(3,q) and [14, Theorem C] if T=Ree(q)). Since np=q3+1 we have q+1⩽n<p⩽6f+1. The only values of q=q0f satisfying q<6f are q∈{4,5,8,16}. The case q=4 gives row 4 of Table 3. For q=5, 8, 16 we have that p−1 divides 6f=6, 18, 24, and p divides q2−q+1=21, 57, 241 respectively. Since p=3 (and 241 is prime) this implies that (q,p)=(5,7) or (8,19). These give a contradiction since in both cases n=(q3+1)/p>p.
Now consider case (b) where p=3 divides (q+1,q2−q+1). Thus q≡2(mod3), and since q>2 we have q⩾5. Also, since 32e>q3+1⩾126, we have e⩾3. Now modulo 9, q≡2,5 or 8, and in each of these cases q2−q+1≡3(mod9). Therefore 9 does not divide q2−q+1 and hence 3e−1⩾9 divides q+1. Thus q+1⩾9 and
[TABLE]
and hence q=8. Then pe=27, n=19. However Aut(T) does not contain a 2-transitive group of degree 27 since its order is not divisible by 13.
∎
Lemma 5.9**.**
If T=PSL(d,q) as in row 6 of Table 2, then one of rows 5–15 of Table 3 holds.
Proof.
In this case m=(qd−1)/(q−1). Recall from (9) that k divides m and m<k2. So k⩽mp<k2, and pe and p both divide qd−1. Let c be the smallest positive integer such that p divides qc−1 (note that c=1 if p=2). Then c divides d by Lemma 2.9; let ℓ=d/c. By Lemma 5.5 we have Cpe=soc(Pρ)⩽T=PSL(d,q). By [15, Lemma 5.5.2] we have
[TABLE]
Claim 1. If p=2 then d is even. If p=2 and d is odd then by Lemma 2.9 (1),
[TABLE]
Since k⩽m2 we have k=2e⩽1, a contradiction. Therefore d is even whenever p=2.
Let r,s∈Z such that pr=(qc−1)p and ps=ℓp=(d/c)p. Note that r⩾1 since p divides qc−1. Also, for p=2, let t∈Z such that 2t=(q+1)2. Then by Lemma 2.9 we have
[TABLE]
Now
[TABLE]
and since m<k2=p2e it follows that
[TABLE]
Claim 2. If p=2 then c>1. If p=2 and c=1 then, mp=ps, and since k⩽mp we have e⩽s using (12), and thus r(ℓ−1)<2s using (13). So ps=ℓp⩽ℓ<(2s+r)/r, which implies that s=0. Hence
r<2s/(ps−1)⩽1 since 2s⩽3s−1⩽ps−1 for all s⩾1 and p⩾3. Thus r<1, a contradiction.
Therefore c>1 whenever p=2.
By Claims 1 and 2 we only need to consider the cases where p=2 and d is even, and where p=2 and c>1.
Case 1. Assume that p=2. Recall that in this case c=1, so d=ℓ.
Since 3⩽k=2e we must have e⩾2, and using (12) and k⩽mp we have e⩽s+t−1.
Suppose first that d=2. Then by (11) we have e=2 and so k=4. Thus
[TABLE]
so q=7 or 11. If q=7 then k=4, n=2, and T=PSL(2,7), as in row 6 of Table 3. If q=11 then k=4, n=3, and T=PSL(2,11), as in row 8 of Table 3.
Suppose now that d⩾4 (recall that d is even). Then, using Lemma 2.9 and (12), we can write
[TABLE]
Since m<k2=22e⩽22(s+t−1), this gives us m<(q+1)2d2/4 and hence
[TABLE]
Hence d⩽5, and therefore d=4. So 2⩽e⩽4 by (11) and k∈{4,8,16}. From the fact that m<k2 and k divides m we get that k=8, n=5, and T=PSL(4,3), as in row 14 of Table 3.
Case 2. Finally, assume that p=2. Then c>1 by Claim 2, so mp=pr+s by (12). Since pe=k⩽mp we have e⩽r+s, and using (13), r(ℓ−1)<2(r+s). Now since r⩾1, we have 2(r+s)⩽2r(1+s). Hence ℓ−1<2(1+s), so that
[TABLE]
Since ps=ℓp⩽ℓ, we have ps<3+2s.
Therefore either s=0 or (p,s)=(3,1).
Subcase 2.1. Suppose that s=0. By Claim 2 and (12) we have k⩽mp=pr=(qc−1)p and since c>1, p∤q−1, so k⩽(qc−1)p⩽q−1qc−1. Now if ℓ⩾2, we have
[TABLE]
Hence k>qc, contradicting k⩽qc−1. Thus ℓ=1, and so d=c. Hence p is a primitive prime divisor of qd−1, and so the Sylow p-subgroups of T are cyclic (see Remark 2.10). Hence k=p. Now by (12) we have pr=mp<k2=p2. Thus r=1.
Since p is a primitive prime divisor of qd−1, soc(Pρ)≅Cp is contained in a Singer cycle subgroup of T. Now by [12, Kapitel II, 7.3], ∣NG(soc(Pρ))/CG(soc(Pρ))∣ divides df where q=q0f, q0 a prime, so we find that p−1=∣P/soc(P)∣ divides df.
In particular, p−1⩽df, so p⩽df+1. Thus
[TABLE]
The above restrictions give bounds on d and f. We first use the condition (df+1)2>2(d−1)f to determine possible values for d,f, then for each of these we find the odd primes p such that p−1 divides df and p2>2(d−1)f. We list the possibilities in Table 4.
In particular, p∈{3,5,7,11,13,17}. Note k=p and n<k. Then p=3 implies that n=2 and hence m=6. The only solution to m=(qd−1)/(q−1)=6 is given by d=2, f=1 and q=5. Hence T=PSL(2,5) as in row 5 of Table 3. If p=5, then n∈{2,3,4} and m=kn∈{10,15,20}. From Table 4 we have d∈{2,4}. If d=4, then the only solution is q=2, T=PSL(4,2) and n=3, as in row 13 of Table 3. For d=2, we have m=q+1, and hence q+1∈{10,15,20}. If q=19, then f=1 and p−1=4 does not divide df=2, a contradiction. If q=9, then n=2 and T=PSL(2,9) as in row 7 of Table 3.
For p∈{7,11,13,17}, using the bound p2>q0(d−1)f, the possible values for q0 and m informed by the possible values of p and the pair (d,f) are given in Table 5. If a case leads to a possibility, we give a reference to Table 3 , and otherwise an explanation of the contradiction that arises.
Subcase 2.2. Finally suppose that (p,s)=(3,1). Then ℓ<5 by (14); since ℓp=ℓ3=3s=3 we have ℓ=3 and d=3c. Now a Sylow 3-subgroup of GL(d,q)=GL(3c,q) is contained in the stabiliser of a direct sum decomposition of the underlying vector space into three c-dimensional subspaces. This stabiliser is isomorphic to GL(c,q)≀Sym(3), and since p=3 is a primitive prime divisor of qc−1 with (qc−1)3=3r, a Sylow 3-subgroup is isomorphic to C3r≀C3. Note that the elementary abelian subgroups of C3r≀C3 have order at most 33. Since c>1, we can choose an elementary abelian preimage of C3e≅soc(Pρ) in GL(d,q). Hence e⩽3 and k∈{3,9,27}. Since k2>m, if c⩾3, then m⩾29−1, but 29−1>272, a contradiction. Hence c=2 (since c>1) and d=3c=6. Now since p=3 divides qc−1, we have q=2 or q⩾5. The latter case contradicts m<k2, hence q=2. Hence T=PSL(6,2) and kn=63, which forces k=9, n=7, as in row 15 of Table 3.
∎
Theorem 5.10** (Theorem 1.8(3)).**
Suppose that 2⩽n<k and k=pe for some prime p. Let P be a 2-transitive affine group of degree k and let G=Sh(P,n). If n=pf for some integer f, then G is an affine group. Otherwise, Alt(kn)⩽G.
Proof.
If n=pf for an integer f, then Proposition 3.4 shows that G is affine. Thus we may assume n=pf for any f. Now Lemma 5.3 shows that G is 2-transitive, and since the degree of G is kn, which is not a prime power, G cannot be affine. Thus Burnside’s Theorem [4, Theorem 4.3] shows that G is almost simple. Lemma 5.4 shows that T:=soc(G) appears in Table 2. If T=Alt(kn) then by Lemmas 5.6, 5.7, 5.8 and 5.9, T, k and n must appear in Table 3. For each possible T and value of k and n, we obtain a contradiction using Magma. Hence T=Alt(kn) as required.
∎
6. Cascading shuffle groups
Throughout this section we assume that k=2e for some positive integer e and that n⩾2.
For each a∈[k]={0,1,…,2e−1}, write a in its base-2 expansion,
[TABLE]
with each ai∈{0,1}. For each s∈[e], let vs be the permutation on [k] defined by
[TABLE]
Clearly vs has order 2, and vr, vs commute for all r,s. Hence
[TABLE]
is an elementary abelian subgroup of Sym(k) of order 2e. We now extend these definitions and notation for arbitrary 2t⩽2e.
For each t=1,…,e, we can divide the cards in [2en] into 2t piles with 2e−tn cards in each pile.
We define Vt as in (16) with t,2t in place of e,k, namely, for s∈[t], we define vt,s on
a=∑r=0t−1ar2r∈[2t] by
[TABLE]
and then
[TABLE]
is elementary abelian of order 2t, and the corresponding shuffle group Gt is defined as
[TABLE]
Note that Gt⩽Sym(2en). Thus the e shuffle groups G1,…,Ge all act on the same set [2en]. Also G_{1}=\mathrm{Sh}\big{(}V_{1},2^{e-1}n\big{)}=\mathrm{Sh}\big{(}\mathrm{Sym}(2),2^{e-1}n\big{)}, and so G1 is known and is described by Theorem 1.1.
Let σt denote the shuffle in Gt defined as in (1) with 2t,2e−tn in place of k,n respectively. Let σ=σ1. We describe in Lemma 6.1 the relationship between σ and the shuffle
σt for an arbitrary t.
To describe the other generators for Gt as in Definition 2.1, we need to consider the analogue of the embedding ρ:Sym(k)→Sym(kn) defined in (2). Since this map is analogous to ρ, we denote it by ρt:Sym(2t)→Sym(2en) (so ρe=ρ), and define for each τ∈Sym(2t) the permutation ρt,τ∈Sym(2en) as follows (similarly to (2)).
[TABLE]
We show in Lemma 6.1 that the subgroup Vt is mapped by ρt to a subgroup of (Ve)ρ⩽Ge, where ρ=ρe.
Lemma 6.1**.**
Assume that k=2e for some positive integer e, and let t∈{1,…,e}. Then for
Gt the shuffle group as in (18) we have:
- (a)
The shuffle σt satisfies σt=σt;
2. (b)
for s∈[t], the image (vt,s)ρt=(ve−t+s)ρ, and so
(Vt)ρt=(⟨ve−t,…,ve−1⟩)ρ⩽(Ve)ρ;
3. (c)
for 1⩽s⩽e−1, ρvs−1σ=σρvs;
4. (d)
the elements xr:=(vr)ρ, for 0⩽r⩽e−1, are such that xrσ=xr+1 for r⩽e−2,
(Vt)ρt=⟨xe−t,…,xe−1⟩ for t⩽e, and ((Vt)ρt)σ>(Vt−1)ρt−1 for t>1.
Proof.
(a) By Lemma 2.2, σ fixes [math] and 2en−1, and iσ=2i(mod2en−1) for all other i\in\big{[}2^{e}n\big{]}. Likewise σt fixes [math] and 2en−1, and iσt=2ti(mod2en−1)=iσt for all other i\in\big{[}2^{e}n\big{]}. Therefore σt=σt.
(b) Let x∈[kn]. We may write x=an+b=a0(2e−tn)+b0 for unique a∈[2e],b∈[n],a0∈[2t],b0∈[2e−tn]. Note that
b0=b+b1n for (a unique) b1∈[2e−t], and a=a02e−t+b1. Write the base-2 expansions of a0 and b1 as
[TABLE]
and note that
[TABLE]
is the base-2 expansion of a=a02e−t+b1.
By (17), setting a0,r=a0,r if r=s, and a0,r=1−a0,r if r=s, we have
[TABLE]
and since s⩽t−1 and b1⩽2e−t, it follows from (15) that this expression is the image of a under the action of ve,e−t+s. Hence, by (19),
[TABLE]
We conclude that (vt,s)ρt=ρt,vt,s=ρve−t+s=(ve−t+s)ρ, and part (b) follows.
(c) Let 1⩽s⩽e−1, and let x=an+b∈[2en] with a∈[2e] and b∈[n]. We will show that the images of x under
ρvs−1σ and σρvs are equal. Write the base-2 expansion of a as
[TABLE]
and define
[TABLE]
By Lemma 2.2, σ fixes 2en−1, and iσ=2i(mod2en−1) for all other i\in\big{[}2^{e}n\big{]}.
We claim that, for x=2en−1,
[TABLE]
with ∑r=1e−1ar−12r+b1 the base-2 expansion of an element in [2e], and 2b−b1n+ae−1∈[n]. Note that 2an=(∑r=0e−1ar2r+1)n=ae−1+(∑r=1e−1ar−12r)n(mod2en−1), and hence equality holds in (20),
and since b1∈{0,1}, ∑r=1e−1ar−12r+b1 is the base-2 expansion of an element in [2e]. If b⩾n/2 then
0⩽2b−b1n+ae−1⩽2(n−1)−n+1=n−1; if b⩽(n−2)/2 then 0⩽2b−b1n+ae−1⩽(n−2)+1=n−1; and finally if b=(n−1)/2, then
(ae−1,b1) is either (1,1) or (0,0) and 2b−b1n+ae−1 is [math] or n−1 respectively. This proves the claim.
Since s⩾1, this implies that
[TABLE]
Also, provided that x(vs−1)ρ=2en−1, we may apply (15) and (20) to obtain
[TABLE]
On the other hand, if x(vs−1)ρ=2en−1, then x=(∑0⩽r⩽e−1,r=s−12r)n+(n−1), b=n−1,b1=1,ae−1=1, and by (21),
[TABLE]
It remains to consider actions on x=2en−1=(∑r=0e−12r)n+(n−1). Here we have
[TABLE]
while, using the fact that x(vs−1)ρ=2en−1, we have
[TABLE]
This completes the proof of part (c). Part (d) follows immediately from parts (b) and (c).
∎
By Lemma 6.1(b) we have the following subgroup chain in Ge:
[TABLE]
We now consider how the subgroups G1, …, Ge are related.
Lemma 6.2**.**
For 1⩽t⩽e we have
[TABLE]
*Moreover, Gt⩽G1, and if t<e then Gt normalises Gt+1.
*
Proof.
By definition, G=Sh(Vt,n)=⟨(Vt)ρt,σt⟩. By Lemma 6.1 part (a) we have σt=σt and by part (c) we have (Vt)ρ=⟨xe−t,…,xe−1⟩. This gives the first part of the lemma. For the next assertion, we may assume e⩾2.
By Lemma 6.1(d), for 1⩽i⩽t we have
[TABLE]
and clearly σt∈G1, so the inclusion Gt⩽G1 follows. Finally we check that Gt normalises Gt+1.
Since Gt+1 contains xe−t,…,xe−1, we only need to check that σt normalises Gt+1.
Now σt centralises σt+1 and, by Lemma 6.1(d),
[TABLE]
For 1⩽i⩽t, we use Lemma 6.1(d) to write xe−i=(xe−i−1)σ, and then we have
[TABLE]
which completes the proof.
∎
We now prove Theorem 1.9 which we restate for convenience. We use notation from Theorem 1.1.
Theorem 1.9**.**
Suppose that n is not a power of 2 and k=2e with e⩾2. The following hold:
- (1)
if (k,n)=(4,3) then G1=C26⋊Sym(5) and G2=C25⋊Sym(5);
2. (2)
if (k,n)=(4,6) then G1=G2 and G1≅C211⋊M12;
3. (3)
if (k,n)=(8,3) then G1=G2=G3 and G1≅C211⋊M12;
4. (4)
if k=4 and n⩾5 is odd then G1=C2≀Sym(2n) and G2=ker(sgn);
5. (5)
in all other cases, G1=G2=⋯=Ge and G1=ker(sgn)∩ker(sgn).
Proof.
The structure of G1=Sh(Sym(2),2e−1n) is given in Theorem 1.1. Note that since e⩾2, 2e−1n is even. Now, since n is not a power of 2, G1 appears in row two, four, six or seven of Table 1. First we deal with the exceptional cases.
The exceptional case in row six is when 2e−1n=6. Here 2e=4, 2en=12, and G1=Sh(Sym(2),6)≅26⋊Sym(5). By Magma, Sh(V4,3) has index two in G1, and G2≅25⋊Sym(5).
The exceptional case in row seven is when 2e−1n=12, so (2e,n)=(4,6) or (8,3), and G1=211⋊M12, where M12 is one of Mathieu’s nonabelian simple groups.
By Lemma 6.2, G2 is a normal subgroup of G1. Since σ2 does not centralise ⟨xe−2,xe−1⟩, G2 is nonabelian, and hence G2=G1. Similarly, if e=3, then G3 is a nonabelian normal subgroup of G2=G1, so G1=G2=G3.
Suppose now that 2e−1n=6 and 2e−1n≡2(mod4) (as in row four of Table 1). Then 2e=4, n is odd, and G1=B2n=22n.Sym(2n)≅C2≀Sym(2n). By Lemma 6.2, G2 is a normal subgroup of G1, and as above, G2 is nonabelian. The proper nonabelian normal subgroups of G1 all contain the derived subgroup of G1, namely [G1,G1]∼22n−1.Alt(2n). Hence G2 contains [G1,G1] and so has index one, two or four in G1. By Corollary 2.4(3) we have that G2=Sh(V4,n)⩽Alt(4n), and so G2 is contained in ker(sgn) which has index two in B2n. Note that the unique index four subgroup of G1 projects to Alt(2n) in the quotient G1/O2(G1)≅Sym(2n). To determine the index of G2, we consult [8, Table III] - and note that, unfortunately, 2n corresponds to the the integer n in [8, Table III] (because they are investigating Sh(Sym(2),n) and we are investigating G1=Sh(Sym(2),2n)). We see that σ (there denoted I) projects to an odd permutation in G1/O2(G1) (witnessed in the table in column 2(mod4) by sgn(g)=−1 for g=I) and that the element there denoted O which is equal to xe−1σ projects to an even permutation (witnessed in the table in column 2(mod4) by sgn(g)=1 for g=O). Thus sgn(xe−1)=−1 and so xe−1 also projects to an odd permutation in G1/O2(G1), and since xe−1 lies in G2, we have that G2 projects to Sym(2n). Hence G2/O2(G2)≅(G2O2(G1))/O2(G1)≅Sym(2n) and so G2 has index exactly two in G1 and we have G2=ker(sgn).
Now consider the case that 2e−1n≡0(mod4) and n is not a power of two (as in row two of Table 1). From Table 1 we have G1=22e−1n−1.Alt(2e−1n). Since 2e−1n is even, we see that G1 has exactly two proper non-trivial normal subgroups, both are abelian and have order 2 and 22e−1n−1 respectively. By Lemma 6.2, G2 is a normal subgroup of G1. Since G2 is nonabelian, we have G2=G1. If e⩾3, then by Lemma 6.2, G3⩽G1, and since G1=G2, we have G3⩽G2. Then, again by Lemma 6.2, G3 is a normal subgroup of G2=G1, and since G3 is nonabelian, we conclude that G3=G2=G1. Continuing in this fashion, we have G1=G2=⋯=Ge. This completes the proof of the theorem.
∎
We are now ready to prove Corollary 1.10 which we also restate for convenience.
Corollary 1.10**.**
Suppose that n is not a power of 2 and that k=2e for some e⩾2. Then Sh(Sym(k),n) is the alternating group or the symmetric group if n is even or odd, respectively.
Proof.
Set G=Sh(Sym(k),n) and Ge=Sh(V2e,n). Note that the restrictions on e and n imply that 2en⩾12. The structure of Ge is given by Theorem 1.9. For (k,n)=(4,3), (4,6) or (8,3), we verify that the result holds using Magma. Thus we may assume one of case (4) or (5) of Theorem 1.9 holds. In particular, we have that Ge contains the derived subgroup L=ker(sgn)∩ker(sgn)=22e−1n−1.Alt(2e−1n) of G1. Note that L is a transitive imprimitive subgroup of Sym(2en) preserving a partition with 2e−1n parts of size 2.
Suppose that Alt(2en)G=G and let M be a maximal subgroup of Alt(2en)G containing G. Since M contains G, we have that M contains L. Now L contains elements of cycle type 52 and so, since we have dealt with the case 2en=12 above, applying [24, Theorem 13.10] with q=2 and p=5 yields that M is imprimitive. On the other hand, Theorem 1.6 implies that G is primitive, a contradiction. Hence Alt(2en)G=G and so G contains Alt(2en). Since e⩾2, k≡0(mod4), so Corollary 2.4 shows that G⩽Alt(2en) if and only if n is even.
∎