The rational cuspidal divisor class group of X0(N)
Hwajong Yoo
College of Liberal Studies and Research Institute of Mathematics, Seoul National University, Seoul 08826, South Korea
[email protected]
Abstract.
For any positive integer N, we completely determine the structure of the rational cuspidal divisor class group of X0(N), which is conjecturally equal to the rational torsion subgroup of J0(N).
More specifically, for a given prime ℓ, we construct a rational cuspidal divisor Zℓ(d) for any non-trivial divisor d of N. Also, we compute the order of the linear equivalence class of Zℓ(d) and show that the ℓ-primary subgroup of the rational cuspidal divisor class group of X0(N) is isomorphic to the direct sum of the cyclic subgroups generated by the linear equivalence classes of Zℓ(d).
Key words and phrases:
Rational torsion subgroup, Rational cuspidal subgroup, Rational cuspidal divisor class group
2020 Mathematics Subject Classification:
Primary 11G16, 11G18, 14G05
Contents
- 1 Introduction
- 2 The cusps of X0(N)
- 3 The order of a rational cuspidal divisor
- 4 Motivational Examples
- 5 Strategy for the computation
- 6 The structure of C(N)
1. Introduction
Let N be a positive integer and let Γ0(N) be the congruence subgroup of SL2(Z) consisting of matrices that are upper-triangular modulo N. The complete modular curve X0(N)C is the union of the affine modular curve Y0(N)C=Γ0(N)\\cmcalH and a finite set of cusps, where \cmcalH is the complex upper half plane and Γ0(N) acts on \cmcalH by linear fractional transformation. The curve X0(N)C has a canonical nonsingular projective model X0(N) defined over Q (cf. [27, Ch. 6]), and in this model the set of cusps is invariant under the action of Gal(Q/Q), the absolute Galois group of Q.
Let J0(N):=Pic0(X0(N)) be the Jacobian variety of X0(N) and J0(N)(Q)tors its rational torsion subgroup. We would like to understand the group J0(N)(Q)tors for any positive integer N, but there is no systematic way for doing it yet. On the other hand, if N is a prime, Mazur proved the following, which was known as Ogg’s conjecture [15, Th. 1].
Theorem 1.1** (Mazur).**
Let N≥5 be a prime number, and let n=numerator(12N−1).
The rational torsion subgroup J0(N)(Q)tors is a cyclic group of order n, generated by the linear equivalence class of the difference of the two cusps (0)−(∞).
Let CN be the cuspidal subgroup of J0(N), which is defined as a subgroup of J0(N)(Q) generated by the linear equivalence classes of the differences of cusps. By the theorem of Manin [14, Cor. 3.6] and Drinfeld [7], we have CN(Q)⊆J0(N)(Q)tors, where CN(Q) is the group of the rational points on CN, called the rational cuspidal subgroup of J0(N). As a natural generalization of Mazur’s theorem, we expect the following.
Conjecture 1.2** (Generalized Ogg’s conjecture).**
For any positive integer N, we have
[TABLE]
Let Divcusp0(X0(N)) be the group of the degree [math] cuspidal divisors111By a cuspidal divisor, we mean a divisor on X0(N) supported only on cusps. on X0(N).
By definition, there is an exact sequence:
[TABLE]
where UN is the group of the divisors of modular units222By a modular unit on X0(N), we mean a meromorphic function on X0(N)C which does not have zeros and poles on Y0(N)C..
In fact, Gal(Q/Q) acts naturally on the objects in the exact sequence and (1.1) is an exact sequence of Gal(Q/Q)-modules. Taking Galois cohomology we get
[TABLE]
where
[TABLE]
is the group of the degree [math] rational cuspidal divisors333By a rational cuspidal divisor, we mean a cuspidal divisor fixed by the action of Gal(Q/Q). on X0(N).
The image of π is called the rational cuspidal divisor class group of X0(N), denoted by C(N). In other words, C(N) is a subgroup of J0(N)(Q) generated by the linear equivalence classes of the degree [math] rational cuspidal divisors on X0(N). Ken Ribet asked whether the map π is surjective, or more generally H1(Gal(Q/Q),UN)=0.444For a discussion of a similar problem, see [22] and [31, Th. 4.13].
Motivated by his question and the result of Toshikazu Takagi [31, Th. 1.1], we propose the following.
Conjecture 1.3**.**
For any positive integer N, we have
[TABLE]
Thus, it is worth understanding the structure of the group C(N) for any positive integer N. Although the group C(N) is a very explicit object, not much is known about
its precise structure due to a lack of efficient tools. To the author’s best knowledge, the structure of C(N) for a composite integer N (which is large enough) has been computed for the following cases:
- (1)
N is the product of two distinct primes by Chua and Ling [3].
2. (2)
N is a power of an odd prime p by Lorenzini [13] and Ling [12].
3. (3)
N is a power of 2 by Rouse and Webb [26, Th. 10].
4. (4)
N is squarefree by Takagi [30, Th. 6.1].555In fact, Takagi computed the precise order of the group C(N), but he did not determine the structure of the 2-primary subgroup of C(N), which is a motivation of this paper.
Remark 1.4*.*
There are some partial results for Conjectures 1.2 and 1.3, e.g. [1, 12, 13, 19, 21, 23, 24, 25, 32, 35].
For a thorough discussion and a new result of the conjectures above, see [37].
1.1. Main result
In this paper, we completely determine the structure of the ℓ-primary subgroup of the group C(N) for any positive integer N and any prime ℓ. Throughout this paper, we denote by A[ℓ∞] the ℓ-primary subgroup of a finite abelian group A.
For any non-trivial divisor666By a non-trivial divisor of N, we mean a positive divisor of N different from 1. d of N, we try to find a degree [math] rational cuspidal divisor Dd such that
[TABLE]
where \cmcalDN0 is the set of all non-trivial divisors of N and Dd denotes the linear equivalence class of Dd.
Although it is possible when N is a prime power,
it seems already difficult if N is the product of two primes. Nonetheless, we almost solve this problem by leaving the “squarefree part” aside. Let \cmcalDNsf be the set of all non-trivial squarefree divisors of N and let \cmcalDNnsf be the set of all non-squarefree divisors of N.
Theorem 1.5**.**
Let N be a positive integer.
For any non-trivial divisor d of N, there is a degree [math] rational cuspidal divisor Z(d) such that
[TABLE]
Also, the order of Z(d) is n(N,d), which is defined in Section 1.3.
Let C(N)sf:=⟨Z(d):d∈\cmcalDNsf⟩ be the “squarefree part” of the group C(N).
Since it seems difficult to directly find the decomposition of C(N)sf into cyclic groups, we deal with the decomposition of its ℓ-primary subgroup instead.
Theorem 1.6**.**
Let N be a positive integer and let ℓ be any given prime.
For any non-trivial squarefree divisor d of N, there is a degree [math] rational cuspidal divisor Y2(d) such that
[TABLE]
Also, the order of Y2(d) is N(N,d), which is defined in Section 1.3.
Thus, by Theorems 1.5 and 1.6, we easily have the following.
Theorem 1.7**.**
Let N be a positive integer and let ℓ be any given prime. For any d∈\cmcalDN0, there is a rational cuspidal divisor777
Let a(d) and b(d) be the prime-to-ℓ parts of n(N,d) and N(N,d), respectively. Then we define Zℓ(d):=a(d)⋅Z(d) if d is not squarefree, and Zℓ(d)=b(d)⋅Y2(d) otherwise. Zℓ(d) such that
[TABLE]
As an application, we can determine the structure of the cuspidal group CN when N=4M or 8M for M odd squarefree because two groups C(N) and CN are equal.
1.2. Construction of the divisors Z(d) and Y2(d)
Before describing our method for computing the group C(N), we review previous methods and point out some difficulties in their generalization.
For a divisor d of N, there is a rational cuspidal divisor (Pd) on X0(N) whose degree is equal to φ(z), where z=gcd(d,N/d) and φ(n) is the Euler’s totient function. By Lemma 2.19, the group Divcusp0(X0(N))(Q) is generated by
[TABLE]
A starting point to understand the group C(N) is to compute the order888By the order of a degree [math] rational cuspidal divisor on X0(N), we mean the order of its linear equivalence class in J0(N), which is always finite by the theorem of Manin and Drinfeld. of Cd.
This can be easily done by Ligozat’s method if N is either a prime power [12] or the product of two primes [3]. However, as you can see in [11, Th. 3.2.16], the formula for the order of CN looks very complicated in general. So we need a better understanding of the computation of the order of Cd for any d∈\cmcalDN0, which is done in Section 3.
After that, to determine the structure of C(N), it suffices to find all relations999We say that there is a relation among rational cuspidal divisors Cd if there are integers ad such that ∑ad⋅Cd=0∈J0(N) and ad⋅Cd=0 for some d. among Cd.
As noticed on [12, pg. 31], it seems difficult even in a simple case where N is a power of 2, and the main problem is that there is essentially only one method for finding relations among Cd, which is just a rephrasing of the definition:
[TABLE]
Although the order of X can be computed by Ligozat’s method in principle once ad are given, it is very hard to find such integers ad. Thus, it seems necessary to develop a new method for computing the group C(N).
From now on, we illustrate our strategy for constructing the divisors Z(d) and Y2(d). Before proceeding, we fix some notations, which will be used throughout the whole paper.
Notation 1.8*.*
For k=1 or 2, let \cmcalSk(N)Q be the Q-vector space of dimension σ0(N), indexed by the divisors of N, and let \cmcalSk(N) be the Z-lattice of \cmcalSk(N)Q consisting of integral vectors.
In other words,
[TABLE]
where e(N)d is the unit vector in \cmcalSk(N) whose d-th entry is 1 and all other entries are zero. Also, let
[TABLE]
Let Divcusp(X0(N))(Q) be the group of the rational cuspidal divisors on X0(N). Then by Lemma 2.19, we have a tautological isomorphism as abelian groups101010The restriction of ΦN to Divcusp0(X0(N))(Q) induces an isomorphism with \cmcalS2(N)0.
[TABLE]
sending (Pd) to e(N)d. Let Υ(N)=(Υ(N)δd)∈Mn×n(Z) be a square matrix of size n=σ0(N) (indexed by the divisors of N), defined in Section 3.2. We regard this matrix as a linear map from \cmcalS2(N)Q to \cmcalS1(N)Q.
We use a capital Roman letter for a rational cuspidal divisor on X0(N) and the corresponding capital bold Roman letter for its image in \cmcalS2(N) by ΦN.
For example, Dd and Dd, Bp(r,f) and Bp(r,f), Ap(r,f) and Ap(r,f), Z(d) and Z(d), Yi(d) and Yi(d), respectively.
Step 1: First, we elaborate Ligozat’s method and provide a simple algorithm for computing the order of a degree [math] rational cuspidal divisor as follows:
For a rational cuspidal divisor
[TABLE]
we compute an integral vector V(C)=∑δ∣NV(C)δ⋅e(N)δ∈\cmcalS1(N) defined as
[TABLE]
Let GCD(C) be the greatest common divisor of the entries of V(C) and let
[TABLE]
Although this computation is very easy, the vector V(C) plays a crucial role throughout the whole paper. For each prime p, let
[TABLE]
where the sum runs over the divisors of N whose p-adic valuations are odd. Let
[TABLE]
Finally, let κ(N)=N∏p∣N(p−p−1).
Then the order of C is equal to
[TABLE]
As an application, we simplify Ligozat’s formula and compute the order of Cd for any non-trivial divisor d of N. More specifically, we prove that
[TABLE]
where g(N,d) and h(N,d) are defined in Section 3.6. Note that in most cases we have g(N,d)=h(N,d)=1, and so the order of Cd is 24κ(N).
Step 2: Since finding relations among Cd is quite difficult, we
try to find a criterion for proving “linear independence” among rational cuspidal divisors instead. As a result, we have the following.
Theorem 1.9**.**
Let Ci∈Divcusp0(X0(N))(Q) for all 1≤i≤k. Suppose that there is a divisor δ of N such that
[TABLE]
If h(Ck)=1, then we have ⟨Ci:1≤i≤k−1⟩∩⟨Ck⟩=0, or equivalently
[TABLE]
Applying this criterion successively, we can easily deduce the following:
For rational cuspidal divisors Dd on X0(N), we consider a square matrix
[TABLE]
indexed by the non-trivial divisors of N. If M is lower-unipotent111111A square matrix is lower-unipotent if it is lower-triangular and all its diagonal entries are 1. (with respect to suitable orderings on \cmcalDN0), and h(Dd)=1 for all but one that corresponds to the first row, then we have
[TABLE]
Additionally, if the divisors Dd generate Divcusp0(X0(N))(Q), or equivalently
[TABLE]
then we easily have
[TABLE]
Although the required assumptions are pretty strong, we can verify them in many cases, and this strategy is quite useful in our computation.
Step 3: We first apply our strategy when N=pr is a prime power.
To do so, we have to find a rational cuspidal divisor Dd such that most of the entries of V(Dd) are zeros, which can be constructed from lower levels (cf. Proposition 5.4). Thus, we first find “nice” vectors Bp(r,f)∈\cmcalS1(N) for any 1≤f≤r so that the matrix
[TABLE]
is lower-unipotent with respect to suitable orderings on {1,2,…,r}, and then compute Υ(pr)−1×Bp(r,f).
By removing the denominators of the entries, we obtain “nice” vectors Bp(r,f)∈\cmcalS2(N)0 such that
Υ(pr)×Bp(r,f) is a scalar multiple of Bp(r,f). Furthermore, we prove that the vectors Bp(r,f) (integrally) generate \cmcalS2(pr)0. If p is odd, then we have h(Bp(r,f))=1 for any f≥2, and so
[TABLE]
However, if p=2, then the arguments above break down since h(B2(r,f))=2 for some f≥2. Nonetheless, we find another one B2(r,f) and show that
[TABLE]
(Our proof relies on two facts: one is that the genus of X0(16) is zero, and the other is that the group C(2r) is a 2-group.)
Step 4: We then apply our strategy for any positive integer N.
For simplicity, let N=Mpr with gcd(M,p)=1. (Here, p denotes a prime as above.)
As already mentioned, we need a rational cuspidal divisor Dd such that most of the entries of V(Dd) are zeros, which can be constructed using tensor product.121212By the Chinese remainder theorem, we have {\cmcal{S}}_{2}(N)_{\mathbf{Q}}\simeq{\cmcal{S}}_{2}(M)_{\mathbf{Q}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\cmcal{S}}_{2}(p^{r})_{\mathbf{Q}} (Remark 2.20). In general, the dimension of \cmcalS2(N)0 is larger than the product of the dimensions of \cmcalS2(M)0 and \cmcalS2(pr)0, and so we need more vectors, which are not of degree [math]. Motivated by the discussion in Section 4.3.1 (Remark 4.2), we construct vectors Ap(r,0) and Ap(r,1) in \cmcalS2(pr). Note that Ap(r,0) is constructed when we regard a divisor C in level M as one in level Mpr, and Ap(r,1) is obtained by applying the degeneracy map from level M to level Mpr, which may be regarded as a vector (in level pr) “coming from level 1”.
By letting Ap(r,f):=Bp(r,f) for any 2≤f≤r, we can prove that the vectors Ap(r,f) (integrally) generate \cmcalS2(pr).
Using these vectors, we now define a vector Z1(d) for any non-trivial divisor d of N as follows.
Definition 1.10**.**
Let N=∏i=1tpiri and d=∏i=1tpifi. Then we define a vector Z1(d)∈\cmcalS2(N)0 as
[TABLE]
where m is the smallest positive integer such that fm=1.
If N is odd, we can prove Theorem 1.5 as in Step 3. On the other hand, if N is divisible by 32, there are some problematic vectors that we cannot apply our strategy as in the case of level 2r. Using the vectors B2(r,f), we then define a vector Z(d) as follows:
[TABLE]
where \cmcalTu:=∅ if either u=0 or ru≤4, and otherwise
[TABLE]
(Here, u denotes the index such that pu=2.)
Unfortunately, we cannot prove Theorem 1.5 directly when N is divisible by 32. So we prove a partial result first (Theorem 6.23) and finish the proof in Section 6.7.
Remark 1.11*.*
In Section 4.5, we fix an ordering of the prime divisors of N (using Assumption 1.14 below) and define Z(d).
On the other hand, here and in Theorem 1.5, we do not choose a specific ordering of the prime divisors of N in the definition of Z(d).
This does not cause any problem because
- (1)
for any d∈\cmcalDNnsf, the definition of Z(d) does not depend on the ordering of the prime divisors of N, and
2. (2)
the squarefree part C(N)sf does not depend on the ordering of the prime divisors of N (Remark 6.32).
Step 5: For any given prime ℓ, we try to understand the group C(N)sf[ℓ∞]. Let N=p1r1p2r2 be the product of two prime powers. Then we construct the following vector in \cmcalS2(N), which is not defined by tensors131313For its definition, see Section 3.4.:
[TABLE]
where γi=piri−1(pi+1). Using these vectors, we define a vector Y0(d) as follows.
Definition 1.12**.**
Let N=∏i=1tpiri be a positive integer. For a given prime ℓ,
by appropriately ordering the prime divisors of N, we make Assumption 1.14 below. Then for a non-trivial squarefree divisor d=∏i=1tpifi of N, we define a vector Y0(d) in \cmcalS2(N)0 as
[TABLE]
where m is as above and n is the smallest integer such that n>m and fn=0.
By its construction (and Assumption 1.14), it is not difficult to show that
[TABLE]
Also, the matrix M0:=(V(Y0(d))δ)dδ is lower-triangular (with respect to suitable orderings on \cmcalDNsf). Moreover, if N is odd, then the diagonal entries of M0 are ℓ-adic units. Thus, by an ℓ-adic variant of Theorem 1.9, we prove that
[TABLE]
Step 6: However, some of the previous arguments break down if N is even. As in the case of level 2r, we know exactly where the problems occur. So by replacing problematic elements by new ones (Remark 4.11), we construct a rational cuspidal divisor Y1(d) and show that M0:=(V(Y1(d))δ)dδ is lower-triangular and all its diagonal entries are ℓ-adic units. In this case, it is not obvious that
[TABLE]
which is proved in Section 6.5. Hence for an odd prime ℓ, we prove that
[TABLE]
But still, we have a problem if ℓ=2. Thus, we finally construct a rational cuspidal divisor Y2(d) and for any prime ℓ, we prove that
[TABLE]
Remark 1.13*.*
Using our criteria for linear independence, we may easily guess which divisors are linearly independent. However, they frequently fail to generate the whole group Divcusp0(X0(N))(Q).
The main achievement of this paper is that we actually succeed in finding such (ℓ-adic) generators that satisfy strong assumptions in our criteria. Thus, we obtain the decomposition of the ℓ-primary subgroup of C(N) for any positive integer N and any prime ℓ.
1.3. Notation and Convention
In order to avoid excessive repetition, we adhere to some conventions throughout the whole paper.
p, pi and ℓ : prime numbers (unless otherwise mentioned).
r, ri, f and fi : non-negative integers, which are exponents of primes
(unless otherwise mentioned).
N : a positive integer (unless otherwise mentioned).
rad(N) : the radical of N, the largest squarefree divisor of N, i.e., rad(N):=∏p∣Np.
valp(N) : the (normalized) p-adic valuation of N, i.e., N is divisible by pvalp(N) but not by pvalp(N)+1.
κ(N):=N∏p∣N(p−p−1)=rad(N)N∏p∣N(p2−1).
φ(N):=N∏p∣N(1−p−1)=rad(N)N∏p∣N(p−1).
\cmcalDN : the set of all (positive) divisors of N.
σ0(N):=#\cmcalDN : the number of all divisors of N.
\cmcalDN0:=\cmcalDN∖{1} : the set of all non-trivial divisors of N.
\cmcalDNnsf : the set of all non-squarefree divisors of N.
\cmcalDNsf:=\cmcalDN0∖\cmcalDNnsf : the set of all non-trivial squarefree divisors of N.
If we write N=∏i=1tpiri for some t≥1 and ri≥1, then we use the following.
Ω(t):={(f1,…,ft)∈Zt:0≤fi≤ri for all i and fi=0 for some i}.
Δ(t):={(f1,…,ft)∈Zt:0≤fi≤1 for all i and fi=0 for some i}.
□(t):={(f1,…,ft)∈Ω(t):fi≥2 for some i}=Ω(t)∖Δ(t).
pI:=∏i=1tpifi for any I=(f1,…,ft)∈Ω(t).141414By definition, \cmcalDN0={pI:I∈Ω(t)}, \cmcalDNsf={pI:I∈Δ(t)} and \cmcalDNnsf={pI:I∈□(t)}.
If either u=0 or ru≤4, then let \cmcalTu:=∅. Otherwise, let
[TABLE]
For an element I=(f1,…,ft)∈Δ(t), let
m(I) : the smallest positive integer m such that fm=1.
n(I) : the smallest integer n such that n>m(I) and fn=0.
k(I) : the smallest integer k such that k>n(I) and fk=0.
Here, we set n(I):=t+1 (resp. k(I):=t+1) if fi=1 for all i>m(I) (resp. i>n(I)).
For any integer 1≤k≤t, let
A(k):=(f1,…,ft) such that fi=0 for all i<k and fj=1 for all j≥k.
E(k):=(f1,…,ft) such that fk=0 and fi=1 for all i=k.
F(k):=(f1,…,ft) such that fk=1 and fi=0 for all i=k.
Also, for a given integer 1≤u≤t and any integer 1≤k≤t different from u, let
Eu(k):=(f1,…,ft) such that fk=fu=0 and fi=1 for all i=k,u.
Fu(k):=(f1,…,ft) such that fk=fu=1 and fi=0 for all i=k,u.
Now, we define some subsets of Δ(t). Let
[TABLE]
If u≤1 then we set \cmcalHu=\cmcalHu1:=∅, and for any 2≤u≤t, we set
\cmcalHu:={(f1,…,ft)∈Δ(t):n(I)=u and k(I)≤t}.
\cmcalHu1:={(f1,…,ft)∈Δ(t):n(I)=u and k(I)=t+1}.
Also, if u=0 then we set \cmcalFu=\cmcalFu1=∅, and for any 1≤u≤t, we set
[TABLE]
where
[TABLE]
Furthermore, for 0≤u≤t, we set
[TABLE]
The definitions of n(N,d) and N(N,d), which both depend on the ordering of the prime divisors of N, are a bit complicated. We first assume the following.
Assumption 1.14*.*
Let N=∏i=1tpiri be the prime factorization of N. For a given prime ℓ, by appropriately ordering the prime divisors of N, we assume that
[TABLE]
Let u be the smallest positive integer such that pu=2 if N is even, and u=0 otherwise. Also, let s=0 if ℓ is odd, and s=u if ℓ=2. We further assume that
[TABLE]
Next, we define the following.
Definition 1.15**.**
For a positive integer r, let
[TABLE]
where j=[2r+1−f]. Also, let
[TABLE]
Then, we define n(N,d) for any d∈\cmcalDN0 as follows.
Definition 1.16**.**
For any I=(f1,…,ft)∈Ω(t), let
[TABLE]
where m=m(I). Also, let \cmcalH(N,pI):=2 if one of the following holds, and \cmcalH(N,pI):=1 otherwise.
- (1)
I=A(1).
2. (2)
u≥1 and I=E(u).
3. (3)
u≥1, 3≤ru≤4, fu=3 and fi=1 for all i=u.
4. (4)
u≥1, ru≥5, fu=ru+1−gcd(2,ru) and fi=1 for all i=u.
Furthermore, let
[TABLE]
Lastly, we define N(N,d) for any d∈\cmcalDNsf as follows.
Definition 1.17**.**
For any I=(f1,…,ft)∈Δ(t) with m=m(I), n=n(I) and k=k(I), let
[TABLE]
where x=max(m,u) and y=max(1,3−s). Also, let
[TABLE]
Furthermore, let
[TABLE]
2. The cusps of X0(N)
In this section, we review the results about the cusps of X0(N), which are well-known to the experts. Although there is no new result in this section, we provide detailed proofs as elementary and self-contained as possible for the convenience of the readers.
In this section (except Section 2.5), we only consider the modular curves over C (not over Q) and regard them as compact Riemann surfaces (given by explicit charts). By explicit and concrete methods, we obtain various results on the cusps without further digression on algebraic theory.151515For more general discussion on the modular curves over C, see Chapters 2 and 3 of [6].
For the algebraic description of the cusps (using generalized elliptic curves), see [5], [4] or [2].
As usual, let Y0(N)C:=Γ0(N)\\cmcalH, where \cmcalH={z∈C:Im z>0} and Γ0(N)⊂SL2(Z) acts on \cmcalH by linear fractional transformations. Also, let X0(N)C denote the classical (analytic) modular curve, the “canonical” compactification of Y0(N)C. The cusps of X0(N) are the points added for the compactification, which can be naturally identified with the equivalence classes of P1(Q) modulo Γ0(N), i.e.,
[TABLE]
where Γ0(N) acts on P1(Q) by linear fractional transformation.
In Sections 2.2 and 2.3, we study the degeneracy maps and the Atkin–Lehner operators on modular curves. To investigate their properties, it is often useful to consider them as holomorphic maps161616They also have “moduli interpretations”, so there exist corresponding algebraic morphisms. But we do not discuss “moduli interpretations” here as they are not used. For such discussions, see [16, Sec. 13] and [21, Sec. 1]. between compact Riemann surfaces. More specifically, let A be a positive integer and B its (positive) divisor. Let
[TABLE]
be a matrix satisfying Γ:=γΓ0(A)γ−1⊂Γ0(B). Then since
[TABLE]
we have natural maps
[TABLE]
The composition of the two maps naturally extends to a holomorphic map from X0(A)C to X0(B)C, denoted by Fγ. Note that for any p,q∈Z with gcd(p,q)=1, we have
[TABLE]
where g=gcd(ap+bq,cp+dq), and therefore
[TABLE]
Note that g is a divisor of det(γ)=ad−bc by Lemma 2.13 below.
Remark 2.1*.*
The results about the Atkin–Lehner operators and the Hecke operators are not used in this paper, but we include them for the sake of the readers.
2.1. Representatives of the cusps
Let
[TABLE]
and we define an equivalence relation on (Z2)′ by
[TABLE]
We denote by [ab]N (or simply [ab] if there is no confusion) an equivalence class of (ab)∈(Z2)′. If we write the notation [ab], we always assume that a and b are relatively prime integers.
A cusp of X0(N) can be regarded as an equivalence class in (Z2)′/∼ (as in [28, Sec. 1.3]). Thus, we simply denote a cusp of X0(N) by [ab].
For a (positive) divisor d of N, we say that a cusp of X0(N) is of level d if it is equivalent to (xd) for some (xd)∈(Z2)′. Thus, a cusp of level d is written as [xd] for some integer x relatively prime to d.
Theorem 2.2**.**
For any (ab)∈(Z2)′, we have [ab]=[xd] for some integer x, where d=gcd(b,N). Also, for two divisors d and d′ of N, we have
[TABLE]
As a corollary, we have the following.
Corollary 2.3**.**
The set of the cusps of X0(N) can be written as
[TABLE]
Remark 2.4*.*
Since the map (Z/NZ)×→(Z/dZ)× is surjective for any divisor d of N, it is often useful to take the set of the cusps of X0(N) by
[TABLE]
To begin with, we show three types of equivalences of cusps.
Lemma 2.5**.**
We have
[TABLE]
[TABLE]
[TABLE]
Here, j can be any integer, k can be any integer satisfying gcd(a,b+kN)=1 and y can be any integer satisfying gcd(y,abN)=1.
Proof.
The first assertion easily follows because (10j1)∈Γ0(N). To prove the second assertion, we find v,w∈Z such that
av+b(b+kN)w=k, which is possible because gcd(a,b(b+kN))=1. Now, we consider the following matrix:
[TABLE]
where Γ(N) is the principal congruence subgroup of level N, which is defined as the kernel of the natural homomorphism SL2(Z)→SL2(Z/NZ) induced by the reduction modulo N.
By direct computation, we have γ(ab)=(ab+kN).171717More generally, Γ(N)(ab)=Γ(N)(a′b′) if and only if (ab)≡±(a′b′)(modN).
For the last one, we can find v,w∈Z such that γ=(yNvw)∈Γ0(N) because gcd(y,N)=1. Since wy=1+Nv≡1(modN), we have
[TABLE]
as desired. This completes the proof.
∎
Proof of Theorem 2.2.
Let d=gcd(b,N). Also, let b′=b/d and N′=N/d.
To prove the first assertion, we first suppose that gcd(b′,N)=1. Then we have
[TABLE]
Suppose next that gcd(b′,N)=1. Since gcd(b′,N′)=1, there is an integer181818This can be proved directly by taking k as the product of all prime divisors of ad not dividing b′, or by Dirichlet’s theorem on arithmetic progression. k such that gcd(b′+kN′,ad)=1.
Also, since gcd(b′+kN′,N′)=gcd(b′,N′)=1, we have gcd(b′+kN′,aN)=1.
Finally, since gcd(b+kN,a)=1, we have
[TABLE]
This completes the proof of the first assertion.
Next, we prove the second assertion. For simplicity, let z=gcd(d,N/d).
For any γ=(rNvuw)∈Γ0(N) and (ab)∈(Z2)′, we have gcd(Nva+wb,N)=gcd(wb,N)=gcd(b,N). Thus, any two equivalent cusps have the same level. Now, we claim that [xd]=[x′d] if and only if x≡x′(modz).
Suppose first that [xd]=[x′d], i.e., there is a matrix γ=(rNvuw)∈Γ0(N) such that γ(xd)=(x′d).
Then we have d=Nvx+dw and rw−Nuv=1. Thus, we have w≡1(modN′) and rw≡1(modN). This implies that r≡1(modN′). Since z is a divisor of N′, we also have r≡1(modz), and so x′=rx+du≡x(modz), as wanted.
Conversely, suppose that x=x′+nz for some n∈Z. Since gcd(x,d)=1, we have gcd(x2N′,d)=z, and so there are v,w∈Z such that vd+wx2N′=nz.
Note that x(1−wxN′)=x′+vd and gcd(xx′,d)=1. Thus, we have
[TABLE]
Since gcd(1−wxN′,xN′)=1, we have gcd(1−wxN′,xN)=1 and so
[TABLE]
This completes the proof.
∎
It is often useful to fix a choice of representatives of the cusps of X0(N) (which is neither important nor harmful). For a divisor d of N, we take a finite subset R(N,d) of Z satisfying the following conditions:
The number of elements of R(N,d) is φ(z), where z=gcd(d,N/d).
Any elements of R(N,d) are relatively prime to d.
The elements of R(N,d) are all distinct modulo z.
Then by Corollary 2.3, a cusp of X0(N) of level d can be written as [xd]N for some x∈R(N,d), and so the set of cusps of X0(N) is equal to
[TABLE]
There is another notation for the cusps of X0(N), which is useful to figure out the actions of the Atkin–Lehner operators and the Hecke operators. Let \cmcalS(N)Q:=Divcusp(X0(N))⊗ZQ, where Divcusp(X0(N)) is the group of cuspidal divisors of X0(N).
In other words,
[TABLE]
Lemma 2.6**.**
Let N=Mpr with gcd(M,p)=1. Then there is a canonical isomorphism
[TABLE]
sending a vector [xd]N to a tensor \left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f}\end{smallmatrix}\right]^{p^{r}}, where d′=gcd(d,M) and pf=gcd(d,pr).
Proof.
The existence of the map ι is obvious (and canonical).
Suppose that ι([xd]N)=ι([yδ]N), i.e.,
[TABLE]
Then by definition, we have [xd′]M=[yδ′]M and [xpf]pr=[ypf′]pr. By Theorem 2.2, we have d′=δ′ and f=f′, and so d=δ. Also, we have x≡y(modz′) and x≡y(modpm(f)), where z′=gcd(d′,M/d′) and m(f)=min(f,r−f). Since gcd(M,p)=1, we have x≡y(modz), where z=z′⋅pm(f), which is equal to gcd(d,N/d). Thus, we have [xd]N=[yδ]N and hence the map ι is injective.
By the Chinese remainder theorem, the map ι is surjective. Indeed, if [ad′]M and [bpf]pr are cusps of X0(M) and X0(pr), respectively, then we can find an integer x such that x≡a(modd′) and x≡b(modpf) because gcd(d′,pf)=1. By its construction, we have gcd(x,d′pf)=1, and so
\iota(\left[\begin{smallmatrix}x\\
d^{\prime}p^{f}\end{smallmatrix}\right]^{N})=\left[\begin{smallmatrix}x\\
{d^{\prime}}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f}\end{smallmatrix}\right]^{p^{r}}, as claimed.
∎
From now on, we identify a cusp [xd′pf]N with \left[\begin{smallmatrix}x\\
{d^{\prime}}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f}\end{smallmatrix}\right]^{p^{r}} if we write N=Mpr with gcd(M,p)=1. (Here, we allow the case of r=0, in which case we have \left[\begin{smallmatrix}x\\
{d^{\prime}}\end{smallmatrix}\right]^{M}=\left[\begin{smallmatrix}x\\
{d^{\prime}}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
1\end{smallmatrix}\right]^{1}.)
2.2. Degeneracy maps
Let N=Mpr with gcd(M,p)=1. Also, let d′ be a divisor of M and 0≤f≤r+1. Let αp(N) and βp(N) be two degeneracy maps from X0(Np)C to X0(N)C defined by
[TABLE]
respectively. As already mentioned at the beginning of this section, if we take A=Np and B=N, then αp(N) (resp. βp(N)) is the map Fγ for γ=(1001) (resp. γ=(p001)).191919We can easily check that γΓ0(Np)γ−1⊂Γ0(N), or see Definition 2.11 below.
Thus, by (2.1) we have
[TABLE]
and
[TABLE]
where g=gcd(px,d′pf) is a divisor of p. Equivalently, we have the following.
Lemma 2.7**.**
We have
[TABLE]
and
[TABLE]
Also, we have the following.
Lemma 2.8**.**
The map αp(N) is ramified at a cusp [xd′pf]Np if and only if 0≤f≤r/2. Also, the map βp(N) is ramified at a cusp [xd′pf]Np if and only if r/2+1≤f≤r+1. In particular, the ramification indices of the maps αp(N) and βp(N) depend only on f, neither on x nor on d′. Furthermore, the ramification indices of αp(N) and βp(N) at a cusp are either 1 or p.
This is [36, Lem. 2.1]. Here, we provide a more direct proof using the theory of compact Riemann surfaces. Before proceeding, we compute the width of a cusp.
Definition 2.9**.**
The width of a cusp [ad] of level d of X0(N) is defined as the smallest positive integer n such that
(10n1)∈γ−1Γ0(N)γ,
where γ∈SL2(Z) satisfying γ(10)=(ad).
Lemma 2.10**.**
The width of a cusp [ad] of level d of X0(N) is d⋅gcd(d,N/d)N.
Proof.
Since gcd(a,d)=1, we can find integers b and c such that ac−bd=1. We take γ=(adbc)∈SL2(Z) so that γ(10)=(ad). Since
[TABLE]
we must have −d2n≡0(modN). Note that gcd(d2,N)=d⋅gcd(d,N/d), and hence the result follows.
∎
Proof of Lemma 2.8.
We recall the definition of the ramification index of a holomorphic function between compact Riemann surfaces. Let X and Y be compact Riemann surfaces, and
let ϕ:X→Y be a non-constant holomorphic function. Also,
let q and q′ denote the local parameters of τ∈X and ϕ(τ)∈Y, respectively.
If we write ϕ(q)=∑n≥1a(n)⋅(q′)n, then the ramification index of ϕ at a point τ is defined as the smallest positive integer n such that a(n)=0.
Now, we prove the lemma. Let ϕ=αp(N) or βp(N).
Also, let h (resp. h′) be the width of a cusp c (resp. ϕ(c)).
By the discussion in [6, Sec. 2.4], the local parameter of a cusp c (resp. ϕ(c)) of X0(N) is equal to e2πiτ/h (resp. e2πiτ/h′). Since
[TABLE]
the ramification index of αp(N) (resp. βp(N)) at a cusp c is hh′ (resp. hph′).202020The ramification index of αp(N) is computed on [6, pg. 67] or [29, pg. 538].
By Lemma 2.10, the width of a cusp c=[xd′pf]Np is d′⋅gcd(d′,M/d′)M×pr+1−f−m(f), where m(f)=min(f,r+1−f). So the result easily follows by Lemma 2.7.
∎
More generally, we can define various degeneracy maps as follows.
We use the same notation as at the beginning of the section.
For γ=(n001)∈M2(Z), we have
[TABLE]
Thus, if we take n as a divisor of A/B, then we have γΓ0(A)γ−1⊂Γ0(B), and hence the map Fγ:X0(A)C→X0(B)C is well-defined.
Definition 2.11**.**
We denote by π1(A,B) (resp. π2(A,B)) the map Fγ from X0(A) to X0(B) induced by γ=(1001) (resp. γ=(A/B001)). By definition, for any integer r≥1 we have
[TABLE]
Furthermore, if we take γ=(p001) when A/B=p2, then we obtain a map
[TABLE]
denoted by π12(N), which is equal to
[TABLE]
2.3. Atkin–Lehner operators
As above, let N=Mpr with gcd(M,p)=1. Also, let d′ be a divisor of M and 0≤f≤r. In this subsection, we further assume that p is a divisor of N, i.e., r≥1. Consider any matrix of the form:
[TABLE]
One of the properties of such matrices is that they normalize the group Γ0(N), so if we take A=B=N and γ=Wpr at the beginning of the section, then the map Fγ is well-defined as an endomorphism of X0(N)C, which is called the Atkin–Lehner operator with respect to p, denoted by wp. Note that this operator does not depend on the choice of a matrix Wpr because all such matrices are equivalent modulo Γ0(N), i.e.,
for any two such matrices W and W′ there is a matrix U∈Γ0(N) such that W=U×W′.
Lemma 2.12**.**
Let c=[xd′pf]N be a cusp of X0(N). Then we have
[TABLE]
Proof.
During the proof, we take x∈Z so that gcd(x,d′p)=1 (Remark 2.4). By (2.1), we have
[TABLE]
where u=m1xpr−f+m2d′, v=m3x(M/d′)+m4pf and g=gcd(upf,vd′pr).
Since the operator wp does not depend on the choice of a matrix Wpr, we may choose m3=d′ and so v=Mx+m4pf. Indeed, such a matrix Wpr exists because we can always find integers m1,m2 and m4 such that m1m4pr−m2d′M=1 by our assumption gcd(d′M,p)=1.
Since m4pf is relatively prime to M, we have gcd(v,M)=gcd(Mx+m4pf,M)=1. Also, since Mx is relatively prime to p, we have gcd(v,pr)=gcd(Mx+m4pf,pr)=1. Thus, we have gcd(v,N)=1.
Since g is a divisor of det(Wpr)=pr, and since p does not divide u, we have g=pf. Therefore we obtain gcd(u,v)=1. Since gcd(v,N)=1 as well, we have
[TABLE]
Note that u≡m1xpr−f(modd′) and v≡m4pf(modd′). Note also that u≡m2d′(modpm(f)) and v≡Mx(modpm(f)), where m(f)=min(f,r−f)=m(r−f).
Therefore by the determinant condition m1m4pr−m2d′M=1, we have
[TABLE]
This completes the proof.
∎
Lemma 2.13**.**
Let M=(acbd)∈M2(Z) and (xy)∈(Z2)′. Then the greatest common divisor of ax+by and cx+dy is a divisor of the determinant of M.
Proof.
Since gcd(x,y)=1, there are integers u and v such that ux+vy=1. So we have
[TABLE]
which proves the result.
∎
2.4. Hecke operators
The degeneracy maps induce maps between the divisor groups.
Using them, we define the p-th Hecke operator Tp by the composition
[TABLE]
We also denote by Tp the endomorphism of J0(N) induced by the restriction of the map Tp to Div0(X0(N)), which we also call the p-th Hecke operator.
Lemma 2.14**.**
As above, let N=Mpr with gcd(M,p)=1. Also, let d′ be a divisor of M and 0≤f≤r. For a prime p, let p∗∈Z be chosen so that pp∗≡1(modM) and gcd(p∗,p)=1.
- (1)
Suppose that r=0. Then we have
[TABLE]
2. (2)
Suppose that r≥1. Then we have
[TABLE]
Proof.
During the proof, we frequently use Lemma 2.7.
First, suppose that r=0. By direct computation, we have
[TABLE]
which implies the first assertion. (Note that \left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
1\end{smallmatrix}\right]^{1}.)
Next, suppose that r≥1. If 0≤f≤r/2, then we have
[TABLE]
and so we have the result. If r/2<f≤r−1, then we
have gcd(pf,pr+i−f)=pr+i−f for i=0 or 1. Thus, we have
[TABLE]
If f>(r+1)/2, then f−1≥r/2 and so gcd(pf−1,pr−(f−1))=pr−f+1. Thus, x+ipr−f (for 0≤i≤p−1) are all distinct modulo gcd(pf−1,pr−(f−1)), and hence
the result follows.
If f=(r+1)/2, then we have gcd(pf−1,pr−(f−1))=pf−1=pr−f, and therefore
[TABLE]
Lastly, since [ipr]pr=[1pr]pr for any 1≤i≤p−1, we have
[TABLE]
This completes the proof.
∎
Lemma 2.15**.**
As above, let N=Mpr with gcd(M,p)=1. For any cuspidal divisor D on X0(N), we have
[TABLE]
Proof.
Let c=\left[\begin{smallmatrix}x\\
{d^{\prime}}p^{f}\end{smallmatrix}\right]^{N}=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f}\end{smallmatrix}\right]^{p^{r}} be a cusp of X0(N), where x is an integer relatively prime to N, d′ is a divisor of M and 0≤f≤r. We claim that the formula holds for D=c, which easily proves the result by linearity.
First, suppose that r=1. Then we have \beta_{p}(N/p)_{*}(c)=\left[\begin{smallmatrix}px^{\prime}\\
{d^{\prime}}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
1\end{smallmatrix}\right]^{1}, where x′=(p∗)fx.
Thus, by Lemma 2.7 we have
[TABLE]
By Lemmas 2.12 and 2.14, if f=0 then we have w_{p}(c)=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
p\end{smallmatrix}\right]^{p} and
T_{p}(c)=p\cdot\left[\begin{smallmatrix}px\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
1\end{smallmatrix}\right]^{p}.
Also, if f=1 then we have w_{p}(c)=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}1\\
1\end{smallmatrix}\right]^{p} and
[TABLE]
Thus, the claim follows.
Next, suppose that r≥2. If f=0, then we have
[TABLE]
If 1≤f≤r, then \beta_{p}(N/p)_{*}(c)=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f-1}\end{smallmatrix}\right]^{p^{r-1}}.
Thus, the result follows by Lemma 2.7 and 2.14.
∎
Remark 2.16*.*
In general, as an endomorphism on J0(N) we have
[TABLE]
(cf. [36, (2.7)]). The computation above verifies this formula for cuspidal divisors.
2.5. Rational cuspidal divisor group
In [28, Th. 1.3.1], Glenn Stevens computed the action of Gal(Q/Q) on the cusps of the modular curve XΓ for a congruence subgroup Γ containing Γ(N). As a corollary, we have the following.
Theorem 2.17**.**
A cusp [xd]N of level d is defined over Q(μz), where z=gcd(d,N/d) and the action of Gal(Q(μz)/Q) on the set of all cusps of level d is simply transitive.
Proof.
For simplicity, let G=Gal(Q(μN)/Q) and H=Gal(Q(μN)/Q(μz)). Also, let Xd be the set of all cusps of X0(N) of level d.
First, any cusps in Xd are defined over Q(μN) by Theorem 1.3.1(a) of loc. cit.
Next, for any k∈(Z/NZ)× let τk be an element in G sending ζN to ζNk, where ζN is a primitive N-th root of unity. By Theorem 1.3.1(b) of loc. cit., for any cusp [xd]∈Xd, we have τk([xd])=[xk∗d], where k∗∈Z is chosen so that kk∗≡1(modN) and gcd(k∗,x)=1. Since [xk∗d]=(3)[k∗xd] by Lemma 2.5, there is an action of G on the set Xd.
Then, for any cusp [x′d]∈Xd, we can find k∈Z such that x′k≡x(modd) and gcd(k,N)=1.
Since kk∗≡1(modN), we have k∗x≡kk∗x′≡x′(modd). Thus, we have
[xd]τk=[x′d] and hence the action of G on Xd is transitive.
Finally, note that [xd]τk=[xd] if and only if k∗x≡x(modz), or equivalently k≡1(modz). Since H is equal to {τk∈G:k∈(Z/NZ)× with k≡1(modz)}, the action of G on Xd factors through G/H≃Gal(Q(μz)/Q). Therefore a cusp [xd]∈Xd is defined over Q(μz) and the action of Gal(Q(μz)/Q) on Xd is simply transitive, as claimed.
∎
Definition 2.18**.**
Let (P(N)d) denote the divisor on X0(N) defined as the sum of all cusps of level d (each with multiplicity one), i.e.,
[TABLE]
where R(N,d) is defined in Section 2.1.
Also, let
[TABLE]
If there is no confusion, we simply write (Pd) and Cd.
Lemma 2.19**.**
We have
[TABLE]
and
[TABLE]
Proof.
By Theorem 2.17, (Pd) is a single orbit of Gal(Q/Q). Thus, the first assertion follows. Since the degree of (Pd) is equal to the number of the cusps of X0(N) of level d, which is φ(gcd(d,N/d)), we have
[TABLE]
Since
[TABLE]
the second assertion follows.
∎
Remark 2.20*.*
As already introduced in Notation 1.8, there is a tautological isomorphism
[TABLE]
sending (Pd) to e(N)d.
As above, let N=Mpr with gcd(M,p)=1. Also, let d′ be a divisor of M.
Using the identification \left[\begin{smallmatrix}x\\
{d^{\prime}}p^{f}\end{smallmatrix}\right]^{N}=\left[\begin{smallmatrix}x\\
d^{\prime}\end{smallmatrix}\right]^{M}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left[\begin{smallmatrix}x\\
p^{f}\end{smallmatrix}\right]^{p^{r}} and an isomorphism
(Z/NZ)×≃(Z/MZ)××(Z/prZ)×, we easily have (P(N)_{{d^{\prime}}p^{f}})=(P(M)_{d^{\prime}})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}(P(p^{r})_{p^{f}}). Thus, we also identify \cmcalS2(N)Q with {\cmcal{S}}_{2}(M)_{\mathbf{Q}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\cmcal{S}}_{2}(p^{r})_{\mathbf{Q}} by letting {\bf e}(N)_{{d^{\prime}}p^{f}}={\bf e}(M)_{d^{\prime}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\bf e}(p^{r})_{p^{f}}.
Similarly, if N=∏i=1tpiri is the prime factorization, then we identify \cmcalSk(N)Q with \operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=1}^{t}{\cmcal{S}}_{k}(p_{i}^{r_{i}})_{\mathbf{Q}} for both k=1 and k=2.
2.6. The actions of various operators on Divcusp(X0(N))(Q)
As above, let N=Mpr with gcd(M,p)=1. Also, let d′ be a divisor of M and
[TABLE]
Note that since p does not divide M we have
[TABLE]
where p∗ is an integer such that pp∗≡1(modM) and gcd(p∗,p)=1.
Lemma 2.21**.**
For any 0≤f≤r+1, we have
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
Proof.
For simplicity, let D=(P(Np)d′pf). Let k=min(f,r+1−f). Then we have
[TABLE]
Suppose first that f=r+1. Then S(k)={1}, and by (2.2) we have
[TABLE]
Suppose next that f≤r. Then we have
[TABLE]
If 0≤f≤r/2, then we have min(f,r−f)=k and (P(pr)pf)=∑y∈S(k)[ypf]pr. If f=r≥1, then #S(k)=p−1, and
we have [ypr]pr=[1pr]pr=(P(pr)pr) for any y∈S(k). If r/2<f≤r−1, then min(f,r−f)=k−1 and we have
[TABLE]
Thus, we obtain the result for αp(N)∗(D).
The proof for βp(N)∗ is similar and we leave the details to the readers.
∎
Lemma 2.22**.**
For any 0≤f≤r, we have
[TABLE]
and
[TABLE]
where
[TABLE]
and
[TABLE]
Proof.
The result follows by Lemmas 2.7 and 2.8.
More specifically, suppose that f=r=0. Then we have
[TABLE]
By (2.2), the result follows. If f=r≥1, then
[TABLE]
Also, if 0≤f≤r−1, then
[TABLE]
If 0≤f≤r/2, then min(f,r+1−f)=f and hence
[TABLE]
If r/2<f≤r, then min(f,r+1−f)=r+1−f=min(f,r−f)+1 and therefore
[TABLE]
Thus, we obtain the result for αp(N)∗(D).
The proof for βp(N)∗ is similar and we leave the details to the readers.
∎
Lemma 2.23**.**
If r=0 then we have
[TABLE]
Suppose that r≥1. Then for any 0≤f≤r, we have
[TABLE]
and
[TABLE]
where
[TABLE]
Proof.
For simplicity, let D=(P(N)d′pf). Then we have
[TABLE]
where k=min(f,r−f). Thus, by Lemma 2.12 we have
[TABLE]
Also, by Lemma 2.14 and (2.2), we have the following: If f=r=0, then
[TABLE]
Suppose that r≥1. If f=0, then
[TABLE]
If f=r≥1, then we have
[TABLE]
If r=1, then [ipr−1]pr=[11]p=(P(p)1) for any 1≤i≤p−1, and so the formula holds. If r≥2, then (P(pr)pr−1)=∑i=1p−1[ipr−1]pr and so the result follows.
Now, suppose that 1≤f≤r−1. If 1≤f≤(r+1)/2, then
[TABLE]
By direct computation, we have
[TABLE]
Thus, the result follows. If (r+1)/2<f≤r−1, then we have
[TABLE]
because any element of S(r−f+1) can be uniquely written as y+ipr−f for some y∈S(k)=S(r−f) and 0≤i≤p−1. This completes the proof.
∎
The following will be used later.
Lemma 2.24**.**
For any r≥0, we have
[TABLE]
Proof.
First, we have
[TABLE]
Next, suppose that the above formula holds for r≥1, i.e.,
[TABLE]
Then by Lemma 2.22, we have
[TABLE]
By induction we obtain the result.
∎
Remark 2.25*.*
Suppose that M is a prime not divisible by p. As above, for any D∈Divcusp(X0(M))(Q), we easily have
[TABLE]
Lemma 2.26**.**
We have
[TABLE]
If r≥1, then we have
[TABLE]
Therefore the map p1×π12(pr)∗ is well-defined as
a linear map from the group of rational cuspidal divisors on X0(pr) to that on X0(pr+2).
Proof.
We may prove the theorem by direct computation using
[TABLE]
But instead, we note that
[TABLE]
and therefore
[TABLE]
First, suppose that r=f=0. Then αp(1)∗(P(1)1)=p⋅(P(p)1)+(P(p)p). Thus, we have
[TABLE]
Next, suppose that r≥1. Suppose further that 0≤f≤r/2. Then we have
[TABLE]
Thus, we have
[TABLE]
If (r+1)/2≤f≤r, then we have βp(pr)∗(P(pr)pf)=p⋅(P(pr+1)pf+1), and hence
[TABLE]
Lastly, since the coefficient of (P(pr+2)pk) in π12(pr)∗(P(pr)pf) is divisible by p for any f and k, the last assertion follows.
∎
3. The order of a rational cuspidal divisor
For a degree [math] divisor D on X0(N), let D denote the linear equivalence class of D in J0(N).
By the order of D, we mean the order of D in J0(N), i.e., the smallest positive integer n such that n⋅D is equal to the divisor of a meromorphic function on X0(N). (If there does not exist such an integer, then we say that the order is infinite.)
In this section, we develop a method for computing the order of a degree [math] rational cuspidal divisor on X0(N), which is a slight elaboration of Ligozat’s method (Section 3.3). As an application, we compute the order of Cd for any non-trivial divisor d of N (Sections 3.5 and 3.6).
Before proceeding, we discuss a way to compute the order of a degree [math] cuspidal divisor on X0(N), which may not be rational.
Let D be a degree [math] cuspidal divisor on X0(N). By Manin [14] and Drinfeld [7], the order of D is finite. In other words, there is a modular function212121By a modular function on X0(N), we mean a meromorphic function on \cmcalH∗ invariant under the action of Γ0(N). on X0(N) whose divisor is an integral multiple of D.
Such a function has no zeros and poles on \cmcalH, and hence is called a modular unit.
Modular units for the modular curves X(N) or X1(N) have been studied by various mathematicians (most notably Kubert and Lang [10]), but those for the modular curve X0(N) have not much studied before (unless N is squarefree). In principle, if we know a precise description222222Since modular units on X0(N) are also modular units on X(N), they can be written in terms of Siegel’s units. However, we need a precise description of which Siegel’s units are invariant under the action of a more larger group Γ0(N). For some progress on modular units on X0(N), see [8]. of all the modular units on X0(N), then we can compute the order of any degree [math] cuspidal divisor on X0(N).
However, until now, there is no systematic way to compute the order of a non-rational cuspidal divisor except a method using modular symbols.
Now, let D be a degree [math] rational cuspidal divisor on X0(N). If n is the order of D, then there is a modular function F on X0(N) such that
[TABLE]
Such a function F has the following properties:
- (1)
It has no zeros and poles on \cmcalH.
2. (2)
Its order of vanishing at a cusp [xd]N of level d does not depend on x.
In the 1950s, Morris Newman constructed such functions using the Dedekind eta function [17, 18]. Now, they are called eta quotients (or eta products, depending on author’s preference, cf. [9, pg. 31]).
In fact, he found a sufficient condition when an eta quotient is a modular function on X0(N). Also, in the early 1970s Andrew Ogg proved a necessary and sufficient condition for eta quotients to be modular functions on X0(N) when N is either a prime or the product of two primes, and he computed the order of a degree [math] (rational) cuspidal divisors [19, 20].
Lastly, in 1975 Gerard Ligozat proved a necessary and sufficient condition for an eta quotient to be a modular function on X0(N) for any positive integer N (Proposition 3.5). As an application, he computed the order of the divisor (0)−(∞) on X0(N) for any positive integer N [11, Th. 3.2.16].
In 1997, Toshikazu Takagi proved that all modular units on X0(N) are eta quotients (up to constant) when N is squarefree [30, Th. 3.3]. We remark that this result can be obtained by the work of Ligozat. More precisely, Ligozat’s result is enough to prove that any modular unit on X0(N) such that its order of vanishing at a cusp [xd]N of level d does not depend on x is an eta quotient up to constant (Theorem 3.6). Since all cusps of X0(N) are defined over Q when N is of the form 2rM, where M is odd squarefree and r≤3, all modular units on X0(N) for such an N are indeed eta quotients (up to constant).
3.1. Eta quotients
Let η:\cmcalH→C be the Dedekind eta function defined by
[TABLE]
It is well-known that for any γ=(acbd)∈SL2(Z) with c≥0, we have
[TABLE]
where ϵ(a,b,c,d)—see [11, Sec. 3.1] for its definition—is a certain 24-th root of unity. One way to ignore this root of unity is to take its 24-th power. The function Δ(τ):=η(τ)24 is then invariant under the action of SL2(Z), and so it is a modular form of weight 12 for SL2(Z).
As mentioned at the beginning of Section 2, for any divisor δ of N, we have the map Fγ:X0(N)→X0(1), where γ=(δ001). Thus, the function
[TABLE]
is a modular form of weight 12 for Γ0(N), and so the ratio of such modular forms may be used to construct a modular function on X0(N).
Likewise, we define
[TABLE]
and consider the following.
Definition 3.1**.**
A function g:\cmcalH→C is called an eta quotient of level N if it is of the form g=∏δ∣Nηδrδ for some rδ∈Z.
As in Notation 1.8, we consider the Q-vector space \cmcalS1(N)Q and for any r=∑δ∣Nrδ⋅e(N)δ∈\cmcalS1(N)Q, we define a generalized eta quotient of level N by
[TABLE]
which is regarded as a power series in q with rational coefficients (after multiplying suitable rational power of q if necessary). Our interest is to understand when such a product is a modular function on X0(N).
Lemma 3.2**.**
Let d and δ be two divisors of N. The order of vanishing of Δδ at a cusp [xd]N of level d is
[TABLE]
Proof.
Let γ=(xdab)∈SL2(Z) so that γ∞=[xd]. Then the order of Δδ at a cusp [xd]N is the smallest power of qh:=q1/h=e2πiτ/h in the Puiseux expansion of (dτ+b)−12Δδ(γτ), where h is the width of a cusp [xd]N. (For example, see [6, Sec. 3.2].) By Lemma 2.10, we have h=d⋅gcd(d,N/d)N.
Let g=gcd(d,δ), and write d=gd1 and δ=gδ1 with gcd(d1,δ1)=1.
Since gcd(xδ1,d1)=1, there are integers m and n such that xδ1n−d1m=1.
By direct computation, we have
[TABLE]
Since
[TABLE]
and (xδ1d1mn)∈SL2(Z), we have
[TABLE]
Thus, the Puiseux expansion of (dτ+b)−12Δδ(γτ) is
[TABLE]
and therefore the order of vanishing of Δδ at a cusp [xd]N is
[TABLE]
This completes the proof.
∎
Definition 3.3**.**
For a positive integer N, let
[TABLE]
be a square matrix of size σ0(N), indexed by the divisors of N. We regard this matrix as a linear map from \cmcalS1(N)Q to \cmcalS2(N)Q.
By Lemma 3.2, we have the following, which is [11, Prop. 3.2.8].
Lemma 3.4**.**
Let r=∑δ∣Nrδ⋅e(N)δ∈\cmcalS1(N)Q. Then we have
[TABLE]
If gr is a modular function on X0(N), then its order of vanishing at any cusp is an integer and the degree of its divisor is zero. Thus, we have
Λ(N)×r∈\cmcalS2(N)0. It turns out that such properties are almost enough for an eta quotient to be a modular function.
Proposition 3.5** (Ligozat).**
Let r=∑δ∣Nrδ⋅e(N)δ∈\cmcalS1(N)Q and
gr=∏δ∣Nηδrδ. Then gr is a modular function on X0(N) if and only if all the following conditions are satisfied:
- (0)
all rδ are rational integers, i.e., r∈\cmcalS1(N).
2. (1)
∑δ∣Nrδ⋅δ≡0(mod24).
3. (2)
∑δ∣Nrδ⋅(N/δ)≡0(mod24).
4. (3)
∑δ∣Nrδ=0.
5. (4)
∏δ∣Nδrδ* is the square of a rational number.*
Proof.
The proof can be obtained from [11, Sec. 3.2]. For the sake of the readers, we explain this proof in detail.
First, for any γ=(aNcbd)∈Γ0(N) and rδ∈Z, we have
[TABLE]
where ε(γ) is a certain root of unity depending on γ and r.
Indeed, by the same idea used in (3.1) and (3.2), we obtain this formula, which will not be done here.
In this case, the argument is much simpler because we may use the equality (δ001)(aNcbd)=(aNcδ−1bδd)(δ001) instead of (3.1).
See page 29 of op. cit. for more detail.
Suppose that r satisfies all the conditions above.
To prove that gr is a modular function on X0(N), it suffices to show that ε(γ)=1 for any γ∈Γ0(N).
In fact, by the argument on [18, pg. 374], it suffices to show that ε(γ)=1 for the matrices γ=(aNcbd)∈Γ0(N) satisfying gcd(a,6)=1, a>0 and c>0. By direct computation, we easily have
ε(γ)=1 for such γ. (For more detail, see [17].)
Conversely, suppose that gr is a modular function on X0(N). Then the order of vanishing of gr at a cusp must be an integer. By Lemma 3.4, for any divisor d of N, we get 241∑δ∣NaN(d,δ)⋅rδ∈Z. By direct computation, aN(N,δ)=δ and aN(1,δ)=N/δ, so conditions (1) and (2) are satisfied. Also, since the degree of div(gr) is zero, condition (3) is fulfilled.
Furthermore, by the transformation property of gr, we must have ∏δ∣N(aδ)rδ=1 for any a>0 with gcd(a,6N)=1. In particular, for any prime p not dividing 6N, we have (px)=1, where x=∏δ∣Nδrδ. This only holds when x is a square (cf. Lemme on [11, pg. 32]), and hence condition (4) is satisfied.
Thus, it suffices to show that rδ∈Z for all divisors δ of N.
Note that since ∑δ∣Nrδ=0, gr is a power series in q with rational coefficients.
As on [11, pg. 39], we easily have rδ∈Z for all δ if gr∈Z[[q]] (cf. [26, Lem. 19]).
Instead, we follow the argument on [20, pg. 458]. Since the order of vanishing of Δ at a cusp [xd]N of level d is d⋅gcd(d,N/d)N by Lemma 3.2, a function g′=gr⋅Δk vanishes at all cusps for a sufficiently large integer k.
Thus, g′ is a cusp form of weight 12k for Γ0(N) with rational Fourier coefficients.
Since these coefficients have bounded denominators by [27, Th. 3.52], there is a non-zero integer b such that g′′=b⋅g′∈Z[[q]]. Since g′′ does not vanish on \cmcalH, we can write g′′=c∏δ∣Nηδsδ for some c∈Z and sδ∈Z by [26, Th. 7]. Since g′′=0, we have c=0 and hence rδ∈Z.232323For a generalization of such an argument, see Theorem 4.2 of [10, Ch. 4] and
the proof of the claim (d) on [32, pg. 421]. (Note that r1=s1−12k and rδ=sδ for any δ∈\cmcalDN0.)
∎
Theorem 3.6**.**
Suppose that F is a modular unit on X0(N) such that its order of vanishing at a cusp [xd]N of level d does not depend on x. Then there is a constant ϵ∈C× and an eta quotient gr of level N such that F=ϵ⋅gr.
Proof.
This basically follows from the fact that the matrix Λ(N) is invertible ([11, Lem. 3.2.9] or Lemma 3.7 below).
By our assumption, we can write
[TABLE]
Let r=∑δ∣Nrδ⋅e(N)δ=Λ(N)−1×(∑d∣Nad⋅e(N)d).
Note that a priori we only have rδ∈Q, and so gr might not be an eta quotient.
Since
[TABLE]
we have div(gr)=∑d∣Nad⋅(Pd)=div(F). Thus, there is a constant ϵ∈C× such that F=ϵ⋅gr. Since F is a modular function on X0(N), so is gr. Therefore we have rδ∈Z by Proposition 3.5, and so gr is indeed an eta quotient of level N.
∎
3.2. The matrix Υ(N)
For a prime p and a positive integer r, we define a tridiagonal matrix
Υ(pr) (indexed by the divisors of pr) by
[TABLE]
where m(f)=min(f,r−f). In other words, we have
[TABLE]
If we write N=∏i=1tpiri, then we define a matrix Υ(N) (indexed by the divisors of N) by
[TABLE]
In other words, if δ=∏i=1tpiei and d=∏i=1tpifi, then we have
[TABLE]
From now on, we regard Υ(N) as a linear map from \cmcalS2(N)Q to \cmcalS1(N)Q by our identifications {\cmcal{S}}_{k}(N)_{\mathbf{Q}}=\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=1}^{t}{\cmcal{S}}_{k}(p_{i}^{r_{i}})_{\mathbf{Q}} (cf. Remark 2.20).
Lemma 3.7**.**
For any positive integer N>1, we have
[TABLE]
where Idn is the identity matrix of size n. In particular, Λ(N) is invertible.
Proof.
For any integer N, let Λ(N)′=24×Λ(N). It suffices to prove that
[TABLE]
First, suppose that N=pr is a prime power. Then by direct computation (or by [12, Prop. 3]), the result follows.
Next, let N=Mpr with gcd(M,p)=1 and r≥1. Suppose that
[TABLE]
Let d and δ be two divisors of M, and 0≤f,g≤r.
By direct computation, we have
[TABLE]
and so \Lambda(N)^{\prime}=\Lambda(M)^{\prime}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\Lambda(p^{r})^{\prime}. Since \Upsilon(N)=\Upsilon(M)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\Upsilon(p^{r}) by definition, we have
[TABLE]
Similarly, we have Λ(N)′×Υ(N)=κ(N)×Idσ0(N).
By induction, the result follows.
∎
Lemma 3.8**.**
For a divisor d of N, let
[TABLE]
be the d-th column of the matrix Υ(N) and let z=gcd(d,N/d). Then we have the following.
- (1)
We have
[TABLE]
where a(p)=2 if p divides z, and a(p)=1 otherwise.
2. (2)
We have
[TABLE]
3. (3)
We have
[TABLE]
4. (4)
We have
[TABLE]
Proof.
Let N=∏i=1tpiri and d=∏i=1tpifi. Since {\bf e}(N)_{d}=\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=1}^{t}{\bf e}(p_{i}^{r_{i}})_{p_{i}^{f_{i}}}, we have
[TABLE]
Since the constant function, the identity function, the reciprocal function and the greatest common divisor function are multiplicative, both sides are multiplicative. Thus, it suffices to check the formulas when N=pr with r≥1. By direct computation, we have
[TABLE]
This completes the proof.
∎
Remark 3.9*.*
The second and the third equalities easily follow from Lemma 3.7 because the N-th (resp. first) row of 24×Λ(N) is ∑d∣Nd⋅e(N)d (resp. ∑d∣N(N/d)⋅e(N)d).
3.3. Algorithm for computing the order
In this subsection, we elaborate Ligozat’s method and provide a simple algorithm for computing the order of a degree [math] rational cuspidal divisor C on X0(N). We first construct a generalized eta quotient g(r(C)) as follows:
[TABLE]
where \cmcalE is the set of generalized eta quotients of level N,
[TABLE]
The following is well-known (cf. [12, pg. 36]).
Proposition 3.10**.**
For a rational cuspidal divisor C on X0(N), the order of C is the smallest positive integer n such that g(n⋅r(C)) is a modular function on X0(N), or equivalently n⋅r(C) satisfies all the conditions in Proposition 3.5.
Proof.
Let k be the order of C. Then by the definition of k, there is a modular function F on X0(N) such that k⋅C=div(F). Since the divisor of g(r(C)) is C by Lemma 3.4, the divisors of F and g(k⋅r(C)) are equal. Therefore there is a constant ϵ∈C× such that F=ϵ⋅g(k⋅r(C)), and so g(k⋅r(C)) is also a modular function on X0(N).
By the minimal property of n, we have n≤k.
Conversely, by the definition of n, g(n⋅r(C)) is a modular function on X0(N). Thus, by the minimal property of k, we have k≤n because the divisor of g(n⋅r(C)) is n⋅C. This completes the proof.
∎
The following is crucial in our method.
Corollary 3.11**.**
Let X∈Divcusp0(X0(N))(Q) be a degree [math] rational cuspidal divisor on X0(N). For a prime p, let
[TABLE]
where the sum runs over the divisors of N whose p-adic valuations are odd. Then the following are equivalent.
- (1)
The order of X is 1, or equivalently X=0∈J0(N).
2. (2)
r(X)∈\cmcalS1(N)* and r\mathchar45Pwp(X)∈2Z for all primes p.*
Proof.
Let X=∑d∣Nad⋅(Pd)∈Divcusp0(X0(N))(Q). Note that conditions (1), (2) and (3) for r(X) follow from aN∈Z, a1∈Z and the degree of X is [math], respectively. (For instance, see the proof of Proposition 3.5 above.) Thus, conditions (1), (2) and (3) for n⋅r(X) are always satisfied for any integer n≥1.
Note also that the normalized p-adic valuation of the product in condition (4) is ∑δ∣Nvalp(δ)⋅r(X)δ. Since
[TABLE]
the result easily follows by Proposition 3.10.
∎
In principle, we can compute the order of any rational cuspidal divisor by Proposition 3.10 above. Since Υ(N) is a scalar multiple of Λ(N)−1 and is an integral matrix, it is easy to compute various invariants in the following.
Definition 3.12**.**
For C=∑d∣Nad⋅(Pd)∈Divcusp(X0(N))(Q), we define
[TABLE]
In other words, for any divisor δ of N, we have
[TABLE]
Let GCD(C):=gcd(V(C)δ:δ∣N) be the greatest common divisor of the entries of V(C)
and let
[TABLE]
Furthermore, let
[TABLE]
where the sum runs over the divisors of N whose p-adic valuations are odd. Finally, let
[TABLE]
The main theorem of this section is the following, which gives an easy algorithm for computing the order of any degree [math] rational cuspidal divisor.
Theorem 3.13**.**
For any C∈Divcusp0(X0(N))(Q), the order of C is
[TABLE]
In particular, if the genus of X0(N) is positive, then the order of C divides 12κ(N).
Remark 3.14*.*
Since κ(N)=N∏p∣N(p−p−1) is multiplicative and p2−1 is divisible by 24 for any prime p≥5, it is easy to see that κ(N) is divisible by 24 if and only if N∈{1,2,3,4,8}.
Proof of Theorem 3.13.
First, suppose that the genus of X0(N) is [math], in which case
[TABLE]
Then we can easily verify the formula, which we leave to the readers. (In fact, the order of any degree [math] rational cuspidal divisor is 1.) So we assume that the genus of X0(N) is positive, in which case 24κ(N)∈Z by the remark above.
Next, let C∈Divcusp0(X0(N))(Q). Since V(C)∈\cmcalS1(N), it is easy to check that 2V(C) satisfies conditions (0) and (4). Note that 2V(C)=12κ(N)⋅r(C).
Since conditions (1), (2) and (3) for n⋅r(C) are always satisfied for any integer n≥1 as in Corollary 3.11, the order of C is a divisor of 12κ(N) by Proposition 3.10.
Accordingly, let m be the largest divisor of 12k(N) such that
[TABLE]
satisfies conditions (0) and (4), where k=m2⋅GCD(C). Then the order of C is 12mκ(N).
Since the greatest common divisor of the entries of V(C) is 1 and r(m)∈\cmcalS1(N), we have k∈Z. Also, since r(m)=k⋅V(C), for any prime p we have
[TABLE]
Thus, k⋅Pwp(C)∈2Z for all primes p by condition (4), and hence we must have either k∈2Z or Pwp(C)∈2Z. So by definition, if h(C)=1 then there is no condition on k, and if h(C)=2 then there is a prime p such that Pwp(C)∈2Z, in which case k must be even.
Therefore k must be divisible by h(C) in both cases, and so m is a divisor of h(C)2⋅GCD(C). Since m is given as a divisor of 12κ(N), we have
[TABLE]
This implies 12m=gcd(κ(N),24⋅GCD(C)⋅h(C)−1), and so the result follows.
∎
3.4. Example I: The divisors defined by tensors
Let C∈Divcusp0(X0(N))(Q) be a degree [math] rational cuspidal divisor on X0(N). We say that C is defined by tensors if there are non-trivial divisors N1 and N2 of N such that N=N1N2, gcd(N1,N2)=1 and \Phi_{N}(C)=V_{1}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}V_{2} for some Vi∈\cmcalS2(Ni).
Let Ci=ΦNi−1(Vi) so that Ci∈Divcusp(X0(Ni))(Q). Since the degree of C is the product of the degrees of C1 and C2, we further assume that the degree of C1 is zero.
The main result of this section is the following.
Theorem 3.15**.**
Suppose that C is defined by tensors and written as above. Then we have
[TABLE]
Also, we have GCD(C)=GCD(C1)⋅GCD(C2) and
[TABLE]
Proof.
By definition, we have
[TABLE]
Also, by the definition of the tensor product, the greatest common divisor of the entries of v⊗w is the product of the greatest common divisors of the entries of v and w. Indeed, if we let g (resp. h) be the greatest common divisor of the entries of v (resp. w), then
all the entries of v⊗w are divisible by gh and hence the greatest common divisor of the entries of v⊗w is a multiple of gh. Conversely, since there are integers ai and bj such that g=∑iaivi and h=∑jbjwj, where v=(vi) and w=(wj),
we have
[TABLE]
and so gh is divisible by the greatest common divisor of the entries of v⊗w. Thus, the first assertion follows.
Next, let p be a prime divisor of N1. For a divisor δ of N, let δi=gcd(δ,Ni). Then by definition, we have
[TABLE]
Similarly, if p divides N2, then we have Pwp(C)=Pwp(C2)⋅∑δ1∣N1V(C1)δ1. Since the degree of C1 is zero, the sum of the entries of V(C1) is zero and hence the result follows.
∎
Remark 3.16*.*
In the previous papers [34, 36], the author computed the orders of certain rational cuspidal divisors, which are eigenvectors under the action of the Hecke operators. The secret to our success of the computations is that the rational cuspidal divisors in our consideration are defined by tensors, and hence we could use an inductive method based on Theorem 3.15.
The author did not know the work of Yazdani [33] at the time of writing of the papers [34, 36]. After reading [33], the author decided to follow Yazdani’s idea, which enormously simplifies our previous notation.
3.5. Example II: The order of the divisor CN
In this subsection, we compute the order of CN for any positive integer N.
Theorem 3.17**.**
Let N be an integer greater than 1. Then we have
[TABLE]
where g(N) and h(N) are defined as follows:
- (1)
g(1):=0.
2. (2)
g(N)=gcd(N+(−1)t−1,pi2−1:1≤i≤t)* if N=∏i=1tpi is squarefree.*
3. (3)
g(N):=gcd(p,g(M))* if N=Mp2 with gcd(M,p)=1 and M squarefree.*
4. (4)
g(N):=1* otherwise.*
Also, h(N)=2 if one of the following holds, and h(N)=1 otherwise242424This is a variant of h in [34, Th. 1.3] and of h(M,N,D) in [36, Th. 4.3]..
- (1)
N=p* for a prime p.*
2. (2)
N=2r* for an odd integer r.*
3. (3)
N=pq* for two distinct odd primes p and q such that val2(p−1)=val2(q−1) and val2(p+1)=val2(q+1).*
4. (4)
N=4p* for a prime p congruent 1 modulo 4.*
By Theorem 3.13, we have the following.
Theorem 3.18**.**
Let N be a positive integer, and let n(N) be the order of CN.
If N=pr is a prime power, then we have
[TABLE]
If N is not a prime power, then we have the following.
- (1)
For an odd prime q, we have n(2q)=8⋅gcd(3,q+1)q2−1.
If N=pq for two distinct odd primes p and q, then
[TABLE]
2. (2)
If N=∏i=1tpi be a squarefree integer with t≥3, then
[TABLE]
3. (3)
If N=Mp2 for a squarefree integer M not divisible by a prime p, then
[TABLE]
4. (4)
If N is not of the form considered above, i.e., either N is divisible by p3 or by p2q2, then n(N)=24κ(N).
Proof of Theorems 3.17 and 3.18.
First, let N=pr be a prime power. If r=1, then we have
[TABLE]
Thus, GCD(CN)=p+1 and h(CN)=2 because Pwp(CN)=−1 is odd. Therefore the order of CN is the numerator of 12p−1.
If r=2, then we have
[TABLE]
Thus, GCD(CN)=p and h(CN)=1 because Pwp(CN)=0∈2Z. Suppose that r≥3.
Then we have
[TABLE]
where the dots denote zero entries. Thus, GCD(CN)=1 and
[TABLE]
Therefore h(CN)=2 if and only if p=2 and r∈2Z.
If p is odd then pr−1(p2−1) is divisible by 24 and hence the result follows.
From now on, we assume that N is divisible by at least two primes. For simplicity, let
[TABLE]
- (1)
Let N=pq, and assume that q is an odd prime. Then we have
[TABLE]
Thus, we have
[TABLE]
and
[TABLE]
If p=2, then GCD(CN)=gcd(3,q+1), which is odd. Since q is odd, Powerp(CN) and Powerq(CN) are both even, and hence h(CN)=1.
Suppose that p is odd. If val2(p−1)=val2(q−1) and val2(p+1)=val2(q+1), then we have val2(GCD(CN))=val2((p−1)(q+1)) (cf. Lemma 3.20 below). Thus, we have h(CN)=2 in this case.
Suppose that either val2(p−1)=val2(q−1) or val2(p+1)=val2(q+1).
Then we have val2(pq−1)=val2(p−q).252525For instance, if val2(p−1)>val2(q−1), then we have
val2(pq−1)=val2(q(p−1)+(q−1))=val2(q−1)=val2(p−q).
Therefore we have
[TABLE]
Similarly, we have val2((p+1)(q−1))>val2(pq−1) and so h(CN)=1. By Lemma 3.20 below, we can simplify the formula and thus we have
[TABLE]
2. (2)
Let N=∏i=1tpi with t≥3. As above, for a divisor d of N, let r(d) be the d-th column vector of the matrix Υ(N). By direct computation, we have
[TABLE]
where μ(d) is the Möbius function. We claim that
[TABLE]
Note that
[TABLE]
Thus, GCD(CN) divides pi2−1, and so g(N).
Conversely, since pi2≡1(modg(N)), we have δ2≡1(modg(N)) for any divisor δ of N. Therefore we have
[TABLE]
Since g(N) is relatively prime to N, it divides V(CN)δ for any divisor δ of N, and hence GCD(CN). This proves the claim.
Next, let d be a divisor of N, and p=pi for some i. Let d′=gcd(d,M), where M=N/p. Then we have
[TABLE]
where ϵ=valp(d). Since M is squarefree, by Lemma 3.8(1) we have
[TABLE]
Thus, we have h(CN)=1 by Lemma 3.21 below. Since (p2−1)(q2−1) is always divisible by 24 and pi2−1≡0(modg(N)), we have
24⋅g(N)κ(N)∈Z, and therefore we have
[TABLE]
3. (3)
Let N=Mp2 for a prime p not dividing M. Let r′(d) be the d-th column vector of Υ(M).
Then by direct computation, we have
[TABLE]
Since r′(1)rad(M)=(−1)k, where k is the number of prime divisors of M, GCD(CN) divides p. Since V(CM)=r′(1)−r′(M), we have
[TABLE]
If M is squarefree, we have GCD(CM)=g(M) by the result above
and hence GCD(CN)=gcd(p,g(M)).
Next, we have
[TABLE]
and for a prime divisor ℓ of M
[TABLE]
Since Powerℓ(CM)=GCD(CM)⋅Pwℓ(CM), we have h(CN)=1 if p is odd. Thus, the result follows when p is odd.
Lastly, let p=2 and assume that M is odd and squarefree. Then by definition, g(M) is even, and therefore we have GCD(CN)=2. As discussed above, we have Powerℓ(CN)=Powerℓ(CM). If h(CM)=1, then Powerℓ(CN) is divisible by 4 and so h(CN)=1. Suppose that h(CM)=2.
Since M is odd and squarefree, Powerℓ(CM) is divisible by 4 unless M is a prime congruent to 1 modulo 4, in which case we obtain h(CN)=2 (cf. Remark 3.19 below). Thus, the result follows.
4. (4)
If N is not a prime power, we have 24κ(N)∈Z (Remark 3.14) and so it suffices to show that GCD(CN)=1 and h(CN)=1.
Suppose first that N is divisible by p2q2. We may assume that p is an odd prime.
Then by taking M=N/p2 (resp. M=N/q2) in (3) above, we obtain that GCD(CN) divides p (resp. q). Therefore GCD(CN)=1. Also, since p is odd, we have Powerℓ(CN)∈2Z for any prime divisor ℓ of M=N/p2. Thus, we have h(CN)=1 as well.
Suppose next that N=Mpr with gcd(M,p)=1 and r≥3.
Since we assume that N is not a prime power, we have M>1.
If we use the same notation as in (3), then we have
[TABLE]
where O is the zero matrix of suitable size.
Since r′(1)rad(M)=(−1)k, we have GCD(CN)=1. Also,
as above we have
[TABLE]
where ϵ=1 if r is even, and ϵ=−p if r is odd. Since N is divisible by an odd prime, we have Powerp(CN)∈2Z. Also, we have
[TABLE]
and hence Powerℓ(CN)∈2Z if p is odd. Furthermore, if p=2 and M>1, then we have
[TABLE]
and hence h(CN)=1. Indeed, since M is odd we have the following:
If M is squarefree, then GCD(CM)=g(M)∈2Z.
If M is exactly divisible by ℓ2, then Powerℓ(CM)∈2Z as in (3).
If M is divisible by ℓ3, then Powerℓ(CM)∈2Z as above.
This completes the proof.
∎
Remark 3.19*.*
Let M be an odd and squarefree integer. By definition, g(M) is even. Suppose that h(M)=2. Then either M is a prime or the product of two primes satisfying certain conditions above. In the latter case, we can prove that g(M) is divisible by 8 (cf. Lemma 3.20). Thus, g(M) is divisible by 4 unless M is a prime congruent to 1 modulo 4, in which case g(M)=M+1≡2(mod4).
We finish this section by proving two lemmas used above.
Lemma 3.20**.**
Let p and q be two distinct odd primes. Then we have
[TABLE]
where a=0 if val2(p−1)=val2(q−1) and val2(p+1)=val2(q+1), and a=−1 otherwise.
Proof.
Let g=gcd(pq−1,p−q). Then gcd(g,p)=gcd(g,q)=1 because pq≡1(modg). This directly implies the first equality because
[TABLE]
Next, let G=gcd(pq−1,p2−1,q2−1) and H=gcd(p−1,q−1)⋅gcd(p+1,q+1).
If ℓ is an odd prime, then we easily have that valℓ(G)=valℓ(H) because ℓ cannot divide both p−1 and p+1.
Finally, we compute b=val2(G)−val2(H). Let r=val2(p−1) and s=val2(q−1), which are both at least 1.
Without loss of generality we may assume that r≥s≥1. Note that val2(p2−1)≥r+1 and val2(q2−1)≥s+1.
If r>s, then we have
[TABLE]
Therefore val2(G)=s. Since s≥1, we have r≥2 and hence val2(p+1)=1. So we have val2(H)=s+1 and b=−1.
If r=s≥2, then val2(p+1)=val2(q+1)=1, and so val2(p2−1)=val2(q2−1)=s+1. Also, we have
[TABLE]
and therefore val2(G)=s+1. Since val2(H)=s+1, we have b=0.
Assume that r=s=1. Let x=val2(p+1) and y=val2(q+1), which are both at least 2. If x>y, then similarly as above, we have val2(G)=y and val(H)=y+1, and so b=−1. If x=y≥2, then we have
[TABLE]
Therefore val2(G)=val2(H)=x+1 and hence b=0.
This completes the proof.
∎
Lemma 3.21**.**
Let N=∏i=1tpi be a squarefree integer with t≥3. Let
[TABLE]
Then 2⋅g(N) divides si(N) for all 1≤i≤t.
Proof.
It suffices to show that for any prime ℓ we have
[TABLE]
Since t≥3, si(N) is always even and hence the inequality above always holds if g(N) is not divisible by ℓ. Thus, we may assume that valℓ(g(N))≥1.
First, suppose that ℓ is odd.
Since either valℓ(pj−1)=0 or valℓ(pj+1)=0, we have
[TABLE]
Therefore we have valℓ(g(N))≤valℓ(si(N)) unless
[TABLE]
in which case we have valℓ(pj+1)≥1 for all j=i because valℓ(g(N))≥1. Thus, we have
[TABLE]
which is a contradiction to the assumption that valℓ(g(N))≥1 because ℓ is odd.
Next, let ℓ=2. Note that if N is even then g(N) is odd by definition.
Thus, N is odd because we assume that g(N) is divisible by ℓ=2, and hence all pi are odd.
So we have val2(si(N))≥t. Let g=val2(g(N)).
If g≤t−1, then there is nothing to prove, so we further assume that g≥t, which is at least 3 by our assumption.
Since val2(pj2−1)=val2(pj−1)+val2(pj+1)≥g, and either val2(pi−1)=1 or val2(pi+1)=1, we have val2(pi−1)≥g−1≥2 or val2(pi+1)≥g−1≥2.
Thus, we have val2(si(N))≥g+1 unless
[TABLE]
in which case we have
val2(pi−1)≥2 and val2(pj+1)≥2 for all j=i. Thus, we have
[TABLE]
This implies that g=1, which is a contradiction. This completes the proof.
∎
3.6. Example III: The order of the divisor Cd
In this subsection, we compute the order of Cd for any positive integer N and a non-trivial divisor d of N with d=N. By Theorem 3.13, it suffices to prove the following.
Theorem 3.22**.**
Let N be a positive integer and let d be a non-trivial divisor of N. Then we have
[TABLE]
where g(N,d) and h(N,d) are defined as follows: First, we set
[TABLE]
Next, suppose that d=N, and let z=gcd(d,N/d). Then we set
- (1)
g(N,d):=g(d)* if z=1.*
2. (2)
g(N,d):=gcd(p,g(d/p))* if z=p is a prime and valp(d)=1.*
3. (3)
g(N,d):=rad(z)z* otherwise.*
Also, h(N,d):=2 if one of the following holds, and h(N,d):=1 otherwise.
- (1)
N=2r* for r≥2 and d=2.*
2. (2)
N=2r* for r≥2 and d=2f for f∈2Z.*
3. (3)
N=2rp* for r≥2 and d=2p, where p≡1(mod4) is a prime.*
4. (4)
N=2rd* for r≥1 and d=p, where p is an odd prime.*
5. (5)
N=2rd* for r≥1 and d=pq, where p and q are two distinct odd primes such that val2(p−1)=val2(q−1) and val2(p+1)=val2(q+1).*
Proof.
By Theorem 3.17, the result follows for d=N. Thus, we henceforth assume that d=N. During the proof, let O denote the zero matrix of suitable size.
First, let N=pr with r≥2. If d=p, then we have
[TABLE]
Thus, GCD(Cd)=p and Pwp(Cd)=−p−1. Therefore h(Cd)=1 if p is odd, and h(Cd)=2 if p=2.
Suppose that d=p2 (so r≥3). Then we have
[TABLE]
Thus, GCD(Cd)=pm(2)−1=rad(z)z and Pwp(Cd)=p+1. Therefore h(Cd)=2 if p=2, and h(Cd)=1 if p is odd.
Suppose that d=pf with 3≤f≤r−1. Then we have
[TABLE]
because φ(pm(f))=pm(f)−1(p−1). Thus, GCD(Cd)=pm(f)−1 and
[TABLE]
Therefore h(Cd)=2 if p=2 and f is even, and h(Cd)=1 otherwise. This proves the result for the case where N is a prime power.
Next, let N=Mpr with gcd(M,p)=1 and r≥1. We assume that M>1, so there is a prime divisor of M different from p. For a divisor d of N, let d′=gcd(M,d) and pf=gcd(d,pr), so that d=d′pf.
Also, let z′=gcd(d′,M/d′). Note that
[TABLE]
and {\bf e}(N)_{d}={\bf e}(M)_{d^{\prime}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\bf e}(p^{r})_{p^{f}}. For a divisor k of M, let r′(k):=Υ(M)×e(M)k be the k-th column vector of the matrix Υ(M). Then we have
[TABLE]
Here, we allow d′=1, in which case V(C(M)d′) is the zero vector. From now on, let ℓ denote a prime divisor of M.
By Lemma 3.8(4), the greatest common divisor of the entries of r(d′) is rad(z′)z′. Since φ(z′)=rad(z′)z′∏ℓ∣z′(ℓ−1),
we easily have
[TABLE]
Also, since the degree of C(M)d′ is zero, we have
[TABLE]
Furthermore, we have
[TABLE]
This will be used below without further mention.
By direct computation, we have
[TABLE]
Similarly, if f=0 (resp. f=r), then
[TABLE]
and if 1≤f≤r−1, then
[TABLE]
Using this computation, we have the following.
Case 1. Assume that f=0 and r≥1. Then m(f)=0 and d=d′. We have
[TABLE]
Thus, we have
[TABLE]
Next, by definition we have
[TABLE]
and
[TABLE]
Thus, we have
[TABLE]
Case 2. Assume that f=1 and r≥2. Then m(f)=1 and we have
[TABLE]
Note first that by Lemma 3.8(4), p does not divide the greatest common divisor of the entries of r′(d′), and so p2 does not divide GCD(Cd).
Since φ(z)=φ(z′)(p−1) and V(C(M)d′)=φ(z′)⋅r′(1)−r′(d′), we have
[TABLE]
Thus, p divides GCD(Cd) if and only if p divides GCD(C(M)d′).
Moreover, by Lemma 3.8(4) and (3.3) we have
[TABLE]
Next, since the sum of the entries of V(C(M)d′) is zero, by (3.4) we have
[TABLE]
which is equal to −rad(z′)z′×p(p+1)∏ℓ∣M(ℓ−1)a(ℓ) by Lemma 3.8(1).
Lastly, as above we have
[TABLE]
Thus, h(Cd)=2 if and only if all the following hold.
p=2.
val2(α)≤1.
h(C(M)d′)=2.
Case 3. Assume that f=2 and r≥3. We have
[TABLE]
By Lemma 3.8(4), all entries of pm(2)−1⋅r′(d′) are divisible by rad(z)z. Since φ(z) is also divisible by rad(z)z, all entries of V(Cd) are divisible by rad(z)z. By comparing the entries of pm(2)−1(p2+1)⋅r′(d′) and pm(2)⋅r′(d′), we have
[TABLE]
Next, since the sum of the entries of V(C(M)d′) is zero, as above we have
[TABLE]
Lastly, as above we have
[TABLE]
Thus, h(Cd)=2 if and only if all the following hold.
p=2.
α is odd.
h(C(M)d′)=2.
Case 4. Assume that 3≤f≤r−1. We have
[TABLE]
As above, we have
[TABLE]
Also, if f is odd (resp. even) then the computation of Powerq(Cd) for any prime q is the same as Case 2 (resp. Case 3) above, and we have
[TABLE]
and
[TABLE]
Thus, h(Cd)=2 if and only if all the following hold.
p=2.
α is odd.
h(C(M)d′)=2.
Now, we are ready to prove our theorem when N is divisible by at least two primes.
Firstly, suppose that z=1. Then by Case (1), we have
[TABLE]
Thus, h(Cd)=2 if and only if N/d is a power of 2 and h(C(d)d)=h(d)=2.
Since z=1, if N/d is a power of 2, then d must be odd. By definition, we have h(d)=1 unless d is squarefree. Thus, h(Cd)=2 if and only if N=2rd with d odd squarefree and h(d)=2. (These are the cases (4) and (5) in the definition of h(N,d).)
Secondly, suppose that z is a power of a prime p. If we write N=Mpr with gcd(M,p)=1, then d=d′pf for a divisor d′ of M and 1≤f≤r−1. Since z′=gcd(d′,M/d′)=1, we have GCD(C(M)d′)=GCD(C(d′)d′)=g(d′). In the discussion below, we use the same notation as in Cases (1)–(4).
Assume that f≥2. Then we are in Cases (3) and (4), and so we easily have GCD(Cd)=rad(z)z. Also, we always have h(Cd)=1. Indeed, if p=2, then we have either α∈2Z or h(C(M)d′)=1.
More specifically, let p=2 (and so M is odd). If d′=M, then there is an odd prime ℓ dividing M/d′ because M is odd. Since z′=1, by Case (1) we have h(C(M)d′)=1. If d′=M, then GCD(C(M)d′)=g(M)=α. Since M is odd, either h(C(M)d′)=h(M)=1 or g(M)=α∈2Z (cf. Remark 3.19).
Assume that f=1. Then we are in Case (2). Since d′=d/p and z′=1, we have GCD(C(M)d′)=GCD(C(d′)d′)=g(d′)=g(d/p) by Case (1). As above, we have
[TABLE]
Next, we compute h(Cd). By the discussion above, h(Cd)=2 if and only if p=2, α≡0(mod4), and h(C(M)d′)=2. Let p=2. If d′=M then we have h(C(M)d′)=1 as above.
Thus, we further assume that d′=M. Then h(C(M)d′)=h(M)=2 and g(M)≡0(mod4) if and only if M is a prime congruent to 1 modulo 4 (cf. Remark 3.19).
Finally, suppose that z is divisible by two distinct primes p and q. As above, let N=Aprqs with gcd(A,pq)=1.
If either valp(d)≥2 or valq(d)≥2, then as discussed above we have
[TABLE]
Thus, we further assume that d=d′pq, where d′=gcd(A,d). Note that rad(z)z=rad(z′)z′, where z′=gcd(d′,A/d′).
As in (3.3), we have
[TABLE]
If we take M=Aqs then by Case (2) we have β=gcd(p,α), which is a divisor of p.
By swapping the roles of p and q, we obtain that β is also a divisor of q. Therefore we have β=1. Since either p or q is odd, we also have h(Cd)=1 by Case (2).
This completes the proof.
∎
4. Motivational Examples
We hope to find rational cuspidal divisors Z(d) on X0(N) such that
[TABLE]
It turns out that it is indeed possible if N is a prime power. On the other hand, if N is divisible by at least two primes, then the author does not know how to find such divisors except a few easy cases. Instead, for any given prime ℓ, we try to find rational cuspidal divisors Zℓ(d) such that
[TABLE]
Indeed, we can do more as follows: we construct rational cuspidal divisors Z(d) such that
[TABLE]
where C(N)sf:=⟨Z(d):d∈\cmcalDNsf⟩. Also, for any d∈\cmcalDNsf, we find a rational cuspidal divisor Y2(d) such that
[TABLE]
The purpose of this section is to explain our initial ideas for the computation of C(N) in great detail and to define rational cuspidal divisors Z(d) and Y2(d). Also, we try to give a motivation behind the definition. We will not prove our claims since we will do in the proceeding sections.
From now on, a vector in \cmcalSk(pr) is written as (a0,…,ar) so that its pi-th entry is ai, i.e.,
[TABLE]
4.1. Case of level pr
In this subsection, we explain our construction of Bp(r,f) for any prime p and r≥2.
For simplicity, let (Pk):=(P(pr)pk) and let
[TABLE]
where m(f):=min(f,r−f). To simplify our computation, we assume that p≥7.
4.1.1. r=2
Since the group Divcusp0(X0(p2))(Q) is generated by C2(f), it suffices to compute the intersection
[TABLE]
Suppose that there is a relation between C2(f), i.e., ∑f=12af⋅C2(f)=0. Then the order of the divisor X=∑f=12af⋅C2(f) is 1, and so by Corollary 3.11 we have
[TABLE]
Let nf be the order of the divisor C2(f) and let bf=nfaf. (Note that we can always take bf∈[0,1).) By direct computation, we have
[TABLE]
Thus, we have GCD(C2(f))=p and h(C2(f))=1, and so by Theorem 3.13 we have n1=n2=24p2−1>1. Also, by Lemma 3.7 we have
[TABLE]
Thus, we have
[TABLE]
If we take b1=b2=p+12, then r(X)=(2,−2,0)∈\cmcalS1(N) and r\mathchar45Pwp(X)=−2.
Thus, there is a relation between C2(f), and
this argument illustrates a way to find a relation among rational cuspidal divisors.
Indeed, this is informed by the fact that the greatest common divisor of the entries of
[TABLE]
is p+1. Now, we may replace C2(1) by C2(1)+C2(2), and let
[TABLE]
Note that
[TABLE]
Thus, if there is a relation as above, say a1⋅Bp(2,1)+a2⋅Bp(2,2)=0, then
[TABLE]
for some integer c. Thus, we get b2=0 and hence b1=0 as well, which is a contradiction. Therefore we have
[TABLE]
Since it is obvious that Divcusp0(X0(p2))(Q) is generated by Bp(2,f), we obtain
[TABLE]
Observation*.*
Note that Bp(2,1)=C2(1)+C2(2) is equal to αp(p)∗(C1(1)). Thus, we may guess that “nice” rational cuspidal divisors can be obtained from lower levels by the degeneracy maps. Also, in the proof of (4.2), we easily have bi=0 because there is a divisor δ of N such that
[TABLE]
(Also, we crucially use the fact that h(B2(2,2))=1.)
As a generalization, we will obtain a simple criterion for “linear independence” in Section 5.1. Moreover, it turns out that if D is a rational cuspidal divisor constructed from lower level using two degeneracy maps αp(N)∗ and βp(N)∗, then most of the entries of V(D) are zeros (cf. Proposition 5.4). Thus, it is easy to apply our criterion with such divisors.
4.1.2. r=3
Since the group Divcusp0(X0(p3))(Q) is generated by C3(f), we need to compute the intersections
[TABLE]
Instead, as above we replace C3(1) by βp(p2)∗(C2(2))=−C3(1)+p⋅C3(3). Also, we replace C3(2) by αp(p2)∗(B2(2,1)).
Thus, let
[TABLE]
Then it is easy to see that Divcusp0(X0(p3))(Q) is generated by Bp(3,f). Also, since
[TABLE]
and h(Bp(3,2))=h(Bp(3,3))=1, we can prove that there is no relation among them. Indeed, you can see V(Bp(3,f))p2=0,0 or 1 for f=1,2 or 3, respectively. So as above we obtain
[TABLE]
Therefore we have
[TABLE]
Also, if you check the p3-th entries of V(Bp(3,1)) and V(Bp(3,2)), then we obtain
[TABLE]
Thus, we finally have
[TABLE]
4.1.3. r=4
Now, we continue as above without hesitation. Let
[TABLE]
Next, we take Bp′(4,2):=αp(p3)∗(Bp(3,3))=p(P0)−(P3)−(P4).
Then by direct computation, we have
[TABLE]
The generation part (the group Divcusp0(X0(p4))(Q) is generated by these divisors) is easy.
On the other hand, there may exist a relation among Bp′(4,2), Bp(4,3) and Bp(4,4). Thus, we replace Bp′(4,2) by
[TABLE]
so that the p2-th entry of V(X) is zero. Since all the coefficients of X are divisible by p
(cf. Lemma 2.26), the generation part is not achieved. Therefore we replace X by Y=p1X, which does not change the vector V(X), and then we can prove the generation part.
Now, by comparing the p2-th entries we can conclude that
[TABLE]
However, there is a relation between Y and Bp(4,4). Indeed, since the greatest common divisor of the entries of
[TABLE]
is p, we can find a new divisor
[TABLE]
so that V(Bp(4,2)) is parallel to V. Now, we can prove that Divcusp0(X0(p4))(Q) is generated by Bp(4,f). Also, we have
[TABLE]
Thus, by comparing the p2-th, p3-th and p4-th entries successively, we obtain
[TABLE]
4.1.4. r=5
As above, we take
[TABLE]
Then we can show that Divcusp0(X0(p5))(Q) is generated by Bp(5,f). Also, we have
[TABLE]
Thus, by comparing the p3-th, p2-th, p4-th and p5-th entries successively, we obtain
[TABLE]
4.1.5. r=6
As in the case of r=4, we can take
[TABLE]
Note that Bp(6,2) is equal to p1αp(p5)∗(Bp(5,2))+p4⋅Bp(6,6). Then we have
[TABLE]
As above, by comparing the p3-th, p4-th, p2-th, p5-th and p6-th entries successively, we obtain
[TABLE]
4.1.6. r≥7
As above, we define
[TABLE]
Also, if f=r−2a≥3, then we set
[TABLE]
Furthermore, if f=r+1−2a≥3, then we set
[TABLE]
Finally, we set Bp(r,2):=βp(pr−1)∗(Bp(r−1,2)) if r is odd, and otherwise
[TABLE]
Then we will prove later that
[TABLE]
4.2. Case of level 2r
In this subsection, we explain our construction of B2(r,f) for any r≥5.
Let N=2r for some r≥5. As above, we hope to prove that
[TABLE]
However, the arguments above (about linear independence) break down because h(B2(r,f))=2 for some f≥2. Indeed, if r≥6 is even, then we can prove that ⟨C1⟩∩⟨C2⟩≃Z/2Z, where Ci=B2(r,r−i). Note that since
[TABLE]
the order of Ci is 2r−4 by Theorem 3.13. If we let X=C1+C2, then we have
[TABLE]
Since Pw2(X)=0, the order of X is 2r−5, which is one half of the order of Ci.
Also, since ⟨C1,C2⟩=⟨X,C1⟩, there is a relation between Ci. Moreover, we can prove that there is no relation between X and C1, which proves the claim.
So we need another idea for finding new generators to remove a possible non-trivial intersection resulting from this phenomenon.262626This seems to be a reason why Ling could not find all possible relations among Cd. One may guess if we find new generators D with Pw2(D)∈2Z or even better Pw2(D)=0, then such a phenomenon will not occur.
In our method, this simple idea will be crucial.
For simplicity, let Df:=C(2r)2f.
4.2.1. r=5
By direct computation, we have
[TABLE]
We can compute the orders of D1−2D5, D2−2D5, D3 and D4+D5, which are all 1. Thus, we have
[TABLE]
4.2.2. r=6
By direct computation, we have
[TABLE]
We can show that there is a relation between D1 and D2. Indeed, such a relation can be obtained from level 16 because the genus of X0(16) is zero (cf. Lemma 6.22).
This is very useful because we can ignore some divisors from the set of generators and our computation heavily relies on the number of generators. As a result, we can ignore the divisors D2 and D5 (and D3 in this case because D3=0) from our computation. In other words, we have
[TABLE]
As above, we can find a relation between D1 and D4, and so we replace the divisor D4 by D1+D4. Then we can prove that
[TABLE]
Since h(D1+D4)=1, by comparing the 22-th entries of V(D1+D4) and V(D6) we further have
[TABLE]
4.2.3. r=7
By direct computation, we have
[TABLE]
Again, as above D2 and D6 can be removed from the set of generators. Next, for any 3≤f≤5, we compute the intersection ⟨D1⟩∩⟨Df⟩, which is [math] if f is odd, and Z/2Z otherwise. Thus, for even f we replace Df with Ef:=Df+af⋅D1 for (some suitable af) so that the order of Ef is one half of that of Df. (This process is similar to the construction of X at the beginning of the section.)
Then we may insist that there is no relation among D1,D3,E4,D5 and D7. However, since the computation seems highly complicated, for odd f we replace Df by Ef:=Df−af⋅D1 (for some suitable af) so that Pw2(Ef)=0. Also, we replace D1 by E6:=D1−2Dr so that Pw2(E6)=0. Finally, let E7=D7. Then we have
[TABLE]
This implies that
[TABLE]
By direct computation, we have
[TABLE]
Suppose that there is a relation among {Ef:3≤f≤6}. As above, we can find bf∈[0,1) so that the order of X=∑f=36bf⋅Ef is 1. Since
r(X)=∑f=36bf⋅V(Ef)∈\cmcalS1(27),
from the 22-th and 26-th entries we have b4∈Z, and hence b4=0. From the 23-th entry, we have −5b3∈Z, and so b3=0. (Note that the denominator of bf is a power of 2.) Also, from the 25-th entry we have −5b5∈Z, and therefore b5=0. Finally, we have b6=0 as well, which is a contradiction. Thus, we have
[TABLE]
4.2.4. r≥8
By direct computation, we have
[TABLE]
where m(f)=min(f,r−f) and the dots denote zero entries.
As discussed above, for any 3≤f≤r−2, we replace Df by
[TABLE]
If r is odd, then we further replace D1 by D1−2Dr. Let
[TABLE]
so that Pw2(Er)∈2Z but Pw2(Ef)=0 for all 3≤f≤r−1. Then we have
[TABLE]
Therefore it suffices to show that there is no relation among {Ef:3≤f≤r−1}. It turns out that we can easily prove (as in the case of r=7) that
[TABLE]
Remark 4.1*.*
For an odd integer f with 3≤f≤r−2, the order of Ef is the same as that of Df. Thus, there is no relation among
[TABLE]
It might be a good exercise to prove this directly.
4.3. Case of odd squarefree level
Let N=∏i=1tpi be an odd squarefree integer with t≥2. In this subsection, we discuss our construction of Y0(d) on X0(N) for any non-trivial divisors d of N.
In this and next subsections, since all cusps of X0(N) are defined over Q, we simply write Pd for the rational cuspidal divisor (P(N)d). We use two orderings
≺ and ⊲ on \cmcalDN0=\cmcalDNsf (which are defined in Section 6.1), and write
[TABLE]
so that di≺dj (resp. δi⊲δj) if and only if i<j. (Here, m=#\cmcalDN0=2t−1.) For simplicity, we take δ0:=1, and write a vector in \cmcalS1(N) as (a0,…,am) so that its δi-th entry is ai. Note that the ordering ⊲ in this subsection is given by the colexicographic order on Δ(t), i.e., ∏i=1tpiai⊲∏i=1tpibi if and only if there is an index h such that ah=0, bh=1 and ai=bi for all i>h. Thus we have
[TABLE]
4.3.1. t=2
For simplicity, let p=p1 and q=p2 be two odd primes. In this subsection, we explain why we fail to find a basis for C(pq) and how to construct the divisors Y0(d) for d=p, q or pq.
First, the group Divcusp0(X0(pq))(Q) is generated by Cp, Cq and Cpq. As in Section 4.1, we may replace Cpq by
[TABLE]
Then by direct computation, we have
[TABLE]
Let V:=V(Cp)−V(Cq)=(q−p,1−q,p−1,0). Then the greatest common divisor of the entries of V is gcd(p−1,q−1), and so there may exist a relation between Cp and Cq. Let Y=g((q+1)Cp−(p+1)Cq), where g=gcd(p+1,q+1)−1. Then V(Y) is parallel to V, and so the divisors X,Y,Cp (or X,Y,Cq) are “good” to apply our criteria for linear independence. On the other hand, Divcusp0(X0(pq))(Q) is not generated by them unless g(p+1)=1 (or g(q+1)=1). Even in this “simple” case, it is very difficult to find “good” generators without further assumption on p and q. Thus, we study the ℓ-primary subgroup of C(pq) instead.
Now, let ℓ be a given prime. By changing the role of p and q if necessary, we may assume that
[TABLE]
This is always possible because p and q are both odd. Under this assumption, let
[TABLE]
and we take
[TABLE]
Since g(q+1) is an ℓ-adic unit, Divcusp0(X0(pq))(Q) is ℓ-adically generated by Ei, which means that
[TABLE]
By direct computation, we have
[TABLE]
where h=gcd(p−1,q−1)−1. Since h(p−1) is an ℓ-adic unit, the matrix
[TABLE]
is lower ℓ-unipotent, which means that M0 is a lower-triangular matrix whose diagonal entries are ℓ-adic units. This is enough to conclude that for odd ℓ, we have
[TABLE]
Moreover, since p is odd, we have h(E3)=1 and hence
[TABLE]
Also, since Pwq(E2)∈ℓZ (and ℓ=2) and Pwq(E1)=0, we finally get
[TABLE]
Remark 4.2*.*
As you may see in (4.3), the vectors V(C) are written as tensor products. Indeed, even more is true: we have
[TABLE]
i.e., X, Cp and Cq are all defined by tensors. (However, D(p,q) is not defined by tensors.) In Section 4.1, we did not define Bp(1,1) but it seems natural to do by
[TABLE]
which is a “unique” degree [math] divisor of level p. By the consideration above, it is natural to define
[TABLE]
which is not of degree [math] though. Note that if you imagine a cuspidal divisor of level 1, which is ∞∈X0(1), then we have Ap(1,1)=αp(1)∗(∞).
4.3.2. t=3
As above, we first fix a prime ℓ, and want to find rational cuspidal divisors Ei such that the matrix
[TABLE]
is lower ℓ-unipotent. Note that Divcusp0(X0(pqr))(Q) is generated by Cp1, Cp2, Cp3, Cp1p2, Cp1p3, Cp2p3 and Cp1p2p3. As in (4.4), we assume that
[TABLE]
and
[TABLE]
Under this assumption, we keep Cp3 unchanged because it is the one with the highest order. As above, we can find a relation between Cp1 and Cp2.
So we replace Cp1 by Y=D(p1,p2) above. Similarly, we replace Cp2 by Y1=D(p2,p3).
(Here, D(pi,pj) is a divisor in level pipj but we regard it as a divisor in level p1p2p3.)
If we use the tensor notation, it turns out that
[TABLE]
Similarly, we consider the vector {\mathbf{D}}(p_{1},p_{3})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p_{2}}(1,0). By considering all possible combinations of tensor products, and by computing the vectors V(D) for such divisors D defined by tensors (using Theorem 3.15), we find natural candidates for Ei which make the matrix M0 lower ℓ-unipotent as follows.
[TABLE]
Here, ∗ denotes an arbitrary integer and ⋆ denotes an ℓ-adic unit.
Now, we can apply our criteria (for any prime ℓ) and obtain
[TABLE]
4.3.3. t≥4
As above, for a given prime ℓ we assume that
[TABLE]
and
[TABLE]
As above, for any I=(f1,…,ft)∈Δ(t), we construct a rational cuspidal divisor Y0(pI), or equivalently a vector Y0(pI)∈\cmcalS2(N) by
[TABLE]
where m=m(I) and n=n(I). (For notation, see Section 1.3.)
If we take Ei=Y0(di), then we can prove that Divcusp0(X0(N))(Q) is ℓ-adically generated by Ei using our assumption (4.5). Also, we can show the matrix
[TABLE]
is lower ℓ-unipotent. The property that M0 is lower-triangular is obtained by the definition of our orderings ≺ and ⊲. Also, the property that the diagonal entries of M0 are ℓ-adic units is deduced by our assumption (4.6).
If ℓ is odd, they are enough to conclude that
[TABLE]
Even when ℓ=2, since pi are odd, we can prove the above isomorphism.
Remark 4.3*.*
For any i, if we write Cdi=∑a(j)⋅Y0(dj) with a(j)∈Zℓ,
then a(i) is an ℓ-adic unit, which is the reason behind the definition of Y0(di).
4.4. Case of even squarefree level
Let N=∏i=1tpi be an even squarefree integer with t≥2. Also, let u be an integer such that pu=2.
We use the same notation as in the previous section.
However, the orderings ≺ and ⊲ are different from the previous ones since u≥1. For instance, the ordering ⊲ in this subsection is given by a twisted colexicographic order on Δ(t). More precisely, we define q1=pu, qi=pi+1 if i<u, and qj=pj if j>u. Then the ordering ⊲ is the colexicographic order given by qi, i.e., ∏i=1tqiai⊲∏i=1tqibi if and only if there is an index h such that ah=0, bh=1 and ai=bi for all i>h. Thus we have
[TABLE]
4.4.1. t=2
Let N=2p. As in Section 4.3.1, we replace the divisor C2p by
[TABLE]
which is of order 1. (So we can ignore it.) Also, we use the divisor
[TABLE]
depending on the ℓ-adic valuation of p+1, where g=gcd(3,p+1)−1.
Case 1: Suppose that valℓ(3)≥valℓ(p+1). Then we have
[TABLE]
By direct computation as in Section 4.3.1, we have
[TABLE]
If ℓ is odd, then we have
[TABLE]
But there is a problem272727In fact, this is not a problem in this simple case because ℓ cannot be 2. However, we consider this case as if ℓ=2 since a similar problem occurs when val2(N) is large enough. if ℓ=2: Pwp(Cp)=−Pwp(D(2,p))=−1. To fix this, we retreat our modification, and consider the divisor C2 instead. Then we have Pwp(C2)=0 and hence we can prove that
[TABLE]
Case 2: Suppose that valℓ(p+1)>valℓ(3).
Then we have
[TABLE]
By direct computation, we have
[TABLE]
Since p is odd, h(C2)=1. Thus, for any prime ℓ we have
[TABLE]
4.4.2. t=3
Let N=2pq. Fix a prime ℓ and assume that
[TABLE]
As above, we can replace the divisor C2pq by
[TABLE]
where d′=pq/d. Since the order of U2 is 1, we ignore it.
Let γ=gcd(p−1,q−1)p−1∈ℓZ.
Case 1: Suppose that valℓ(3)≥valℓ(p+1).
We take p1=2, p2=p and p3=q.
Also, we take the divisors Ei as in Section 4.3.2.
Since our ordering ⊲ is the same as the previous one, the matrix M0 is lower ℓ-unipotent. Thus, we obtain the result for an odd prime ℓ.
Now, suppose that ℓ=2. Then a problem occurs as in Case 1 of the previous subsection. So we retreat our modification for the divisor E2, i.e., replace E2 by {\mathbf{X}}_{1}={\mathbf{B}}_{p_{1}}(1,1)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p_{2}}(1,0)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p_{3}}(1,1).
As in Section 4.3.2, we have
[TABLE]
Still, we have a problem because Pwp3(E4) and Pwp3(E5) are both odd. Thus, we replace E4 by {\mathbf{X}}_{2}={\mathbf{B}}_{p_{1}}(1,1)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{D}}(p_{2},p_{3}) so that Pwp3(E4)=0.
Then we can prove that
[TABLE]
Case 2: Suppose that282828So ℓ=3, but we also consider the case ℓ=2 for better understanding of the problem. valℓ(p+1)>valℓ(3)≥valℓ(q+1).
We take p1=p, p2=2 and p3=q.
Also, we take the divisors Ei as in Section 4.3.2.
Then the matrix M0 is not lower ℓ-unipotent, and so a modification is necessary. A problematic element is either E5 or E6. We replace E6 by {\mathbf{W}}={\mathbf{D}}(p_{1},p_{3})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p_{2}}(1,0)
and swap the role of E5 and E6. Then we have
[TABLE]
Unfortunately, we have Pwp3(E5)=Pwp3(E4)=γ∈ℓZ, so our argument breaks down if ℓ=2. We replace E4 by {\mathbf{X}}_{3}={\mathbf{D}}(p_{1},p_{3})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{p_{2}}(1,1) so that Pwp3(E4)=0. Then for any prime ℓ, we can prove
[TABLE]
Case 3: Suppose that valℓ(q+1)>valℓ(3).
We take p1=p, p2=2 and p3=q.
Also, we take the divisors Ei as in Section 4.3.2.
As above, we replace E3 by {\mathbf{U}}_{3}={\mathbf{A}}_{p_{1}}(1,0)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p_{2}}(1,1)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{p_{3}}(1,1).
By swapping the role of E2 (resp. E5) and E4 (resp. E6), we have
[TABLE]
As we can see, the matrix M0 is lower ℓ-unipotent.
However, as above we have Pwp2(E5)=Pwp2(E4)=γ∈2Z. Thus, we replace E4 by {\mathbf{X}}_{4}={\mathbf{D}}(p_{1},p_{2})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{p_{3}}(1,1)
so that Pwp2(E4)=0. Then we can prove
[TABLE]
Remark 4.4*.*
In Case 2, if we write p as pI for some I∈Δ(3), then we have I=(1,0,0) in our ordering of the prime divisors of N. Note that a new vector W for such an I is constructed as if positions of p and 2 are changed.
More precisely, if we rename p1=2 and p2=p then our (previous) construction for p=pJ with J=(0,1,0)∈Δ(t) is exactly W.292929A similar idea is used for the definition of Ui. For instance, if we swap the role of p2 and p3 in Case 3, then our (previous) construction for p2p3 is exactly U3.
After this modification, a problematic one is exactly the one obtained by replacing the vector A2(1,0) in W by A2(1,1). Since
A2(1,1)=−B2(1,1)+2A2(1,0), we may replace A2(1,1) by B2(1,1) and obtain a new vector X3. A similar idea is used for the definition of Xi.
4.4.3. t≥4
As in Section 4.3.3, we hope to assume that
[TABLE]
and
[TABLE]
If ℓ=2, then we cannot make both assumptions together because N is even. Since it is not difficult to make the matrix M0 lower ℓ-unipotent, and since it seems difficult to prove that Divcusp0(X0(N))(Q) is ℓ-adically generated by Ei, we keep the first assumption (4.7), and modify the second assumption (4.8) by
[TABLE]
Now, we try to construct divisors Ei so that the matrix M0 is lower ℓ-unipotent. Let di=pI for some I=(f1,…,ft)∈Δ(t). (For notation, see Section 1.3.) As the case of Ui, if I∈\cmcalE and m(I)<u, then we replace Ei by
[TABLE]
Then there are some problematic elements, which are exactly those for I∈\cmcalHu, and
as the case of W we replace Ei by
[TABLE]
Then we can indeed prove that the matrix M0 is lower ℓ-unipotent. So Ei are the ones we are looking for if ℓ is odd. On the other hand, if ℓ=2, then we cannot apply our criteria for linear independence, so we will replace some problematic elements as follows.
Case 1: u=s≥2. In this case, problematic elements are exactly those for I∈\cmcalFs′, i.e., I=Es(n) for some n∈\cmcalIs. By direct computation, if di=pEs(n) for some n∈\cmcalIs, then we have
[TABLE]
Also, we have {\mathbf{E}}_{i-1}=\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=2,\,i\neq n}^{t}{\mathbf{A}}_{p_{i}}(1,1)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{D}}(p_{1},p_{n}) and
[TABLE]
Thus, we replace Ei−1 by
[TABLE]
so that Pwpn(Ei−1)=0 (cf. the cases of X3 and X4).
Case 2: u=s=1. As in Case 1, we replace Ei−1 by the following
[TABLE]
so that Pwpn(Ei−1)=0 (cf. the case of X2). Still, there is a problematic element, which is exactly E3. Motivated by X1, we replace E2 by
[TABLE]
After such modifications, we can finally prove that for any prime ℓ
[TABLE]
(In fact, ⟨E1⟩ is not necessary as it is trivial.)
4.5. The definition of Z(d) and Y2(d)
Let N=∏i=1tpiri be the prime factorization of N.
Motivated by the previous study (cf. (4.7) and (4.9)), we make the following assumption.
Assumption 4.5* (Assumption 1.14).*
For a given prime ℓ, by appropriately ordering the prime divisors of N, we assume the following.
[TABLE]
If N is odd, we set u=0, and we define u as the smallest positive integer such that pu=2 if N is even. If ℓ is odd, we set s=0, and if ℓ=2 then we set s=u. We further assume that
[TABLE]
In this subsection, we define rational cuspidal divisors Z(d) on X0(N) for any non-trivial divisors d of N.
To do so, we first define various vectors in \cmcalS2(pr) for a prime p and an integer r≥1.
Definition 4.6**.**
Let α(i) and β(i) be the maps from \cmcalS2(pr−i) to \cmcalS2(pr) defined by the degeneracy maps π1(pr,pr−i)∗ and π2(pr,pr−i)∗, respectively. For instance, α(i) is the composition of the maps:
[TABLE]
Also, let γ be the map from \cmcalS2(pr−2) to \cmcalS2(pr) defined by the degeneracy map p1×π12(pr−2)∗ (which is well-defined by Lemma 2.26).
For any 0≤f≤r, we define a vector Ap(r,f) in \cmcalS2(pr) as follows:
[TABLE]
Let r≥2. We then define
[TABLE]
Also, for any 1≤f≤r−1, we define
[TABLE]
Finally, for any r≥1 and 1≤f≤r, we define a vector Bp(r,f) in \cmcalS2(pr)0 by
[TABLE]
Remark 4.7*.*
By definition, we easily have Ap(r,1)=α(r)(1).
Let M>1 be an integer relatively prime to p. For simplicity, let
[TABLE]
Then by Lemma 2.22 and Remark 2.25, for any divisor C∈Divcusp(X0(M))(Q) we easily have
[TABLE]
For p=2 and r≥5, we define another vector in \cmcalS2(2r) as follows.
Definition 4.8**.**
For any 3≤f≤r−2, we define
[TABLE]
where m(f)=min(f,r−f) and Ok is the zero vector of size k. Also, we define
[TABLE]
Now, we define vectors Z(d) for any non-trivial divisors d of N. Let d=pI for some I=(f1,…,ft)∈Ω(t). First, we construct {\mathbf{Z}}^{\prime}({\mathfrak{p}}_{I}):=\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=1}^{t}{\mathbf{A}}_{p_{i}}(r_{i},f_{i}).
Then it is not of degree [math] if I∈Δ(t). Thus, we replace them and define
[TABLE]
where m=m(I). Now, it seems like we are almost done, but as in Section 4.2 we have to replace some, so finally we define
[TABLE]
Remark 4.9*.*
Note that we easily have
[TABLE]
Thus, if we set
[TABLE]
where ϵ=21+(−1)r,
then we have
[TABLE]
In other words, the vectors B2(r,f) are just rearrangements of Ek. Note that
the definition of B2(r,f) is designed so that GCD(B2(r,f))=GCD(B2(r,f)) for any 3≤f≤r, which can be checked directly. Thus, by Theorem 3.15 we have
[TABLE]
Next, we define rational cuspidal divisors Y2(d) for any non-trivial squarefree divisors d of N.
Definition 4.10**.**
For any 1≤i<j≤t, let
[TABLE]
where γi:=piri−1(pi+1). For I=(f1,…,ft)∈Δ(t) with m=m(I), n=n(I) and k=k(I), we define Y2(pI)∈\cmcalS2(N)0 as follows.
[TABLE]
where x=max(m,u) and y=max(1,3−s).
Remark 4.11*.*
As we have already seen in Section 4.3.3, the definition of Y0(pI) is simple. In fact, the divisors Y2(pI) are constructed in three steps as in Section 4.4.3. First, let
[TABLE]
Next, we replace some problematic elements when u≥1, and let
[TABLE]
Finally, when N is even and ℓ=2, i.e., s=u≥1,
we get Y2(pI) as above by replacing the elements Y1(pI) for I∈\cmcalFs∪\cmcalGs. (If s=0, then \cmcalFs=\cmcalGs=∅, and so we have Y1(pI)=Y2(pI).)
5. Strategy for the computation
In this section, we sketch our idea for computing the group C(N) and provide necessary tools for later use.
5.1. Criteria for linear independence
Throughout this section, let Ci∈Divcusp0(X0(N))(Q) for any 1≤i≤k. Inspired by our investigation in section 4, we have the following.
Theorem 5.1**.**
Suppose that there is a divisor δ of N such that
[TABLE]
Suppose further that either h(Ck)=1 or there is a prime p such that
[TABLE]
Then we have
[TABLE]
Theorem 5.2**.**
Suppose that there is a divisor δ of N such that
[TABLE]
If ℓ is odd, then we have
[TABLE]
Theorem 5.3**.**
Suppose that one of the following holds.
- (1)
h(Ck)=1* and there is a divisor δ of N such that*
[TABLE]
2. (2)
There is a prime p such that
[TABLE]
Then we have
[TABLE]
We finish this section by proving all the theorems above.
Proof.
Suppose that there are integers ai such that
[TABLE]
Let X=∑i=1kai⋅Ci∈Divcusp0(X0(N))(Q). Then by Corollary 3.11, we have
[TABLE]
To prove Theorem 5.1, it suffices to show that ak⋅Ck=0.
Therefore by Corollary 3.11, it is enough to prove that
[TABLE]
Note that for any 1≤i<k, we have r(ai⋅Ci)δ=0 because we assume that V(Ci)δ=0. Thus, we have
[TABLE]
Since ∣V(Ck)δ∣=1, this implies that r(ak⋅Ck)∈\cmcalS1(N). Indeed, for any divisor d of N, we have
[TABLE]
Thus, we have GCD(ak⋅Ck)=∣r(ak⋅Ck)δ∣ and so
[TABLE]
This implies that for any prime q, we have
[TABLE]
Suppose first that h(Ck)=1, i.e., Pwq(Ck)∈2Z for any prime q. Then by (5.2), we have r\mathchar45Pwq(ak⋅Ck)∈2Z.
Suppose next that there is a prime divisor p of N such that Pwp(Ck)∈2Z and Pwp(Ci)=0 for all 1≤i<k.
Since r\mathchar45Pwp(X)∈2Z and r\mathchar45Pwp(Ci)=0 for all 1≤i<k, we have
[TABLE]
Therefore by (5.2), we have r(ak⋅Ck)δ∈2Z since Pwp(Ck)∈2Z. Again by (5.2), we easily obtain r\mathchar45Pwq(ak⋅Ck)∈2Z.
This completes the proof of Theorem 5.1.
Next, we prove Theorem 5.2.
As above, it suffices to show that there is an integer a not divisible by ℓ such that
[TABLE]
As above, we have
[TABLE]
In particular, we have valℓ(r(ak⋅Ck)δ)≥0.
Since V(Ck)δ∈Zℓ×, by the same argument as in (5.1),
we have
valℓ(r(ak⋅Ck)d)≥0 for any divisor d of N. Thus, there is an integer b∈Zℓ× such that r(b⋅ak⋅Ck)∈\cmcalS1(N). Since ℓ is odd, we can take a=2b∈Zℓ×. Indeed, since r(b⋅ak⋅Ck)∈\cmcalS1(N), we also have
r(2b⋅ak⋅Ck)∈\cmcalS1(N) and r\mathchar45Pwq(2b⋅ak⋅Ck)=2⋅r\mathchar45Pwq(b⋅ak⋅Ck)∈2Z for any primes q, as desired.
This completes the proof of Theorem 5.2.
Finally, we prove Theorem 5.3. Again, it suffices to show that there is an odd integer a such that
[TABLE]
Suppose first that h(Ck)=1 and there is a divisor δ of N such that V(Ck)δ is odd and V(Ci)δ=0 for all i<k. As above, there is an odd integer b such that r(b⋅ak⋅Ck)∈\cmcalS1(N). Also, similarly as (5.2), we have r\mathchar45Pwq(b⋅ak⋅Ck)∈2Z for any prime q because Pwq(Ck)∈2Z. Thus, we can take a=b.
Suppose next that there is a prime p such that Pwp(Ck) is odd and Pwp(Ci)=0 for all i<k. Since r(ak⋅Ck)=b⋅V(Ck) for some b∈Q×, we have r\mathchar45Pwp(ak⋅Ck)=b⋅Pwp(Ck). Since we have
[TABLE]
and Pwp(Ck) is odd, we have val2(b)≥1. Now, we take a as the denominator of b.
Since val2(b)≥1, a is odd and ab∈2Z. Thus, we have r(a⋅ak⋅Ck)=ab⋅V(Ck)∈\cmcalS1(N). Also, since r\mathchar45Pwq(a⋅ak⋅Ck)=ab⋅Pwq(Ck) for any prime q, we have r\mathchar45Pwq(a⋅ak⋅Ck)∈2Z, as desired.
This completes the proof of Theorem 5.3.
∎
5.2. First attempt
Using suitable orderings ≺ and ⊲ on \cmcalDN0, we write
[TABLE]
so that di≺dj (resp. δi⊲δj) if and only if i<j, where m=#\cmcalDNsf and n=#\cmcalDN0. (Thus, for any non-trivial squarefree divisor a of N and for any non-squarefree divisor b of N, we have a≺b and a⊲b.)
- (1)
Find a rational cuspidal divisor Di such that the images Di span the group C(N), or more generally
[TABLE]
2. (2)
Show that the matrix M:=(∣V(Di)δj∣)1≤i,j≤n is lower unipotent, i.e.,
M is lower-triangular and ∣V(Di)δi∣=1 for all i.
3. (3)
Prove that h(Di)=1 for all i=1.
Then by successively applying Theorem 5.1, we easily have
[TABLE]
Although our requirements are quite strong, this strategy indeed works when N is an odd prime power (Section 6.2).
5.3. Second attempt
As above, we define two orderings ≺ and ⊲ on \cmcalDN0. But since some arguments break down in general, we modify our previous strategy a little bit.
- (1)
Find a rational cuspidal divisor Di such that the images Di span the group C(N), or more generally
[TABLE]
2. (2)
Show that the matrix M
is of the form
[TABLE]
where U is a lower unipotent matrix of size n−m, and O is the m×(n−m) zero matrix.
3. (3)
Prove that h(Di)=1 for any m<i≤n.
We then have
[TABLE]
where C(N)sf=⟨Di:1≤i≤m⟩. Indeed, the equality follows from (1), and the isomorphism follows by successively applying Theorem 5.1 as we have (2) and (3).
Now, we fix a prime ℓ and compute the ℓ-primary subgroup of C(N)sf.
- (4)
Find a rational cuspidal divisor Ei such that
[TABLE]
2. (5)
Show that the matrix M0=(V(Ei)δj)1≤i,j≤m is lower ℓ-unipotent, i.e., M0 is lower-triangular and V(Ei)δi∈Zℓ× for all i.
If ℓ is odd, then by successively applying Theorem 5.2, we have
[TABLE]
If ℓ=2, then we need one more step.
- (6)
Prove that for any 1<i≤m, either h(Ei)=1 or there is a prime p such that
[TABLE]
Then by successively applying Theorem 5.3, we have
[TABLE]
By taking Di=Z(di) and Ei=Y0(di), our strategy works if N is odd.
5.4. Third attempt
Let N=∏i=1tpiri be the prime factorization of N.
We fix a prime ℓ and make Assumption 4.5.
We assume that N is even, i.e., u≥1. For simplicity, let r=ru.
In the previous section, we in fact verify (1) and (2) for Di=Z1(di).303030Note that Z(d)=Z1(d) for any d∈\cmcalDN0 if N is odd. So if we take Di=Z1(di), then (1) and (2) can be verified without any change, but (3) is not fulfilled in general. Problematic elements are some of Z1(di) with m<i<m+r as we have already seen in Section 4.2. Thus, we ignore them at the moment, and by mimicking the strategy in the previous section, we obtain313131Note that Z1(di)=Z(di) for any i≥m+r.
[TABLE]
Next, we deal with the group C(N)sf[ℓ∞]. If we take Ei=Y0(di) as in the previous section, then (4) works without any change because (4.10) guarantees that (5.3) holds.
However, (5) works only when u=1. Note that the vector D(piri,pjrj) is designed so that V(D(piri,pjrj))pj∈Zℓ× for any i<j under the assumption that valℓ(pi−1)≤valℓ(pj−1).
Since pu−1=1, this property may not hold if j=u. This is why we take a twisted colexicographic order for the ordering ⊲ on \cmcalDNsf with the cost that the matrix M0 is not lower-triangular any more. With this new ordering ⊲, we try to make the matrix M0 lower ℓ-unipotent. Fortunately, if we take Ei=Y1(di), then the matrix M0 is lower ℓ-unipotent. If ℓ is odd, by successively applying Theorem 5.2, we have
[TABLE]
Suppose that ℓ=2. We hope to verify (6) for Ei=Y1(di). If u=s≥1, then ps−1=1 is odd, and so some arguments break down. As above, we replace some problematic elements and finally take Ei=Y2(di). Although the matrix M0 is not lower-triangular any more, we can successively apply Theorem 5.3 and obtain that
[TABLE]
as desired.
Lastly, we hope to prove
[TABLE]
which will be done in Section 6.7.
This completes the computation of C(N).
5.5. Degeneracy maps revisited
Let N=Mpr with gcd(M,p)=1. For simplicity, let α:=αp(N)∗ (resp. β:=βp(N)∗) be the degeneracy map
from Div(X0(N)) to Div(X0(Np)).
Proposition 5.4**.**
Let C∈Divcusp(X0(N))(Q). If r=0, then we have
[TABLE]
If r≥1, then for any divisor d of M we have
[TABLE]
and
[TABLE]
Proof.
First, let r=0. Then we have \Phi_{Np}(\alpha(C))=\Phi_{N}(C)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}\left(\begin{smallmatrix}p\\
1\end{smallmatrix}\right) by Lemma 2.22. Thus, we have
[TABLE]
as desired. Similarly, we obtain the formula for V(β(C)).
Next, let r≥2. Let ΦN(C)=∑d∣M,0≤f≤ra(dpf)⋅e(N)dpf. Also, for each 0≤f≤r, let Af=∑d∣Ma(dpf)⋅e(M)d∈\cmcalS2(M) and write ΦN(C)=(A0,A1,…,Ar−1,Ar)t, i.e., the pf-th block vector is Af. If we write
[TABLE]
(so that the pf-th block vector is Af′), then by Lemma 2.22 we have Ar+1′=Ar and Af′=af⋅Af for any 0≤f≤r, where af=p if f≤[r/2], and af=1 otherwise.
For simplicity, let
[TABLE]
and
[TABLE]
To prove the result for V(α(C)), we must show that Fr+1=0 and Ff=pEf for any 0≤f≤r. First, we easily have Fr+1=0 because the last row of Υ(pr+1) is (…,−p,p), where the dots denote zero entries. Next, let Bf=Υ(M)×Af for any 0≤f≤r. Then we have
[TABLE]
where m(f)=min(f,r−f). Also, we have
[TABLE]
where n(f)=min(f,r+1−f). Thus, we easily have Ff=pEf for f=0 or r.
Finally, suppose that 1≤f≤r−1. Then we have
[TABLE]
and
[TABLE]
Note that if i≤[r/2], then ai=p and m(i)=n(i)=i. If i>[r/2], then ai=1 and n(i)=r+1−i=m(i)+1. Thus, we have pn(i)ai=pm(i)+1 for all 1≤i≤r, and so we have Ff=pEf for any 1≤f≤r−1. This completes the proof of V(α(C)). Similarly, we obtain the result for V(β(C)).
Lastly, let r=1. Since the result easily follows by the same argument as in the case of r≥2, we leave the details to the readers.
∎
6. The structure of C(N)
In this section, we compute the structure of the rational cuspidal divisor class group C(N) of X0(N) based on the strategy in the previous section. More precisely, we prove the following, which is a combination of Theorems 1.5 and 1.6.
Theorem 6.1**.**
Let N be a positive integer. Then we have
[TABLE]
Also, for any prime ℓ, we have
[TABLE]
Furthermore, the orders of Z(d) and Y2(d) are n(N,d) and N(N,d), respectively.
Throughout this section, let N=∏i=1tpiri be the prime factorization of N,
[TABLE]
As already mentioned, we write
[TABLE]
6.1. The orderings ≺ and ⊲
As already mentioned, we define two orderings ≺ and ⊲ on \cmcalDN0, and write
[TABLE]
so that di≺dj (resp. δi⊲δj) if and only if i<j. For simplicity, we set δ0=1. Since any divisor of N can be written as pI for some I=(f1,…,ft)∈Ω(t), we define two orderings on Ω(t) instead.
Motivated by the idea in Section 4.1, we first define the following.
Definition 6.2**.**
Let r be a positive integer. On the set {0,1,…,r}, we define the orderings ≺r and ⊲r as follows.
- (1)
If r=1, then we define
[TABLE]
2. (2)
If r=2, then we define
[TABLE]
3. (3)
If r=3, then we define
[TABLE]
4. (4)
If r≥4, then we define
[TABLE]
where k=[(r+1)/2].
We define a bijection ιr on {0,1,…,r} so that i≺j if and only if ιr(i)⊲ιr(j). For instance, if r is large enough, then we set ιr(1)=0,
ιr(0)=1, ιr(2)=r, ιr(r)=r−1, …, ιr(5)=k−(−1)r, ιr(4)=k+(−1)r and ιr(3)=k.
Next, we define the orderings ≺ and ⊲ on □(t) as follows.
Definition 6.3**.**
Let I=(a1,…,at) and J=(b1,…,bt) be two elements in □(t).
We write I≺J (resp. I⊲J) if and only if one of the following holds.
- (1)
au≺rubu (resp. au⊲rubu) and aj=bj for all j different from u.
2. (2)
There is an index k=u such that
ak≺rkbk (resp. ak⊲rkbk) and
aj=bj for all j>k different from u.
Then, we define the orderings ≺ and ⊲ on Δ(t) as follows.
Definition 6.4**.**
Let I=(a1,…,at) and J=(b1,…,bt) be two elements in Δ(t).
We write I⊲J if and only if one of the following holds.
- (1)
au=0, bu=1 and aj=bj for all j different from u.
2. (2)
There is an index k=u such that ak=0, bk=1 and
aj=bj for all j>k different from u.
Also, we write I≺J if and only if ι(I)⊲ι(J), where the map ι is defined below.
Finally, we define the orderings ≺ and ⊲ on Ω(t) as follows.
Definition 6.5**.**
For two elements I and J in Ω(t), we write I≺J (resp. I⊲J) if and only if one of the following holds.
- (1)
I∈Δ(t) and J∈□(t).
2. (2)
I,J∈□(t) and I≺J (resp. I⊲J).
3. (3)
I,J∈Δ(t) and I≺J (resp. I⊲J).
As mentioned above, we define a map ι:Δ(t)→Δ(t) as follows.
Definition 6.6**.**
For any I=(a1,…,at)∈Δ(t) with m=m(I), n=n(I) and x=max(m,u), we set ι(I):=(b1,…,bt), where
- (1)
bi=1−ai for all i=x and bx=1 if I∈\cmcalE,
2. (2)
bm=1, bu=0 and bi=1−ai for all i=m,u if I∈\cmcalHu1,
3. (3)
bi=1−ai for all i if I∈\cmcalE∪\cmcalHu1.
Remark 6.7*.*
Let I=(a1,…,at)∈Δ(t) with m=m(I), n=n(I) and x=max(m,u). Also, let ι(I)=(b1,…,bt). By definition, we have the following.
- (1)’
If I∈\cmcalE and x=m, then bi=1 for all i≤m, and bj=0 for all j>m.
2. (1)′
If I∈\cmcalE and x=u>m, then bi=1 for all i<m and i=u, and bj=0 for all j≥m different from u.
3. (2)’
If I∈\cmcalHu1, then bi=1 for all i≤m, and bi=0 for all i>m.
4. (3)’
If I∈\cmcalE∪\cmcalHu1, then bi=1−ai for all i. In particular, bm=1−am=0 and bn=1−an=1.
In cases (1) and (2), there seems no difference on bi. However, we have m≥u in case (1), but m<u in case (2) as we have m<n by definition.
Lemma 6.8**.**
The map ι is a bijection on Δ(t) for any t.
Proof.
As above, let I=(a1,…,at)∈Δ(t) with m=m(I), n=n(I) and x=max(m,u). Also, let ι(I)=(b1,…,bt).
We first claim that ι(I)∈Δ(t). By definition, it is easy to see that bi∈{0,1}, so it suffices to prove that bi=1 for some i.
In case (1) (resp. (1)′, (2), and (3)) in Remark 6.7, we have bm=1 (resp. bu=1, bm=1, and bn=1). Therefore the claim follows.
Next, we prove that the map ι is bijective. Since Δ(t) is a finite set, it suffices to show that ι is injective. Let J=(e1,…,et)∈Δ(t) with m′=m(J), n′=n(J) and x′=max(m′,u).
Also, let ι(J)=(f1,…,ft). If I and J are both in the same case in Remark 6.7, then it is easy to prove that I=J whenever ι(I)=ι(J). So it suffices to prove that bi=fi for some i in the following cases.
- (1)’
Assume that I∈\cmcalE and x=m. Then we have u≤m.
- (a)
Suppose that J∈\cmcalE and x′=u>m′. Since m′<u≤m, we have bm′=1.
By definition, we have em′=1. Since m′=x′, we have fm′=1−em′=0.
Thus, we have bm′=fm′.
2. (b)
Suppose that J∈\cmcalHu1. Then we have m′<n′=u≤m.
Since fj=0 for all i>m′, we have fm=0. Since bm=1, we have bm=fm.
3. (c)
If J∈\cmcalE∪\cmcalHu1, then fm′=0 and fn′=1 with m′<n′.
If m′≤m, then bm′=1. If m′>m then n′>m′>m and so bn′=0. Thus, we have either bm′=fm′ or bn′=fn′.
2. (1)′
Assume that I∈\cmcalE and x=u>m.
- (a)
If J∈\cmcalHu1, then we have fu=0. Since bu=1, we have bu=fu.
2. (b)
If J∈\cmcalE∪\cmcalHu1, then fm′=0 and fn′=1 with m′<n′.
If n′=u then either bm′=fm′ or bn′=fn′ as in case (1)-(c) above.
Suppose that n′=u. Since J∈\cmcalHu1, we have k′≤t, and so fk′=1−ek′=1.
As above, we have either bm′=fm′ (if m′<m) or bk′=fk′ (otherwise).
3. (2)’
Assume that I∈\cmcalHu1. If J∈\cmcalE∪\cmcalHu1, then we have fm′=0 and fn′=1.
As in case (1)-(c) above, we have either bm′=fm′ or bn′=fn′.
This completes the proof.
∎
Remark 6.9*.*
We define the map ι□:□(t)→□(t) by
[TABLE]
Also, we define the map ιΩ:Ω(t)→Ω(t) by ιΩ=ι□ if I∈□(t), and ιΩ=ι otherwise.
Then by definition, the map ιΩ is an order-preserving bijection from (Ω(t),≺) to (Ω(t),⊲). In other words, we have ιΩ(di)=δi for any i.
Remark 6.10*.*
Suppose that N is divisible by 4 and let r=ru≥2. For any 2≤f≤r, let If:=(a1,…,at)∈□(t) such that au=f and ai=1 for all i=u. Then by definition, dm+1=2⋅rad(N)=pI2. Also, we have
[TABLE]
which is equal to \cmcalTu if r≥5. Furthermore, if i≥m+r, then there is an index h=u such that valph(di)≥2.
The following will be used later.
Lemma 6.11**.**
Suppose that t≥2 and s≥1. If n∈\cmcalIs, then we have
[TABLE]
Also, we have
[TABLE]
where ϵ=1 if s<n, and ϵ=0 if n<s.
Proof.
Let I=Es(n) and J=E(n). By definition, n(I)=min(n,s)≤t and k(I)=max(n,s)≤t. Therefore I∈\cmcalE∪\cmcalHu1. Also, since n(J)=n≤t and n=s, we have J∈\cmcalE∪\cmcalHu1. Thus, we have ι(I)=Fs(n) and ι(J)=F(n) by definition.
Now, consider a well-ordered set (Δ(t),⊲).
By definition, the largest element less than Fs(n) is F(n). Furthermore, for any K=(a1,…at)∈Δ(t), K is smaller than F(n)
(or equivalently, K⊲F(n)) if and only if ai=0 for all i≥n different from s. Thus, the number of such elements is exactly 2n−1 if n<s, and 2n−1−1 otherwise. Therefore the result follows.
∎
Lemma 6.12**.**
Suppose that t≥2. Then we have
[TABLE]
Also, we have
[TABLE]
Furthermore, we have
[TABLE]
Proof.
By definition, it is easy to verify the formula for δi. Also, by the definition of the map ι we can easily verify that ι(di)=δi for 1≤i≤3.
∎
6.2. Case of t=1 and u=0
In this subsection, we prove the following.
Theorem 6.13**.**
For an odd prime p and r≥1, we have
[TABLE]
Also, the order of Bp(r,f) is
[TABLE]
To begin with, we prove the following.
Proposition 6.14**.**
For any r≥1, we have
[TABLE]
Proof.
By the result in Section 2.6, we can explicitly compute Ap(r,f). Indeed, we have Ap(r,1)pk=pmax(r−2k,0) by Lemma 2.24 (and Remark 4.7). Also, by Lemmas 2.22 and 2.26, we have the following. If r≥3 is odd, then we have
[TABLE]
where Kp(j)=∑i=0jp2i. Also, if r≥2 is even, then
[TABLE]
Moreover, by Lemma 2.22 we have the following.
If 3≤f=r−2a≤r, then
[TABLE]
and if 3≤f=r+1−2a≤r, then
[TABLE]
Using this description, we first prove that \cmcalS2(pr)⊂⟨Ap(r,f):0≤f≤r⟩. For simplicity, let ek:=e(pr)pk.
Then it suffices to show that for any 0≤k≤r, there are integers a(k,f) such that
[TABLE]
This is obvious for k=0 because Ap(r,0)=e0. Since Ap(r,r)=e0−er, the claim follows for k=r. By direct computation, we easily find a(k,f) for small r, so we assume that r≥5. Since Ap(r,r−1)=e0+e1−p⋅er, the claim follows for k=1. Also, since Ap(r,r−2)=p⋅e0−er−1−er, the result follows for k=r−1. Doing this successively, we can find a(k,f) for any k different from m or m+1, where m=[2r−1].
Note that Ap(r,2)pm=1 and Ap(r,2)pm+1=0. Thus, we obtain the result for k=m using the vectors Ap(r,f) with f=1. Also, since Ap(r,1)pm+1=1, the result for k=m+1 follows. Since the other inclusion is obvious, we obtain \cmcalS2(pr)=⟨Ap(r,f):0≤f≤r⟩.
Next, we prove that \cmcalS2(pr)0⊂⟨Bp(r,f):1≤f≤r⟩. By Lemma 2.19, the group \cmcalS2(pr)0 is generated by
[TABLE]
Since ek=∑f=0ra(k,f)⋅Ap(r,f), we easily obtain
[TABLE]
As the other inclusion is obvious, this completes the proof.
∎
Next, we define vectors Ap(r,f) and Bp(r,f) in \cmcalS1(pr).
Definition 6.15**.**
For any 0≤f≤r, we define a vector Ap(r,f) in \cmcalS1(pr) by
[TABLE]
where Oa=(0,…,0) is the zero vector of size a.
Also, for any 1≤f≤r, we set
[TABLE]
Then, the following is obvious from our construction.
Lemma 6.16**.**
For any 0≤f≤r, we have
[TABLE]
where
[TABLE]
Also, for any 1≤f≤r we have
[TABLE]
Proof.
If r≤2, we can easily verify the formulas by direct computation. For r≥3, by definition it is obvious that
[TABLE]
So by applying Proposition 5.4, we can easily prove the formulas at least for f=2.
Suppose the formula for f=2 holds for r−1, i.e.,
[TABLE]
If r−1 is even, then by Proposition 5.4 we have
[TABLE]
If r−1 is odd, then we have
[TABLE]
Also, we have
[TABLE]
Thus, by definition we have
[TABLE]
By induction, the formula for f=2 holds for any r≥3.
∎
Remark 6.17*.*
For any prime p and an integer r≥1, we have
[TABLE]
This is the reason behind the definition of \cmcalGp(r,f). Also, we have
[TABLE]
By definition, it is easy to see that the greatest common divisor of the entries of Ap(r,f) (resp. Bp(r,f)) is 1. Thus, we have
[TABLE]
Also, we can easily prove the following.
Lemma 6.18**.**
For any 0≤f≤r, we have
[TABLE]
Proof.
It is easy to check for small r, so suppose that r is large enough. Then by definition, we have
Ap(r,0)p=−1 and Ap(r,0)pk=0 for all 1<k≤r.
Ap(r,1)1=1 and Ap(r,1)pk=0 for all 0<k≤r.
Ap(r,2)pr=−1 and Ap(r,2)pk=0 for all 1<k<r.
Ap(r,r)pr−1=1 and Ap(r,r)pk=0 for all 1<k<r−1.
Ap(r,r+1−2a)p1+a=−1 and Ap(r,r+1−2a)pk=0 for all 1+a<k<r−1.
Ap(r,r−2a)pr−1−a=1 and Ap(r,r−2a)pk=0 for all 1<k<r−1−a.
In the last two items, a is any positive integer satisfying r+1−2a≥3 and r−2a≥3, respectively.
Thus, the result follows by the definition of the map ιr.
∎
As a corollary, the following is obvious.
Corollary 6.19**.**
For each 1≤i≤r with di=pfi, let Di=Bp(r,fi).
Then the matrix M=(∣V(Di)δj∣)1≤i,j≤r is lower unipotent, or equivalently for any 1≤i≤r, we have
[TABLE]
Finally, the following is easy to verify from the previous discussion.
Lemma 6.20**.**
For any 1≤f≤r, we have
[TABLE]
Proof of Theorem 6.13.
As above, for each 1≤i≤r with di=pfi, we take Di=Z(di)=Z(p(fi))=Bp(r,fi). For example, D1=Z(p) and Dr=Z(p3) (resp. Z(p2)) if r≥3 (resp. r=2).
To prove the first assertion, we follow the strategy in Section 5.2.
By Proposition 6.14 and Corollary 6.19, (1) and (2) are fulfilled, respectively.
Since p is odd, we have h(Di)=1 for any 2≤i≤r by Lemma 6.20, and therefore (3) is satisfied. This completes the proof of the first assertion.
The second assertion follows by Lemmas 6.16 and 6.20, and Theorem 3.13.
∎
6.3. Case of t=u=1
In this subsection, we prove the following.
Theorem 6.21**.**
If r≤4, then we have C(2r)=0. Suppose that r≥5. Then we have
[TABLE]
Also, the order of B2(r,f) is
[TABLE]
As in Section 4.2, let Df=C(2r)2f.
Proof.
Suppose that r≤4. Since the genus of X0(2r) is zero, we have C(2r)=0. Thus, we assume that r≥5.
We first claim that323232In contrast to Proposition 6.14, the vectors Ek cannot generate \cmcalS2(2r)0 because the number of Ek is smaller than r, the rank of \cmcalS2(2r)0. So we need at least two relations among Df.
[TABLE]
Note that the second equality follows by Remark 4.9.
Note also that the group Divcusp0(X0(2r))(Q) is generated by Df for any 1≤f≤r by Lemma 2.19.
Thus, by Lemma 6.22 below, it suffices to show that for any 1≤f≤r different from 2 and r−1, there are integers a(f,k) such that Df=∑k=3ra(f,k)⋅Ek. This is obvious by definition. Indeed, we have Dr=B2(r,r) and
[TABLE]
Since {B2(r,r−1),B2(r,r)}={Er−1,Er}, the result for f=1 and f=r follows. Also, for any 3≤f≤r−2, we have
[TABLE]
This completes the proof of the claim.
Next, we claim that there is no relation among Ek.
To begin with, we compute GCD(Ek), V(Ek) and Pw2(Ek).
If r=5, we can easily compute them and have C(32)≃Z/4Z (cf. Section 4.2.1)333333By direct computation, B2(5,3)=B2(5,4)=0 and ⟨B2(5,5)⟩≃Z/4Z..
So we assume that r≥6. By direct computation, we have the following.
[TABLE]
Since Pw2(Er)=−3 and Pw2(Ek)=0 for all 3≤k<r, by Theorem 5.3 we have343434Note that the orders of Ek are powers of 2, and so we can apply Theorem 5.3.
[TABLE]
Suppose that there are integers ak such that
[TABLE]
For simplicity, let nk be the order of Ek. Since h(Ek)=1 for any 3≤k≤r−1, by Theorem 3.13 we have
[TABLE]
Thus, as in (4.1) we have
[TABLE]
Let X=∑k=3r−1ak⋅Ek. Since X=0, by Corollary 3.11 we have
[TABLE]
Since (6.1) does not change if we replace ak by ak−c⋅nk for any c∈Z, we may assume that 0≤ak<nk.
For simplicity, let bk=nkak∈[0,1).
As in Section 4.2.3, we have
[TABLE]
For an integer c≥1, let M(c) denote the set of rational numbers between [math] and 1 whose denominators are exactly 2c when reduced to lowest terms, i.e.,
[TABLE]
Since nk is a power of 2, we have bk=0 or bk∈M(c) for some c≥1.
Suppose that br−1=0. Since xr∈Z, we have br−1∈M(1). By the condition for xr−1, we have br−2∈M(2). Also, by the condition for xr−2 we have br−3∈M(3). Similarly, we can deduce that br−k∈M(k) for any 3≤k≤r−1 by the conditions for xj with 4≤j≤r−3. On the other hand, we then have x3=−5b3+2b4∈M(r−3), which is a contradiction to xj∈Z. Thus, we have br−1=0.
Next, suppose that bs=0 for some 3≤s≤r−2 and bi=0 for all s<i≤r−1.
By the condition for xs+1, we have bs∈M(1). Similarly as above, we obtain bk∈M(s−k+1) by the conditions for xj with 4≤j≤r−2.
In particular, we have b4∈M(s−3) and b3∈M(s−2). As above, we then have x3∈M(s−2), which is a contradiction to xj∈Z. Thus, we have bi=0 for all 3≤i≤r−1.
This completes the proof of the claim.
Finally, by (6.2) and the table above, we obtain the result for the order. This completes the proof.
∎
Lemma 6.22**.**
For an integer r≥5, we have
[TABLE]
and
[TABLE]
Proof.
Note that the genus of X0(16) is zero, and so J0(16)=0. Thus, any pullbacks of the divisors on X0(16) by the degeneracy maps become trivial in J0(2r). In particular, we have
[TABLE]
By direct computation using Lemma 2.22, we have
[TABLE]
and
[TABLE]
This completes the proof.
∎
6.4. First reduction
From now on, we assume that t≥2. In this subsection, we prove the following.
Theorem 6.23**.**
We have
[TABLE]
where r=1 if u=0 and r=ru if u≥1.
First, we claim the following.
Proposition 6.24**.**
For any t≥2, we have
[TABLE]
Proof.
We prove this by induction on t.
Since the claim holds for t=1 by Proposition 6.14, we suppose that
[TABLE]
Here, we use the notation Z1(M,d′) to emphasize that they are of level M (and to distinguish them from Z1(d′), which are of level N). In other words, for any J=(f1,…,ft−1)∈Ω(t−1), we have
[TABLE]
To prove the assertion, it suffices to show that \cmcalS2(N)0⊂⟨Z1(d):d∈\cmcalDN0⟩ since the other inclusion is obvious. By Lemma 2.19, it suffices to show that
for any non-trivial divisor δ of N, there are integers a(d) such that
[TABLE]
For simplicity, let p=pt and r=rt. Also, let f=valp(δ) and δ′=gcd(δ,M).
If δ′=1, then by (6.3) there are integers b(d′) such that
[TABLE]
Also by Proposition 6.14, we have e(pr)pf=∑k=0rc(k)⋅Ap(r,k) for some c(k)∈Z.
Furthermore, φ(pmin(f,r−f))⋅e(pr)1−e(pr)pf=∑k=1r−c(k)⋅Bp(r,k).
Now, we prove that for any δ∈\cmcalDN0, we have fδ∈⟨Z1(d):d∈\cmcalDN0⟩.
Suppose first that f=0, and so δ′=1.
Then since {\mathbf{Z}}^{1}(d)={\mathbf{Z}}^{1}(M,d)\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p}(r,0) for any d∈\cmcalDM0, we have
[TABLE]
Next, suppose that f≥1 and δ′=1. Let
[TABLE]
Since {\bf e}(M)_{1}=\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}_{i=1}^{t-1}{\mathbf{A}}_{p_{i}}(r_{i},0), we have
[TABLE]
Finally, suppose that δ′=1 and f≥1. Let {\mathbf{X}}_{2}:={\bf f}(M)_{\delta^{\prime}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\bf e}(p^{r})_{p^{f}}. Since {\mathbf{Z}}^{1}(M,d^{\prime})\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{p}(r,k)={\mathbf{Z}}^{1}(d^{\prime}p^{k}) for any d′∈\cmcalDM0, we have
[TABLE]
Since fδ=φ(gcd(δ′,M/δ′))⋅X1+X2, the result follows by induction.
∎
Next, we show the following.
Proposition 6.25**.**
For any non-squarefree divisor di of N, we have
[TABLE]
Proof.
It suffices to show that
- (1)
for any squarefree divisor d of N and any non-squarefree divisor δ of N, we have
V(Z1(d))δ=0, and
2. (2)
for any non-squarefree divisors di of N, we have
∣V(Z1(di))δi∣=1 and V(Z1(di))δj=0 for all j>i.
Let I=(f1,…,ft)∈Ω(t) and J=(a1,…,at)∈□(t). By definition, there is an index h such that ah≥2.
First, suppose that I∈Δ(t) with m=m(I). Then by Theorem 3.15, we have
[TABLE]
Since fh∈{0,1}, Aph(rh,fh)phah=Bph(rh,1)phah=0 by Lemma 6.18, and so
[TABLE]
Next, suppose that I∈□(t). Let pI=di and δj=pJ. If we write δi=pK for some K=(b1,…,bt), then we have bi=ιri(fi) (cf. Remark 6.9).
Since ∣Api(ri,fi)pibi∣=1 by Lemma 6.18, we have
[TABLE]
Also, if i<j then K⊲J by definition, and hence there is an index h such that bh⊲rhah. Since Aph(rh,fh)phah=0 by Lemma 6.18, we have
[TABLE]
This completes the proof.
∎
Finally, we prove the following.
Proposition 6.26**.**
Let I=(f1,…,ft)∈Ω(t). Then we have
[TABLE]
where m=m(I) and gpi(ri,fi) is defined in Lemma 6.16.
Furthermore, we have h(Z(pI))=2 if and only if one of the following holds.
- (1)
I=A(1).
2. (2)
u≥1* and I=E(u).*
3. (3)
u≥1, 3≤ru≤4, fu=3 and fi=1 for all i=u.
4. (4)
u≥1, ru≥5, fu=ru+1−gcd(2,ru) and fi=1 for all i=u.
Proof.
First, let I=(f1,…,ft)∈□(t). By definition, there is an index h such that fh≥2.
Note that by Theorem 3.15 and Lemma 6.16, we have
[TABLE]
As already mentioned in Remark 4.9, we have GCD(Z1(pI))=GCD(Z(pI)) and so the first assertion follows in this case.
Note that the summation of all entries of Api(ri,fi) is (pi−1)1−fi if fi∈{0,1}, and [math] otherwise.
Thus, by Theorem 3.15 we have Pwpn(Z1(pI))=0 unless n=h and fi≤1 for all i=h.
Moreover, by Lemma 6.20 we have Pwph(Z1(pI))∈2Z if and only if h=u≥1, fu≥3, ru−[2ru−fu+1] is odd, and fi=1 for all i=u.
Thus, we obtain the result for I∈\cmcalTu as Z1(pI)=Z(pI).
Suppose further that I∈\cmcalTu, i.e., h=u≥1, ru≥5, fu≥3 and fi=1 for all i=u. Note that for any r≥5 (and 3≤f≤r), Pw2(B2(r,f))∈2Z if and only if f=r+1−gcd(2,r). Therefore the result follows in this case.
Next, let I=(f1,…,ft)∈Δ(t) with m=m(I). Similarly as above, we have
[TABLE]
Also, since the summation of all entries of Bp(r,1) is zero, we have Pwpn(Z(pI))=0 unless n=m. Furthermore, we have
Pwpm(Z(pI))=−∏i=1,i=mt(pi−1)1−fi.
Thus, Pwpm(Z(pI))∈2Z if and only if one of the following holds.
fi=1 for all i, i.e., I=A(1).
fu=0 and fi=1 otherwise if u≥1, i.e., I=E(u).
This completes the proof.
∎
Corollary 6.27**.**
For any d∈\cmcalDN0, we have
[TABLE]
Proof.
Note that κ(N)=∏i=1tκ(piri). So by Remark 6.17, we have the first equality. Also, the second equality follows from Proposition 6.26 (and the definition of \cmcalH(N,d)).
∎
Combining all the results above, we now prove Theorem 6.23.
By Proposition 6.24, it suffices to show that
[TABLE]
First, suppose that N is odd, i.e., u=0. Then Z1(d)=Z(d) for any d∈\cmcalDN0 and so by Proposition 6.26, we have h(Z1(d))=1 for any d∈\cmcalDNnsf. Hence by Proposition 6.25, we can successively apply Theorem 5.1 for Ci=Z1(di) and obtain
[TABLE]
Next, suppose that N is even, i.e., u≥1. Let r=ru≥1 and
let dj=pJ for some J=(f1,…,ft)∈Ω(t).
Suppose that m+r≤j≤n. Then there is an index h=u such that fh≥2 and J∈\cmcalTu (cf. Remark 6.10). Thus, we have Z1(dj)=Z(dj) and so h(Z1(dj))=h(Z(dj))=1 by Proposition 6.26.
Similarly as above, we obtain
[TABLE]
This completes the proof. ∎
Remark 6.28*.*
When N is divisible by 32, one may try to find a “better” replacement of B2(r,f) than B2(r,f), which can be used to directly prove
[TABLE]
Then it seems necessary to find a replacement D∈Divcusp0(X0(2r))(Q) of B2(r,r) with Pw2(D)∈2Z. But then the order of D is at most 2r−4 by Theorem 3.13, whereas that of B2(r,r) is 2r−3 if r≥5 is odd. Thus, it seems impossible to find such a divisor, and unfortunately we only have an indirect proof of the first assertion of Theorem 6.1, which will be presented in Section 6.7.
On the other hand, if N is not divisible by 32, then (6.4) is obviously true because the order of Z1(dj) is 1 for any m<j<m+r.
6.5. Generation
As above, we assume that t≥2. In this subsection, we prove the following.
Theorem 6.29**.**
For each i=0, 1 or 2, we have
[TABLE]
For simplicity, we use the following abbreviated notation.
Notation 6.30*.*
For any 1≤i<j≤t, let
Aifi:=Api(ri,fi),Bi:=Bpi(ri,1)andD(i,j):=D(piri,pjrj).
Yi(I):=Yi(pI)andZ(I):=Z(pI).
γi:=piri−1(pi+1)andGji:=gcd(γi,γj).
Uji=γj⋅(Gji)−1∈Zℓ×andWji=γi⋅(Gji)−1∈Z.
For any I=(f1,…,ft)∈Δ(t) and a subset S of {1,…,t}, let
[TABLE]
and
[TABLE]
For instance, for any I=(f1,…,ft)∈Δ(t) with m(I)=m and n(I)=n≤t, we can write
[TABLE]
For any 1≤k≤t, let
[TABLE]
and for any I=(f1,…,ft)∈Δ(t)k, let
[TABLE]
Also, let Yi:=⟨Yi(pI):I∈Δ(t)⟩, Z0:=⟨Z(pI):I∈Δ(t)⟩ and
[TABLE]
Furthermore, let
[TABLE]
From now on, we regard Yi and Zi as subsets of Yℓi and Zℓi, respectively.
We will prove the following by which one can easily deduce the theorem.
- (A)
Z0⊂Yℓ0.
2. (B)
Y0⊂Yℓ1.
3. (C)
Y1⊂Yℓ2.
4. (D)
Y2⊂Z1.
5. (E)
Z1=Z0.
Note that for each i, we have Bi=γi⋅Ai0−Ai1. Also, we have
[TABLE]
Furthermore, we have
[TABLE]
These equalities are frequently used below without further mention.
Proof of A.
Let I∈Δ(t) with m(I)=m and n(I)=n. It suffices to show that
Z(I)∈Yℓ0. We prove it by (backward) induction on m.
First, if m=t then I∈\cmcalE and hence Z(I)=Y0(I)∈Y0.
Next, assume that m<t and Z(J)∈Yℓ0 for any J∈Δ(t) with m(J)≥m+1.
If n=t+1, then I∈\cmcalE and hence Z(I)=Y0(I)∈Y0. Assume further that n≤t. Let J=I{m,n} and let
[TABLE]
Note that since m(J)≥m+1, we have Z(J)∈Yℓ0 by induction hypothesis.
If n=m+1, then we have X=Z(J)∈Yℓ0. Suppose that n>m+1.
Since {\mathbf{A}}_{m+1}^{1}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{n}={\mathbf{B}}_{m+1}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{n}^{1}-G^{m+1}_{n}\cdot{\mathbf{D}}(m+1,n), we then have
[TABLE]
where S={m,m+1,n}.
Therefore we always have X∈Yℓ0, and so
[TABLE]
Since Unm∈Zℓ×, we have Z(I)∈Yℓ0 and the result follows by induction.
∎
Remark 6.31*.*
By modifying the induction hypothesis, we can prove that X is written as a Zℓ-linear combination of Y0(J) for J∈Δ(t) satisfying m(J)≥m+1.
Thus, for each I∈Δ(t) with m=m(I), we can write
[TABLE]
where a(J)=0 unless J=I or m(J)≥m+1. Also, we have a(I)∈Zℓ×. This is the reason behind the definition of Y0(I).
Proof of B.
Let I∈Δ(t) with m=m(I), n=n(I) and k=k(I). As above, it suffices to show that
Y0(I)∈Yℓ1.
First, let I∈\cmcalE∪\cmcalHu. Then Y0(I)=Y1(I)∈Y1.
Next, let I∈\cmcalE, i.e., I=A(m). If m≥u then Y0(I)=Y1(I)∈Y1. Suppose that m<u. By definition, we have {\mathbf{Y}}^{1}(I)={\mathbf{A}}(I)^{\{m,u\}}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{m}^{1}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{u},
[TABLE]
Thus, we have
[TABLE]
Since U^{m}_{u}\cdot{\mathbf{B}}_{m}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{u}^{0}={\mathbf{D}}(m,u)+W^{m}_{u}\cdot{\mathbf{A}}_{m}^{0}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{u}, we have
[TABLE]
Since Uum∈Zℓ×, we have Y0(I)∈Yℓ1, as desired.
Lastly, let I=(f1,…,ft)∈\cmcalHu, i.e., m<n=u<k≤t. For simplicity,
for any J∈\cmcalHu with m(J)=m′ and k(J)=k′, let
[TABLE]
We prove that Y0(I)∈Yℓ1 by (backward) induction on m, or more generally
[TABLE]
Step 1. Assume that m=u−1.
(1) Let J=I{m,u}. Then we have m(J)=u and n(J)=k. Thus, we have
[TABLE]
Since I∈\cmcalHu, we have
[TABLE]
Note that (Ukm)−1⋅Uum⋅Wkm⋅Uku=Wum⋅Wku=γm⋅γu⋅(Gum⋅Gku)−1, and therefore
[TABLE]
(2) Note that Y0(J)=Y1(J) for any J∈Δ(t) with m(J)≥u. Therefore by Remark 6.31, for any J∈Δ(t) with m(J)≥u, we have
[TABLE]
Let J=I{m,k}. Then we have m(J)=u+1 and so Z(J)∈Yℓ1 by (6.5).
If k=u+1, then we have U0,0(I)=Z(J)∈Yℓ1.
If k>u+1, then for S={m,u,u+1,k} we have
[TABLE]
(3) Let S={m,u,k}. Since m=u−1, we have m(IS)=u. Hence by (6.5), we have
[TABLE]
and therefore
[TABLE]
(4) Let S={m,u,k} and let {\mathbf{V}}:={\mathbf{A}}(I)^{S}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{m}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{u}^{0}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{k}^{1}. Then we have
Uum⋅V=Y0(I{k})+Wum⋅Z(IS) because
U^{m}_{u}\cdot{\mathbf{B}}_{m}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{A}}_{u}^{0}={\mathbf{D}}(m,u)+W^{m}_{u}\cdot{\mathbf{A}}_{m}^{0}\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigotimes}}}}{\mathbf{B}}_{u}.
If I{k}∈\cmcalHu, then Y0(I{k})∈Yℓ1 by (1) because m(I{k})=m=u−1.
If I{k}∈\cmcalHu, then we have I{k}∈\cmcalHu1 since n(I{k})=u.
Thus, Y0(I{k})∈Yℓ1 by the previous cases.
Also, by (3) we have Z(IS)∈Yℓ1. Therefore we always have V∈Yℓ1. Hence, we have
[TABLE]
This completes the proof of the claim for m=u−1.
Step 2. Assume that m≤u−2. For any J∈\cmcalHu with m′=m(J)≥m+1, assume further that
[TABLE]
Since I{m}∈\cmcalHu with m(I{m})=m+1 and k(I{m})=k, we have
[TABLE]
(1) Let S={m,m+1,u} and T={m,m+1,u,k}, and let
[TABLE]
Let J=I{m,u}. Since m(J)=m+1 and n(J)=k, we have
[TABLE]
Since Ukm+1∈Zℓ×, we have V1∈Yℓ1, and therefore
[TABLE]
Note that
[TABLE]
Thus, we have
[TABLE]
(2) Note that U{0,0}(I)=U{1,0}(I{m})∈Y1.
(3) Let S={m,m+1,u,k}, and let
[TABLE]
If fi=1 for all i>k, then V4=Y0(A(m+1))∈Yℓ1 by the result above.
Suppose that fk′=0 for some k<k′≤t. We take k′ as small as possible, i.e., fj=1 for all k<j<k′. Let J=I{m,k}. Then we have m(J)=m+1, n(J)=u and k(J)=k′, and so we have J∈\cmcalHu. By induction hypothesis we have
Y0(J)∈Yℓ1 and U0,1(J)∈Yℓ1. Note that
[TABLE]
Thus, we have V4∈Yℓ1 because Uk′m+1∈Zℓ×. Note also that
[TABLE]
Therefore we have
[TABLE]
(4) For simplicity, let J=I{k} and S={m,m+1,u}. As for V2 and V3, let
[TABLE]
Suppose that J∈\cmcalHu. Since m(J)=m, by the same argument as in (1), we can easily prove that
[TABLE]
Thus, we have
[TABLE]
Suppose that J∈\cmcalHu, i.e., fi=1 for all i>k. Then we have Y0(J)=Y1(J). Since Y0(A(m+1))∈Yℓ1, we have
[TABLE]
Thus, whether J∈\cmcalHu or not, we always have V6∈Yℓ1 as Uum∈Zℓ×.
Therefore we have
[TABLE]
By induction, the result follows.
∎
Proof of C.
It suffices to show that Y1(I)∈Yℓ2 for any I∈\cmcalFs∪\cmcalGs.
If s=0 then \cmcalFs=\cmcalGs=∅ and the claim vacuously holds.
Suppose that s=1. Then we have
[TABLE]
Let I=E(2). Then we have
[TABLE]
Let I=E(n) for some 3≤n≤t. Also, let J=I{1}, K=I{1,2}, S={1,2,n}, and let
[TABLE]
Finally, let
[TABLE]
Note that
[TABLE]
Thus, it suffices to show that W1,0∈Yℓ2 and V1∈Yℓ2. More generally,
we claim that Vi∈Yℓ2 and Wϵ1,ϵ2∈Yℓ2 for all i and ϵj.
Indeed, we have
[TABLE]
If n=3 then V2=Y2(A(3))∈Y2. If n>3 then we have
[TABLE]
Also, since Un2∈Zℓ× and
[TABLE]
we have V3∈Yℓ2.
Next, we have W1,1=Y2(A(1))∈Y2. Also, since U21∈Zℓ× and
[TABLE]
we have W0,0∈Yℓ2. Furthermore, we have W0,1∈Yℓ2 because U21∈Zℓ× and
[TABLE]
Finally, let
[TABLE]
Then since Un2∈Zℓ× and
[TABLE]
we have V4∈Yℓ2. Thus, we have
[TABLE]
This completes the proof for s=1.
Suppose that s≥2. Then \cmcalGs=∅ and
[TABLE]
For some n∈\cmcalIs, let I=E(n), J=Es(n) and S={1,n,s}.
Note that if n>s then J∈\cmcalHu. Thus, whether n<s or not, we always have
[TABLE]
Therefore we have
[TABLE]
This completes the proof.
∎
Proof of D.
The claim is obvious by the equalities at the beginning of the section, so we leave the details to the readers.
∎
Proof of E.
Since Z0⊂Z1 is obvious, we only prove Z1⊂Z0.
Let I=(f1,…,ft)∈Δ(t)k with m=m(I). We prove that Xk(I)∈Z0 by (backward) induction on m. By definition, we have m≤k.
If m=k then Xk(I)=Z(I)∈Z0.
Next, suppose that m<k and Xk(J)∈Z0 for any J with m(J)≥m+1. Since m(I{m})≥m+1 and I{m}∈Δ(t)k, we have Xk(I{m})∈Z0 by induction hypothesis. Thus, we have
[TABLE]
By induction the result follows.
∎
Remark 6.32*.*
Let N=∏i=1tpiri=∏j=1tqjsj be two prime factorizations of N, namely, pi and qj are rearrangements of each other.
As above, let u be the index such that qu=2. (If N is odd, then we set u=0.)
For any J=(f1,…,ft)∈Δ(t), we define
[TABLE]
where qJ=∏j=1tqjfj. Then by the same argument as in the proof of E, we have
[TABLE]
Thus, we have
[TABLE]
In other words, the definition of C(N)sf does not depend on the ordering of the prime divisors of N.
6.6. Linear independence
As above, we assume that t≥2. In this subsection, we prove the following.
Theorem 6.33**.**
We have
[TABLE]
For simplicity, we use the following abbreviated notation.
Notation 6.34*.*
For any 1≤i<j≤t, let
Aifi:=Api(ri,fi)andBi:=Bpi(ri,1).
γi:=piri−1(pi+1)andGji:=gcd(γi,γj).
gji:=γi⋅γj⋅gcd(pi−1,pj−1)⋅(Gji)−1.
uji:=gcd(pi−1,pj−1)−1⋅(pi−1) if j=uanduui:=−1.
wji:=gcd(pi−1,pj−1)−1⋅(1−pj) if j=uandwui:=pi−1.
D(i,j):=V(D(piri,pjrj))∈\cmcalS1(piripjrj).
For each I∈Δ(t), let Yi(I):=Yi(pI).
For each J∈Δ(t) and V∈\cmcalS1(N), let VJ:=VpJ.
Note that we have uji∈Zℓ× by Assumption 4.5. Since
[TABLE]
by Lemma 6.16 we have
[TABLE]
Thus, we have
[TABLE]
in particular we have D(i,j)pipj=0. Also, if j=u (resp. j=u), then D(i,j)pj=uji∈Zℓ× (resp. D(i,j)pi=uji∈Zℓ×).
Note that uji is odd if ℓ=2. However, uji might be even if ℓ is odd. In that case, wji is odd because uji and wji are relatively prime. Therefore either uji or wji is odd.
To begin with, we prove the matrix M0=(V(Y1(di))δj) is lower ℓ-unipotent.
Proposition 6.35**.**
For any I∈Δ(t), we have
[TABLE]
Proof.
As above, it suffices to show that for any I∈Δ(t), we have
[TABLE]
Let I=(f1,…,ft)∈Δ(t) with m=m(I), n=n(I) and k=k(I). Also, let
ι(I)=(a1,…,at) and J=(b1,…,bt)∈Δ(t).
Assume that ι(I)⊲J. Then by definition, there is an index h such that bh=1 and ah=0.
We further assume
- (1)
ai=bi for all i>h different from u if h=u, and
2. (2)
ai=bi for all i different from u if h=u.
Then by Theorem 3.15, we can easily compute V(Y1(I)), and so we can prove the claim as in the proof of Proposition 6.25. More precisely, we proceed as follows: Let K=(c1,…,ct)∈Δ(t).
- (1)
Assume that I∈\cmcalE, and let x=max(m,u). Then we have
[TABLE]
Note that
[TABLE]
Therefore we have ∣V(Y1(I))ι(I)∣=1. Also, since I∈\cmcalE we have ax=1, and hence h=x.
By the definition of ι, we have fh=1−ah=1. Since fh=bh=1 and (Ah1)ph=0, we have V(Y1(I))J=0, as claimed.
2. (2)
Assume that I∈\cmcalHu. Then we have
[TABLE]
Note that we have n=u and so k=u. Thus, we have
[TABLE]
Since I∈\cmcalE∪\cmcalHu1, we have ai=1−fi for all i.
Therefore we have
[TABLE]
Since ak=1−fk=1, we have h=k.
If h=m, then V(Y1(I))J=0 because (Ah1)ph=0 as above. If h=m, then bh=bm=1 and ai=bi for all i>m different from u, in particular ak=bk=1. Thus, we have V(Y1(I))J=0 as D(m,k)pmpk=0 and bm=bk=1.
3. (3)
Assume that I∈\cmcalE∪\cmcalHu. Then we have
[TABLE]
- (a)
Suppose that I∈\cmcalHu1, i.e., m<n=u and fi=1 for all i>u. By definition, am=1, au=0 and ai=1−fi for all i=m,u.
Since D(m,u)pm=uum=−1, we have ∣V(Y1(I))ι(I)∣=1. Also, since am=1 we have h=m. If h=u, then we have V(Y1(I))J=0 as above. If h=u then bh=bu=1 and ai=bi for all i different from u, and so am=bm=1.
Thus, we have V(Y1(I))J=0 as D(m,u)pmpu=0 and bm=bu=1.
2. (b)
Suppose that I∈\cmcalHu1. Since I∈\cmcalHu∪\cmcalHu1, we have n=u. Also, since I∈\cmcalE∪\cmcalHu1, we have ai=1−fi for all i.
Therefore we have V(Y1(I))ι(I)=±unm∈Zℓ×. Note that fh=1−ah=1, and so h=n.
As above, if h=m then we have fh=bh=1, and so we have V(Y1(I))J=0 as (Ah1)ph=0. Also, if h=m then bh=bm=1 and bn=an=1−fn=1 because n>m and n=u.
Thus, we have V(Y1(I))J=0 as D(m,n)pmpn=0 and bm=bn=1.
This completes the proof.
∎
Next, we compute GCD(Y1(I)) and h(Y1(I)).
Proposition 6.36**.**
Let I=(f1,…,ft)∈Δ(t) with m=m(I), n=n(I) and k=k(I).
Also, let x=max(m,u).
Then we have
[TABLE]
Also, h(Y1(I))=2 if and only if I∈\cmcalFu1∪\cmcalGu1∪{A(1)}.
Proof.
The first assertion easily follows by
the same argument as in the proof of Proposition 6.26.
To prove the second assertion, we note that the summation of all entries of Bi (resp. D(i,j)) is zero. Thus, we have Pwph(Y1(I))=0
unless
- (1)
I∈\cmcalE and h=x.
2. (2)
I∈\cmcalHu and either h=m or k.
3. (3)
I∈\cmcalE∪\cmcalHu and either h=m or n.
Since either uji or wji is odd, and since pi−1 is even unless i=u, we easily have the following.
- (1)
For I∈\cmcalE, h(Y1(I))=2 if and only if one of the following holds.
- (a)
m=1, i.e., I=A(1).
2. (b)
u=1 and m=2, i.e., I=A(2)=E(1).
2. (2)
For I∈\cmcalHu (and so u≥2), h(Y1(I))=2 if and only if m=1 and fj=1 for all j>k, i.e., I=Eu(k) for some k>u.
3. (3)
For I∈\cmcalE∪\cmcalHu, h(Y1(I))=2 if and only if one of the following holds.
- (a)
m=1 and fj=1 for all j>n, i.e., I=E(n) for some n≥2. (Here, n might be equal to u.)
2. (b)
m=1, n<u, fu=0 and fj=1 for all j>n different from u, i.e., I=Eu(n) for some 2≤n<u.
3. (c)
u=1, m=2 and fj=1 for all j>n, i.e., I=Eu(n) for some 3≤n≤t.
Thus, the second assertion follows by the definition of \cmcalFu1 and \cmcalGu1.
∎
From now on, let
[TABLE]
Lemma 6.37**.**
Let ℓ=2. Then for any I∈Δ(t)∖H, we have
[TABLE]
Proof.
Since ℓ=2, we have s=u. Also, since Y2(I)=Y1(I) unless I∈\cmcalFs∪\cmcalGs, by Proposition 6.35 it suffices to prove the assertion for I∈\cmcalFs∪\cmcalGs. If s=0, there is nothing to prove and so we assume that s≥1.
For simplicity, let F={2n−ϵ:n∈\cmcalIs}, where ϵ=1 if s<n and [math] otherwise. Also, let G={2} if s=1, and G=∅ otherwise. Then by Lemma 6.11, I∈\cmcalFs if and only if pI=di for some i∈F. Also, by Lemma 6.12 I∈\cmcalGs if and only if pI=di for some i∈G.
Now, we claim the following. For any i∈F∪G, we have
- (1)
di+1=pK for some K∈H,
2. (2)
V(Y2(di))δi=−V(Y2(di))δi+1∈2Z, and
3. (3)
V(Y2(di))δj=0 for any j>i+1.
Indeed, if i∈F, i.e., i=2n−ϵ for some n∈\cmcalIs, then by Lemma 6.11 we have K=Es(n)∈H, δi=pF(n) and δi+1=pFs(n). Also by Theorem 3.15, we have
[TABLE]
Therefore we have V(Y2(di))F(n)=−V(Y2(di))Fs(n)=uny∈2Z. Suppose that j>i+1 and δj=pJ for some J=(f1,…,ft)∈Δ(t). Since δj+1=pFs(n), we have Fs(n)⊲J, and so fh=1 for some h>n different from s. Thus, we have V(Y2(di))J=0 as (Ah1)ph=0.
If i∈G, then s=1 and i=2. By Lemma 6.12, we have K=A(2)=E(1)∈H, δ2=p2 and δ3=p1p2.
Similarly as above, we have
[TABLE]
and so V(Y2(d2))p2=−V(Y2(d2))p1p2=−1. Also, as above V(Y2(d2))δj=0 for all j>3. Therefore the claim follows.
Now, the assertion for I∈\cmcalFs∪\cmcalGs easily follows by the claim. Indeed, if I∈\cmcalFs∪\cmcalGs, then di=pI for some i∈F∪G and so V(Y2(I))ι(I)∈2Z. Suppose that J≺I. By definition, we have ι(J)⊲ι(I). If J∈\cmcalFs∪\cmcalGs, then Y2(J)=Y1(J) and so V(Y2(J))ι(I)=0 by Proposition 6.35. If J∈\cmcalFs∪\cmcalGs with dj=pJ, then we have j∈F∪G.
By the claim, dj+1=pK for some K∈H. Since I∈H, we have j+1=i. Also, since J≺I, we have j<i and so j+1<i. Thus, V(Y2(J))ι(I)=0 by the claim.
This completes the proof.
∎
Lemma 6.38**.**
Let I∈\cmcalFs∪\cmcalGs. Then we have
[TABLE]
Also, we have h(Y2(I))=1.
Proof.
By Theorem 3.15, the first assertion easily follows. For the second assertion, we note that the summation of all entries of Bs (resp. D(y,n)) is zero.
Thus, if I∈\cmcalFs, then Pwph(Y2(I))=0 for any h. Also, if I∈\cmcalGs, then Pwp1(Y2(I))=1−p2 and Pwph(Y2(I))=0 for all h≥2. Since p2 is odd, the second assertion follows.
∎
Corollary 6.39**.**
For any d∈\cmcalDNsf, we have
[TABLE]
Proof.
Note that κ(piripjrj)=gji×\cmcalG(piri,pjrj).
Thus, by the same argument as Corollary 6.27, the assertion follows from Proposition 6.36 and Lemma 6.38.
∎
Lastly, we prove the following.
Lemma 6.40**.**
Let ℓ=2. Then for any I∈H′, there is an index 1≤h≤t different from u such that
[TABLE]
Proof.
Since ℓ=2, we have s=u. For simplicity,
let I=(f1,…,fn)∈Δ(t) and J=(a1,…,at)∈Δ(t). Assume that I∈H′ and J≺I.
First, suppose that s=0. Then I=E(n) for some 2≤n≤t. By definition, we have ι(I)=F(n). In this case, we can take h=n. (Since u=0, we have h=u.) Indeed, as above we have
Pwpn(Y2(I))=Pwpn(Y1(I))=un1∈2Z. Also, we claim that
V(Y2(J))δ=0 for any δ∈\cmcalDNsf divisible by pn, or equivalently
[TABLE]
which clearly implies that Pwpn(Y2(J))=0. Since cn=1, by definition F(n)⊲K or F(n)=K. Also, since ι(J)⊲ι(I)=F(n), we have ι(J)⊲K. Thus, the claim follows by Proposition 6.35 as Y2(J)=Y1(J). (Note that \cmcalFs∪\cmcalGs=∅.)
Next, suppose that s≥1. Then we have H′={E(s),Es(n):n∈\cmcalIs}.
Suppose first that I=E(s). Then dk=pE(s) by Lemma 6.12, where k=max(2,4−s). Hence by direct computation, we can take h=max(1,3−s). (Thus, we have h=u.) Suppose next that I=Es(n) for some n∈\cmcalIs.
Then we can take h=n. (Since n∈\cmcalIs, we have h=u.) Indeed, whether n<s or not353535If n>s, then we have n(I)=s and k(I)=n, i.e., I∈\cmcalHs., we have
[TABLE]
where y=max(1,3−s). Thus, we have Pwpn(Y2(I))=uny∈2Z as above. Also, if J∈\cmcalFs∪\cmcalGs, then we have Pwpn(Y2(I))=0 as n∈\cmcalIs. Furthermore, if J∈\cmcalFs∪\cmcalGs, then we have ι(J)⊲F(n). Thus, we have ι(J)⊲K for any K=(c1,…,ct)∈Δ(t) with cn=1.
Since Y2(J)=Y1(J), by Proposition 6.35 we have V(Y2(J))K=0, and so Pwpn(Y2(J))=0, as desired.
This completes the proof.
∎
Combining all the results above, we now prove Theorem 6.33.
By Theorem 6.29, it suffices to show that
[TABLE]
Suppose first that ℓ is odd. Then we have Y1(I)=Y2(I). Thus, the assertion follows by successively applying Theorem 5.2 thanks to Proposition 6.35.
Suppose next that ℓ=2, and so s=u. By Theorem 5.3, it suffices to show the following. For any I∈Δ(t)∖{A(1)}, one of the following holds363636Note that A(1) is the smallest element in (Δ(t),≺) by Lemma 6.12..
- (1)
h(Y2(I))=1, V(Y2(I))ι(I)∈2Z and V(Y2(J))ι(I)=0 for all J≺I.
2. (2)
There is an index 1≤h≤t such that
[TABLE]
Note that (1) follows by Lemma 6.37 because h(Y2(I))=1 if and only if I∈H (Corollary 6.39).
And (2) follows by Lemma 6.40. This completes the proof. ∎
6.7. Proof of Theorem 6.1
In this subsection, we finish the proof of Theorem 6.1.
Proof of Theorem 6.1.
By Theorems 6.13 and 6.21, it suffices to prove the theorem for t≥2.
First, by Theorem 3.13 and Corollary 6.27 (resp. 6.39), the order of Z(d) (resp. Y2(d)) is n(N,d) (resp. N(N,d)). Thus, it suffices to show the first assertion.
For simplicity, we use the same notation as in the previous section.
Also, let r=1 if u=0 and r=ru if u≥1.
Since Z1(di)=Z(di) for any 1≤i≤m, by Theorems 6.23 and 6.33, it suffices to show that
[TABLE]
Since the claim vacuously holds for r=1, we assume that r≥2.
For any 2≤f≤r, let If:=(f1,…,ft) with fu=f and fi=1 for all i=u.
Then by Remark 6.10, we have
[TABLE]
Thus, it is enough to show that
[TABLE]
For simplicity, let
[TABLE]
Then by Remark 4.7, we have Z1(If)=π1∗(B2(r,f)).
So if r≤4, then we have B2(r,f)=0 because the genus of X0(2r) is zero. Thus, we have Z(If)=Z1(If)=0 for any 2≤f≤r, and so (6.6) obviously holds.
Accordingly, we assume r≥5. By direct computation, the order of Z(dm+1)=Z(I2) is numerator(2422−1)=1, i.e., Z(dm+1)=0, and so (6.6) is equivalent to
[TABLE]
We prove (6.7) by the following two claims:
- (1)
C(N)sf∩⟨Z1(If):3≤f≤r⟩=0.
2. (2)
\left\langle\overline{Z^{1}(I_{f})}:3\leq f\leq r\right\rangle\simeq\operatorname*{\text{\raisebox{1.72218pt}{\scalebox{0.6}{\bigoplus}}}}_{f=3}^{r}\left\langle\overline{Z(I_{f})}\right\rangle.
For simplicity, let G:=⟨Z1(If):3≤f≤r⟩.
Then we have
[TABLE]
Note that G is a 2-group as C(2r) is. Thus, we have
[TABLE]
By Theorem 6.33, we have
[TABLE]
Hence it suffices to show that ⟨Y2(I)⟩[2∞]∩G=0 for any I∈Δ(t). If I=A(1), then the claim follows because the order of Y2(A(1)) is 1.373737The divisor Y2(A(1)) comes from level 2. Indeed, we have Y2(A(1))=π1(N,2)∗(0−∞).
If I∈Δ(t)∖{A(1)}, then as in Proposition 6.25, we can prove that V(Z1(If))δ=0
for any δ divisible by ph for some h=u. Thus, we obtain ⟨Y2(I)⟩[2∞]∩G=0 by Theorem 5.3 because of the following:
- (1)
Suppose that I∈Δ(t)∖H.
Since I=A(1), pι(I) is divisible by ph for some h=u, and thus we have V(Z1(If))ι(I)=0. As already discussed, we have V(Y2(I))ι(I)∈2Z.
2. (2)
Suppose that I∈H′. Then by Lemma 6.40, Pwph(Y2(I))∈2Z for some h=u. Also, we have Pwph(Z1(If))=0 since V(Z1(If))δ=0 for any δ divisible by ph.
This completes the proof of the first claim.
Next, we prove the second claim.
Note that by Theorem 6.21, we have
[TABLE]
Note also that since π1∗ is injective (cf. [36, Rem. 2.7]), we have
[TABLE]
Finally, since \cmcalTu={If:3≤f≤r}, we have
[TABLE]
and so Z(If)=π1∗(B2(r,f)) by Remark 4.7.
This completes the proof.
∎
Remark 6.41*.*
Unfortunately, we cannot use Theorem 5.3 to prove
[TABLE]
In fact, if there were a rational cuspidal divisor X satisfying V(X)=V(Y2(I)) and the order of X is even, then
we could prove that
[TABLE]
Thus, it is crucial that the order of Y2(I) is 1 (or at least odd).
6.8. Acknowledgments
I thank Ken Ribet for generously sharing his idea and encouragement. I also thank Myungjun Yu for comments and corrections to an earlier version of this manuscript.
This work was supported by National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT) (No. 2019R1C1C1007169 and
No. 2020R1A5A1016126).