Minimal modularity lifting for non-regular symplectic representations
Frank Calegari, David Geraghty

TL;DR
This paper proves a minimal modularity lifting theorem for Galois representations linked to genus two Siegel modular forms that are limits of discrete series, advancing understanding in automorphic forms and Galois theory.
Contribution
It introduces a minimal modularity lifting theorem specifically for Galois representations associated with certain Siegel modular forms, a novel result in the field.
Findings
Established a minimal modularity lifting theorem for specific Galois representations.
Connected Galois representations to Siegel modular forms that are limits of discrete series.
Enhanced the theoretical framework for automorphic forms and Galois representations.
Abstract
In this paper, we prove a minimal modularity lifting theorem for Galois representations (conjecturally) associated to Siegel modular forms of genus two which are holomorphic limits of discrete series at infinity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Minimal Modularity Lifting
For non-regular symplectic representations
Frank Calegari and David Geraghty
2010 Mathematics Subject Classification:
11F33, 11F80.
The first author was supported in part by NSF Career Grant DMS-0846285 and NSF Grant DMS-1404620 and NSF Grant DMS-1701703. The second author was supported in part by NSF Grants DMS-1200304 and DMS-1128155.
Contents
1. Introduction
In this paper, we prove a minimal modularity lifting theorem for Galois representations (conjecturally) associated to Siegel modular forms for the group such that is a holomorphic limit of discrete series. An example of what we can prove with these methods is the following:
Theorem 1.1**.**
Let be a continuous irreducible representation satisfying the following conditions:
- (1)
* is ordinary with Hodge–Tate weights for some integer satisfying .* 2. (2)
If and are the unit root eigenvalues of Frobenius on , then
[TABLE] 3. (3)
The image of contains . 4. (4)
For a prime , the image of inertia at is unipotent, and the image of any generator of tame inertia has the same number of Jordan blocks mod as it does in characteristic zero. 5. (5)
* is modular of level and weight .*
Then is modular, that is, there exists a cuspidal Siegel modular Hecke eigenform of weight such that
[TABLE]
where is the spinor -function of .
We deduce Theorem 1.1 from our main result, which we now state. (We shall refer to § 4 and § 6.4 for precise details concerning ramification behaviour, level subgroups and the exact definition of minimal deformations.) Let denote the -adic cyclotomic character. Let be the ring of integers of a finite extension of . Let be a continuous irreducible representation whose similitude character on inertia at is the mod- reduction of . Suppose that contains an unramified subspace of dimension two on which acts by the scalars and respectively, where
[TABLE]
Suppose further that has big image (explicitly, satisfies Assumption 4.1) and that for a prime is either unramified or is one of the types listed in Assumption 4.3. Let denote the (open) Siegel modular variety of level over , where is determined by as in § 5, and let denote the coherent sheaf on whose complex sections define Siegel modular forms of weight for some integer . Let denote the subring of endomorphisms of
[TABLE]
(where is a certain ordinary projection, see section 6.4) generated by Hecke operators at primes not dividing . Let denote the universal minimal deformation ring of (see Definition 4.6 for more details).
Theorem 1.2**.**
Suppose that there exists a maximal ideal of and a corresponding representation which is isomorphic to . Let denote the universal minimal ordinary deformation ring of . Suppose that . Then there is an isomorphism , and moreover, the module is free as a -module.
The proof follows the strategy of [CG18]. The main ingredients are showing that there exists a map from to (see Theorem 6.17) and proving that the cohomology of the subcanonical extension of to a smooth toroidal compactification of vanishes outside degrees [math] and (see Theorem 5.1 — in the case of classical modular curves this step was trivial).
1.1. Comparison with previous methods
The first modularity theorems which applied to non-regular motives were the results of Buzzard–Taylor and Buzzard [BT99, Buz03] on two dimensional odd Artin representations . The idea of these papers can roughly be described as follows. Using known cases of Serre’s conjecture, one deduces that is modular, where is the representation associated to some -adic realization of for some . Using modularity theorems in regular weight, one then proves that a big Hecke algebra is modular. Specializing to weight one, one deduces the existence of an overconvergent eigenform corresponding to . Under a non-degeneracy assumption on ( is -distinguished), one constructs (using companion forms) a second Hida family which specializes to a second eigenform . Using the geometric properties of , one shows that and converge deeply into the supersingular locus. The Fourier coefficients of and for are determined by . One then constructs a suitable linear combination which converges over the entire modular curve, and is thus classical by rigid GAGA. The formal rigid geometry employed by these papers have been generalized by various authors, in particular by Kassaei [Kas06a]. One may well ask whether this approach can be applied to Siegel modular forms of weight — work of Tilouine and his collaborators has made great progress in this direction. The modularity lifting result for (regular weight) Hida families has been established in many cases by Genestier–Tilouine [GT05] (see also Pilloni [Pil12a]). Significant progress has also been made in the theory of canonical subgroups and the geometry of Siegel modular varieties. One difficulty, however, is that the Fourier expansions of Siegel modular forms are not determined by the Hecke eigenvalues. This is a difficulty which must be overcome in such an approach. (Various classicality results for overconvergent forms can be established without using -expansions, see for example [Kas06b, PS17], but these results only apply to forms of sufficiently non-critical slope.) The difficulty of dealing with -expansions manifests itself for our approach also — we are forced to prove (“by hand”) various properties of Fourier expansions of Hecke eigenforms in § 8.2.
1.2. Abelian Surfaces
It would be desirable to weaken the assumption in the main theorem to , since the case includes the representations associated to the Tate modules of abelian surfaces. The only point in our arguments in which we use the fact that is to deduce that for the subcanonical extension of to a smooth toroidal compactification of . If this vanishing holds for then our theorem would also apply to these cases. On the other hand, one does not expect vanishing here, because one expects that singular Siegel modular forms should contribute cohomology in these degrees. However, we need only the weaker result that the image of in is zero after localization at a sufficiently non-Eisenstein maximal ideal . We expect this to always be true for , although we were not able to prove it. On the other hand, using the ideas of Khare–Thorne [KT17], one can dispense with proving this under the very strong supplementary hypothesis that there exists a characteristic zero form of weight which gives rise to . In particular, by using the arguments of the proof of Theorem 6.29 of ibid, one should be able to prove the analogue of Theorem 1.1 in weight assuming the existence of an auxiliary Siegel modular form of the same level also of weight with .
1.3. Recent Developments
(Added: January, 2019) Very recently, there have been a number of developments related to the main theme of this paper, in particular, in the preprints [Pil17] and [BCGP18], the latter which establishes the potential modularity of abelian surfaces over totally real fields. The introduction to [BCGP18] explains a number of innovations which made those results possible, so we shall confine ourselves here to only a few salient remarks. The first is that the vanishing conjecture for localized at mentioned in §1.2 remains unresolved, and the methods of [BCGP18] blend the techniques of this paper (and [CG18]) with arguments from [Pil17]. A second point is that the paper [Pil17] develops a conceptual method to define (normalized) Hecke operators at , and in particular establishes the action of these operators on higher cohomology (which is essential for the main results of [Pil17] and [BCGP18]). In this paper, it suffices to construct the action of these Hecke operators on which is significantly easier. The methods we use in §8.4 to do this are admittedly disagreeable, relying as they do on arguments using -expansions. Thus the reader is encouraged to consult [Pil17, §7] and [BCGP18, §4.5] for a more geometric construction of these operators. An analysis of the normalization factors for Hecke operators required in [Pil17] also sheds some light on another phenomenological feature of this paper which readers may find surprising: On the Galois side, there is essentially no difference (in the ordinary setting) between working in the (irregular) weight for and working in the (irregular) weight . On the other hand, the Hecke operators at (particularly ) behave quite differently in weight . In our context, this arises most noticeably in §8.5 (via Lemma 8.12), which one should compare to [Pil17, §11.1] (warning: the convention of that paper is that Pilloni’s is our and vice versa, and the spherical version of the operator in [Pil17] is equal in weight up to translation by a multiple of to the operator we call ). Finally, the paper [BCGP18] develops a geometric version of the doubling argument (see §5 of ibid.) This provides a much more robust explanation (in a slightly different setting) for what in this paper occupies most of §8 and consists of a sequence of tricky and not entirely intuitive series of manipulations with -expansions. (Note that the geometric doubling argument of [BCGP18] is only written for weight but the method applies in principle to the weights which we consider in this paper.) Finally, the very observant reader will notice that the doubling argument of [BCGP18] applies in weight to the space of ordinary forms at Klingen level, whereas in this paper we essentially prove (in the same weight) a tripling result at spherical level. Neither of these results immediately imply the other. The “extra” copy of the space of forms can be interpreted as giving rise to a space of non-ordinary forms of weight . See Remark 8.18 for further discussion on this point, which we also discuss in a different context below.
It is natural to ask whether one should expect any genuine difficulties in modifying the geometric doubling argument of [BCGP18] to the setting of this paper. We now offer some speculative remarks to address this point (using notation from [BCGP18]). Let be a smooth admissible irreducible unramified representation of (over ) which is not trivial. (For example, could be the local constituent of an automorphic representation associated to a classical modular form.) Let and let denote the Iwahori subgroup of . The classical theory of oldforms is a reflection of the fact that
[TABLE]
and the characteristic zero version of doubling is the statement that the span of the spherical vector under the operator is all of . The integral version of this statement is false in general. For example, given a classical ordinary modular eigenform of weight , the span of under is simply , because in these weights. However, some version of this result does hold in weight , and it is this property which is leveraged to prove local-global compatibility results in [CG18]. Let us now replace by , and let and denote the Klingen and Iwahori subgroups respectively of (denoted elsewhere in this paper by and respectively.) Now (for the of interest) we will have
[TABLE]
The factor here may be interpreted as the order of the Weyl group of . More prosaically, the oldforms in correspond to a choice of eigenvalues and for the Hecke operators and respectively, whereas the oldforms in correspond to a choice of eigenvalues and for the Hecke operators and . When one passes from to for weight one modular forms or to for weight Siegel modular forms, the property of of being ordinary turns out to be automatically preserved on the corresponding space of old forms. However, this is not a priori true when passing from to , and so one would have to see in any geometric version of this argument a way of dealing with the non-ordinary forms.
1.4. Results of Arthur
In Section 7.2, we make use of the results of [Art04], which sketches how the results of [Art13] on orthogonal and symplectic groups can be extended to the general symplectic group . At the time of the initial submission of this paper, these results of Arthur are conditional on the stabilization of the twisted trace formula. (We direct the reader to [GT18] for the most up to date status of these results for .)
1.5. Acknowledgements
We would like to thank George Boxer for some very helpful comments related to the proofs of Theorems 8.10 and 8.11. We would also like to thank Olivier Taïbi for answering some technical questions arising in §7.2. We would also like to acknowledge useful conversations with Kevin Buzzard, Ching-Li Chai, Matthew Emerton, Toby Gee, Michael Harris, Kai-Wen Lan, Vincent Pilloni, and Jack Thorne. We also like thank many of the participants of the Bellairs workshop in number theory in 2014, where an earlier version of this paper was discussed. Finally, we thank the referees, whose detailed comments very much helped to improve this manuscript.
2. Notation
We fix a prime and let be the ring of integers of a finite extension of with residue field . We let denote the category of complete local Noetherian -algebras with residue field isomorphic to (via the structural homomorphism ).
We let
[TABLE]
denote the cyclotomic character. The Hodge–Tate weight of is .
If is a finite extension of for some prime . We let denote the Artin map, normalized so that uniformizers correspond to geometric Frobenius elements. If is an element of some ring , then we define the character
[TABLE]
to be the unramified character which takes the geometric Frobenius element to , when this character is well defined.
2.0.1. The group
Let , where
[TABLE]
The group is the subgroup consisting of elements with . We let be the Borel subgroup consisting of upper triangular matrices. The Lie algebras of and are denoted and while those of and are denoted and . Let denote the Siegel parabolic, that is, the stabilizer of the plane spanned by the first two standard basis vectors. Let denote the Klingen parabolic, which is the stabilizer of the line spanned by the first standard basis vector. We denote the Levi subgroup of (resp. ) by (resp. ). We have .
Let denote the diagonal torus in and its character group. We identify with the lattice by associating to the character
[TABLE]
We identify the cocharacter group with by associating the triple with the cocharacter:
[TABLE]
The natural pairing on is then: .
The positive roots of with respect to the Borel are given by , , and . Of these, and are the simple roots. We let denote the half-sum of the positive roots. The coroots are: , , and . The intersection is a Borel subgroup of . The corresponding positive root is .
Definition 2.1**.**
We define the set to be the set of weights which are dominant with respect to . Explicitly
[TABLE]
Similarly, we define the set of weights which are dominant with respect to . Explicitly, this is:
[TABLE]
Note that the natural action of on the plane spanned by the first two (resp. the last two) standard basis vectors is the irreducible representation of highest weight (resp. ).
We let denote the Weyl group of and we define and similarly. Let denote the generators for the Weyl group given in [HT13, §2]. We fix a set of Kostant representatives for by setting , , and . Note that each has length . We let act on by . Then we have:
[TABLE]
The longest element of which we denote by acts via .
Note that the collection of representatives is precisely the set of such that . We let denote the closed dominant Weyl chamber. In other words, . For , we define the chambers .
2.0.2. The group
Let
[TABLE]
be the homomorphism sending to the matrix
[TABLE]
where
[TABLE]
Let denote the centralizer of in (acting by conjugation). Then since , we see that where is the maximal compact subgroup of given by the fixed points of the Cartan involution . The similitude character restricts to a surjective map and whose kernel is the connected component of the identity. Then we have explicitly,
[TABLE]
The map:
[TABLE]
induces an isomorphism between and . We let denote the preimage of the diagonal compact torus in and let . Let , and so on. Then we have
[TABLE]
We use subscripts to denote complexifications of Lie algebras and Lie groups; thus and denote the complexifications of and . Then and the surjective map sends to
[TABLE]
Thus its kernel is . We define the lattice to be the subspace consisting of differentials of (complex analytic) characters of . Equivalently, is the subset of consisting of differentials of characters of which factor through the multiplication map . We fix an isomorphism
[TABLE]
by letting correspond to the linear form
[TABLE]
on . This extends by linearity to an isomorphism .
Let be the subspace where acts via . Then each is isotropic and we have an orthogonal direct sum . Let denote the stabilizer of . Consider the Hodge decomposition
[TABLE]
where is the subspace on which acts via . Then we have and we let , . We also let denote the subgroup of generated by . Then we have
[TABLE]
Moreover, is the Levi component of and is its unipotent radical. Let
[TABLE]
Then are a basis of and are a basis of . With respect to the basis of , an element
[TABLE]
acts via
[TABLE]
Note that the Cayley transform conjugates the Siegel parabolic to . Let denote the root system defined by the adjoint action of on . The compact roots are those appearing in , while the non-compact roots are those appearing in . We choose a system of positive roots in such a way that the set of positive non-compact roots coincides with the roots in . (We do this in order to be consistent with the conventions of [BHR94, §2.4].) We are then forced to take to be the set of roots appearing in where is the Borel subgroup of lower triangular matrices. With respect to the identification of as a subset of given above, we then have:
[TABLE]
This can be seen easily from the fact that .
Definition 2.2**.**
We let denote the set of which are dominant with respect the system of positive roots . In other words, .
This set parameterizes the irreducible complex analytic representations of . For , we let denote the corresponding irreducible representation of highest weight .
We note that natural representation of on (resp. ) is the irreducible representation of highest weight (resp. ). Note also that the similitude character has weight .
3. Some Commutative Algebra
We recall here some formalism from [CG18] for proving modularity lifting results in contexts where the Hecke algebra has “co-dimension ” over the ring of diamond operators. The notion of “balanced” below plays the role of “codimension one” for the non-regular group rings .
3.1. Balanced Modules
Let be a Noetherian local ring with residue field and let be a finitely generated -module.
Definition 3.1**.**
We define the defect of to be
[TABLE]
Let
[TABLE]
be a (possibly infinite) resolution of by finite free -modules. Assume that the image of in is contained in for each . (Such resolutions always exist and are often called ‘minimal’.) Let denote the rank of . Tensoring the resolution over with we see that and hence that .
Definition 3.2**.**
We say that is balanced if .
If is balanced, then we see that it admits a presentation
[TABLE]
with .
3.2. Patching
We recall the abstract Taylor–Wiles style patching result from [CG18].
Proposition 3.3**.**
Suppose that
- (1)
* is an object of and is a finite -module which is also finite over ;* 2. (2)
* is an integer, and for each integer , ;* 3. (3)
; 4. (4)
for each , is a surjection in and is an -module
and that for each the following conditions are satisfied
- (a)
the image of in is contained in the image of , and moreover, the image of the augmentation ideal of in is contained in the image of ; 2. (b)
there is an isomorphism of -modules, where acts on via and ; 3. (c)
* is finite and balanced over (see Definition 3.2).*
Then is a free -module.
Proof.
This is Prop. 2.3 of [CG18]. ∎
4. Deformations of Galois representations
Let
[TABLE]
be a continuous, odd, absolutely irreducible Galois representation with similitude character of the form where . Let us suppose that there exist and in such that
[TABLE]
and moreover . Let denote the set of primes of away from at which is ramified.
The group admits a -dimensional adjoint representation on its Lie algebra . Let denote the composition of with this representation. For , the representation admits a decomposition , where is the similitude character of .
We make the following further assumptions on :
Assumption 4.1** (Big Image).**
The restriction of to satisfies the following conditions, cf. §5.7 of [Pil12a]:
- H1:
The field does not contain , 2. H2:
For any , there exists an element such that has four distinct eigenvalues and such that the action of on each irreducible representation of over contains as an eigenvalue. 3. H3:
Neither the image of nor the image of admits a quotient of degree .
If this assumption holds, we say that has big image, although condition (H1) depends on more than the group-theoretic image of or even .
Assumption 4.2** (Neatness).**
There exists a with such that the ratio of any two eigenvalues of is not equal to .
This condition is imposed to avoid dealing with stacks. If , any surjective representation whose similitude character is a power of will be neat. By assumption, the image contains an element which is scalar with eigenvalue . If , then the similitude character would also equal . But the similitude character of the scalar matrix is .
Assumption 4.3** (Ramification).**
If , then is one of the following types:
- (1)
**U3 **: has unipotent image, and is conjugate to the group generated by , where
[TABLE] 2. (2)
**U2 **: has unipotent image, and is conjugate to the group generated by , where
[TABLE] 3. (3)
**U1 **: has unipotent image, and is conjugate to the group generated by , where
[TABLE] 4. (4)
P*: is a direct sum of characters, and has the form*
[TABLE]
for some non-trivial character of . Both the plane of invariants under and the plane on which acts by are isotropic. Moreover is prime to . 5. (5)
H*: is absolutely irreducible and is prime to .*
Remark 4.4**.**
*Since we are assuming that the similitude character of is a power of the cyclotomic character, it turns out that can never be of type P. We expect that our arguments can also be adapted to deal with representations with more general (odd) similitude characters, but we made this assumption to simplify some of the arguments involving -expansions (in particular, to avoid various Nebentypus characters).
Note that non-trivial unipotent representations are not direct sums, so a prime is either of type U, P, or H, but never simultaneously any two of these types. Moreover, is of type U2 or U3 if and only if is generated by an element where is nilpotent of rank , or respectively.
Let denote a finite set of primes of disjoint from . We assume that for each the following hold:
- •
,
- •
is a direct sum of four pairwise distinct characters. Label these characters as , and such that the planes and are isotropic and .
(By abuse of notation, we sometimes use to denote the product of primes in .) For objects in , a deformation of to is a -conjugacy class of continuous lifts of . We will often refer to the deformation containing a lift simply by .
Remark 4.5**.**
*When deforming Galois representations over , we could work either with fixed or varying similitude character — both give rise to deformation problems with . We make the (somewhat arbitrary) choice to work with deformations with fixed similitude character in this paper, because it is the “correct” approach for general totally real fields — for totally real fields other than , the invariant increases (by ) when deforming the similitude character.
Definition 4.6**.**
We say that a deformation of is minimal outside if it satisfies the following properties:
- (1)
The similitude character is equal to . 2. (2)
If is a prime of , then is unramified. 3. (3)
If is of type U1, U2 or U3, then has unipotent image and its image is topologically generated by an element where is nilpotent of rank , or respectively. 4. (4)
If is of type P, then . 5. (5)
If , then where each is an isotropic plane in and lifts while lifts . Moreover, acts by scalars (via some character) on and by scalars via the inverse of this character on . 6. (6)
The representation has the following shape at :
[TABLE]
where and are unramified characters lifting and respectively, and is an unramified character which is trivial modulo the maximal ideal.
If is empty, we will refer to such deformations simply as being minimal. If satisfies conditions (2)–(4), then we say is weakly minimal outside .
Remark 4.7**.**
The local condition at is equivalent to asking that is ordinary (of fixed weight). When it is also equivalent to being finite flat. This is because, for unramified characters and , the group in this category is trivial, and the group is the same whether it is computed in the category of finite flat group schemes or as modules, as long as . The latter condition follows (for all the relevant extensions) from the assumption .**
The functor that associates to each object of the set of deformations of to which are minimal outside is represented by a complete Noetherian local -algebra . This follows from the proof of Theorem 2.41 of [DDT97]. If , we will sometimes denote by .
Let denote the Selmer group defined as the kernel of the map
[TABLE]
where runs over all primes of and:
- •
If , then .
- •
If , then is isomorphic to the subspace of
[TABLE]
consisting of elements with . (Note that each summand is a copy of .) We let denote the subspace corresponding to elements with and and (equivalently, ) unramified.
- •
If , then we define as follows: let be the subspace of matrices whose non-zero entries appear in the upper right block. We define and .
Let denote the corresponding dual Selmer group.
Lemma 4.8**.**
We have .
Proof.
The subspace is precisely set of elements mapping to the subspace . We have as a -module and hence is 2-dimensional since . The condition implies that and hence . It follows that . Thus
[TABLE]
We have and . The Euler characteristic formula implies that . Thus
[TABLE]
Finally, the condition on and implies that . It follows that and . This concludes the proof. ∎
Proposition 4.9**.**
The reduced tangent space of has dimension
[TABLE]
Proof.
The argument is very similar to that of Corollary 2.43 of [DDT97]. The reduced tangent space has dimension . By Theorem 2.18 of loc. cit. this is equal to
[TABLE]
where runs over all finite places of . The second term is equal to [math] and the third term vanishes (by the absolute irreducibility of and the fact that ). Now, we have:
- •
for ;
- •
for ;
- •
for (by [GT05, Prop. 10.4.1]); and
- •
.
This concludes the proof. ∎
The next result (on the existence of Taylor-Wiles primes) follows from the previous proposition and the proof of [Pil12a, Prop. 5.6].
Proposition 4.10**.**
Let and recall that we are supposing satisfies Assumption 4.1. Then and for any integer we can find a set of primes of such that
- (1)
. 2. (2)
* for each .* 3. (3)
For each , is unramified at and has four pairwise distinct eigenvalues. 4. (4)
.
In particular, the reduced tangent space of has dimension and is a quotient of a power series ring over in variables.
Example 4.11** (Examples of representations with big image).**
Suppose that .
- (1)
Let be an imaginary quadratic field not contained in . Let
[TABLE]
is a representation with determinant for some integer such that the images of and for any complex conjugation both contain are have totally disjoint fixed fields over . Then the representation
[TABLE]
preserves a symplectic form and has big image. 2. (2)
Suppose the image of is . Then has big image.
Proof.
The second claim follows immediately for by [Pil12a], Prop 5.8. For the first claim, it is an easy consequence of the fact that is perfect for that holds, and similarly, assuming that , that holds. Hence it suffices to find an element in the image with distinct eigenvalues and with as an eigenvalue for every irreducible constituent of . We first compute the representation . Note that the dual of and can be identified with and respectively. Over , we have an identification
[TABLE]
and over , we have an identification
[TABLE]
where is the Asai representation. Over for any , the character is trivial, and hence the image of under our assumptions is the group . Since and are always eigenvalues of any element acting on , it suffices to find an element which has distinct eigenvalues under and has an eigenvalue in . To be more precise, since we haven’t been careful about distinguishing the Asai representation from its quadratic twist, we shall find an element with eigenvalues both and . One can explicitly realize the Asai representation as follows. Let be the standard representation of over , and let be the representation of the exterior product . The element acts on via . The Asai representation is determined uniquely by the action of a fixed lift of complex conjugation , which acts on by the formula .
Consider the elements such that, with respect to some chosen basis ,
[TABLE]
Then acts on via the matrix
[TABLE]
with eigenvalues and . On the other hand, the action of this element via the Asai representation (and basis , , , ) is
[TABLE]
with eigenvalues , , and . The four eigenvalues are distinct as long as , or equivalently if . One can now choose and in . ∎
Remark 4.12**.**
*Suppose that is an imaginary quadratic field, and suppose that is an elliptic curve which neither has CM nor is isogenous (over ) to its Galois conjugate . We claim that Example 4.11 applies to the mod representations associated to the dual of for sufficiently large . The representations in this case are the duals of the representations associated to the abelian surface . By [Ser72], the Galois representations associated to the duals of and have images and determinants for all sufficiently large . Let and denote the corresponding extensions, so and are both isomorphic to , and and are both isomorphic to . By the simplicity of for , the only non-trivial quotients of are and . This implies that if is strictly larger than , then then either , or . In either case, the projective representations associated to and both factor through . Since all automorphisms of are inner, this implies that projective representations of and are isomorphic, and hence for some character which (by comparing determinants) is at most quadratic. Assume is sufficiently large so that has good reduction at all primes above and moreover that is unramified in . Then and are both finite flat at , which forces to be unramified at all primes above . But this implies that is unramified outside primes dividing the conductor and of and respectively. There are only finitely many such quadratic characters by class field theory. Hence, if there are infinitely primes for which the assumptions of Example 4.11 do not occur, then there exists a fixed character with and isomorphisms for infinitely many . Such an isomorphism (for a single ) implies that for all pairs of conjugate primes and of good reduction for , and hence, given infinitely many such , one deduces the equality . If is the (at most) quadratic extension in which splits, this implies (by Cebotarev) that the Tate modules (for any fixed prime) of and are isomorphic, and hence (by Faltings [Fal83]) that and are isogenous over .
5. Siegel threefolds
5.1. Level Structure
Recall that there are two conjugacy classes of maximal parabolic subgroups of represented by the Siegel parabolic which is block upper triangular with Levi
[TABLE]
and the Klingen parabolic which is block upper triangular with Levi
[TABLE]
These both contain the Borel subgroup . For each prime , these give rise to parahoric subgroups , , and of , namely, the inverse image of the corresponding parabolic subgroups over . (The group is called the Iwahori subgroup.) The Klingen parahoric subgroup contains a normal subgroup with (via projection onto ). For each prime , we also have the Paramodular group , which is the stabilizer in of , and is the intersection
[TABLE]
for values .
5.2. Cohomology of Siegel -folds
Let and be finite sets of primes of which are disjoint from each other and do not contain . By a slight abuse of notation, we will sometimes denote the product of the primes in by the same symbol . For each , let equal one of , , , , or the full congruence subgroup of level . For , we let and we define . For , we let and . Let for .
We assume that the subgroup is neat. (This will be the case if contains a prime where is the full congruence subgroup of level .) We let (resp. ) denote the Siegel moduli space of level (resp. ). This scheme classifies principally polarized abelian varieties together with a -level structure (resp. -level structure). (See [Pil12b, §4.1].) In each case we denote the universal abelian variety by .
If denotes one of the above spaces, we can choose a toroidal compactification of . The abelian scheme then extends to a semi-abelian scheme and the sheaf is a locally free -module of rank 2. For integers , we let . We also denote by , so, for example, is a line bundle. If is an -module, we will let denote the sheaf . The coherent cohomology groups are independent of the choice of toroidal compactification (see [Lan13, Lemma 7.1.1.4] and the proof of [Lan13, Lemma 7.1.1.5]). The Koecher principle states that there is an isomorphism
[TABLE]
We may therefore pass freely between the open variety and the (any) smooth projective toroidal compactification without comment when dealing with .
We choose toroidal compactifications and so that the natural map extends to a map . As explained in § 4.1.2 of [Pil12b], the universal subgroup over extends to . We then define the toroidal compactification . The resulting map is then finite étale with Galois group .
5.3. Vanishing results
Let denote one of the toroidal compactifications defined in the previous section. We first record some consequences of a vanishing theorem of Lan and Suh.
Theorem 5.1**.**
- (1)
Suppose that and . Then
[TABLE]
for . 2. (2)
Suppose that and . Then
[TABLE]
for . 3. (3)
Suppose that and . Then
[TABLE]
for .
Proof.
This follows from [LS13, Cor. 7.24] after unwinding definitions. We take the group scheme (in the notation of [LS13]) to be our . The groups correspond to the Siegel Levi and parabolic: . The set of dominant weights (resp. ) is our (resp. ) from Definition 2.1.
In this paragraph, we show that the subset as defined in [LS12, Defn. 6.3] corresponds to the set of those such that . As an intermediate step, we first show that corresponds to those such that:
- •
for ;
- •
.
To see this, we note the following: to lie in , by definition, the element must satisfy and must also lie in . The definition of in Definition 3.2 of [LS12] boils down to (the set in our case consists of the single embedding and the norm is defined near the beginning of §2.5). The dimension is defined in Definition 3.9 of [LS12] to be which is 3 in our case. Next, the set is defined in Definition 3.2 to consist of those for which . Finally, the set is defined in Definition 2.29 to consist of all dominant which satisfy the first condition above. This establishes the intermediate step. Now, if , then the largest of the is . Thus, we see that if and only if and .
The set , by Definition 2.29 of [LS12], is . By Lemma 7.2, Definition 7.14 (which is vacuous in our case) and Proposition 7.15 of [LS12], a weight lies in and is positive parallel if and only if .
If , then a pair of vector bundles , for is defined in [LS13]. Indeed determines an algebraic representation of over with highest weight (namely where is the standard representation of ) and the corresponding bundles are then defined by [LS13, Defn. 4.12]. We claim that
[TABLE]
(We note that the parameter does not change the underlying vector bundle, but does change the Hecke action on cohomology by a power of the similitude character.) Let , let denote the standard representation of and the subspace spanned by the first two standard basis vectors. Then is the standard representation of the -factor of and is the representation of corresponding to . The representation thus corresponds to . By [LS12, Example 1.22], we have (in the notation of that paper) . However, by definition, and we have . It follows that . We deduce that , as required.
With these preliminaries out of the way, we now apply [LS13, Cor. 7.24]. We take . (The condition that when boils down to in our case.) We take a positive parallel weight. We therefore have , and .
We now apply part 2 of [LS13, Cor. 7.24] successively with taken to equal each of the elements from Section 2. Note that each has length . If we take , then (ignoring the third component):
[TABLE]
Thus . Then [LS13, Cor. 7.24] implies
[TABLE]
for each . Taking and gives the first part of our proposition.
Similarly, if , then:
[TABLE]
Hence
[TABLE]
for . This gives the second part of the proposition.
Finally, we take , then:
[TABLE]
Hence
[TABLE]
for . This gives the last part of the proposition. ∎
It is interesting to compare the above vanishing result in characteristic with the following characteristic 0 vanishing results due to Blasius–Harris–Ramakrishnan, Mirković, Williams and Schmid. We have an identification
[TABLE]
where is the open compact subgroup used to define and is the compact-mod-center subgroup defined in Section 2.0.2. To any finite dimensional -representation of , there is an associated vector bundle on which is defined in [BHR94, Defn. 1.3.2]. This bundle has extensions to . In [BHR94], the bundle is denoted . We have . For each , we define:
[TABLE]
Let and denote the direct limit of and respectively over all levels . Let denote the corresponding limit of (including both an overline and a tilde in the notation was too cumbersome, hopefully no confusion will result).
Let denote the space of automorphic forms on which are square integrable modulo the centre . Let denote the space of cusp forms. For a representation of as above and , we define:
[TABLE]
Then we have the following result of Harris:
Theorem 5.2**.**
There are canonical maps, forming a commutative diagram:
[TABLE]
Moreover:
- (1)
The composition is injective for all , and is an isomorphism for . 2. (2)
The image of in contains .
Proof.
This follows from [Har90, Theorem 2.7 & Prop. 3.2.2]. ∎
For , we then define to be the image of the space in . Thus we have
[TABLE]
For , the space is semisimple as a -representation and we decompose:
[TABLE]
We let denote the subspace
[TABLE]
where the sum is over all those such that is essentially tempered. We define by replacing with in the definition of . We then define
[TABLE]
to be the image of . We may also define analogous spaces
[TABLE]
by applying -invariants to the constructions above, where is the level of .
Suppose now that is the irreducible representation of of highest weight , with respect to the system of positive weights fixed in § 2. We first of all observe that the bundle does not depend on . Indeed, let be the irreducible representation of highest weight . Consider the -equivariant bundles and on defined in [BHR94, §1.3]. (The superscripted ∨’s do not refer to dual bundles here.) Then by the definition of , it suffices to show that as -equivariant bundles. We have that , so we may take the underlying space of to be and the action to be for all . Then the map
[TABLE]
gives the required isomorphism . (Note however that the Hecke action on the cohomology of will depend on – changing the value of introduces a corresponding twist by a power of the similitude character in the Hecke action.)
For a dominant weight, we let denote the vector bundle associated to the irreducible -representation . We would like to compare these bundles to the bundles introduced in the proof of Theorem 5.1.
Definition 5.3**.**
Let . We let denote the canonical extension in the notation of the proof of Theorem 5.1, and we let .
We saw above that, as vector bundles over , we have:
[TABLE]
though the Hecke action on the cohomology of will depend on .
Lemma 5.4**.**
Let . Then, over , we have:
[TABLE]
compatibly with Hecke actions on cohomology.
Proof.
It suffices to prove the isomorphism over . Consider the short exact sequence:
[TABLE]
and the Poincaré duality pairing
[TABLE]
(See [LS12, §1.2]).
Expressed in terms of the functor of Lan–Suh, the short exact sequence becomes:
[TABLE]
and the bundle becomes . (See [LS12, Example 1.22].)
Similarly, over the short exact sequence becomes
[TABLE]
and is identified with . This follows from [Mil90, Example III.2.4]: if we take the point to be in the notation of Section 2.0.2, then the isotropic subspace corresponds to and corresponds to . As remarked at the end of Section 2.0.2, we have , and the similitude character corresponds to . Note also that the notation of [Mil90] refers to de Rham homology (see §I.3).
It follows that, over , we have and . Thus,
[TABLE]
This is compatible with Hecke action on cohomology since all isomorphisms respect the equivariant constructions. ∎
The Weyl chambers are defined in Section 2.0.1. We have
[TABLE]
Theorem 5.5**.**
Let . Then:
[TABLE]
for all such that
[TABLE]
Proof.
In Section 2.0.2, we identified with . Under the resulting identification of with , the chambers for above correspond to Weyl chambers in . Let , regarded as an element of . Then we’ve seen above that
[TABLE]
Suppose that
[TABLE]
Then by Theorem 5.2, there is some in such that
[TABLE]
By a theorem of Mirković [Har90, Theorem 3.5], is a discrete series or limit of discrete series. Hence, using the Harish-Chandra parameterization, we may write for some Weyl Chamber and a weight . By [BHR94, Theorem 3.2.1], it follows that:
[TABLE]
and
[TABLE]
where is the system of positive roots determined by the chamber . For , we have . Hence we must have and . However, , so . We have, have:
[TABLE]
Thus, we deduce that lies in . This is equivalent to the condition in statement of the theorem. ∎
We also record the following:
Theorem 5.6**.**
Let , let , and let , regarded as an element of . Suppose that in contributes to .
- (1)
The infinitesimal character of is given under the Harish-Chandra isomorphism by:
[TABLE] 2. (2)
Let denote the transfer of to . Then the infinitesimal character of is given under the Harish-Chandra isomorphism by where:
[TABLE] 3. (3)
If furthermore, is tempered, then is a discrete series or limit of discrete series representation, and is given under the Harish-Chandra parameterization by:
[TABLE]
Proof.
For the first part, we have that
[TABLE]
It follows from [BHR94, Theorem 3.2.1] that the infinitesimal character of is equal to . The second part can be inferred from [Sor10, §2.1.2]. The last part is due to Mirković and was established in the proof of Theorem 5.5. ∎
Definition 5.7**.**
A weight such that lies in the interior of a unique Weyl chamber is said to be a discrete series weight or a regular weight. If lies in the intersection of exactly two of Weyl chambers , we say it is a limit of discrete series weight or a non-regular weight.
From the explicit description of the Weyl chambers above, we see that the limit of discrete series weights thus come in 3 families:
[TABLE]
Note that for the corresponding families of vector bundles , the first and third are interchanged under the Serre duality map while the second family is stable under this operation. (Up to interchanging the canonical and subcanonical extensions, of course.) The preceding theorem implies that for all , we have:
[TABLE]
(Technically, we should normalize the Hecke action on the cohomology of before we adjoin the subscripts or . See Section 5.5 below.) From the result of Lan and Suh, we deduce the following characteristic analogue of these vanishing results for limit of discrete series weights.
Corollary 5.8**.**
- (1)
For , we have
[TABLE]
for . 2. (2)
For , we have
[TABLE]
Proof.
The vanishing results for the subcanonical extensions follow directly from Theorem 5.1. The fact that
[TABLE]
in the second part then follows from Serre duality since:
[TABLE]
∎
5.4. Torsion Classes
It seems natural to ask whether one can (explicitly or otherwise) construct classes in which do not lift to characteristic zero. Let us recall what happens for classical modular forms of weight one.
Suppose that denotes (for this paragraph) the classical modular curve. A non-Eisenstein Hecke eigenclass in gives rise to an irreducible Galois representation . Suppose that the image of contains for some . Such a representation cannot be the mod- reduction of a representation with image isomorphic to some subgroup of , and thus by [DS74], the corresponding mod- class does not lift to characteristic zero. (Explicit examples were first found by Mestre for and .) A slightly different example can be given as follows. Suppose that . Consider a non-Eisenstein Hecke eigenclass in which is new of level . Then the restriction of to is rank two unipotent. Such a class cannot lift to characteristic zero at minimal level, because otherwise (by [DS74] again) the corresponding representation would simultaneously have finite image and yet would be unipotent and hence infinite. Note that (unlike in the first example) it may well be possible to lift to characteristic zero at some non-minimal level. Examples of the second kind have a natural analogue in the Siegel context.
Suppose that has type U3 at . If is any minimal lift of , the image of under will be rank three unipotent. This will also be true for the restriction of to any finite extension of . Yet, by a theorem of Grothendieck ([Gro72], Exp.9) the image of inertia of a semistable abelian variety is rank two unipotent, i.e., satisfies . If follows that cannot contribute to a motive associated to an abelian variety. Conjecturally, Siegel modular eigenforms of weight should be associated to abelian varieties of dimension equipped with an injection for some totally real field of degree . This suggests that such representations do not admit minimal lifts to characteristic zero when . It would be interesting to produce an explicit example of such a modular representation. Recall that there is an exceptional isomorphism coming from identifying the Galois group of over with either the symmetries of the -torsion points on the universal abelian surface or the action of on the (generically) Weierstrass points [BFvdG08]. The unipotent element such that has conjugacy class (this class is preserved by the exotic automorphism of ). In particular, if is a sextic field with Galois closure containing and acting irreducibly on , and is an odd prime such that , then should give rise to such a representation. Here is an explicit example coming from a slight variation of this argument. Suppose that is the abelian surface corresponding to the Jacobian of the curve:
[TABLE]
which has good reduction outside . The representation has image , and the image of inertia at is conjugate to . Hence should give rise to a mod- torsion class with trivial level structure outside , and the following level structure at these primes:
- (1)
Iwahori level structure at , 2. (2)
Paramodular level structure at and .
Note that this conjectural torsion class does conjecturally lift to characteristic zero at some level since one expects that is modular. (The conductor of is .)
Common to both examples is the non-existence of automorphic representations (associated to either classical modular forms of weight or Siegel modular forms of weight ) such that is the Steinberg representation. For classical modular forms, the non-existence of such follows from a consideration of the corresponding Galois representations, an argument which does not obviously generalize to the Siegel case (since one does not know how to attach an abelian variety to such a form). However, following argument (due to Kevin Buzzard) generalizes nicely:
Theorem 5.9**.**
If is a cuspidal automorphic representation associated to a Siegel modular form of weight , then is not the Steinberg representation for any .
Proof.
In weights with , the corresponding Frobenius eigenvalues of the Weil–Deligne representation associated to a Steinberg representation are
[TABLE]
where . Moreover, the corresponding eigenvalue of is . In particular, if , then and the corresponding eigenvalue of is , contradicting the integrality of Hecke eigenvalues (which is a consequence of the integrality of the -expansion). ∎
5.5. Hecke operators
For simplicity, we denote the schemes and of § 5.2 by and respectively. Let denote an -module.
Let be a rational prime. We define matrices
[TABLE]
and regard them as elements of . If (resp. ) we will consider the Hecke operators (resp. ) acting on each of the spaces
[TABLE]
as in [SU06, §1.1.6] or [Til12, §8]. We also denote by . The definition of Hecke operators given in [SU06] or [Til12] applies when or when is invertible on . The remaining cases when requires more care. In Lemma 8.8 below we show that and exist as operators in cohomological degree over .
Similarly, if , we have operators on . As in § 5.2, the map is Galois with Galois group . This gives rise to an action of on . For each , we denote the corresponding operator on by .
Finally, we shall also exploit Hecke operators of a slightly different flavour, which we denote by and respectively. In the context of this paper, they may be considered formal operators on -expansions. (They can also be interpreted more classically as Hecke operators with level structure at .) Their key property is that the operators and act by and for large enough weights, including plus any non-trivial multiple of for . Their explicit definition in given in Lemmas 8.3 and 8.4.
Remark 5.10**.**
We note that our definition of the Hecke action is the ‘natural’ one twisted by (see [SU06, 1.1.6a]). We saw in the proof of Theorem 5.1, that for the natural action, there is an isomorphism , and hence over , an isomorphism . Under our normalization of the Hecke action on , we therefore have and, over , where we take:
[TABLE]
Remark 5.11**.**
In view of the previous remark, we will identify the set with the subset of . Thus it makes sense to speak of .
Remark 5.12**.**
Let and let . For , we can similarly define a Hecke operator associated to on the cohomology of : this operator acts as . Now, suppose that in contributes to
[TABLE]
where . It follows that the central character of is given by:
[TABLE]
Furthermore, by Proposition 5.6, the transfer of to has infinitesimal character where
[TABLE]
We now introduce some Hecke algebras. We note that in the following definition, we work over rather than .
Definition 5.13**.**
Let with .
- (1)
The anaemic Hecke algebra
[TABLE]
is the -algebra generated by the operators for . 2. (2)
Similarly, we let be the algebra generated over by the operators for and for . When , we have and we denote this algebra by . 3. (3)
Finally, denotes the -algebra generated by the operators and . (The existence of these operators is established in Lemma 8.8.) If , then we denote by .
Note that the algebras preserve the subspace
[TABLE]
We will also need to consider ordinary Hecke algebras. Let denote the ordinary idempotent associated to the Hecke operators and . (We will only consider this operator in contexts where the direct limit makes sense.) We define:
[TABLE]
for or . We thus have:
[TABLE]
for such .
Definition 5.14**.**
Let with . We define the ordinary Hecke algebras (resp. , ) to be the image of (resp. , ) in
[TABLE]
6. Galois representations associated to modular forms
As in Section 5.2, let and be finite sets of primes of which are disjoint and do not contain . We allow the possibility that . We let and be open compact subgroups of as in Section 5.2, and we let and be the corresponding Siegel threefolds, defined over .
6.1. The Hasse invariant
We begin with a definition.
Definition 6.1**.**
Let be the Hasse invariant and let be a lift of , for some which we fix for the rest of this section.
The existence of such a lift follows from the Koecher principle and the ampleness of on the minimal compactification of .
Lemma 6.2**.**
Let with . Then:
- (1)
Multiplication by defines an injection:
[TABLE]
which is equivariant for the Hecke operators for each and the operators for . 2. (2)
If , then this map is also equivariant for the operators and .
Proof.
It is well-known that multiplication by is injective and commutes with Hecke operators away from . We may thus assume that . It is shown in [Pil12b, §A.3] and [Til12, Lemme 8.7] that multiplication by commutes with the operators and . Since , [Til12, Lemme 8.5] implies that and . It follows that and also commute with . ∎
Suppose that with . By the proof of [Pil12b, Théorème 6.2], there exists an integer as in the following definition.
Definition 6.3**.**
Let be an integer such that for all , , and , the cohomology group
[TABLE]
vanishes.
Note that for such , the maps
[TABLE]
are both surjective. The same is true over and .
Lemma 6.4**.**
Let with and let . There exists an integer such that, if we set , then:
- (1)
, and 2. (2)
multiplication by defines an injection
[TABLE]
which is equivariant for the Hecke operators for each and the operators for each .
Proof.
The second property holds as long as (see [Gol14, Theorem 6.2.1]), so it suffices to take equal to any integer greater than and divisible by . ∎
Let with . Recall that the Hecke algebras
[TABLE]
were defined in Definition 5.13.
Remark 6.5**.**
For with and each , we have
[TABLE]
Let (resp. ) denote the annihilator of the former space in (resp. ). If and are as in Lemma 6.4, then multiplication by induces a surjective map:
[TABLE]
where . In particular, any maximal ideal of pulls back under this map to a maximal ideal of which we will also denote by .
Similarly, Lemma 6.2 induces a map
[TABLE]
where and, if , this extends to a map
[TABLE]
6.2. Preliminaries on Galois representations
We now turn our attention to Galois representations.
Proposition 6.6**.**
Let and let . There is a continuous character
[TABLE]
such that:
- (1)
* is crystalline with Hodge–Tate weight ;* 2. (2)
for all , is unramified at and .
In particular,
[TABLE]
for some finite order character .
Proof.
This follows from the proof of [Tay91, Proposition 4], noting that we have twisted the Hecke action by (see Remark 5.12). ∎
Definition 6.7**.**
For a prime , we introduce the Hecke polynomial:
[TABLE]
If a modular form is an eigenform for a collection of Hecke operators , we denote by the map such that for each . In particular, if is an eigenform for the operators at , then we can specialize the polynomial at to get .
Proposition 6.8**.**
Let with . Let and . Let
[TABLE]
be a cuspidal eigenform for the operators for all and . Then there is a continuous semisimple representation
[TABLE]
defined over a finite extension such that:
- (1)
The similitude character is given by
[TABLE] 2. (2)
* is unramified at primes , and at such primes, the characteristic polynomial of is given by:*
[TABLE] 3. (3)
The restriction is crystalline with Hodge–Tate weights . If, in addition, is an eigenvalue of the Hecke operators at , then the characteristic polynomial of on is . 4. (4)
Suppose is ordinary in the sense that it is an eigenform for and with eigenvalues being -adic units. Then has distinct eigenvalues with -adic valuations , respectively. Furthermore, is conjugate in to a representation of the form
[TABLE] 5. (5)
If is absolutely irreducible, then it satisfies local-global compatibility at all primes.
Proof.
The existence of follows from the work of Taylor, Laumon and Weissauer. Some of the finer properties are due to Urban, Genestier–Tilouine, Gan–Takeda, Sorensen and Mok. Fix an embedding and let be an cuspidal automorphic representation of which contributes to the -part of under the isomorphism of the first part of Theorem 5.2 (with , as in Remark 5.10).
We take be the representation of [Mok14, Theorem 3.5] associated to . When is simple, generic in the terminology of [Mok14], the representation can be conjugated to take values in , by the main theorem of [BC09]. In the remaining cases, the representation is reducible and can easily be seen to be symplectic. The usual Baire category argument implies that can be defined over a finite extension of . Thus in all cases, we may take . Parts (1)– (5) follow from the statement of Theorem [Mok14, Theorem 3.5]. ∎
Lemma 6.9**.**
Let with and let be a maximal ideal of . Then there is a continuous semisimple representation
[TABLE]
such that for each , the restriction is unramified and has characteristic polynomial .
If is absolutely irreducible, then the representation preserves a symplectic pairing and hence, after conjugation, we have a representation:
[TABLE]
Proof.
Choose an integer as in Lemma 6.4 with taken to equal and let . Let be an eigenform for . Let be the Galois representation associated to by Proposition 6.8 and take to be the semisimplification of a reduction of to characteristic . The resulting representation is defined over the algebraic closure of , but by the argument of [CHT08, Prop. 3.4.2], we see that after conjugation, it may be defined over .
For the last part: let the transfer to (given by [Art04]) of the automorphic representation generated by . Then descends to an automorphic representation of a unitary group over . The family of -adic Galois representations associated to is the same as that associated to . Thus, [BC11, Theorem 1.2] and the fact that is absolutely irreducible implies that is symplectic. The same is then true of (by absolute irreducibility). ∎
Remark 6.10**.**
By the same argument, the previous result holds if we replace by or .
Definition 6.11**.**
We say that is non-Eisenstein if the representation is absolutely irreducible.
6.3. Galois representations in cohomological weights
Let be a representation as in Section 4. By Assumption 4.2 and by Cebotarev, there exist infinitely many primes such that no pair of eigenvalues of have ratio and . Choose any such which is disjoint to and all primes of bad reduction of . We take and a possibly empty set of primes disjoint from . We define a compact open subgroup of as follows:
- (1)
If or is unramified at and , then . 2. (2)
If is of type U3, then , where is the Iwahori subgroup. 3. (3)
If is of type U2, then , where is the Klingen parahoric. 4. (4)
If is of type U1, then , where is the paramodular group at . 5. (5)
If is of type P, then (and is prime to ). 6. (6)
If is of type H, then is the full congruence subgroup of level . 7. (7)
If , then is the full congruence subgroup of level .
We then let and as in Section 5.2.
Let with be a regular weight and let be a maximal ideal of (the ordinary Hecke algebra with ) with residue field . Then pulls back to an ideal of which in turn pushes forward to an ideal of . We denote both of these ideals by , in a slight abuse of notation. The ideal is maximal but need not be maximal – there may be multiple maximal ideals of that contain it. We make the following assumption:
Assumption 6.12**.**
Let , and be as above. Then:
- (1)
We have . In particular, since is absolutely irreducible, is non-Eisenstein. 2. (2)
For each , and is a direct sum of four pairwise distinct characters with Frobenius eigenvalues . We assume the eigenvalues have been labeled so that the plane is isotropic, and hence .
We let be any maximal ideal which contains . The representations , and are all isomorphic.
We now turn to the prime . Let be the elements associated to at the beginning of Section 4. For or , we define:
- •
to be the subspace of given by the image of the idempotent , where and are lifts of and to .
- •
(resp. , ) to be the image of (resp. , ) in
[TABLE]
We also make the analogous definitions with and swapping roles.
Theorem 6.13**.**
Let , and be as above, and suppose that Assumption 6.12 holds. Let and . Then there exists a continuous representation
[TABLE]
lifting and such that:
- (1)
The similitude character is given by:
[TABLE]
where is a finite order character unramified at which is trivial modulo . 2. (2)
For each prime , is unramified at and has characteristic polynomial . 3. (3)
There are units satisfying
[TABLE]
and such that:
- (a)
We have and ; 2. (b)
* is conjugate in to a representation of the form:*
[TABLE] 4. (4)
After twisting by the unique square-root of which is trivial modulo , the deformation of satisfies properties (2)– (5) of Definition 4.6.
Remark 6.14**.**
We expect that, under the given assumptions, the Hecke rings in question are torsion free. However, we avoid having to prove this by passing to sufficiently high weight.**
Proof.
As in Remark 6.5, denotes the annihilator of in . Since , it suffices to construct, for each , a representation satisfying the conditions of the theorem. We thus fix an . Choose an integer as in Lemma 6.4 and let . By Lemma 6.4 and Lemma 6.2 (2), multiplication by restricts to a map:
[TABLE]
This in turns gives rise to a surjective map . Thus it suffices to prove the result in weight .
Since , we have that
[TABLE]
and hence we may regard as acting faithfully on both
[TABLE]
Thus we have
[TABLE]
where the are a finite collection of finite extensions of , one for each minimal prime of . Each such minimal prime corresponds to an eigenform for . The eigenform has an associated Galois representation for some finite extension , by Proposition 6.8. After conjugation, we may assume that each reduces to . By the argument of the proof of [CHT08, 3.4.4], using [GG12, Lemma 7.1.1] in place of [CHT08, 2.1.12], we see that the representation descends to a representation . It follows from Proposition 6.8 that satisfies properties (1)–(3) of the theorem. For part (3), note that factors as
[TABLE]
for units . We also have and in (by definition of the idempotent ). Since , we deduce that and .
To show that satisfies properties (2)– (5) of Definition 4.6, it suffices to show that each does so. In fact, property (2) has already been established with the exception of the prime . If , then (by our assumptions) as a -module contains no subquotient isomorphic to , and so . Since , it follows that consists entirely of unramified classes. In particular, all lifts of are automatically unramified at . Since is non-Eisenstein, it follows from Proposition 6.8(5) that satisfies local-global compatibility at all primes. Thus we may apply the results of [Sor10, §4.5]. We now turn to property (3) of Definition 4.6. If is of type U3, then is unipotent and generated by a conjugate of . Since , [Sor10, Corollary 1] implies that is topologically generated by a conjugate of , or . The latter two cases are incompatible with the residual representation being of nilpotent rank 3. Similarly, if is of type U2, then and [Sor10, Corollary 1] implies that is topologically generated by a conjugate of or . The latter case is incompatible with the residual representation being of nilpotent rank 2. Finally, if is of type U1, then . It then suffices to note, following [Sor10, §4.5], that the corresponding representation is para-spherical, that is, has a non-zero fixed vector by a non-special maximal compact subgroup, namely itself. This establishes property (3). For property (4), suppose that is of type P. Then . It follows from [Sor10, Corollary 1] that has no invariants on the automorphic representation generated by (as otherwise would be unipotent, contradicting the assumption on at ). Thus acts through a non-trivial character on the space of invariants. By [Sor10, Corollary 3] all such characters have to lift the character . However, since is prime to , there is a unique such character, and the result follows from [Sor10, Corollary 3].
Finally, we turn to property (5) of Definition 4.6. Let , and recall that . Let be the automorphic representation generated by . Consider first the case where has non-trivial -invariants. Then is a subquotient of an unramified principal series. By part (2) of Assumption 6.12 and [GT05, Prop. 3.2.3], we see that is unramified. In this case, property (5) of Definition 4.6 certainly holds for . In the remaining case, where has no non-trivial -invariants, we see that acts through a non-trivial character on , and the required property holds by [Sor10, Corollary 3]. ∎
6.4. Galois representations in low weights
We let , , and be as in the previous section. Recall that in Section 4, we fixed two units associated to . We now let with denote a non-regular weight.
Definition 6.15**.**
We say that is Katz modular of weight if there exists a maximal ideal of such that:
- (1)
We have , and 2. (2)
There exists a form such that
[TABLE]
We now make the following assumption:
Assumption 6.16** (Residual Modularity).**
We assume:
- (1)
* is Katz modular of weight with associated maximal ideal and eigenform ,* 2. (2)
For each , and is a direct sum of four pairwise distinct characters with Frobenius eigenvalues . We assume the eigenvalues have been labeled so that the plane is isotropic, and hence .
We let be any maximal ideal of containing .
Let be the idempotent
[TABLE]
where and are lifts of and to , and define:
[TABLE]
The assumption that is Katz modular implies that this space is non-zero after localization at . We let denote the image of in
[TABLE]
Our main result in this section is the following.
Theorem 6.17**.**
Let , with and be as above and suppose that Assumption 6.16 holds. In addition, suppose that:
[TABLE]
Then there exists a representation
[TABLE]
which is a minimal deformation of outside .
Proof.
As in the proof of Theorem 6.13, it suffices to prove the existence of an appropriate representation for each . We thus fix an . By Theorem 8.13 below, there exists a power of such that we have injections:
[TABLE]
where . These in turns give rise to surjections:
[TABLE]
where . The first of these surjections together with Theorem 6.13 implies the existence of a representation satisfying all of the required properties, except for conditions (1) and (6) of Definition 4.6. However, we deduce from the existence of both surjections that the representation contains two distinct rank-1 unramified submodules (spanned by basis vectors) – one of which having Frobenius eigenvalue lifting , and the other having Frobenius eigenvalue lifting . By Nakayama’s Lemma, we deduce that contains an unramified rank-2 submodule of the form required by condition (6) of Definition 4.6. In order to obtain a representation that also satisfies condition (1) of Definition 4.6, we note that where is a finite order character of -power order which is unramified outside . Since is odd, we can find a square root of and twist by the inverse of this square root. The resulting representation now satisfies all required properties. ∎
7. Properties of cohomology groups
As in Section 5.2, let and be finite sets of primes of which are disjoint and do not contain . We allow the possibility that . We let and be open compact subgroups of as in Section 5.2, and we let and be the corresponding Siegel threefolds, The goal of this section is to prove Theorems 7.2 and 7.11 below.
7.1. Taylor–Wiles primes
Fix with . Let be a non-Eisenstein maximal ideal of . The ideal gives rise to ideals of and which we also denote by (see Section 6.3). We will need the following assumption (c.f. Assumptions 6.12 and 6.16):
Assumption 7.1**.**
For each , we have , and is a direct sum of four pairwise distinct characters with Frobenius eigenvalues . We assume the eigenvalues have been labeled so that the plane is isotropic, and hence .
For , we let be elements lifting , , . The point of the above assumption is to rule out the possibility of newforms at level :
Theorem 7.2**.**
Let and be as above, and suppose that Assumption 7.1 holds. Let denote the ideal of containing together with the elements and for each . Then is maximal and there is an isomorphism
[TABLE]
which is equivariant for the operators for each as well as for the operators and .
Here is the natural inclusion and is defined as follows. For , let denote the Hecke operator
[TABLE]
and let denote the idempotent
[TABLE]
Then the ’s commute with one another and denotes their product.
For compactness, we will make use the alternative notation , and . In sufficiently high weight, Theorem 7.2 is due to Genestier and Tilouine:
Theorem 7.3**.**
Suppose is such that and are 0 for all . Then the map
[TABLE]
is an isomorphism. An explicit inverse is given by the composition
[TABLE]
where (which is prime to ) and is the trace map associated to .
Proof.
By the assumption of cohomology vanishing, it suffices to prove both statements with replaced by . Indeed, if the map over is surjective, then so too is the map over . Furthermore, if is an inverse over , then the fact that its defined over implies immediately that it also gives an inverse over . The proof of the corresponding result over follows exactly as in the proof of [GT05, Proposition 11.1.2]. ∎
Using this result and the Hasse invariant , we can now establish Theorem 7.2 at the level of -torsion. (Recall that cohomology in degree 0 over can be identified with -torsion in degree 0 cohomology over .) Note that, for any weight , the cohomology vanishing assumption of the previous theorem holds in weight as long as (where is defined in Definition 6.3).
Lemma 7.4**.**
Let with . Choose an integer such that . Let . Then the following diagrams are co-cartesian:
[TABLE]
[TABLE]
In particular, the left hand vertical maps are mutually inverse isomorphisms.
Proof.
Note that the right hand vertical maps are mutually inverse isomorphisms by Theorem 7.3 and the choice of . The diagrams are commutative because commutes with all Hecke operators at the primes in (Lemma 6.2). Now, let and let . Note that can be recovered from via the formula . We need to show that is divisible by if and only if is divisible by . But this follows immediately by the commutativity of the diagrams above: if , then , and if , then . (Note that since and are smooth (and in particular irreducible) over , multiplication by is injective on .) ∎
We will need the analogous result for forms on the non-ordinary locus: let (resp. ) denote the non-ordinary locus of (resp. ).
Lemma 7.5**.**
Let with . Then the map
[TABLE]
is an isomorphism with inverse .
Proof.
We first show that the result is true in sufficiently high weight. More precisely: let . We let and . We have a commutative diagram:
[TABLE]
The choice of guarantees that the rows are short exact sequences. From the previous lemma, we deduce that the right hand vertical map is an isomorphism with inverse .
Now we imitate the proof of the previous lemma to deduce the result in smaller weights. For this we use the existence of the Hasse invariant
[TABLE]
Such a form was constructed in unpublished work of the second author with Goldring, but is also constructed in greater generality in [Box15] and [KG15]. In [Box15, Theorem B.2] (see also [Box15, Theorem 6.2.3]), it is shown that extends to the boundary, (by the normality of the -rank 1 locus) and that multiplication by is Hecke equivariant away from (see [Box15, Theorem 4.5.4(3)]). (It is also true, but not relevant here, that vanishes on the 1-dimensional Ekedahl–Oort stratum of to precise order : see the references in proof of Theorem 8.10 below for more discussion on this point.)
We choose an integer such that . Let . Then we have a commutative diagram:
[TABLE]
The right hand vertical map is an isomorphism with inverse by the first paragraph. The lemma now follows by the same argument as the previous lemma. ∎
We will also need the analogous result for first degree cohomology over :
Lemma 7.6**.**
Suppose where . Then the map
[TABLE]
is an isomorphism with inverse .
Proof.
If , then both sides of the map are zero, so we may assume that . Let , and let and . Consider the diagram with exact rows:
[TABLE]
The first three vertical maps are isomorphisms with inverse by the previous two lemmas. We deduce that the rightmost vertical map above is an isomorphism with inverse . This proves the lemma in weight . The general case then follows by a similar argument using a reverse induction on . ∎
We are finally in a position to prove Theorem 7.2 in the general case.
Proof of Theorem 7.2.
For each , let . We have a commutative diagram:
[TABLE]
The vertical maps on the ends are isomorphisms by Lemma 7.4 and Lemma 7.6. By induction on and the Five Lemma we deduce that the map
[TABLE]
is an isomorphism for all . This shows that the map of Theorem 7.2 is an isomorphism after passing to -torsion, for any . The result follows. ∎
7.2. The balanced property
In this section we assume that is a limit of discrete series weight, where . Let be a quotient of and let denote the corresponding sub-cover of . If is a vector bundle on , we define
[TABLE]
for all . Note that is the dualizing sheaf on .
We now take , so that . Here we use our bound to deduce that there is an equality as -modules. Thus, acts on . We fix a non-Eisenstein maximal ideal of . We will need the following assumption:
Assumption 7.7**.**
The space is trivial.
There is a slight abuse of notation here in that does not act on . The localization at refers to the localization at the corresponding maximal ideal of the polynomial ring over generated by the Hecke operators.
Remark 7.8**.**
We note that if , then the assumption above holds, even before localization at , by Theorem 5.1.
Lemma 7.9**.**
Suppose Assumption 7.7 holds. Then is -torsion free.
Proof.
The claim is equivalent to the divisibility of . Since is flat over , there is an exact sequence
[TABLE]
Taking cohomology, this reduces to the claim that vanishes. ∎
The following lemma uses only the assumption that is non-Eisenstein: it holds in all weights and in all prime to levels. We just state it in the case we need:
Lemma 7.10**.**
The map
[TABLE]
is an isomorphism for all .
Proof.
Let denote the boundary of . It suffices to show that the boundary cohomology
[TABLE]
vanishes for all . However, over the cohomology of the boundary is computed by the nerve spectral sequence:
[TABLE]
See [HZ01] (3.2.4). Here is a -parabolic of and is its parabolic rank. By [HZ01, Corollary 3.2.9], and freely using the notation of this paper. the space is the space of -invariants in:
[TABLE]
If is the Klingen parabolic, then and . If is the Siegel parabolic or the Borel subgroup, then is trivial and is the Levi component of (and hence is either or ). In all cases, is the canonical extension of an automorphic vector bundle on the Shimura variety and is a local system on associated to an algebraic representation of . See [HZ94, (3.6.1)] for the highest weight formulas. The functor is an intermediate induction defined in [HZ01, (3.2.8)].
Since each of the groups and are products of copies of and , we see that to any Hecke eigenclass in any , we can associate a compatible system of reducible -valued -adic representations of . Since the ideal is non-Eisenstein, it follows that , as required. ∎
We come to the main result of this section:
Theorem 7.11**.**
Let be a quotient of which is of -power order. As above, let with and let be a non-Eisenstein ideal of . Suppose that Assumption 7.7 holds. Then the -module
[TABLE]
is balanced in the sense of Definition 3.2.
Proof.
The argument proceeds exactly as in the proof of Prop. 3.8 of [CG18]. If we let and , then the defect is given by:
[TABLE]
where is the -rank of . Thus we need to show that .
Let . Applying Pontryagin duality to the Hochschild–Serre spectral sequence, we get a spectral sequence:
[TABLE]
This spectral sequence tells us that:
- (1)
, and 2. (2)
we have a short exact sequence
[TABLE]
To prove that , it follows from the second point that it is sufficient to show that is free of rank at most over . Lemma 7.9 tells us that this space is -torsion free. Passing to characteristic 0 and using the first point, we are therefore reduced to establishing the inequality:
[TABLE]
In other words, we need to show:
[TABLE]
By Lemma 7.10, we are reduced to showing that
[TABLE]
where denotes the interior cohomology (the image of in ).
As recalled in Theorem 5.2, the interior cohomology can be computed in terms of square integrable automorphic forms on . By Remark 5.10, the cohomology of agrees with that of where and . Theorem 5.2 then implies that:
[TABLE]
where denotes the multiplicity of in . Fix a degree and let be such that contributes to under the above inclusion (for some embedding ). Let denote the transfer of to under the Classification Theorem of [Art04]. Then, by Remark 5.12, the infinitesimal character of is . Let denote the central character of .
The representation falls into one of 6 classes (a)–(f) given in Section 5 of [Art04]. We show now that we can rule out all classes other than class (a). In cases (e) and (f), is an isobaric sum of idele class characters. In case (d), is of the form where is an idele class character and is a cuspidal automorphic representation of such that its central character satisfies . Considering the infinitesimal character of , we see that we must have and must correspond to a classical modular eigenform of weight 2. In case (c), there is a cuspidal automorphic representation of orthogonal type of such that . Being of orthogonal type means that is induced from a quadratic extension of . In case (b), where the are distinct cuspidal automorphic representations of with . Considering the infinitesimal character of and the fact that the have the same central character, it follows that are both associated to classical modular eigenforms of weight . Thus, in all cases (b) – (f), we can associate a compatible family of reducible -adic Galois representations to . This contradicts the fact that is non-Eisenstein.
The only remaining case is case (a) where is a cuspidal automorphic representation of that is -self dual. By Clozel’s Purity Lemma [Clo90, Lemme 4.9], is essentially tempered. (We thank Olivier Taïbi for pointing this out to us.) It follows that is also essentially tempered, since its -parameter is essentially bounded. Then by Theorem 5.6(3), is the limit of discrete series representation where . Furthermore, by a Theorem of Wallach [Mok14, Theorem 2.3], it follows that is cuspidal.
By the first part of Theorem 5.2 , the cuspidal cohomology maps injectively to the interior cohomology:
[TABLE]
where is the multiplicity of in . Thus, at this point, we can prove that the dimensions
[TABLE]
are equal for if we can establish:
- (1)
The spaces have the same dimension for . 2. (2)
The representation also lies in ; 3. (3)
The multiplicities , , and are all equal.
The first point follows from [Har90, Theorem 3.4] which says that both spaces are one dimensional. The second point follows from [Art04]. Indeed, since is essentially tempered, the local packet (where , in the notation of [Art04]) is in fact an L-packet by [Mok14, Theorem 2.1]. Furthermore, it consists of the pair of representations (see [Mok14, §3.1]). Since the group is trivial in Case (a) of [Art04], it then follows from part (ii) of the Classification Theorem that is also automorphic. Finally, for the third point, the theorem of Wallach quoted above implies that and are both cuspidal. Part (iii) of the Classification Theorem then implies that each of the multiplicities in point (3) is 1. We have thus shown that
[TABLE]
as required. ∎
8. -expansions of Siegel modular forms
As in Section 5.2, let and be finite sets of primes of which are disjoint and do not contain . We allow the possibility that . We let and be open compact subgroups of as in Section 5.2, and we let and be the corresponding Siegel threefolds, with open subspaces and , all defined over .
8.1. -expansions of Siegel modular forms
(For more background and details on the results quoted in this section, see § 3.1 of [Til06].) Recall that has good reduction at . Let be an -module (we will exclusively be interested in the case when either for some , or when ). Let be a weight and associate to the representation
[TABLE]
of over . Associated to , we also have the vector bundle . There is a -expansion map:
[TABLE]
Theorem 8.1**.**
The -expansion map is injective.
Proof.
This is a standard fact (see, for example, Prop. 3.2 of [Til06]). ∎
8.2. Explicit Formulae
Let be the product of the primes in and , so that has good reduction outside . Let be a -module and thus a -algebra. Any has a “-expansion”:
[TABLE]
where denotes the positive semi-definite matrices which take on -integral arguments for integral vectors, or equivalently,
[TABLE]
The set is naturally a subset of . The group acts on by the following formula:
[TABLE]
where the right hand side is multiplication. We may naturally extend the definition of for by setting for all not in . In any -expansion, the coefficients will also vanish unless the denominators occurring in are bounded by some fixed power of which depends only on the level structure. (Since our arguments in this section are all -adic, there is little harm in imagining that .) Let be the standard representation of over . The elements are elements of the representation , where, if has weight , then
[TABLE]
Let denote the corresponding representation. The representation extends to a homomorphism from to over which we denote by , where once more only depends on and (more relevantly) preserves integrality. We may write the -expansion of a form as
[TABLE]
where satisfies, for , the equality
[TABLE]
Here is the congruence subgroup of defined on p.807 of [Til06]; since we are working at spherical level at the group has level prime to . (It will do the reader little harm to pretend that is just .)
Remark 8.2**.**
We shall assume that either or . Since we are most interested in representations with similitude character is equal to , the oddness condition forces the congruence , and so if then . In cases (coming from Taylor–Wiles primes) where there is non-trivial Nebentypus character at the auxiliary primes , we may twist (at the cost of increasing the level at ) to force the Nebentypus character to be trivial. The only change this has is to make the -expansions below less unpleasant — the addition of a Nebentypus character only introduces a notational difficulty. We note, however, that with non-trivial Nebentypus character the case of weight is possible, but our arguments would not cover this case.**
8.3. Hecke Operators at
Since we will exclusively be interested in Hecke operators at , we drop the subscript from the notation. Similarly, we drop the subscript ,and so and are denoted and , whereas and are denoted and respectively. One has the following explicit description of the Hecke operator :
Lemma 8.3**.**
In weight there is an identity of formal operators , where , , and preserve formal integral -expansions, and such that the following identities hold:
[TABLE]
[TABLE]
Here denotes (any) set of representatives in for the left coset decomposition of
[TABLE]
Moreover, unless is a -integral binary quadratic form.
Note that the coset decomposition of for a congruence subgroup prime to is essentially the same as the coset decomposition of . These formulae are well known. See, for example, Prop 10.2 of [CvdG15]. To compare our formula with ibid, note that we have normalized the matrices in to be integral of determinant , and absorbed the action of the determinant into the coefficient (since we are concerned here with issues of -integrality). We have a similar description of which can be obtained by a laborious computation (following the arguments of §3.2 and §3.3 of [And87]:
Lemma 8.4**.**
In weight there is an identity of formal operators where , , and preserve formal integral -expansions, and the following identities hold:
[TABLE]
where is as in the description of in Lemma 8.3. If , then
[TABLE]
If , then
[TABLE]
For those wanting a more explicit description, note that in weight we have the possibly more familiar identities:
[TABLE]
[TABLE]
Note also that there is a formal identity .
Definition 8.5**.**
Let denote the formal operator on -expansions such that
[TABLE]
Explicitly, if , then times , where is the determinant of the quadratic form associated to , and is the Legendre symbol. If , then . In all cases, we see that .
Lemma 8.6**.**
Over , we have .
Proof.
We have if , but is a sum over terms of the form with . ∎
Definition 8.7**.**
A binary quadratic form is -primitive if it is not of the form for an -integral form .**
8.4. Hecke Operators on forms of in characteristic
Let .
Lemma 8.8**.**
There is an action of and on which commutes with the other Hecke operators and acts on -expansions via the above formula.
Proof.
The argument is very similar to Prop. 4.1 of [Gro90]. It suffices to prove the result with coefficients in . The natural approach to defining these operators is using correspondences, as for modular curves. There are two issues which arise. The first is that the projection maps from the Siegel modular varieties with appropriate parahoric level structures are not finite over . The second is that the definition involving correspondences is some power of times the actual Hecke operator of interest. A general approach to resolving these questions has been recently found by Pilloni [Pil12a], who constructs all the operators used in this paper. More importantly, his method also allows one to give an action of these operators on higher higher coherent cohomology as well. We use a more pedestrian approach. We can resolve the normalization issue by using the -expansion principle. The first issue is more subtle. The geometric maps involved are certainly proper; the failure of finiteness is thus a failure of quasi-finiteness. The source of quasi-finiteness arises from the fact that the kernel of Frobenius of an abelian surface could (for example) equal , which contains “too many” subgroup schemes of type . On the other hand, this issue does not arise over the ordinary locus nor over the larger almost ordinary locus consisting of abelian surfaces (those with rank ) where subgroup schemes such as cannot occur. This shows how to resolve the issue by the following ad hoc method: by Hartogs’ Lemma, it suffices to construct over the global sections of a subvariety whose complement has codimension . In particular, we may replace by the moduli space of almost ordinary abelian surfaces for which the corresponding maps are indeed finite. Implicit in this argument is a verification that the formulas above (in Lemmas 8.3 and 8.4) preserve integrality — for this is verified in Lemma 8.12 below. ∎
Note that this argument is not sufficient to construct these operators on
[TABLE]
however, we have no need to the consider the action of Hecke operators at on these spaces.
We shall also need to use various properties of theta operators. We begin by recalling their basic properties:
Proposition 8.9**.**
Let , let , and let .
- (1)
There is a map
[TABLE]
whose action on -expansions is given by
[TABLE] 2. (2)
There is a map
[TABLE]
whose action on -expansions is given by
[TABLE]
where is the natural -equivariant projection.
Proof.
The operator is defined in [Yam16, Prop 3.9], and the operator is defined in [Yam16, Prop 3.12]. ∎
(Some of these maps were also considered in previous unpublished work of Ghitza [Ghi]). The main results we need concerning these operators are given by the next two theorems.
Theorem 8.10**.**
Let and , and assume — so in particular and are admissible values of . Then the map
[TABLE]
is injective. In particular, if , we must have .
Proof.
We may immediately reduce to the case and . Suppose that lies in the kernel, so . After possibly replacing by , we may assume that is not divisible by the Hasse invariant. Following Theorem 4.7 of [Yam16], it suffices to show that is not zero on the superspecial locus if it is not divisible by the Hasse invariant. Hence has non-trivial specialization to the -rank strata. The supersingular locus on this strata is a Cartier divisor cut out by a section of for , so since (for ), the restriction of is non-zero on the supersingular locus. (That the supersingular locus is a Cartier divisor inside the -rank locus when was proved by Koblitz, see p.193 of [Kob75]. The exact order of vanishing can also be found in [vdG99], Theorem 2.4.) Finally, each irreducible component of the supersingular locus is a copy of with superspecial points on it. Moreover, the line bundle restricts to on each of these s. Hence the restriction to the superspecial points is injective as long has , which holds for . ∎
We also require a related result for non parallel weight.
Theorem 8.11**.**
Let . The map:
[TABLE]
is injective.
Proof.
It suffices to work over Suppose that , and that is non-zero after restriction to the superspecial locus. Then the result follows directly from Theorem 3.20 of [Yam16]. As stated, the result does not apply in weight , although the same argument works in this weight providing that one may assume (in the notation of ibid.) that , which can be achieved under the action of for , since the level of is prime to and so surjects on to . The corresponding representation of is irreducible, and thus for there exists an element which applied to has for any fixed choice of . Hence it remains to show that the restriction of to the superspecial locus is non-zero. Let , and denote the rank one strata (respectively, the supersingular locus, respectively, the superspecial locus) by , , and respectively. We are assuming that the restriction of to is nonzero. Suppose the restriction of to is zero. There is an exact sequence:
[TABLE]
where . If restricts to zero, we obtain a non-zero class in the first group. Yet there is also a sequence:
[TABLE]
The first term vanishes. To see that the final term vanishes, we use the fact that Serre duality shows that the last term is dual to
[TABLE]
which vanishes by Theorem 5.1. We now have to establish non-vanishing from to . The restriction of the Hodge bundle to any on is . Hence we need to show that no class in
[TABLE]
can vanish at points. This is valid as long as
[TABLE]
which holds provided .
∎
8.5. Relationship between Hecke eigenvalues and crystalline Frobenius
Suppose that is a cuspidal eigenform of weight of level prime to , and let be the associated Galois representation. One expects (and knows in regular weights, see Theorem 6.13) that is crystalline at and that crystalline Frobenius has eigenvalues which are the roots of the following polynomial:
[TABLE]
where is the eigenvalue of and is the eigenvalue of . We may write the eigenvalues of this polynomial as follows:
[TABLE]
where and have non-negative -adic valuation. That means that the coefficient of crystalline Frobenius should have characteristic polynomial:
[TABLE]
On the other hand, we know that the coefficient of should be:
[TABLE]
where the operator is defined by this formula. In particular, the eigenvalues of this operator () should all be integral.
Lemma 8.12**.**
Let with . If , there is a congruence of operators on formal -expansions:
[TABLE]
In particular, if is an ordinary form of regular weight with crystalline eigenvalues as above, the eigenvalue of is . If , there is a congruence
[TABLE]
Proof.
The operator acts by a scalar which is equal to . Note that
[TABLE]
Thus we can ignore the term above. We have
[TABLE]
and we are done. ∎
8.6. The Main Theorem on -expansions
Our main theorem is as follows (we use the notation of §6.4).
Theorem 8.13**.**
Let for some . Assume that is as in Assumption 6.16. Assume, moreover, that
[TABLE]
Let denote the corresponding ideal of the Hecke algebra away from . Let denotes a non-trivial power of the Hasse invariant of weight . Then the composite map:
[TABLE]
is injective, where denotes the projection onto the summand where and (equivalently ) are nilpotent.
Note that, by symmetry, the same result holds with replaced by . Before beginning the proof of this theorem, we first prove a much easier analogue for :
Lemma 8.14**.**
Let denote the modular curve, and let be a modular representation of level and weight one over such that has eigenvalues and . Let denote the corresponding ideal of the Hecke algebra away from . Assume that
[TABLE]
If denotes a suitable power of the Hasse invariant of weight , then the composite map:
[TABLE]
is injective, where denotes the projection onto the quotient of homology where is nilpotent.
In both results, all of the corresponding maps are equivariant with respect to Hecke operators away from . It suffices to show that the image of the -socle maps injectively, and hence we may work with coefficients over a finite field of characteristic .
Proof of Lemma 8.14.
Let and . The map is certainly injective, as can be seen by the -expansion principle (the map is the identity on -expansions). Let denote the action of on . Then satisfies the polynomial on the image of , and so lies inside the ordinary subspace of , and so inside , where is the factor of on which is nilpotent. We have operators and defined by the formulae
[TABLE]
and in weight , whereas in higher weight. The projection operator:
[TABLE]
is given by for some integer . Suppose that satisfies . We have the identity , and we may reduce to the case that . We are assuming that . Let us write
[TABLE]
Note that is invertible on . Since also lies in , we deduce that lies in . Yet , and so , and moreover . It follows that
[TABLE]
If , then the latter expression is non-zero, since applying gives and . On the other hand, is deeper in the filtration of given by
[TABLE]
and hence, replacing by sufficiently many times, we may assume that , that , and that . We are thus left with a form such that:
[TABLE]
We may now achieve a contradiction based purely on a computation with formal -expansions. For example, the identity is impossible as soon as either or is a cusp form, simply by considering the exponent of the smallest coefficient. Alternatively, a non-formal argument using properties of modular forms would be to note that , and then use the fact that has no kernel in low weight (by [Kat77]). ∎
A different proof of this theorem is given in [CG18]; the point is that the proof given here avoids any geometry. The proof below is somewhat in this spirit — using some elementary reductions, we arrive, given an element of , and a form which is simultaneously acted upon by a collection of formal operators in a very constrained way. The identities we get are not quite enough to deduce that as formal -expansions, however, they are enough to produce forms of low weight inside the kernel of various theta operators, which will be enough to produce a contraction by Theorems 8.11 and 8.10. No doubt (see §1.3) there will be better geometric replacements for this argument, so we apologize in advance for the somewhat messy approach that we present here.
As in the proof above, let use write:
[TABLE]
The map is certainly injective, as can be seen by the -expansion principle (the map is the identity on -expansions). By abuse of notation, we view under this map. Since , the operator acts invertibly on . Depending on the weight , the operator acts on either as or as .
Lemma 8.15**.**
Assume that and are as in Theorem 8.13. Suppose that with . Then , and is a subspace of the submodule of on which is invertible. If , then acts on , the map is injective, and is a subspace of the submodule of on which is invertible.
Proof.
In the first case, by assumption we know that is nilpotent, and so induces an isomorphism of . On the other hand, the operator acts via the formal operator . In weight , the corresponding operator also acts via , and so we deduce that acts on and acts nilpotently. Yet only acts invertibly on the ordinary part of , as can be seen by lifting to characteristic zero. Now let us consider the case of weight . We have
[TABLE]
Now acts in weight by , so certainly . Since acts by on , there is a commutative diagram as follows:
[TABLE]
where (by Lemma 8.6) we use the fact that . Since the left hand side is an isomorphism, it follows that , and hence that acts invertibly on , and as in the previous argument it follows that and hence is invertible on this space.
Hence it suffices to show that for any . Suppose that . Then . Since , we have . Yet then (again by Lemma 8.6) we have , and then , and so is an eigenvector of with eigenvalue satisfying . Yet the only generalized eigenvalue of is , and by assumption . ∎
(Note that this is the point in this paper which uses the assumption rather than the weaker claim which is sufficient for arguments on the Galois side.)
Lemma 8.16**.**
The operator acts nilpotently on .
Proof.
This follows by lifting to characteristic zero and noting that the only possible unit crystalline eigenvalues of Frobenius of a lift of are or modulo . ∎
Lemma 8.17**.**
Suppose that the composite is not injective.
- (1)
If with , there exists a nonzero form such that
[TABLE] 2. (2)
If , there exists a nonzero form with and such that:
[TABLE]
Proof.
First note that , and that , so . Assume that with . Note that commutes with . Hence, after replacing by for sufficiently large , we may assume that . The assumption implies that . Clearly also, and so . Yet , so we have
[TABLE]
(There can be no component in because is invertible on that space.) Write , so , or
[TABLE]
We infer that
[TABLE]
We claim that if , then the last expression is non-zero. This is because acts invertibly on , and applying we get
[TABLE]
and has a smaller nilpotence level than , and . In particular, replacing by , we may find more elements in which also lie in the kernel of , and reduce to the case where and . However, in this case, we also see that , and the required equalities follow.
Now suppose that . Let us write as , and so for some . Since formally commutes with , we also get
[TABLE]
so preserves the property of lying in the kernel of . But
[TABLE]
because . Hence, if lies in the kernel of , then so does
[TABLE]
Hence we may repeatedly replace by , and thus replace by a form such that and . Now, as above, we may write
[TABLE]
We are assuming that , and so
[TABLE]
Thus we deduce that and . We once more would like to use that implies that . However, we no longer know (or expect) that it is ordinary. However, since and , we certainly deduce that
[TABLE]
for some in the kernel of . Are arguments are similar to those used above. We write , so , or
[TABLE]
This implies that
[TABLE]
The first term lies in a space where is nilpotent, but it has a smaller nilpotence level than by construction. Moreover, if it is equal to zero, then
[TABLE]
where has yet a higher level of nilpotence. In particular, this can equal zero only if either or . Since we are explicitly forbidding the former, we may assume, by induction, that is a -eigenvector, and so
[TABLE]
This implies that , and thus (from the injectivity of in Lemma 8.15) that , or that is a -eigenform. The required identities follow immediately upon writing where is a -eigenform, , and . ∎
At this point, to prove Theorem 8.13, it suffices to show that there are no Siegel modular forms which satisfy the above identities. For example, in weights with , we would like to show that there is no form which is an eigenform for both and . We now examine what constraints these identities place on the Fourier coefficients of .
Remark 8.18** (Tripling).**
*A theme of [CG18], following previous work of Wiese [Wie14], was to prove that certain Galois representations were ordinary in two different ways by doubling, that is, mapping the form of low weight to forms of heigh weight in two different ways. This is also our argument in weights for . However, in weight , we see some new phenomena. When we pass to weight , we see not only the the space of low weight forms has been doubled, but rather tripled, with the image generating (under the map ) is mapped to the kernel of . What this must mean is that, in weight , any ordinary Galois representation coming from weight should have a non-ordinary lift in weight . This phenomena doesn’t happen for , since forms of weight which are ordinary modulo are ordinary in characteristic zero by (boundary cases of) Fontaine–Laffaille theory. For , however, the Hodge–Tate weights in weight are , which are well beyond the Fontaine–Laffaille range. One can also ask what is the exact relationship between tripling argument here in weight and the doubling version of [BCGP18] at Klingen level. For our purposes, this would require proving that there exists a (Hecke equivariant away from ) injection from from our space of forms at spherical level to a space of ordinary forms (with respect to the operator denoted in [BCGP18]) at Klingen level also in weight . While this should certainly be true, we have not attempted to prove it.
8.7. Binary quadratic forms
Definition 8.19**.**
We define a set with multiplicities of equivalence classes of -integral binary quadratic forms as follows. For each (with as defined in Lemma 8.3), we add to if and only if there exists a such that . In particular, contributes a class if and only if is -integral.**
An easy lemma shows that only depends on . A binary quadratic form defines a section of on , the latter of which is in natural bijection to (recall that is the coset space of in ). We see that is -integral if any only if the corresponding quadratic form has a zero at the corresponding point in . In particular, is empty if does not represent zero. Moreover, the cardinality of is given by the number of zeros of , and is thus equal to [math], , or if is -primitive. (If is not -primitive, then and has cardinality ).
The definition of is motivated by the following observation: There is an identity
[TABLE]
where is some (any) element in such that for .
Lemma 8.20**.**
If , then .
Proof.
Replacing by for some , we may assume that where
[TABLE]
Yet then , and . ∎
Let denote the discriminant of .
Lemma 8.21**.**
Suppose that is -primitive. Let . Then either:
- (1)
, and is empty. 2. (2)
, and has exactly one element. 3. (3)
, and has exactly two elements.
Proof.
This follows from the fact that a -primitive form has exactly [math], , or solutions in , depending on whether is , [math], or respectively. Note that (in the final case) may consist of the same class with multiplicity two. This happens, for example, if and the class number of is one. ∎
In light of Lemma 8.17, to prove Theorem 8.13, it suffices to prove the following.
Theorem 8.22**.**
Suppose that is a Siegel modular -expansion of weight in characteristic , where .
- (1)
Let with , and suppose that and for some with , then . 2. (2)
Let , and suppose that , where , , and for some with . Then .
Proof.
We first prove that that there exists a with . In particular, in weight , we may also assume that , and thus have the equalities:
[TABLE]
In fact, we may assume these equalities hold in both cases, since we are assuming such an equality holds in the case of non-parallel weight. If , then, since , we have . Hence, if , there exists a -primitive form with . Without loss of generality, assume that is a -primitive form of minimal discriminant with . By Lemma 8.21, consists of a single class . It follows that
[TABLE]
If is not -primitive, then for some , and then , contradicting the minimality of (note that and have the same discriminant). Hence is also -primitive. Yet then consists of a single element, which must be by Lemma 8.20. Yet then it follows that
[TABLE]
Here we use that , and thus is the identity in weight and zero in higher weight. If we are done, and if , we are done since .
Remark 8.23**.**
As an alternative to this argument, one could use an analogue of Theorem 8.10 to show that the kernel of is trivial in low weight (but this would require formulating and then proving such a theorem for non-parallel weight).
We may therefore assume that for some of discriminant prime to .
8.8. The case .
Let us now assume that . The coefficient is equal to , where is the discriminant of . Hence, since , we deduce that, if , that
[TABLE]
Assuming that , we deduce that . It follows that the only with have satisfying . In particular, the form
[TABLE]
lies in the kernel of . Yet this implies that trivial by Theorem 8.10. But this implies that , and this contradicts the injectivity of in Lemma 8.15.
8.9. The case with
We may assume that , where is -primitive and is non-zero. If , then , contradicting the non-vanishing of and the identity . Hence we may assume that . The action of on binary quadratic forms of discriminant has a finite orbit which may be identified with a ray class group. The assumption on implies that has exactly two zeros in . For either of the zeros (say ), we may consider the corresponding quadratic form
[TABLE]
where is a representative of an element in corresponding to . The class of in the class group does not depend on the choice of representative of . The quadratic form also has two roots. We claim that, for one of those roots, there is a choice of representative for the element in such that
[TABLE]
Indeed, if , then the corresponding identity is trivially satisfied. We may view the process of applying dynamically as follows: The coefficient corresponding to a quadratic form of discriminant with of is given by a sum for a pair of quadratic forms and also of the same discriminant. The ray class group corresponding to is partitioned by this process into a finite number of cyclic orbits, on which this operation takes a binary quadratic form to its two nearest neighbours (if the orbit has fewer than two elements, this pair of neighbours may have multiplicity). Let us now consider the coefficient . This consists of two pairs of two terms coming from the neighboring quadratic forms and respectively. From the above, for each neighbour , there will be a term of the form
[TABLE]
where the identity requires the assumption that . Hence will also be a sum of two terms coming from the quadratic forms of distance away from inside its cyclic orbit. Let us consider one orbit of size . Then, we also see, modifying by an element of if necessary, that
[TABLE]
where has . Cycling the other way, we deduce the following:
Lemma 8.24**.**
Suppose that is a formal Siegel modular form of weight which is an eigenform of with eigenvalue . Suppose that has discriminant with . Then there exists an integer such that
[TABLE]
where
[TABLE]
We now make a small recap: At the beginning of the of the proof of Theorem 8.22, we proved that we could assume that had a non-zero coefficient where has non-zero discriminant modulo . If , then , which (with ) would imply that . Hence we may assume there is a non-zero coefficient with (which we exploit below) and use the following proposition to reach the final contradiction.
Proposition 8.25**.**
Suppose that is a formal Siegel modular form of weight modulo which is an eigenform of with eigenvalue such that , and suppose that . Suppose that has a non-zero coefficient where . Then .
Proof.
The map is induced from the contraction map
[TABLE]
(this is well defined integrally as long as ). In particular, we have the identity
[TABLE]
where denotes the contraction map. We claim that for any , where . Once we have this, we deduce that , and since , we have and .
While there is probably an easy coordinate free way to prove the required claim, it is also simple enough to do the computation explicitly by writing everything out in terms of bases. Let us write down a standard basis for and a standard basis for . To be explicit, we choose bases such that a form
[TABLE]
gives rise to the element , and gives rise to . With respect to this choice, the contraction map on (up to scalar) corresponds to sending and to and to , and sending all other monomials to zero. As a consistency check, note that
[TABLE]
Similarly, the contraction mapping on for satisfies
[TABLE]
The formula continues to hold if we replace by and by some invertible . In particular, we may replace by any integral conjugate. We consider two cases.
- (1)
has a non-zero eigenvalue mod . In this case (by Hensel’s Lemma), the matrix has an eigenvalue over , and a second eigenvalue which has valuation . In particular, after a change of basis, we may write
[TABLE]
The conditions and imply that (multiply out and consider the bottom right entry), and thus that . But now the image of on is generated by , and so the image of is given by . But this forces the contraction after tensoring with to be zero over , because the only monomial which contracts with non-trivially with is . 2. (2)
is nilpotent modulo . If is trivial modulo there is nothing to prove. On the other hand, if
[TABLE]
then once again the image of is generated by , and the conditions and imply once more that (multiply out as above but now consider the top left entry), and the proof proceeds as in the previous case.
This completes the proof of the proposition. ∎
Combining Prop. 8.25 with Lemma 8.17 and Theorem 8.11, we obtain a contradiction, and this completes the proof of Theorem 8.13.
∎
9. Modularity Lifting
The following theorem is the main result of this paper.
Theorem 9.1**.**
Let be a continuous, odd, absolutely irreducible Galois representation. Suppose that where . Suppose that the following hold:
- (1)
There exist units and in such that
[TABLE]
and moreover . 2. (2)
Let denote the set of primes of away from at which is ramified. Then for each , the restriction falls into one of the cases of Assumption 4.3. 3. (3)
Big Image* The restriction has big image in the sense of Assumption 4.1.* 4. (4)
The representation is Katz modular of weight in the sense of Definition 6.15. 5. (5)
Neatness* satisfies Assumption 4.2.*
We now introduce some notation: let be the compact open subgroup defined as in the beginning of Section 6.3. Let , and for any set of primes disjoint from , let . Let the Hecke algebras and be as in Definition 5.13. The assumption that is Katz modular implies that there is a maximal ideal of associated to . The pullback of to is also denoted . We further assume:
- (6)
If satisfies Assumption 6.12 (2), then
[TABLE]
Let be the universal deformation ring classifying minimal deformations of in the sense of Definition 4.6 (with taken to be empty). Then the map
[TABLE]
which classifies the minimal deformation of Theorem 6.17 (with taken to be empty), is an isomorphism. Furthermore, the space
[TABLE]
is a free module.
Note that, for , the hypothesis 6 holds by Theorem 5.1.
Proof.
To prove the theorem, we apply Proposition 3.3, as follows:
- (1)
Take and . 2. (2)
Let and the sets be as in Proposition 4.10. 3. (3)
The ring is the power series ring . 4. (4)
For each , we define a surjection as follows: Let denote the universal deformation ring classifying deformations of which are minimal outside , in the sense of Definition 4.6. Choose any surjection (possible by Proposition 4.10) and let be the composite of this surjection with the natural map .
We define the module as follows: let be the unique quotient of which is isomorphic to , and let be as in Section 7.2. Let be the ideal of Theorem 7.2 when is taken to be . We then take
[TABLE]
and we regard it as an -module via the surjection chosen above, and the classifying map associated to the deformation of Theorem 6.17. The -module structure on is given by choosing an identification .
We need to check that, given these definitions, the conditions of Proposition 3.3 hold.
- (a)
The image of in is contained in the image of because under the Galois representation of Theorem 6.17, the image of an element , for a prime in , is conjugate to a matrix of the form where . This follows from [Sor10, Corollary 3]. 2. (b)
We have
[TABLE]
Combining this with the isomorphism of Theorem 7.2, we obtain an isomorphism:
[TABLE] 3. (c)
Finally, is finite and balanced over by Theorem 7.11.
We can thus apply Proposition 3.3, and we deduce that is a finite free -module. Since the action of on factors through , the conclusions of Theorem 9.1 follow immediately. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[And 87] Anatolij N. Andrianov, Quadratic forms and Hecke operators , Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 286, Springer-Verlag, Berlin, 1987. MR 884891
- 2[Art 04] James Arthur, Automorphic representations of G Sp ( 4 ) G Sp 4 {\rm G Sp(4)} , Contributions to automorphic forms, geometry, and number theory, Johns Hopkins Univ. Press, Baltimore, MD, 2004, pp. 65–81. MR 2058604
- 3[Art 13] by same author, The endoscopic classification of representations: Orthogonal and symplectic groups , American Mathematical Society Colloquium Publications, vol. 61, American Mathematical Society, Providence, RI, 2013, Orthogonal and symplectic groups. MR 3135650
- 4[BC 09] Joël Bellaïche and Gaëtan Chenevier, Families of Galois representations and Selmer groups , Astérisque (2009), no. 324, xii+314. MR 2656025
- 5[BC 11] by same author, The sign of Galois representations attached to automorphic forms for unitary groups , Compos. Math. 147 (2011), no. 5, 1337–1352. MR 2834723
- 6[BCGP 18] George Boxer, Frank Calegari, Toby Gee, and Vincent Pilloni, Abelian surfaces over totally real fields are potentially modular , preprint, 2018.
- 7[B Fvd G 08] Jonas Bergström, Carel Faber, and Gerard van der Geer, Siegel modular forms of genus 2 and level 2: cohomological computations and conjectures , Int. Math. Res. Not. IMRN (2008), Art. ID rnn 100, 20. MR 2439544 (2009 f:11054)
- 8[BHR 94] Don Blasius, Michael Harris, and Dinakar Ramakrishnan, Coherent cohomology, limits of discrete series, and Galois conjugation , Duke Math. J. 73 (1994), no. 3, 647–685. MR 1262930 (95b:11054)
