Minimal modularity lifting for non-regular symplectic representations

Frank Calegari; David Geraghty

arXiv:1907.08691·math.NT·December 16, 2020

Minimal modularity lifting for non-regular symplectic representations

Frank Calegari, David Geraghty

PDF

TL;DR

This paper proves a minimal modularity lifting theorem for Galois representations linked to genus two Siegel modular forms that are limits of discrete series, advancing understanding in automorphic forms and Galois theory.

Contribution

It introduces a minimal modularity lifting theorem specifically for Galois representations associated with certain Siegel modular forms, a novel result in the field.

Findings

01

Established a minimal modularity lifting theorem for specific Galois representations.

02

Connected Galois representations to Siegel modular forms that are limits of discrete series.

03

Enhanced the theoretical framework for automorphic forms and Galois representations.

Abstract

In this paper, we prove a minimal modularity lifting theorem for Galois representations (conjecturally) associated to Siegel modular forms of genus two which are holomorphic limits of discrete series at infinity.

Equations583

(α^{2} - 1) (β^{2} - 1) (α - β) (α^{2} β^{2} - 1) \neq \equiv 0 mod p .

(α^{2} - 1) (β^{2} - 1) (α - β) (α^{2} β^{2} - 1) \neq \equiv 0 mod p .

L (r, s) = L (F, s),

L (r, s) = L (F, s),

(α^{2} - 1) (β^{2} - 1) (α - β) (α^{2} β^{2} - 1) \neq = 0.

(α^{2} - 1) (β^{2} - 1) (α - β) (α^{2} β^{2} - 1) \neq = 0.

e H^{0} (Y_{1} (N), ω (j, 2) \otimes K / O) ≃ \to lim e H^{0} (Y_{1} (N), ω (j, 2) \otimes O / ϖ^{n})

e H^{0} (Y_{1} (N), ω (j, 2) \otimes K / O) ≃ \to lim e H^{0} (Y_{1} (N), ω (j, 2) \otimes O / ϖ^{n})

dim π_{p}^{Iw} = 2 = 2 \cdot dim π_{p}^{Sph}

dim π_{p}^{Iw} = 2 = 2 \cdot dim π_{p}^{Sph}

dim Π_{p}^{Iw} = 8 = 2 \cdot 4 = 2 dim Π_{p}^{Kli} = 8 dim Π_{p}^{Sph} .

dim Π_{p}^{Iw} = 8 = 2 \cdot 4 = 2 dim Π_{p}^{Kli} = 8 dim Π_{p}^{Sph} .

ϵ : G_{Q} \to Z_{p}^{\times}

ϵ : G_{Q} \to Z_{p}^{\times}

λ (γ) : G_{L} ⟶ R^{\times}

λ (γ) : G_{L} ⟶ R^{\times}

J := 000 - 1 00 - 1 0 01001000 .

J := 000 - 1 00 - 1 0 01001000 .

diag (t_{1}, t_{2}, ν t_{2}^{- 1}, ν t_{1}^{- 1}) \mapsto t_{1}^{a} t_{2}^{a} ν^{c} .

diag (t_{1}, t_{2}, ν t_{2}^{- 1}, ν t_{1}^{- 1}) \mapsto t_{1}^{a} t_{2}^{a} ν^{c} .

t \mapsto diag (t^{α}, t^{β}, t^{γ - β}, t^{γ - α}) .

t \mapsto diag (t^{α}, t^{β}, t^{γ - β}, t^{γ - α}) .

X^{*} (T)_{G}^{+} = {(a, b; c) \in X^{*} (T) : a \geq b \geq 0} .

X^{*} (T)_{G}^{+} = {(a, b; c) \in X^{*} (T) : a \geq b \geq 0} .

X^{*} (T)_{M}^{+} = {(a, b; c) \in X^{*} (T) : a \geq b} .

X^{*} (T)_{M}^{+} = {(a, b; c) \in X^{*} (T) : a \geq b} .

w_{1} (a, b; c)

w_{1} (a, b; c)

w_{2} (a, b; c)

w_{3} (a, b; c)

h : Res_{C / R} (G_{m}) (R) = C^{\times} \to G (R) = GSp_{4} (R)

h : Res_{C / R} (G_{m}) (R) = C^{\times} \to G (R) = GSp_{4} (R)

(x I_{2} - y S y S x I_{2})

(x I_{2} - y S y S x I_{2})

S := (0110) .

S := (0110) .

K_{1, \infty} = {(S A S - B S S B A) \in G (R) : A^{t} A + B^{t} B = I_{2}, A^{t} B = B^{t} A} .

K_{1, \infty} = {(S A S - B S S B A) \in G (R) : A^{t} A + B^{t} B = I_{2}, A^{t} B = B^{t} A} .

K_{\infty, 1}

K_{\infty, 1}

(S A S - B S S B A)

h = ⎩ ⎨ ⎧ h (t_{1}, t_{2}; z) := z 00 - t_{1} 0 z - t_{2} 0 0 t_{2} z 0 t_{1} 00 z : t_{1}, t_{2}, z \in R ⎭ ⎬ ⎫ .

h = ⎩ ⎨ ⎧ h (t_{1}, t_{2}; z) := z 00 - t_{1} 0 z - t_{2} 0 0 t_{2} z 0 t_{1} 00 z : t_{1}, t_{2}, z \in R ⎭ ⎬ ⎫ .

exp (z) cos t_{1} 00 - sin t_{1} 0 cos t_{2} - sin t_{2} 0 0 sin t_{2} cos t_{2} 0 sin t_{1} 00 cos t_{1} .

exp (z) cos t_{1} 00 - sin t_{1} 0 cos t_{2} - sin t_{2} 0 0 sin t_{2} cos t_{2} 0 sin t_{1} 00 cos t_{1} .

{(a, b; c) \in Z^{3} : a + b \equiv c mod 2} \to \sim X^{*} (H_{C})

{(a, b; c) \in Z^{3} : a + b \equiv c mod 2} \to \sim X^{*} (H_{C})

h (t_{1}, t_{2}; z) \mapsto a t_{1} i + b t_{2} i + cz

h (t_{1}, t_{2}; z) \mapsto a t_{1} i + b t_{2} i + cz

g_{C} = g^{0, 0} \oplus g^{- 1, 1} \oplus g^{1, - 1}

g_{C} = g^{0, 0} \oplus g^{- 1, 1} \oplus g^{1, - 1}

Q^{-} = K_{C}^{h} P^{-} and Lie Q^{-} = k_{C}^{h} \oplus p^{-} .

Q^{-} = K_{C}^{h} P^{-} and Lie Q^{-} = k_{C}^{h} \oplus p^{-} .

f_{1} = 100 - i, f_{2} = 01 - i 0, f_{3} = 0 - i 10, f_{4} = - i 001 \in C^{4} .

f_{1} = 100 - i, f_{2} = 01 - i 0, f_{3} = 0 - i 10, f_{4} = - i 001 \in C^{4} .

k = (S A S - B S S B A) \in K_{1, \infty}

k = (S A S - B S S B A) \in K_{1, \infty}

C^{- 1} k C = (S A S - i S B S 0 0 A + i B) where C := (I_{2} - i S - i S I_{2}) .

C^{- 1} k C = (S A S - i S B S 0 0 A + i B) where C := (I_{2} - i S - i S I_{2}) .

Φ_{c}^{+}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Minimal Modularity Lifting

For non-regular symplectic representations

Frank Calegari and David Geraghty

2010 Mathematics Subject Classification:

11F33, 11F80.

The first author was supported in part by NSF Career Grant DMS-0846285 and NSF Grant DMS-1404620 and NSF Grant DMS-1701703. The second author was supported in part by NSF Grants DMS-1200304 and DMS-1128155.

1 Introduction
1.1 Comparison with previous methods
1.2 Abelian Surfaces
1.3 Recent Developments
1.4 Results of Arthur
1.5 Acknowledgements
2 Notation
2.0.1 The group $\mathrm{GSp}_{4}$
2.0.2 The group $\mathrm{GSp}_{4}(\mathbf{R})$
3 Some Commutative Algebra
3.1 Balanced Modules
3.2 Patching
4 Deformations of Galois representations
5 Siegel threefolds
5.1 Level Structure
5.2 Cohomology of Siegel $3$ -folds
5.3 Vanishing results
5.4 Torsion Classes
5.5 Hecke operators
6 Galois representations associated to modular forms
6.1 The Hasse invariant
6.2 Preliminaries on Galois representations
6.3 Galois representations in cohomological weights
6.4 Galois representations in low weights
7 Properties of cohomology groups
7.1 Taylor–Wiles primes
7.2 The balanced property
8 $q$ -expansions of Siegel modular forms
8.1 $q$ -expansions of Siegel modular forms
8.2 Explicit Formulae
8.3 Hecke Operators at $p$
8.4 Hecke Operators on forms of in characteristic $p$
8.5 Relationship between Hecke eigenvalues and crystalline Frobenius
8.6 The Main Theorem on $q$ -expansions
8.7 Binary quadratic forms
8.8 The case $\sigma=(2,2)$ .
8.9 The case $\sigma=(j,2)$ with $j\geq 4$
9 Modularity Lifting

1. Introduction

In this paper, we prove a minimal modularity lifting theorem for Galois representations (conjecturally) associated to Siegel modular forms $\pi$ for the group $\mathrm{GSp}(4)/\mathbf{Q}$ such that $\pi_{\infty}$ is a holomorphic limit of discrete series. An example of what we can prove with these methods is the following:

Theorem 1.1.

Let $r:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(\overline{\mathbf{Q}}_{p})$ be a continuous irreducible representation satisfying the following conditions:

(1)

$r|G_{\mathbf{Q}_{p}}$ * is ordinary with Hodge–Tate weights $[0,0,j-1,j-1]$ for some integer $j$ satisfying $p-1>j\geq 4$ .* 2. (2)

If $\alpha$ and $\beta$ are the unit root eigenvalues of Frobenius on $D_{\mathrm{cris}}(V)$ , then

[TABLE] 3. (3)

The image of $\overline{r}|G_{\mathbf{Q}(\zeta_{p})}$ contains $\mathrm{Sp}_{4}(\mathbf{F}_{p})$ . 4. (4)

For a prime $x\neq p$ , the image of inertia at $x$ is unipotent, and the image of any generator of tame inertia has the same number of Jordan blocks mod $p$ as it does in characteristic zero. 5. (5)

$\overline{r}$ * is modular of level $N(\overline{r})$ and weight $(j,2)$ .*

Then $r$ is modular, that is, there exists a cuspidal Siegel modular Hecke eigenform $F$ of weight $(j,2)$ such that

[TABLE]

where $L(F,s)$ is the spinor $L$ -function of $F$ .

We deduce Theorem 1.1 from our main result, which we now state. (We shall refer to § 4 and § 6.4 for precise details concerning ramification behaviour, level subgroups and the exact definition of minimal deformations.) Let $\epsilon$ denote the $p$ -adic cyclotomic character. Let $\mathcal{O}$ be the ring of integers of a finite extension $K$ of $\mathbf{Q}_{p}$ . Let $\displaystyle{\overline{r}:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(k)}$ be a continuous irreducible representation whose similitude character $\nu(\overline{r})$ on inertia at $p$ is the mod- $p$ reduction of $\epsilon^{1-j}$ . Suppose that $\overline{r}|G_{\mathbf{Q}_{p}}$ contains an unramified subspace of dimension two on which $\mathrm{Frob}_{p}$ acts by the scalars $\alpha$ and $\beta$ respectively, where

[TABLE]

Suppose further that $\overline{r}$ has big image (explicitly, satisfies Assumption 4.1) and that $\overline{r}|G_{\mathbf{Q}_{x}}$ for a prime $x\neq p$ is either unramified or is one of the types listed in Assumption 4.3. Let $Y_{1}(N)$ denote the (open) Siegel modular variety of level $N=N(\overline{r})$ over $\mathrm{Spec}(\mathcal{O})$ , where $N$ is determined by $\overline{r}$ as in § 5, and let $\omega(j,2)$ denote the coherent sheaf on $Y_{1}(N)$ whose complex sections define Siegel modular forms of weight $(j,2)$ for some integer $p-1>j\geq 4$ . Let $\mathbf{T}$ denote the subring of endomorphisms of

[TABLE]

(where $e=e_{\alpha,\beta}$ is a certain ordinary projection, see section 6.4) generated by Hecke operators at primes not dividing $Np$ . Let $R^{\mathrm{min}}$ denote the universal minimal deformation ring of $\overline{r}$ (see Definition 4.6 for more details).

Theorem 1.2.

Suppose that there exists a maximal ideal $\mathfrak{m}$ of $\mathbf{T}$ and a corresponding representation $\overline{r}_{\mathfrak{m}}:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(k)$ which is isomorphic to $\overline{r}$ . Let $R^{\mathrm{min}}$ denote the universal minimal ordinary deformation ring of $\overline{r}$ . Suppose that $p-1>j\geq 4$ . Then there is an isomorphism $R^{\mathrm{min}}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}\mathbf{T}_{\mathfrak{m}}$ , and moreover, the $\mathbf{T}_{\mathfrak{m}}$ module $eH^{0}(Y_{1}(N),\omega(j,2)\otimes K/\mathcal{O})_{\mathfrak{m}}^{\vee}$ is free as a $\mathbf{T}_{\mathfrak{m}}$ -module.

The proof follows the strategy of [CG18]. The main ingredients are showing that there exists a map from $R^{\mathrm{min}}$ to $\mathbf{T}_{\mathfrak{m}}$ (see Theorem 6.17) and proving that the cohomology of the subcanonical extension $\omega(j,2)^{\mathrm{sub}}$ of $\omega(j,2)$ to a smooth toroidal compactification $X_{1}(N)$ of $Y_{1}(N)$ vanishes outside degrees [math] and $1$ (see Theorem 5.1 — in the case of classical modular curves this step was trivial).

1.1. Comparison with previous methods

The first modularity theorems which applied to non-regular motives were the results of Buzzard–Taylor and Buzzard [BT99, Buz03] on two dimensional odd Artin representations $V$ . The idea of these papers can roughly be described as follows. Using known cases of Serre’s conjecture, one deduces that $\overline{\rho}$ is modular, where $\rho$ is the representation associated to some $p$ -adic realization of $V$ for some $p$ . Using modularity theorems in regular weight, one then proves that a big Hecke algebra is modular. Specializing to weight one, one deduces the existence of an overconvergent eigenform $f$ corresponding to $V$ . Under a non-degeneracy assumption on $V$ ( $\overline{\rho}$ is $p$ -distinguished), one constructs (using companion forms) a second Hida family which specializes to a second eigenform $g$ . Using the geometric properties of $U$ , one shows that $f$ and $g$ converge deeply into the supersingular locus. The Fourier coefficients $a_{n}$ of $f$ and $g$ for $(n,p)=1$ are determined by $V$ . One then constructs a suitable linear combination $h=(\alpha f-\beta g)/(\alpha-\beta)$ which converges over the entire modular curve, and is thus classical by rigid GAGA. The formal rigid geometry employed by these papers have been generalized by various authors, in particular by Kassaei [Kas06a]. One may well ask whether this approach can be applied to Siegel modular forms of weight $(2,2)$ — work of Tilouine and his collaborators has made great progress in this direction. The modularity lifting result for (regular weight) Hida families has been established in many cases by Genestier–Tilouine [GT05] (see also Pilloni [Pil12a]). Significant progress has also been made in the theory of canonical subgroups and the geometry of Siegel modular varieties. One difficulty, however, is that the Fourier expansions of Siegel modular forms are not determined by the Hecke eigenvalues. This is a difficulty which must be overcome in such an approach. (Various classicality results for overconvergent forms can be established without using $q$ -expansions, see for example [Kas06b, PS17], but these results only apply to forms of sufficiently non-critical slope.) The difficulty of dealing with $q$ -expansions manifests itself for our approach also — we are forced to prove (“by hand”) various properties of Fourier expansions of Hecke eigenforms in § 8.2.

1.2. Abelian Surfaces

It would be desirable to weaken the assumption $j\geq 4$ in the main theorem to $j\geq 2$ , since the case $j=2$ includes the representations associated to the Tate modules of abelian surfaces. The only point in our arguments in which we use the fact that $j\geq 4$ is to deduce that $H^{2}(X_{1}(N),\omega(j,2)^{\mathrm{sub}})=0$ for the subcanonical extension $\omega(j,2)^{\mathrm{sub}}$ of $\omega(j,2)$ to a smooth toroidal compactification $X_{1}(N)$ of $Y_{1}(N)$ . If this vanishing holds for $j=2$ then our theorem would also apply to these cases. On the other hand, one does not expect vanishing here, because one expects that singular Siegel modular forms should contribute cohomology in these degrees. However, we need only the weaker result that the image of $H^{2}(X_{1}(N),\omega(j,2)^{\mathrm{can}})$ in $H^{2}(X_{1}(N),\omega(j,2)^{\mathrm{sub}})$ is zero after localization at a sufficiently non-Eisenstein maximal ideal $\mathfrak{m}$ . We expect this to always be true for $j=2$ , although we were not able to prove it. On the other hand, using the ideas of Khare–Thorne [KT17], one can dispense with proving this under the very strong supplementary hypothesis that there exists a characteristic zero form of weight $N=N(\overline{r})$ which gives rise to $\overline{r}$ . In particular, by using the arguments of the proof of Theorem 6.29 of ibid, one should be able to prove the analogue of Theorem 1.1 in weight $(2,2)$ assuming the existence of an auxiliary Siegel modular form $G$ of the same level also of weight $(2,2)$ with $\overline{r}_{G}=\overline{r}$ .

1.3. Recent Developments

(Added: January, 2019) Very recently, there have been a number of developments related to the main theme of this paper, in particular, in the preprints [Pil17] and [BCGP18], the latter which establishes the potential modularity of abelian surfaces over totally real fields. The introduction to [BCGP18] explains a number of innovations which made those results possible, so we shall confine ourselves here to only a few salient remarks. The first is that the vanishing conjecture for $H^{2}$ localized at $\mathfrak{m}$ mentioned in §1.2 remains unresolved, and the methods of [BCGP18] blend the techniques of this paper (and [CG18]) with arguments from [Pil17]. A second point is that the paper [Pil17] develops a conceptual method to define (normalized) Hecke operators at $p$ , and in particular establishes the action of these operators on higher cohomology (which is essential for the main results of [Pil17] and [BCGP18]). In this paper, it suffices to construct the action of these Hecke operators on $H^{0}$ which is significantly easier. The methods we use in §8.4 to do this are admittedly disagreeable, relying as they do on arguments using $q$ -expansions. Thus the reader is encouraged to consult [Pil17, §7] and [BCGP18, §4.5] for a more geometric construction of these operators. An analysis of the normalization factors for Hecke operators required in [Pil17] also sheds some light on another phenomenological feature of this paper which readers may find surprising: On the Galois side, there is essentially no difference (in the ordinary setting) between working in the (irregular) weight $(j,2)$ for $j>2$ and working in the (irregular) weight $(2,2)$ . On the other hand, the Hecke operators at $p$ (particularly $T_{p,2}$ ) behave quite differently in weight $(2,2)$ . In our context, this arises most noticeably in §8.5 (via Lemma 8.12), which one should compare to [Pil17, §11.1] (warning: the convention of that paper is that Pilloni’s $T_{p,1}$ is our $T_{p,2}$ and vice versa, and the spherical version of the operator $T$ in [Pil17] is equal in weight $(2,2)$ up to translation by a multiple of $T_{p,0}$ to the operator we call $Q_{2}$ ). Finally, the paper [BCGP18] develops a geometric version of the doubling argument (see §5 of ibid.) This provides a much more robust explanation (in a slightly different setting) for what in this paper occupies most of §8 and consists of a sequence of tricky and not entirely intuitive series of manipulations with $q$ -expansions. (Note that the geometric doubling argument of [BCGP18] is only written for weight $(2,2)$ but the method applies in principle to the weights $(j,2)$ which we consider in this paper.) Finally, the very observant reader will notice that the doubling argument of [BCGP18] applies in weight $(2,2)$ to the space of ordinary forms at Klingen level, whereas in this paper we essentially prove (in the same weight) a tripling result at spherical level. Neither of these results immediately imply the other. The “extra” copy of the space of forms can be interpreted as giving rise to a space of non-ordinary forms of weight $(p+1,p+1)$ . See Remark 8.18 for further discussion on this point, which we also discuss in a different context below.

It is natural to ask whether one should expect any genuine difficulties in modifying the geometric doubling argument of [BCGP18] to the setting of this paper. We now offer some speculative remarks to address this point (using notation from [BCGP18]). Let $\pi_{p}$ be a smooth admissible irreducible unramified representation of $\mathrm{GL}_{2}(\mathbf{Q}_{p})$ (over $\mathbf{C}$ ) which is not trivial. (For example, $\pi_{p}$ could be the local constituent of an automorphic representation $\pi$ associated to a classical modular form.) Let $\mathrm{Sph}=\mathrm{GL}_{2}(\mathbf{Z}_{p})$ and let $\mathrm{Iw}$ denote the Iwahori subgroup of $\mathrm{Sph}$ . The classical theory of oldforms is a reflection of the fact that

[TABLE]

and the characteristic zero version of doubling is the statement that the span of the spherical vector $v$ under the operator $U_{p}$ is all of $\pi^{\mathrm{Iw}}_{p}$ . The integral version of this statement is false in general. For example, given a classical ordinary modular eigenform $f$ of weight $k\geq 2$ , the span of $f\mod p$ under $U_{p}$ is simply $f$ , because $T_{p}=U_{p}\mod p$ in these weights. However, some version of this result does hold in weight $k=1$ , and it is this property which is leveraged to prove local-global compatibility results in [CG18]. Let us now replace $\mathrm{GL}_{2}(\mathbf{Q}_{p})$ by $\mathrm{GSp}_{4}(\mathbf{Q}_{p})$ , and let $\mathrm{Kli}$ and $\mathrm{Iw}$ denote the Klingen and Iwahori subgroups respectively of $\mathrm{Sph}=\mathrm{GSp}_{4}(\mathbf{Z}_{p})$ (denoted elsewhere in this paper by $\Pi$ and $I$ respectively.) Now (for the $\Pi_{p}$ of interest) we will have

[TABLE]

The factor $8$ here may be interpreted as the order of the Weyl group of $\mathrm{GSp}(4)$ . More prosaically, the oldforms in $\Pi^{\mathrm{Iw}}_{p}$ correspond to a choice of eigenvalues $\alpha$ and $\alpha\beta$ for the Hecke operators $U_{\mathrm{Iw}(p),1}$ and $U_{\mathrm{Iw}(p),2}$ respectively, whereas the oldforms in $\Pi^{\mathrm{Kli}}_{p}$ correspond to a choice of eigenvalues $\alpha+\beta$ and $\alpha\beta$ for the Hecke operators $U_{\mathrm{Kli}(p),1}$ and $U_{\mathrm{Kli}(p),2}=U_{\mathrm{Iw}(p),2}$ . When one passes from $\pi^{\mathrm{Sph}}_{p}$ to $\pi^{\mathrm{Iw}}_{p}$ for weight one modular forms or $\Pi^{\mathrm{Kli}}_{p}$ to $\Pi^{\mathrm{Iw}}_{p}$ for weight $(2,2)$ Siegel modular forms, the property of of being ordinary turns out to be automatically preserved on the corresponding space of old forms. However, this is not a priori true when passing from $\Pi^{\mathrm{Sph}}_{p}$ to $\Pi^{\mathrm{Iw}}_{p}$ , and so one would have to see in any geometric version of this argument a way of dealing with the non-ordinary forms.

1.4. Results of Arthur

In Section 7.2, we make use of the results of [Art04], which sketches how the results of [Art13] on orthogonal and symplectic groups can be extended to the general symplectic group $\mathrm{GSp}_{4}$ . At the time of the initial submission of this paper, these results of Arthur are conditional on the stabilization of the twisted trace formula. (We direct the reader to [GT18] for the most up to date status of these results for $\mathrm{GSp}_{4}$ .)

1.5. Acknowledgements

We would like to thank George Boxer for some very helpful comments related to the proofs of Theorems 8.10 and 8.11. We would also like to thank Olivier Taïbi for answering some technical questions arising in §7.2. We would also like to acknowledge useful conversations with Kevin Buzzard, Ching-Li Chai, Matthew Emerton, Toby Gee, Michael Harris, Kai-Wen Lan, Vincent Pilloni, and Jack Thorne. We also like thank many of the participants of the Bellairs workshop in number theory in 2014, where an earlier version of this paper was discussed. Finally, we thank the referees, whose detailed comments very much helped to improve this manuscript.

2. Notation

We fix a prime $p$ and let $\mathcal{O}$ be the ring of integers of a finite extension $K$ of $\mathbf{Q}_{p}$ with residue field $k$ . We let $\mathcal{C}_{\mathcal{O}}$ denote the category of complete local Noetherian $\mathcal{O}$ -algebras $R$ with residue field isomorphic to $k$ (via the structural homomorphism $\mathcal{O}\to R$ ).

We let

[TABLE]

denote the cyclotomic character. The Hodge–Tate weight of $\epsilon|G_{\mathbf{Q}_{p}}$ is $-1$ .

If $L$ is a finite extension of $\mathbf{Q}_{l}$ for some prime $l$ . We let $\mathrm{Art}_{L}:L^{\times}\to W_{L}^{\mathrm{ab}}$ denote the Artin map, normalized so that uniformizers correspond to geometric Frobenius elements. If $\gamma$ is an element of some ring $R$ , then we define the character

[TABLE]

to be the unramified character which takes the geometric Frobenius element $\mathrm{Frob}_{L}$ to $\gamma$ , when this character is well defined.

2.0.1. The group $\mathrm{GSp}_{4}$

Let $G=\mathrm{GSp}_{4}=\{M\in\mathrm{GL}_{4}:M^{t}JM=\nu\cdot J\text{\ for some\ }\nu\in\mathrm{GL}_{1}\}$ , where

[TABLE]

The group $\mathrm{Sp}_{4}$ is the subgroup consisting of elements with $\nu=1$ . We let $B\subset G$ be the Borel subgroup consisting of upper triangular matrices. The Lie algebras of $G$ and $B$ are denoted $\mathfrak{g}$ and $\mathfrak{b}$ while those of $\mathrm{Sp}_{4}$ and $B\cap\mathrm{Sp}_{4}$ are denoted $\mathfrak{g}^{0}$ and $\mathfrak{b}^{0}$ . Let $P\subset G$ denote the Siegel parabolic, that is, the stabilizer of the plane spanned by the first two standard basis vectors. Let $\Pi\subset G$ denote the Klingen parabolic, which is the stabilizer of the line spanned by the first standard basis vector. We denote the Levi subgroup of $P$ (resp. $\Pi$ ) by $M=M_{P}$ (resp. $M_{\Pi}$ ). We have $M\cong\mathrm{GL}_{2}\times\mathrm{GL}_{1}$ .

Let $T$ denote the diagonal torus in $\mathrm{GSp}_{4}$ and $X^{*}(T)$ its character group. We identify $X^{*}(T)$ with the lattice $\mathbf{Z}^{3}$ by associating to $(a,b;c)$ the character

[TABLE]

We identify the cocharacter group $X_{*}(T)$ with $\mathbf{Z}^{3}$ by associating the triple $(\alpha,\beta;\gamma)$ with the cocharacter:

[TABLE]

The natural pairing on $X^{*}(T)\times X_{*}(T)$ is then: $\langle(a,b;c),(\alpha,\beta,\gamma)\rangle\mapsto a\alpha+b\beta+c\gamma$ .

The positive roots of $G$ with respect to the Borel $B$ are given by $\alpha_{1}:=(1,-1;0)$ , $\alpha_{2}:=(0,2;-1)$ , $\alpha_{3}=(1,1;-1)$ and $\alpha_{4}=(2,0;-1)$ . Of these, $\alpha_{1}$ and $\alpha_{2}$ are the simple roots. We let $\rho=(2,1;-3/2)$ denote the half-sum of the positive roots. The coroots are: $\alpha_{1}^{\vee}=(1,-1;0)$ , $\alpha_{2}^{\vee}=(0,1;0)$ , $\alpha_{3}^{\vee}=(1,1;0)$ and $\alpha_{4}^{\vee}=(1,0;0)$ . The intersection $B\cap M$ is a Borel subgroup of $M$ . The corresponding positive root is $\alpha_{1}$ .

Definition 2.1.

We define the set $X^{*}(T)^{+}_{G}$ to be the set $\{\lambda\in X^{*}(T):\langle\lambda,\alpha_{i}^{\vee}\rangle\geq 0\text{\ }\forall i\}$ of weights which are dominant with respect to $B$ . Explicitly

[TABLE]

Similarly, we define the set of weights $X^{*}(T)^{+}_{M}:=\{(a,b;c)\in X^{*}(T):\langle\lambda,\alpha_{1}^{\vee}\rangle\geq 0\}$ which are dominant with respect to $B\cap M$ . Explicitly, this is:

[TABLE]

Note that the natural action of $M$ on the plane spanned by the first two (resp. the last two) standard basis vectors is the irreducible representation of highest weight $(1,0;0)$ (resp. $(0,-1;1)$ ).

We let $W_{G}=N_{G}(T)/T$ denote the Weyl group of $G$ and we define $W_{M}$ and $W_{M_{\Pi}}$ similarly. Let $s_{0},s_{1}$ denote the generators for the Weyl group $W_{G}$ given in [HT13, §2]. We fix a set of Kostant representatives $W^{M}=\{\widetilde{w}_{0},\widetilde{w}_{1},\widetilde{w}_{2},\widetilde{w}_{3}\}$ for $W_{M}\backslash W_{G}$ by setting $\widetilde{w}_{0}=1$ , $\widetilde{w}_{1}=s_{1}$ , $\widetilde{w}_{2}=s_{1}s_{0}$ and $\widetilde{w}_{3}=s_{1}s_{0}s_{1}$ . Note that each $\widetilde{w}_{i}$ has length $i$ . We let $w\in W_{G}$ act on $X^{*}(T)$ by $(w\lambda)(t)=\lambda(w^{-1}tw)$ . Then we have:

[TABLE]

The longest element of $W_{G}$ which we denote by $w_{0}$ acts via $w_{0}(a,b;c)=(b,a;c)$ .

Note that the collection of representatives $W^{M}$ is precisely the set of $w\in W_{G}$ such that $w(X^{*}(T)^{+}_{G})\subset X^{*}(T)^{+}_{M}$ . We let $C_{0}\subset X^{*}(T)_{\mathbf{R}}:=X^{*}(T)\otimes_{\mathbf{Z}}\mathbf{R}$ denote the closed dominant Weyl chamber. In other words, $C_{0}=\{(a,b;c)\in\mathbf{R}^{3}:a\geq b\geq 0\}$ . For $i=1,2,3$ , we define the chambers $C_{i}:=\widetilde{w}_{i}(C_{0})$ .

2.0.2. The group $\mathrm{GSp}_{4}(\mathbf{R})$

Let

[TABLE]

be the homomorphism sending $x+iy$ to the matrix

[TABLE]

where

[TABLE]

Let $K^{h}$ denote the centralizer of $h$ in $G(\mathbf{R})$ (acting by conjugation). Then since $h(i)=J$ , we see that $K^{h}=\mathbf{R}^{\times}K_{\infty}$ where $K_{\infty}$ is the maximal compact subgroup of $G(\mathbf{R})$ given by the fixed points of the Cartan involution $g\mapsto(g^{t})^{-1}$ . The similitude character restricts to a surjective map $\nu:K_{\infty}\to\{\pm 1\}$ and whose kernel $K_{\infty,1}$ is the connected component of the identity. Then we have explicitly,

[TABLE]

The map:

[TABLE]

induces an isomorphism between $K_{\infty,1}$ and $U(2)$ . We let $H_{1}\subset K_{\infty,1}$ denote the preimage of the diagonal compact torus in $U(2)$ and let $H:=\mathbf{R}^{\times}_{>0}H_{1}\subset K^{h}$ . Let $\mathfrak{h}=\operatorname{\mathrm{Lie}}H$ , $\mathfrak{k}^{h}=\operatorname{\mathrm{Lie}}K^{h}$ and so on. Then we have

[TABLE]

We use subscripts to denote complexifications of Lie algebras and Lie groups; thus $H_{\mathbf{C}}$ and $\mathfrak{h}_{\mathbf{C}}$ denote the complexifications of $H$ and $\mathfrak{h}$ . Then $\mathfrak{h}_{\mathbf{C}}=\operatorname{\mathrm{Lie}}H_{\mathbf{C}}=\{h(t_{1},t_{2};t):t_{1},t_{2},z\in\mathbf{C}\}$ and the surjective map $\exp:\mathfrak{h}_{\mathbf{C}}\to H_{\mathbf{C}}$ sends $h(t_{1},t_{2};z)$ to

[TABLE]

Thus its kernel is $\{h(t_{1},t_{2};z):t_{1},t_{2}\in 2\pi\mathbf{Z},z\in 2\pi i\mathbf{Z}\}$ . We define the lattice $X^{*}(H_{\mathbf{C}})\subset\mathfrak{h}_{\mathbf{C}}^{*}$ to be the subspace consisting of differentials of (complex analytic) characters of $H_{\mathbf{C}}$ . Equivalently, $X^{*}(H_{\mathbf{C}})$ is the subset of $X^{*}(\mathbf{C}^{\times}\times H_{1,\mathbf{C}})=\{\lambda\in\mathfrak{h}_{\mathbf{C}}^{*}:\lambda(\ker(\exp:\mathfrak{h}_{\mathbf{C}}\to H_{\mathbf{C}}))\subset 2\pi i\mathbf{Z}\}$ consisting of differentials of characters of $\mathbf{C}^{\times}\times K_{1,\mathbf{C}}$ which factor through the multiplication map $\mathbf{C}^{\times}\times H_{1,\mathbf{C}}\to H_{\mathbf{C}}$ . We fix an isomorphism

[TABLE]

by letting $(a,b;c)$ correspond to the linear form

[TABLE]

on $\mathfrak{h}_{\mathbf{C}}$ . This extends by linearity to an isomorphism $\mathbf{C}^{3}\to\mathfrak{h}_{\mathbf{C}}^{*}$ .

Let $V^{\pm}\subset\mathbf{C}^{4}$ be the subspace where $h(i)$ acts via $\pm i$ . Then each $V^{\pm}$ is isotropic and we have an orthogonal direct sum $\mathbf{C}^{4}=V^{-}\oplus V^{+}$ . Let $Q^{-}\subset G(\mathbf{C})$ denote the stabilizer of $V^{-}$ . Consider the Hodge decomposition

[TABLE]

where $\mathfrak{g}^{p,q}$ is the subspace on which $h(z)$ acts via $z^{-p}\overline{z}^{-q}$ . Then we have $\mathfrak{g}^{0,0}=\mathfrak{k}_{\mathbf{C}}^{h}$ and we let $\mathfrak{p}^{+}=\mathfrak{g}^{-1,1}$ , $\mathfrak{p}^{-}=\mathfrak{g}^{1,-1}$ . We also let $P^{\pm}$ denote the subgroup of $G(\mathbf{C})$ generated by $\exp(\mathfrak{p}^{\pm})$ . Then we have

[TABLE]

Moreover, $K^{h}_{\mathbf{C}}$ is the Levi component of $Q^{-}$ and $P^{-}$ is its unipotent radical. Let

[TABLE]

Then $f_{1},f_{2}$ are a basis of $V^{-}$ and $f_{3},f_{4}$ are a basis of $V^{+}$ . With respect to the basis $f_{1},\dots,f_{4}$ of $\mathbf{C}^{4}$ , an element

[TABLE]

acts via

[TABLE]

Note that the Cayley transform $C$ conjugates the Siegel parabolic $P(\mathbf{C})$ to $Q^{-}$ . Let $\Phi\subset X^{*}(H_{\mathbf{C}})$ denote the root system defined by the adjoint action of $H_{\mathbf{C}}$ on $\mathfrak{g}_{\mathbf{C}}$ . The compact roots $\Phi_{c}$ are those appearing in $\mathfrak{k}_{\mathbf{C}}^{h}$ , while the non-compact roots $\Phi_{n}$ are those appearing in $\mathfrak{p}^{+}\oplus\mathfrak{p}^{-}$ . We choose a system of positive roots $\Phi^{+}$ in such a way that the set of positive non-compact roots $\Phi^{+}_{n}=\Phi^{+}\cap\Phi_{n}$ coincides with the roots in $\mathfrak{p}^{+}$ . (We do this in order to be consistent with the conventions of [BHR94, §2.4].) We are then forced to take $\Phi^{+}$ to be the set of roots appearing in $C(\operatorname{\mathrm{Lie}}\overline{B})C^{-1}$ where $\overline{B}\subset G$ is the Borel subgroup of lower triangular matrices. With respect to the identification of $X^{*}(H_{\mathbf{C}})$ as a subset of $\mathbf{Z}^{3}$ given above, we then have:

[TABLE]

This can be seen easily from the fact that $C^{-1}h(t_{1},t_{2};0)C=\operatorname{diag}(-it_{1},-it_{2},it_{2},it_{1})$ .

Definition 2.2.

We let $X^{*}(H_{\mathbf{C}})^{+}_{K^{h}_{\mathbf{C}}}$ denote the set of which are dominant with respect the system of positive roots $\Phi^{+}_{c}$ . In other words, $X^{*}(H_{\mathbf{C}})^{+}_{K^{h}_{\mathbf{C}}}=\{(a,b;c)\in X^{*}(H_{\mathbf{C}}):a\geq b\}$ .

This set parameterizes the irreducible complex analytic representations of $K_{\mathbf{C}}^{h}$ . For $\mu\in X^{*}(H_{\mathbf{C}})^{+}_{K^{h}_{\mathbf{C}}}$ , we let $V_{\mu}$ denote the corresponding irreducible representation of highest weight $\mu$ .

We note that natural representation of $K^{h}_{\mathbf{C}}$ on $V^{-}$ (resp. $V^{+}$ ) is the irreducible representation of highest weight $(0,-1;1)$ (resp. $(1,0;1)$ ). Note also that the similitude character $\nu:H_{\mathbf{C}}\to\mathbf{C}^{\times}$ has weight $(0,0;2)$ .

3. Some Commutative Algebra

We recall here some formalism from [CG18] for proving modularity lifting results in contexts where the Hecke algebra has “co-dimension $1$ ” over the ring of diamond operators. The notion of “balanced” below plays the role of “codimension one” for the non-regular group rings $S_{N}:=\mathcal{O}[(\mathbf{Z}/p^{N}\mathbf{Z})^{q}]$ .

3.1. Balanced Modules

Let $S$ be a Noetherian local ring with residue field $k$ and let $M$ be a finitely generated $S$ -module.

Definition 3.1.

We define the defect $d_{S}(M)$ of $M$ to be

[TABLE]

Let

[TABLE]

be a (possibly infinite) resolution of $M$ by finite free $S$ -modules. Assume that the image of $P_{i}$ in $P_{i-1}$ is contained in $\mathfrak{m}_{S}P_{i-1}$ for each $i\geq 1$ . (Such resolutions always exist and are often called ‘minimal’.) Let $r_{i}$ denote the rank of $P_{i}$ . Tensoring the resolution over $S$ with $k$ we see that $P_{i}/\mathfrak{m}_{S}P_{i}\cong\mathrm{Tor}^{i}_{S}(M,k)$ and hence that $r_{i}=\dim_{k}\mathrm{Tor}^{i}_{S}(M,k)$ .

Definition 3.2.

We say that $M$ is balanced if $d_{S}(M)\geq 0$ .

If $M$ is balanced, then we see that it admits a presentation

[TABLE]

with $d=\dim_{k}M/\mathfrak{m}_{S}M$ .

3.2. Patching

We recall the abstract Taylor–Wiles style patching result from [CG18].

Proposition 3.3.

Suppose that

(1)

$R$ * is an object of $\mathcal{C}_{\mathcal{O}}$ and $H$ is a finite $R$ -module which is also finite over $\mathcal{O}$ ;* 2. (2)

$q\geq 1$ * is an integer, and for each integer $N\geq 1$ , $S_{N}:=\mathcal{O}[(\mathbf{Z}/p^{N}\mathbf{Z})^{q}]$ ;* 3. (3)

$R_{\infty}:=\mathcal{O}[[x_{1},\dots,x_{q-1}]]$ ; 4. (4)

for each $N\geq 1$ , $\phi_{N}:R_{\infty}\twoheadrightarrow R$ is a surjection in $\mathcal{C}_{\mathcal{O}}$ and $H_{N}$ is an $R_{\infty}\otimes_{\mathcal{O}}S_{N}$ -module

and that for each $N\geq 1$ the following conditions are satisfied

(a)

the image of $S_{N}$ in $\mathrm{End}_{\mathcal{O}}(H_{N})$ is contained in the image of $R_{\infty}$ , and moreover, the image of the augmentation ideal of $S_{N}$ in $\mathrm{End}_{\mathcal{O}}(H_{N})$ is contained in the image of $\ker(\phi_{N})$ ; 2. (b)

there is an isomorphism $\psi_{N}:(H_{N})_{\Delta_{N}}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}H$ of $R_{\infty}$ -modules, where $R_{\infty}$ acts on $H$ via $\phi_{N}$ and $\Delta_{N}=(\mathbf{Z}/p^{N}\mathbf{Z})^{q}$ ; 3. (c)

$H_{N}$ * is finite and balanced over $S_{N}$ (see Definition 3.2).*

Then $H$ is a free $R$ -module.

Proof.

This is Prop. 2.3 of [CG18]. ∎

4. Deformations of Galois representations

Let

[TABLE]

be a continuous, odd, absolutely irreducible Galois representation with similitude character of the form $\nu(\overline{r})=\overline{\epsilon}^{-(a-1)}$ where $a\geq 2$ . Let us suppose that there exist $\alpha$ and $\beta$ in $k$ such that

[TABLE]

and moreover $(\alpha^{2}-1)(\beta^{2}-1)(\alpha^{2}\beta^{2}-1)(\alpha-\beta)\neq 0$ . Let $S(\overline{r})$ denote the set of primes of $\mathbf{Q}$ away from $p$ at which $\overline{r}$ is ramified.

The group $\mathrm{GSp}_{4}$ admits a $11$ -dimensional adjoint representation on its Lie algebra $\mathfrak{g}$ . Let $\mathrm{ad}(\overline{r})$ denote the composition of $\overline{r}$ with this representation. For $p>2$ , the representation $\mathrm{ad}(\overline{r})$ admits a decomposition $\mathrm{ad}(\overline{r})=\mathrm{ad}^{0}(\overline{r})\oplus\nu$ , where $\nu$ is the similitude character of $\overline{r}$ .

We make the following further assumptions on $\overline{r}$ :

Assumption 4.1 (Big Image).

The restriction of $\overline{r}$ to $G_{\mathbf{Q}(\zeta_{p})}$ satisfies the following conditions, cf. §5.7 of [Pil12a]:

H1:

The field $\mathbf{Q}(\mathrm{ad}^{0}(\overline{r}))$ does not contain $\zeta_{p}$ , 2. H2:

For any $m$ , there exists an element $\sigma\in G_{\mathbf{Q}(\zeta_{p^{m}})}$ such that $\overline{r}(\sigma)$ has four distinct eigenvalues and such that the action of $\sigma$ on each irreducible representation of $\mathrm{ad}^{0}(\overline{r})$ over $G_{\mathbf{Q}(\zeta_{p^{m}})}$ contains $1$ as an eigenvalue. 3. H3:

Neither the image $\Gamma$ of $\mathrm{ad}^{0}(\overline{r})$ nor the image of $\mathrm{ad}^{0}(\overline{r})(1)$ admits a quotient of degree $p$ .

If this assumption holds, we say that $\overline{r}$ has big image, although condition (H1) depends on more than the group-theoretic image of $\overline{r}$ or even $\overline{r}|_{G_{\mathbf{Q}(\zeta_{p})}}$ .

Assumption 4.2 (Neatness).

There exists a $\sigma\in G_{\mathbf{Q}}$ with $\epsilon(\sigma)=q\not\equiv 1\mod p$ such that the ratio of any two eigenvalues of $\overline{r}(\sigma)$ is not equal to $q\mod p$ .

This condition is imposed to avoid dealing with stacks. If $p\geq 5$ , any surjective representation $\overline{r}:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(\mathbf{F}_{p})$ whose similitude character is a power of $\overline{\epsilon}$ will be neat. By assumption, the image contains an element $\overline{r}(\sigma)$ which is scalar with eigenvalue $\lambda\neq\pm 1$ . If $q=\epsilon(\sigma)\equiv 1\mod p$ , then the similitude character would also equal $1$ . But the similitude character of the scalar matrix $\lambda$ is $\lambda^{2}\not\equiv 1\mod p$ .

Assumption 4.3 (Ramification).

If $x\in S(\overline{r})$ , then $\overline{r}|G_{x}$ is one of the following types:

(1)

**U3 **: $\overline{r}|I_{x}$ has unipotent image, and $\overline{r}|I_{x}$ is conjugate to the group generated by $\exp(N_{3})$ , where

[TABLE] 2. (2)

**U2 **: $\overline{r}|I_{x}$ has unipotent image, and $\overline{r}|I_{x}$ is conjugate to the group generated by $\exp(N_{2})$ , where

[TABLE] 3. (3)

**U1 **: $\overline{r}|I_{x}$ has unipotent image, and $\overline{r}|I_{x}$ is conjugate to the group generated by $\exp(N_{1})$ , where

[TABLE] 4. (4)

P*: $\overline{r}|G_{x}$ is a direct sum of characters, and $\overline{r}|I_{x}$ has the form*

[TABLE]

for some non-trivial character $\chi_{x}$ of $I_{x}$ . Both the plane of invariants under $I_{x}$ and the plane on which $I_{x}$ acts by $\chi_{x}$ are isotropic. Moreover $x-1$ is prime to $p$ . 5. (5)

H*: $\overline{r}|I_{x}$ is absolutely irreducible and $x^{4}-1$ is prime to $p$ .*

Remark 4.4.

*Since we are assuming that the similitude character of $\overline{r}$ is a power of the cyclotomic character, it turns out that $\overline{r}|I_{x}$ can never be of type P. We expect that our arguments can also be adapted to deal with representations $\overline{r}$ with more general (odd) similitude characters, but we made this assumption to simplify some of the arguments involving $q$ -expansions (in particular, to avoid various Nebentypus characters).

Note that non-trivial unipotent representations are not direct sums, so a prime $x\in S(\overline{r})$ is either of type U, P, or H, but never simultaneously any two of these types. Moreover, $x$ is of type U2 or U3 if and only if $\overline{r}(I_{x})$ is generated by an element $\exp(N)$ where $N$ is nilpotent of rank $2$ , or $3$ respectively.

Let $Q$ denote a finite set of primes of $\mathbf{Q}$ disjoint from $S(\overline{r})\cup\{p\}$ . We assume that for each $x\in Q$ the following hold:

•

$x\equiv 1\mod p$ ,

•

$\overline{r}|G_{x}$ is a direct sum of four pairwise distinct characters. Label these characters as $\lambda(\alpha_{x}),\lambda(\beta_{x}),\lambda(\gamma_{x})$ , and $\lambda(\delta_{x})$ such that the planes $\lambda(\alpha_{x})\oplus\lambda(\beta_{x})$ and $\lambda(\gamma_{x})\oplus\lambda(\delta_{x})$ are isotropic and $\alpha_{x}\delta_{x}=\beta_{x}\gamma_{x}=\nu(\overline{r})(\mathrm{Frob}_{x})$ .

(By abuse of notation, we sometimes use $Q$ to denote the product of primes in $Q$ .) For objects $R$ in $\mathcal{C}_{\mathcal{O}}$ , a deformation of $\overline{r}$ to $R$ is a $\ker(\mathrm{GSp}_{4}(R)\to\mathrm{GSp}_{4}(k))$ -conjugacy class of continuous lifts $r:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(R)$ of $\overline{r}$ . We will often refer to the deformation containing a lift $r$ simply by $r$ .

Remark 4.5.

*When deforming Galois representations over $\mathbf{Q}$ , we could work either with fixed or varying similitude character — both give rise to deformation problems with $l_{0}=1$ . We make the (somewhat arbitrary) choice to work with deformations with fixed similitude character in this paper, because it is the “correct” approach for general totally real fields — for totally real fields other than $\mathbf{Q}$ , the invariant $l_{0}$ increases (by $[F:\mathbf{Q}]-1$ ) when deforming the similitude character.

Definition 4.6.

We say that a deformation $r:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(R)$ of $\overline{r}$ is minimal outside $Q$ if it satisfies the following properties:

(1)

The similitude character $\nu(r)$ is equal to $\epsilon^{-(a-1)}$ . 2. (2)

If $x\not\in Q\cup S(\overline{r})\cup\{p\}$ is a prime of $\mathbf{Q}$ , then $r|G_{x}$ is unramified. 3. (3)

If $x\in S(\overline{r})$ is of type U1, U2 or U3, then $r|I_{x}$ has unipotent image and its image is topologically generated by an element $\exp(N)$ where $N$ is nilpotent of rank $1$ , $2$ or $3$ respectively. 4. (4)

If $x\in S(\overline{r})$ is of type P, then $r(I_{x})\stackrel{{\scriptstyle\sim}}{{\rightarrow}}\overline{r}(I_{x})$ . 5. (5)

If $x\in Q$ , then $r|G_{x}\cong V_{1}\oplus V_{2}$ where each $V_{i}$ is an isotropic plane in $R^{4}$ and $V_{1}$ lifts $\lambda(\alpha_{x})\oplus\lambda(\beta_{x})$ while $V_{2}$ lifts $\lambda(\gamma_{x})\oplus\lambda(\delta_{x})$ . Moreover, $I_{x}$ acts by scalars (via some character) on $V_{1}$ and by scalars via the inverse of this character on $V_{2}$ . 6. (6)

The representation $r$ has the following shape at $p$ :

[TABLE]

where $\chi_{\alpha}$ and $\chi_{\beta}$ are unramified characters lifting $\lambda(\alpha)$ and $\lambda(\beta)$ respectively, and $\psi$ is an unramified character which is trivial modulo the maximal ideal.

If $Q$ is empty, we will refer to such deformations simply as being minimal. If $r$ satisfies conditions (2)–(4), then we say $r$ is weakly minimal outside $Q$ .

Remark 4.7.

The local condition at $p$ is equivalent to asking that $r$ is ordinary (of fixed weight). When $a=2$ it is also equivalent to being finite flat. This is because, for unramified characters ${\psi_{1}}$ and $\psi_{2}$ , the group $\mathrm{Ext}^{1}({\psi_{1}},\psi_{2})$ in this category is trivial, and the group $\mathrm{Ext}^{1}(\epsilon{\psi_{1}},\psi_{2})$ is the same whether it is computed in the category of finite flat group schemes or as $G_{p}$ modules, as long as ${\psi_{1}}{\psi_{2}}^{-1}\not\equiv 1\mod p$ . The latter condition follows (for all the relevant extensions) from the assumption $(\alpha\beta-1)(\alpha^{2}-1)(\beta^{2}-1)(\alpha-\beta)\neq 0$ .**

The functor that associates to each object $R$ of $\mathcal{C}_{\mathcal{O}}$ the set of deformations of $\overline{r}$ to $R$ which are minimal outside $Q$ is represented by a complete Noetherian local $\mathcal{O}$ -algebra $R_{Q}$ . This follows from the proof of Theorem 2.41 of [DDT97]. If $Q=\emptyset$ , we will sometimes denote $R_{Q}$ by $R^{\min}$ .

Let $H^{1}_{Q}(\mathbf{Q},\mathrm{ad}^{0}(\overline{r}))$ denote the Selmer group defined as the kernel of the map

[TABLE]

where $x$ runs over all primes of $\mathbf{Q}$ and:

•

If $x\not\in Q\cup p$ , then $L_{Q,x}=H^{1}(G_{x}/I_{x},(\mathrm{ad}^{0}(\overline{r}))^{I_{x}})$ .

•

If $x\in Q$ , then $H^{1}(G_{x},\mathrm{ad}^{0}(\overline{r}))$ is isomorphic to the subspace of

[TABLE]

consisting of elements $(c_{1},c_{2},d_{2},d_{1})$ with $c_{1}+d_{1}=c_{2}+d_{2}$ . (Note that each summand is a copy of $\mathrm{Hom}_{\operatorname{cts}}(G_{x},k)$ .) We let $L_{Q,x}$ denote the subspace corresponding to elements $(c_{1},c_{2},d_{2},d_{1})$ with $c_{1}-c_{2}$ and $d_{1}-d_{2}$ and $c_{1}+d_{1}$ (equivalently, $c_{2}+d_{2}$ ) unramified.

•

If $x=p$ , then we define $L_{Q,p}=L_{p}$ as follows: let $\mathfrak{u}\subset\mathfrak{b}^{0}$ be the subspace of matrices whose non-zero entries appear in the upper right $2\times 2$ block. We define $L^{\prime}_{p}=\ker(H^{1}(G_{p},\mathfrak{b}^{0})\to H^{1}(I_{p},\mathfrak{b}^{0}/\mathfrak{u}))$ and $L_{p}=L_{Q,p}=\operatorname{Im}(L^{\prime}_{p}\to H^{1}(G_{p},\mathfrak{g}^{0}))$ .

Let $H^{1}_{Q}(\mathbf{Q},\mathrm{ad}^{0}(\overline{r}(1)))$ denote the corresponding dual Selmer group.

Lemma 4.8.

We have $\dim_{k}L_{p}-\dim_{k}H^{0}(G_{p},\mathrm{ad}^{0}(\overline{r}))=3$ .

Proof.

The subspace $L^{\prime}_{p}\subset H^{1}(G_{p},\mathfrak{b}^{0})$ is precisely set of elements mapping to the subspace $H^{1}(G_{p}/I_{p},(\mathfrak{b}^{0}/\mathfrak{u})^{I_{p}})\subset H^{1}(G_{p},\mathfrak{b}^{0}/\mathfrak{u})$ . We have $\mathfrak{b}^{0}/\mathfrak{u}\cong 1\oplus 1\oplus\lambda(\beta)\lambda(\alpha)^{-1}$ as a $k[G_{p}]$ -module and hence $H^{1}(G_{p}/I_{p},(\mathfrak{b}^{0}/\mathfrak{u})^{I_{p}})$ is 2-dimensional since $\alpha\neq\beta$ . The condition $(\alpha^{2}-1)(\beta^{2}-1)(\alpha^{2}\beta^{2}-1)(\alpha-\beta)\neq 0$ implies that $h^{2}(G_{p},\mathfrak{u})=0$ and hence $H^{1}(G_{p},\mathfrak{b}^{0})\twoheadrightarrow H^{1}(G_{p},\mathfrak{b}^{0}/\mathfrak{u})$ . It follows that $\dim_{k}L^{\prime}_{p}=2+h^{1}(G_{p},\mathfrak{b}^{0})-h^{1}(G_{p},\mathfrak{b}^{0}/\mathfrak{u})$ . Thus

[TABLE]

We have $h^{0}(G_{p},\mathfrak{u})=0$ and $h^{0}(G_{p},\mathfrak{b}^{0}/\mathfrak{u})=2$ . The Euler characteristic formula implies that $h^{1}(G_{p},\mathfrak{u})=3$ . Thus

[TABLE]

Finally, the condition on $\alpha$ and $\beta$ implies that $h^{0}(G_{p},\mathfrak{g}^{0}/\mathfrak{b}^{0})=0$ . It follows that $h^{0}(G_{p},\mathfrak{b}^{0})=h^{0}(G_{p},\mathfrak{g}^{0})$ and $L^{\prime}_{p}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}L_{p}$ . This concludes the proof. ∎

Proposition 4.9.

The reduced tangent space $\mathrm{Hom}(R_{Q}/\mathfrak{m}_{\mathcal{O}},k[\epsilon]/\epsilon^{2})$ of $R_{Q}$ has dimension

[TABLE]

Proof.

The argument is very similar to that of Corollary 2.43 of [DDT97]. The reduced tangent space has dimension $\dim_{k}H^{1}_{Q}(\mathbf{Q},\mathrm{ad}^{0}(\overline{r}))$ . By Theorem 2.18 of loc. cit. this is equal to

[TABLE]

where $x$ runs over all finite places of $\mathbf{Q}$ . The second term is equal to [math] and the third term vanishes (by the absolute irreducibility of $\overline{r}$ and the fact that $\overline{r}\not\cong\overline{r}\otimes\epsilon$ ). Now, we have:

•

$\dim_{k}L_{Q,x}-\dim_{k}H^{0}(\mathbf{Q}_{x},\mathrm{ad}^{0}(\overline{r}))=0$ for $x\not\in Q\cup\{p\}$ ;

•

$\dim_{k}L_{Q,x}-\dim_{k}H^{0}(\mathbf{Q}_{x},\mathrm{ad}^{0}(\overline{r}))=3$ for $x=p$ ;

•

$\dim_{k}L_{Q,x}-\dim_{k}H^{0}(\mathbf{Q}_{x},\mathrm{ad}^{0}(\overline{r}))=1$ for $x\in Q$ (by [GT05, Prop. 10.4.1]); and

•

$\dim_{k}H^{0}(G_{\infty},\mathrm{ad}^{0}(\overline{r}))=4$ .

This concludes the proof. ∎

The next result (on the existence of Taylor-Wiles primes) follows from the previous proposition and the proof of [Pil12a, Prop. 5.6].

Proposition 4.10.

Let $q=\dim_{k}H^{1}_{\emptyset}(\mathbf{Q},\mathrm{ad}^{0}(\overline{r}(1)))$ and recall that we are supposing $\overline{r}$ satisfies Assumption 4.1. Then $q\geq 1$ and for any integer $N\geq 1$ we can find a set $Q_{N}$ of primes of $\mathbf{Q}$ such that

(1)

$\#Q_{N}=q$ . 2. (2)

$x\equiv 1\mod p^{N}$ * for each $x\in Q_{N}$ .* 3. (3)

For each $x\in Q_{N}$ , $\overline{r}$ is unramified at $x$ and $\overline{r}(\mathrm{Frob}_{x})$ has four pairwise distinct eigenvalues. 4. (4)

$H^{1}_{Q_{N}}(\mathbf{Q},\mathrm{ad}(\overline{r}(1)))=(0)$ .

In particular, the reduced tangent space of $R_{Q_{N}}$ has dimension $q-1$ and $R_{Q_{N}}$ is a quotient of a power series ring over $\mathcal{O}$ in $q-1$ variables.

Example 4.11 (Examples of representations with big image).

Suppose that $p\geq 5$ .

(1)

Let $K/\mathbf{Q}$ be an imaginary quadratic field not contained in $\mathbf{Q}(\zeta_{p})$ . Let

[TABLE]

is a representation with determinant $\epsilon^{1-k}$ for some integer $k$ such that the images of $\overline{\rho}$ and $\overline{\rho}^{c}$ for any complex conjugation $c\in\mathrm{Gal}(\overline{\mathbf{Q}}/\mathbf{Q})$ both contain $\mathrm{SL}_{2}(\mathbf{F}_{p})$ are have totally disjoint fixed fields over $K(\zeta_{p})$ . Then the representation

[TABLE]

preserves a symplectic form and has big image. 2. (2)

Suppose the image of $\overline{r}$ is $\mathrm{GSp}_{4}(\mathbf{F}_{p})$ . Then $\overline{r}$ has big image.

Proof.

The second claim follows immediately for $p\geq 5$ by [Pil12a], Prop 5.8. For the first claim, it is an easy consequence of the fact that $\mathrm{SL}_{2}(\mathbf{F}_{p})$ is perfect for $p\geq 5$ that $H3$ holds, and similarly, assuming that $K\not\subset\mathbf{Q}(\zeta_{p})$ , that $H1$ holds. Hence it suffices to find an element in the image with distinct eigenvalues and with $1$ as an eigenvalue for every irreducible constituent of $\mathrm{ad}^{0}(\overline{r})$ . We first compute the representation $\mathrm{ad}^{0}(\overline{r})$ . Note that the dual of $\overline{\rho}$ and $\overline{\rho}^{c}$ can be identified with $\overline{\rho}\times\epsilon^{k-1}$ and $\overline{\rho}^{c}\otimes\epsilon^{k-1}$ respectively. Over $K$ , we have an identification

[TABLE]

and over $\mathbf{Q}$ , we have an identification

[TABLE]

where $\mathrm{As}$ is the Asai representation. Over $\mathbf{Q}(\zeta_{p^{m}})$ for any $m$ , the character $\epsilon^{k-1}$ is trivial, and hence the image of $\overline{r}|_{G_{\mathbf{Q}(\zeta_{p^{m}})}}$ under our assumptions is the group $\mathrm{SL}_{2}(\mathbf{F}_{p})^{2}\rtimes\mathbf{Z}/2\mathbf{Z}$ . Since $1$ and $-1$ are always eigenvalues of any element acting on $\mathrm{Ind}^{\mathbf{Q}}_{K}\mathrm{ad}^{0}(\overline{\rho})$ , it suffices to find an element $\sigma\in\mathrm{SL}_{2}(\mathbf{F}_{p})^{2}\rtimes\mathbf{Z}/2\mathbf{Z}$ which has distinct eigenvalues under $\overline{r}$ and has an eigenvalue $1$ in $\mathrm{As}(\overline{\rho})$ . To be more precise, since we haven’t been careful about distinguishing the Asai representation from its quadratic twist, we shall find an element with eigenvalues both $1$ and $-1$ . One can explicitly realize the Asai representation as follows. Let $V$ be the standard representation of $\mathrm{SL}_{2}(\mathbf{F}_{p})$ over $\mathbf{F}_{p}$ , and let $V\otimes V$ be the representation of the exterior product $\mathrm{SL}_{2}(\mathbf{F}_{p})\times\mathrm{SL}_{2}(\mathbf{F}_{p})$ . The element $(g,h)$ acts on $v\otimes w$ via $(g,h)(v\otimes w)=(gv\otimes hw)$ . The Asai representation is determined uniquely by the action of a fixed lift of complex conjugation $c\in\mathrm{Gal}(\overline{\mathbf{Q}}/\mathbf{Q})$ , which acts on $V\otimes V$ by the formula $c(v\otimes w)=w\otimes v$ .

Consider the elements $g,h\in\mathrm{SL}_{2}(\mathbf{F}_{p})$ such that, with respect to some chosen basis $V=\{u,v\}$ ,

[TABLE]

Then $c\cdot(g,h)$ acts on $\overline{r}$ via the matrix

[TABLE]

with eigenvalues $\pm(xy)^{1/2}$ and $\pm(xy)^{-1/2}$ . On the other hand, the action of this element via the Asai representation (and basis $u\otimes u$ , $v\otimes v$ , $u\otimes v$ , $v\otimes u$ ) is

[TABLE]

with eigenvalues $xy$ , $(xy)^{-1}$ , and $\pm 1$ . The four eigenvalues are distinct as long as $\pm(xy)^{1/2}\neq\pm(xy)^{-1/2}$ , or equivalently if $(xy)^{2}\neq 1$ . One can now choose $x=2$ and $y=1$ in $\mathbf{F}^{\times}_{p}$ . ∎

Remark 4.12.

*Suppose that $K$ is an imaginary quadratic field, and suppose that $E/K$ is an elliptic curve which neither has CM nor is isogenous (over $\overline{K}$ ) to its Galois conjugate $E^{c}/K$ . We claim that Example 4.11 applies to the mod $p$ representations $\overline{\rho}:G_{K}\rightarrow\mathrm{GL}_{2}(\mathbf{F}_{p})$ associated to the dual of $E[p]$ for sufficiently large $p$ . The representations $\overline{r}$ in this case are the duals of the representations $A[p]$ associated to the abelian surface $A=\mathrm{Res}^{\mathbf{Q}}_{K}(E)$ . By [Ser72], the Galois representations $\overline{\rho}_{p},\overline{\rho}^{c}_{p}:G_{K}\rightarrow\mathrm{GL}_{2}(\mathbf{F}_{p})$ associated to the duals of $E[p]$ and $E^{c}[p]$ have images $\mathrm{GL}_{2}(\mathbf{F}_{p})$ and determinants $\epsilon^{1-2}$ for all sufficiently large $p\geq 5$ . Let $F/K$ and $F^{c}$ denote the corresponding extensions, so $\mathrm{Gal}(F/K)$ and $\mathrm{Gal}(F^{c}/K)$ are both isomorphic to $\mathrm{GL}_{2}(\mathbf{F}_{p})$ , and $\mathrm{Gal}(F/K(\zeta_{p}))$ and $\mathrm{Gal}(F^{c}/K(\zeta_{p}))$ are both isomorphic to $\mathrm{SL}_{2}(\mathbf{F}_{p})$ . By the simplicity of $\mathrm{PSL}_{2}(\mathbf{F}_{p})$ for $p\geq 5$ , the only non-trivial quotients of $\mathrm{SL}_{2}(\mathbf{F}_{p})$ are $\mathrm{PSL}_{2}(\mathbf{F}_{p})$ and $\mathrm{SL}_{2}(\mathbf{F}_{p})$ . This implies that if $H:=F\cap F^{c}\supseteq K(\zeta_{p})$ is strictly larger than $K(\zeta_{p})$ , then then either $\mathrm{Gal}(H/K)=\mathrm{GL}_{2}(\mathbf{F}_{p})$ , or $\mathrm{Gal}(H/K)=\mathrm{GL}_{2}(\mathbf{F}_{p})/\pm I$ . In either case, the projective representations associated to $\overline{\rho}_{p}$ and $\overline{\rho}^{c}_{p}$ both factor through $\mathrm{Gal}(H/K)$ . Since all automorphisms of $\mathrm{PGL}_{2}(\mathbf{F}_{p})$ are inner, this implies that projective representations of $\overline{\rho}_{p}$ and $\overline{\rho}^{c}_{p}$ are isomorphic, and hence $\overline{\rho}_{p}\simeq\overline{\rho}^{c}_{p}\otimes\chi_{p}$ for some character $\chi_{p}$ which (by comparing determinants) is at most quadratic. Assume $p$ is sufficiently large so that $E$ has good reduction at all primes above $p$ and moreover that $p$ is unramified in $K$ . Then $\overline{\rho}_{p}$ and $\overline{\rho}^{c}_{p}$ are both finite flat at $v|p$ , which forces $\chi_{p}$ to be unramified at all primes above $p$ . But this implies that $\chi_{p}$ is unramified outside primes dividing the conductor $N$ and $N^{c}$ of $E$ and $E^{c}$ respectively. There are only finitely many such quadratic characters by class field theory. Hence, if there are infinitely primes $p$ for which the assumptions of Example 4.11 do not occur, then there exists a fixed character $\chi$ with $\chi^{2}=1$ and isomorphisms $\overline{\rho}_{p}\simeq\overline{\rho}^{c}_{p}\otimes\chi$ for infinitely many $p$ . Such an isomorphism (for a single $p$ ) implies that $a_{v}=\chi(v)a_{v^{c}}\mod p$ for all pairs of conjugate primes $v$ and $v^{c}$ of good reduction for $E$ , and hence, given infinitely many such $p$ , one deduces the equality $a_{v}=\chi(v)a_{v^{c}}$ . If $L/K$ is the (at most) quadratic extension in which $\chi$ splits, this implies (by Cebotarev) that the Tate modules (for any fixed prime) of $E$ and $E^{c}$ are isomorphic, and hence (by Faltings [Fal83]) that $E$ and $E^{c}$ are isogenous over $L$ .

5. Siegel threefolds

5.1. Level Structure

Recall that there are two conjugacy classes of maximal parabolic subgroups of $\mathrm{GSp}(4)$ represented by the Siegel parabolic $P$ which is block upper triangular with Levi

[TABLE]

and the Klingen parabolic $\Pi$ which is block upper triangular with Levi

[TABLE]

These both contain the Borel subgroup $B$ . For each prime $x$ , these give rise to parahoric subgroups $P(x)$ , $\Pi(x)$ , and $I(x)$ of $\mathrm{GSp}_{4}(\mathbf{Z}_{x})$ , namely, the inverse image of the corresponding parabolic subgroups over $\mathbf{F}_{x}$ . (The group $I(x)$ is called the Iwahori subgroup.) The Klingen parahoric subgroup contains a normal subgroup $\Pi(x)^{+}$ with $\Pi(x)/\Pi(x)^{+}\simeq(\mathbf{Z}/x\mathbf{Z})^{\times}$ (via projection onto $\lambda\bmod x$ ). For each prime $x$ , we also have the Paramodular group $K(x)$ , which is the stabilizer in $\mathrm{GSp}_{4}(\mathbf{Q}_{x})$ of $\mathbf{Z}_{x}\oplus\mathbf{Z}_{x}\oplus\mathbf{Z}_{x}\oplus x\mathbf{Z}_{x}$ , and is the intersection

[TABLE]

for values $*\in\mathbf{Z}_{x}$ .

5.2. Cohomology of Siegel $3$ -folds

Let $S$ and $Q$ be finite sets of primes of $\mathbf{Q}$ which are disjoint from each other and do not contain $p$ . By a slight abuse of notation, we will sometimes denote the product of the primes in $Q$ by the same symbol $Q$ . For each $x\in S$ , let $K_{x}\subset\mathrm{GSp}_{4}(\mathbf{Z}_{x})$ equal one of $S(x)$ , $\Pi(x)$ , $K(x)$ , $\Pi(x)^{+}$ , $I(x)$ or the full congruence subgroup of level $x$ . For $x\not\in S$ , we let $K_{x}=\mathrm{GSp}_{4}(\mathbf{Z}_{x})$ and we define $K:=\prod_{x}K_{x}\subset\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ . For $x\in Q$ , we let $K_{x,0}=\Pi(x)$ and $K_{x,1}=\Pi^{+}(x)$ . Let $K_{i}(Q)=\prod_{x\not\in Q}K_{x}\times\prod_{x\in Q}K_{x,i}$ for $i=0,1$ .

We assume that the subgroup $K$ is neat. (This will be the case if $S$ contains a prime $x\geq 3$ where $K_{x}$ is the full congruence subgroup of level $x$ .) We let $Y_{K}\to\mathrm{Spec}(\mathcal{O})$ (resp. $Y_{K_{i}(Q)}\to\mathrm{Spec}(\mathcal{O})$ ) denote the Siegel moduli space of level $K$ (resp. $K_{i}(Q)$ ). This scheme classifies principally polarized abelian varieties together with a $K$ -level structure (resp. $K_{i}(Q)$ -level structure). (See [Pil12b, §4.1].) In each case we denote the universal abelian variety by $\mathcal{A}$ .

If $Y$ denotes one of the above spaces, we can choose a toroidal compactification $X\to\mathrm{Spec}(\mathcal{O})$ of $Y$ . The abelian scheme $\mathcal{A}$ then extends to a semi-abelian scheme $\pi:\mathcal{A}\to X$ and the sheaf $\mathcal{E}:=\pi_{*}\Omega^{1}_{\mathcal{A}/X}$ is a locally free $\mathcal{O}_{X}$ -module of rank 2. For integers $a\geq b$ , we let $\omega(a,b):=\mathrm{Sym}^{a-b}\mathcal{E}\otimes\det^{b}\mathcal{E}$ . We also denote $\det\mathcal{E}$ by $\omega$ , so, for example, $\omega(a,a)=\omega^{a}$ is a line bundle. If $M$ is an $\mathcal{O}$ -module, we will let $\omega(a,b)_{M}$ denote the sheaf $\omega(a,b)\otimes_{\mathcal{O}}M$ . The coherent cohomology groups $H^{i}(X,\omega(a,b)_{M})$ are independent of the choice of toroidal compactification $X$ (see [Lan13, Lemma 7.1.1.4] and the proof of [Lan13, Lemma 7.1.1.5]). The Koecher principle states that there is an isomorphism

[TABLE]

We may therefore pass freely between the open variety $Y$ and the (any) smooth projective toroidal compactification $X$ without comment when dealing with $H^{0}$ .

We choose toroidal compactifications $X_{K}$ and $X_{K_{0}(Q)}$ so that the natural map $Y_{K_{0}(Q)}\to Y_{K}$ extends to a map $X_{K_{0}(Q)}\to X_{K}$ . As explained in § 4.1.2 of [Pil12b], the universal subgroup $H\subset\mathcal{A}[Q]$ over $Y_{K_{0}(Q)}$ extends to $X_{K_{0}(Q)}$ . We then define the toroidal compactification $X_{K_{1}(Q)}=\mathrm{Isom}_{X_{K_{0}}(Q)}(\mathbf{Z}/Q,H)$ . The resulting map $X_{K_{1}(Q)}\to X_{K_{0}(Q)}$ is then finite étale with Galois group $\Delta_{Q}:=(\mathbf{Z}/Q)^{\times}$ .

5.3. Vanishing results

Let $X$ denote one of the toroidal compactifications defined in the previous section. We first record some consequences of a vanishing theorem of Lan and Suh.

Theorem 5.1.

(1)

Suppose that $a\geq 3$ and $2\leq a-b\leq p-2$ . Then

[TABLE]

for $i>2$ . 2. (2)

Suppose that $a+b\geq 6$ and $2\leq a-b\leq p-2$ . Then

[TABLE]

for $i>1$ . 3. (3)

Suppose that $b\geq 4$ and $0\leq a-b\leq p-4$ . Then

[TABLE]

for $i>0$ .

Proof.

This follows from [LS13, Cor. 7.24] after unwinding definitions. We take the group scheme $\mathrm{G}_{1}/R_{1}$ (in the notation of [LS13]) to be our $G/\mathcal{O}$ . The groups $\mathrm{M}_{1}\subset\mathrm{P}_{1}\subset\mathrm{G}_{1}$ correspond to the Siegel Levi and parabolic: $M\subset P\subset G$ . The set of dominant weights $X_{\mathrm{G}_{1}}^{+}$ (resp. $X_{\mathrm{M}_{1}}^{+}$ ) is our $X^{*}(T)^{+}_{G}$ (resp. $X^{*}(T)^{+}_{M}$ ) from Definition 2.1.

In this paragraph, we show that the subset $X_{\mathrm{G}_{1}}^{+,<_{\mathrm{re}}p}\subset X_{\mathrm{G}_{1}}^{+}$ as defined in [LS12, Defn. 6.3] corresponds to the set of those $\mu=(a,b;c)\in X^{*}(T)_{G}^{+}$ such that $a+b<p-3$ . As an intermediate step, we first show that $X_{\mathrm{G}_{1}}^{+,<_{\mathrm{re}}p}$ corresponds to those $\mu=(a,b;c)\in X^{*}(T)_{G}^{+}$ such that:

•

$\langle\mu+\rho,\pm\alpha_{i}^{\vee}\rangle\leq p$ for $i=1,\dots,4$ ;

•

$a+b+3<p$ .

To see this, we note the following: to lie in $X_{\mathrm{G}_{1}}^{+,<_{\mathrm{re}}p}$ , by definition, the element $\mu$ must satisfy $|\mu|_{L}+d<p$ and must also lie in $X_{\mathrm{G}_{1}}^{+,<_{\mathrm{W}}p}$ . The definition of $|\mu|_{L}$ in Definition 3.2 of [LS12] boils down to $|\mu|_{L}=a+b$ (the set $\Upsilon$ in our case consists of the single embedding $\mathbf{Z}\hookrightarrow\mathcal{O}$ and the norm $|\mu|=a+b$ is defined near the beginning of §2.5). The dimension $d$ is defined in Definition 3.9 of [LS12] to be $\dim_{\mathcal{O}}(X)$ which is 3 in our case. Next, the set $X_{\mathrm{G}_{1}}^{+,<_{\mathrm{W}}p}$ is defined in Definition 3.2 to consist of those $\mu\in X_{\mathrm{G}_{1}}^{+,<p}$ for which $|\mu|_{L}<p$ . Finally, the set $X_{\mathrm{G}_{1}}^{+,<p}$ is defined in Definition 2.29 to consist of all dominant $\mu\in X^{+}_{\mathrm{G}_{1}}$ which satisfy the first condition above. This establishes the intermediate step. Now, if $\mu\in X^{*}(T)^{+}_{G}$ , then the largest of the $\langle\mu+\rho,\pm\alpha_{i}^{\vee}\rangle$ is $\langle\mu+\rho,\alpha_{3}^{\vee}\rangle=a+b+3$ . Thus, we see that $\mu\in X_{\mathrm{G}_{1}}^{+,<_{\mathrm{re}}p}$ if and only if $a\geq b\geq 0$ and $a+b<p-3$ .

The set $X_{\mathrm{M}_{1}}^{+,<p}$ , by Definition 2.29 of [LS12], is $\{\mu\in X^{*}(T)^{+}_{M}:\langle\mu+\rho,\alpha_{1}^{\vee}\rangle\leq p\}=\{(a,b;c)\in X^{*}(T)^{+}_{M}:(a+2)-(b+1)\leq p\}$ . By Lemma 7.2, Definition 7.14 (which is vacuous in our case) and Proposition 7.15 of [LS12], a weight $\mu=(a,b;c)$ lies in $X_{\mathrm{M}_{1}}^{+,<p}$ and is positive parallel if and only if $a=b>0$ .

If $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ , then a pair of vector bundles ${\mathcal{W}}_{\mu}^{*}$ , for $*\in\{\mathrm{can},\mathrm{sub}\}$ is defined in [LS13]. Indeed $\mu$ determines an algebraic representation of $M\cong\mathrm{GL}_{2}\times\mathrm{GL}_{1}$ over $\mathcal{O}$ with highest weight $(a,b;c)$ (namely $(\mathrm{Sym}^{a-b}S_{2}\otimes\det^{b}S_{2})\otimes S_{1}^{\otimes c}$ where $S_{i}$ is the standard representation of $\mathrm{GL}_{i}$ ) and the corresponding bundles are then defined by [LS13, Defn. 4.12]. We claim that

[TABLE]

(We note that the parameter $c$ does not change the underlying vector bundle, but does change the Hecke action on cohomology by a power of the similitude character.) Let $\mu=(0,-1;1)$ , let $L$ denote the standard representation of $G$ and $L_{0}^{\vee}(1)\subset L$ the subspace spanned by the first two standard basis vectors. Then $L_{0}^{\vee}(1)$ is the standard representation of the $\mathrm{GL}_{2}$ -factor of $M$ and is the representation of $M$ corresponding to $(1,0;0)$ . The representation $L_{0}=(L_{0}^{\vee}(1))^{\vee}(1)$ thus corresponds to $\mu=(0,-1;1)$ . By [LS12, Example 1.22], we have (in the notation of that paper) $\mathcal{E}_{\mathrm{M}_{1}}(L_{0})=\operatorname{\mathrm{Lie}}_{\mathcal{A}/Y}$ . However, $\mathcal{W}_{\mu}=\mathcal{E}_{\mathrm{M}_{1}}(L_{0})$ by definition, and we have $\operatorname{\mathrm{Lie}}_{\mathcal{A}/Y}=\mathcal{E}^{\vee}=\omega(0,-1)$ . It follows that $\mathcal{W}_{(0,-1;1)}^{\mathrm{can}}\cong\omega(0,-1)$ . We deduce that $\omega(a,b)=(\mathrm{Sym}^{a-b}\otimes\det{}^{b})(\omega(1,0))=\mathcal{W}_{(a,b;-a-b)}$ , as required.

With these preliminaries out of the way, we now apply [LS13, Cor. 7.24]. We take $\mu=(\alpha,\beta;\gamma)\in X_{\mathrm{G}_{1}}^{+,<_{\mathrm{re}}p}$ . (The condition that $\max(2,r_{\tau})<p$ when $\tau=\tau\circ c$ boils down to $2<p$ in our case.) We take $\nu=(t,t;0)$ a positive parallel weight. We therefore have $t>0$ , $\alpha\geq\beta\geq 0$ and $\alpha+\beta<p-3$ .

We now apply part 2 of [LS13, Cor. 7.24] successively with $w\in W^{M_{1}}$ taken to equal each of the elements $\widetilde{w}_{1},\widetilde{w}_{2},\widetilde{w}_{3}$ from Section 2. Note that each $\widetilde{w}_{i}$ has length $i$ . If we take $w=\widetilde{w}_{1}$ , then (ignoring the third component):

[TABLE]

Thus $({\mathcal{W}}^{\vee}_{\widetilde{w}_{1}\cdot\mu-\nu})^{\mathrm{sub}}=\omega(\beta+2+t,-\alpha+t)(-\infty)$ . Then [LS13, Cor. 7.24] implies

[TABLE]

for each $i>2$ . Taking $a=\beta+2+t$ and $b=-\alpha+t$ gives the first part of our proposition.

Similarly, if $w=\widetilde{w}_{2}$ , then:

[TABLE]

Hence

[TABLE]

for $i>1$ . This gives the second part of the proposition.

Finally, we take $w=\widetilde{w}_{3}$ , then:

[TABLE]

Hence

[TABLE]

for $i>0$ . This gives the last part of the proposition. ∎

It is interesting to compare the above vanishing result in characteristic $p$ with the following characteristic 0 vanishing results due to Blasius–Harris–Ramakrishnan, Mirković, Williams and Schmid. We have an identification

[TABLE]

where $U\subset G(\mathbb{A}^{\infty})$ is the open compact subgroup used to define $Y$ and $K^{h}$ is the compact-mod-center subgroup defined in Section 2.0.2. To any finite dimensional $\mathbf{C}$ -representation $(\sigma,V_{\sigma})$ of $K^{h}_{\mathbf{C}}$ , there is an associated vector bundle $\mathcal{V}_{\sigma}$ on $Y(\mathbf{C})$ which is defined in [BHR94, Defn. 1.3.2]. This bundle has extensions $\mathcal{V}_{\sigma}^{\mathrm{sub}}\subset\mathcal{V}_{\sigma}^{\mathrm{can}}$ to $X(\mathbf{C})$ . In [BHR94], the bundle $\mathcal{V}_{\sigma}^{\mathrm{can}}$ is denoted $\widetilde{\mathcal{V}}_{\sigma}$ . We have $\mathcal{V}^{\mathrm{sub}}_{\sigma}=\mathcal{V}_{\sigma}^{\mathrm{can}}(-\infty)$ . For each $i\geq 0$ , we define:

[TABLE]

Let $\widetilde{H}^{i}(\mathcal{V}_{\sigma}^{\mathrm{sub}})$ and $\widetilde{H}(\mathcal{V}_{\sigma}^{\mathrm{can}}))$ denote the direct limit of $H^{i}(X(\mathbf{C}),\mathcal{V}_{\sigma}^{\mathrm{sub}})$ and $H^{i}(X(\mathbf{C}),\mathcal{V}_{\sigma}^{\mathrm{can}}))$ respectively over all levels $K$ . Let $\overline{H}^{i}(X(\mathbf{C}),\mathcal{V}_{\sigma})$ denote the corresponding limit of $\overline{H}^{i}(X(\mathbf{C}),\mathcal{V}_{\sigma})$ (including both an overline and a tilde in the notation was too cumbersome, hopefully no confusion will result).

Let $\mathcal{A}_{(2)}(G)$ denote the space of automorphic forms on $G(\mathbf{Q})\backslash G(\mathbb{A})$ which are square integrable modulo the centre $Z_{G}(\mathbb{A})$ . Let $\mathcal{A}_{0}(G)\subset\mathcal{A}_{(2)}(G)$ denote the space of cusp forms. For $(\sigma,V_{\sigma})$ a representation of $K^{h}_{\mathbf{C}}$ as above and $i\geq 0$ , we define:

[TABLE]

Then we have the following result of Harris:

Theorem 5.2.

There are canonical maps, forming a commutative diagram:

[TABLE]

Moreover:

(1)

The composition $\mathcal{H}^{i}_{\mathrm{cusp},\sigma}\to\overline{H}^{i}(\mathcal{V}_{\sigma})$ is injective for all $i$ , and is an isomorphism for $i=0,3$ . 2. (2)

The image of $\mathcal{H}^{i}_{(2),\sigma}$ in $\widetilde{H}^{i}(\mathcal{V}_{\sigma}^{\mathrm{can}})$ contains $\overline{H}^{i}(\mathcal{V}_{\sigma})$ .

Proof.

This follows from [Har90, Theorem 2.7 & Prop. 3.2.2]. ∎

For $*\in\{\mathrm{cusp},(2)\}$ , we then define $\widetilde{H}^{i}(\mathcal{V}_{\sigma}^{\mathrm{can}})_{*}$ to be the image of the space $\mathcal{H}^{i}_{*,\sigma}$ in $\widetilde{H}^{i}(\mathcal{V}_{\sigma}^{\mathrm{can}})$ . Thus we have

[TABLE]

For $*\in\{\mathrm{cusp},(2)\}$ , the space $\mathcal{A}_{*}(G)$ is semisimple as a $G(\mathbb{A})$ -representation and we decompose:

[TABLE]

We let $\mathcal{A}_{*}(G)_{\mathrm{temp}}$ denote the subspace

[TABLE]

where the sum is over all those $\pi$ such that $\pi_{\infty}$ is essentially tempered. We define $\mathcal{H}^{i}_{*,\sigma,\mathrm{temp}}\subset\mathcal{H}^{i}_{*,\sigma}$ by replacing $\mathcal{A}_{*}(G)$ with $\mathcal{A}_{*}(G)_{\mathrm{temp}}$ in the definition of $\mathcal{H}^{i}_{*,\sigma}$ . We then define

[TABLE]

to be the image of $\mathcal{H}^{i}_{*,\sigma,\mathrm{temp}}\to\widetilde{H}^{i}(\mathcal{V}_{\sigma}^{\mathrm{can}})$ . We may also define analogous spaces

[TABLE]

by applying $K$ -invariants to the constructions above, where $K$ is the level of $X(\mathbf{C})$ .

Suppose now that $(\sigma,V_{\sigma})$ is the irreducible representation of $K^{h}_{\mathbf{C}}$ of highest weight $\mu=(a,b;c)\in X^{*}(H_{\mathbf{C}})$ , with respect to the system of positive weights fixed in § 2. We first of all observe that the bundle $\mathcal{V}_{\sigma}$ does not depend on $c$ . Indeed, let $(\tau,V_{\tau})$ be the irreducible representation of highest weight $(a,b;c+2)$ . Consider the $G(\mathbf{Q})$ -equivariant bundles $\mathcal{V}_{\sigma}^{\vee}=G(\mathbf{C})\times_{Q^{-}}V_{\sigma}$ and $\mathcal{V}_{\tau}^{\vee}=G(\mathbf{C})\times_{Q^{-}}V_{\tau}$ on $G(\mathbf{C})/Q^{-}$ defined in [BHR94, §1.3]. (The superscripted ∨’s do not refer to dual bundles here.) Then by the definition of $\mathcal{V}_{\sigma}$ , it suffices to show that $\mathcal{V}_{\sigma}^{\vee}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}\mathcal{V}_{\tau}^{\vee}$ as $G(\mathbf{Q})$ -equivariant bundles. We have that $\tau=\sigma\otimes\nu$ , so we may take the underlying space of $\tau$ to be $V_{\tau}=V_{\sigma}$ and the action to be $\tau(g)=\nu(g)\sigma(g)\in\mathrm{End}(V_{\sigma})$ for all $g\in K^{h}_{\mathbf{C}}$ . Then the map

[TABLE]

gives the required isomorphism $\mathcal{V}_{\sigma}^{\vee}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}\mathcal{V}_{\tau}^{\vee}$ . (Note however that the Hecke action on the cohomology of $\mathcal{V}_{\sigma}$ will depend on $c$ – changing the value of $c$ introduces a corresponding twist by a power of the similitude character in the Hecke action.)

For $\mu\in X^{*}(H_{\mathbf{C}})^{+}_{K^{h}_{\mathbf{C}}}$ a dominant weight, we let $\mathcal{V}_{\mu}$ denote the vector bundle associated to the irreducible $K^{h}_{\mathbf{C}}$ -representation $W_{\mu}$ . We would like to compare these bundles to the bundles introduced in the proof of Theorem 5.1.

Definition 5.3.

Let $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ . We let $\mathcal{W}_{\mu}$ denote the canonical extension $\mathcal{W}_{\mu}^{\mathrm{can}}$ in the notation of the proof of Theorem 5.1, and we let $\mathcal{W}_{\mu}^{\mathrm{sub}}=\mathcal{W}_{\mu}(-\infty)$ .

We saw above that, as vector bundles over $X$ , we have:

[TABLE]

though the Hecke action on the cohomology of $\mathcal{W}_{\mu}$ will depend on $c$ .

Lemma 5.4.

Let $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ . Then, over $X(\mathbf{C})$ , we have:

[TABLE]

compatibly with Hecke actions on cohomology.

Proof.

It suffices to prove the isomorphism over $Y$ . Consider the short exact sequence:

[TABLE]

and the Poincaré duality pairing

[TABLE]

(See [LS12, §1.2]).

Expressed in terms of the functor $\mathcal{W}_{\mu}$ of Lan–Suh, the short exact sequence becomes:

[TABLE]

and the bundle $\mathcal{O}_{Y}(1)$ becomes $\mathcal{W}_{(0,0;1)}$ . (See [LS12, Example 1.22].)

Similarly, over $Y(\mathbf{C})$ the short exact sequence becomes

[TABLE]

and $\mathcal{O}_{Y}(1)$ is identified with $\mathcal{V}_{(0,0;2)}$ . This follows from [Mil90, Example III.2.4]: if we take the point $o\in\check{X}$ to be $h(i)=J$ in the notation of Section 2.0.2, then the isotropic subspace corresponds to $V^{-}$ and $V/W$ corresponds to $V^{+}$ . As remarked at the end of Section 2.0.2, we have $V^{-}=W_{(0,-1;1)}$ , $V^{+}=W_{(1,0;1)}$ and the similitude character corresponds to $W_{(0,0;2)}$ . Note also that the notation $\mathcal{H}_{\mathrm{dR}}(\mathcal{A})$ of [Mil90] refers to de Rham homology (see §I.3).

It follows that, over $Y(\mathbf{C})$ , we have $\mathcal{W}_{(0,0;1)}=\mathcal{V}_{(0,0;2)}$ and $\mathcal{W}_{(1,0;0)}=\mathcal{V}_{(0,-1;1)}$ . Thus,

[TABLE]

This is compatible with Hecke action on cohomology since all isomorphisms respect the equivariant constructions. ∎

The Weyl chambers $C_{0},\dots,C_{4}\subset X^{*}(T)\otimes_{\mathbf{Z}}\mathbf{R}\cong\mathbf{R}^{3}$ are defined in Section 2.0.1. We have

[TABLE]

Theorem 5.5.

Let $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ . Then:

[TABLE]

for all $0\leq i\leq 3$ such that

[TABLE]

Proof.

In Section 2.0.2, we identified $X^{*}(H_{\mathbf{C}})$ with $\mathbf{Z}^{3}$ . Under the resulting identification of $X^{*}(H_{\mathbf{C}})\otimes_{\mathbf{Z}}\mathbf{R}$ with $\mathbf{R}^{3}$ , the chambers $C_{i}$ for $X^{*}(T)^{+}_{M}\otimes_{\mathbf{Z}}\mathbf{R}$ above correspond to Weyl chambers in $X^{*}(H_{\mathbf{C}})\otimes_{\mathbf{Z}}\mathbf{R}$ . Let $\sigma=(-b,-a;a+b+2c)$ , regarded as an element of $X^{*}(H_{\mathbf{C}})$ . Then we’ve seen above that

[TABLE]

Suppose that

[TABLE]

Then by Theorem 5.2, there is some $\pi=\pi^{\infty}\otimes\pi_{\infty}$ in $\mathcal{A}_{(2)}(G)_{\mathrm{temp}}$ such that

[TABLE]

By a theorem of Mirković [Har90, Theorem 3.5], $\pi_{\infty}$ is a discrete series or limit of discrete series. Hence, using the Harish-Chandra parameterization, we may write $\pi_{\infty}=\pi(\lambda,C)^{*}=\pi(-w_{0}(\lambda),-w_{0}(C))$ for some Weyl Chamber $C\in\{C_{0},\dots,C_{3}\}$ and a weight $\lambda\in C\cap\left(X^{*}(H_{\mathbf{C}})+\rho\right)$ . By [BHR94, Theorem 3.2.1], it follows that:

[TABLE]

and

[TABLE]

where $\Phi(C)^{+}$ is the system of positive roots determined by the chamber $C$ . For $j=0,\dots,3$ , we have $\#\left(\Phi(C_{j})^{+}\cap\Phi_{n}^{+}\right)=3-j$ . Hence we must have $C=C_{3-i}$ and $\lambda\in C_{3-i}$ . However, $C_{3-i}=-w_{0}(C_{i})$ , so $-w_{0}(\lambda)\in C_{i}$ . We have, have:

[TABLE]

Thus, we deduce that $-w_{0}(\lambda)=(a-1,b-2;a+b+2c)$ lies in $C_{i}$ . This is equivalent to the condition in statement of the theorem. ∎

We also record the following:

Theorem 5.6.

Let $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ , let $w=-(a+b+2c)$ , and let $\sigma=(-b,-a;a+b+2c)=(-b,-a;-w)$ , regarded as an element of $X^{*}(H_{\mathbf{C}})$ . Suppose that $\pi=\pi^{\infty}\otimes\pi_{\infty}$ in $\mathcal{A}_{(2)}(G)$ contributes to $H^{i}(X(\mathbf{C}),\mathcal{W}_{\mu})_{(2)}\cong H^{i}(X(\mathbf{C}),\mathcal{V}_{\sigma})_{(2)}$ .

(1)

The infinitesimal character of $\pi_{\infty}$ is given under the Harish-Chandra isomorphism by:

[TABLE] 2. (2)

Let $\widetilde{\pi}_{\infty}$ denote the transfer of $\pi_{\infty}$ to $\mathrm{GL}_{4}(\mathbf{R})$ . Then the infinitesimal character of $\widetilde{\pi}_{\infty}$ is given under the Harish-Chandra isomorphism by $\chi_{\tau}$ where:

[TABLE] 3. (3)

If furthermore, $\pi_{\infty}$ is tempered, then $\pi_{\infty}$ is a discrete series or limit of discrete series representation, and is given under the Harish-Chandra parameterization by:

[TABLE]

Proof.

For the first part, we have that

[TABLE]

It follows from [BHR94, Theorem 3.2.1] that the infinitesimal character of $\pi_{\infty}$ is equal to $\chi_{((-\sigma-\rho)|_{\mathrm{Sp}_{4}(\mathbf{R})};-w)}$ . The second part can be inferred from [Sor10, §2.1.2]. The last part is due to Mirković and was established in the proof of Theorem 5.5. ∎

Definition 5.7.

A weight $\mu=(a,b;c)\in X^{*}(T)^{+}_{M}$ such that $(a-1,b-2;c)$ lies in the interior of a unique Weyl chamber $C_{i}$ is said to be a discrete series weight or a regular weight. If $\mu-w_{0}(\rho)$ lies in the intersection of exactly two of Weyl chambers $C_{i}$ , we say it is a limit of discrete series weight or a non-regular weight.

From the explicit description of the Weyl chambers $C_{i}$ above, we see that the limit of discrete series weights thus come in 3 families:

[TABLE]

Note that for the corresponding families of vector bundles $\mathcal{W}_{\mu}=\omega(a,b)$ , the first and third are interchanged under the Serre duality map $\omega(a,b)\mapsto\omega(a,b)^{{\vee}}\otimes\det\Omega^{1}_{X}\cong\omega(3-b,3-a)(-\infty)$ while the second family is stable under this operation. (Up to interchanging the canonical and subcanonical extensions, of course.) The preceding theorem implies that for all $a\geq 2$ , we have:

[TABLE]

(Technically, we should normalize the Hecke action on the cohomology of $\omega(a,b)$ before we adjoin the subscripts $(2)$ or $\mathrm{temp}$ . See Section 5.5 below.) From the result of Lan and Suh, we deduce the following characteristic $p$ analogue of these vanishing results for limit of discrete series weights.

Corollary 5.8.

(1)

For $4\leq a\leq p$ , we have

[TABLE]

for $i=2,3$ . 2. (2)

For $3\leq a\leq(p+1)/2$ , we have

[TABLE]

Proof.

The vanishing results for the subcanonical extensions $\omega(*,*)(-\infty)$ follow directly from Theorem 5.1. The fact that

[TABLE]

in the second part then follows from Serre duality since:

[TABLE]

∎

5.4. Torsion Classes

It seems natural to ask whether one can (explicitly or otherwise) construct classes in $H^{0}(X,\omega(2,2))$ which do not lift to characteristic zero. Let us recall what happens for classical modular forms of weight one.

Suppose that $X_{1}(N)$ denotes (for this paragraph) the classical modular curve. A non-Eisenstein Hecke eigenclass in $H^{0}(X_{1}(N),\omega_{k})$ gives rise to an irreducible Galois representation $\overline{r}:G_{\mathbf{Q}}\rightarrow\mathrm{GL}_{2}(k)$ . Suppose that the image of $\overline{\rho}$ contains $\mathrm{SL}_{2}(k^{\prime})$ for some $\#k^{\prime}>5$ . Such a representation cannot be the mod- $p$ reduction of a representation with image isomorphic to some subgroup of $\mathrm{GL}_{2}(\mathbf{C})$ , and thus by [DS74], the corresponding mod- $p$ class does not lift to characteristic zero. (Explicit examples were first found by Mestre for $\#k=8$ and $N=1429$ .) A slightly different example can be given as follows. Suppose that $\Gamma=\Gamma_{1}(N)\cap\Gamma_{0}(x)$ . Consider a non-Eisenstein Hecke eigenclass in $H^{1}(X(\Gamma),\omega_{k})$ which is new of level $x$ . Then the restriction of $\overline{r}$ to $I_{x}$ is rank two unipotent. Such a class cannot lift to characteristic zero at minimal level, because otherwise (by [DS74] again) the corresponding representation $\rho$ would simultaneously have finite image and yet $\rho|I_{x}$ would be unipotent and hence infinite. Note that (unlike in the first example) it may well be possible to lift $\overline{\rho}$ to characteristic zero at some non-minimal level. Examples of the second kind have a natural analogue in the Siegel context.

Suppose that $\overline{r}$ has type U3 at $x$ . If $r$ is any minimal lift of $\overline{r}$ , the image of $I_{x}$ under $r$ will be rank three unipotent. This will also be true for the restriction of $r$ to any finite extension of $\mathbf{Q}_{x}$ . Yet, by a theorem of Grothendieck ([Gro72], Exp.9) the image of inertia of a semistable abelian variety is rank two unipotent, i.e., satisfies $(\sigma-1)^{2}=0$ . If follows that $r$ cannot contribute to a motive associated to an abelian variety. Conjecturally, Siegel modular eigenforms of weight $(2,2)$ should be associated to abelian varieties $M/\mathbf{Q}$ of dimension $2n$ equipped with an injection $E\rightarrow\mathrm{End}_{\mathbf{Q}}(M)\otimes\mathbf{Q}$ for some totally real field $E$ of degree $n$ . This suggests that such representations $\overline{r}$ do not admit minimal lifts to characteristic zero when $\sigma=(2,2)$ . It would be interesting to produce an explicit example of such a modular representation. Recall that there is an exceptional isomorphism $S_{6}\simeq\mathrm{GSp}_{4}(\mathbf{F}_{2})$ coming from identifying the Galois group of $\mathcal{A}_{2}[2]$ over $\mathcal{A}_{2}$ with either the symmetries of the $2$ -torsion points on the universal abelian surface or the action of $S_{6}$ on the (generically) $6$ Weierstrass points [BFvdG08]. The unipotent element $\sigma\in\mathrm{GSp}_{4}(\mathbf{F}_{2})$ such that $(\sigma-1)^{2}\neq 0$ has conjugacy class $(1,2,3,4)(5,6)\in S_{6}$ (this class is preserved by the exotic automorphism of $S_{6}$ ). In particular, if $K/\mathbf{Q}$ is a sextic field with Galois closure $G\subset S_{6}$ containing $(1,2,3,4)(5,6)$ and acting irreducibly on $\mathbf{F}^{4}_{2}$ , and $p$ is an odd prime such that $p=\mathfrak{p}^{4}\mathfrak{q}^{2}$ , then $\overline{r}:\mathrm{Gal}(K/\mathbf{Q})\simeq\mathrm{GSp}_{4}(\mathbf{F}_{2})$ should give rise to such a representation. Here is an explicit example coming from a slight variation of this argument. Suppose that $A$ is the abelian surface corresponding to the Jacobian of the curve:

[TABLE]

which has good reduction outside $3\cdot 5\cdot 19$ . The representation $\overline{r}:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(\mathbf{F}_{2})$ has image $S_{5}\subset S_{6}$ , and the image of inertia at $5$ is conjugate to $(1,2,3,4)(5,6)$ . Hence $\overline{r}$ should give rise to a mod- $2$ torsion class with trivial level structure outside $3\cdot 5\cdot 19$ , and the following level structure at these primes:

(1)

Iwahori level structure at $p=5$ , 2. (2)

Paramodular level structure at $p=3$ and $19$ .

Note that this conjectural torsion class does conjecturally lift to characteristic zero at some level since one expects that $A$ is modular. (The conductor of $A$ is $3\cdot 5^{3}\cdot 19$ .)

Common to both examples is the non-existence of automorphic representations $\pi$ (associated to either classical modular forms of weight $1$ or Siegel modular forms of weight $(2,2)$ ) such that $\pi_{x}$ is the Steinberg representation. For classical modular forms, the non-existence of such $\pi$ follows from a consideration of the corresponding Galois representations, an argument which does not obviously generalize to the Siegel case (since one does not know how to attach an abelian variety to such a form). However, following argument (due to Kevin Buzzard) generalizes nicely:

Theorem 5.9.

If $\pi$ is a cuspidal automorphic representation associated to a Siegel modular form of weight $(2,2)$ , then $\pi_{x}$ is not the Steinberg representation for any $p$ .

Proof.

In weights $(j,k)$ with $j\geq k\geq 2$ , the corresponding Frobenius eigenvalues of the Weil–Deligne representation associated to a Steinberg representation $\pi_{x}$ are

[TABLE]

where $w=j+k-3$ . Moreover, the corresponding eigenvalue of $U_{x,1}$ is $x^{(w-3)/2}$ . In particular, if $j=k=2$ , then $w=1$ and the corresponding eigenvalue of $U_{x,1}$ is $x^{-1}$ , contradicting the integrality of Hecke eigenvalues (which is a consequence of the integrality of the $q$ -expansion). ∎

5.5. Hecke operators

For simplicity, we denote the schemes $X_{K}$ and $X_{K_{i}(Q)}$ of § 5.2 by $X$ and $X_{i}(Q)$ respectively. Let $M$ denote an $\mathcal{O}$ -module.

Let $x$ be a rational prime. We define matrices

[TABLE]

and regard them as elements of $\mathrm{GSp}_{4}(\mathbf{Q}_{x})$ . If $x\not\in S$ (resp. $x\not\in S\cup Q$ ) we will consider the Hecke operators $T_{x,i}=[K\beta_{x,i}K]$ (resp. $T_{x,i}=[K_{i}(Q)\beta_{x,i}K_{i}(Q)]$ ) acting on each of the spaces

[TABLE]

as in [SU06, §1.1.6] or [Til12, §8]. We also denote $T_{x,0}$ by $S_{x}$ . The definition of Hecke operators given in [SU06] or [Til12] applies when $x\neq p$ or when $p$ is invertible on $M$ . The remaining cases when $x=p$ requires more care. In Lemma 8.8 below we show that $T_{p,1}$ and $Q_{p,2}:=(pT_{p,2}+(p+p^{3})S_{p})p^{2-b}$ exist as operators in cohomological degree $n=0$ over $M=K/\mathcal{O}$ .

Similarly, if $x\in Q$ , we have operators $U_{x,i}=[K_{i}(Q)\beta_{x,i}K_{i}(Q)]$ on $H^{n}(X_{i}(Q),\omega(a,b)_{M})$ . As in § 5.2, the map $X_{1}(Q)\to X_{0}(Q)$ is Galois with Galois group $\Delta_{Q}:=\prod_{x\in Q}(\mathbf{Z}/x)^{\times}$ . This gives rise to an action of $\Delta_{Q}$ on $H^{n}(X_{1}(Q),\omega(a,b)_{M})$ . For each $u\in\Delta_{Q}$ , we denote the corresponding operator on $H^{n}(X_{1}(Q),\omega(a,b)_{M})$ by $\langle u\rangle$ .

Finally, we shall also exploit Hecke operators of a slightly different flavour, which we denote by $U_{p,1}$ and $U_{p,2}$ respectively. In the context of this paper, they may be considered formal operators on $q$ -expansions. (They can also be interpreted more classically as Hecke operators with level structure at $p$ .) Their key property is that the operators $T_{p,1}$ and $T_{p,2}/p^{k+j-6}$ act by $U_{p,1}$ and $U_{p,2}$ for large enough weights, including $(j,k)$ plus any non-trivial multiple of $(p-1,p-1)$ for $j\geq k\geq 2$ . Their explicit definition in given in Lemmas 8.3 and 8.4.

Remark 5.10.

We note that our definition of the Hecke action is the ‘natural’ one twisted by $\nu^{-3}$ (see [SU06, 1.1.6a]). We saw in the proof of Theorem 5.1, that for the natural action, there is an isomorphism $\omega(a,b)\cong\mathcal{W}_{(a,b;-a-b)}$ , and hence over $\mathbf{C}$ , an isomorphism $\omega(a,b)\cong\mathcal{V}_{(-b,-a;-a-b)}$ . Under our normalization of the Hecke action on $\omega(a,b)$ , we therefore have $\omega(a,b)\cong\mathcal{W}_{\mu}$ and, over $\mathbf{C}$ , $\omega(a,b)\cong\mathcal{V}_{\sigma}$ where we take:

[TABLE]

Remark 5.11.

In view of the previous remark, we will identify the set $\mathbf{Z}^{2,+}:=\{(a,b)\in\mathbf{Z}^{2}:a\geq b\}$ with the subset $(a,b;3-a-b)$ of $X^{*}(T)^{+}_{M}$ . Thus it makes sense to speak of $\mu=(a,b)\in X^{*}(T)^{+}_{M}$ .

Remark 5.12.

Let $\mu=(a,b)\in X^{*}(T)^{+}_{M}$ and let $w=a+b-6$ . For $x\in\mathbf{Z}$ , we can similarly define a Hecke operator associated to $[K\operatorname{diag}(x,x,x,x)K]$ on the cohomology of $\omega(a,b)$ : this operator acts as $x^{w}=x^{a+b-6}$ . Now, suppose that $\pi=\pi^{\infty}\otimes\pi_{\infty}$ in $\mathcal{A}_{(2)}(G)$ contributes to

[TABLE]

where $\sigma=(-b,-a;-w)$ . It follows that the central character of $\pi_{\infty}$ is given by:

[TABLE]

Furthermore, by Proposition 5.6, the transfer of $\pi_{\infty}$ to $\mathrm{GL}_{4}(\mathbf{R})$ has infinitesimal character $\chi_{\tau}$ where

[TABLE]

We now introduce some Hecke algebras. We note that in the following definition, we work over $K/\mathcal{O}$ rather than $\mathcal{O}$ .

Definition 5.13.

Let $\mu=(a,b)\in X^{*}(T)^{+}_{M}$ with $a\geq b\geq 2$ .

(1)

The anaemic Hecke algebra

[TABLE]

is the $\mathcal{O}$ -algebra generated by the operators $T_{x,i}$ for $x\not\in S\cup Q\cup\{p\}$ . 2. (2)

Similarly, we let $\mathbf{T}_{\mu}(Q)$ be the algebra generated over $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ by the operators $U_{x,i}$ for $x\in Q$ and $\langle u\rangle$ for $u\in\Delta_{Q}$ . When $Q=\emptyset$ , we have $\mathbf{T}^{\mathrm{an}}_{\mu}(\emptyset)=\mathbf{T}_{\mu}(\emptyset)$ and we denote this algebra by $\mathbf{T}_{\mu}$ . 3. (3)

Finally, $\widetilde{\mathbf{T}}_{\mu}(Q)$ denotes the $\mathbf{T}_{\mu}(Q)$ -algebra generated by the operators $T_{p,1}$ and $Q_{p,2}=(pT_{p,2}+(p+p^{3})S_{p})p^{2-b}$ . (The existence of these operators is established in Lemma 8.8.) If $Q=\emptyset$ , then we denote $\widetilde{\mathbf{T}}_{\mu}(\emptyset)$ by $\widetilde{\mathbf{T}}_{\mu}$ .

Note that the algebras $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)\subset\mathbf{T}_{\mu}(Q)\subset\widetilde{\mathbf{T}}_{\mu}(Q)$ preserve the subspace

[TABLE]

We will also need to consider ordinary Hecke algebras. Let $e=\varinjlim_{n}(T_{p,1}Q_{p,2})^{n!}$ denote the ordinary idempotent associated to the Hecke operators $T_{p,1}$ and $Q_{p,2}$ . (We will only consider this operator in contexts where the direct limit makes sense.) We define:

[TABLE]

for $M=\mathcal{O},\mathcal{O}/\varpi^{m}$ or $M=K/\mathcal{O}$ . We thus have:

[TABLE]

for such $M$ .

Definition 5.14.

Let $\mu=(a,b)$ with $a\geq b\geq 2$ . We define the ordinary Hecke algebras $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)^{\mathrm{ord}}$ (resp. $\mathbf{T}_{\mu}(Q)^{\mathrm{ord}}$ , $\widetilde{\mathbf{T}}_{\mu}(Q)^{\mathrm{ord}}$ ) to be the image of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ (resp. $\mathbf{T}_{\mu}(Q)$ , $\widetilde{\mathbf{T}}_{\mu}(Q)$ ) in

[TABLE]

6. Galois representations associated to modular forms

As in Section 5.2, let $S$ and $Q$ be finite sets of primes of $\mathbf{Q}$ which are disjoint and do not contain $p$ . We allow the possibility that $Q=\emptyset$ . We let $K$ and $K_{i}(Q)$ be open compact subgroups of $\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ as in Section 5.2, and we let $X=X_{K}$ and $X_{i}(Q)=X_{K_{i}(Q)}$ be the corresponding Siegel threefolds, defined over $\mathcal{O}$ .

6.1. The Hasse invariant

We begin with a definition.

Definition 6.1.

Let $h\in H^{0}(X,\omega^{p-1}_{k})$ be the Hasse invariant and let $A\in H^{0}(X,\omega^{r(p-1)})$ be a lift of $h^{r}$ , for some $r>0$ which we fix for the rest of this section.

The existence of such a lift $A$ follows from the Koecher principle and the ampleness of $\omega$ on the minimal compactification of $X$ .

Lemma 6.2.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ . Then:

(1)

Multiplication by $h$ defines an injection:

[TABLE]

which is equivariant for the Hecke operators $T_{x,i}$ for each $x\not\in S\cup Q\cup\{p\}$ and the operators $U_{x,i}$ for $x\in Q$ . 2. (2)

If $b\geq 3$ , then this map is also equivariant for the operators $T_{p,1}$ and $Q_{p,2}$ .

Proof.

It is well-known that multiplication by $h$ is injective and commutes with Hecke operators away from $p$ . We may thus assume that $b\geq 3$ . It is shown in [Pil12b, §A.3] and [Til12, Lemme 8.7] that multiplication by $h$ commutes with the operators $U_{p,1}$ and $U_{p,2}$ . Since $b\geq 3$ , [Til12, Lemme 8.5] implies that $T_{p,1}\equiv U_{p,1}\mod p$ and $p^{3-b}T_{p,2}\equiv U_{p,2}\mod p$ . It follows that $T_{p,1}$ and $Q_{p,2}=p^{3-b}T_{p,2}+(1+p^{2})p^{3-b}S_{p}$ also commute with $h$ . ∎

Suppose that $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ . By the proof of [Pil12b, Théorème 6.2], there exists an integer $N(\mu)$ as in the following definition.

Definition 6.3.

Let $N(\mu)$ be an integer such that for all $t\geq N(\mu)$ , $i>0$ , and $Z\in\{X,X_{0}(Q),X_{1}(Q)\}$ , the cohomology group

[TABLE]

vanishes.

Note that for such $t\geq N(\mu)$ , the maps

[TABLE]

are both surjective. The same is true over $X_{0}(Q)$ and $X_{1}(Q)$ .

Lemma 6.4.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ and let $m>0$ . There exists an integer $s>0$ such that, if we set $t=rs(p-1)$ , then:

(1)

$t\geq N(\mu)$ , and 2. (2)

multiplication by $A^{s}$ defines an injection

[TABLE]

which is equivariant for the Hecke operators $T_{x,i}$ for each $x\not\in S\cup Q\cup\{p\}$ and the operators $U_{x,i}$ for each $x\in Q$ .

Proof.

The second property holds as long as $p^{m-1}|s$ (see [Gol14, Theorem 6.2.1]), so it suffices to take $s$ equal to any integer greater than $N(\mu)/r(p-1)$ and divisible by $p^{m-1}$ . ∎

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ . Recall that the Hecke algebras

[TABLE]

were defined in Definition 5.13.

Remark 6.5.

For $\mu=(a,b)$ with $a\geq b\geq 2$ and each $m>0$ , we have

[TABLE]

Let $I_{\mu,m}$ (resp. $\widetilde{I}_{\mu,m}$ ) denote the annihilator of the former space in $\mathbf{T}_{\mu}(Q)$ (resp. $\widetilde{\mathbf{T}}_{\mu}(Q)$ ). If $s$ and $t$ are as in Lemma 6.4, then multiplication by $A^{s}$ induces a surjective map:

[TABLE]

where $\mu^{\prime}=\mu+(t,t)$ . In particular, any maximal ideal $\mathfrak{m}$ of $\mathbf{T}_{\mu}(Q)$ pulls back under this map to a maximal ideal of $\mathbf{T}_{\mu^{\prime}}(Q)$ which we will also denote by $\mathfrak{m}$ .

Similarly, Lemma 6.2 induces a map

[TABLE]

where $\mu^{\prime}=\mu+(p-1,p-1)$ and, if $b\geq 3$ , this extends to a map

[TABLE]

6.2. Preliminaries on Galois representations

We now turn our attention to Galois representations.

Proposition 6.6.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ and let $w=a+b-6$ . There is a continuous character

[TABLE]

such that:

(1)

$\chi_{\mu}|G_{\mathbf{Q}_{p}}$ * is crystalline with Hodge–Tate weight $w$ ;* 2. (2)

for all $x\not\in S\cup Q\cup\{p\}$ , $\chi_{\mu}$ is unramified at $x$ and $\chi_{\mu}(\mathrm{Frob}_{x})=S_{x}$ .

In particular,

[TABLE]

for some finite order character $\chi_{\mu,0}:G_{\mathbf{Q}}\to\widetilde{\mathbf{T}}_{\mu}(Q)^{\times}$ .

Proof.

This follows from the proof of [Tay91, Proposition 4], noting that we have twisted the Hecke action by $\nu^{-3}$ (see Remark 5.12). ∎

Definition 6.7.

For a prime $x$ , we introduce the Hecke polynomial:

[TABLE]

If a modular form $f$ is an eigenform for a collection of Hecke operators $T$ , we denote by $\lambda_{f}$ the map such that $Tf=\lambda_{f}(T)f$ for each $T$ . In particular, if $f$ is an eigenform for the operators $T_{x,i}$ at $x$ , then we can specialize the polynomial $Q_{x}(T)$ at $f$ to get $\lambda_{f}(Q_{x}(T))$ .

Proposition 6.8.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 3$ . Let $w=a+b-6$ and $\mathbf{w}=w+3=a+b-3$ . Let

[TABLE]

be a cuspidal eigenform for the operators $T_{x,i}$ for all $x\not\in Q\cup S$ and $i=0,1,2$ . Then there is a continuous semisimple representation

[TABLE]

defined over a finite extension $K^{\prime}/K$ such that:

(1)

The similitude character $\nu\circ r_{f}$ is given by

[TABLE] 2. (2)

$r_{f}$ * is unramified at primes $x\not\in Q\cup S\cup\{p\}$ , and at such primes, the characteristic polynomial of $r_{f}(\mathrm{Frob}_{x})$ is given by:*

[TABLE] 3. (3)

The restriction $r_{f}|G_{\mathbf{Q}_{p}}$ is crystalline with Hodge–Tate weights $\mathbf{w},(a-1),(b-2),0$ . If, in addition, $f$ is an eigenvalue of the Hecke operators at $p$ , then the characteristic polynomial of $\Phi$ on $D_{\mathrm{cris}}(r_{f}|G_{\mathbf{Q}_{p}})$ is $\lambda_{f}(Q_{p}(X))$ . 4. (4)

Suppose $f$ is ordinary in the sense that it is an eigenform for $T_{p,1}$ and $Q_{p,2}$ with eigenvalues being $p$ -adic units. Then $Q_{p}(X)$ has distinct eigenvalues $\alpha_{p},\beta_{p},\gamma_{p},\delta_{p}$ with $p$ -adic valuations $0,b-2,a-1,\mathbf{w}$ , respectively. Furthermore, $r_{f}|G_{\mathbf{Q}_{p}}$ is conjugate in $\mathrm{GSp}_{4}(K^{\prime})$ to a representation of the form

[TABLE] 5. (5)

If $r_{f}$ is absolutely irreducible, then it satisfies local-global compatibility at all primes.

Proof.

The existence of $r_{f}$ follows from the work of Taylor, Laumon and Weissauer. Some of the finer properties are due to Urban, Genestier–Tilouine, Gan–Takeda, Sorensen and Mok. Fix an embedding $\imath:K\hookrightarrow\mathbf{C}$ and let $\pi$ be an cuspidal automorphic representation of $\mathrm{GSp}_{4}(\mathbb{A}_{\mathbf{Q}})$ which contributes to the $f$ -part of $H^{0}(X_{1}(Q),\omega(a,b)(-\infty)_{\mathbf{C}})$ under the isomorphism of the first part of Theorem 5.2 (with $\sigma=(-b,-a;6-a-b)$ , as in Remark 5.10).

We take $r_{f}:G_{\mathbf{Q}}\to\mathrm{GL}_{4}(\overline{K})$ be the representation $R_{p}$ of [Mok14, Theorem 3.5] associated to $\pi$ . When $\pi$ is simple, generic in the terminology of [Mok14], the representation can be conjugated to take values in $\mathrm{GSp}_{4}(\overline{K})$ , by the main theorem of [BC09]. In the remaining cases, the representation $R_{p}$ is reducible and can easily be seen to be symplectic. The usual Baire category argument implies that $r_{f}$ can be defined over a finite extension of $K$ . Thus in all cases, we may take $r_{f}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(K^{\prime})$ . Parts (1)– (5) follow from the statement of Theorem [Mok14, Theorem 3.5]. ∎

Lemma 6.9.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ and let $\mathfrak{m}$ be a maximal ideal of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ . Then there is a continuous semisimple representation

[TABLE]

such that for each $x\not\in S\cup Q\cup\{p\}$ , the restriction $\overline{r}_{\mathfrak{m}}|G_{\mathbf{Q}_{x}}$ is unramified and $\overline{r}_{\mathfrak{m}}(\mathrm{Frob}_{x})$ has characteristic polynomial $Q_{x}(X)$ .

If $\overline{r}_{\mathfrak{m}}$ is absolutely irreducible, then the representation $\overline{r}_{\mathfrak{m}}$ preserves a symplectic pairing and hence, after conjugation, we have a representation:

[TABLE]

Proof.

Choose an integer $s$ as in Lemma 6.4 with $m$ taken to equal $1$ and let $t=rs(p-1)$ . Let $f\in H^{0}(X_{1}(Q),\omega(a+t,b+t)(-\infty))\otimes\overline{K}$ be an eigenform for $\widetilde{\mathbf{T}}_{\mu^{\prime}}(Q)_{\mathfrak{m}}$ . Let $r_{f}$ be the Galois representation associated to $f$ by Proposition 6.8 and take $\overline{r}_{\mathfrak{m}}$ to be the semisimplification of a reduction of $r_{f}$ to characteristic $p$ . The resulting representation is defined over the algebraic closure of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)/\mathfrak{m}$ , but by the argument of [CHT08, Prop. 3.4.2], we see that after conjugation, it may be defined over $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)/\mathfrak{m}$ .

For the last part: let $\pi$ the transfer to $\mathrm{GL}_{4}$ (given by [Art04]) of the automorphic representation generated by $f$ . Then $\pi$ descends to an automorphic representation $\Pi$ of a unitary group over $\mathbf{Q}$ . The family of $\ell$ -adic Galois representations associated to $\Pi$ is the same as that associated to $f$ . Thus, [BC11, Theorem 1.2] and the fact that $\overline{r}_{\mathfrak{m}}$ is absolutely irreducible implies that $r_{f}$ is symplectic. The same is then true of $\overline{r}_{\mathfrak{m}}$ (by absolute irreducibility). ∎

Remark 6.10.

By the same argument, the previous result holds if we replace $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ by $\mathbf{T}_{\mu}(Q)$ or $\widetilde{\mathbf{T}}_{\mu}(Q)$ .

Definition 6.11.

We say that $\mathfrak{m}$ is non-Eisenstein if the representation $\overline{r}_{\mathfrak{m}}$ is absolutely irreducible.

6.3. Galois representations in cohomological weights

Let $\overline{r}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(k)$ be a representation as in Section 4. By Assumption 4.2 and by Cebotarev, there exist infinitely many primes $q$ such that no pair of eigenvalues of $\overline{r}(\mathrm{Frob}_{q})$ have ratio $q\mod p$ and $q\not\equiv 1\mod p$ . Choose any such $q$ which is disjoint to $p$ and all primes of bad reduction of $\overline{r}$ . We take $S=S(\overline{r})\cup\{q\}$ and $Q$ a possibly empty set of primes disjoint from $S\cup\{p\}$ . We define a compact open subgroup $K=\prod_{x}K_{x}$ of $\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ as follows:

(1)

If $x=p$ or $\overline{r}$ is unramified at $x$ and $x\neq q$ , then $K_{x}=\mathrm{GSp}_{4}(\mathbf{Z}_{x})$ . 2. (2)

If $x$ is of type U3, then $K_{x}=I(x)$ , where $I(x)$ is the Iwahori subgroup. 3. (3)

If $x$ is of type U2, then $K_{x}=\Pi(x)$ , where $\Pi(x)$ is the Klingen parahoric. 4. (4)

If $x$ is of type U1, then $K_{x}=K(x)$ , where $K(x)$ is the paramodular group at $x$ . 5. (5)

If $x$ is of type P, then $K_{x}=\Pi(x)^{+}$ (and $x-1$ is prime to $p$ ). 6. (6)

If $x$ is of type H, then $K_{x}$ is the full congruence subgroup of level $x$ . 7. (7)

If $x=q$ , then $K_{x}$ is the full congruence subgroup of level $x$ .

We then let $X=X_{K}$ and $X_{i}(Q)=X_{K_{i}(Q)}$ as in Section 5.2.

Let $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 3$ be a regular weight and let $\mathfrak{m}_{\emptyset}$ be a maximal ideal of $\mathbf{T}_{\mu}^{\mathrm{ord}}$ (the ordinary Hecke algebra with $Q=\emptyset$ ) with residue field $k$ . Then $\mathfrak{m}_{\emptyset}$ pulls back to an ideal of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)^{\mathrm{ord}}$ which in turn pushes forward to an ideal of $\mathbf{T}_{\mu}(Q)^{\mathrm{ord}}$ . We denote both of these ideals by $\mathfrak{m}_{\emptyset}$ , in a slight abuse of notation. The ideal $\mathfrak{m}_{\emptyset}\subset\mathbf{T}^{\mathrm{an}}_{\mu}(Q)^{\mathrm{ord}}$ is maximal but $\mathfrak{m}_{\emptyset}\subset\mathbf{T}_{\mu}(Q)^{\mathrm{ord}}$ need not be maximal – there may be multiple maximal ideals $\mathfrak{m}$ of $\mathbf{T}_{\mu}(Q)^{\mathrm{ord}}$ that contain it. We make the following assumption:

Assumption 6.12.

Let $\overline{r}$ , $\mu$ and $\mathfrak{m}_{\emptyset}$ be as above. Then:

(1)

We have $\overline{r}_{\mathfrak{m}_{\emptyset}}\cong\overline{r}$ . In particular, since $\overline{r}$ is absolutely irreducible, $\mathfrak{m}_{\emptyset}$ is non-Eisenstein. 2. (2)

For each $x\in Q$ , $x\equiv 1\mod p$ and $\overline{r}|G_{x}$ is a direct sum of four pairwise distinct characters with Frobenius eigenvalues $\alpha_{x},\beta_{x},\gamma_{x},\delta_{x}$ . We assume the eigenvalues have been labeled so that the plane $\lambda(\alpha_{x})\oplus\lambda(\beta_{x})$ is isotropic, and hence $\alpha_{x}\delta_{x}=\beta_{x}\gamma_{x}$ .

We let $\mathfrak{m}\subset\mathbf{T}_{\mu}(Q)^{\mathrm{ord}}$ be any maximal ideal which contains $\mathfrak{m}_{\emptyset}$ . The representations $\overline{r}_{\mathfrak{m}}$ , $\overline{r}_{\mathfrak{m}_{\emptyset}}$ and $\overline{r}$ are all isomorphic.

We now turn to the prime $p$ . Let $\alpha,\beta\in k^{\times}$ be the elements associated to $\overline{r}|G_{\mathbf{Q}_{p}}$ at the beginning of Section 4. For $M=\mathcal{O},\mathcal{O}/\varpi^{m}$ or $K/\mathcal{O}$ , we define:

•

$H^{0}(X_{1}(Q),\omega(a,b)(-\infty)_{M})^{\beta}$ to be the subspace of $H^{0}(X_{1}(Q),\omega(a,b)(-\infty)_{M})$ given by the image of the idempotent $e_{\beta}=\varinjlim_{n}((T_{p,1}-\tilde{\beta})(Q_{p,2}-\tilde{\alpha}\tilde{\beta}))^{n!}$ , where $\tilde{\alpha}$ and $\tilde{\beta}$ are lifts of $\alpha$ and $\beta$ to $\mathcal{O}$ .

•

$\mathbf{T}^{\mathrm{an}}_{\mu}(Q)^{\beta}$ (resp. $\mathbf{T}_{\mu}(Q)^{\beta}$ , $\widetilde{\mathbf{T}}_{\mu}(Q)^{\beta}$ ) to be the image of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ (resp. $\mathbf{T}_{\mu}(Q)$ , $\widetilde{\mathbf{T}}_{\mu}(Q)$ ) in

[TABLE]

We also make the analogous definitions with $\alpha$ and $\beta$ swapping roles.

Theorem 6.13.

Let $\mu=(a,b)$ , $\mathfrak{m}_{\emptyset}$ and $\mathfrak{m}$ be as above, and suppose that Assumption 6.12 holds. Let $w=a+b-6$ and $\mathbf{w}=w+3=a+b-3$ . Then there exists a continuous representation

[TABLE]

lifting $\overline{r}_{\mathfrak{m}}=\overline{r}$ and such that:

(1)

The similitude character $\nu\circ r$ is given by:

[TABLE]

where $\chi_{\mu,0}$ is a finite order character unramified at $p$ which is trivial modulo $\mathfrak{m}$ . 2. (2)

For each prime $x\not\in S\cup Q\cup\{p\}$ , $r$ is unramified at $x$ and $r(\mathrm{Frob}_{x})$ has characteristic polynomial $Q_{x}(X)$ . 3. (3)

There are units $d_{p,1},\dots,d_{p,4}\in\mathbf{T}_{\mu}(Q)_{\mathfrak{m}}^{\beta}$ satisfying

[TABLE]

and such that:

(a)

We have $d_{p,1}\mod\mathfrak{m}=\beta$ and $d_{p,2}\mod\mathfrak{m}=\alpha$ ; 2. (b)

$r|G_{\mathbf{Q}_{p}}$ * is conjugate in $\mathrm{GSp}_{4}$ to a representation of the form:*

[TABLE] 4. (4)

After twisting by the unique square-root of $\chi_{\mu,0}$ which is trivial modulo $\mathfrak{m}$ , the deformation $r$ of $\overline{r}$ satisfies properties (2)– (5) of Definition 4.6.

Remark 6.14.

We expect that, under the given assumptions, the Hecke rings in question are torsion free. However, we avoid having to prove this by passing to sufficiently high weight.**

Proof.

As in Remark 6.5, $I_{\mu,m}$ denotes the annihilator of $H^{0}(X_{1}(Q),{\omega(a,b)(-\infty)}_{\mathcal{O}/\varpi^{m}})$ in $\mathbf{T}_{\mu}(Q)$ . Since $\mathbf{T}_{\mu}(Q)_{\mathfrak{m}}=\varprojlim_{m}\mathbf{T}_{\mu}(Q)_{\mathfrak{m}}/I_{\mu,m}$ , it suffices to construct, for each $m>0$ , a representation $r_{m}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(\mathbf{T}_{\mu}(Q)_{\mathfrak{m}}^{\beta}/I_{\mu,m})$ satisfying the conditions of the theorem. We thus fix an $m>0$ . Choose an integer $s>0$ as in Lemma 6.4 and let $t=rs(p-1)$ . By Lemma 6.4 and Lemma 6.2 (2), multiplication by $A^{s}$ restricts to a map:

[TABLE]

This in turns gives rise to a surjective map $\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}\twoheadrightarrow\mathbf{T}_{\mu}(Q)_{m}^{\beta}/I_{\mu,m}$ . Thus it suffices to prove the result in weight $\mu^{\prime}:=(a^{\prime},b^{\prime}):=(a+t,b+t)$ .

Since $t\geq N(\mu)$ , we have that

[TABLE]

and hence we may regard $\mathbf{T}_{\mu^{\prime}}(Q)$ as acting faithfully on both

[TABLE]

Thus we have

[TABLE]

where the $K_{i}$ are a finite collection of finite extensions of $K$ , one for each minimal prime $\wp_{i}$ of $T_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}$ . Each such minimal prime corresponds to an eigenform $f_{i}$ for $\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}$ . The eigenform $f_{i}$ has an associated Galois representation $r_{f_{i}}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(\mathcal{O}_{K^{\prime}_{i}})$ for some finite extension $K^{\prime}_{i}/K_{i}$ , by Proposition 6.8. After conjugation, we may assume that each $r_{f_{i}}$ reduces to $\overline{r}$ . By the argument of the proof of [CHT08, 3.4.4], using [GG12, Lemma 7.1.1] in place of [CHT08, 2.1.12], we see that the representation $\prod_{i}r_{f_{i}}$ descends to a representation $r:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta})$ . It follows from Proposition 6.8 that $r$ satisfies properties (1)–(3) of the theorem. For part (3), note that $Q_{p}(X)\in\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}$ factors as

[TABLE]

for units $d_{p,i}\in\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}$ . We also have $T_{p,1}\equiv\beta\mod\mathfrak{m}$ and $Q_{p,2}\equiv\alpha\beta\mod\mathfrak{m}$ in $\mathbf{T}_{\mu^{\prime}}(Q)_{\mathfrak{m}}^{\beta}$ (by definition of the idempotent $e_{\beta}$ ). Since $Q_{p}(X)=X^{4}-T_{p,1}X^{3}+p^{b-2}Q_{p,2}X^{2}-\dots$ , we deduce that $d_{p,1}\equiv\beta\mod\mathfrak{m}$ and $d_{p,2}\equiv\alpha\mod\mathfrak{m}$ .

To show that $r$ satisfies properties (2)– (5) of Definition 4.6, it suffices to show that each $r_{f_{i}}$ does so. In fact, property (2) has already been established with the exception of the prime $x=q$ . If $x=q$ , then (by our assumptions) $\mathrm{ad}^{0}(\overline{r})(1)$ as a $G_{\mathbf{Q}_{q}}$ -module contains no subquotient isomorphic to $k$ , and so $H^{2}(\mathbf{Q}_{q},\mathrm{ad}^{0}(\overline{r}))\simeq H^{0}(\mathbf{Q}_{q},\mathrm{ad}^{0}(\overline{r})(1))^{*}=0$ . Since $q\neq p$ , it follows that $H^{1}(\mathbf{Q}_{q},\mathrm{ad}^{0}(\overline{r}))$ consists entirely of unramified classes. In particular, all lifts of $\overline{r}$ are automatically unramified at $q$ . Since $\mathfrak{m}$ is non-Eisenstein, it follows from Proposition 6.8(5) that $r_{f_{i}}$ satisfies local-global compatibility at all primes. Thus we may apply the results of [Sor10, §4.5]. We now turn to property (3) of Definition 4.6. If $x\in S(\overline{r})$ is of type U3, then $\overline{r}(I_{x})$ is unipotent and generated by a conjugate of $\exp(N_{3})$ . Since $K_{x}=I(x)$ , [Sor10, Corollary 1] implies that $r_{f_{i}}(I_{x})$ is topologically generated by a conjugate of $\exp(N_{3})$ , $\exp(N_{2})$ or $\exp(N_{1})$ . The latter two cases are incompatible with the residual representation being of nilpotent rank 3. Similarly, if $x\in S(\overline{r})$ is of type U2, then $K_{x}=\Pi(x)$ and [Sor10, Corollary 1] implies that $r_{f_{i}}(I_{x})$ is topologically generated by a conjugate of $\exp(N_{2})$ or $\exp(N_{1})$ . The latter case is incompatible with the residual representation being of nilpotent rank 2. Finally, if $x\in S(\overline{r})$ is of type U1, then $K_{x}=K(x)$ . It then suffices to note, following [Sor10, §4.5], that the corresponding representation $\pi_{x}$ is para-spherical, that is, has a non-zero fixed vector by a non-special maximal compact subgroup, namely $K(x)$ itself. This establishes property (3). For property (4), suppose that $x\in S(\overline{r})$ is of type P. Then $K_{x}=\Pi(x)^{+}$ . It follows from [Sor10, Corollary 1] that $\Pi(x)$ has no invariants on the automorphic representation generated by $f_{i}$ (as otherwise $r_{f_{i}}|I_{x}$ would be unipotent, contradicting the assumption on $\overline{r}$ at $x$ ). Thus $\Pi(x)/\Pi(x)^{+}$ acts through a non-trivial character on the space of $\Pi(x)^{+}$ invariants. By [Sor10, Corollary 3] all such characters have to lift the character $\nu\circ\overline{r}|I_{x}$ . However, since $x-1$ is prime to $p$ , there is a unique such character, and the result follows from [Sor10, Corollary 3].

Finally, we turn to property (5) of Definition 4.6. Let $x\in Q$ , and recall that $K_{x}=\Pi(x)^{+}$ . Let $\pi$ be the automorphic representation generated by $f_{i}$ . Consider first the case where $\pi_{x}$ has non-trivial $\Pi(x)$ -invariants. Then $\pi_{x}$ is a subquotient of an unramified principal series. By part (2) of Assumption 6.12 and [GT05, Prop. 3.2.3], we see that $\pi_{x}$ is unramified. In this case, property (5) of Definition 4.6 certainly holds for $r_{f_{i}}$ . In the remaining case, where $\pi_{x}$ has no non-trivial $\Pi(x)$ -invariants, we see that $\Pi(x)/\Pi^{+}(x)$ acts through a non-trivial character on $\pi_{x}^{\Pi(x)^{+}}$ , and the required property holds by [Sor10, Corollary 3]. ∎

6.4. Galois representations in low weights

We let $\overline{r}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(k)$ , $S=S(\overline{r})$ , $Q$ and $K\subset\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ be as in the previous section. Recall that in Section 4, we fixed two units $\alpha,\beta\in k^{\times}$ associated to $\overline{r}|G_{\mathbf{Q}_{p}}$ . We now let $\sigma=(a,2)\in X^{*}(T)_{M}^{+}$ with $a\geq 2$ denote a non-regular weight.

Definition 6.15.

We say that $\overline{r}$ is Katz modular of weight $\sigma$ if there exists a maximal ideal $\mathfrak{m}_{\emptyset}$ of $\mathbf{T}_{\sigma}$ such that:

(1)

We have $\overline{r}_{\mathfrak{m}_{\emptyset}}\cong\overline{r}$ , and 2. (2)

There exists a form $\eta\in H^{0}(X,\omega(a,2)_{K/\mathcal{O}})[\mathfrak{m}_{\emptyset}]$ such that

[TABLE]

We now make the following assumption:

Assumption 6.16 (Residual Modularity).

We assume:

(1)

$\overline{r}$ * is Katz modular of weight $\sigma$ with associated maximal ideal $\mathfrak{m}_{\emptyset}$ and eigenform $\eta$ ,* 2. (2)

For each $x\in Q$ , $x\equiv 1\mod p$ and $\overline{r}|G_{x}$ is a direct sum of four pairwise distinct characters with Frobenius eigenvalues $\alpha_{x},\beta_{x},\gamma_{x},\delta_{x}$ . We assume the eigenvalues have been labeled so that the plane $\lambda(\alpha_{x})\oplus\lambda(\beta_{x})$ is isotropic, and hence $\alpha_{x}\delta_{x}=\beta_{x}\gamma_{x}$ .

We let $\mathfrak{m}$ be any maximal ideal of $\mathbf{T}_{\sigma}(Q)$ containing $\mathfrak{m}_{\emptyset}$ .

Let $e_{\alpha,\beta}$ be the idempotent

[TABLE]

where $\tilde{\alpha}$ and $\tilde{\beta}$ are lifts of $\alpha$ and $\beta$ to $\mathcal{O}$ , and define:

[TABLE]

The assumption that $\overline{r}$ is Katz modular implies that this space is non-zero after localization at $\mathfrak{m}$ . We let $\mathbf{T}_{\sigma}(Q)^{\alpha,\beta}$ denote the image of $\mathbf{T}_{\sigma}(Q)$ in

[TABLE]

Our main result in this section is the following.

Theorem 6.17.

Let $\overline{r}$ , $\sigma=(a,2)$ with $p-1>a$ and $\mathfrak{m}$ be as above and suppose that Assumption 6.16 holds. In addition, suppose that:

[TABLE]

Then there exists a representation

[TABLE]

which is a minimal deformation of $\overline{r}$ outside $Q$ .

Proof.

As in the proof of Theorem 6.13, it suffices to prove the existence of an appropriate representation $r_{m}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(\mathbf{T}_{\sigma}(Q)_{\mathfrak{m}}^{\alpha,\beta}/I_{\sigma,m})$ for each $m>0$ . We thus fix an $m>0$ . By Theorem 8.13 below, there exists a power $A^{s}$ of $A$ such that we have injections:

[TABLE]

where $t=rp^{m-1}(p-1)$ . These in turns give rise to surjections:

[TABLE]

where $\mu^{\prime}=\mu+(t,t)$ . The first of these surjections together with Theorem 6.13 implies the existence of a representation $r^{\prime}_{m}$ satisfying all of the required properties, except for conditions (1) and (6) of Definition 4.6. However, we deduce from the existence of both surjections that the representation $r_{m}|G_{p}$ contains two distinct rank-1 unramified submodules (spanned by basis vectors) – one of which having Frobenius eigenvalue lifting $\alpha$ , and the other having Frobenius eigenvalue lifting $\beta$ . By Nakayama’s Lemma, we deduce that $r_{m}^{\prime}$ contains an unramified rank-2 submodule of the form required by condition (6) of Definition 4.6. In order to obtain a representation that also satisfies condition (1) of Definition 4.6, we note that $\nu(r^{\prime}_{m})=\chi\epsilon^{-(a-1)}\chi_{Q}$ where $\chi_{Q}$ is a finite order character of $p$ -power order which is unramified outside $Q$ . Since $p$ is odd, we can find a square root of $\chi_{Q}$ and twist $r^{\prime}_{m}$ by the inverse of this square root. The resulting representation $r_{m}$ now satisfies all required properties. ∎

7. Properties of cohomology groups

As in Section 5.2, let $S$ and $Q$ be finite sets of primes of $\mathbf{Q}$ which are disjoint and do not contain $p$ . We allow the possibility that $Q=\emptyset$ . We let $K$ and $K_{i}(Q)$ be open compact subgroups of $\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ as in Section 5.2, and we let $X=X_{K}$ and $X_{i}(Q)=X_{K_{i}(Q)}$ be the corresponding Siegel threefolds, The goal of this section is to prove Theorems 7.2 and 7.11 below.

7.1. Taylor–Wiles primes

Fix $\mu=(a,b)\in X^{*}(T)_{M}^{+}$ with $a\geq b\geq 2$ . Let $\mathfrak{m}_{\emptyset}$ be a non-Eisenstein maximal ideal of $\mathbf{T}_{\mu}$ . The ideal $\mathfrak{m}_{\emptyset}$ gives rise to ideals of $\mathbf{T}^{\mathrm{an}}_{\mu}(Q)$ and $\mathbf{T}_{\mu}(Q)$ which we also denote by $\mathfrak{m}_{\emptyset}$ (see Section 6.3). We will need the following assumption (c.f. Assumptions 6.12 and 6.16):

Assumption 7.1.

For each $x\in Q$ , we have $x\equiv 1\mod p$ , and $\overline{r}_{\mathfrak{m}_{\emptyset}}|G_{x}$ is a direct sum of four pairwise distinct characters with Frobenius eigenvalues $\alpha_{x},\beta_{x},\gamma_{x},\delta_{x}$ . We assume the eigenvalues have been labeled so that the plane $\lambda(\alpha_{x})\oplus\lambda(\beta_{x})$ is isotropic, and hence $\alpha_{x}\delta_{x}=\beta_{x}\gamma_{x}$ .

For $x\in Q$ , we let $\alpha_{x}^{\prime},\beta_{x}^{\prime},\gamma_{x}^{\prime},\delta_{x}^{\prime}\in\mathcal{O}^{\times}$ be elements lifting $\alpha_{x}$ , $\beta_{x}$ , $\gamma_{x},\delta_{x}\in k^{\times}$ . The point of the above assumption is to rule out the possibility of newforms at level $K_{0}(Q)$ :

Theorem 7.2.

Let $\mu$ and $\mathfrak{m}_{\emptyset}$ be as above, and suppose that Assumption 7.1 holds. Let $\mathfrak{m}$ denote the ideal of $\mathbf{T}_{\mu}(Q)$ containing $\mathfrak{m}_{\emptyset}$ together with the elements $xU_{x,2}-\alpha^{\prime}_{x}\beta^{\prime}_{x}$ and $U_{x,1}-\alpha^{\prime}_{x}-\beta^{\prime}_{x}$ for each $x\in Q$ . Then $\mathfrak{m}$ is maximal and there is an isomorphism

[TABLE]

which is equivariant for the operators $T_{x,i}$ for each $x\not\in S\cup Q\cup\{p\}$ as well as for the operators $T_{p,1}$ and $Q_{p,2}$ .

Here $i$ is the natural inclusion and $\mathrm{pr}_{Q}$ is defined as follows. For $x\in Q$ , let $R_{x}$ denote the Hecke operator

[TABLE]

and let $\mathrm{pr}_{x}$ denote the idempotent

[TABLE]

Then the $\mathrm{pr}_{x}$ ’s commute with one another and $\mathrm{pr}_{Q}$ denotes their product.

For compactness, we will make use the alternative notation $\mathcal{W}_{\mu}=\omega(a,b)$ , and $\mathcal{W}_{\mu}^{\mathrm{sub}}=\omega(a,b)(-\infty)$ . In sufficiently high weight, Theorem 7.2 is due to Genestier and Tilouine:

Theorem 7.3.

Suppose $\mu=(a,b)$ is such that $H^{i}(X,\mathcal{W}^{\mathrm{sub}}_{\mu,k})$ and $H^{i}(X_{0}(Q),\mathcal{W}^{\mathrm{sub}}_{\mu,k})$ are 0 for all $i>0$ . Then the map

[TABLE]

is an isomorphism. An explicit inverse is given by the composition

[TABLE]

where $d_{Q}=\prod_{x\in Q}[\mathrm{GSp}_{4}(\mathbf{Z}_{x}):\Pi(x)]$ (which is prime to $p$ ) and $\mathrm{tr}$ is the trace map associated to $X_{0}(Q)\to X$ .

Proof.

By the assumption of cohomology vanishing, it suffices to prove both statements with $K/\mathcal{O}$ replaced by $K$ . Indeed, if the map over $K$ is surjective, then so too is the map over $K/\mathcal{O}$ . Furthermore, if $d_{Q}^{-1}\mathrm{tr}$ is an inverse over $K$ , then the fact that its defined over $\mathcal{O}$ implies immediately that it also gives an inverse over $K/\mathcal{O}$ . The proof of the corresponding result over $K$ follows exactly as in the proof of [GT05, Proposition 11.1.2]. ∎

Using this result and the Hasse invariant $h\in H^{0}(X,\omega^{p-1}_{k})$ , we can now establish Theorem 7.2 at the level of $\varpi$ -torsion. (Recall that cohomology in degree 0 over $k$ can be identified with $\varpi$ -torsion in degree 0 cohomology over $K/\mathcal{O}$ .) Note that, for any weight $\mu$ , the cohomology vanishing assumption of the previous theorem holds in weight $\mu+(t,t)$ as long as $t\geq N(\mu)$ (where $N(\mu)$ is defined in Definition 6.3).

Lemma 7.4.

Let $\mu=(a,b)$ with $a\geq b\geq 2$ . Choose an integer $t$ such that $(p-1)t\geq N(\mu)$ . Let $\mu^{\prime}=(a^{\prime},b^{\prime})=(a+t(p-1),b+t(p-1))$ . Then the following diagrams are co-cartesian:

[TABLE]

In particular, the left hand vertical maps are mutually inverse isomorphisms.

Proof.

Note that the right hand vertical maps are mutually inverse isomorphisms by Theorem 7.3 and the choice of $t$ . The diagrams are commutative because $h$ commutes with all Hecke operators at the primes in $Q$ (Lemma 6.2). Now, let $f\in H^{0}(X,\mathcal{W}^{\mathrm{sub}}_{\mu^{\prime},k})_{\mathfrak{m}_{\emptyset}}$ and let $F=\mathrm{pr}_{Q}(f)\in H^{0}(X_{0}(Q),\mathcal{W}^{\mathrm{sub}}_{\mu^{\prime},k})_{\mathfrak{m}}$ . Note that $f$ can be recovered from $F$ via the formula $f=d_{Q}^{-1}\mathrm{tr}(F)$ . We need to show that $f$ is divisible by $h^{t}$ if and only if $F$ is divisible by $h^{t}$ . But this follows immediately by the commutativity of the diagrams above: if $f=h^{t}g$ , then $F=h^{t}\mathrm{pr}_{Q}(g)$ , and if $F=h^{t}g$ , then $f=h^{t}d_{Q}^{-1}\mathrm{tr}(g)$ . (Note that since $X_{0}(Q)$ and $X$ are smooth (and in particular irreducible) over $k$ , multiplication by $h$ is injective on $H^{0}$ .) ∎

We will need the analogous result for forms on the non-ordinary locus: let $S$ (resp. $S_{0}(Q)$ ) denote the non-ordinary locus of $X_{k}$ (resp. $X_{0}(Q)_{k}$ ).

Lemma 7.5.

Let $\mu=(a,b)$ with $a\geq b\geq 2$ . Then the map

[TABLE]

is an isomorphism with inverse $d_{Q}^{-1}\mathrm{tr}$ .

Proof.

We first show that the result is true in sufficiently high weight. More precisely: let $t\geq N(\mu)+1$ . We let $\mu^{\prime}=(a+(t-1)(p-1),b+(t-1)(p-1))$ and $\mu^{\prime\prime}=(a+t(p-1),b+t(p-1))$ . We have a commutative diagram:

[TABLE]

The choice of $t$ guarantees that the rows are short exact sequences. From the previous lemma, we deduce that the right hand vertical map is an isomorphism with inverse $d_{Q}^{-1}\mathrm{tr}$ .

Now we imitate the proof of the previous lemma to deduce the result in smaller weights. For this we use the existence of the Hasse invariant

[TABLE]

Such a form was constructed in unpublished work of the second author with Goldring, but is also constructed in greater generality in [Box15] and [KG15]. In [Box15, Theorem B.2] (see also [Box15, Theorem 6.2.3]), it is shown that $\tilde{h}$ extends to the boundary, (by the normality of the $p$ -rank 1 locus) and that multiplication by $\tilde{h}$ is Hecke equivariant away from $p$ (see [Box15, Theorem 4.5.4(3)]). (It is also true, but not relevant here, that $\tilde{h}$ vanishes on the 1-dimensional Ekedahl–Oort stratum of $S$ to precise order $2$ : see the references in proof of Theorem 8.10 below for more discussion on this point.)

We choose an integer $s$ such that $t:=s(p+1)\geq N(\mu)+1$ . Let $\mu^{\prime\prime}=\mu+s(p^{2}-1,p^{2}-1)=\mu+t(p-1,p-1)$ . Then we have a commutative diagram:

[TABLE]

The right hand vertical map is an isomorphism with inverse $d_{Q}^{-1}\mathrm{tr}$ by the first paragraph. The lemma now follows by the same argument as the previous lemma. ∎

We will also need the analogous result for first degree cohomology over $k$ :

Lemma 7.6.

Suppose $\mu=(a,b)$ where $a\geq b\geq 2$ . Then the map

[TABLE]

is an isomorphism with inverse $d_{Q}^{-1}\mathrm{tr}$ .

Proof.

If $N(\mu)=0$ , then both sides of the map are zero, so we may assume that $N(\mu)>0$ . Let $t\geq N(\mu)$ , and let $\mu^{\prime}=(a+(t-1)(p-1),b+(t-1)(p-1))$ and $\mu^{\prime\prime}=(a+t(p-1),b+t(p-1))$ . Consider the diagram with exact rows:

[TABLE]

The first three vertical maps are isomorphisms with inverse $d_{Q}^{-1}\mathrm{tr}$ by the previous two lemmas. We deduce that the rightmost vertical map above is an isomorphism with inverse $d_{Q}^{-1}\mathrm{tr}$ . This proves the lemma in weight $\mu^{\prime}$ . The general case then follows by a similar argument using a reverse induction on $t$ . ∎

We are finally in a position to prove Theorem 7.2 in the general case.

Proof of Theorem 7.2.

For each $n\geq 1$ , let $\mathcal{O}_{n}:=\mathcal{O}/\varpi^{n}$ . We have a commutative diagram:

[TABLE]

The vertical maps on the ends are isomorphisms by Lemma 7.4 and Lemma 7.6. By induction on $n$ and the Five Lemma we deduce that the map

[TABLE]

is an isomorphism for all $n$ . This shows that the map of Theorem 7.2 is an isomorphism after passing to $\varpi^{n}$ -torsion, for any $n$ . The result follows. ∎

7.2. The balanced property

In this section we assume that $\mu=(a,2)$ is a limit of discrete series weight, where $p>a-2$ . Let $\Delta$ be a quotient of $\Delta_{Q}:=\prod_{x\in Q}(\mathbf{Z}/x)^{\times}$ and let $X_{\Delta}(Q)\to X_{0}(Q)$ denote the corresponding sub-cover of $X_{1}(Q)\to X_{0}(Q)$ . If $\mathcal{L}$ is a vector bundle on $X_{\Delta}(Q)$ , we define

[TABLE]

for all $i$ . Note that $\omega^{3}(-\infty)$ is the dualizing sheaf on $X_{\Delta}(Q)$ .

We now take $\mathcal{L}=\omega(1,3-a)$ , so that $\omega^{3}\otimes\mathcal{L}^{\vee}(-\infty)\cong\omega(a,2)(-\infty)$ . Here we use our bound $p>a-2$ to deduce that there is an equality $(\mathrm{Sym}^{a-2})^{\vee}\simeq\mathrm{Sym}^{a-2}\otimes\det^{2-a}$ as $\mathcal{O}$ -modules. Thus, $\mathbf{T}_{\mu}(Q)$ acts on $H_{0}(X_{\Delta}(Q),\omega(1,3-a))$ . We fix a non-Eisenstein maximal ideal $\mathfrak{m}$ of $\mathbf{T}_{\mu}(Q)$ . We will need the following assumption:

Assumption 7.7.

The space $H^{2}(X_{\Delta}(Q),\omega(a,2)(-\infty)_{k})_{\mathfrak{m}}$ is trivial.

There is a slight abuse of notation here in that $\mathbf{T}_{\mu}(Q)$ does not act on $H^{2}(X_{\Delta}(Q),\omega(a,2)_{k})$ . The localization at $\mathfrak{m}$ refers to the localization at the corresponding maximal ideal of the polynomial ring over $\mathcal{O}$ generated by the Hecke operators.

Remark 7.8.

We note that if $p\geq a\geq 4$ , then the assumption above holds, even before localization at $\mathfrak{m}$ , by Theorem 5.1.

Lemma 7.9.

Suppose Assumption 7.7 holds. Then $H_{1}(X_{\Delta}(Q),\omega(1,3-a))_{\mathfrak{m}}$ is $p$ -torsion free.

Proof.

The claim is equivalent to the divisibility of $H^{1}(X_{\Delta}(Q),\omega(a,2)(-\infty)_{K/\mathcal{O}})_{\mathfrak{m}}$ . Since $X_{\Delta}(Q)$ is flat over $\mathcal{O}$ , there is an exact sequence

[TABLE]

Taking cohomology, this reduces to the claim that $H^{2}(X_{\Delta}(Q),{{\omega(a,2)(-\infty)}}_{k})_{\mathfrak{m}}$ vanishes. ∎

The following lemma uses only the assumption that $\mathfrak{m}$ is non-Eisenstein: it holds in all weights and in all prime to $p$ levels. We just state it in the case we need:

Lemma 7.10.

The map

[TABLE]

is an isomorphism for all $i$ .

Proof.

Let $\partial X$ denote the boundary of $X_{0}(Q)$ . It suffices to show that the boundary cohomology

[TABLE]

vanishes for all $i$ . However, over $\mathbf{C}$ the cohomology of the boundary is computed by the nerve spectral sequence:

[TABLE]

See [HZ01] (3.2.4). Here $R$ is a $\mathbf{Q}$ -parabolic of $G$ and $r(R)$ is its parabolic rank. By [HZ01, Corollary 3.2.9], and freely using the notation of this paper. the space $E_{1}(R)^{r,s}$ is the space of $K$ -invariants in:

[TABLE]

If $R=\Pi$ is the Klingen parabolic, then $G_{h,R}=\mathrm{GSp}_{2}=\mathrm{GL}_{2}$ and $G_{\ell,R}=\mathrm{GL}_{1}$ . If $R$ is the Siegel parabolic or the Borel subgroup, then $G_{h,R}$ is trivial and $G_{\ell,R}=L_{R}$ is the Levi component of $R$ (and hence is either $\mathrm{GL}_{2}\times\mathrm{GL}_{1}$ or $\mathrm{GL}_{1}^{3}$ ). In all cases, $\mathcal{V}_{\lambda(h,w)}$ is the canonical extension of an automorphic vector bundle on the Shimura variety $X(G_{h})$ and $\widetilde{\mathbf{V}}_{\lambda(\ell,w)}$ is a local system on $X(G_{\ell})$ associated to an algebraic representation of $G_{\ell}$ . See [HZ94, (3.6.1)] for the highest weight formulas. The functor $I_{R}$ is an intermediate induction defined in [HZ01, (3.2.8)].

Since each of the groups $G_{h}$ and $G_{\ell}$ are products of copies of $\mathrm{GL}_{2}$ and $\mathrm{GL}_{1}$ , we see that to any Hecke eigenclass in any $H^{i}(\partial X,\omega(a,2)_{\mathbf{C}})$ , we can associate a compatible system of reducible $\mathrm{GSp}_{4}$ -valued $l$ -adic representations of $G_{\mathbf{Q}}$ . Since the ideal $\mathfrak{m}$ is non-Eisenstein, it follows that $H^{i}(\partial X,\omega(a,2))_{\mathfrak{m}}\otimes K=0$ , as required. ∎

We come to the main result of this section:

Theorem 7.11.

Let $\Delta$ be a quotient of $\Delta_{Q}$ which is of $p$ -power order. As above, let $\mu=(a,2)$ with $p-2>a$ and let $\mathfrak{m}$ be a non-Eisenstein ideal of $\mathbf{T}_{\mu}(Q)$ . Suppose that Assumption 7.7 holds. Then the $\mathcal{O}[\Delta]$ -module

[TABLE]

is balanced in the sense of Definition 3.2.

Proof.

The argument proceeds exactly as in the proof of Prop. 3.8 of [CG18]. If we let $M=H_{0}(X_{\Delta}(Q),\omega(1,3-a))_{\mathfrak{m}}$ and $S=\mathcal{O}[\Delta]$ , then the defect $d_{S}(M)$ is given by:

[TABLE]

where $r$ is the $\mathcal{O}$ -rank of $M_{\Delta}$ . Thus we need to show that $r\geq\dim_{k}\mathrm{Tor}_{1}^{S}(M,\mathcal{O})/\varpi$ .

Let $\mathcal{L}=\omega(1,3-a)$ . Applying Pontryagin duality to the Hochschild–Serre spectral sequence, we get a spectral sequence:

[TABLE]

This spectral sequence tells us that:

(1)

$M_{\Delta}\stackrel{{\scriptstyle\sim}}{{\rightarrow}}H_{0}(X_{0}(Q),\mathcal{L})_{\mathfrak{m}}$ , and 2. (2)

we have a short exact sequence

[TABLE]

To prove that $d_{S}(M)\geq 0$ , it follows from the second point that it is sufficient to show that $H_{1}(X_{0}(Q),\mathcal{L})_{\mathfrak{m}}$ is free of rank at most $r$ over $\mathcal{O}$ . Lemma 7.9 tells us that this space is $p$ -torsion free. Passing to characteristic 0 and using the first point, we are therefore reduced to establishing the inequality:

[TABLE]

In other words, we need to show:

[TABLE]

By Lemma 7.10, we are reduced to showing that

[TABLE]

where $\overline{H}^{i}$ denotes the interior cohomology (the image of $H^{i}(\omega(a,b)(-\infty))$ in $H^{i}(\omega(a,b))$ ).

As recalled in Theorem 5.2, the interior cohomology can be computed in terms of square integrable automorphic forms on $G$ . By Remark 5.10, the cohomology of $\omega(a,b)$ agrees with that of $\mathcal{W}_{\mu}\cong\mathcal{V}_{\sigma}$ where $\mu=(a,b;3-a-b)=(a,2;1-a)$ and $\sigma=(-2,-a;4-a)$ . Theorem 5.2 then implies that:

[TABLE]

where $m_{(2)}(\pi)$ denotes the multiplicity of $\pi$ in $\mathcal{A}_{(2)}(G)$ . Fix a degree $i\in\{0,1\}$ and let $\pi\in\mathcal{A}_{(2)}(G)$ be such that $\pi$ contributes to $H^{i}(X_{0}(Q),\omega(a,2))_{\mathfrak{m}}\otimes\mathbf{C}$ under the above inclusion (for some embedding $K\hookrightarrow\mathbf{C}$ ). Let $\widetilde{\pi}$ denote the transfer of $\pi$ to $\mathrm{GL}_{4}(\mathbb{A})$ under the Classification Theorem of [Art04]. Then, by Remark 5.12, the infinitesimal character of $\widetilde{\pi}_{\infty}$ is $\chi_{(0,0-(a-1),-(a-1))+3/2(1,1,1,1)}$ . Let $\chi_{\pi}$ denote the central character of $\pi$ .

The representation $\widetilde{\pi}$ falls into one of 6 classes (a)–(f) given in Section 5 of [Art04]. We show now that we can rule out all classes other than class (a). In cases (e) and (f), $\widetilde{\pi}$ is an isobaric sum of idele class characters. In case (d), $\widetilde{\pi}$ is of the form $\lambda|\cdot|^{1/2}\boxplus\lambda|\cdot|^{-1/2}\boxplus\mu$ where $\lambda$ is an idele class character and $\mu$ is a cuspidal automorphic representation of $\mathrm{GL}_{2}(\mathbb{A})$ such that its central character $\chi_{\mu}$ satisfies $\chi_{\mu}=\lambda^{2}=\chi_{\pi}$ . Considering the infinitesimal character of $\widetilde{\pi}_{\infty}$ , we see that we must have $a=2$ and $\mu$ must correspond to a classical modular eigenform of weight 2. In case (c), there is a cuspidal automorphic representation $\mu$ of orthogonal type of $\mathrm{GL}_{2}(\mathbb{A})$ such that $\widetilde{\pi}=\mu|\cdot|^{1/2}\boxplus\mu|\cdot|^{-1/2}$ . Being of orthogonal type means that $\mu$ is induced from a quadratic extension of $\mathbf{Q}$ . In case (b), $\widetilde{\pi}=\mu_{1}\boxplus\mu_{2}$ where the $\mu_{i}$ are distinct cuspidal automorphic representations of $\mathrm{GL}_{2}(\mathbb{A})$ with $\chi_{\mu_{1}}=\chi_{\mu_{2}}=\chi_{\pi}$ . Considering the infinitesimal character of $\widetilde{\pi}_{\infty}$ and the fact that the $\mu_{i}$ have the same central character, it follows that $\mu_{i}$ are both associated to classical modular eigenforms of weight $a$ . Thus, in all cases (b) – (f), we can associate a compatible family of reducible $l$ -adic Galois representations to $\widetilde{\pi}$ . This contradicts the fact that $\mathfrak{m}$ is non-Eisenstein.

The only remaining case is case (a) where $\widetilde{\pi}$ is a cuspidal automorphic representation of $\mathrm{GL}_{4}(\mathbb{A})$ that is $\chi_{\pi}$ -self dual. By Clozel’s Purity Lemma [Clo90, Lemme 4.9], $\widetilde{\pi}_{\infty}$ is essentially tempered. (We thank Olivier Taïbi for pointing this out to us.) It follows that $\pi_{\infty}$ is also essentially tempered, since its $L$ -parameter is essentially bounded. Then by Theorem 5.6(3), $\pi_{\infty}$ is the limit of discrete series representation $\pi(\lambda,C_{i})$ where $\lambda=(a-1,0;4-a)$ . Furthermore, by a Theorem of Wallach [Mok14, Theorem 2.3], it follows that $\pi$ is cuspidal.

By the first part of Theorem 5.2 , the cuspidal cohomology $\mathcal{H}^{i}_{\mathrm{cusp},\sigma}$ maps injectively to the interior cohomology:

[TABLE]

where $m_{0}(\pi)$ is the multiplicity of $\pi$ in $\mathcal{A}_{0}(G)$ . Thus, at this point, we can prove that the dimensions

[TABLE]

are equal for $j=0,1$ if we can establish:

(1)

The spaces $H^{j}(\operatorname{\mathrm{Lie}}P^{-},K^{h};\pi(\lambda,C_{j})\otimes V_{\sigma})$ have the same dimension for $j=0,1$ . 2. (2)

The representation $\pi^{\prime}=\pi^{\infty}\otimes\pi(\lambda,C_{1-i})$ also lies in $\mathcal{A}_{(2)}(G)$ ; 3. (3)

The multiplicities $m_{0}(\pi)$ , $m_{(2)}(\pi)$ , $m_{0}(\pi^{\prime})$ and $m_{(2)}(\pi^{\prime})$ are all equal.

The first point follows from [Har90, Theorem 3.4] which says that both spaces are one dimensional. The second point follows from [Art04]. Indeed, since $\pi(\lambda,C_{i})$ is essentially tempered, the local packet $\Pi_{\psi_{\infty}}$ (where $\psi=\widetilde{\pi}\boxplus 1$ , in the notation of [Art04]) is in fact an L-packet by [Mok14, Theorem 2.1]. Furthermore, it consists of the pair of representations $\{\pi(\lambda,C_{0}),\pi(\lambda,C_{1})\}$ (see [Mok14, §3.1]). Since the group $\mathcal{S}_{\psi}$ is trivial in Case (a) of [Art04], it then follows from part (ii) of the Classification Theorem that $\pi^{\prime}$ is also automorphic. Finally, for the third point, the theorem of Wallach quoted above implies that $\pi$ and $\pi^{\prime}$ are both cuspidal. Part (iii) of the Classification Theorem then implies that each of the multiplicities in point (3) is 1. We have thus shown that

[TABLE]

as required. ∎

8. $q$ -expansions of Siegel modular forms

As in Section 5.2, let $S$ and $Q$ be finite sets of primes of $\mathbf{Q}$ which are disjoint and do not contain $p$ . We allow the possibility that $Q=\emptyset$ . We let $K$ and $K_{i}(Q)$ be open compact subgroups of $\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ as in Section 5.2, and we let $X=X_{K}$ and $X_{i}(Q)=X_{K_{i}(Q)}$ be the corresponding Siegel threefolds, with open subspaces $Y$ and $Y_{i}(Q)$ , all defined over $\mathcal{O}$ .

8.1. $q$ -expansions of Siegel modular forms

(For more background and details on the results quoted in this section, see § 3.1 of [Til06].) Recall that $Y_{1}(Q)$ has good reduction at $p$ . Let $R$ be an $\mathcal{O}$ -module (we will exclusively be interested in the case when either $R=\mathcal{O}/\varpi^{n}$ for some $n$ , or when $R=K/\mathcal{O}$ ). Let $\sigma=(j,k)\in X^{*}(T)_{M}^{+}$ be a weight and associate to $\sigma$ the representation

[TABLE]

of $\mathrm{GL}_{2}$ over $R$ . Associated to $\sigma$ , we also have the vector bundle $\mathcal{W}_{\sigma}=\omega(j,k)$ . There is a $q$ -expansion map:

[TABLE]

Theorem 8.1.

The $q$ -expansion map is injective.

Proof.

This is a standard fact (see, for example, Prop. 3.2 of [Til06]). ∎

8.2. Explicit Formulae

Let $L$ be the product of the primes in $S$ and $Q$ , so that $X_{1}(Q)$ has good reduction outside $L$ . Let $R$ be a $\mathbf{Z}_{p}$ -module and thus a $\mathbf{Z}[1/L]$ -algebra. Any $F\in H^{0}(X_{1}(Q),\mathcal{W}_{\sigma,R})$ has a “ $q$ -expansion”:

[TABLE]

where $\mathcal{X}$ denotes the $2\times 2$ positive semi-definite matrices which take on $\mathbf{Z}[1/L]$ -integral arguments for integral vectors, or equivalently,

[TABLE]

The set $\mathcal{X}$ is naturally a subset of $M_{2}(\mathbf{Q})$ . The group $\mathrm{GL}_{2}(\mathbf{Q})$ acts on $M_{2}(\mathbf{Q})$ by the following formula:

[TABLE]

where the right hand side is multiplication. We may naturally extend the definition of $a(F,Q)$ for $Q\in M_{2}(\mathbf{Q})$ by setting $a(F,Q)=0$ for all $Q$ not in $\mathcal{X}$ . In any $q$ -expansion, the coefficients $a(F,Q)$ will also vanish unless the denominators occurring in $Q$ are bounded by some fixed power of $L$ which depends only on the level structure. (Since our arguments in this section are all $p$ -adic, there is little harm in imagining that $L=1$ .) Let $V=\mathcal{O}^{2}$ be the standard representation of $\mathrm{SL}_{2}(\mathbf{Z})$ over $\mathcal{O}$ . The elements $a(F,Q)$ are elements of the representation $U$ , where, if $\sigma$ has weight $(j,k)$ , then

[TABLE]

Let $\rho:\mathrm{SL}_{2}(\mathbf{Z})\rightarrow\mathrm{GL}(U)$ denote the corresponding representation. The representation $\rho$ extends to a homomorphism from $M_{2}(\mathbf{Z})$ to $\mathrm{End}(U)$ over $R$ which we denote by $\rho$ , where once more $\rho$ only depends on $j-k$ and (more relevantly) preserves integrality. We may write the $q$ -expansion of a form $F$ as

[TABLE]

where $a_{F}(n,r,m)=a(F,Q)$ satisfies, for $M\in\overline{\Gamma}\subset\mathrm{SL}_{2}(\mathbf{Z})$ , the equality

[TABLE]

Here $\overline{\Gamma}$ is the congruence subgroup of $\mathrm{SL}_{2}(\mathbf{Z})$ defined on p.807 of [Til06]; since we are working at spherical level at $p$ the group $\overline{\Gamma}$ has level prime to $p$ . (It will do the reader little harm to pretend that $\overline{\Gamma}$ is just $\mathrm{SL}_{2}(\mathbf{Z})$ .)

Remark 8.2.

We shall assume that either $j\geq 4$ or $j=k=2$ . Since we are most interested in representations with similitude character $\nu$ is equal to $\epsilon^{j+k-3}$ , the oddness condition forces the congruence $j\equiv k\mod 2$ , and so if $j>k\geq 2$ then $j\geq 4$ . In cases (coming from Taylor–Wiles primes) where there is non-trivial Nebentypus character at the auxiliary primes $q|Q$ , we may twist (at the cost of increasing the level at $Q$ ) to force the Nebentypus character to be trivial. The only change this has is to make the $q$ -expansions below less unpleasant — the addition of a Nebentypus character only introduces a notational difficulty. We note, however, that with non-trivial Nebentypus character the case of weight $(j,k)=(3,2)$ is possible, but our arguments would not cover this case.**

8.3. Hecke Operators at $p$

Since we will exclusively be interested in Hecke operators at $p$ , we drop the subscript $p$ from the notation. Similarly, we drop the subscript $1$ ,and so $T_{p,1}$ and $U_{p,1}$ are denoted $T$ and $U$ , whereas $T_{p,2}$ and $U_{p,2}$ are denoted $T_{2}$ and $U_{2}$ respectively. One has the following explicit description of the Hecke operator $T$ :

Lemma 8.3.

In weight $\sigma=(j,k)$ there is an identity of formal operators $T=U+p^{k-2}Z+p^{k+j-3}V$ , where $U$ , $Z$ , and $V$ preserve formal integral $q$ -expansions, and such that the following identities hold:

[TABLE]

Here $\mathcal{S}$ denotes (any) set of representatives in $M_{2}(\mathbf{Z})$ for the left coset decomposition of

[TABLE]

Moreover, $a(F,S^{-1}Q)=0$ unless $S^{-1}Q$ is a $p$ -integral binary quadratic form.

Note that the coset decomposition of $\overline{\Gamma}\left(\begin{matrix}p&0\\ 0&1\end{matrix}\right)\overline{\Gamma}$ for a congruence subgroup $\overline{\Gamma}$ prime to $p$ is essentially the same as the coset decomposition of $\mathrm{SL}_{2}(\mathbf{Z})\left(\begin{matrix}p&0\\ 0&1\end{matrix}\right)\mathrm{SL}_{2}(\mathbf{Z})$ . These formulae are well known. See, for example, Prop 10.2 of [CvdG15]. To compare our formula with ibid, note that we have normalized the matrices in $\mathcal{S}$ to be integral of determinant $p$ , and absorbed the action of the determinant into the coefficient (since we are concerned here with issues of $p$ -integrality). We have a similar description of $T_{2}$ which can be obtained by a laborious computation (following the arguments of §3.2 and §3.3 of [And87]:

Lemma 8.4.

In weight $\sigma=(j,k)$ there is an identity of formal operators $T_{2}=p^{k+j-6}U_{2}+p^{k-3}Z_{2}+p^{2k+j-6}V_{2},$ where $U_{2}$ , $Z_{2}$ , and $V_{2}$ preserve formal integral $q$ -expansions, and the following identities hold:

[TABLE]

where $\mathcal{S}$ is as in the description of $Z$ in Lemma 8.3. If $Q\not\equiv 0\mod p$ , then

[TABLE]

If $Q\equiv 0\mod p$ , then

[TABLE]

For those wanting a more explicit description, note that in weight $(k,k)$ we have the possibly more familiar identities:

[TABLE]

Note also that there is a formal identity $Z_{2}=UZ$ .

Definition 8.5.

Let $X_{2}$ denote the formal operator on $q$ -expansions such that

[TABLE]

Explicitly, if $Q\not\equiv 0\mod p$ , then $a(X_{2}F,Q)=a(F,Q)$ times $(D/p)$ , where $D$ is the determinant of the quadratic form associated to $Q$ , and $(D/p)$ is the Legendre symbol. If $Q\equiv 0\mod p$ , then $a(X_{2}F,Q)=pa(F,Q)$ . In all cases, we see that $a(X_{2}F,Q)=(D/p)a(F,Q)\mod p$ .

Lemma 8.6.

Over $k=\mathcal{O}/\varpi$ , we have $Z_{2}X_{2}=0$ .

Proof.

We have $a(X_{2}F,Q)=0$ if $\det(Q)\equiv 0\mod p$ , but $a(Z_{2}F,Q)$ is a sum over terms of the form $a(F,R)$ with $\det(R)=0$ . ∎

Definition 8.7.

A binary quadratic form $Q$ is $p$ -primitive if it is not of the form $pR$ for an $p$ -integral form $R$ .**

8.4. Hecke Operators on forms of in characteristic $p$

Let $Q_{2}=(p\cdot T_{2}+(p+p^{3})S)p^{2-k}$ .

Lemma 8.8.

There is an action of $T$ and $Q_{2}$ on $H^{0}(X_{1}(Q),{\omega(j,k)}_{K/\mathcal{O}})$ which commutes with the other Hecke operators and acts on $q$ -expansions via the above formula.

Proof.

The argument is very similar to Prop. 4.1 of [Gro90]. It suffices to prove the result with coefficients in $\mathcal{O}/\varpi^{m}$ . The natural approach to defining these operators is using correspondences, as for modular curves. There are two issues which arise. The first is that the projection maps from the Siegel modular varieties with appropriate parahoric level structures are not finite over $X$ . The second is that the definition involving correspondences is some power of $p$ times the actual Hecke operator of interest. A general approach to resolving these questions has been recently found by Pilloni [Pil12a], who constructs all the operators used in this paper. More importantly, his method also allows one to give an action of these operators on higher higher coherent cohomology as well. We use a more pedestrian approach. We can resolve the normalization issue by using the $q$ -expansion principle. The first issue is more subtle. The geometric maps involved are certainly proper; the failure of finiteness is thus a failure of quasi-finiteness. The source of quasi-finiteness arises from the fact that the kernel of Frobenius of an abelian surface $A$ could (for example) equal $\alpha_{p}\times\alpha_{p}$ , which contains “too many” subgroup schemes of type $\alpha_{p}$ . On the other hand, this issue does not arise over the ordinary locus nor over the larger almost ordinary locus consisting of abelian surfaces (those with $p$ rank $\geq 1$ ) where subgroup schemes such as $\alpha_{p}\times\alpha_{p}$ cannot occur. This shows how to resolve the issue by the following ad hoc method: by Hartogs’ Lemma, it suffices to construct $T$ over the global sections of a subvariety $X^{\prime}\subset X$ whose complement has codimension $\geq 2$ . In particular, we may replace $X$ by the moduli space of almost ordinary abelian surfaces for which the corresponding maps are indeed finite. Implicit in this argument is a verification that the formulas above (in Lemmas 8.3 and 8.4) preserve integrality — for $Q_{2}$ this is verified in Lemma 8.12 below. ∎

Note that this argument is not sufficient to construct these operators on

[TABLE]

however, we have no need to the consider the action of Hecke operators at $p$ on these spaces.

We shall also need to use various properties of theta operators. We begin by recalling their basic properties:

Proposition 8.9.

Let $p>3$ , let $j-2\geq k\geq 2$ , and let $p-2>j-k$ .

(1)

There is a map

[TABLE]

whose action on $q$ -expansions is given by

[TABLE] 2. (2)

There is a map

[TABLE]

whose action on $q$ -expansions is given by

[TABLE]

where $\mathrm{con}:\mathrm{Sym}^{j-k}\otimes\mathrm{Sym}^{2}\rightarrow\mathrm{Sym}^{j-k-2}$ is the natural $\mathrm{SL}_{2}(\mathbf{Z})$ -equivariant projection.

Proof.

The operator $\Theta$ is defined in [Yam16, Prop 3.9], and the operator $\theta_{1}$ is defined in [Yam16, Prop 3.12]. ∎

(Some of these maps were also considered in previous unpublished work of Ghitza [Ghi]). The main results we need concerning these operators are given by the next two theorems.

Theorem 8.10.

Let $p>3$ and $p+1\geq k$ , and assume $p\nmid k(2k-1)$ — so in particular $k=2$ and $k=p+1$ are admissible values of $k$ . Then the map

[TABLE]

is injective. In particular, if $\Theta F=0$ , we must have $F=0$ .

Proof.

We may immediately reduce to the case $m=1$ and $\mathcal{O}/\varpi=k$ . Suppose that $F$ lies in the kernel, so $\Theta F=0$ . After possibly replacing $(k,k)$ by $(k-(p-1),k-(p-1))$ , we may assume that $F$ is not divisible by the Hasse invariant. Following Theorem 4.7 of [Yam16], it suffices to show that $F$ is not zero on the superspecial locus if it is not divisible by the Hasse invariant. Hence $F$ has non-trivial specialization to the $p$ -rank $1$ strata. The supersingular locus on this strata is a Cartier divisor cut out by a section of $\omega^{(p^{2}-1)/2}$ for $p>2$ , so since $2k<p^{2}-1$ (for $p>3$ ), the restriction of $F$ is non-zero on the supersingular locus. (That the supersingular locus is a Cartier divisor inside the $p$ -rank $1$ locus when $p>2$ was proved by Koblitz, see p.193 of [Kob75]. The exact order of vanishing can also be found in [vdG99], Theorem 2.4.) Finally, each irreducible component of the supersingular locus is a copy of $\mathbf{P}^{1}$ with $p^{2}+1$ superspecial points on it. Moreover, the line bundle $\omega$ restricts to $\mathcal{O}(p-1)$ on each of these $\mathbf{P}^{1}$ s. Hence the restriction to the superspecial points is injective as long has $k(p-1)\leq p^{2}+1$ , which holds for $k\leq p+1$ . ∎

We also require a related result for non parallel weight.

Theorem 8.11.

Let $p-1>j\geq 4$ . The map:

[TABLE]

is injective.

Proof.

It suffices to work over $k=\mathcal{O}/\varpi.$ Suppose that $\theta_{1}F=0$ , and that $F$ is non-zero after restriction to the superspecial locus. Then the result follows directly from Theorem 3.20 of [Yam16]. As stated, the result does not apply in weight $(6,2)$ , although the same argument works in this weight providing that one may assume (in the notation of ibid.) that $F_{2}|_{X}\neq 0$ , which can be achieved under the action of $\overline{\Gamma}\subset\mathrm{SL}_{2}(\mathbf{Z})$ for $j<p-1$ , since the level of $\overline{\Gamma}$ is prime to $p$ and so surjects on to $\mathrm{SL}_{2}(\mathbf{F}_{p})$ . The corresponding representation of $\mathrm{SL}_{2}(\mathbf{F}_{p})$ is irreducible, and thus for there exists an element which applied to $F$ has $F_{i}|_{X}\neq 0$ for any fixed choice of $i$ . Hence it remains to show that the restriction of $F$ to the superspecial locus is non-zero. Let $X=X_{1}(Q)$ , and denote the rank one strata (respectively, the supersingular locus, respectively, the superspecial locus) by $Y$ , $Z$ , and $S$ respectively. We are assuming that the restriction of $F$ to $Y$ is nonzero. Suppose the restriction of $F$ to $Z$ is zero. There is an exact sequence:

[TABLE]

where $m=(p^{2}-1)/2$ . If $F$ restricts to zero, we obtain a non-zero class in the first group. Yet there is also a sequence:

[TABLE]

The first term vanishes. To see that the final term vanishes, we use the fact that Serre duality shows that the last term is dual to

[TABLE]

which vanishes by Theorem 5.1. We now have to establish non-vanishing from $Z$ to $S$ . The restriction of the Hodge bundle to any $\mathbf{P}^{1}$ on $Z$ is $\mathcal{O}(-1)\oplus\mathcal{O}(p)$ . Hence we need to show that no class in

[TABLE]

can vanish at $p^{2}+1$ points. This is valid as long as

[TABLE]

which holds provided $j\leq p$ .

∎

8.5. Relationship between Hecke eigenvalues and crystalline Frobenius

Suppose that $F$ is a cuspidal eigenform of weight $\sigma=(j,k)$ of level prime to $p$ , and let $r:G_{\mathbf{Q}}\rightarrow\mathrm{GSp}_{4}(\overline{\mathbf{Q}}_{p})$ be the associated Galois representation. One expects (and knows in regular weights, see Theorem 6.13) that $r$ is crystalline at $p$ and that crystalline Frobenius has eigenvalues which are the roots of the following polynomial:

[TABLE]

where $\lambda$ is the eigenvalue of $T$ and $\mu$ is the eigenvalue of $T_{2}$ . We may write the eigenvalues of this polynomial as follows:

[TABLE]

where $\alpha$ and $\beta$ have non-negative $p$ -adic valuation. That means that the coefficient of crystalline Frobenius should have characteristic polynomial:

[TABLE]

On the other hand, we know that the coefficient of $X^{2}$ should be:

[TABLE]

where the operator $Q_{2}$ is defined by this formula. In particular, the eigenvalues of this operator ( $Q_{2}$ ) should all be integral.

Lemma 8.12.

Let $\sigma=(j,k)$ with $j\geq k\geq 2$ . If $(j,k)\neq(2,2)$ , there is a congruence of operators on formal $q$ -expansions:

[TABLE]

In particular, if $F$ is an ordinary form of regular weight $\sigma$ with crystalline eigenvalues as above, the eigenvalue of $Z_{2}$ is $\alpha\beta\mod p$ . If $\sigma=(2,2)$ , there is a congruence

[TABLE]

Proof.

The operator $S$ acts by a scalar which is equal to $p^{j+k-6}$ . Note that

[TABLE]

Thus we can ignore the $p^{3}S$ term above. We have

[TABLE]

and we are done. ∎

8.6. The Main Theorem on $q$ -expansions

Our main theorem is as follows (we use the notation of §6.4).

Theorem 8.13.

Let $\sigma=(j,2)$ for some $p-1>j\geq 2$ . Assume that $\overline{r}$ is as in Assumption 6.16. Assume, moreover, that

[TABLE]

Let $\mathfrak{m}$ denote the corresponding ideal of the Hecke algebra away from $p$ . Let $A$ denotes a non-trivial power of the Hasse invariant of weight $k$ . Then the composite map:

[TABLE]

is injective, where $\pi_{\beta}$ denotes the projection onto the summand where $U-\beta$ and $Q_{2}-\alpha\beta$ (equivalently $Z_{2}-\alpha\beta$ ) are nilpotent.

Note that, by symmetry, the same result holds with $\beta$ replaced by $\alpha$ . Before beginning the proof of this theorem, we first prove a much easier analogue for $\mathrm{GL}(2)$ :

Lemma 8.14.

Let $X_{1}(N)$ denote the modular curve, and let $\overline{\rho}:G_{\mathbf{Q}}\rightarrow\mathrm{GL}_{2}(\overline{\mathbf{F}}_{p})$ be a modular representation of level $N$ and weight one over $\mathbf{F}_{p}$ such that $\overline{\rho}(\mathrm{Frob}_{p})$ has eigenvalues $\alpha$ and $\beta$ . Let $\mathfrak{m}$ denote the corresponding ideal of the Hecke algebra away from $p$ . Assume that

[TABLE]

If $A$ denotes a suitable power of the Hasse invariant of weight $k$ , then the composite map:

[TABLE]

is injective, where $\pi_{\beta}$ denotes the projection onto the quotient of homology where $U-\beta$ is nilpotent.

In both results, all of the corresponding maps are equivariant with respect to Hecke operators away from $p$ . It suffices to show that the image of the $\mathbf{T}$ -socle maps injectively, and hence we may work with coefficients over a finite field $k=\mathcal{O}/\varpi$ of characteristic $p$ .

Proof of Lemma 8.14.

Let $M=H^{0}(X_{1}(N),\omega_{\mathcal{O}/\varpi^{m}})_{\mathfrak{m}}$ and $N=H^{0}(X_{1}(N),\omega^{k+1}_{\mathcal{O}/\varpi^{m}})_{\mathfrak{m}}$ . The map $M\rightarrow N$ is certainly injective, as can be seen by the $q$ -expansion principle (the map is the identity on $q$ -expansions). Let $U$ denote the action of $T$ on $N$ . Then $U$ satisfies the polynomial $U^{2}-TU+\langle p\rangle=0$ on the image of $M$ , and so $M$ lies inside the ordinary subspace of $N$ , and so inside $N_{\alpha}\oplus N_{\beta}$ , where $N_{\gamma}$ is the factor of $N$ on which $(U-\gamma)$ is nilpotent. We have operators $U$ and $V$ defined by the formulae

[TABLE]

and $T=U+\langle p\rangle V$ in weight $1$ , whereas $T=U$ in higher weight. The projection operator:

[TABLE]

is given by $\pi_{\beta}=(U-\alpha)^{m}$ for some integer $m$ . Suppose that $F\in M$ satisfies $\pi_{\beta}(F)=0$ . We have the identity $UVF=F$ , and we may reduce to the case that $\langle p\rangle F=\alpha\beta F$ . We are assuming that $F=F_{\alpha}\in N_{\alpha}$ . Let us write

[TABLE]

Note that $U$ is invertible on $N_{\alpha}$ . Since $TF$ also lies in $N_{\alpha}\oplus N_{\beta}$ , we deduce that $VF$ lies in $N_{\alpha}\oplus N_{\beta}$ . Yet $UVF=F\in N_{\beta}$ , and so $VF\in N_{\alpha}$ , and moreover $\langle p\rangle VF=\alpha\beta U^{-1}F_{\alpha}$ . It follows that

[TABLE]

If $G_{\alpha}\neq 0$ , then the latter expression is non-zero, since applying $U$ gives $UG_{\alpha}-\beta G_{\alpha}$ and $\beta\neq\alpha$ . On the other hand, $G_{\alpha}$ is deeper in the filtration of $N_{\alpha}$ given by

[TABLE]

and hence, replacing $F$ by $(T-\alpha-\beta)F$ sufficiently many times, we may assume that $G_{\alpha}=0$ , that $UF_{\alpha}=\alpha F_{\alpha}$ , and that $(T-\alpha-\beta)F_{\alpha}=0$ . We are thus left with a form $F$ such that:

[TABLE]

We may now achieve a contradiction based purely on a computation with formal $q$ -expansions. For example, the identity $VF=\beta F$ is impossible as soon as either $\beta\neq 1$ or $F$ is a cusp form, simply by considering the exponent of the smallest coefficient. Alternatively, a non-formal argument using properties of modular forms would be to note that $\theta VF=0$ , and then use the fact that $\theta$ has no kernel in low weight (by [Kat77]). ∎

A different proof of this theorem is given in [CG18]; the point is that the proof given here avoids any geometry. The proof below is somewhat in this spirit — using some elementary reductions, we arrive, given an element of $\ker(\pi_{\beta})$ , and a form $F$ which is simultaneously acted upon by a collection of formal operators in a very constrained way. The identities we get are not quite enough to deduce that $F=0$ as formal $q$ -expansions, however, they are enough to produce forms of low weight inside the kernel of various theta operators, which will be enough to produce a contraction by Theorems 8.11 and 8.10. No doubt (see §1.3) there will be better geometric replacements for this argument, so we apologize in advance for the somewhat messy approach that we present here.

As in the proof above, let use write:

[TABLE]

The map $M\rightarrow N$ is certainly injective, as can be seen by the $q$ -expansion principle (the map is the identity on $q$ -expansions). By abuse of notation, we view $M\subset N$ under this map. Since $\alpha\beta\neq 0$ , the operator $Q_{2}$ acts invertibly on $M$ . Depending on the weight $\sigma$ , the operator $Q_{2}$ acts on $M$ either as $Z_{2}$ or as $Z_{2}+X_{2}$ .

Lemma 8.15.

Assume that $\alpha$ and $\beta$ are as in Theorem 8.13. Suppose that $\sigma=(j,2)$ with $j>2$ . Then $M=Q_{2}M=Z_{2}M$ , and $M$ is a subspace of the submodule of $N$ on which $U$ is invertible. If $\sigma=(2,2)$ , then $Z_{2}$ acts on $N$ , the map $M\rightarrow Z_{2}M$ is injective, and $Z_{2}M\subset N$ is a subspace of the submodule of $N$ on which $U$ is invertible.

Proof.

In the first case, by assumption we know that $Q_{2}-\alpha\beta$ is nilpotent, and so $Q_{2}$ induces an isomorphism of $M$ . On the other hand, the operator $Q_{2}$ acts via the formal operator $Z_{2}$ . In weight $\tau=(j+k,2+k)$ , the corresponding operator $Q_{2}$ also acts via $Z_{2}$ , and so we deduce that $Q_{2}-\alpha\beta$ acts on $M\subset N$ and acts nilpotently. Yet $Q_{2}$ only acts invertibly on the ordinary part of $N$ , as can be seen by lifting to characteristic zero. Now let us consider the case of weight $\sigma=(2,2)$ . We have

[TABLE]

Now $Q_{2}$ acts in weight $N$ by $Z_{2}$ , so certainly $Z_{2}M\subset N$ . Since $Q_{2}$ acts by $Z_{2}+X_{2}$ on $M$ , there is a commutative diagram as follows:

[TABLE]

where (by Lemma 8.6) we use the fact that $Z_{2}X_{2}=0$ . Since the left hand side is an isomorphism, it follows that $Z^{2}_{2}M=Z_{2}M$ , and hence that $Z_{2}$ acts invertibly on $Z_{2}M$ , and as in the previous argument it follows that $Z_{2}$ and hence $U$ is invertible on this space.

Hence it suffices to show that $Z_{2}F\neq 0$ for any $F\in M$ . Suppose that $Z_{2}F=0$ . Then $Q_{2}F=Z_{2}F+X_{2}F=X_{2}F$ . Since $Q_{2}F\in M$ , we have $X_{2}F\in M$ . Yet then (again by Lemma 8.6) we have $Q^{2}_{2}F=(Z_{2}+X_{2})X_{2}F=X^{2}_{2}F$ , and then $Q^{3}_{2}F=X^{3}_{2}F=X_{2}F$ , and so $Q_{2}F=X_{2}F\neq 0$ is an eigenvector of $Q_{2}$ with eigenvalue $\lambda$ satisfying $\lambda^{2}=1$ . Yet the only generalized eigenvalue of $Q_{2}$ is $\alpha\beta$ , and by assumption $(\alpha\beta)^{2}\neq 1$ . ∎

(Note that this is the point in this paper which uses the assumption $(\alpha\beta)^{2}\neq 1$ rather than the weaker claim $\alpha\beta\neq 1$ which is sufficient for arguments on the Galois side.)

Lemma 8.16.

The operator $U(U-\alpha)(U-\beta)$ acts nilpotently on $N$ .

Proof.

This follows by lifting to characteristic zero and noting that the only possible unit crystalline eigenvalues of Frobenius of a lift of $\overline{r}$ are $\alpha$ or $\beta$ modulo $\mathfrak{m}$ . ∎

Lemma 8.17.

Suppose that the composite $\pi_{\beta}:Z_{2}M\rightarrow N_{\beta}$ is not injective.

(1)

If $(\sigma)=(j,2)$ with $j>2$ , there exists a nonzero form $F=F_{\alpha}\in M\cap N_{\alpha}$ such that

[TABLE] 2. (2)

If $(\sigma)=(2,2)$ , there exists a nonzero form $F=F_{\alpha}+F_{0}$ with $F_{\alpha}\in N_{\alpha}$ and $F_{0}\in N_{0}$ such that:

[TABLE]

Proof.

First note that $TF=(U+Z)F\in M$ , and that $UF\in N$ , so $ZF\in N$ . Assume that $\sigma=(j,2)$ with $j>2$ . Note that $Z_{2}$ commutes with $U$ . Hence, after replacing $F\in\ker(\pi_{\beta})$ by $(Z_{2}-\alpha\beta)^{m}F=(Q_{2}-\alpha\beta)^{m}F$ for sufficiently large $m$ , we may assume that $Z_{2}F=\alpha\beta F$ . The assumption $\pi_{\beta}(F)=0$ implies that $F=F_{\alpha}\in N_{\alpha}$ . Clearly $UF\in N_{\alpha}$ also, and so $ZF=TF-UF\in N_{\alpha}\oplus N_{\beta}$ . Yet $Z_{2}=UZ$ , so we have

[TABLE]

(There can be no component in $N_{\beta}$ because $U$ is invertible on that space.) Write $(U-\alpha)F_{\alpha}=G_{\alpha}$ , so $UF_{\alpha}-G_{\alpha}=\alpha F_{\alpha}$ , or

[TABLE]

We infer that

[TABLE]

We claim that if $G_{\alpha}\neq 0$ , then the last expression is non-zero. This is because $U$ acts invertibly on $N_{\alpha}$ , and applying $U$ we get

[TABLE]

and $(U-\alpha)G_{\alpha}$ has a smaller nilpotence level than $G_{\alpha}$ , and $(\alpha-\beta)\neq 0$ . In particular, replacing $F$ by $(T-\alpha-\beta)F$ , we may find more elements in $M$ which also lie in the kernel of $\pi_{\beta}$ , and reduce to the case where $UF_{\alpha}=\alpha F_{\alpha}$ and $Z_{2}F_{\alpha}=UZF_{\alpha}=\alpha\beta F_{\alpha}$ . However, in this case, we also see that $ZF_{\alpha}=\beta F_{\alpha}$ , and the required equalities follow.

Now suppose that $\sigma=(2,2)$ . Let us write $\pi_{\beta}:Z_{2}M\subset N_{\alpha}\oplus N_{\beta}\rightarrow N_{\beta}$ as $(U-\alpha)^{m}$ , and so $(U-\alpha)^{m}Z_{2}F=0$ for some $F\neq 0$ . Since $Z_{2}$ formally commutes with $U$ , we also get

[TABLE]

so $Z_{2}$ preserves the property of $Z_{2}F$ lying in the kernel of $\pi_{\beta}$ . But

[TABLE]

because $Z_{2}X_{2}=0$ . Hence, if $Z_{2}F$ lies in the kernel of $\pi_{\beta}$ , then so does

[TABLE]

Hence we may repeatedly replace $F$ by $(Q_{2}-\alpha\beta)F=(Z_{2}+X_{2}-\alpha\beta)F$ , and thus replace $F$ by a form such that $Q_{2}F=\alpha\beta F$ and $Z_{2}F\in N_{\alpha}$ . Now, as above, we may write

[TABLE]

We are assuming that $Q_{2}F=\alpha\beta F$ , and so

[TABLE]

Thus we deduce that $X_{2}F=(0,0,\alpha\beta F_{0})$ and $Z_{2}F=(\alpha\beta F_{\alpha},0,0)$ . We once more would like to use that $T=U+Z$ implies that $ZF\in N$ . However, we no longer know (or expect) that $ZF$ it is ordinary. However, since $Z_{2}=UZ$ and $ZF\in N$ , we certainly deduce that

[TABLE]

for some $G_{0}$ in the kernel of $U$ . Are arguments are similar to those used above. We write $(U-\alpha)F_{\alpha}=G_{\alpha}$ , so $UF_{\alpha}-G_{\alpha}=\alpha F_{\alpha}$ , or

[TABLE]

This implies that

[TABLE]

The first term lies in a space where $(U-\alpha)$ is nilpotent, but it has a smaller nilpotence level than $F_{\alpha}$ by construction. Moreover, if it is equal to zero, then

[TABLE]

where $(U-\alpha)G_{\alpha}=H_{\alpha}$ has yet a higher level of nilpotence. In particular, this can equal zero only if either $\alpha=\beta$ or $G_{\alpha}=0$ . Since we are explicitly forbidding the former, we may assume, by induction, that $F_{\alpha}\neq 0$ is a $U$ -eigenvector, and so

[TABLE]

This implies that $Z_{2}(T-\alpha-\beta)F=0$ , and thus (from the injectivity of $Z_{2}$ in Lemma 8.15) that $(T-\alpha-\beta)F=0$ , or that $F$ is a $T$ -eigenform. The required identities follow immediately upon writing $F=F_{\alpha}+F_{0}$ where $F$ is a $T$ -eigenform, $UF_{\alpha}=\alpha F_{\alpha}$ , and $X_{2}F=\alpha\beta F_{0}$ . ∎

At this point, to prove Theorem 8.13, it suffices to show that there are no Siegel modular forms which satisfy the above identities. For example, in weights $\sigma=(j,2)$ with $j>2$ , we would like to show that there is no form $F$ which is an eigenform for both $T$ and $U$ . We now examine what constraints these identities place on the Fourier coefficients of $F$ .

Remark 8.18 (Tripling).

*A theme of [CG18], following previous work of Wiese [Wie14], was to prove that certain Galois representations were ordinary in two different ways by doubling, that is, mapping the form of low weight to forms of heigh weight in two different ways. This is also our argument in weights $(j,2)$ for $j\geq 4$ . However, in weight $(2,2)$ , we see some new phenomena. When we pass to weight $(p+1,p+1)$ , we see not only the the space of low weight forms has been doubled, but rather tripled, with the image generating (under the map $X_{2}$ ) is mapped to the kernel of $Z_{2}$ . What this must mean is that, in weight $(p+1,p+1)$ , any ordinary Galois representation coming from weight $(2,2)$ should have a non-ordinary lift in weight $(p+1,p+1)$ . This phenomena doesn’t happen for $\mathrm{GL}(2)$ , since forms of weight $p$ which are ordinary modulo $p$ are ordinary in characteristic zero by (boundary cases of) Fontaine–Laffaille theory. For $\mathrm{GSp}(4)$ , however, the Hodge–Tate weights in weight $(p+1,p+1)$ are $[0,p-1,p,2p-1]$ , which are well beyond the Fontaine–Laffaille range. One can also ask what is the exact relationship between tripling argument here in weight $(2,2)$ and the doubling version of [BCGP18] at Klingen level. For our purposes, this would require proving that there exists a (Hecke equivariant away from $p$ ) injection from from our space of forms $M$ at spherical level to a space of ordinary forms (with respect to the operator denoted $U_{\mathrm{Kli},2}$ in [BCGP18]) at Klingen level also in weight $(2,2)$ . While this should certainly be true, we have not attempted to prove it.

8.7. Binary quadratic forms

Definition 8.19.

We define a set with multiplicities $\mathcal{F}(Q)$ of equivalence classes of $p$ -integral binary quadratic forms as follows. For each $M\in\mathcal{S}$ (with $\mathcal{S}$ as defined in Lemma 8.3), we add $[P]$ to $\mathcal{F}(Q)$ if and only if there exists a $P\in[P]$ such that $Q=M.P$ . In particular, $M$ contributes a class $[P]$ if and only if $[M^{-1}.Q]$ is $p$ -integral.**

An easy lemma shows that $\mathcal{F}(Q)$ only depends on $[Q]$ . A binary quadratic form defines a section of $\mathcal{O}(2)$ on $\mathbf{P}^{1}(\mathbf{F}_{p})$ , the latter of which is in natural bijection to $\mathcal{S}$ (recall that $\mathcal{S}$ is the coset space of $\operatorname{diag}(1,p)$ in $\overline{\Gamma}\subset\mathrm{SL}_{2}(\mathbf{Z})$ ). We see that $M^{-1}.Q$ is $p$ -integral if any only if the corresponding quadratic form has a zero at the corresponding point in $\mathbf{P}^{1}(\mathbf{F}_{p})$ . In particular, $\mathcal{F}(Q)$ is empty if $Q$ does not represent zero. Moreover, the cardinality of $\mathcal{F}(Q)$ is given by the number of zeros of $Q$ , and is thus equal to [math], $1$ , or $2$ if $Q$ is $p$ -primitive. (If $Q$ is not $p$ -primitive, then $Q\equiv 0\mod p$ and $\mathcal{F}(Q)$ has cardinality $p+1$ ).

The definition of $\mathcal{F}(Q)$ is motivated by the following observation: There is an identity

[TABLE]

where $P\in[P]$ is some (any) element in $[P]$ such that $M_{P}.P=Q$ for $M_{P}\in\mathcal{S}$ .

Lemma 8.20.

If $[P]\in\mathcal{F}([Q])$ , then $[Q]\in\mathcal{F}([P])$ .

Proof.

Replacing $Q$ by $g.Q$ for some $g\in\overline{\Gamma}\subset\mathrm{SL}_{2}(\mathbf{Z})$ , we may assume that $Q=M.P$ where

[TABLE]

Yet then $pM^{-1}.Q=M^{-1}.Q=P$ , and $pM^{-1}\in\mathcal{S}$ . ∎

Let $d(Q)$ denote the discriminant of $Q$ .

Lemma 8.21.

Suppose that $Q$ is $p$ -primitive. Let $D=d(Q)$ . Then either:

(1)

$(D/p)=-1$ , and $\mathcal{F}([Q])$ is empty. 2. (2)

$(D/p)=0$ , and $\mathcal{F}([Q])$ has exactly one element. 3. (3)

$(D/p)=+1$ , and $\mathcal{F}([Q])$ has exactly two elements.

Proof.

This follows from the fact that a $p$ -primitive form $Q$ has exactly [math], $1$ , or $2$ solutions in $\mathbf{P}(\mathbf{F}_{p})$ , depending on whether $(D/p)$ is $-1$ , [math], or $1$ respectively. Note that (in the final case) $\mathcal{F}([Q])$ may consist of the same class with multiplicity two. This happens, for example, if $(D/p)=1$ and the class number of $D$ is one. ∎

In light of Lemma 8.17, to prove Theorem 8.13, it suffices to prove the following.

Theorem 8.22.

Suppose that $F=\sum a(F,Q)q^{Q}$ is a Siegel modular $q$ -expansion of weight $\sigma=(j,2)$ in characteristic $p$ , where $p-1>j$ .

(1)

Let $\sigma=(j,2)$ with $j\geq 4$ , and suppose that $UF=\alpha F$ and $ZF=\beta F$ for some $\alpha,\beta$ with $\alpha\beta(\beta^{2}-1)\neq 0$ , then $F=0$ . 2. (2)

Let $\sigma=(2,2)$ , and suppose that $F=F_{\alpha}+F_{0}$ , where $UF_{\alpha}=\alpha F_{\alpha}$ , $X_{2}F=\alpha\beta F_{0}$ , and $ZF=\beta F+\alpha F_{0}$ for some $\alpha,\beta$ with $\alpha\beta(\beta^{2}-1)(\alpha^{2}\beta^{2}-1)\neq 0$ . Then $F=0$ .

Proof.

We first prove that that there exists a $Q$ with $\det(Q)\not\equiv 0\mod p$ . In particular, in weight $(2,2)$ , we may also assume that $F_{0}=(\alpha\beta)^{-1}X_{2}F=0$ , and thus have the equalities:

[TABLE]

In fact, we may assume these equalities hold in both cases, since we are assuming such an equality holds in the case of non-parallel weight. If $a(F,pP)\neq 0$ , then, since $a(F,pP)=a(UF,P)=\alpha\cdot a(F,P)$ , we have $a(F,P)\neq 0$ . Hence, if $F\neq 0$ , there exists a $p$ -primitive form $Q$ with $a(F,Q)\neq 0$ . Without loss of generality, assume that $Q$ is a $p$ -primitive form of minimal discriminant with $a(F,Q)\neq 0$ . By Lemma 8.21, $\mathcal{F}(Q)$ consists of a single class $[P]$ . It follows that

[TABLE]

If $P$ is not $p$ -primitive, then $P=pR$ for some $R$ , and then $a(F,R)\neq 0$ , contradicting the minimality of $Q$ (note that $P$ and $Q$ have the same discriminant). Hence $P$ is also $p$ -primitive. Yet then $\mathcal{F}(P)$ consists of a single element, which must be $[Q]$ by Lemma 8.20. Yet then it follows that

[TABLE]

Here we use that $P=M_{Q}.Q=M_{Q}.M_{P}.P$ , and thus $\rho(M_{Q}.M_{P})=\rho(p\cdot I)$ is the identity in weight $(2,2)$ and zero in higher weight. If $j>2$ we are done, and if $\sigma=(2,2)$ , we are done since $\beta^{2}-1\neq 0$ .

Remark 8.23.

As an alternative to this argument, one could use an analogue of Theorem 8.10 to show that the kernel of $\Theta$ is trivial in low weight (but this would require formulating and then proving such a theorem for non-parallel weight).

We may therefore assume that $a(F,Q)\neq 0$ for some $Q$ of discriminant $D$ prime to $p$ .

8.8. The case $\sigma=(2,2)$ .

Let us now assume that $\sigma=(2,2)$ . The coefficient $a(X_{2}F,Q)$ is equal to $(D/p)a(F,R)$ , where $D=D_{Q}$ is the discriminant of $Q$ . Hence, since $ZF=\beta F+\beta^{-1}X_{2}F$ , we deduce that, if $(D/p)=-1$ , that

[TABLE]

Assuming that $\beta^{2}\neq 1$ , we deduce that $a(F,Q)=0$ . It follows that the only $Q$ with $a(F,Q)\neq 0$ have $D=\det(Q)$ satisfying $(D/p)=0,1$ . In particular, the form

[TABLE]

lies in the kernel of $\Theta$ . Yet this implies that $F-X_{2}F$ trivial by Theorem 8.10. But this implies that $Z_{2}F=Z_{2}X_{2}F=0$ , and this contradicts the injectivity of $Z_{2}:M\rightarrow N$ in Lemma 8.15.

8.9. The case $\sigma=(j,2)$ with $j\geq 4$

We may assume that $a(F,Q)\neq 0$ , where $Q$ is $p$ -primitive and $D=d(Q)$ is non-zero. If $(D/p)=-1$ , then $a(ZF,Q)=0$ , contradicting the non-vanishing of $a(F,Q)$ and the identity $ZF=\beta Z$ . Hence we may assume that $(D/p)=1$ . The action of $\overline{\Gamma}\subset\mathrm{SL}_{2}(\mathbf{Z})$ on binary quadratic forms of discriminant $D$ has a finite orbit which may be identified with a ray class group. The assumption on $D$ implies that $Q$ has exactly two zeros in $\mathbf{P}^{1}(\mathbf{F}_{p})$ . For either of the zeros (say $\xi$ ), we may consider the corresponding quadratic form

[TABLE]

where $M$ is a representative of an element in $\mathcal{S}$ corresponding to $\xi$ . The class of $P$ in the class group does not depend on the choice of representative of $M$ . The quadratic form $P$ also has two roots. We claim that, for one of those roots, there is a choice of representative $N$ for the element in $\mathcal{S}$ such that

[TABLE]

Indeed, if $N=pM^{-1}$ , then the corresponding identity is trivially satisfied. We may view the process of applying $Z$ dynamically as follows: The coefficient corresponding to a quadratic form $Q$ of discriminant $D$ with $(D/p)=+1$ of $ZF$ is given by a sum $\rho(M_{P})a(F,P)+\rho(M_{R})a(F,R)$ for a pair of quadratic forms $P$ and $R$ also of the same discriminant. The ray class group corresponding to $Q$ is partitioned by this process $Q\rightarrow\{P,R\}$ into a finite number of cyclic orbits, on which this operation takes a binary quadratic form to its two nearest neighbours (if the orbit has fewer than two elements, this pair of neighbours may have multiplicity). Let us now consider the coefficient $a(Z^{2}F,Q)$ . This consists of two pairs of two terms coming from the neighboring quadratic forms $P$ and $R$ respectively. From the above, for each neighbour $P$ , there will be a term of the form

[TABLE]

where the identity $\rho(MN)=0$ requires the assumption that $j>k$ . Hence $a(Z^{2}F,Q)$ will also be a sum of two terms coming from the quadratic forms of distance $2$ away from $Q$ inside its cyclic orbit. Let us consider one orbit of size $s$ . Then, we also see, modifying $M_{s}$ by an element of $\overline{\Gamma}$ if necessary, that

[TABLE]

where $A=M_{s}M_{s-1}\ldots M_{1}\in M_{2}(\mathbf{Z})$ has $\det(A)=p^{s}$ . Cycling the other way, we deduce the following:

Lemma 8.24.

Suppose that $F$ is a formal Siegel modular form of weight $(j,2)$ which is an eigenform of $Z$ with eigenvalue $\beta$ . Suppose that $Q$ has discriminant $D$ with $(D/p)=1$ . Then there exists an integer $s>0$ such that

[TABLE]

where

[TABLE]

We now make a small recap: At the beginning of the of the proof of Theorem 8.22, we proved that we could assume that $F$ had a non-zero coefficient $a(F,Q)$ where $Q$ has non-zero discriminant modulo $p$ . If $(D/p)=-1$ , then $a(ZF,Q)=0$ , which (with $ZF=\beta F$ ) would imply that $F=0$ . Hence we may assume there is a non-zero coefficient with $(D/p)=+1$ (which we exploit below) and use the following proposition to reach the final contradiction.

Proposition 8.25.

Suppose that $F$ is a formal Siegel modular form of weight $(j,2)$ modulo $p$ which is an eigenform of $Z$ with eigenvalue $\beta$ such that $\beta\neq 0$ , and suppose that $p>j-2$ . Suppose that $F$ has a non-zero coefficient $a(F,Q)$ where $(D/p)=1$ . Then $\theta_{1}F=0$ .

Proof.

The map $\theta_{1}$ is induced from the contraction map

[TABLE]

(this is well defined integrally as long as $p>j-2$ ). In particular, we have the identity

[TABLE]

where $\mathrm{con}$ denotes the contraction map. We claim that $\mathrm{con}(\mathrm{Sym}^{j-2}[A]x\otimes Q^{\vee})=0$ for any $x\in\mathrm{Sym}^{j-2}V$ , where $V=k^{2}$ . Once we have this, we deduce that $\beta^{s}a(\theta_{1}F,Q)=a(\theta_{1}Z^{s}F,Q)=0$ , and since $\beta\neq 0$ , we have $a(\theta_{1}F,Q)=0$ and $\theta_{1}F=0$ .

While there is probably an easy coordinate free way to prove the required claim, it is also simple enough to do the computation explicitly by writing everything out in terms of bases. Let us write down a standard basis $\{f_{1},f_{2}\}$ for $V$ and a standard basis $\{e_{1},e_{2}\}$ for $V^{\vee}$ . To be explicit, we choose bases such that a form

[TABLE]

gives rise to the element $mf^{2}_{1}+rf_{1}f_{2}+nf^{2}_{2}$ , and $Q^{\vee}$ gives rise to $me^{2}_{1}+re_{1}e_{2}+ne^{2}_{2}$ . With respect to this choice, the contraction map on $\mathrm{Sym}^{2}\otimes\mathrm{Sym}^{2}$ (up to scalar) corresponds to sending $e^{2}_{1}f^{2}_{2}$ and $e^{2}_{2}f^{2}_{1}$ to $-2$ and $e_{1}f_{1}e_{2}f_{2}$ to $1$ , and sending all other monomials to zero. As a consistency check, note that

[TABLE]

Similarly, the contraction mapping on $\mathrm{Sym}^{j-2}\otimes\mathrm{Sym}^{2}$ for $p>j-2$ satisfies

[TABLE]

The formula $Q=AQA^{t}\det(A)^{-1}$ continues to hold if we replace $Q$ by $M.Q=MQM^{t}$ and $A$ by $MAM^{-1}$ some invertible $M$ . In particular, we may replace $A$ by any integral conjugate. We consider two cases.

(1)

$A$ has a non-zero eigenvalue mod $p$ . In this case (by Hensel’s Lemma), the matrix $A$ has an eigenvalue over $\mathbf{Z}_{p}$ , and a second eigenvalue which has valuation $s$ . In particular, after a change of basis, we may write

[TABLE]

The conditions $AQA^{t}=\det(A)Q$ and $\det(A)=p^{s}$ imply that $n\equiv 0\mod p^{s}$ (multiply out and consider the bottom right entry), and thus that $Q^{\vee}=me^{2}_{1}+re_{1}e_{2}\mod p$ . But now the image of $A$ on $k$ is generated by $f_{1}$ , and so the image of $\mathrm{Sym}^{j-2}[A]x$ is given by $f^{j-2}_{1}$ . But this forces the contraction after tensoring with $Q^{\vee}$ to be zero over $k$ , because the only monomial which $f^{j-2}_{1}$ contracts with non-trivially with is $e^{2}_{2}$ . 2. (2)

$A$ is nilpotent modulo $p$ . If $A$ is trivial modulo $p$ there is nothing to prove. On the other hand, if

[TABLE]

then once again the image of $A$ is generated by $f$ , and the conditions $AQA^{t}=\det(A)Q$ and $\det(A)=p^{s}$ imply once more that $n\equiv 0\mod p$ (multiply out as above but now consider the top left entry), and the proof proceeds as in the previous case.

This completes the proof of the proposition. ∎

Combining Prop. 8.25 with Lemma 8.17 and Theorem 8.11, we obtain a contradiction, and this completes the proof of Theorem 8.13.

∎

9. Modularity Lifting

The following theorem is the main result of this paper.

Theorem 9.1.

Let $\overline{r}:G_{\mathbf{Q}}\to\mathrm{GSp}_{4}(k)$ be a continuous, odd, absolutely irreducible Galois representation. Suppose that $\nu(\overline{r})=\epsilon^{-(a-1)}$ where $p-1>a\geq 2$ . Suppose that the following hold:

(1)

There exist units $\alpha$ and $\beta$ in $k$ such that

[TABLE]

and moreover $(\alpha^{2}-1)(\beta^{2}-1)(\alpha^{2}\beta^{2}-1)(\alpha-\beta)\neq 0$ . 2. (2)

Let $S(\overline{r})$ denote the set of primes of $\mathbf{Q}$ away from $p$ at which $\overline{r}$ is ramified. Then for each $x\in S(\overline{r})$ , the restriction $\overline{r}|G_{x}$ falls into one of the cases of Assumption 4.3. 3. (3)

$($ Big Image $)$ * The restriction $\overline{r}|G_{\mathbf{Q}(\zeta_{p})}$ has big image in the sense of Assumption 4.1.* 4. (4)

The representation $\overline{r}$ is Katz modular of weight $\sigma:=(a,2)\in X^{*}(T)^{+}_{M}$ in the sense of Definition 6.15. 5. (5)

$($ Neatness $)$ * $\overline{r}$ satisfies Assumption 4.2.*

We now introduce some notation: let $K\subset\mathrm{GSp}_{4}(\mathbb{A}^{\infty})$ be the compact open subgroup defined as in the beginning of Section 6.3. Let $X=X_{K}$ , and for any set of primes $Q$ disjoint from $S(\overline{r})\cup\{p\}$ , let $X_{i}(Q)=X_{K_{i}(Q)}$ . Let the Hecke algebras $\mathbf{T}_{\sigma}$ and $\mathbf{T}^{\mathrm{an}}_{\sigma}(Q)$ be as in Definition 5.13. The assumption that $\overline{r}$ is Katz modular implies that there is a maximal ideal $\mathfrak{m}_{\emptyset}$ of $\mathbf{T}_{\sigma}$ associated to $\overline{r}$ . The pullback of $\mathfrak{m}_{\emptyset}$ to $\mathbf{T}^{\mathrm{an}}_{\sigma}(Q)$ is also denoted $\mathfrak{m}_{\emptyset}$ . We further assume:

(6)

If $Q$ satisfies Assumption 6.12 (2), then

[TABLE]

Let $R^{\min}$ be the universal deformation ring classifying minimal deformations of $\overline{r}$ in the sense of Definition 4.6 (with $Q$ taken to be empty). Then the map

[TABLE]

which classifies the minimal deformation of Theorem 6.17 (with $Q$ taken to be empty), is an isomorphism. Furthermore, the space

[TABLE]

is a free $\mathbf{T}_{\sigma,\mathfrak{m}_{\emptyset}}^{\alpha,\beta}$ module.

Note that, for $p\geq a\geq 4$ , the hypothesis 6 holds by Theorem 5.1.

Proof.

To prove the theorem, we apply Proposition 3.3, as follows:

(1)

Take $R=R^{\min}$ and $H=H^{0}(X,\omega(a,2)(-\infty)_{K/\mathcal{O}})^{\alpha,\beta,\vee}_{\mathfrak{m}_{\emptyset}}$ . 2. (2)

Let $q$ and the sets $Q_{N}$ be as in Proposition 4.10. 3. (3)

The ring $R_{\infty}$ is the power series ring $\mathcal{O}[[x_{1},\dots,x_{q-1}]]$ . 4. (4)

For each $N\geq 1$ , we define a surjection $R_{\infty}\twoheadrightarrow R$ as follows: Let $R_{Q_{N}}$ denote the universal deformation ring classifying deformations of $\overline{r}$ which are minimal outside $Q$ , in the sense of Definition 4.6. Choose any surjection $R_{\infty}\twoheadrightarrow R_{Q_{N}}$ (possible by Proposition 4.10) and let $R_{\infty}\twoheadrightarrow R$ be the composite of this surjection with the natural map $R_{Q_{N}}\twoheadrightarrow R^{\min}$ .

We define the module $H_{N}$ as follows: let $\Delta$ be the unique quotient of $\Delta_{Q_{N}}=\prod_{x\in Q_{N}}(\mathbf{Z}/x)^{\times}$ which is isomorphic to $(\mathbf{Z}/p^{N}\mathbf{Z})^{q}$ , and let $X_{\Delta}(Q_{N})\to X_{0}(Q_{N})$ be as in Section 7.2. Let $\mathfrak{m}_{N}$ be the ideal $\mathfrak{m}\subset\mathbf{T}_{\sigma}(Q_{N})$ of Theorem 7.2 when $Q$ is taken to be $Q_{N}$ . We then take

[TABLE]

and we regard it as an $R_{\infty}$ -module via the surjection $R_{\infty}\twoheadrightarrow R_{Q_{N}}$ chosen above, and the classifying map $R_{Q_{N}}\twoheadrightarrow\mathbf{T}_{\sigma}(Q)_{\mathfrak{m}_{N}}^{\alpha,\beta}$ associated to the deformation $r_{Q_{N}}$ of Theorem 6.17. The $S_{N}$ -module structure on $H_{N}$ is given by choosing an identification $\Delta\cong(\mathbf{Z}/p^{N}\mathbf{Z})^{q}$ .

We need to check that, given these definitions, the conditions of Proposition 3.3 hold.

(a)

The image of $S_{N}$ in $\mathrm{End}_{\mathcal{O}}(H_{N})$ is contained in the image of $R_{\infty}$ because under the Galois representation $r_{Q_{N}}$ of Theorem 6.17, the image of an element $\sigma\in I_{x}$ , for $x$ a prime in $Q_{N}$ , is conjugate to a matrix of the form $\operatorname{diag}(1,1,\langle u\rangle,\langle u\rangle)$ where $\mathrm{Art}_{x}(u)=\sigma$ . This follows from [Sor10, Corollary 3]. 2. (b)

We have

[TABLE]

Combining this with the isomorphism of Theorem 7.2, we obtain an isomorphism:

[TABLE] 3. (c)

Finally, $H_{N}$ is finite and balanced over $S_{N}$ by Theorem 7.11.

We can thus apply Proposition 3.3, and we deduce that $H$ is a finite free $R$ -module. Since the action of $R$ on $H$ factors through $\mathbf{T}_{\sigma,\mathfrak{m}_{\emptyset}}^{\alpha,\beta}$ , the conclusions of Theorem 9.1 follow immediately. ∎

Bibliography53

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[And 87] Anatolij N. Andrianov, Quadratic forms and Hecke operators , Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 286, Springer-Verlag, Berlin, 1987. MR 884891
2[Art 04] James Arthur, Automorphic representations of G Sp ( 4 ) G Sp 4 {\rm G Sp(4)} , Contributions to automorphic forms, geometry, and number theory, Johns Hopkins Univ. Press, Baltimore, MD, 2004, pp. 65–81. MR 2058604
3[Art 13] by same author, The endoscopic classification of representations: Orthogonal and symplectic groups , American Mathematical Society Colloquium Publications, vol. 61, American Mathematical Society, Providence, RI, 2013, Orthogonal and symplectic groups. MR 3135650
4[BC 09] Joël Bellaïche and Gaëtan Chenevier, Families of Galois representations and Selmer groups , Astérisque (2009), no. 324, xii+314. MR 2656025
5[BC 11] by same author, The sign of Galois representations attached to automorphic forms for unitary groups , Compos. Math. 147 (2011), no. 5, 1337–1352. MR 2834723
6[BCGP 18] George Boxer, Frank Calegari, Toby Gee, and Vincent Pilloni, Abelian surfaces over totally real fields are potentially modular , preprint, 2018.
7[B Fvd G 08] Jonas Bergström, Carel Faber, and Gerard van der Geer, Siegel modular forms of genus 2 and level 2: cohomological computations and conjectures , Int. Math. Res. Not. IMRN (2008), Art. ID rnn 100, 20. MR 2439544 (2009 f:11054)
8[BHR 94] Don Blasius, Michael Harris, and Dinakar Ramakrishnan, Coherent cohomology, limits of discrete series, and Galois conjugation , Duke Math. J. 73 (1994), no. 3, 647–685. MR 1262930 (95b:11054)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Minimal Modularity Lifting

2010 Mathematics Subject Classification:

Contents

1. Introduction

Theorem 1.1**.**

Theorem 1.2**.**

1.1. Comparison with previous methods

1.2. Abelian Surfaces

1.3. Recent Developments

1.4. Results of Arthur

1.5. Acknowledgements

2. Notation

2.0.1. The group GSp4\mathrm{GSp}_{4}GSp4​

Definition 2.1**.**

2.0.2. The group GSp4(R)\mathrm{GSp}_{4}(\mathbf{R})GSp4​(R)

Definition 2.2**.**

3. Some Commutative Algebra

3.1. Balanced Modules

Definition 3.1**.**

Definition 3.2**.**

3.2. Patching

Proposition 3.3**.**

Proof.

4. Deformations of Galois representations

Assumption 4.1** (Big Image).**

Assumption 4.2** (Neatness).**

Assumption 4.3** (Ramification).**

Remark 4.4**.**

Remark 4.5**.**

Definition 4.6**.**

Remark 4.7**.**

Lemma 4.8**.**

Proof.

Proposition 4.9**.**

Proof.

Proposition 4.10**.**

Example 4.11** (Examples of representations with big image).**

Proof.

Remark 4.12**.**

5. Siegel threefolds

5.1. Level Structure

5.2. Cohomology of Siegel 333-folds

5.3. Vanishing results

Theorem 5.1**.**

Proof.

Theorem 5.2**.**

Proof.

Definition 5.3**.**

Lemma 5.4**.**

Proof.

Theorem 5.5**.**

Proof.

Theorem 5.6**.**

Proof.

Definition 5.7**.**

Corollary 5.8**.**

Proof.

5.4. Torsion Classes

Theorem 5.9**.**

Proof.

5.5. Hecke operators

Remark 5.10**.**

Remark 5.11**.**

Remark 5.12**.**

Definition 5.13**.**

Definition 5.14**.**

6. Galois representations associated to modular forms

6.1. The Hasse invariant

Definition 6.1**.**

Lemma 6.2**.**

Proof.

Definition 6.3**.**

Lemma 6.4**.**

Proof.

Theorem 1.1.

Theorem 1.2.

2.0.1. The group $\mathrm{GSp}_{4}$

Definition 2.1.

2.0.2. The group $\mathrm{GSp}_{4}(\mathbf{R})$

Definition 2.2.

Definition 3.1.

Definition 3.2.

Proposition 3.3.

Assumption 4.1 (Big Image).

Assumption 4.2 (Neatness).

Assumption 4.3 (Ramification).

Remark 4.4.

Remark 4.5.

Definition 4.6.

Remark 4.7.

Lemma 4.8.

Proposition 4.9.

Proposition 4.10.

Example 4.11 (Examples of representations with big image).

Remark 4.12.

5.2. Cohomology of Siegel $3$ -folds

Theorem 5.1.

Theorem 5.2.

Definition 5.3.

Lemma 5.4.

Theorem 5.5.

Theorem 5.6.

Definition 5.7.

Corollary 5.8.

Theorem 5.9.

Remark 5.10.

Remark 5.11.

Remark 5.12.

Definition 5.13.

Definition 5.14.

Definition 6.1.

Lemma 6.2.

Definition 6.3.

Lemma 6.4.

Remark 6.5.

Proposition 6.6.

Definition 6.7.

Proposition 6.8.

Lemma 6.9.

Remark 6.10.

Definition 6.11.

Assumption 6.12.

Theorem 6.13.

Remark 6.14.

Definition 6.15.

Assumption 6.16 (Residual Modularity).

Theorem 6.17.

Assumption 7.1.

Theorem 7.2.

Theorem 7.3.

Lemma 7.4.

Lemma 7.5.

Lemma 7.6.

Assumption 7.7.

Remark 7.8.

Lemma 7.9.

Lemma 7.10.

Theorem 7.11.

8. $q$ -expansions of Siegel modular forms

8.1. $q$ -expansions of Siegel modular forms

Theorem 8.1.

Remark 8.2.

8.3. Hecke Operators at $p$

Lemma 8.3.

Lemma 8.4.

Definition 8.5.

Lemma 8.6.

Definition 8.7.

8.4. Hecke Operators on forms of in characteristic $p$

Lemma 8.8.

Proposition 8.9.

Theorem 8.10.

Theorem 8.11.

Lemma 8.12.