This paper extends Bella"iche's work on 2-dimensional pseudorepresentations, providing a conceptual interpretation of the image of such representations and applying the results to various arithmetic contexts involving modular forms.
Contribution
It enlarges Bella"iche's ring, interprets it via conjugate self-twists, and connects it to the adjoint trace ring, offering a more natural framework for big-image results.
Findings
01
The new ring $B$ is optimal among congruence subgroups in the image.
02
The results recover and extend big-image theorems for Galois representations.
03
The approach applies to elliptic, Hilbert, and Bianchi modular forms, and p-adic families.
Abstract
Bella\"iche has recently applied Pink-Lie theory to prove that, under mild conditions, the image of a continuous 2-dimensional pseudorepresentation ρ of a profinite group on a local pro-p domain A contains a nontrivial congruence subgroup of SL2(B) for a certain subring B of A. We enlarge Bella\"iche's ring and give this new B a conceptual interpretation in terms of conjugate self-twists of ρ, symmetries that naturally constrain its image. As a corollary, this new B is optimal among congruence subgroups contained in the image. We also interpret the new B vis-a-vis the adjoint trace ring of ρ, which we show is a more natural ring for these questions in general. Finally, we use our purely algebraic result to recover and extend a variety of arithmetic big-image results for GL2 Galois representations arising from elliptic, Hilbert, and Bianchi…
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Big images of two-dimensional pseudorepresentations
Bellaïche has recently applied Pink-Lie theory to prove that, under mild conditions, the image of a continuous 2-dimensional pseudorepresentation ρ of a profinite group on a local pro-p domain A contains a nontrivial congruence subgroup of SL2(B) for a certain subring B of A. We enlarge Bellaïche’s ring and give this new B a conceptual interpretation both in terms of conjugate self-twists of ρ, symmetries that constrain its image, and in terms of the adjoint trace ring of ρ, which we show is both more natural and the optimal ring for these questions in general. Finally, we use our purely algebraic result to recover and extend a variety of arithmetic big-image results for GL2 Galois representations arising from elliptic, Hilbert, and Bianchi modular forms and p-adic Hida or Coleman families of elliptic and Hilbert modular forms.
Key words and phrases:
Representations of profinite groups, images of p-adic Galois representations, pseudorepresentations
2010 Mathematics Subject Classification:
11F11, 11F80, 11F85
Let ρ:Π→GL2(A) be a continuous representation of a profinite group Π on a local pro-p domain A. How big or small can the image of ρ be? Arithmetic versions of this question, with ρ the p-adic Galois representation attached to a cuspidal modular eigenform — or more often a compatible family of such ρ considered adelically — have been considered since the 1960s, when Serre proved an adelic open-image result for the Galois representation on the Tate module of a non-CM
elliptic curve [Ser68]. Serre’s result was
adapted
by Ribet [Rib85](i)(i)(i)Ribet’s work appeared in a series of papers starting in 1975; [Rib85] is a convenient reference. and then by Momose [Mom81]
to the more
delicate setting of modular forms, where certain symmetries naturally bound the size of the image. Recently Nekovář [Nek12] generalized their work to Hilbert modular forms.
In the 1990s Pink began a purely algebraic study of this kind of question by characterizing closed pro-p subgroups of SL2(A) in terms of associated “Pink-Lie” algebras. Pink’s investigations fueled further exploration of arithmetic big-image questions, this time for Galois representations attached to p-adic Hida families of modular forms,
by Hida [Hid15] and then, accounting for Ribet–Momose-type symmetries, by Lang (second author here) [Lan16].
Simultaneously, Bellaïche began adapting Pink-Lie theory to the (pseudo)representation setting, obtaining abstract big-image
results, but in a form that was difficult to compare to the symmetry formulations of Ribet, Momose, and Lang.
In the present work we finally unite the two approaches, refining Bellaïche-Pink-Lie theory to relate p-adic big-image results to natural symmetry bounds in an abstract algebraic setting. We thereby recover the p-adic big-image results of Ribet, Momose, Nekovář, and Lang, improving the latter, under mild conditions on
ρ.
We also obtain the first big-image results for Galois representations attached to p-adic Coleman families of modular forms (rather than for associated rigid-analytic Lie algebras, as in Conti(first author here)–Iovita–Tilouine [CIT16]), p-adic families of Hilbert modular forms, and Bianchi modular forms. Along the way we propose shifting our
perspective
towards
formulating big-image results in terms of
rings of definition of
adjoint representations rather than in terms of rings fixed by symmetries.
We emphasize that our results require absolutely no arithmetic input and are provably optimal.
We explain how our results apply to a wide variety of modular Galois representations, and we anticipate that this framework can yield
even more arithmetic fruit, from understanding even Galois representations to relating reducibility/dihedrality ideals and automorphic congruence modules. Finally we hope that the algebraic nature of our results might portend similar phenomena in higher dimension.
Acknowledgements
The authors would like to thank Joël Bellaïche for helpful discussions about his work. We also thank Gebhard Böckle, Keith Conrad, Shaunak Deo, Mladen Dimitrov, Haruzo Hida, David Loeffler, Robert Pollack, and Jacques Tilouine for useful comments. Thanks to David Rohrlich for help with Clifford theory.
Finally, we thank the anonymous referee, whose detailed comments improved both the content and the exposition of this article, not once but twice.
All three authors express gratitude to the Max Planck Institute for Mathematics for creating space for us to get together; without the hospitality of the MPIM, this collaboration would not have begun.
The first author was supported by the DFG Forschergruppe 1920, the second and third authors by the National Science Foundation through awards DMS-1604148 (second author) and DMS-1703834 (third author). This article underwent extensive revisions under soft covid lockdown conditions. The third author thanks her husband and especially her nanny for months of childcare support.
Let p be an odd prime, A a local pro-p domain with maximal ideal m and (finite) residue field F:=A/m, and Π a profinite group. Let ρ:Π→GL2(A) be a continuous representation(ii)(ii)(ii)In fact we consider 2-dimensional pseudorepresentations, but we stick with representations for the introduction.
with the property that the residual representation ρ:=ρmodm is semisimple and multiplicity free: either absolutely irreducible, or a sum of two distinct characters to F×.
Roughly, the objective is to show that the image of ρ is as big as possible.
Note that if ρ, or its restriction to an index-2 subgroup of Π, is reducible, then the image of ρ is both well understood and not big. Similarly, one cannot expect a big-image result when the image of ρ is up to twist isomorphic to that of ρ, as happens when ρ arises from a modular form of weight one.
Let us call these three kinds of representations a priori small. The a priori small representations are exactly those that are not strongly absolutely irreducible.
Suppose now that ρ is not a priori small.
We cannot expect ρ to be surjective: even its determinant need not be surjective. Nor can we expect the image of ρ to contain all of SL2(A), unless the image of ρ contains all of SL2(F). Following ideas of Hida, we settle on the notion of fullness. If B is any ring and b⊆B is any nonzero ideal, the subgroup of SL2(B) given by the kernel of reduction modulo b is a congruence subgroup of SL2(B) (of levelb):
[TABLE]
If the image of ρ, up to conjugation, contains such a congruence subgroup, we say that ρ is B-full. A key part of the big-image game is the search for an optimal fullness ring — or rather for an optimal equivalence class of fullness peers, rings that each contain an ideal of the other.
Historically, the constraints on the fullness rings of a representation ρ have been described in terms of certain symmetries of ρ. If σ is an automorphism of A and η is a character of Π, the pair (σ,η)
is a conjugate self-twist of ρ if applying the automorphism gives the same representation as twisting by the character: σρ≅η⊗ρ.
If ρ has a
nontrivial conjugate self-twist (σ,η),
then
ρ cannot be A-full: indeed, the equation
[TABLE]
means that
the trace of ρ(g) is an eigenvector for σ viewed as a linear map over the σ-invariant scalars.
But the trace of a congruence subgroup of A is not so constrained. Accordingly, the known arithmetic big-image results — of Ribet and Momose, of Nekovář, of Lang, described in Section 1.2 below — have all established fullness with respect to AΣρ, the subring of A fixed by the conjugate self-twists of ρ.
Stepping outside the constraints of the arithmetic setting, however, reveals shortcomings of the AΣρ perspective: AΣρ may simply be too big for ρ to be AΣρ-full. For one thing, A itself may not see all the conjugate self-twists of ρ, as in Example 4.3, a failure of normality. Enlarging A may still not suffice if A has inseparable elements, as in Example 4.10, or worse yet, transcendental ones. The limitations are exactly those of
Galois theory: there may not be enough automorphisms to carve down deep enough to a fullness ring with conjugate self-twists alone.
Rather than carving down from above, we propose building a fullness ring from below. Let A0 be the adjoint trace ring of ρ,
the closed subring of A topologically generated
by the elements (trρ(g))2/detρ(g) for g∈Π,
the traces of the adjoint representation.
It is easy to see that this generating set is both twist-invariant and fixed by all conjugate self-twists. Thus A0 acts as a base ring for the conjugate self-twist automorphisms, and the question of whether A or its extensions have enough conjugate self-twists turns into the usual one of Galois theory: are there enough automorphisms to isolate the base? On the other hand, A0 is a potential fullness ring free from the limitations of AΣρ outlined above. Moreover, in all arithmetic settings where we recover existing fullness results, we show that A0 and AΣρ are always fullness peers, so that A0- and AΣρ-fullness are equivalent. Our first theorem shows that A0 is the optimal fullness ring in all cases.
Theorem A** (Optimality theorem. See Theorem 5.3).**
If ρ is B-full for some ring B, then a fullness peer of B is contained in A0.
Accompanying the A0-optimality theorem, we present the main result of this paper: A0-fullness.
We assume that the pro-p part of Π is topologically finitely generated: for more on this p-finiteness condition of Mazur, always satisfied by
characteristic-zero
local Galois groups and global Galois groups with ramification restricted to a finite set of places,
see Definition 2.6.
Theorem B*′*** (Main fullness theorem, preliminary version. See B and Theorem 10.1).**
Suppose that ρ satisfies a mild condition. If ρ is not a priori small, then ρ is A0-full.
For a more precise formulation and a detailed discussion of the mild condition, see B in Section 1.3 below. For now we merely note that the most limiting condition for applications is a regularity assumption: we require that the image of ρ contain a matrix whose eigenvalue ratio differs from ±1 but is contained in residue field of A0. For the most general version of our theorem, which is formulated for images of pseudorepresentations in the sense of Chenevier [Che14] as studied by Bellaïche [Bel19], see Theorem 10.1.
The last stand-alone result of this paper is a refinement of B*′* in the case where the residual image of ρ is large. Let E be the residue field of A0; here we assume that #E≥7.
If
Imρ⊇SL2(E) then Imρ contains SL2(A0) up to conjugation.
1.2. History
We now survey the history of big-image results,
both arithmetic and algebraic, using the terminology introduced above,
to situate B*′* in context. In all of the theorems stated in Section 1.2, A0-fullness is equivalent to AΣρ-fullness.
1.2.1. Classical modular forms
The big-image line of inquiry began in the late 1960s, when Serre showed that if ρ comes from the p-adic Tate module (including for p=2) of a non-CM elliptic curve over a number field F, so that Π=Gal(F/F) and A=A0=Zp, then ρ is Zp-full [Ser68, Theorem IV.2.2].(iii)(iii)(iii)Serre’s result is better known as an open-image theorem; and in fact he shows much more: the image of all the p-adic Tate modules for all p at once is open adelically.
In the 80s, Ribet and Momose generalized Serre’s theorem to elliptic modular forms. Let f be a cuspidal non-CM eigenform of weight at least 2. Given a prime p and an embedding ιp:Q↪Qp, one can associate to f a 2-dimensional Galois representation ρ=ριp of Π=Gal(Q/Q) over the ring of integers A of a finite extension Qp.
For all but finitely many primes p, the representation ρ is AΣρ-full.(iv)(iv)(iv)Like Serre, Ribet and Momose prove stronger adelic big-image results.
See Section 12.1 for more details.
More recently, Nekovář generalized Theorem 1.1 to representations coming from Hilbert modular forms, in which case Π is the absolute Galois group of a totally real number field and A is still a finite extension of Zp [Nek12, Appendices B.3–B.6].
Our main theorem (B*′*) recovers the at-p statements of both the Ribet–Momose and the Nekovář results, under the assumption that residual representation satisfies our regularity condition. See Section 12.1 and Section 12.2 for details.
1.2.2. Families of p-adic modular forms
Although we have stated the work of Serre, Ribet, Momose, and Nekovář for a fixed prime p to better fit our p-adic framework, all of these theorems are actually adelic open-image results proved using geometric methods. Much work has been done to generalize such theorems to groups other than GL2, but that is not the direction that interests us. Rather, we are interested in fixing p and deforming representations p-adically, which necessitates a completely different approach. There has been some progress in this direction in special cases. Recall that we are assuming throughout that p=2.
First we suppose that ρ arises from a non-CM cuspidal Hida family. In this case Π=Gal(Q/Q) and A is a finite domain over Λ:=Zp[[X]].
When A is a constant extension of Λ and the image of ρ contains SL2(Fp), Boston [MW86, Proposition 3] and Fischman [Fis02, Theorem 4.8] show that the image of ρ contains SL2(AΣρ), hence ρ is AΣρ-full.(v)(v)(v)All the works in Section 1.2.2 consider only conjugate self-twists that fix Λ; see Section 12.4 for details.
More recently, Hida proved that if ρ is locally-at-p multiplicity free then ρ is Λ-full [Hid15, Theorem I], but his work did not relate Λ to A0 or conjugate self-twists of ρ. Lang
then improved Hida’s result from Λ-fullness to AΣρ-fullness under the assumption that ρ is absolutely irreducible, proving the following result.
If ρ arises from a non-CM cuspidal Hida family, and ρ is absolutely irreducible and satisfies additional multiplicity-freeness conditions locally at p then ρ is AΣρ-full.
The case when ρ arises from a Coleman family was studied by Conti–Iovita–Tilouine [CIT16]. In this case we again have Π=Gal(Q/Q) and A is a domain finite over Λ. In [CIT16, Theorem 6.2] it is proved that, under hypotheses similar to those in Theorem 1.2, a certain rigid analytic Lie algebra attached to Imρ contains that
of a congruence subgroup of AΣρ.
This strongly suggests that ρ should be AΣρ-full, though this statement does not follow from [CIT16].
1.2.3. Abstract p-adic representations
Both Hida [Hid15] and Lang [Lan16] rely in a key way on results of Pink [Pin93] classifying, for odd p, pro-p subgroups of SL2(A) in terms of a correspondence with purely algebraically defined “Pink-Lie algebras”. The analogous role in [CIT16] is played by rigid-analytic Lie theory, whence
the different form of the conclusion in that case. Although the conclusions of the big-image theorems in all of [Hid15, Lan16, CIT16] are stated in terms of pure algebra
— a feature that is most clear in the fullness results of [Hid15] and [Lan16] — nonetheless all of their proofs are
arithmetic in nature: they rely on special information about the restriction of ρ to the local Galois group at p, and they
use the
results of Ribet and Momose as input.
In contrast, Bellaïche in [Bel19] studies the image of ρ:Π→GL2(A) in a purely algebraic way. More precisely, he applies Pink’s theory from [Pin93] to images of 2-dimensional (pseudo)representations with constant determinant, that is, with detρ equal to the Teichmüller lift of detρ. Bellaïche’s main application is to density results for mod-p modular forms, but along the way he also proves the following theorem, under the same p-finiteness assumption on Π (Definition 2.6).
Suppose that the image of ρ contains an element with eigenvalues in Fp× whose ratio is not ±1. If ρ has constant determinant and is not a priori small, then there is a subring Bρ(Fp) of A such that ρ is Bρ(Fp)-full.
Bellaïche’s ring Bρ(Fp) is defined as the subring of A topologically generated by a Zp-module I1(ρ) created out of the Pink-Lie algebra of Imρ. See Theorem 2.23 and the discussion following it for the definition of the generating set I1(ρ) and the ring Bρ(Fp).
Unfortunately, as Bellaïche himself notes, it is not straightforward to relate the ring Bρ(Fp) to the rings A0 or AΣρ from previous results. Indeed, he gives no conceptual interpretation of Bρ(Fp) at all.
The goal of the present work is to refine the definition of Bρ(Fp) by enlarging scalars and then give it a conceptual interpretation. Under mild assumptions we thus recover, and in the case of p-adic families improve, the results mentioned above in a uniform and purely algebraic way. We point out that prior to Bellaïche’s work, Hida’s work was the only fullness result when ρ is reducible and ρ comes from a p-adic family of modular forms. In the case of Coleman families, a true fullness result was not previously known. Additionally, we obtain first results in other GL2-contexts, including Galois representations attached to Bianchi modular modular forms and to p-adic families of Hilbert modular forms. See Section 12 for all the details.
1.3. Main theorem
We now state our main fullness theorem in more detail. Recall that A is a local pro-p domain with maximal ideal m and residue field F and Π is a p-finite profinite group. Let ρ:Π→GL2(A) be a representation with mod-m reduction ρ. Let A0 be the adjoint trace ring of ρ and E its residue field, so that E⊆F⊆Fp. We say that ρ is regular if there is some g0∈Π such that ρ(g0) has eigenvalues λ0,μ0∈F× with λ0μ0−1∈E×∖{±1}. See 2.20 for an analysis of this condition.
For the notion of goodness, see Definition 8.10.
Theorem B** (Main fullness theorem; see also Theorem 10.1).**
Assume that ρ is regular. If the projective image of ρ is isomorphic to S4, assume that ρ is good.
If ρ is not a priori small, then ρ is A0-full.
In fact we prove something slightly more general in that we can replace ρ by a pseudodeformation (t,d):Π→A of ρ: see Theorem 10.1 for a precise statement.
Recall that B is provably optimal, in the sense that, if ρ is B-full for some subring B of A, then a fullness peer of B is contained in A0 (see A or Theorem 5.3).
Let us point out some features of the statement of B. First, the group Π can be quite general. For that reason, representations coming from Hilbert modular forms and their p-adic families are no more difficult than representations coming from elliptic modular forms or their p-adic families. Similarly, since Π need not be the absolute Galois group of a number field, the notion of oddness does not play a role in the paper. In particular, B applies to deformations of ρ when ρ is an even Galois representation.
The proof of B proceeds via Bellaïche-Pink-Lie theory with various refinements and improvements. As in Bellaïche [Bel19], we linearize the pro-p normal core of the image of a constant-determinant ρ by considering its Pink-Lie algebra L. But rather than building up an A0-structure on L directly and interpreting this for Imρ, we proceed by showing fullness in turn for a sequence of fullness peer rings, first Bρ(E), and then AΣρ, and finally A0, which then proves A0-fullness for all the twists of ρ. More precisely, the argument proceeds in steps as follows.
(1)
We show that fullness rings and adjoint trace rings are unchanged under twisting by a character (Corollaries 3.12 and 4.8), so that we may assume that ρ has constant determinant.
2. (2)
For constant-determinant ρ, we show that A0 and AΣρ are fullness peers (Corollary 4.21).
Therefore it suffices to prove AΣρ-fullness in the constant-determinant case.
3. (3)
Here Bρ(E) is the W(E)-algebra generated by the same Zp-module I1(ρ) as Bellaïche’s Bρ(Fp) — see discussion following Theorem 1.3 above. Although a small improvement, this is crucial for the next
step, and it allows our regularity hypothesis to be weaker than that of Bellaïche.
4. (4)
We show that for ρ with constant determinant, Bρ(E) and AΣρ are fullness peers (Corollary 9.15), so that ρ is AΣρ-full.
More precisely, we show that Bρ(E) is always contained in AΣρ, and the discrepancy between them is explained on the residue field by a failure of conjugate self-twists of ρ to lift to ρ. The residue field of Bρ(E) is E=FΣρ, the fixed field of residual twists (Corollary 4.13), whereas the residue field of AΣρ is FΣρ, the field fixed by those residual twists that lift to twists of ρ. We show both that AΣρ=Bρ(FΣρ) and that the elements of FΣρ “missing” in Bρ(E) show up in its fraction field. This is the most delicate step of the argument.
We do not show that non-constant-determinant ρ are AΣρ-full: Examples 4.3 and 4.10 show that such a result is impossible in general, even under favorable regularity conditions. The key point is that AΣρ depends on ρ itself and may chance after twisting, whereas A0 depends only on the projective representation of ρ.
Nor do we show that A0=AΣρ for constant-determinant ρ — they are merely fullness peers: see Example 4.22.
The circuitous nature of our argument may well be merely a matter of our historical bias: the known arithmetic big-image results are formulated in terms of conjugate self-twists, so our original motivation was to relate Bellaïche’s Bρ(Fp) to AΣρ. The advantages of A0 revealed themselves only later. A more direct argument for proving B is the subject of our current investigation.
1.4. Structure of the paper
This article is informally organized into parts as follows.
Background:Section 2. We review known material: pseudorepresentations, generalized matrix algebras, Pink-Lie theory, and Bellaïche’s recent
results from [Bel19]. We also introduce our notion of regularity.
Our philosophy: In Sections 3 to 5 we present and justify our approach: our goal is to show that ρ is full with respect to the adjoint trace ring A0, which is both the optimal fullness result and equivalent to the historically familiar AΣρ-fullness of known applications.
–
Section 3 discusses the basic properties of fullness and fullness peer rings.
We prove that fullness is twisting-invariant (Corollary 3.12).
–
Section 4 studies the relationship between A0 and conjugate self-twists, paying particular attention to constant-determinant, and nearly so, settings where conjugate self-twists carve out the fraction field of A0 (Corollary 4.20), so that A0 and AΣρ are fullness peers.
–
Section 5 proves two related optimality results: every fullness ring has a fullness peer contained in A0 (Corollary 5.2) and is also fixed by all conjugate self-twists (Theorem 5.4).
Technical results:Sections 6 to 9 are the technical heart of the paper. We prove that constant-determinant ρ are Bρ(E)-full, which implies AΣρ-fullness, which in turn guarantees A0-fullness.
–
In Section 6 we show that the Bellaïche-Pink-Lie algebra L attached to ρ, a priori only a Zp-module, is in fact a W(E)-module. Under certain regularity assumptions, ρ is therefore Bρ(E)-full. This completes Step (3) of the proof.
–
In Section 7 we study residual conjugate self-twists and their lifting properties via the universal deformation ring — how does Σρ compare to Σρ?
–
Section 8 explores how the regularity of ρ imposes structure on residual conjugate self-twists
in preparation for Section 9. We also introduce the goodness constraint.
–
Section 9 is the technical heart of the technical heart of the paper. Its main goal is to prove that Bρ(E) has the same field of fractions as AΣρ to show that they are fullness peers (Corollary 9.15), completing Step (4) of the proof. This turns out to be intimately related to the lifting properties of conjugate self-twists of ρ to ρ explored in Section 7.
Interpretation and applications:Sections 10 to 12 interpret our results for general ρ and explain how to apply them to various modular form contexts in detail.
Section 11 is independent of the main thrust of the paper; it gives an improvement on previous very-big-image results, showing that the image of ρ contains SL2(A0) if it does so residually.
–
Section 12 explains in detail how and to what extent B recovers and refines known big-image results about Galois representations arising from modular forms and their p-adic families.
We also apply B to obtain new results for Galois representations attached to Bianchi modular forms (Section 12.3), Coleman families of elliptic modular forms (Section 12.5), and p-adic families of Hilbert modular forms (Section 12.6).
Appendix:Appendix A houses a variety of lemmas on representation theory and commutative algebra for which we failed to find convenient references in the literature, in particular the statement that semisimple representations with isomorphic adjoints are isomorphic up to twist (Theorem A.10). No claims to originality here.
1.5. Leitfaden suggestions
We propose different levels of interaction with this paper for different readers. Those who are merely interested in results and applications should read the present Section 1 and skip to Section 12. Those also curious about our methods should additionally
skim Section 2, and then read Sections 3, 4 and 5 and Section 10.
Readers who have the stomach for the technical weeds should begin the same way, but then also brave Sections 6 to 9. Finally, any of the previous types of readers who are interested in very-big-image results (Imρ contains an SL2 if it does so residually) should peek at Section 11.
1.6. Notation
We establish some notation and conventions. All rings are unital. Given any ring R (not necessarily commutative), we will let R× denote the multiplicative group of invertible elements in R. For brevity, we call a commutative ring B containing a subring C an extension of C, finite if B is module-finite as a C-algebra.
If B is a domain, then Q(B) denotes its field of fractions. If F is a field, then F (respectively, Fsep) denotes a fixed algebraic (respectively, separable) closure.
If C⊆B is an extension of rings, write Aut(B) for the group of ring automorphisms of B and Aut(B/C) for the subgroup fixing C pointwise. For Σ⊆Aut(B), write BΣ for the subring pointwise fixed by every σ∈Σ. If σ∈Aut(B) and f:X→B is any set map, write σf:X→B for the map x↦σ(f(x)).
For any integer n, let ζn denote a primitive nth root of unity. Given a finite field F of size q a power of a prime p, its ring of Witt vectors, isomorphic to Zp[ζq−1], will be denoted by W(F). Let s:F×→W(F)× be the Teichmüller lift. We extend it to s:F→W(F) by defining s(0):=0. If A is a W(F)-algebra, we use W(F) to denote the image of W(F) in A under the structure map. In particular, s may be viewed as being A-valued by composing with the structure map.
Throughout the paper we fix a prime p=2. The ring A will always denote a local pro-p commutative ring with maximal ideal m and residue field F, which is then automatically a finite extension of Fp. A closed subring of such an A is also automatically local and pro-p.(vi)(vi)(vi)Let B be closed subring of A. Its ideal mB:=B∩m is maximal since FB:=B/mB is a subring of F. Given any α∈A one can check that the sequence {αpn}n converges and that its limit is s(α), the Teichmüller lift of the image of α in F. Since B is closed, it thus contains both s(FB×) and inverses of elements of 1+mB. Therefore every element of B−mB is invertible, and B is local. And as a closed subgroup of a pro-p group, B is automatically pro-p. Since 2 is invertible in A, we can always take square roots of elements x∈1+m via the formula
[TABLE]
In particular, when we write x, we always choose the root congruent to 1 modulo m. In general, the profinite topology on A is coarser than the m-adic topology, but if such an A is noetherian, then the profinite topology coincides with the m-adic topology, so that A is a complete local noetherian ring [dSL97, Proposition 2.4]. In this case, every finite A-module is equipped with its natural m-adic A-module topology, which is compatible on submodules by the Artin-Rees lemma — see [AM69, Theorem 10.11].
In particular, every ideal in A is closed. By the Cohen structure theorem [Eis95, Theorem 7.7], if A is noetherian then it is a quotient of W(F)[[x1,…,xn]] for some n.
If A is additionally a domain then A enjoys the so-called N2 (or sometimes “japanese”) property: the integral closure of A in a finite extension of its field of fractions is finite over A [Mat70, Chapter 12, proof of Corollary 2], and hence a pro-p local noetherian domain in its own right. Note that in this case there need not be a topology on Q(A) under which Q(A) is a topological field containing A as a closed subring: since A is compact, the existence of any such topology would mean that Q(A) is a locally compact field, so that Q(A) is a finite extension of Fp, Qp, or Fp((X)) [RV91, Theorem 4.12]. But our A are more general.
If M is a subset of a W(F)-module N, then we will write W(F)M for the W(F)-linear span of M in N. When R is a topological ring and S a subring of R, we say that S is topologically generated by a set X if S is the smallest closed subring of R containing X. Similarly, we can talk about an additive subgroup or a W(F)-algebra topologically generated by a set.
Finally, Π always denotes a p-finite profinite group (Definition 2.6). If ρ:Π→GL2(F) is a representation over a finite field F, we can compose ρ with the natural projection P:GL2(F)→PGL2(F). We shall refer to the image of Π under the composition P∘ρ as the projective image of ρ. It is well known that the projective image of ρ is cyclic, dihedral, or isomorphic to A4,S4,A5 or one of PSL2(F′) or PGL2(F′) for some subfield F′ of F ([Dic58, Chapter XII] or see [Bel19, Section 3.1]). If Pρ(Π)≅A4 (respectively, S4,A5), we say that ρ is tetrahedral (respectively, octahedral, icosahedral). If ρ is tetrahedral, octahedral, or icosahedral, then we say that ρ is exceptional. If Pρ(Π) contains PSL2(Fp) and ρ is not exceptional, then we say that ρ is large. Be warned that there are exceptional isomorphisms PSL2(F3)≅A4,PGL2(F3)≅S4,PSL2(F5)≅A5.
2. Bellaïche-Pink-Lie theory
In this section we introduce the basic objects of study — 2-dimensional pseudorepresentations and their associated realizations over generalized matrix algebras — along with the primary tools we use to study them: Pink-Lie algebras and Bellaïche’s structure theorem (Theorem 2.23). The main reference for everything in this section is Bellaïche’s paper [Bel19], which we refer to for most proofs. The only exception is that our definition of regularity (Definition 2.19) is weaker than that of Bellaïche.
2.1. Pseudorepresentations
In this section we summarize the definitions and notation related to two-dimensional pseudorepresentations, algebraic gadgets introduced by Chenevier in [Che14] (where they are called “determinants”) to mimic the behavior of trace and determinant functions of true 2-dimensional representation of groups. We follow [Che14, Example 1.8] and [Bel19, 2.1.1] for our definitions.
2.1.1. Abstract pseudorepresentations
Definition 2.1**.**
A (2-dimensional) pseudorepresentation of a group G over a commutative ring B is a pair of functions t:G→B and d:Π→B× such that
(1)
t(1)=2;
2. (2)
d(gh)=d(g)d(h) for all g,h∈G;
3. (3)
t(gh)=t(hg) for all g,h∈G;
4. (4)
t(gh)+d(h)t(gh−1)=t(g)t(h) for all g,h∈G.
If G is a topological group and B is a topological ring, we say that a pseudorepresentation (t,d):G→B is continuous if t and d are continuous maps.
One can verify that if ρ:G→GL2(B) is a (continuous) representation, then (trρ,detρ) is a (continuous) 2-dimensional pseudorepresentation. Conversely, if B is an algebraically closed field, then every pseudorepresentation (t,d):G→B is carried by a unique semisimple representation G→GL2(B). If 2 is invertible in B, then a 2-dimensional pseudorepresentation (t,d) is determined by t alone: setting h=g in (3) above yields d(g)=2t(g)2−t(g2). In this way, pseudorepresentations generalize earlier work of Wiles [Wil90], Taylor [Tay91], and Rouquier [Rou96] on pseudocharacters, which mimic representations by keeping track only of a trace function.
If (t,d):Π→B is a (continuous) pseudorepresentation and χ:Π→B× is a (continuous) character, then (χt,χ2d) is also a (continuous) pseudorepresentation, called the twist of (t,d) by χ.
We say (t,d) is reducible if t=χ1+χ2 with χi:Π→B× characters. Otherwise (t,d) is irreducible. We say (t,d) is dihedral if it is irreducible and there is a nontrivial character η:Π→B× such that (ηt,η2d)=(t,d).
The kernel of a pseudorepresentation (t,d):G→B is
[TABLE]
This is a normal subgroup of G, closed if (t,d) is continuous. Moreover, (t,d) factors through the quotient Π/ker(t,d).
2.1.2. Pseudorepresentations of profinite groups over pro-p rings
We henceforth assume that all pseudorepresentations are continuous if both the group and the ring have topologies. Recall that A is a local pro-p commutative ring with maximal ideal m and residue field F and Π is a profinite group.
If (t,d):Π→A is a pseudorepresentation, its residual pseudorepresentation (tˉ,dˉ):Π→F is obtained by composing (t,d) with reduction modulo m. Because the Brauer group of F is trivial, the semisimple representation ρ carrying (tˉ,dˉ) is always realizable over F (see also Lemma A.12).
Definition 2.2**.**
A pseudorepresentation (t,d):Π→A has constant determinant if d is the Teichmüller lift of its reduction modulo m: that is, d=s(dˉ). Since A×≅s(F×)×1+m, we see that d is always the product of s(dˉ) and a pro-p character d1:Π→1+m. The twist of (t,d) by d1−1/2 is the unique constant-determinant pseudorepresentation with the same ρ: this is the constant determinant twist of (t,d).
If (t,d):Π→A is a pseudorepresentation, we call the subring of A topologically generated by t(Π) the trace ring of (t,d). Note that the residue field of the trace ring is the trace ring of (tˉ,dˉ) or ρ. Sometimes we need the residue field F of A to be a quadratic extension of the trace ring of (tˉ,dˉ). Thus if (t,d):Π→A is a pseudorepresentation and F is the residue field of A, we define the trace algebra of (t,d) to be the W(F)-subalgebra of A topologically generated by t(Π). Thus the residue field does not change when restricting the codomain of a pseudorepresentation to its trace algebra. Both the trace ring and the trace algebra are pro-p local rings (Footnote (vi)).
Definition 2.3**.**
We call a pseudorepresentation (t,d):Π→A is a priori small if it is reducible or dihedral, or if the kernel of its constant-determinant twist is equal to the kernel of ρ.
If A is a domain it turns out that the a priori small notion coincides with a certain weak kind of reducibility. Recall that a representation ρ:G→GL2(F) over a field F is strongly (absolutely) irreducible if ρ∣H is (absolutely) irreducible for any finite-index subgroup H of G.
Proposition 2.4**.**
Let (t,d):Π→A be a pseudorepresentation to a local pro-p domain A with field of fractions K, and let ρ:Π→GL2(K) be the semisimple representation carrying (t,d).
The following are equivalent:
(1)
(t,d)* is a priori small;*
2. (2)
ρ* is reducible, dihedral, or its constant-determinant twist has finite image;*
3. (3)
ρ* is not strongly irreducible.*
Note that if any twist of ρ:Π→GL2(K) has finite image, then the constant-determinant twist does; equivalently, the image of the projective representation Pρ:Π→PGL2(K) is finite.
Proof.
Since all the notions in question are twist-invariant, we may replace (t,d) and ρ by constant-determinant twists. Clearly (1) ⟹ (2) ⟹ (3).
To see that (3) ⟹ (2), we follow [Rib75, Theorem 2.3]. Suppose ρ∣H reducible for some finite-index subgroup H of Π. Up to replacing H with its normal core (i.e., the intersection of all conjugates, which is still of finite index in Π), we may assume that H is normal in Π, so that by Clifford’s theorem [Cra19, Theorem 7.1.1], ρ∣H is semisimple, and hence has abelian image. If ρ(H) is not contained in the center of GL2(K) then H contains a semisimple element h with distinct ρ-eigenvalues, and ρ(H) is contained in the maximal torus centralizing ρ(h). Moreover, since H is normal in Π, all of ρ(Π) is contained in the normalizer of ρ(h), so that ρ(H) has index 1 or 2 in ρ(Π), and ρ is either reducible or dihedral. On the other hand, if ρ∣H is scalar, then since its trace is A-valued, ρ∣H=α⊕α for some character α:H→A×; since ρ has constant determinant, α2=s(α2), so that ρ∣H takes values in the finite set of prime-to-p roots of unity in A, whence ρ has finite image.
For (2) ⟹ (1) it suffices to consider (t,d) residually exceptional or large, in which case by Rouquier-Nyssen ρ is the base change of a representation ρA:Π→GL2(A) with kerρA=ker(t,d) (see also Proposition 2.11 and 2.12). We show that if ρ has finite image, then reduction modulo the pro-p subgroup 1+M2(m) induces an isomorphism ρ(Π)≅ρ(Π). If A=F there is nothing to show, so we can assume that A is infinite. (Indeed, if A has characteristic zero, then A contains Zp; otherwise, A is a local F-algebra with residue field F, and any such ring that is also finite over F is equal to F.) We claim that the projective image of ρ is isomorphic to that of ρ: If A has characteristic zero, then the finite subgroups of GL2(K) are the same as those of GL2(C) [Ser72, Proposition 16], hence the projective image of ρ is isomorphic to A4,S4 or A5. And if A has characteristic p, then the finite subgroups of GL2(A) are all defined over F, because the eigenvalues of any finite-order element are roots of unity. In any case, the kernel of the reduction map ρA(Π)→ρ(Π) is pro-p, so that it can be seen on the map PρA(Π)→Pρ(Π). But none of A4, S4, A5, PSL2(F′), or PGL2(F′) have normal subgroups of p-power order for p>2.
∎
2.1.3. Pseudodeformations
Fix a continuous semisimple representation ρ:Π→GL2(F).
Definition 2.5**.**
We say that a pseudorepresentation (t,d):Π→A is a pseudodeformation of ρ if (t,d)≡(trρ,detρ)modm.
Let C be the category of local pro-p commutative rings with residue field F, which have a natural W(F)-algebra structure, and with morphisms being local continuous W(F)-algebra homomorphisms. We are interested in the deformation functors
[TABLE]
These functors are always representable. In order for the representing ring to be noetherian, we need to impose a finiteness condition on Π due to Mazur, which we now recall.
Definition 2.6**.**
[Maz89, §1.1]
A profinite group Πsatisfies the p-finiteness condition or is p-finite if, for every open subgroup Π0 of Π, the set Hom(Π0,Fp) is finite.
It is well known that F is represented by a pro-p local noetherianW(F)-algebra A~ whenever Π is a p-finite profinite group. See, for example, [Che14, Proposition 3.3] or [Böc13, Proposition 2.3.1]. In particular, the trace algebra of any pseudorepresentation of a p-finite profinite group on a local pro-p ring is a quotient of A~ and hence noetherian. Let (tuniv,duniv):Π→A~ be the universal pseudodeformation of ρ. It is easy to see that the constant-determinant condition is a deformation condition. Indeed, let a be the ideal of A~ topologically generated by {duniv(g)−s(detρ(g)):g∈Π}. Then A:=A~/a represents G. In particular, A is also a pro-p local noetherian W(F)-algebra with residue field F. We use (T,d):Π→A to denote the universal constant-determinant pseudodeformation.
Definition 2.7**.**
If F′ is a subfield of F, then we say that a 2-dimensional semisimple representation ρ is multiplicity free over F′ if either ρ is absolutely irreducible or ρ≅χ1⊕χ2 such that χ1,χ2:Π→F′× are distinct characters.
The following notion of admissibility, introduced by Bellaïche, plays a central role in [Bel19].
Definition 2.8**.**
[Bel19, Section 5.2]
A tuple (Π,ρ,t,d) is an admissible pseudodeformation over A if the following conditions are satisfied:
(1)
Π is a p-finite profinite group;
2. (2)
ρ:Π→GL2(F) is a continuous representation that is multiplicity free over F;
3. (3)
(t,d):Π→A is a continuous pseudodeformation of ρ;
4. (4)
d(g)∈s(F×) for all g∈Π, that is, (t,d) has constant determinant;
5. (5)
A is the trace algebra of (t,d).
A local pro-pA accepting an admissible pseudodeformation is a complete noetherian local ring.
2.2. GMAs and (t,d)-representations
It is natural to ask when a given pseudodeformation (t,d):Π→A arises as the trace and determinant of an actual representation ρ:Π→GL2(A). This has been studied in great generality; see the introduction of Chenevier’s paper [Che14] for a thorough history. Bellaïche and Chenevier [BC09, Section 1.4] have shown that, under the residual multiplicity-free assumption, (t,d) always comes from a representation if one allows something more general than matrix algebras for the target. In Section 2.2 we summarize Bellaïche’s [Bel19, Section 2], where he specializes his work with Chenevier to the 2-dimensional setting. All proofs that can be found in Bellaïche’s work are omitted.
Definition 2.9**.**
A generalized matrix algebra (GMA) over a commutative ring A is given by a tuple of data (A,B,C,m), where B and C are A-modules, m:B⊗AC→A is a morphism of A-modules satisfying
[TABLE]
Given such data, define the A-module R:=A⊕B⊕C⊕A=(ACBA) and give R a multiplicative structure via
[TABLE]
so that R has the structure of an A-algebra via the ring homomorphism a↦(a00a)∈R. We refer to the GMA given by (A,B,C,m) simply by R.
A morphism of GMAs(A,B,C,m)→(A′,B′,C′,m′) (with associated A-algebras R and R′) is a triple (fA,fB,fC) consisting of a ring morphism fA:A→A′ and two A′-module morphisms fB:B⊗A,fAA′→B′, fC:C⊗A,fAA′→C′ such that fA∘m=m′∘(fB⊗fC). The data (fA,fB,fC) defines in a natural way an A-algebra morphism ψ:R→R′; we say that ψ is associated with (fA,fB,fC).
If A is a topological ring and B,C are topological A-modules, then R inherits a natural topology, and we call R a topological GMA if m is continuous. We say that R is faithful if m is nondegenerate as a pairing of A-modules. As with matrix algebras, we have the notion of a trace and determinant on a GMA R given by \operatorname{tr}\bigl{(}\begin{smallmatrix}a&b\\
c&d\end{smallmatrix}\bigr{)}=a+d and \det\bigl{(}\begin{smallmatrix}a&b\\
c&d\end{smallmatrix}\bigr{)}=ad-m(b\otimes c).
The following lemma shows that when A is a domain, faithful GMAs can be embedded into a matrix algebra over the field of fractions of A.
Lemma 2.10**.**
[Bel19, Lemmas 2.2.2, 2.2.3]**
Assume that A is a domain with field of fractions K and that R=(ACBA) is a faithful GMA over A. Then there exist embeddings of A-modules B,C↪K such that (identifying B,C with their images in K), m:B⊗AC→A is induced by multiplication in K. In particular, if BC=0, then R⊗AK is isomorphic over K as a GMA to M2(K).
We recall the following result of Bellaïche, which explains that any residually multiplicity-free pseudorepresentation can be realized as the trace of a GMA-valued representation.
Proposition 2.11**.**
[Bel19, Proposition 2.4.2]**
Let ρ:Π→GL2(F) be multiplicity free over F. Let (t,d):Π→A be a pseudodeformation of ρ.
(1)
There exists a faithful GMA R over A and a morphism of groups ρ:Π→R× such that trρ=t,detρ=d, and Aρ(Π)=R.
2. (2)
If (ρ,R) and (ρ′,R′) are as in (1), then there is a unique isomorphism of A-algebras Ψ:R→R′ such that Ψ∘ρ=ρ′.
3. (3)
If g0∈Π such that ρ(g0) has distinct eigenvalues λ0,μ0∈F×, then there exists (ρ,R) as in (1) such that ρ(g0) is diagonal and ρ(g0)≡(λ000μ0)modm.
4. (4)
If g0∈Π and (ρ,R),(ρ′,R′) are as in (3), then the unique isomorphism of A-algebras Ψ:R→R′ such that Ψ∘ρ=ρ′ is associated with an isomorphism of GMAs.
5. (5)
If ρ is irreducible and (ρ,R) is as in (1), then R=(A,B,C,m,R) is isomorphic to M2(A) as a GMA over A. If ρ is reducible, then BC⊆m.
6. (6)
If (ρ,R) are as in (1), then kerρ=ker(t,d).
7. (7)
Assume that A is noetherian and Π is p-finite. If (t,d) is continuous, then for (ρ,R) as in (1), R is of finite type as an A-module. If R is given its unique topology as an A-algebra, then ρ is continuous.
Remark 2.12*.*
When ρ is absolutely irreducible, Proposition 2.11 allows us to identify the GMA R with the matrix algebra M2(A). We follow Bellaïche in always implicitly making such an identification. In particular, in the dihedral case, elements of B and C are viewed as elements of A.
∎
Following Bellaïche, we make the following definitions.
Definition 2.13**.**
[Bel19, Definition 2.4.3]
A representation ρ:Π→R× satisfying condition (1) in Proposition 2.11 is called a (t,d)-representation. If in addition ρ satisfies condition (3), then we say that ρ is adapted to (g0,λ0,μ0).
In fact, it is often useful to have the following strengthening of Proposition 2.11(3).
Lemma 2.14**.**
Let ρ:Π→GL2(F) be multiplicity free over F and λ0=μ0∈F× be the eigenvalues of an element in Imρ. Let (t,d):Π→A be a pseudodeformation of ρ. Then there exists g0∈Π and a (t,d)-representation ρ adapted to (g0,λ0,μ0) such that ρ(g0)=(s(λ0)00s(μ0)).
Proof.
Let g0′∈Π be any element such that ρ(g0′) has eigenvalues λ0,μ0. Then Proposition 2.11(3) guarantees the existence of a (t,d)-representation ρ:Π→R× adapted to (g0′,λ0,μ0). By [Bel19, Theorem 6.2.1], it follows that (s(λ0)00s(μ0))∈Imρ. Let g0 be any element in ρ−1(s(λ0)00s(μ0)). Then ρ is a (t,d)-representation adapted to (g0,λ0,μ0) and ρ(g0)=(s(λ0)00s(μ0)).
∎
2.3. Pink-Lie algebras
In Section 2.3 we recall Pink’s theory relating pro-p subgroups of SL2(A) to closed Lie subalgebras of sl2(A) [Pin93]. In fact, we use Bellaïche’s generalization to GMAs [Bel19, Section 4].
Recall that A is a local pro-p ring with p=2. The assumption that p=2 is critical for Pink’s theory. We denote by m the maximal ideal of A. Fix a compact topological GMA R=(ACBA) over A. (The compactness condition is satisfied, for instance, when R is finite as an A-module.) Write
[TABLE]
Let radR be the Jacobson radical of R, and R1:=1+radR. We let SR1:=SR×∩R1, which is a closed normal pro-p subgroup of R×. See [Bel19, Remark 4.2.1] for an explicit description of these objects. We mention here that in the case when BC=A there is by [Bel19, Lemma 2.2.1] an isomorphism of GMAs R≅M2(A) that we can use to identify radR with mM2(A) and R/radR with M2(F), while if BC⊂m then radR=(mCBm)
and R/radR=(F00F).
Given any subset S of R, we write
[TABLE]
Then (radR)0 has a Lie algebra structure with bracket given by [r1,r2]:=r1r2−r2r1.
For any topological group G and closed subgroup H of G, write (G,H) for the smallest closed subgroup of G containing {g−1h−1gh:g∈G,h∈H}. Fix a closed subgroup Γ⊆SR1. Recall that the lower central series of Γ is defined by Γ1:=Γ and define Γn+1:=(Γ,Γn). We describe how Pink associates a filtration of Lie algebras to Γ [Pin93, Section 2].
Define a function
[TABLE]
where (trr)/2 is regarded as a scalar via the structure morphism A→R. Let L(Γ)=L1(Γ) be the (additive) subgroup of (radR)0 topologically generated by Θ(Γ). For n≥2, define Ln(Γ) recursively as the subgroup of (radR)0 topologically generated by the set
[TABLE]
Although the Ln(Γ) are a priori only subgroups of (radR)0, Pink shows that they are closed under Lie brackets and form a descending filtration, as summarized in the following proposition, which is due to Pink when R=M2(A) [Pin93, Proposition 3.1, Proposition 2.3] and to Bellaïche in the GMA case [Bel19, Proposition 4.7.1].
Proposition 2.15**.**
For all n≥1, we have Ln+1(Γ)⊆Ln(Γ). In particular, each Ln(Γ) is a Lie subalgebra of (radR)0.
We emphasize that, a priori, each Ln(Γ) is just a Zp-module, even if the ring A is very large. The point of Section 6 is to prove that, under mild conditions, Ln(Γ) is in fact an algebra over an (in general) much larger ring.
Conversely, given a closed Lie subalgebra L of (radR)0, define H(L):=Θ−1(L)∩SR1. Let Hn:=H(Ln(Γ)). A priori, H(L) is only a subset of SR1. However, we have the following theorem of Pink [Pin93, Proposition 2.4, Theorem 2.7], which was generalized to GMAs by Bellaïche [Bel19, Theorem 4.7.3].
Theorem 2.16**.**
We have that Hn is a pro-p subgroup of SR1. Furthermore, Γ is a normal subgroup of H1, and H1/Γ is abelian. For n≥2, we have Hn=Γn.
Remark 2.17*.*
Pink’s construction satisfies the following two important properties.
(1)
It is functorial with respect to surjective ring homomorphisms. Namely, let a be a closed ideal of A and φ:R→R/aR the natural projection. Then for all n≥1 we have
[TABLE]
2. (2)
Pink’s Lie algebra Ln(Γ) is closed under conjugation by the normalizer of Γ in R×. This follows easily from the definitions since Θ is invariant under conjugation.∎
See Lemma 3.9 for an example calculating Ln(Γ) when Γ is a congruence subgroup.
2.4. Decomposability and regularity
In order to prove fullness theorems, it is useful to be able to decompose Pink’s Lie algebra according to its entries. In Section 2.4 we define this precisely and then define regularity, which will turn out to ensure that the Lie algebras of the representations we work with are decomposable.
Definition 2.18**.**
[Bel19, Section 4.9]
Let R be a GMA over A and L a closed subspace of (radR)0. We say that L is decomposable if
[TABLE]
We say that L is strongly decomposable if L is decomposable and
[TABLE]
If Ln(Γ)⊆R=(ACBA) is decomposable, we write
[TABLE]
Eventually, L will be a Pink-Lie algebra associated to some admissible pseudodeformation of ρ:Π→GL2(F). Regularity is a condition on ρ that will allow us to decompose L, as we will see in Section 6.
Let E be the subfield of F generated by {(trρ(g))2/detρ(g):g∈Π}; equivalently, E is generated by the traces of adρ.
If λg,μg are the eigenvalues of ρ(g), then we see that
(trρ(g))2/detρ(g)=λgμg−1+λg−1μg+2. Hence E is generated over Fp by the set
[TABLE]
In particular, g will not contribute to E if the multiplicative order of λgμg−1 is strictly less than 5. Using this reasoning, it is straightforward to calculate E when ρ exceptional. Namely, if ρ is tetrahedral or octahedral, then E=Fp. If ρ is icosahedral, then E=Fp if p=5 and E=Fp(ζ5+ζ5−1)=Fp(5) otherwise.
Definition 2.19**.**
Let ρ:Π→GL2(F) be a semisimple representation. We say that ρ is regular if there exists g0∈Π such that ρ(g0) has eigenvalues λ0,μ0∈Fp× satisfying λ0μ0−1∈E×∖{±1}. We call g0 a regular element for ρ. If in addition λ0,μ0∈E×, then we say that ρ is strongly regular.
Definition 2.19 is weaker than Bellaïche’s definition of regularity [Bel19, Definition 7.2.1], where the eigenvalues λ0 and μ0 are required in addition to belong to Fp. Examining (2), we see that the only way ρ can fail to be regular is if, for every matrix in Imρ with eigenvalues λ,μ, either λμ−1=±1 or the unique quadratic extension of E is E(λμ−1).
Remark 2.20*.*
Let us analyze regularity depending on the projective image of ρ. With notation as in Definition 2.19, note that the order of λ0μ0−1 in E× corresponds to the order of ρ(g0) in the projective image of ρ.
(1)
If ρ is large, then ρ is regular. Indeed, Pρ(Π) contains PSL2(E) up to conjugation. Since ρ is not exceptional, E× contains an element x such that x2=±1. Then the image of ρ contains, up to conjugation, a scalar multiple of \bigl{(}\begin{smallmatrix}x&0\\
0&x^{-1}\end{smallmatrix}\bigr{)}, which satisfies the regularity property.
2. (2)
If ρ is tetrahedral and p>3, then a regular element must map to a 3-cycle in the projective image of ρ, since the other elements of A4 have order at most 2. Thus in this case regularity is equivalent to ζ3∈E=Fp, which is equivalent to p≡2mod3. By a similar argument we see that if ρ is octahedral and p>3, then regularity is equivalent to one of ζ3 or ζ4 being in E=Fp, which is equivalent to p≡11mod12. If ρ is icosahedral and p=5, then regularity is equivalent to one of ζ3 or ζ5 being in E=Fp(5), which is equivalent to p≡14mod15.
3. (3)
If ρ≅ε⊕δ, then ρ is regular if and only if εδ−1 takes values in E× (Lemma 8.1).
4. (4)
If ρ=IndΠ0Πχ is dihedral, then elements in Π∖Π0 have projective order 2, and so any regular element must lie in Π0. Furthermore, elements in Π∖Π0 have trace 0, and so the field E associated to ρ is the same as the field E associated to ρ∣Π0. Hence we are reduced to the previous case when ρ is reducible.
5. (5)
If the projective image of ρ is isomorphic to Z/2Z or (Z/2Z)2, or if E=F3, then ρ is never regular. In particular, if p=3 and ρ is tetrahedral or octahedral, then ρ is not regular. If p=5 and ρ is icosahedral, then Pρ(Π) is conjugate to PSL2(F5). Thus E=F5 and any potential regular element has eigenvalues in (F5×)2={±1}, so ρ is not regular in this case.∎
2.5. Bellaïche’s results
The purpose of this section is to state Bellaïche’s main results that form the basis for our work in this paper.
We state them in slightly less generality than [Bel19, Section 6]. As before, A denotes a local pro-p ring with maximal ideal m and residue field F. In particular, A is naturally a topological W(F)-algebra.
Let R be a faithful GMA over A. Recall the description of R/radR that we gave in the beginning of Section 2.3. We define s:R/radR→R by
[TABLE]
Note that in the latter case, we have a priori that b=c=0.
Let us fix an admissible pseudodeformation (Π,ρ,t,d) over A. If p=3, let us assume that ρ is not tetrahedral. By Proposition 2.11, there exists a (t,d)-representation ρ:Π→R×. Given such a (t,d)-representation, write G=Gρ:=ρ(Π) and Γ=Γρ:=G∩SR1. Furthermore, let G denote the image of G modulo radR. (Note that the image of G under an embedding R/radR→GL2(F) is a conjugate of ρ(Π).) We will write Ln(ρ):=Ln(Γρ) and analogously for In(ρ),∇n(ρ),Bn(ρ),Cn(ρ).
Bellaïche chooses his (t,d)-representations very carefully in order to give a nice description of their Pink-Lie algebras. How this is done depends upon the projective image of ρ. Since ρ is multiplicity free over F, we can let λ0=μ0∈Fp× be the eigenvalues of a matrix x0∈Imρ chosen such that the following conditions are satisfied:
•
if ρ is large, then (λ0μ0−1)2=1 and λ0,μ0∈Fp×;
•
if p=3 and ρ is octahedral, then λ0μ0−1 is a primitive fourth root of unity;
•
if p=5 and ρ is icosahedral, then λ0μ0−1 is a primitive third root of unity;
•
if ρ is exceptional and does not belong to one of the previous to scenarios, then λ0μ0−1 is a primitive third, fourth, or fifth root of unity;
•
otherwise, the multiplicative order of λ0μ0−1 is equal to the maximal order of an element in the projective image of ρ.
Definition 2.21**.**
Suppose (Π,ρ,t,d) is an admissible pseudodeformation. We say that a (t,d)-representation ρ is well adapted if
(1)
ρ is adapted to an element g0 such that ρ(g0)=(s(λ0)00s(μ0)), where λ0,μ0 satisfy the relevant property listed above;
2. (2)
if the projective image of ρ is dihedral and nonabelian, then G contains a matrix of the form (0cb0) with bc−1∈Fp× and s(G)⊆Imρ.
Bellaïche shows that well-adapted (t,d)-representations always exist, provided that one is willing to replace F by a quadratic extension in the dihedral case [Bel19, Proposition 6.3.2, Lemma 6.8.2].
Define Fq as in the table below. We will see in Lemma 8.1 that if ρ is regular and reducible or dihedral, then Fq can be taken to be E. If ρ is not projectively cyclic or dihedral, then Fq⊆E by definition. (In the A5 case, this follows from the calculation that E=Fp(5) prior to Definition 2.19.)
Remark 2.22*.*
Our definition of Fq differs from that of Bellaïche when ρ is exceptional. If ρ is tetrahedral, then Bellaïche defines Fq=Fp(ζ3). If ρ is octahedral, he defines Fq to be Fp(ζ3) if the ratio λ0μ0−1 chosen prior to Definition 2.21 has order 3 and Fp(ζ4) if that ratio has order 4. If ρ is icosahedral, then he defines Fq=Fp(ζ5). The key property that Bellaïche needs is that ρ can be conjugated so that its image lies in ZGL2(Fq) and \bigl{(}\begin{smallmatrix}\lambda_{0}&0\\
0&\mu_{0}\end{smallmatrix}\bigr{)}\in\operatorname{Im}\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu, where Z is the group of scalar matrices in F (cf. [Bel19, Lemma 6.8.5]). This change of definition will be justified in Lemma 8.12.
∎
The following theorem summarizes Bellaïche’s results describing the structure of W(Fq)L1(ρ) from [Bel19, Section 6]. We recall from 2.12 that in the dihedral case, elements in B and C can be viewed as elements of A.
Theorem 2.23** (Bellaïche).**
Let (Π,ρ,t,d) be an admissible pseudodeformation such that the projective image of ρ is not isomorphic to Z/2Z nor (Z/2Z)2. Then every well-adapted (t,d)-representation ρ:Π→R× with R=(ACBA) has the following properties:
(1)
L1(ρ)* is decomposable;*
2. (2)
the ring A is equal to
[TABLE]
3. (3)
W(F)C1(ρ)=C* and W(F)B1(ρ)=B;*
4. (4)
up to possibly replacing ρ with its conjugate by a certain matrix (100a) with a∈A× when ρ is exceptional or large, W(Fq)L1(ρ) is equal to
[TABLE]
Furthermore
(i)
(W(Fq)I1(ρ))3⊆W(Fq)I1(ρ);
2. (ii)
if ρ is not reducible, then W(Fq)C1(ρ)=W(Fq)B1(ρ);
3. (iii)
if ρ is exceptional or large, then W(Fq)B1(ρ)=W(Fq)I1(ρ) and (W(Fq)I1(ρ))2⊂W(Fq)I1(ρ).
For a subfield F′ of F, we shall often refer to the W(F′)-subalgebra of A generated by I1(ρ). We denote it by Bρ(F′), which is simply equal to W(F′)+W(F′)I1(ρ)+W(F′)I1(ρ)2 whenever Fq⊆F′. When ρ is not reducible or dihedral, we have Bρ(F′)=W(F′)+W(F′)I1(ρ) by Theorem 2.23(ii, iii). In particular, Theorem 2.23 says that A=Bρ(F) unless ρ is projectively dihedral, in which case A=Bρ(F)+W(F)B1(ρ).
Bellaïche uses Theorem 2.23 to deduce that, under certain hypotheses, the representation ρ is Bρ(Fp)-full. See Theorem 1.3 or [Bel19, Theorem 7.2.3] for a precise statement of his result. The first step in our main theorem is to improve this to Bρ(E)-fullness in Section 6.
But first we discuss fullness, conjugate self-twists, and the connections between them in detail.
3. Fullness
In this section we explore the notion of fullness, our measure of the size of the image of a continuous (pseudo)representation on a noetherian local pro-p ring A. Here we additionally assume that A is a domain, with field of fractions K.
3.1. Fullness for (pseudo)representations
Let B be any ring.
For any nonzero B-ideal b, let
[TABLE]
be the congruence subgroup of SL2(B) of levelb.
Definition 3.1**.**
Let G be a subgroup of GL2(K). For a subring B of K we say that G is B-full if there exists a nonzero B-ideal b and x∈GL2(K) such that
[TABLE]
A GL2(K)-valued representation is B-full if its image is B-full. If (t,d):Π→A is a pseudorepresentation, we say (t,d) is B-full if there exists a (t,d)-representation ρ:Π→R× such that ι∘ρ is B-full, where ι is an embedding of R into M2(K). Such an ι
exists by Lemma 2.10; note that by replacing ι by a conjugate embedding we may insist that ΓB(b)⊂GL2(K) is contained in ι(R×) on the nose.
We will say B is a (t,d)-fullness ring if (t,d) is B-full.
The notion of fullness, which has appeared in earlier incarnations in [Hid15, last introductory paragraph] and [Lan16, Definition 2.2], is analogous to Bellaïche’s notion of “congruence large-image” [Bel19, Definition 7.2.1].
We now show that
fullness is well defined for pseudorepresentations and gives compatible notions for representations and pseudorepresentations.
Lemma 3.2**.**
Let (t,d):Π→A be a pseudorepresentation and ρ:Π→GL2(K) a representation whose trace takes values in A. Let B be any subring of K.
(1)
If there exists a (t,d)-representation that is B-full, then every (t,d)-representation is B-full.
2. (2)
The representation ρ is B-full if and only if its pseudorepresentation (trρ,detρ) is B-full.
Proof.
To prove (1), let ρ:Π→R× and ρ′:Π→R′× be two (t,d)-representations. We just have to verify that the A-algebra isomorphism Ψ:R→R′ such that ρ′=Ψ∘ρ from Proposition 2.11 is given by conjugation by an element of GL2(K). Consider Ψ⊗1:R⊗AK→R′⊗AK and recall that R⊗AK≅M2(K)≅R′⊗AK by Lemma 2.10. By the Skolem-Noether theorem, it follows that Ψ⊗1 (and hence Ψ) is conjugation by an element of GL2(K).
For (2), let (t,d)=(trρ,detρ), which is a pseudorepresentation over A by assumption. Let r:Π→R× be a (t,d)-representation, and embed R into GL2(K) by Lemma 2.10, thus viewing r as valued in GL2(K). Note that fullness of ρ (respectively, r) implies that ρ (respectively, r) is absolutely irreducible since this is true for the inclusion representation of a congruence subgroup in GL2(K). As (trρ,detρ)=(t,d)=(trr,detr) and either ρ or r is absolutely irreducible, by the Brauer-Nesbitt Theorem it follows that ρ and r are conjugate by a matrix in GL2(K). Since fullness is defined up to conjugation in GL2(K), it follows that ρ is B-full if and only if r, and hence (t,d), is B-full.
∎
The next two propositions suggest that we can restrict our attention to fullness rings that are closed subrings of the trace algebra. These ideas are made precise Proposition 3.4 and Corollary 3.7 below. Fix a continuous pseudorepresentation (t,d):Π→A.
Proposition 3.3**.**
If (t,d) is B-full for some subring B of K, then the Z-linear span of t(Π) contains a nonzero B-ideal. In particular, the trace algebra At of t contains a nonzero B-ideal.
Proof.
Let 0=b be an B-ideal such that ΓB(b) is contained in the image of some (t,d)-representation. We claim that pairwise products of elements of b are all in the set {t(g)−2:g∈Π}, so that b2 is contained in the trace algebra At. Indeed, an element of ΓB(b) is of the form \bigl{(}\begin{smallmatrix}1+a&b\\
c&1+d\end{smallmatrix}\bigr{)} with a,b,c,d∈b such that a+d+ad−bc=0. In particular, for any b,c∈b, taking d=bc and a=0 shows that
[TABLE]
Since 2=t(1)∈t(Π) and elements of the form bc generate b2, we see that the Z-span of t(Π) contains b2, which is nonzero since it is the square of a nonzero ideal of a domain.
∎
Proposition 3.4**.**
Let (t,d):Π→A be a continuous pseudorepresentation.
If (t,d) is B-full for a subring B of A, then (t,d) is also full for the closure B of B in A. Conversely, suppose that (t,d) is B-full.
If the image of a (t,d)-representation contains a congruence subgroup of SL2(B) whose level is the closure b of an ideal b of B, then (t,d) is also B-full.
Proof.
We show that for a closed subgroup G of the unit group R× of a faithful
GMA R over A equipped with an embedding ι:R↪M2(K), if ι(G) contains ΓB(b) for some nonzero ideal b of B, then ι(G) also contains ΓB(b) for the closure b of b. Since A is noetherian, and both R and M2(A) are finite A-algebras, the preimage S\coloneqq\iota^{-1}\big{(}M_{2}(A)\big{)} is a finite, hence closed,
A-subalgebra of R. Moreover, the induced map ι∣S:S→M2(A) is a homeomorphism onto its image by the compatibility of topologies on finite A-modules (see p. 1.6).
In particular, any closed subset of R containing \iota^{-1}\big{(}\Gamma_{B}(\mathfrak{b})\big{)} will also contain its closure \iota^{-1}\big{(}\Gamma_{\mkern 1.5mu\overline{\mkern-1.5muB\mkern-1.0mu}\mkern 1.0mu}(\mkern 1.5mu\overline{\mkern-1.5mu\mathfrak{b}\mkern-1.0mu}\mkern 1.0mu)\big{)}. The converse claim is clear since b is nonzero only if b is.
∎
3.2. Fullness peers
A pseudorepresentation may be B-full for more than one choice of ring B, even if B is a closed subring of A. For example, any pseudorepresentation that happens to be full for Zp2=W(Fp2) is also full for the order Zp+pZp2⊂Zp2.
We say two subrings B1,B2 of K are fullness peers if every nonzero ideal of B1 contains a nonzero ideal of B2 and vice versa, in which case B1-fullness is equivalent to B2-fullness. One easily checks that fullness peerage is an equivalence relation on subrings of K. The next lemma gives a criterion for establishing when nested domains are fullness peers.
Lemma 3.5**.**
Let B1⊆B2 be domains. The following conditions are equivalent:
(1)
B1* contains a nonzero ideal of B2;*
2. (2)
there exists y∈B1∖{0} such that yB2⊆B1;
3. (3)
B1* and B2 are fullness peers.*
These equivalent conditions imply that B2 and B1 have the same field of fractions.
If moreover B1 is noetherian, then conditions (1,2,3) are equivalent to:
(4)
Q(B2)=Q(B1)* and B2 is a finite B1-algebra.*
Proof.
For (1) implies (2), take y to be any nonzero element of the nonzero ideal of B2 contained in A1. If (2) holds, then an arbitrary nonzero ideal b of A1 contains (yB2)b, which is a nonzero ideal of B2, implying (3). Clearly (3) implies (1).
To see that Q(B2)=Q(B1) under any of (1,2,3), note that any x∈B2 can be written as (yx)/y∈Q(B1) with y as in (2).
For the rest of the proof, assume that B1 is noetherian. Suppose first that any of (1,2,3) holds. If J is a non-zero ideal of B2 contained in B1, then J is a finitely generated B1-module. By replacing J with a smaller B2-ideal, we can assume that J is principal in B2, that is, J=bB2 for some b∈B1. Now choose a finite set of generators {bx1,…,bxn} of bB2 as an B1-module, with x1,…,xn in B2. Then, for every y in B1, by is a linear combination ∑iaibxi for some ai∈B1, which means that y=∑iaixi, so the set {x1,…,xn} generates B2 as an B1-module.
Conversely, suppose that (4) is satisfied. Let x1,…,xn be generators for B2 as an B1-module. Write xi=bi1/bi2 with bij∈B1∖{0}. Set b=∏i=1nbi2∈B1∖{0}. Then bxi∈B1 for all i, and it follows that bB2⊆A1, proving (2).
∎
Question 3.6**.**
Note that Zp[[X]] and its
non-noetherian subring Zp+pZp[[X]] and are fullness peers even though the extension is not finite. Could profiniteness
substitute for finiteness in (4) above? That is, if B2 is a local noetherian pro-p domain, and B1⊂B2 is a closed subring with the same field of fractions, are B1 and B2 necessarily fullness peers?
∎
Corollary 3.7**.**
Let (t,d):Π→A be a pseudorepresentation. If (t,d) is B-full for some subring B of K, then B∩A is a fullness peer of B.
Proof.
By Proposition 3.3, the trace algebra of (t,d), and hence A, contains a nonzero ideal of B. Therefore so does B∩A, and by Lemma 3.5, B∩A⊆B is an extension of fullness peers.
∎
Corollary 3.7 together with Proposition 3.4
allows us to restrict our attention to fullness for closed subrings of A, though we continue to point out features of the general case for completeness.
Corollary 3.8**.**
Let B be a complete
local noetherian domain. Then all the extensions of B contained in the normalization Bnorm are fullness peers of B.
Proof.
Since Bnorm is the integral closure of B in its field of fractions, Bnorm is finite over B by the N2 property (see p. 1.6), hence a noetherian B-module. Therefore any ring C with B⊆C⊆Bnorm
is finite over B. Fullness peerage then follows from Lemma 3.5.
∎
3.3. Fullness and twisting
A key property of fullness, shown in Corollary 3.12, is that it does not change when we twist a pseudorepresentation by a character. This will allow us to do all of our technical work in the setting of constant determinant pseudorepresentations where we have Bellaïche’s Theorem 2.23 available. The proof of the twist invariance of fullness relies on a calculation of the Pink-Lie algebras of a congruence subgroup.
Let B be a local pro-p domain. For a closed nonzero B-ideal b, define
Let b be a closed ideal of B. Then ΓB(b) is a closed pro-p subgroup of GL2(B) and Ln(ΓB(b))=sl2(bn).
Proof.
For x=(1+acb1+d)∈ΓB(b) one has Θ(x)=(2a−dcb2d−a)∈sl2(b), so L1(ΓB(b))⊆sl2(b). In particular, for any b,c∈b, we have Θ(10b1)=(00b0) and Θ(1c01)=(0c00). For a∈b we have (1+2a2a−2a1−2a)∈ΓB(b), and so
[TABLE]
It follows that sl2(b) is contained in the additive subgroup generated by Θ(ΓB(b)). Since sl2(b) is closed in sl2(B), it follows that sl2(b)=L1(ΓB(b)).
It is straightforward to calculate by induction on n that the subgroup topologically generated by
[TABLE]
is sl2(bn+1). That is, Ln(ΓB(b))=sl2(bn) for all n≥1.
∎
Corollary 3.10**.**
Let b be a closed B-ideal different from B. Then \big{(}\Gamma_{B}(\mathfrak{b}),\Gamma_{B}(\mathfrak{b})\big{)}=\Gamma_{B}(\mathfrak{b}^{2}). If #B>3, this also holds for b=B.
Clearly ΓB(b2)⊆Θ−1(sl2(b2))∩ΓBA(m). We compute Θ−1(acb−a)∩ΓB(m) for (acb−a)∈sl2(b2). If (αγβδ)∈Θ−1(acb−a)∩ΓB(m) then we must have β=b,γ=c,α−δ=2a, and 1=αδ−βγ. From this one calculates that α=a±1+a2+bc and δ=−a±1+a2+bc. But only one of these possibilities has α≡1≡δmodm and thus is in ΓB(m). That is, there is a unique element in Θ−1(acb−a)∩ΓB(m). It follows that Θ−1(sl2(b2))∩ΓB(m)=ΓB(b2), as desired.
We now prove that \big{(}\operatorname{{SL}}_{2}(B),\operatorname{{SL}}_{2}(A)\big{)}=\operatorname{{SL}}_{2}(A) when #B>3. By the first statement in the corollary, we know that \Gamma_{B}(\mathfrak{m}^{2})\subseteq\big{(}\operatorname{{SL}}_{2}(B),\operatorname{{SL}}_{2}(B)\big{)}, so we may assume that m2=0. Furthermore, the residual image of \big{(}\operatorname{{SL}}_{2}(B),\operatorname{{SL}}_{2}(B)\big{)} is \big{(}\operatorname{{SL}}_{2}({\mathbb{B}}),\operatorname{{SL}}_{2}({\mathbb{B}})\big{)}, which is equal to SL2(B). Therefore, it suffices to show that \left(\begin{smallmatrix}1+a&b\\
c&1-a\end{smallmatrix}\right)\in\big{(}\operatorname{{SL}}_{2}(B),\operatorname{{SL}}_{2}(B)\big{)} for any with a,b,c∈m. Since m2=0, we can decompose
[TABLE]
Let x∈B× such that x2≡1modm, which exists since #B>3. Note that for any β,γ∈m we have
[TABLE]
and
[TABLE]
It follows that \Gamma_{B}(\mathfrak{m})\subseteq\big{(}\operatorname{{SL}}_{2}(B),\operatorname{{SL}}_{2}(B)\big{)} and hence that SL2(B) is its own topological derived subgroup.
∎
Having calculated the derived subgroup of a congruence subgroup, we can now prove that fullness for closed subrings of A is inherited by restrictions to finite-index and coabelian subgroups.
Proposition 3.11**.**
Suppose that the pseudorepresentation (t,d):Π→A is B-full for a subring B of K. Let Π0 be a closed normal subgroup of Π so that Π/Π0 is abelian. Then (t∣Π0,d∣Π0) is also B-full. The same is true if Π0 is a closed finite-index subgroup of Π so long as B is not finite.
Proof.
First, assume that B is a closed subring of A.
Let ρ:Π→R× be a (t,d)-representation such that ρ(Π) contains ΓB(b) for some nonzero B-ideal b.
Write G:=ρ(Π) and let G0:=ρ(Π0). If Π0 is coabelian, then G/G0 is abelian, so that G0 contains the derived subgroup [G,G]. In particular, G0 contains [ΓB(b),ΓB(b)], which is ΓB(b2) by Corollary 3.10. Since A is a domain, b2 is nonzero if b is.
Suppose alternatively that Π0 is finite index in Π and B is not finite. Replacing b by mA∩b (that is,
if
b=B, we replace b by the maximal ideal mA∩B of B, nonzero by the assumption on B), we note that ΓB(b) is contained in ρ(Γ) for Γ:=kerρ⊆Π. Let Γ0 be the normal core
of Γ∩Π0 inside Γ, so that Γ0 is a finite-index normal subgroup of Γ contained in Π0. Since Γ is pro-p, and hence pro-solvable, and Γ/Γ0 is finite, Γ0 must contain the nth derived subgroup of Γ for some n≥1. Therefore ρ(Γ0), and hence ρ(Π0), contains the nth closed derived subgroup of ΓB(b), namely ΓB(b2n) (Corollary 3.10). This last is again a nontrivial congruence subgroup of SL2(B).
For arbitrary B, Corollary 3.7 allows us to assume that B is a subring of A. If (t,d) is B-full of level b for some ideal b of B, then the argument above and the first part of Proposition 3.4 tell us that (t∣Π0,d∣Π0) is B-full of level b2n for some n≥1. This ideal is the closure of the ideal b2n of B, so we are done by the second part of Proposition 3.4.
∎
We conclude that fullness is unchanged under twisting.
Corollary 3.12**.**
Let (t,d):Π→A be a pseudorepresentation and χ:Π→A× a continuous character. If (t,d) is B-full for some subring B of K, then (χt,χ2d) is also B-full.
Having established the terms of the investigation — finding congruence subgroups for fullness rings (Definition 3.1) contained in images of GMA-valued representations (Proposition 2.11) — in this section we search for optimal (fullness peerage equivalence classes of) fullness rings. Our first stop is the ring fixed by the conjugate self-twists of (t,d) (Definition 4.1 below), symmetries that naturally limit its image. Although this fixed-by-twist-automorphisms subring is a fullness ring for the historical big-image results that serve as our inspiration, one cannot expect fullness with respect to the ring fixed by conjugate self-twists in the general setting.
Indeed, there may not be enough automorphisms to carve down to a fullness ring as illustrated in Examples 4.3 and 4.10, reflecting the limits of Galois theory.
Instead of trying to cut out a fullness ring from above, we build one from below by considering the trace ring A0 of the adjoint pseudorepresentation. This adjoint trace ring (Definition 4.6 below) does not change when twisting (t,d) by a character, and moreover is morally expected to be pointwise fixed by all conjugate self-twists. Because of topological considerations, we do not actually show that A0 is fixed by all conjugate self-twists (Corollary 4.24) until after we prove our main fullness result, so this idea is merely a guiding principle — except for (t,d) whose determinant is A0-constant (Definition 4.14), a condition expected to be satisfied by all intended applications.
The main result of this section is Corollary 4.20:
if (t,d) has A0-constant–determinant, then A0 and the ring fixed by conjugate self-twists are fullness peers. We crucially use this fullness peerage result when deriving our A0-fullness results for certain constant-determinant pseudorepresentations satisfying our mild conditions (Corollary 9.16), which we then propagate to all such pseudorepresentations using the twist-invariance of A0 (Theorem 10.1).
4.1. Conjugate self-twists
Recall that A is a local pro-p noetherian ring and Π is a p-finite profinite group. Fix a continuous pseudorepresentation (t,d):Π→A with trace algebra At.
Definition 4.1**.**
If A is a domain, let B be a domain extending it; otherwise let B=A.
A (B-valued) conjugate self-twist of (t,d) is a pair (σ,η), where σ is an automorphism of B as a Zp-algebra and η:Π→B× is a character. We also consider Σt(B/C), the conjugate self-twists whose automorphisms Σt(B/C) fix a subring C of B.
The set of all B-valued conjugate self-twists of a pseudorepresentation forms a group Σt(B), with composition law (σ1,η1)∘(σ2,η2)=(σ1σ2,η1σ1η2)
and inverse (σ,η)−1=(σ−1,σ−1η−1).
Forgetting the character is therefore is a group homomorphism Σt(B)→AutZp(B) whose image we denote Σt(B). If A is a domain and ρ:Π→GL2(A) is a semisimple representation, we use the notation Σρ(B):=Σtrρ(B) and Σρ(B):=Σtrρ(B).
The kernel of the forget-the-character map are the dihedral conjugate self-twists Σtdi:
[TABLE]
If (1,η) is a nontrivial dihedral conjugate self-twist, then one can check that H:=kerη is an index-2 subgroup of G, that (t∣H,d∣H) is a sum of two characters, and that (t,d) is carried by the induction of either of them.
In particular η is quadratic so that Σtdi does not depend on B. Moreover, when B is a field, H, and hence η, is uniquely defined by (t,d) unless the projective image of the semisimple ρ carrying (t,d) is the Klein-4 group, in which case there are three possibilities for H. In other words, when B is a field, Σtdi is abelian dihedral if ρ is projectively dihedral, cyclic of order 2 if ρ is reducible of projective order 2, and trivial in all other cases. See Lemmas A.6 and A.7, and the proof of Lemma 7.1 for details.
Proposition 4.2**.**
If χ:Π→A× is a continuous character and B is an extension of A, then
[TABLE]
Proof.
The map (σ,η)↦(σ,σχχ−1η) realizes the isomorphism.
∎
If a conjugate self-twist (σ,η) happens to be A-valued, then the automorphism σ is automatically continuous: since A is local, algebraic automorphisms automatically send the maximal ideal to itself, and since A is noetherian, the maximal ideal defines the topology.
It turns out that an A-valued conjugate self-twist character η must also be continuous — see Proposition 4.4(3) below. But in general we cannot expect that all conjugate self-twists can be restricted to ones defined over A, as seen in Example 4.3 below illustrating a failure of normality in the sense of Galois theory.
Example 4.3**.**
For any odd prime p, let Π be the subgroup of GL2(Zp[pp]) generated by GL2(Zp) and the scalar matrix 1+pp.
Let ρ:Π→GL2(Zp[pp]) be the inclusion, a representation with trace algebra A=Zp[pp]. Note that Σρ(A) is trivial, since A has no automorphisms — but that’s not the whole story. Let B=Zp[pp,ζp] and consider the automorphism σ in Aut(B) sending pp to ζppp, together with the character η:Π→B× with kernel GL2(Zp) sending 1+pp to (1+ζppp)(1+pp)−1. Then (σ,η) is a B-valued conjugate self-twist of ρ.
One can show that ρ is not A-full (trρ does not span an ideal of A; see Proposition 3.3). On the other hand, ρ is visibly Zp-full, corresponding to the fact that Zp is the ring fixed by Σρ(B).
∎
Because it is not immediately clear that we can demand that relevant extensions of A be endowed with a sensible topology (if A is a domain we already cannot expect a topology on Q(A): see p. 1.6), there is no way to require conjugate self-twists to be continuous. On the other hand, since trace algebras are topologically generated rings, they only behave well under conjugate self-twists satisfying continuity conditions. As a result, much of this section is devoted to finding settings where conjugate self-twists are A-valued and hence continuous. This will eventually allow us to show that all conjugate self-twists of constant-determinant pseudorepresentations satisfying the mild conditions of B are continuous: see
Corollary 4.24.
Write Atalg for the W(F)-subalgebra of A algebraically generated by t(Π), so that the trace algebra At is the closure of Atalg in A.
Proposition 4.4**.**
Let (σ,η) be a B-valued conjugate self-twist.
(1)
If (t,d) has constant determinant, then η is W(F)-valued and finite order.
2. (2)
If η is Atalg-valued (for example, if η is W(F)-valued), then σ restricts to an automorphism of Atalg, and there is an At-valued conjugate self-twist (σ′,η) satisfying σ′∣Atalg=σ∣Atalg.
3. (3)
If η is A-valued, then η is continuous.
Proof.
(1)
We follow [Mom81, 1.5]. Recall that d=s(d) has finite order by the assumption.
Since
σd=η2d
we have
η2=σdd−1. Since d is finite order, σd must be a power of d. We now claim that σdd−1 has a power of d as a square root, so that η is differs from a power of d by at most a quadratic character and hence takes values in W(F).
Indeed, if d has odd order, then d itself has a power-of-d square root, so that any power of d has the same property. And if d has even order, then since σ preserves orders, σd must be an odd power of d; which means that σdd−1 is an even power of d and hence has a power-of-d square root.
2. (2)
Given that η and t are both Atalg-valued,
σt=ηt is Atalg-valued as well. Since B is either a domain or a local ring and σ is a Zp-algebra homomorphism, it follows that σ permutes W(F), and hence it permutes Atalg as well. Moreover, the action of σ is continuous on Atalg in the topology from At: with malg:=m∩Atalg, we have σ(malg)⊆η(Π)malg⊆malg.
Therefore σ∣Atalg extends uniquely to a continuous automorphism σ′ of the closure At of Atalg. Finally, since σ and σ′ agree on t(Π), the pair (σ′,η) is still a conjugate self-twist, this time At-valued.
3. (3)
First, we claim that ker(t,d)⊆kerη, so that η factors through G:=Π/ker(t,d). Indeed, for g∈ker(t,d) we have in particular t(g)=2 so that η(g)=σt(g)/t(g)=1.
Since ker(t,d) is closed, G=Π/ker(t,d) is still p-finite profinite, and we check continuity of η as a character on G. Moreover Γ:=kerρˉ/ker(t,d)⊆G is a finite-index pro-p subgroup of G [Che14, Lemma 3.8].
By p-finiteness, Γ is topologically finitely generated, so that any finite-index subgroup of Γ is open in Γ [Ser97, §4.2 exercise 6(d)], and hence in G. Now use the fact that A×≅F××(1+m) to write η=η(p)ηp, where ηp:G→1+m is a pro-p character and η(p):G→F× has prime-to-p order.
Then η(p) is continuous because its kernel contains the open pro-p subgroup Γ.
And ηp is continuous because the preimage U⊆Γ of any open subgroup of 1+m along ηp∣Γ is finite index in Γ, hence open in G. Therefore η is continuous.∎
Corollary 4.5**.**
If (t,d) has constant determinant, then any conjugate self-twist (σ,η) of (t,d)
has η continuous, finite-order, and W(F)-valued. Moreover,
there exists an automorphism σ′ of At agreeing with σ on Atalg so that (σ′,η) is an At-valued conjugate self-twist of (t,d).
A posteriori after our main fullness results, we will deduce that, at least for most constant-determinant (t,d) that are not a priori small, σ restricts to an automorphism of At, necessarily continuous, and consequently σ′=σ∣At: see Theorems 10.1 and 5.4.
4.2. The adjoint trace ring
Quite generally, if V is a 2-dimensional vector space over a field F, then the adjoint action of matrices m∈GL(V) on EndF(V)0 — that is, the conjugation action on the vector-space of trace-zero endomorphisms — has trace (trm)2/detm and factors through PGL(V) since scalars act trivially.
By analogy, we define the adjoint-trace elements and the adjoint-trace ring attached to (t,d).
Definition 4.6**.**
The adjoint trace ring of (t,d), denoted A0,t or simply A0, is the closed subring of A topologically generated by the adjoint-trace elementstradt(g):=t(g)2/d(g) for g∈Π.
Proposition 4.7**.**
The adjoint trace ring A0 of any pseudorepresentation of a p-finite profinite group is a local noetherian pro-p ring. In particular, it is N2 (see p. 1.6).
Proof.
By definition, A0 is a closed subring of the local pro-p ring A, so that A0 is also local and pro-p. Moreover, by construction A0 is the trace ring of a pseudodeformation of ad0ρ, where ρ is the semisimple residual representation carrying (t,d). Since Π is p-finite, the universal deformation ring of the pseudorepresentation associated to ad0ρ is noetherian, hence so is its quotient A0.
∎
Adjoint-trace elements don’t change when \big{(}t(g),d(g)\big{)} is replaced by \big{(}\alpha t(g),\alpha^{2}d(g)\big{)} for nonzero scalars α. This has two consequences. First, the adjoint-trace ring is unchanged under twisting.
Proposition 4.8**.**
If χ:Π→A is a continuous character, then A0,t=A0,χt.
Proof.
The adjoint-trace elements of (t,d) and of (χt,χ2t) are the same: d(g)t(g)2=χ2(g)d(g)(χ(g)t(g))2.
∎
To state the second, we define A0,talg⊆A0,t as the subring generated algebraically rather than topologically by the values of t2/d, so that by definition A0,t is the closure of A0,talg in A.
Proposition 4.9**.**
If (σ,η) is an arbitrary conjugate self-twist of (t,d), then σ fixes A0,talg pointwise.
Proof.
The adjoint-trace elements are fixed by any σ. Indeed, for g∈Π,
[TABLE]
In spite of Proposition 4.9, there may not be enough automorphisms to cut out A0 or A0alg, as the following example illustrates.
Example 4.10**.**
Let G=GL2(Fp[[x]]) and let \rho:G\to\operatorname{GL}_{2}\big{(}\mathbb{F}_{p}\llbracket x\rrbracket\big{)} be the inclusion representation. Define χ:G→Fp[[x1/p]]× as follows: for g∈G let dg:=detg∈Fp[[x]], and set χ(g):=dg(x1/p). Then χ is a character to A:=Fp[[x1/p]], and we consider the representation \rho\otimes\chi:G\to\operatorname{GL}_{2}\big{(}A). Since ρ has no nontrivial conjugate self-twists — or, said more precisely, for any extension B of A0:=Fp[[x]], we have Σρ(B/A0)≅Aut(B/A0), each appearing with trivial character — the conjugate self-twists of ρ⊗χ defined over an extension B of A are all of the form (σ,σχ/χ) for σ∈Aut(B/A0) (Proposition 4.2). Since any automorphism that fixes A0 also fixes A, the ring fixed by all conjugate self-twists of ρ⊗χ is A itself. But in this case it’s clear by inspection that A is not a fullness ring (see, for example Proposition 3.3). On the other hand A0 is a fullness ring — the image of ρ⊗χ contains SL2(A0).
∎
In the next section we show that, working over fields rather than rings, one can always cut out the analogue of A0 with conjugate self-twists provided the usual conditions from Galois theory hold.
4.3. Interlude: the theory over abstract fields
In this subsection we switch gears to a topology-free setting, where we show that under Galois-theoretically favorable conditions, the adjoint trace field (defined below) is exactly the fixed field of all conjugate self-twists. This result allows us to give an interpretation of the residue field of A0 as the fixed field of residual conjugate self-twists, and is both inspiration and key in deducing an analogous result for (a generalization of) constant-determinant pseudorepresentations in our pro-p setting in the next subsection — a lodestar and a tool.
Let F be an arbitrary field whose characteristic is not 2, G an abstract group, and (t,d):G→F a pseudorepresentation. For an extension L of F, define an L-valued conjugate self-twist (σ,η) of (t,d) as in Definition 4.1, but where σ is simply a field automorphism rather than a morphism of Zp-algebras. Write Σt(L) for the set of L-valued conjugate self-twists of (t,d), and Σt(L) for the group of automorphisms appearing in Σt(L). Define the adjoint-trace field of (t,d) to be the subfield F0:=F0,t generated by the adjoint-trace elements t(g)2/d(g) over all g∈G. Then as in Proposition 4.9, the adjoint trace field F0,t is fixed by all conjugate self-twist automorphisms, so that F0,t⊆LΣt(L). The following theorem gives conditions that guarantee reverse containment.
Theorem 4.11**.**
Let L be a separably closed extension of F, and E⊆L an extension of F0,t. Then
[TABLE]
In particular, Σt(Fsep)=Aut(Fsep/F0,t).
Proof.
To show that Aut(L/E)⊆Σt(L/E), we start with σ∈Aut(L/E) and produce a twist character. Let ρ be a semisimple representation over a finite extension F′ of F carrying (t,d). Since σ fixes the adjoint trace algebra F0⊆E, we have tradσρ=tradρ. Moreover, adρ is semisimple if ρ is. By Brauer-Nesbitt, therefore, the multiplicities of irreducible representations of G inside adρ and inside ad(σρ) are equal in (the prime field of) F. If F has characteristic [math] or characteristic p≥5, then we can already conclude that adρ≅ad(σρ). If F has characteristic 3, then we split off a trivial character acting on the center using adρ=1⊕ad0ρ and need only eliminate the case that ad0ρ≅ϕ⊕3 and ad0σρ≅σϕ⊕3 for some character ϕ with ϕ=σϕ. But since at least one of the eigenvalues of ad0ρ(g) is 1 for any g∈G (see Lemma A.4), this is impossible. Finally, Theorem A.10 says that, since adρ≅adσρ as representations over L, there is a character η:G→L× such that σρ≅η⊗ρ. Therefore (σ,η) is a conjugate self-twist!
The second statement is a special case of the first since every conjugate self-twist fixes F0,t.
∎
Corollary 4.12**.**
(1)
If F/F0,t is a separable extension, then (Fsep)Σt(Fsep)=F0,t.
2. (2)
Suppose that L is a separably closed extension of F, and E⊆L is an extension of F0,t.
If L/E is separable, then LΣt(L/E)=E.
Both statements follow from Theorem 4.11. The conditions of Corollary 4.12(2) are nontrivially satisfied, for example, if F0,t=Q, F=Q(32), E=Qp and L=Qp. Or see Theorem 10.3(4).
In particular Corollary 4.12(1) for F=F finite applies in our pro-p setting. In this case, every pseudorepresentation (t,d):Π→F is carried by a unique semisimple ρ:Π→GL2(F). Write E for the adjoint trace field F0,ρ.
Corollary 4.13**.**
E=FΣρ(F)=FΣρ(F)**
Proof.
The first equality is Corollary 4.12(1). The second follows from Corollary 4.5, which implies that every conjugate self-twist of ρ restricts to one defined over F.
∎
Our next goal is to limn a setting where the pro-p analogue of the extension F/F0,t is Galois, so that we can obtain analogues of Theorem 4.11 and Corollary 4.12(1).
4.4. A0-constant determinant and simple conjugate self-twists
We now return to our pro-p setting and consider a restriction on the determinant of (t,d) that shares many properties with the constant-determinant case, but is mild enough to be expected to be satisfied by all our arithmetic applications. This constraint allows us to restrict our attention to conjugate self-twists valued in the trace algebra.
Definition 4.14**.**
We say that (t,d) has A0-constant determinant if d is the product of an A0-valued character and a character of finite prime-to-p order. Said another way, d is A0-constant if its pro-p part d1 takes values in A0. If (t,d) has A0-constant determinant, we call the conjugate self-twists in Σt(At)simple. For brevity we write Σt and Σt for these simple conjugate self-twists in Sections 7 through 9.
By automatic continuity, all automorphisms in Σt(At) fix A0, so that Σt(At)=Σt(At/A0). The following lemma shows that twist characters are also continuous.
Lemma 4.15**.**
Let (t,d):Π→A be an A0-constant-determinant pseudorepresentation, and let (t′,d′) be its constant-determinant twist obtained by twisting off the pro-p part d1 of d. Then:
(1)
At=At′**
2. (2)
Σt(At)=Σt′(At′)**
3. (3)
If (σ,η) in Σt(At) is a simple conjugate self-twist, then η is W(F)×-valued and continuous.
Proof.
First note that (t,d) and (t′,d′) have the same adjoint trace algebra A0, since the latter is twist invariant (Proposition 4.8). Write χ for d1−1/2, a continuous A0-valued character. Note that A0⊂At∩At′. Since χ and χ−1 are both valued in A0⊂At∩At′, we can move back and forth using t′=χt and t=χ−1t. That is At′=Aχt⊆At and At=Aχ−1t′⊆At′. Part (1) follows. For (2), because χ is fixed by σ, the map Σt(At)→Σχt′(At) from Proposition 4.2 sending (σ,η) to (σ,σχχ−1η) is the identity. For (3), use (2) and Corollary 4.5, or redo Proposition 4.4(1) directly and use Proposition 4.4(3).
∎
4.5. K/K0 as a Galois extension
Finally we assume that A is a (local pro-p) domain and fix an A0-constant-determinant pseudorepresentation (t,d):Π→A. By replacing A by At, we may assume that A is the trace algebra of (t,d); let K be the fraction field of A. Recall that ρ is the semisimple representation Π→GL2(F) carrying (tˉ,d).
Let A0 be the adjoint trace ring of (t,d) with fraction field K0; write E for the adjoint trace ring of ρ. Here F is the residue field of A and E is the residue field of A0.
Let Γ:=kerρ⊂Π. This is a finite-index normal subgroup of Π, and we can restrict (t,d) to Γ to obtain (t∣Γ,d1).
Lemma 4.16**.**
We have t(Γ)⊂A0.
Proof.
For γ∈Γ, we have t(γ)=2+m for some m∈m. The corresponding adjoint-trace element is t(γ)2/d1(γ)=d1(γ)−1(4+4m+m2)∈A0. Since d1(γ)−1∈A0 by assumption, so is 4+4m+m2. But any closed subring containing 4+4m+m2 contains 2+m=t(γ) as well since 2 is invertible.
∎
Proposition 4.17**.**
A* is a finite A0-algebra.*
Proof.
The subgroup Γ:=kerρˉ is a finite-index normal subgroup of Π. Moreover, At∣Γ⊆A0 (Lemma 4.16). Thus K0 contains the trace field of t∣Γ, so that by Lemma A.12 there is an at-most quadratic extension L0 over K0 over which we can define a representation carrying (t∣Γ,d1). By the proof of Proposition A.13, there is a finite extension L over L0 containing t(Π). In fact, letting Aalg be the W(F)A0-submodule of A (algebraically, not topologically) generated by t(Π), we have Aalg⊆L. Note that A is the closure of Aalg.
We claim that Aalg is integral over A0. For every g∈Π, the element t(g) is a square root of t(g)2, which is in A0[d]⊆W(F)A0. Since Aalg is generated by the t(g) over W(F)A0, the former is integral over the latter. The claim follows.
Finally, let B be the integral closure of A0 in L. Since A0 is noetherian and N2 (Proposition 4.7), B is a finite, hence noetherian, A0-algebra. Because B contains Aalg by integrality, the latter is also finite over A0. But this means that Aalg is a sum of finitely many compact submodules of A, so compact itself, hence closed. In other words, Aalg=A. Therefore A is finite over A0.
∎
Corollary 4.18**.**
A* is a multiquadratic extension of W(F)A0.
More precisely, there is an integer n prime to p, elements a1,…,ar of A0, and integers k1,…,kr modulo n so that for ζ=ζn a generator of s(F×) we have*
[TABLE]
Proof.
Let n be the order of F×, so that ζ generates W(F) and A0[ζ]=W(F)A0.
Since A0 contains
[TABLE]
for every g∈Π and k(g) depending on g, we see that A is topologically generated over W(F)A0 by t(g)=ζk(g)a(g). By Proposition 4.17 and compactness only finitely many of these are needed.
∎
Theorem 4.19**.**
The extension K over K0 is finite abelian, with
[TABLE]
We give two different arguments both fundamentally rooted in Theorem 4.11.
Since K=K0(ζ,ζk1a1,…,ζkrar) by Corollary 4.18 (and using the same notation), it is clear that Aut(K/K0)=Aut(A/A0). To see that K/K0 is Galois, note that it is a subextension of K0(ζ2n,a1,…,ar)/K0, which is a compositum of abelian extensions and hence abelian as well. Here we use the fact that the characteristic of (the prime field of) K is either zero or an odd prime p, so that K0(ζ2n)/K0 for n prime to p and the K0(ai)/K0 are all separable and hence abelian. Therefore K/K0 is abelian with Gal(K/K0) a subgroup of (Z/2Z)r×(Z/nZ)×.
To see that Aut(A/A0)=Σt(A), start with σ∈Aut(A/A0) and proceed as in the proof of Theorem 4.11 to produce a character η:Π→K× with (σ,η) a conjugate self-twist of (t,d). By Lemma 4.15, η is A-valued, so that (σ,η) is in Σt(A).
∎
As in the first proof, it is clear that K is separable over K0. First suppose that (t,d) has constant determinant. To show that K/K0 is normal, take an embedding σ of K into K over K0. Proceed as in the proof of Theorem 4.11 to get a character η:Π→K× so that (σ,η) is a K-valued conjugate self-twist. Use Corollary 4.5
to find σ′ so that (σ′,η) is an A-valued conjugate self-twist. Note that σ′∣W(F)A0=σ∣W(F)A0 since both σ and σ′ fix A0 and act the same way on W(F) by construction. Therefore σ′σ−1 is an embedding of K that fixes Q(W(F)A0)=K0(ζ). Since K/K0(ζ) is a multiquadratic extension, it is Galois, so σ′σ−1 sends K to itself. But σ′ also sends K to itself by construction, so σ must be an automorphism of K. In fact, using that σ′σ−1 fixes K0 and t(Π), we see that σ∣A=σ′ and thus Gal(K/K0)=Σt(A). The fact that Σt(A) is abelian in the constant-determinant case follows from Corollary 7.2 and the diagram following Corollary 7.4.
In general when (t,d) has A0-constant determinant, the result follows from the constant determinant-case and Lemma 4.15.
∎
Corollary 4.20**.**
We have K0=KΣt(A). Moreover, AΣt(A) is a finite extension of A0 with the same field of fractions and the same normalization.
Proof.
The first statement follows from Theorem 4.19. Since A0⊆AΣt(A)⊆KΣt(A)=K0, the field of fractions of AΣt(A) is K0. Finally, since A is integral over A0, so is AΣt(A).
∎
Corollaries 4.20 and 3.8 imply the following fullness comparison result for two key subrings of A.
Corollary 4.21**.**
The rings A0 and AΣt(A), as well as their normalizations, are all fullness peers.
The example below shows that proper containment A0⊊AΣt(A) is possible: A0 and AΣt(A) need not have the same residue field.
Example 4.22**.**
Define
[TABLE]
so that in each set above a and b are in Fp2 satisfying N(a)=N(b). Let G⊂GL2(Fp2[[X]]) be the set of matrices whose determinants are in Fp2, and let G⊂G be the set of invertible matrices that are residually in G. Finally, let Π:=G and set ρ:Π→GL2(Fp2[[X]]) to be the inclusion map, a constant-determinant representation. Then the trace algebra A of ρ is A=Fp2[[X]] — indeed, fix a generator β of Fp2×, so that N(β)=βp+1=1. Then for every a∈Fp2×, G contains both
[TABLE]
so that A contains both trga=2+aX and trha=a−ap. A similar computation shows that the adjoint trace ring is A0=Fp+XFp2[[X]]: on one hand, A0 contains
[TABLE]
for every a∈Fp2×, and on the other hand every element of A0 is residually in Fp, reflecting the fact that ρ admits a conjugate self-twist and illustrating Corollary 4.13. Since ρ itself has no conjugate self-twist — any automorphism of A appearing in a conjugate self-twist must fix A0 and hence extends to the identity on K0, which contains A — we have AΣρ(A)=A.
∎
We close with a technical observation necessary for our a posteriori justification for focusing on A-valued conjugate self-twists for A0-constant–determinant pseudorepresentations.
Proposition 4.23**.**
Let (σ,η) be any conjugate self-twist of an A0-constant–determinant pseudorepresenation (t,d). Its trace algebra A is σ-stable if and only if σ fixes A0 pointwise. Both of these conditions are satisified if (t,d) is A0-full.
Proof.
If σ restricts to an automorphism of A, then σ∣A is automatically continuous, so that σ fixing A0alg pointwise per Proposition 4.9 is enough to conclude that σ fixes all of A0. Conversely, if A0 is fixed by σ, then since A is finite over A0 in this setting (Proposition 4.17), Aalg is dense in A, and A is noetherian, we can express A=a1A0+⋯+anA0 for ai∈Aalg, so that σ(A)⊆A.
If (t,d) is A0-full, then Aalg contains some nonzero A0-ideal b0 (Proposition 3.3). The expression A=a1A0+⋯+anA0 for ai∈Aalg implies that b0A⊆Aalg, whence Aalg and A are fullness peers, and in particular have the same field of fractions (Lemma 3.5). Since A⊆Q(Aalg), the values of σ on A are entirely determined by the restriction of σ to Aalg. Therefore the action of σ on A coincides with the action on A by the continuous extension of σ∣Aalg guaranteed by Corollary 4.5. Thus σ restricts to an automorphism of A, as claimed.
∎
Corollary 4.24**.**
Let (t,d):Π→A be any pseudorepresentation. If (t,d) is full for its adjoint trace ring A0, then every arbitrary conjugate self-twist of (t,d) fixes A0 pointwise.
Proof.
Let (σ,η) be a conjugate self-twist of (t,d), and let (t′,d′)
be its constant-determinant twist. Then (σ,η′) is a conjugate self-twist of (t′,d′) for some character η′ (Proposition 4.2). Since neither fullness nor A0 is changed under twisting (Corollaries 3.12 and 4.8), the constant-determinant twist (t′,d′) is still A0-full, and hence by Proposition 4.23σ fixes A0 pointwise.
∎
Combined with our main A0-fullness result (Theorem 10.1), Corollary 4.24 allows us to conclude a posteriori that under mild conditions on non–a-priori-small (t,d), all conjugate self-twists of (t,d) fix all of A0, resolving the topological concerns we have danced around in Sections 4.1 and 4.2. Of course one continues to hope for a less circuitous argument that works for all (t,d).
5. Optimality
Armed with the notions of conjugate self-twists and the adjoint trace ring from Section 4, we prove two related optimality results. We show that the adjoint trace ring of a pseudorepresentation (t,d) contains a fullness peer of any fullness ring for (t,d) (Theorem 5.3). In this way we establish the adjoint trace ring as the optimal fullness ring up to fullness peerage. We also show that every fullness ring is fixed by all conjugate self-twists (Theorem 5.4), generalizing Corollary 4.24.
As usual, A is a local noetherian pro-p domain carrying a continuous pseudorepresentation (t,d):Π→A of a p-finite profinite group Π. We assume that A is a domain with field of fractions K. Also let A0 and K0 be the adjoint trace ring of (t,d) (Definition 4.1) and its fraction field, respectively. Recall that (t,d) is B-full for any subring B of K if there exists a (t,d)-representation ρ:Π→R× and an embedding R↪M2(K) such that via this embedding Imρ⊇ΓB(b) for some nonzero ideal b of B (Definition 3.1).
We first show that fullness rings are fixed by conjugate self-twist automorphisms whose characters are continuous, or at least nearly so.
Proposition 5.1**.**
Suppose (t,d) is B-full for some subring B of A. Let (σ,η) be a conjugate self-twist of (t,d) valued in any domain extending A. If kerη contains a subgroup H, closed and normal in Π, such that Π/H is abelian, then σ fixes B pointwise.
In particular, if (σ,η) is A-valued, then σ always fixes B pointwise.
Proof.
By Proposition 3.4 we may replace B by its closure in A. On one hand, since H⊆kerη, we have that t(H) is fixed by σ. On the other hand, by Proposition 3.11, (t∣H,d∣H) is still B-full, so that by Proposition 3.3 applied to (t∣H,d∣H):H→A, the Z-span of t(H) contains some nonzero B-ideal b. Therefore every element of b is fixed by σ. Since any element of B is a ratio of elements of b (if b=0 is in b, then x=bxb for any x∈B), every element of B is fixed by σ.
If (σ,η) is A-valued, then η is continuous (Proposition 4.4(3)) so that we can take H=kerη.
∎
Corollary 5.2**.**
If (t,d) is B-full for some subring B of K, then B⊆K0.
Our main optimality theorem is an improvement on Corollary 5.2: any fullness ring is not only contained in K0, it in fact has a fullness peer subring contained in A0.
Theorem 5.3**.**
Suppose (t,d) is B-full for some subring B of K. Then B∩A0 is a fullness peer of B contained in A0.
Proof.
First suppose that B is finite. Then B is a finite field; its only nonzero ideal is itself, and the only congruence subgroup of SL2(B) is SL2(B) itself. In this case, B has no proper subring fullness peers and we show directly that B is contained in A0. Indeed, K, and hence A, is now an Fp-algebra, and it follows that ρ is also B-full. Now Proposition 5.1 applied to ρ tells us that B is fixed by all conjugate self-twist automorphisms of ρ: in other words, B⊆E (Corollary 4.13), which is the residue field of, and hence here contained in, A0.
Now assume that B is not finite. Use Corollary 3.12 to replace (t,d) by its constant-determinant twist and let At⊆A be its trace algebra. Replace B by its fullness peer B∩At (Corollaries 3.12 and 4.8). Let Γ=kerρ, a finite-index closed subgroup of Π. By Proposition 3.11, the restriction (t∣Γ,d∣Γ) is B-full. By Proposition 3.3, A0, which contains the trace algebra of Γ (Lemma 4.16), contains a nonzero ideal of B. Therefore so does B∩A0, making B∩A0⊆B an extension of fullness peers by Lemma 3.5.
∎
We do not know whether we can sharpen Theorem 5.3 to conclude that any closed fullness ring B contained in A0 with Q(B)=K0 must in fact be a fullness peer of A0. This would follow from an affirmative answer to 3.6.
We close with an observation that fullness rings are always fixed by conjugate self-twist automorphisms. Corollary 4.24 has already established this for A0; Theorem 5.4 below is a generalization.
Theorem 5.4**.**
Let (t,d):Π→A be a pseudorepresentation. If (t,d) is B-full for some subring B of
K, then B is fixed by any conjugate self-twists valued in any
domain extending A containing B.
Note that any automorphism valued in a domain E containing A extends uniquely to an automorphism of Q(E), which contains K and hence B.
Proof.
Let (t′,d′) be the constant determinant twist associated to (t,d), obtained by twisting off the pro-p part d1 of d. As in Proposition 4.2, if (σ,η) is an E-valued conjugate self-twists (t,d) for some domain E extending A,
then \big{(}\sigma,\ \sigma(d_{1}^{-1/2})d_{1}^{1/2}\eta\big{)} is a E-valued conjugate self-twist of (t′,d′).
By Corollary 4.5, χ=σ(d1−1/2)d11/2η is continuous and W(F)-valued.
Note that kerd1=kerd11/2=kerσ(d1−1/2) is closed since d1 is also continuous. Thus kerd1∩kerχ is a closed subgroup of kerη. Moreover, since the order of χ is prime to p and d1 is pro-p we get that
[TABLE]
is abelian. Thus we can take H=kerd1∩kerχ in Proposition 5.1 to deduce that B∩A
is fixed by σ. As B∩A and B are fullness peers, σ thus fixes an ideal of B, hence all of B.
∎
6. Fullness for Bρ(E)
Throughout this section we fix a local pro-p ring A, not necessarily a domain, with residue field F. Fix an admissible pseudodeformation (Π,ρ,t,d) over A. Let A0 be the adjoint trace ring of (t,d) and E the residue field of A0.
6.1. L2(ρ) is a W(E)-module
Recall that, a priori, Pink’s construction only gives Lie algebras that are Zp-modules. The goal of Section 6.1 is to show that if ρ is a (t,d)-representation, then in fact its associated Lie algebras are modules over W(E) (Proposition 6.2). Although this is a minor improvement on Zp (indeed, it is no improvement at all if W(E)=Zp), it is an essential input for proving the results of Section 9.
We assume throughout Section 6.1 that the eigenvalues of ρ(g) are in F× for all g∈Π. This requires at most replacing F by its unique quadratic extension.
Let λ=μ∈F× be the eigenvalues of a matrix in Imρ. By Lemma 2.14, there is a (t,d)-representation ρλ,μ:Π→Rλ,μ× and gλ,μ∈Π such that
[TABLE]
Recall that Gρλ,μ:=Imρλ,μ,Γρλ,μ:=Gρλ,μ∩SRλ,μ1, and Ln(ρλ,μ):=Ln(Γρλ,μ). Since λ=μ, the Lie algebra L1(ρλ,μ) is decomposable [Bel19, Corollary 6.2.2]. Note that although the Teichmüller map s is not additive, it is easy to check that W(Fp(λμ−1+λ−1μ))=Zp[s(λ)s(μ)−1+s(λ)−1s(μ)], a fact that we make frequent use of in Lemma 6.1 below.
Lemma 6.1**.**
With the notation introduced at the beginning of Section 2.5, we have
(1)
∇1(ρλ,μ),B1(ρλ,μ),C1(ρλ,μ),* and L2(ρλ,μ) are W(Fp(λμ−1+λ−1μ))-modules;*
2. (2)
if the projective image of ρ contains PSL2(Fp) and p≥7, then I1(ρλ,μ) is a W(Fp(λμ−1+λ−1μ))-module; after possibly replacing ρλ,μ with its conjugate by a certain (100a) with a∈A×, one has that L1(ρλ,μ) is a W(Fp(λμ−1+λ−1μ))-module.
Proof.
Note that
[TABLE]
Furthermore, from their definitions on page 2.18, it’s clear that B1(ρλ,μ) and C1(ρλ,μ) inherit whatever structure ∇1(ρλ,μ) has.
Therefore it suffices to show that ∇1(ρλ,μ) is a W(Fp(λμ−1+λ−1μ))-module.
To prove that
[TABLE]
recall that L1(ρλ,μ) is closed under conjugation by Gρλ,μ (in fact, by any element in the normalizer of Γρλ,μ). In particular, it is closed under conjugation by (s(λ)00s(μ)) and (s(λ)−100s(μ)−1). Using this, a short matrix calculation shows that if \bigl{(}\begin{smallmatrix}0&b\\
c&0\end{smallmatrix}\bigr{)}\in\nabla_{1}(\rho_{\lambda,\mu}) then
Finally, if the projective image of ρ contains PSL2(Fp) and p≥7, then by Theorem 2.23, up to replacing ρλ,μ with its conjugate by a certain (100a) with a∈A×, we have
[TABLE]
and thus B1(ρλ,μ)=I1(ρλ,μ)=C1(ρλ,μ). (Note that conjugation by (100a) with a∈A× does not change I1(ρλ,μ).) In particular, L1(ρλ,μ) is strongly decomposable. By the second statement of the lemma, we see that I1(ρλ,μ) is a W(Fp(λμ−1+λ−1μ))-module. The above description of L1(ρλ,μ) shows that it is also a W(Fp(λμ−1+λ−1μ))-module.
∎
Proposition 6.2**.**
Let ρ:Π→R× be a (t,d)-representation. Then Ln(ρ) is a W(E)-module for all n≥2. If the projective image of ρ contains PSL2(Fp) and p≥7, then L1(ρ) is a W(E)-module.
Proof.
By the generating set for E given in (2) in Section 2.4, it suffices to show that Ln(ρ) is closed under multiplication by s(λ)s(μ)−1+s(λ)−1s(μ) for all λ,μ∈Fp× that are distinct eigenvalues of an element in Imρ. Fix such λ,μ. Let ρλ,μ:Π→Rλ,μ× be the (t,d)-representation over A described prior to Lemma 6.1. Let us assume furthermore that, in the case when ρ is not projectively cyclic or dihedral, that we have already replaced ρλ,μ by its relevant diagonal conjugate so that the description of W(Fq)L1(ρλ,μ) from Theorem 2.23 applies to ρλ,μ.
Since ρ:Π→R× and ρλ,μ:Π→Rλ,μ× are both (t,d)-representations over A, it follows from Proposition 2.11 that there is a unique A-algebra isomorphism Ψ:Rλ,μ→R such that ρ=Ψ∘ρλ,μ. We claim that this implies that Ln(ρ)=Ψ(Ln(λ,μ)) for all n≥1. If this is true, then L2(ρ) is closed under multiplication by s(λ)s(μ)−1+s(λ)−1s(μ)∈A since L2(ρλ,μ) is by Lemma 6.1 and Ψ is an A-algebra homomorphism. Since L2(ρ) is a W(E)-module, it follows immediately from the definition that Ln(ρ) is a W(E)-module for all n≥2. Furthermore, the argument in this paragraph applies to L1(ρ) under the assumption that the projective image of ρ contains PSL2(Fp) for p≥7.
To see that Ln(ρ)=Ψ(Ln(ρλ,μ)), note that Gρ=GΨ∘ρλ,μ=Ψ(Gρλ,μ). Since Ψ is an algebra morphism, it follows that Ψ(radRλ,μ)=radR. Furthermore, since ρ and ρλ,μ are both (t,d)-representations, it follows that Ψ preserves determinants. Therefore Ψ(SRλ,μ1)⊃SR1. Since Ψ is a continuous algebra homomorphism, it follows directly from the definition of Θ that Ψ(L1(ρλ,μ))=L1(ρ) and hence Ψ(Ln(ρλ,μ))=Ln(ρ) for all n≥1.
∎
6.2. L2(ρ) is a Bρ(E)-module
In Section 6.2 we use Bellaïche’s work to show that, for any well-adapted (t,d)-representation ρ, Ln(ρ) is a module over a ring comparable to A. This is the key input into Corollary 6.6, which is our improvement on Bellaïche’s fullness theorem.
Proposition 6.3**.**
Let ρ be a (t,d)-representation adapted to (g0,λ0,μ0). Then
(1)
L2(ρ)* is a module over W(E)[I1(ρ)2]:=W(E)+W(E)I1(ρ)2;*
2. (2)
if n≥1 and Ln(ρ) is strongly decomposable, then Ln+1(ρ) is a module over Bρ(E);
3. (3)
if the projective image of ρ contains PSL2(Fp) for p≥7, then up to replacing ρ with its conjugate by some (100a) with a∈A×, L1(ρ) is a module over Bρ(E).
Proof.
Since ρ is adapted to (g0,λ0,μ0), it follows that L1(ρ) is decomposable [Bel19, Corollary 6.2.2]. Note that [I_{1}(\rho)\bigl{(}\begin{smallmatrix}1&0\\
0&-1\end{smallmatrix}\bigr{)},\nabla_{1}(\rho)]\subset\nabla_{1}(\rho) since L1(ρ) is a Lie algebra. That is, for all a∈I1(ρ) and \bigl{(}\begin{smallmatrix}0&b\\
c&0\end{smallmatrix}\bigr{)}\in\nabla_{1}(\rho), we have 2a\bigl{(}\begin{smallmatrix}0&b\\
-c&0\end{smallmatrix}\bigr{)}\in\nabla_{1}(\rho). To prove the first statement, we can apply this fact a second time to α∈I1(ρ) and 2a\bigl{(}\begin{smallmatrix}0&b\\
-c&0\end{smallmatrix}\bigr{)} to see that 4a\alpha\bigl{(}\begin{smallmatrix}0&b\\
c&0\end{smallmatrix}\bigr{)}\in\nabla_{1}(\rho). Therefore ∇1(ρ) is closed under multiplication by I1(ρ)2. Since
[TABLE]
we see that L2(ρ) is closed under multiplication by I1(ρ)2.
For the second statement, if Ln(ρ) is strongly decomposable, then we can write
[TABLE]
By calculating [(100−1),(0010)] and [(100−1),(0100)], we find that I1(ρ)Bn(ρ)=Bn+1(ρ)⊂Bn(ρ) and I1(ρ)Cn(ρ)=Cn+1(ρ)⊂Cn(ρ). Therefore Bn(ρ),Cn(ρ) are closed under multiplication by I1(ρ). Since
[TABLE]
it follows that Ln+1(ρ) is closed under multiplication by I1(ρ).
It would be nice to remove the assumption that L1(ρ) is strongly decomposable and still conclude that L2(ρ) is a Bρ(E)-module, but we do not see a way to do this.
∎
6.3. Regularity implies Bρ(E)-fullness
The goal of Section 6.3 is to establish a slightly stronger version of [Bel19, Theorem 7.2.3], which is Bellaïche’s Theorem 1.3 of the introduction. We do so in Corollary 6.6 below. Our result is different from that of Bellaïche mainly in that we can weaken his definition of regularity and enlarge his ring Bρ(Fp) to Bρ(E).
Throughout Section 6.3 the ring A will be a local pro-p domain with residue field F and field of fractions K. We fix an admissible pseudodeformation (Π,ρ,t,d) over A throughout this section. If ρ is a (t,d)-representation that is adapted to some (g0,λ0,μ0), then L1(ρ) is decomposable by [Bel19, Corollary 6.2.2]. Thus I1(ρ) is defined. We write K1 for the field of fractions of Bρ(E).
Proposition 6.5**.**
Assume that ρ is regular. Let ρ:Π→R× be a (t,d)-representation adapted to (g0,λ0,μ0) for a regular element g0 such that ρ(g0)=(s(λ0)00s(μ0)). If B1(ρ),C1(ρ)=0, then ρ is Bρ(E)-full.
Proof.
It is easy to see that the eigenvalues of ρ(g0)=(s(λ0)00s(μ0)) acting on Ln(ρ) by conjugation are 1,s(λ0)s(μ0−1),s(λ0−1)s(μ0), which are distinct elements of W(E)× since g0 is a regular element. Since Ln(ρ) is a W(E)-module for n≥2 by Proposition 6.2, it follows that Ln(ρ) is the direct sum of the eigenspaces for the conjugation action of ρ(g0). Thus, Ln(ρ) is strongly decomposable for n≥2. By Proposition 6.3, it follows that Ln(ρ) is an Bρ(E)-module for n≥3.
Since A is a domain, we may view R inside of M2(K) by Lemma 2.10. Note that if B1(ρ),C1(ρ)=0, then since In(ρ),Bn(ρ),Cn(ρ)⊂K, it follows that In(ρ),Bn(ρ), and Cn(ρ) are nonzero for all n≥1. In particular, I3(ρ),B3(ρ),C3(ρ) are nonzero Bρ(E)-modules.
Define
[TABLE]
Then R1 is a faithful GMA over Bρ(E). By the proof of [Bel19, Lemma 2.2.2], if 0=b0∈B3(ρ) and x=\bigl{(}\begin{smallmatrix}1&0\\
0&b_{0}\end{smallmatrix}\bigr{)}, it follows that xR1x−1⊆GL2(K1). Thus, by replacing ρ with xρx−1, which is still a (t,d)-representation adapted to (g0,λ0,μ0) that sends g0 to (s(λ0)00s(μ0)), we may assume that B3(ρ),C3(ρ)⊆K1. (Note that I1(ρ)=I1(xρx−1).)
Note that any nonzero Bρ(E)-submodule of K1 contains a nonzero element of Bρ(E) and thus contains a non-zero Bρ(E)-ideal. Therefore there exists a nonzero Bρ(E)-ideal b contained in I3(ρ)∩B3(ρ)∩C3(ρ). Hence sl2(b)⊆L3(ρ). Using Theorem 2.16 we deduce that ΓBρ(E)(b)⊂Imρ and ρ is Bρ(E)-full.
∎
Corollary 6.6**.**
Assume that ρ is regular. Let ρ be a well-adapted (t,d)-representation adapted to (g0,λ0,μ0) for a regular element g0 such that \rho(g_{0})=\bigl{(}\begin{smallmatrix}s(\lambda_{0})&0\\
0&s(\mu_{0})\end{smallmatrix}\bigr{)}. If (t,d) is not a priori small, then (t,d) is Bρ(E)-full.
Proof.
By Proposition 6.5 it suffices to show that B1(ρ),C1(ρ)=0. We do this by analyzing the different possibilities for ρ.
By Lemma A.5, we see that either ρ is reducible, dihedral, or ad0ρ is irreducible. Assume first that we are in the last case. The group Γ is equipped with a decreasing normal filtration
[TABLE]
whose quotients Γn/Γn+1 have the structure of Fp-vector spaces with an action of G by conjugation. Moreover, the map γ↦γ−1 gives a G-equivariant embedding Γn/Γn+1↪sl2(mn/mn+1); write Vn for its image. The assumption that (t,d) is not a priori small implies that Γ is nontrivial, so that Γn/Γn+1, and hence Vn, is nontrivial for some n≥1. To see that B1(ρ) (respectively, C1(ρ)) is nonzero, it suffices to show that this Vn contains an element whose upper right (respectively, lower left) entry is nonzero. This can be checked on the F-span of Vn. Choosing an F-basis x1,…,xd of mn/mn+1 gives a G-equivariant splitting sl2(mn/mn+1)=⊕i=1dsl2(F)xi. Since Vn is nontrivial, there is some i such that the projection W of FVn to sl2(F)xi is nonzero. Then W is a stable subspace of sl2(F)xi, which is simple since ad0ρ is irreducible. Thus W=sl2(F)xi and W, and hence FVn, contains an element whose upper right (respectively, lower left) entry is nonzero.(vii)(vii)(vii)Together with the observation that Bn(ρ),Cn(ρ)=0 for all n if it is true for n=1 from Proposition 6.5, the argument here gives an independent proof of the (2) ⟹ (1) implication in Proposition 2.4.
Now suppose that ρ is reducible. Since ρ is well adapted by assumption, it follows that ρ is adapted to (g0,λ0,μ0), where ρ(g0) generates the projective image of ρ. In particular, ρ is automatically adapted to a regular element. Suppose for contradiction that C1(ρ)=0 (respectively, B1(ρ)=0). Then Γρ is contained in the upper (respectively, lower) triangular matrices. By [Bel19, Theorem 6.2.1], we know that s(G)⊂Gρ since ρ is well adapted. Thus Gρ=s(G)Γρ. But then Gρ is contained in the upper (respectively, lower) triangular matrices, and hence ρ is reducible. Therefore t is the sum of two continuous characters Π→A×, which contradicts our assumption that it is not a priori small. Thus B1(ρ),C1(ρ)=0 if ρ is reducible.
Finally suppose that ρ is dihedral. By Lemma A.7 there is a unique subgroup Π0 of index 2 in Π such that ρ≅IndΠ0Πχ for some character χ:Π0→F×. Applying the reducible case to ρ∣Π0, we see that either ρ∣Π0 is reducible or B1(ρ∣Π0),C1(ρ∣Π0)=0. The first possibility is not allowed by hypothesis, so we must have B1(ρ∣Π0),C1(ρ∣Π0)=0. But B1(ρ∣Π0)⊆B1(ρ) and C1(ρ∣Π0)⊆C1(ρ), which proves the desired result when ρ is dihedral.
∎
7. Lifting residual conjugate self-twists
In Section 7 we study the (At-valued) conjugate self-twists of constant-determinant pseudorepresentations. In particular, we show in Section 7.2 that they are all controlled by those of ρ, and all residual conjugate self-twists lift to the universal constant-determinant pseudodeformation ring. Having shown in Section 7.1 that the group of residual conjugate self-twists is finite and abelian, we deduce the same for any constant determinant pseudorepresentation. Finally, in Section 7.3 we study special phenomena that arise when ρ is dihedral and hence there may be conjugate self-twists that are residually dihedral.
Throughout Section 7 we only consider simple conjugate self-twists. Thus we write Σt=Σt(At) and Σρ for Σρ(F); similarly for Σt and Σρ.
7.1. Residual conjugate self-twists
Fix a semisimple representation ρ:Π→GL2(F), and recall that P:GL2(F)→PGL2(F) denotes the natural projection. For Section 7.1 only, we do not require ρ to be residually multiplicity-free.
We begin by studying Σρdi and use that to show that Σρ is finite and abelian.
Lemma 7.1**.**
**
(1)
If ImPρ is not dihedral or cyclic of order 2, then Σρdi is trivial.
2. (2)
If ImPρ is either a nonabelian dihedral group or has order 2, then Σρdi has order 2.
3. (3)
If ImPρ is isomorphic to (Z/2Z)2, then Σρdi is isomorphic to (Z/2Z)2.
Proof.
We claim that if ρ is irreducible, then the following sets are in bijection:
(a)
Σρdi∖{(1,1)};
2. (b)
subgroups Π0 of Π such that [Π:Π0]=2 and ρ(Π0) is abelian;
3. (c)
subgroups H of Pρ(Π) such that [Pρ(Π):H]=2 and H is abelian.
Indeed, the maps between them can be described as follows. Given (1,η)∈Σρdi∖{(1,1)}, let Π0:=kerη. The fact that [Π:Π0]=2 follows from Lemma A.7, and Lemma A.6 shows that ρ(Π0) is abelian. Conversely, given Π0 as in (b), let ηΠ0:Π→Π/Π0≅{±1} be the natural projection. Note that ρ∣Π0 is reducible since ρ(Π0) is abelian. Let χ:Π0→F× be a constituent of ρ∣Π0. Then ρ≅IndΠ0Πχ by Frobenius reciprocity since ρ is irreducible. Thus (1,ηΠ0)∈Σρdi∖{(1,1)} by Lemma A.7.
Given Π0 as in (b), let H:=Pρ(Π0). Given H as in (c), let Π0:=Pρ−1(H). It is clear that [Π:Π0]=2. That ρ(Π0) is abelian follows from the fact that H is abelian and scalar matrices commute with everything.
When ρ is irreducible, the lemma now follows from counting subgroups as in (c) in each of the possible projective images of ρ. (The fact that elements in Σρdi have order at most 2 follows from the fact that detρ=η2detρ and so η2=1.)
Finally, suppose that ρ=ε⊕δ. If (1,η)∈Σρdi and η is nontrivial, then we must have ηε=δ and ηδ=ε. Thus
[TABLE]
which implies that εδ−1 has order 2. But the projective image of ρ is isomorphic to the image of εδ−1. Thus Σρdi is trivial unless the projective image of ρ has order 2, in which case there is one nontrivial element.
∎
Corollary 7.2**.**
The group Σρ is finite and abelian.
Proof.
Since F is a finite field, there are only finitely many automorphisms of F. Any two elements in Σρ with the same automorphism differ by an element of Σρdi, which is finite by Lemma 7.1.
To see that Σρ is abelian, fix a generator σ of the cyclic group Σρ=Gal(F/E). Let η be a character such that (σ,η)∈Σρ. Then Σρ is generated by {(σ,ηη′):(1,η′)∈Σρdi}. Since η′ is at most quadratic by Lemma 7.1, the action of σ on η′ is trivial and hence one easily checks that any two of the generators commute.
∎
7.2. Lifting conjugate self-twists
Let Π be a profinite group satisfying the p-finiteness condition. Fix a multiplicity-free representation ρ:Π→GL2(F). Recall from Section 2.1 that there is a local pro-p noetherian W(F)-algebra A with maximal ideal mA and residue field F and a pseudodeformation (T,d):Π→A that is universal among all constant-determinant pseudodeformations of ρ. The purpose of Section 7.2 is to show that every conjugate self-twist of ρ, and in fact of every constant-determinant pseudodeformation of ρ, can be lifted to a conjugate self-twist of (T,d) (see Proposition 7.3 and Corollary 7.4 below).
Since we are working only with constant-determinant pseudodeformations, we shall identify any F-valued character η with the W(F)-valued character s(η). Furthermore, we will consider η as being valued in any W(F)-algebra via the structure map. If σ is an automorphism of F, we write W(σ) the automorphism of W(F) induced by σ.
We introduce some notation that will be used in the proof of Proposition 7.3. For any W(F)-algebra A, let Aσ:=A⊗W(F),W(σ)W(F), where W(F) is considered as a W(F)-algebra via W(σ). We can equip Aσ with two different W(F)-algebra structures by letting W(F) act either on the first or second factor of the tensor product. In what follows, we refer to these actions respectively as the first or second W(F)-algebra structure on Aσ. Let ι(σ,A):A→Aσ be the natural map given by ι(σ,A)(a)=a⊗1. It is an isomorphism of rings with inverse given by ι(σ−1,Aσ) since (Aσ)σ−1 can be naturally identified with A as a W(F)-algebra. Furthermore, ι(σ,A) is a morphism of W(F)-algebras with respect to the first structure on Aσ. Note that if we view Aσ with respect to its second W(F)-algebra structure, its residue field is F⊗F,σF, which is identified with F via x⊗y↦σ(x)y. The proof of the following proposition is a more streamlined treatment of the arguments in [Lan16, Section 2].
Proposition 7.3**.**
Let (σ,η)∈Σρ. Then there is an automorphism σ~ of A such that (σ~,η)∈ΣT and σ~ induces σ modulo mA. Furthermore, for any w in the image of W(F) in A, we have σ~(w)=W(σ)(w).
Note that any such σ~ is necessarily unique, because it is determined by the character η.
Proof.
Note that ηT:Π→A is the universal constant-determinant pseudodeformation of η⊗ρ≅σρ. We claim that, considering Aσ−1 with its second W(F)-algebra structure, ι(σ−1,A)∘(ηT) is a constant-determinant pseudodeformation of ρ. Indeed, reducing ι(σ−1,A)∘(ηT) modulo the maximal of ideal of Aσ−1 gives
[TABLE]
which is identified with trρ under the identification of F⊗F,σ−1F with F discussed prior to the proposition.
By universality, there is a unique W(F)-algebra homomorphism α:A→Aσ−1, where Aσ−1 is given its second W(F)-algebra structure, such that
[TABLE]
Since ι(σ,Aσ−1) is the inverse of ι(σ−1,A), we have that
[TABLE]
Define σ~:=ι(σ,Aσ−1)∘α, which is a ring endomorphism of A. The relation (4) implies that σ~ is an automorphism of A since the image of T topologically generates A as a W(F)-module [Bel19, Proposition 5.3.3] and η takes values in W(F). The relation (4) also shows that (σ~,η)∈ΣT.
Finally, let w∈W(F). Since α is a W(F)-algebra homomorphism with respect to the second W(F)-algebra structure on Aσ−1, we have that
[TABLE]
For the rest of Section 7.2, fix a local pro-pW(F)-algebra A with residue field F and a constant-determinant pseudodeformation (t,d):Π→A of ρ. Assume that A is the W(F)-algebra generated by t(Π). Let αt:A→A be the unique W(F)-algebra homomorphism such that α∘T=t given by universality. The following corollary shows that conjugate self-twists of (t,d) also lift to conjugate self twists of (T,d).
Corollary 7.4**.**
Given (σ,η)∈Σt, there is a unique (σ~,η)∈ΣT such that αt∘σ~=σ∘αt.
Proof.
Let σ denote the automorphism of F induced by σ. Let σ~ be the automorphism of A given by Proposition 7.3 lifting σ to A. Then we just have to show that αt∘σ~=σ∘αt. Note that σ acts by W(σ) on the image of W(F) in A, so σ−1∘α∘σ~ is a W(F)-algebra homomorphism. Thus by universality, it suffices to show that t=σ−1∘αt∘σ~∘T. Since η takes values in W(F) and αt is a W(F)-algebra homomorphism, we have that
[TABLE]
We end Section 7.2 with some observations about the consequences of Proposition 7.3 and Corollary 7.4. They give the following commutative diagram with exact rows.
[TABLE]
We write
[TABLE]
for the composition of the vertical maps on the right in the above diagram. It is induced by the composition β~t:Σt→Σρ of the middle maps, which reflects the fact that every conjugate self-twist of (t,d) induces a conjugate self-twist of ρ. Combining Corollary 7.4 with Corollary 7.2, we see that Σt is a finite abelian group for any constant-determinant pseudodeformation (t,d) of ρ.
In this paper, we will only be concerned with pseudodeformations (t,d) of ρ that are not a priori small. Under this assumption, if t=trρ then Σtdi=1 and ΣTdi=1 by Lemma 7.1(1). In particular, Σt=Σt and ΣT=ΣT, so (except for ρ) a conjugate self-twist (σ,η) is determined uniquely by the automorphism σ.
7.3. The dihedral case
As usual, let Π be a p-finite profinite group, A a local pro-p ring and (t,d):Π→A a constant determinant pseudorepresentation with trace algebra A. Throughout this section we assume that its associated residual representation ρ is dihedral with nonabelian projective image. We also fix any well-adapted (t,d)-representation ρ, which can be taken to be valued in GL2(A) since ρ is absolutely irreducible Proposition 2.11(5). This case requires special care for two related reasons. First, it is the only case when A is not generated simply by I1(ρ) as a W(F)-algebra; one also needs to include B1(ρ) in the generating set by Theorem 2.23. (As explained in 2.12, it makes sense to view B1(ρ) as a subset of A in this case.) Second, this is the only case when Σρdi is nontrivial and hence kerβt can be nontrivial. As we will see, a nontrivial element in kerβt necessarily behaves quite differently from all other conjugate self-twists, because its action cannot be seen on the residue field. In Proposition 7.5 we explain how a nontrivial element in kerβt interacts with I1(ρ) and B1(ρ). In contrast, when kerβt=1 we show that Bρ(F) has the same fraction field as A=Bρ(F)+W(F)B1(ρ).
We use τ to refer to the nontrivial element of kerβt, should it exist. For ε∈{+,−}, let
[TABLE]
Proposition 7.5**.**
Assume that ρ is projectively dihedral and nonabelian. Suppose there exists 1=τ∈kerβt. If ρ:Π→GL2(A) is a well-adapted (t,d)-representation, then A+=Bρ(F) and A−=W(F)B1(ρ).
Proof.
Since A=A+⊕A− and A=Bρ(F)+W(F)B1(ρ) by Bellaïche’s Theorem 2.23, it suffices to show that τ acts trivially on I1(ρ) and by −1 on B1(ρ). Let η:Π→{±1} be the unique quadratic character such that ρ≅ρ⊗η. (It is unique by Lemma 7.1 since the projective image of ρ is not isomorphic to (Z/2Z)2). Since τ∈kerβt, it follows that η must be the character such that (τ,η)∈Σt.
We first prove that I1(ρ) is fixed by τ. As usual, let Γ:=Imρ∩ΓA(m). Recall that by definition I1(ρ) is the Zp-module topologically generated by {α−δ:(1+αcb1+δ)∈Γ}. Let g∈kerη. Since ρ is well adapted, we can write
[TABLE]
for some γ∈Γ and λ,μ∈F×. Write γ=(1+αcb1+δ) with α−δ=2a, (acb−a)∈L1(ρ) and 0=α+δ+αδ−bc. Then we have
[TABLE]
Since g∈kerη, it follows that
[TABLE]
since τ acts trivially on W(F). Thus, we obtain
[TABLE]
for all λ,μ such that (λ00μ)∈Imρ. As the projective image of ρ is not isomorphic to Z/2Z or (Z/2Z)2, it follows that when g varies λμ−1 takes at least two distinct values in F×. Thus it follows that α−τα=0=τδ−δ. Since I1(ρ) is generated by α−δ with α,δ as above, it follows that I1(ρ) is fixed by τ.
The proof that τ acts by −1 on B1(ρ) is similar. Namely, recall that B1(ρ) is topologically generated by {b,c∈A:(1+αcb1+δ)∈Γ}. Let g∈Π∖kerη. Again since ρ is well adapted, we can write
[TABLE]
for some γ∈Γ,λ,μ∈F×. As above, write γ=(1+αcb1+δ). Then we have
[TABLE]
Since g∈kerη, it follows that
[TABLE]
Thus
[TABLE]
for all (0μλ0)∈Imρ. Once again, since the projective image of ρ is not isomorphic to Z/2Z or (Z/2Z)2, it follows that λμ−1 takes at least two distinct values in F×. Therefore b+τb=0=c+τc. Since B1(ρ) is generated by such b and c, it follows that τ acts on B1(ρ) by −1.
∎
In particular, we can always apply Proposition 7.5 to the universal pseudorepresentation (T,d):Π→A of ρ since ΣT≅Σρ and thus kerβT is always nontrivial in the dihedral case whenever a nondihedral deformation exists. Fix a well-adapted (T,d)-representation ρuniv:Π→GL2(A). By universality, we have a W(F)-algebra homomorphism αt:A→A such that t=αt∘T. Let ρ=ρt be the well-adapted (t,d)-representation obtained by composing ρuniv with the map GL2(A)→GL2(A) induced by αt. Since the Pink-Lie algebra is functorial with respect to surjective ring homomorphisms, we see that I1(ρuniv) (respectively, B1(ρuniv)) surjects onto I1(ρ) (respectively, B1(ρ)).
For any subfield F′⊆F, let A′=Bρuniv(F′)+W(F′)B1(ρuniv). We have A′=(A′)+⊕(A′)− with (A′)+=Bρuniv(F′) and (A′)−=W(F′)B1(ρuniv) by Proposition 7.5.
Proposition 7.6**.**
Suppose that A is a local pro-p domain and (t,d):Π→A is a constant-determinant pseudodeformation of a dihedral ρ such that kerβt=1. Let ρ be a well-adapted (t,d)-representation obtained from a universal one as described above. Then for any subfield F′⊆F:
(1)
A′* and Bρ(F′) are noetherian rings;*
2. (2)
W(F′)B1(ρ)* and hence Bρ(F′)+W(F′)B1(ρ) are noetherian Bρ(F′)-modules;*
3. (3)
Bρ(F′)+W(F′)B1(ρ)* and Bρ(F′) have the same field of fractions.*
Proof.
For (1) note that Bρ(F′) is the image of (A′)+ under αt, which is noetherian if A′ is by Lemma A.16. To see that A′ is noetherian when F′=F, we have A′=A=Bρuniv(F)+W(F)B1(ρuniv) by Theorem 2.23, and hence A is noetherian by the p-finiteness of Π. Note that A is finite and integral over Bρuniv(F′)+W(F′)B1(ρuniv) since this is true of W(F) over W(F′). Then
A′=Bρuniv(F′)+W(F′)B1(ρuniv) is noetherian by [Eak86, Theorem 2].
The statements in (2) follow from the corresponding statements for ρuniv, which in turn follow from Proposition A.18 since A′ is noetherian.
As Bρ(F′) is the image of (A′)+ under αt while W(F′)B1(ρ) is he image of (A′)− and A′ is noetherian, (3) follows from Proposition A.19.
∎
8. Regularity and residual conjugate self-twists
In this section we prepare the groundwork for Section 9 by studying conjugate self-twists of ρ, particularly how they interact with regularity (Definition 2.19). In particular, ρ:Π→GL2(F) is a semisimple regular representation throughout this section, and after Section 8.1 it is always absolutely irreducible. We only consider simple conjugate self-twists in this section and thus write Σρ for Σρ(F) and similarly for Σρ. In Section 8.1 we see that when ρ is regular we may assume that it has no conjugate self-twists. We consider the restriction of ρ to the kernels of twist characters in Section 8.2, followed by some technical lemmas about the extension F/E in Section 8.3. In Section 8.4 we define the condition of goodness when ρ is octahedral, which weighs on our main theorem. Section 8.4 culminates in Proposition 8.13 where we choose the basis of ρ that will be used throughout Section 9.
Recall that ρ is regular if Imρ contains an element with eigenvalues λ0,μ0∈F× such that λ0μ0−1∈E×∖{±1}.
8.1. Reducible regular representations
We show that if ρ is reducible and regular, then one can eliminate the conjugate self-twists of ρ by twisting ρ by a character, making the proof of our main theorem especially easy in that case (cf. Section 9.1).
Lemma 8.1**.**
Suppose that ρ=ε⊕δ and ρ is regular. If (σ,η)∈Σρ, then σε=ηε and σδ=ηδ. In particular, εδ−1 takes values in E.
Proof.
It suffices to show that if ρ is regular then we cannot have σε=ηδ and σδ=ηε. If this were true, then we would have σεδ−1=η=σδε−1, which implies that
[TABLE]
Since ρ is regular, there is some g∈Π such that ε(g)δ(g)−1∈E∖{±1}. As E is fixed by σ by Proposition 4.9, it follows from (5) that ε(g)δ(g)−1=±1, a contradiction.
The last sentence in the statement of the lemma follows from the fact that, for any σ∈Σρ=Gal(F/E), we have σεε−1=η=σδδ−1 and hence εδ−1 is fixed by Gal(F/E).
∎
Corollary 8.2**.**
Suppose ρ=ε⊕δ and ρ is regular. Then ρ′:=ρ⊗δ−1 has no conjugate self-twists.
Proof.
Since ρ is regular, its projective image cannot have order 2. Therefore it suffices to show that Σρ′ is trivial by Lemma 7.1. Let F′ be the extension of Fp generated by the trace of ρ′. Then Σρ′=Gal(F′/E), so it suffices to show that F′⊆E. But F′ is generated by the values of εδ−1, which takes values in E by Lemma 8.1.
∎
8.2. Kernels of twist characters and regularity
In this section we introduce the subgroup of Π given by intersecting the kernels of all twist characters. It is often useful to restrict to this subgroup because doing so kills the conjugate self-twists and but retains fullness. In this section we study how this restriction affect the residual representation and regularity, first in the exceptional/large image case, then when ρ is dihedral. Define
[TABLE]
Remark 8.3*.*
The quotient Π/Π0(ρ) is abelian. Indeed, Π0(ρ) is the kernel of the natural map diagonal map of Π to ∏(σ,η)∈ΣρΠ/kerη,
so Π/Π0(ρ) can be embedded into an abelian group.
∎
As we will now see, it is easiest to control ρ∣Π0(ρ) when the order of detρ is a power of 2, which is an assumption we are forced to make in Section 9. We may always twist ρ by a character to assume that the order of detρ is a power of 2:
Lemma 8.4**.**
Let d:Π→F× be a character. Then there is a character χ:Π→F× such that the order of dχ2 is a power of 2.
Proof.
The odd-order part of d has a square root ψ. Take χ=ψ−1.
∎
Lemma 8.5**.**
Assume that ρ is exceptional or large. If the order of detρ is a power of 2, then ρ∣Π0(ρ) is absolutely irreducible.
Proof.
If (σ,η)∈Σρ, then η2 is equal to a power of detρ and hence the order of η is a power of 2. Thus [Π:Π0(ρ)] is a power of 2, and Π/Π0(ρ) is abelian by 8.3.
By hypothesis, the projective image of ρ is isomorphic to one of A4,S4,A5, PSL2(E),PGL2(E). None of A4,A5,PSL2(E) has a subgroup of 2-power index with abelian quotient. Both S4 and PGL2(E) have a unique proper 2-power index subgroup with abelian quotient, namely A4 and PSL2(E), respectively. Therefore the possible projective images of ρ∣Π0(ρ) are the same as for ρ, so ρ∣Π0(ρ) is absolutely irreducible.
∎
Proposition 8.6**.**
Assume that ρ is regular dihedral, say ρ=IndΠ0Πχ. Then ρ∣Π0(ρ) is multiplicity free over E. Furthermore, given g∈Π0, we have g∈Π0(ρ) if and only if χ(g)∈E×.
Proof.
Since ρ is regular, it follows from Lemma A.7 that there is a unique subgroup Π0 of Π of index 2 such that ρ≅IndΠ0Πχ for some character χ:Π0→F×. For any h∈Π, define χh:Π0→F× by χh(g):=χ(h−1gh). The character χh only depends on the class of h in Π/Π0. Fix an element c∈Π∖Π0. Fix a generator σ∈Σρ=Gal(F/E), and choose η such that (σ,η)∈Σρ. (Note that there are two choices for η, and they differ by the character η0:Π↠Π/Π0≅{±1}.) Then Π0(ρ)=kerη0∩kerη since σ generates Σρ. Therefore Π0(ρ)=kerη∣Π0.
Note that any regular element for ρ must be in Π0 since elements in Π∖Π0 have projective order 2. By applying Lemma 8.1 to ρ∣Π0, we find that σχ=ηχ and σχc=ηχc. Hence η∣Π0=σχχ−1, so g∈Π0(ρ) if and only if χ(g)∈E×. In particular, kerχ⊆Π0(ρ). Furthermore, using the fact that σχχ−1=η∣Π0=σχc(χc)−1, we find that the character χ/χc takes values in E×.
We know that ρ∣Π0(ρ) is multiplicity free over E if and only if there is some g∈Π0(ρ) such that χ(g)=χc(g). If kerχ=kerχc, then we can choose g∈kerχ∖kerχc. Then χ(g)=1=χc(g) and g∈Π0(ρ) by the previous paragraph. Therefore we may assume that kerχ=kerχc.
Let n denote the order of χ. Since kerχ=kerχc, we have that χc=χa for some a∈(Z/nZ)×. Note that χc2=χ since c2∈Π0. Therefore
[TABLE]
Fix g0∈Π0 such that ρ(g0) generates the projective image of ρ(Π0). We will show that h:=g0a−1 is in Π0(ρ) and χ(h)∈E× with χ(h)=χc(h). First we calculate, using the fact that χa2=χ,
[TABLE]
Hence h∈kerχc=kerχ⊆Π0(ρ). On the other hand,
[TABLE]
We saw in the second paragraph that χ/χc is an E-valued character. Furthermore, χc(g0)/χ(g0)=1 since g0 was chosen has a generator of the projective image of ρ(Π0), which is isomorphic to the image of χ/χc.
∎
Corollary 8.7**.**
Assume that ρ is regular and dihedral and that the order of detρ is a power of 2. Let σ be a generator of Σρ and η:Π→F× a character such that (σ,η)∈Σρ. Then either Σρ is trivial or ρ∣kerη is absolutely irreducible.
Proof.
Write ρ=IndΠ0Πχ and fix c∈Π∖Π0. We shall make frequent use of Lemma A.7 in this proof without referencing it every time. We saw in the proof of Proposition 8.6 that Π0(ρ)=Π0∩kerη. Thus Proposition 8.6 implies that χ∣Π0∩kerη=χc∣Π0∩kerη.
If kerη=Π0∩kerη, then [kerη:Π0∩kerη]=2 since [Π:Π0]=2. Thus ρ∣kerη≅IndΠ0∩kerηkerηχ∣Π0∩kerη. Since χ∣Π0∩kerη=χc∣Π0∩kerη it follows that ρ∣kerη is irreducible.
If kerη=Π0∩kerη then Π0⊇kerη and Π/kerη is a cyclic group whose order is a power of 2 since η2 is a power of detρ. If Π0=kerη, then there is a subgroup kerη⊆Π′⊂Π0 such that [Π0:Π′]=2. Note that χ∣Π′=χc∣Π′ since χ∣kerη=χc∣kerη. Then ρ∣Π0≅IndΠ′Π0χ∣Π′ is irreducible, a contradiction since ρ≅IndΠ0Πχ. Thus we must have Π0=kerη. Therefore ρ≅ρ⊗η and so σ, and hence Σρ, is trivial.
∎
8.3. F/E when detρ is a power of 2
In Section 9 we will assume that the order of detρ is a power of 2. A large part of the reason for that assumption is that it guarantees that [F:E] can be taken to be a power of 2 as well, as the next lemma shows. We need this in an induction argument in Section 9. Given any F-valued function f and any subfield F′ of F, let us write F′(f) for the subfield of F generated over F′ by the values of f.
Lemma 8.8**.**
Assume that the order of detρ is a power of 2. Then the degree of Fp(trρ) over E is a power of 2.
Proof.
Let d:=detρ. Since the order of d is a power of 2, the degree of E(d) over E is a power of 2. But, for an arbitrary g′∈Π, the extension E(trρ(g′)) is at most quadratic over E(d) because trρ(g′) satisfies
[TABLE]
The field Fp(trρ) is obtained from E(d) by adding finitely many values of trρ.
∎
In Section 9 we will be interested in gradings coming from conjugate self-twists. To be able to apply Lemma A.22 in those situations, we now verify one of the hypotheses.
Lemma 8.9**.**
Assume that both the order of detρ and [F:E] are powers of 2. If n=#Σρ, then F contains a primitive nth root of unity. In particular, condition (∗) from Section A.4 is satisfied.
Proof.
Let d:=detρ, and write 2s for the order of d. We have that E(d) contains a primitive (2s)th root of unity. If [F:E(d)]=2r, then F contains a primitive (2r+s)th root of unity. On the other hand,
[TABLE]
Since d has order 2s, it follows that [E(d):E] divides 2s−1. Thus n divides 2r+s−1, and so F contains a primitive nth root of unity.
∎
8.4. A good basis for ρ
We need to carefully choose a basis for ρ that has many good properties and will allow us to choose a good (t,d)-representation in Section 9.2. In this section we explain how to find this basis when ρ is exceptional or large. Let us first define an extra condition on octahedral representations.
Definition 8.10**.**
We say a regular octahedral representation ρ is good if at least one of the following properties is satisfied:
(1)
p≡1mod3;
2. (2)
ρ is strongly regular;
3. (3)
there is a regular element g0∈Π such that g02∈Π0(ρ).
We shall need to know that if ρ is good, then twisting away the odd part of the determinant of ρ gives a representation that is also good.
Lemma 8.11**.**
Let ρ:Π→GL2(F) be a good representation. Let χ:Π→F× be the unique odd-order character such that the order of χ2detρ is a power of 2. Then ρ⊗χ is good.
Proof.
First note that twisting by any character does not change the projective image, so ρ⊗χ is octahedral. Regularity is also invariant under twisting. The claim is clear if p≡1mod3, so we assume that p≡2mod3. The regularity assumption then implies that ζ4∈Fp by 2.20. As in the proof of Lemma 8.4, decompose detρ=d1d2, where di:Π→F× are characters such that the order of d1 is odd and the order of d2 is a power of 2.
First suppose that ρ is strongly regular. Then there is a matrix g0∈Π such that ρ(g0) has eigenvalues λ0,μ0∈E× such that λ0μ0−1=ζ4. We have λ0μ0=detρ(g0)=d1(g0)d2(g0). Note that any σ∈Gal(F/E) fixes λ0μ0 since λ0,μ0∈E. Therefore σ(d1(g0)d2(g0))=d1(g0)d2(g0). But since d1(g0) is an odd order root of unity and d2(g0) is a 2-power order root of unity, it follows that σ must fix both d1(g0) and d2(g0). Write a for the order of d1. Then χ=d1−(a+1)/2 by the proof of Lemma 8.4. In particular, χ(g0)∈E×. Thus the eigenvalues χ(g0)λ0 and χ(g0)μ0 of (ρ⊗χ)(g0) are in E. Thus g0 is a strongly regular element for ρ⊗χ, as desired.
Finally, suppose that there is a regular element g0∈Π such that g02∈Π0(ρ). Let σ be a generator for Gal(F/E) and let η:Π→F× such that (σ,η)∈Σρ. Then Π0(ρ)=kerη and Π0(ρ⊗χ)=kerσχχ−1η. Since g02∈Π0(ρ) and σdetρ=η2detρ, it follows that detρ(g0)∈E. But detρ(g0)=d1(g0)d2(g0), and since d1(g0) is an odd order root of unity and d2(g0) has 2-power order, it follows that both d1(g0) and d2(g0) are in E. Therefore χ(g0)=d1−(a+1)/2(g0)∈E. Thus g02∈kerσχχ−1η=Π0(ρ⊗χ).
∎
Finally we describe the basis of ρ that we shall work with in Section 9. Let Z denote the group of scalar matrices in GL2(F). The following lemma justifies our definition of Fq in Section 2.5 for exceptional representations.
Lemma 8.12**.**
Up to conjugation, the image of ρ is contained in ZGL2(E). If Fq is an extension of E and λ0,μ0∈Fp× are eigenvalues of a matrix in the image of ρ such that λ0μ0−1∈Fq, then we may further conjugate ρ to assume that \bigl{(}\begin{smallmatrix}\lambda_{0}&0\\
0&\mu_{0}\end{smallmatrix}\bigr{)}\in\operatorname{Im}\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu and Imρ⊆ZGL2(Fq).
Proof.
By Corollary 4.13, E=FΣρ. First we show that ρ can be conjugated to land in ZGL2(E). Let σ∈Gal(F/E) be a generator and η a character such that (σ,η)∈Σρ. Then there is some x∈GL2(F) such that for all g∈Π, we have σρ(g)=x−1η(g)ρ(g)x. By a theorem of Serge Lang [Lan56, Corollary to Theorem 1], it follows that there is some y∈GL2(F) such that x=σyy−1. Thus σ(y−1ρ(g)y)=η(g)(y−1ρ(g)y). Replacing ρ by its conjugate by y, we have that the projective image of ρ is fixed by Gal(F/E), and hence the image of ρ lands in ZGL2(E), as desired.
If Fq,λ0,μ0 are as in the statement of the lemma, then ρ can be further conjugated such that \bigl{(}\begin{smallmatrix}\lambda_{0}&0\\
0&\mu_{0}\end{smallmatrix}\bigr{)}\in\operatorname{Im}\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu while preserving the property that the image of ρ is in ZGL2(Fq).
∎
Proposition 8.13**.**
Let ρ:Π→GL2(F) be regular and either exceptional or large. If ρ is octahedral, assume further that ρ is good. Assume that the order of detρ is a power of 2. Then there is a regular element g0∈Π and a basis for ρ such that the following are simultaneously true:
(1)
Imρ⊆ZGL2(E);
2. (2)
\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu(g_{0})=\bigl{(}\begin{smallmatrix}\lambda_{0}&0\\
0&\mu_{0}\end{smallmatrix}\bigr{)}* for some λ0,μ0∈F;*
3. (3)
if p≥7 and ρ is large, then λ0,μ0∈Fp×;
4. (4)
there is a positive integer n such that g0n∈Π0(ρ) and ρ(g0n) is not scalar.
Proof.
By Lemma 8.12 we can always conjugate ρ so that Imρ⊆ZGL2(E). If g0∈Π is a regular element and λ0 and μ0 are the eigenvalues of ρ(g0), then we may assume further that \mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu(g_{0})=\bigl{(}\begin{smallmatrix}\lambda_{0}&0\\
0&\mu_{0}\end{smallmatrix}\bigr{)}.
If ρ is large, then up to conjugation, Imρ⊇SL2(Fp). Indeed, up to conjugation we may assume that the projective image of ρ contains PSL2(E). Therefore there is some λ∈F× such that \lambda\bigl{(}\begin{smallmatrix}1&1\\
0&1\end{smallmatrix}\bigr{)}\in\operatorname{Im}\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu. Note that the nth power of this matrix is \lambda^{n}\bigl{(}\begin{smallmatrix}1&n\\
0&1\end{smallmatrix}\bigr{)}. Since λ∈F×, its order m is prime to p. Therefore we can write 1=am+bp≡ammodp for some a,b∈Z. Thus
[TABLE]
Similarly, \bigl{(}\begin{smallmatrix}1&0\\
1&1\end{smallmatrix}\bigr{)}\in\operatorname{Im}\mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu. Since \bigl{(}\begin{smallmatrix}1&1\\
0&1\end{smallmatrix}\bigr{)} and \bigl{(}\begin{smallmatrix}1&0\\
1&1\end{smallmatrix}\bigr{)} generate SL2(Fp), it follows that SL2(Fp)⊆Imρ.
If p≥7, then we can choose α∈Fp× such that α2=±1. Then any g0∈Π such that ρ(g0) has eigenvalues α,α−1 satisfies the first three conditions. Note that Pρ(g0)∈PSL2(E). Recall that Π0(ρ) is a normal subgroup of 2-power index in Π since the order of detρ is a power of 2. Furthermore, Π/Π0(ρ) is abelian. Therefore Pρ(Π0(ρ)) is either PGL2(E) or PSL2(E). In either case, we can find g0∈Π0(ρ) such that ρ(g0) has eigenvalues α,α−1. Thus all of the properties of the proposition are satisfied for this choice of g0.
Next suppose that ρ is either tetrahedral or icosahedral. Once again, Pρ(Π0(ρ)) is a normal subgroup of Pρ(Π) with 2-power index and abelian quotient. Since Pρ(Π) is isomorphic to one of A4 or A5, it follows that Pρ(Π0(ρ))=Pρ(Π). In particular, one can choose the regular element g0 to be in Π0(ρ), and the resulting representation satisfies all of the desired conditions.
Finally, suppose that ρ is octahedral and good. If p≡1mod3 then any g0∈Π such that Pρ(g0) has order 3 is a regular element. Since Pρ(Π0(ρ)) is a normal subgroup of Pρ(Π) with 2-power index and abelian quotient, it follows that Π0(ρ) contains an element g0 such that Pρ(g0) has order 3. Such a g0 satisfies all of the necessary conditions.
Next suppose that p≡2mod3 and that ρ is strongly regular. Let g0∈Π be a strongly regular element. Then \mkern 1.5mu\overline{\mkern-1.5mu\rho\mkern-1.0mu}\mkern 1.0mu(g_{0})=\bigl{(}\begin{smallmatrix}\lambda\zeta_{4}&0\\
0&\lambda\end{smallmatrix}\bigr{)} for some λ∈E×. (Note that ζ4∈Fp since ρ is regular and p≡2mod3.) We claim that g0∈Π0(ρ). Indeed, let σ be a generator of Gal(F/E) and η a character such that (σ,η)∈Σρ. Then Π0(ρ)=kerη. Since λ,ζ4∈E× we have
[TABLE]
As ζ4+1=0 it follows that η(g0)=1, and so g0∈Π0(ρ), as claimed. Therefore g0 satisfies all of the necessary conditions.
Finally suppose that p≡2mod3 and there is a regular element g0∈Π such that g02∈Π0(ρ). Note that Pρ(g0) has order 4 since p≡1mod3. Therefore Pρ(g02) is nontrivial, so ρ(g02) is not scalar. Therefore g0 satisfies all of the conditions of the proposition.
∎
Note that the g0 chosen in Proposition 8.13 satisfies all of the conditions prior to Definition 2.21. In particular, if (t,d):Π→A is any admissible pseudodeformation of ρ, then any (t,d)-representation that is adapted to the element g0 from Proposition 8.13 is well adapted.
9. Fullness peers: Bρ(E) and AΣρ
Throughout Section 9 we fix a local pro-p domain A and an admissible pseudodeformation (Π,ρ,t,d) over A. We only consider A-valued conjugate self-twists throughout this section and thus write Σt for Σt(A) and similarly for Σt. The goal of Section 9 is to prove that (t,d) is A0-full whenever (t,d) is not a priori small and regularity is satisfied. In view of Corollary 4.21 and Corollary 6.6, it suffices to show that AΣt and Bρ(E) are fullness peers for some well chosen (t,d)-representation ρ. Let us point out an easy case when this is possible. If ρ has no conjugate self-twists, then E=F by Corollary 4.13 and Σt=1 by the diagram following Corollary 7.4. Furthermore, the assumption that Σρ=1 implies that ρ is not dihedral and so A=Bρ(F) by Theorem 2.23. Therefore we have
[TABLE]
In general, the proof that (t,d) is AΣt-full, hence A0-full, is structured as follows. The case when ρ is reducible is easily done in Proposition 9.1, so from Section 9.2 onwards we always assume that ρ is irreducible. In light of Corollary 6.6, the strategy is to prove that, under certain conditions on ρ and a good choice of a (t,d)-representation ρ, the two rings Bρ(E) and AΣt have the same fields of fractions and AΣt is finitely generated as a Bρ(E)-module. This is done in Corollary 9.15, although key parts of it are proved in Corollary 9.9 and Proposition 9.14. Lemma 3.5 then implies that AΣt and Bρ(E) are fullness peers. In Corollary 9.16, we combine Corollary 6.6, which established Bρ(E)-fullness, with Corollary 9.15 to show that (t,d) is AΣt-full under mild assumptions on ρ. Since AΣt and A0 are fullness peers in the constant-determinant setting, we conclude that our admissible pseudodeformation (t,d) is A0-full.
Let us now establish some assumptions on our fixed residual representation ρ:Π→GL2(F). Assume that ρ is regular and, after Section 9.1, absolutely irreducible. Whenever ρ is absolutely irreducible, assume further that detρ is a power of 2, which can always be achieved by twisting ρ by a character by Lemma 8.4. Furthermore, the twisting operation does not change the field E by Proposition 4.8.
Assume that [F:E] is a power of 2, which is possible by Lemma 8.8. Note that we do not require F to be the trace ring of ρ since one may need to make a quadratic extension of the trace ring in order to make representations well-adapted in the dihedral case.
9.1. The reducible case
When ρ is reducible, we can use Corollary 6.6 to show that (t,d) is AΣt- and A0-full.
Proposition 9.1**.**
Suppose that ρ=ε⊕δ and that ρ is regular. If (t,d) is not a priori small then (t,d) is AΣt- and A0-full.
Proof.
Let (t′,d′)=(s(δ−1)t,s(δ−1)2d), which is a pseudodeformation of r:=ρ⊗δ−1. Let At′ be the trace ring of (t′,d′); its residue field E since r has no conjugate self-twists by Corollary 8.2. Then (Π,r,t′,d′) is an admissible pseudorepresentation over At′. Note that (t′,d′) is not a priori small since (t,d) is not. By Corollary 6.6, there is a well-adapted (t′,d′)-representation r such that (t′,d′) is Br(E)-full. But At′=Br(E) by Theorem 2.23. Since r has no conjugate self-twists by Corollary 8.2, it follows that Σt′ is trivial. Thus At′Σt′=At′=Br(E).
By Corollary 3.12 it follows that (t,d) is At′Σt′-full. We know that AΣt and At′Σt′ have the same fields of fractions, namely K0, by Corollary 4.20. Furthermore A is obtained by adjoining the values of s(δ) and W(F) to At′. Therefore A, and hence AΣt, is finitely generated over A′, so (t,d) is AΣt-full by Lemma 3.5. Finally, A0 and AΣt are fullness peers in this setting by Corollary 4.21.
∎
9.2. Choosing a good (t,d)-representation
Throughout Section 9.2 through Section 9.4 we fix an absolutely irreducible regular representation ρ:Π→GL2(F) such that the order of detρ is a power of 2. We assume that [F:E] is a power of 2 by Lemma 8.8. If ρ is octahedral, we assume further that ρ is good. Furthermore, we fix a good basis for ρ as follows. If ρ is exceptional or large, choose a basis and a regular element g0∈Π such that Proposition 8.13 holds. If ρ=IndΠ0Πχ is dihedral, assume that ρ(Π0) is diagonal and Imρ contains a matrix \bigl{(}\begin{smallmatrix}0&b\\
c&0\end{smallmatrix}\bigr{)} such that bc−1∈Fp, which is possible by [Bel19, Proposition 6.3.2].
Recall that (T,d):Π→A is the universal constant-determinant pseudorepresentation. Part of our arguments will require appealing to a universal (T,d)-representation. This requires choosing a good (T,d)-representation ρuniv and also choosing our (t,d)-representation to be compatible with ρuniv. In particular, we want I1(ρuniv) to be fixed by all conjugate self-twists of (T,d). In Section 9.2 we make these choices and compatibilities precise. Since we need Propositions 9.2 and 9.3 for the universal ring A, in Section 9.2 we do not require A to be a domain, only a local pro-p ring.
Fix a generator σ1 of Σρ=Gal(F/E). We want to choose a character η1:Π→F× such that (σ1,η1)∈Σρ. There is a unique choice for η1 when ρ is not dihedral. If ρ is dihedral and Σt=1, choose η1 to be the trivial character. Recall from the end of Section 7.2 that βt:Σt→Σρ is given by reducing automorphisms of A modulo m. If ρ is dihedral and kerβt=1 but Σt=1, then there is a unique complement to Σρdi in Σρ that contains β~t(Σt). Choose η1 such that (σ1,η1) generates that complement. Otherwise, when ρ is dihedral, we may take η1 to be either of the two characters such that (σ1,η1)∈Σρ. Recall from (6) in Section 7.1 that Π0(ρ) is the intersection of the kernels of all twist characters of ρ. Define
[TABLE]
Let A1 be the subring of A topologically generated by t(Π1). Note that ρ∣Π1 is absolutely irreducible by Lemma 8.5 and Corollary 8.7.
Proposition 9.2**.**
There exists a well-adapted (t,d)-representation ρ:Π→GL2(A) such that ρ∣Π1 takes values in GL2(A1) and such that ρ is adapted to a regular element.
Proof.
With the exception of the well-adaptedness statement, the proof is well known since ρ∣Π1 is absolutely irreducible. Indeed, a theorem of Rouquier [Rou96, Theorem 5.1] and Nyssen [Nys96] tells us that there are representations ρ:Π→GL2(A) and ρ1:Π1→GL2(A1) such that trρ=t and trρ1=t∣Π1. By a theorem of Carayol and Serre, ρ∣Π1 and ρ1 are conjugate by a matrix in GL2(A) [Car94, Théorème 1].
For the well-adaptedness statement, let us first assume that ρ is not dihedral. Choose ρ adapted to g0 and ρ1 adapted to g0n with g0 and n as in Proposition 8.13. Then the matrix M∈GL2(A) such that M−1ρ∣Π1M=ρ1 commutes with \rho(g_{0}^{n})=\bigl{(}\begin{smallmatrix}s(\lambda_{0}^{n})&0\\
0&s(\mu_{0}^{n})\end{smallmatrix}\bigr{)}=\rho_{1}(g_{0}^{n}). Since λ0n=μ0n by Proposition 8.13, it follows that M must be diagonal. In particular, M commutes with ρ(g0). Hence M−1ρM is still adapted to g0 and satisfies the properties in the statement of the proposition.
The idea is similar when ρ is dihedral, except we can no longer assume that ρ1 is adapted to the g0∈Π0 such that ρ(g0) generates the unique index-2 subgroup of the projective image of ρ, because g0 may not be in Π1. Let ρ be a well-adapted (t,d)-representation, say adapted to g0 with \rho(g_{0})=\bigl{(}\begin{smallmatrix}s(\lambda_{0})&0\\
0&s(\mu_{0})\end{smallmatrix}\bigr{)}.
Since ρ is regular, it follows that ρ∣Π0(ρ) is multiplicity free over E by Proposition 8.6. Therefore, since ρ is well adapted, the image of ρ contains a matrix of the form \bigl{(}\begin{smallmatrix}s(\lambda)&0\\
0&s(\mu)\end{smallmatrix}\bigr{)} with λ=μ and λ,μ∈E×. Let h∈Π such that \rho(h)=\bigl{(}\begin{smallmatrix}s(\lambda)&0\\
0&s(\mu)\end{smallmatrix}\bigr{)}.
We claim that h∈Π1. It suffices to prove that h∈Π0(ρ) since Π0(ρ)=Π0∩kerη1⊂kerη1=Π1. Note that h∈Π0 as ρ(h) is diagonal. By Proposition 8.6, h∈Π0(ρ) if and only if the eigenvalues of ρ(h) are in E×. But the eigenvalues λ,μ of ρ(h) were chosen to be in E×. Therefore h∈Π1.
By [Bel19, Proposition 2.4.2] we may assume that ρ1 in the first paragraph of this proof is adapted to h. Therefore the matrix M∈GL2(A) such that M−1ρ∣Π1M=ρ1 commutes with \rho(h)=\bigl{(}\begin{smallmatrix}s(\lambda)&0\\
0&s(\mu)\end{smallmatrix}\bigr{)}=\rho_{1}(h). Since λ=μ, it follows that M is a diagonal matrix. Note that the second property in Definition 2.21 is unchanged by conjugation by a diagonal matrix. Therefore M−1ρM is still well adapted and satisfies the statement of the proposition.
∎
We recall that when ρ is dihedral, we may view elements in B as elements of A (2.12).
Corollary 9.3**.**
There exists a well-adapted (t,d)-representation ρ:Π→R× such that I1(ρ)⊆AΣt. If ρ is dihedral and σ∈Σt such that σ and kerβt generate Σt, then we may assume furthermore that B1(ρ) is pointwise fixed by σ.
Proof.
Let ρ be the (t,d)-representation from Proposition 9.2. Since the order of detρ is a power of 2, it follows that [Π:Π0(ρ)] is a power of 2. Since Γ is pro-p and p=2, it follows that Γ⊆ρ(Π0(ρ))⊆GL2(A1). Therefore L1(ρ)⊆sl2(A1), and so I1(ρ),B1(ρ)⊆A1.
Let (σ,η)∈Σt such that Π1⊆kerη. Then for all g∈Π1 we have
[TABLE]
and thus A1 is contained in the subring of A fixed by σ.
If kerβt=1, then every (σ,η)∈Σt satisfies Π1⊆kerη by definition of Π1. Thus if kerβt=1, then A1⊆AΣt, and hence I1(ρ),B1(ρ)⊆AΣt.
Now suppose that ρ is dihedral and kerβt=1. Then half of the elements (σ,η)∈Σt satisfy kerη⊆kerη1=Π1, namely all those in the preimage under βt of the subgroup generated by (σ1,η1) in Σρ. This proves the statement about B1(ρ) in the dihedral case. To see that I1(ρ) is fixed by all conjugate self-twists, it remains to show that I1(ρ) is fixed by the nontrivial element in kerβt. This follows from Proposition 7.5.
∎
In light of Corollary 9.3, let us fix a well-adapted (T,d)-representation ρuniv:Π→GL2(A) such that I1(ρuniv)⊆AΣT. Assume furthermore in the case when the projective image of ρ is not dihedral that we have conjugated ρuniv by the relevant diagonal element so that Theorem 2.23 applies to ρuniv, and thus to any quotient of ρuniv. Recall from the commutative diagram following Corollary 7.4 that we have a natural reduction map βT:ΣT→Σρ that sends an automorphism σ of A to the automorphism it induces on F=A/mA. In the case when ρ is dihedral, we need to choose a complement to kerβT in ΣT, whose generator we will denote by ν.
We choose ν such that (ν,η1)∈ΣT, where η1 is the character fixed prior to Proposition 9.2. By Corollary 9.3, we may and do assume that B1(ρuniv) is fixed by ν.
The universal property of (A,(T,d)) gives a surjective W(F)-algebra homomorphism αt:A→A. Let ρt:=αt∘ρuniv:Π→GL2(A). It is a (t,d)-representation such that I1(ρt)⊆AΣt by the diagram following Corollary 7.4. Furthermore, if ρ is dihedral and kerβt=1, then B1(ρt)⊆AΣt as well. By the functoriality of Pink-Lie algebras with respect to quotient maps, we have that αt(I1(ρuniv))=I1(ρt) and αt(B1(ρuniv))=B1(ρt). All of our theorems below will be specifically for this well-chosen representation ρt. To ease notation, write ρ=ρt.
By Proposition 7.5 and the fact that B1(ρ)⊆AΣt if ρ is dihedral and kerβt=1, it follows that
[TABLE]
We therefore define
[TABLE]
We claim that J⊂m is a multiplicatively closed W(E)-module by Theorem 2.23. The key is to note that, since ρ is regular and ρ is well adapted, it follows that Bellaïche’s field Fq from p. 2.5 is contained in E. Therefore it follows from Theorem 2.23 that (W(E)I1(ρ))3⊆W(E)I1(ρ) and W(E)I1(ρ)B1(ρ)⊆W(E)B1(ρ) and (W(E)B1(ρ))2⊆W(E)I1(ρ), which proves that J is multiplicatively closed. Define
[TABLE]
We have A=A unless 1=kerβt, in which case A=A+ by Proposition 7.5.
Remark 9.4*.*
The rings W(E)+J and AΣt differ only in their constants, W(E) versus W(Fβt(Σt)). Furthermore, W(E)+J is often equal to Bρ(E), and the goal of this section is to relate Bρ(E) with AΣt. Assume for a moment that W(E)+J=Bρ(E). Then the difference between Bρ(E) and AΣt is entirely governed by understanding which elements of Σρ lift to elements in Σt under βt. In particular, when there are elements in Σρ that do not lift to Σt, we will be interested in writing the extra elements in W(Fβt(Σt)) as quotients of elements in J to show that Q(Bρ(E))=Q(AΣt).
∎
9.3. Lifting conjugate self-twists to A
In Section 9.3 we study a condition on J, called smallness (Definition 9.6), that dictates which conjugate self-twists of ρ lift to conjugate self-twists of (t,d). This study culminates in Theorem 9.8. As a consequence, we prove in Corollary 9.9 that under such a smallness assumption, AΣt=Bρ(E). The reader is advised that, with the exception of the motivational remark following Definition 9.6, the assumption that A is a domain is never used in Section 9.3.
Throughout Section 9.3, fix a subgroup Σ⊆Σρ, and let F′:=FΣ. Write W:=W(F) and W′:=W(F′). For an arbitrary ring R and a finite group X of ring automorphisms of R, for any φ∈Hom(X,R×), we write
[TABLE]
As explained at the beginning of Section 9, we assume that [F:E] is a power of 2. By Lemma 8.9 we may apply Lemma A.22 to conclude that F=⊕φ∈Σ∗Fφ, where Σ∗:=Hom(Σ,F×). Note that since Σ=Gal(W/W′), it follows that this decomposition lifts to W. More precisely, viewing elements of Σ as automorphisms of W and elements of Σ∗ as valued in W× by composing with the Teichmüller map, we can define Wφ for each φ∈Σ∗. Then Lemma A.22 gives W=⊕φ∈Σ∗Wφ.
For all φ∈Σ, define
[TABLE]
where WφJ:={∑iαiji∣αi∈Wφ,ji∈J}. Since A=W+WJ it follows immediately that A=∑φ∈Σ∗A(φ). We will be interested in understanding when this sum is direct, because in that case we will show that it is possible to find lifts of elements of Σ in Σt. If a is an ideal of A and φ∈Σ∗, let a(φ):=A(φ)∩a and let (A/a)(φ)⊂A/a be the image of A(φ) under the natural projection A→A/a.
Lemma 9.5**.**
The following are equivalent:
(1)
A=⊕φ∈Σ∗A(φ).
2. (2)
For every A-ideal a such that a=⊕φ∈Σ∗a(φ), we have A/a=⊕φ∈Σ∗(A/a)(φ). Furthermore, there exists at least one such ideal a.
3. (3)
There exists an A-ideal a such that a=⊕φ∈Σ∗a(φ) and A/a=⊕φ∈Σ∗(A/a)(φ).
Proof.
First we show that (1) implies (2). We can take a=0 for the existence statement in (2). Now suppose that a is an A-ideal such that a=⊕φ∈Σ∗a(φ). If ∑φ∈Σ∗aφ=0∈A/a with each aφ∈(A/a)(φ), then letting aφ∈A(φ) be a lift of aφ, we see that ∑φ∈Σ∗aφ∈a=⊕φ∈Σ∗a(φ). Thus, there are αφ∈a(φ) such that ∑φ∈Σ∗aφ=∑φ∈Σ∗αφ. Since A=⊕φ∈Σ∗A(φ), it follows that aφ=αφ for all φ∈Σ∗. Thus aφ=0∈A/a for all φ∈Σ∗ and hence A/a=⊕φ∈Σ∗(A/a)(φ).
The fact that (2) implies (3) is trivial.
To see that (3) implies (1), suppose that a is an A-ideal such that a=⊕φ∈Σ∗a(φ) and A/a=⊕φ∈Σ∗(A/a)(φ). For each φ∈Σ∗ fix a set Sφ⊂A(φ) of representatives of (A/a)(φ) such that 0∈Sφ. Suppose that ∑φ∈Σ∗aφ=0 with aφ∈A(φ). Then there is a unique way to write each aφ as
[TABLE]
with sφ∈Sφ and αφ∈a(φ). Modulo a, we see that
[TABLE]
Since A/a=⊕φ∈Σ∗(A/a)(φ), it follows that sφ=0 for all φ. As 0∈Sφ, it follows that sφ=0 for all φ∈Σ∗. Therefore aφ=αφ∈a(φ). As a=⊕φ∈Σ∗a(φ), it follows that each aφ=0. Thus A=⊕φ∈Σ∗A(φ).
∎
Definition 9.6**.**
Let L2⊂L1 be subfields of F. We say that J is small with respect toL1/L2 if
[TABLE]
where the map is multiplication inside A. Otherwise, we say that J is big with respect toL1/L2.
To motivate Definition 9.6, recall from 9.4 that we need to be able to write elements of W(Fβt(Σt)) as quotients of elements in J whenever Fβt(Σt)=E. Suppose that L2=E,L1=Fβt(Σt), and [L1:L2]=2. Write L1=L2(α). Then W(L1)=W(L2)⊕s(α)W(L2) and so
[TABLE]
If J is big with respect to L1/L2, then we can find x,y∈W(L2)J∖{0} such that x+s(α)y=0. Thus s(α)=x/y, and hence W(L1) is in the field of fractions of any domain containing W(L2)J. In contrast, the following proposition shows that when J is small with respect to F/F′, elements of Σ can be lifted to automorphisms of A.
Proposition 9.7**.**
If J is small with respect to F/F′, then A=⊕φ∈Σ∗A(φ). In this case, every σ∈Σ can be lifted to an automorphism σ of A such that σ acts trivially on J, and a lift with this property is unique.
Proof.
Note that a:=WJ is an A-ideal since A=W+WJ and J is multiplicatively closed as discussed prior to 9.4. The assumption that J is small with respect to F/F′ implies that WJ=⊕φ∈Σ∗WφJ. Indeed,
[TABLE]
and the image of Wφ⊗W′W′J is exactly WφJ. Since WJ=∑φ∈Σ∗WφJ, it follows that the multiplication map is an isomorphism and thus WJ is graded by Σ∗. Note that a(φ)=WφJ, so a=⊕φ∈Σ∗a(φ).
By Lemma 9.5, for the first statement of the proposition it suffices to show that A/WJ=⊕φ∈Σ∗(A/WJ)(φ). Note that
[TABLE]
and W∩WJ is a closed W-submodule of pW since J⊆mA. Thus we have W∩WJ=pnW and A/WJ≅W/pnW for some 1≤n≤∞, where p∞W:={∞}. Since W is graded by Σ∗, it follows from Lemma 9.5 that W/pnW is graded by Σ∗ as well. Therefore A=⊕φ∈Σ∗A(φ).
For the second statement, we let σ act by W(σ) on W and trivially on J. The only question is to verify that this is well defined. Since A=⊕φ∈Σ∗A(φ), it suffices to show that σ is well defined on each A(φ). That is, we must show
[TABLE]
where αi∈Wφ,ji∈J. Since J is small with respect to F/F′, ∑i=1nαiji=0 implies that ∑i=1nαi⊗ji=0∈W⊗W′W′J. Since αi∈Wφ, we know that W(σ)(αi)=s(φ(σ))αi for all i. Hence
[TABLE]
Therefore ∑i=1nW(σ)(αi)ji=0, since it is the image of s(φ(σ))∑i=1nαi⊗ji under W⊗W′W′J→WJ.
∎
Now that we have lifted elements of Σ to automorphisms of A under the smallness assumption, we would like to verify that the lifts are conjugate self-twists of (t,d) when A=A and that they come from conjugate self-twists when A=A+. (Recall that A+ is only defined when kerβt is nontrivial; see Proposition 7.5.)
Theorem 9.8**.**
If J is small with respect to F/F′ then Σ is contained in the image of βt:Σt→Σρ. Furthermore, every lift of σ∈Σ to Σt acts trivially on J.
Proof.
Fix σ∈Σ. By Proposition 9.7, there is a unique σ∈AutA that acts as W(σ) on W and fixes J. If kerβt=1, then A=A. If kerβt=1, then A=A+ and we need to extend σ to A=A+⊕A−. We do this by declaring that σ fixes A−; we will still denote the automorphism of A by σ. We already know that σ acts trivially on J, so it is enough to prove that σ∈Σt.
Our strategy is to show that σ comes from an appropriate element of ΣT. More precisely, we claim that there is some (σ~,η)∈ΣT such that σ∘αt=αt∘σ~, where αt:A→A is the W-algebra homomorphism given by universality. If this is true, then for all g∈Π we have
[TABLE]
since αt is a W-algebra homomorphism and η(g)∈W. Thus σ∈Σt.
First suppose that ρ is not dihedral. Then there is a unique lift σ~ of σ in ΣT by Lemma 7.1 and Proposition 7.3. By Proposition 7.3, we know that σ~ acts as W(σ) on the image of W in A. Furthermore, σ~ acts trivially on I1(ρuniv) by our fixed choice of ρuniv after Corollary 9.3. Since ρ is not dihedral, it follows that A=W+WJ(ρuniv). By the construction of ρuniv and ρt, we have αt(I1(ρuniv))=I1(ρ) and thus αt(J(ρuniv))=J(ρ)=J. Recall that σ acts trivially on J. Both σ~ and σ act on W by W(σ). Thus for any ∑i=1naixi with ai∈W,xi∈J(ρuniv)∪{1}, we have
[TABLE]
If ρ is dihedral, then there are two lifts of σ in ΣT by Lemma 7.1 and Proposition 7.3. One acts on A− by +1 and the other acts by −1 by Proposition 7.5 and since we chose ρuniv such that B1(ρuniv) is fixed by ν, which generates a complement of kerβT. Let σ~∈ΣT be the lift of σ that is in ⟨ν⟩. Thus σ~ acts trivially on J(ρuniv) and B1(ρuniv). Then an argument similar to that in the previous paragraph shows that σ∘αt=αt∘σ~.
∎
Corollary 9.9**.**
If J is small with respect to F/E then AΣt=W(E)+J. Suppose furthermore that either ρ is not dihedral or kerβt=1. Then AΣt=Bρ(E).
Proof.
By Theorem 9.8 applied to Σ=Σt, the map βt is a surjection and Σt acts trivially on J. If kerβt=1, then A=A=W+WJ, so AΣt=W(E)+J.
If kerβt=1, then A=W+WI1(ρ)+WI1(ρ)2+WB1(ρ) and J=W(E)I1(ρ)+W(E)I1(ρ)2. Note that AΣt⊆A+=W+WJ since the nontrivial element in kerβt acts by −1 on B1(ρ) by Proposition 7.5. As above, we have that
[TABLE]
The last sentence in the statement of the corollary follows from the definition of J.
∎
Remark 9.10*.*
Note that none of the arguments in Section 9.3 require that A is a domain. In particular, when J is small with respect to F/E and either ρ is not dihedral or kerβt=1, Corollary 9.9 gives a conceptual interpretation of the ring Bρ(E).
∎
9.4. When J is big with respect to F/E
Corollary 9.9 requires the assumption that J is small with respect to F/E. We do not always expect this to be true. The purpose of Section 9.4 is to show that AΣt and Bρ(E) have the same fraction field and AΣt is a finite type Bρ(E)-module even without the smallness assumption. This is done in Corollary 9.15, although the two key inputs to that theorem are Proposition 9.13 and Proposition 9.14. Then we can apply Lemma 3.5 and Corollary 6.6 to conclude that ρt, and thus (t,d), is AΣt-full in Corollary 9.16.
The discussion following Definition 9.6 shows why one may expect to get Q(AΣt)=Q(Bρ(E)) when smallness fails and [Fβt(Σt):E]=2. Unfortunately, the assumption that [Fβt(Σt):E]=2 is rather critical to that argument. This is the primary reason we insist that [F:E] be a power of 2 throughout this section. It allows us to split up the extension Fβt(Σt)/E into a series of quadratic extensions, and thus we can apply the argument following Definition 9.6 inductively. This is the essential idea of the argument; we now prepare some notation to formalize it.
Write [Fβt(Σt):E]=2n for some n≥0.
For integers 0≤i≤n, let Ei be the unique extension of E of degree 2i. In particular, E0=E and En=Fβt(Σt), and [Ei:Ei−1]=2 for all 1≤i≤n. For 0≤i≤n, let Wi denote the image of W(Ei) in A. Define
[TABLE]
In particular, A0=W(E)+J and An=W(Fβt(Σt))+W(Fβt(Σt))J. Since A is a domain, so are all of the Ai, and we write Q(Ai) for the field of fractions of Ai.
In the case when A=A+, there is a 2-to-1 group homomorphism Σt→AutA+ given by restricting elements of Σt to A+. Let Σt(A) denote the image of this map when A=A+, and otherwise (that is, whenever kerβt=1) let Σt(A)=Σt. In either case we can identify Σt(A) with a subgroup of Σρ via βt, and we have En=Fβt(Σt(A)). We write Σt(A)∗:=Hom(Σt(A),A×).
We begin with two preliminary lemmas about the relationship between smallness and the Ei.
Lemma 9.11**.**
We have that J is small with respect to F/En; that is, ker(W⊗WnWnJ→WJ)=0.
Proof.
Recall that WJ is an A-ideal that is stable under the action of Σt(A) since Σt fixes J by the construction of ρt. By Lemma 8.9, we can apply Lemma A.22 with X=Σt(A). Therefore
[TABLE]
where (WJ)φ:={x∈WJ:σx=φ(σ)x,∀σ∈Σt(A)}. Recall that WφJ was defined prior to Lemma 9.5. We claim that
[TABLE]
Clearly (WJ)φ⊇WφJ since Σt(A) acts trivially on J. On the other hand,
[TABLE]
so we must have equality.
For each φ∈Σt(A)∗, choose xφ∈Fφ∖{0}. Then {s(xφ):φ∈Σt(A)∗} is a Wn-basis for W. Thus we have
[TABLE]
If x∈ker(W⊗WnWnJ→WJ), then we can write
[TABLE]
for some yφ∈WnJ. Then we have
[TABLE]
and s(xφ)yφ∈WφJ. Since WJ=⊕φ∈Σt(A)∗WφJ, it follows that each s(xφ)yφ=0. As A is a domain and s(xφ)=0, it follows that yφ=0 for all φ∈Σt(A)∗.
∎
Lemma 9.12**.**
We have
[TABLE]
Proof.
Let K:=ker(Wn⊗Wn−1Wn−1J→WnJ). We have an exact sequence of Wn-modules
[TABLE]
Since W is free over Wn, tensoring with W over Wn gives an exact sequence
[TABLE]
We can identify the last nonzero term in this sequence with WJ by Lemma 9.11.
Thus W⊗WnK=ker(W⊗Wn−1Wn−1J→WJ).
∎
Proposition 9.13**.**
If J is big with respect to F/E then Q(An)=Q(An−1).
Proof.
We claim that J is big with respect to En/En−1. Indeed, if J were small with respect to En/En−1, then J would be small with respect to F/En−1 by Lemma 9.12. Therefore we could apply Theorem 9.8 with Σ=Gal(F/En−1), which implies that En⊆En−1, a contradiction.
Let {1,α} be an En−1-basis for En. Then {1,s(α)} is a Wn−1-basis for Wn and so
[TABLE]
Since J is big with respect to En/En−1, there exist x,y∈Wn−1J∖{0} such that
[TABLE]
Thus, s(α)=−x/y∈Q(An−1). It follows that Wn⊂Q(An−1) and hence Q(An)=Q(An−1).
∎
Finally, we descend from Q(An) to Q(A0) by induction on n.
Proposition 9.14**.**
For all 2≤k≤n, if Q(Ak)=Q(Ak−1) then Q(Ak−1)=Q(Ak−2). In particular, if J is big with respect to F/E, then Q(AΣt)=Q(W(E)+J).
Proof.
Note that for any k≥1 we have Q(Ak)=Q(Ak−1) if and only if Wk⊆Q(Ak−1). Assume that Q(Ak)=Q(Ak−1) for some k, 2≤k≤n. Choose α∈Ek−2,β∈Ek−1 such that Ek−1=Ek−2(α) and Ek=Ek−1(β). Define α:=s(α) and β:=s(β), so Wk−1=Wk−2(α) and Wk=Wk−1(β). It suffices to show that α∈Q(Ak−2).
Since Q(Ak)=Q(Ak−1), we can write β=x/y with x,y∈Ak−1∖{0}. By multiplying x and y by any nonzero element of Wk−1J, we may assume that x,y∈Wk−1J∖{0}.
Note that we can write y=i1+αi2 with i1,i2∈Wk−2J. If i1−αi2=0, then by multiplying x and y by i1−αi2, we may assume that y∈Wk−2J∖{0}. If i1−αi2=0 and y∈/Wk−2J, then we must have i2=0 since y=0 and α=i1/i2∈Q(Ak−2), as desired. We assume henceforth that y∈Wk−2J.
Write x=a+bα for some a,b∈Wk−2J. Then we have yβ=a+bα and thus
[TABLE]
Since β∈Wk−1, we may write β=e+fα for some e,f∈Wk−2. Note that f≡0modp since [Ek:Ek−1]=2 and Ek=Ek−1(β). Substituting this into equation (7), we see that
[TABLE]
Note that y2f−2ab∈Wk−2J since all of y,f,a,b∈Wk−2J. If y2f−2ab=0, then we can conclude that α∈Q(Ak−2) as desired.
Henceforth, assume that y2f=2ab. Then we also have y2e=a2+αb2. Thus 2f−1eab=a2+αb2. Note that a,b=0 since 2ab=y2f and we know y,f=0. Then we have
[TABLE]
Therefore ba is a root of t2−2f−1et+α∈Wk−2[t]. The discriminant of this polynomial is 4(f−2e2−α). We claim that Wk−1⊇Wk−2(f−2e2−α). Note that since β=s(β) and s is multiplicative we have that e2−f2α=s(NEk−1/Ek−2(β)), and therefore e2−f2α=s(NEk−1/Ek−2(β))∈s(Ek−1×). Therefore ba∈Wk−1. Thus we can write β=y/bx/b=y/bba+α, and so
[TABLE]
Note that ba+α=0 since y=0. It follows that (ba+α)β−1 generates Wk over Wk−1 since ba+α∈Wk−1 and β generates Wk over Wk−1. Thus (ba+α)β−1=by∈Q(Ak−2) and so Wk⊂Q(Ak−2). Therefore Q(Ak)=Q(Ak−2).
For the second statement of the proposition, note that by Proposition 9.13 we have Q(An)=Q(A0) for all 0≤k≤n. We have A0=W(E)+J by definition. Since Σt acts trivially on J, it follows that
[TABLE]
Corollary 9.15**.**
We have
(1)
AΣt⊇Bρ(E);
2. (2)
AΣt* is a finitely generated Bρ(E)-module;*
3. (3)
AΣt* has the same field of fractions as Bρ(E).*
In particular, AΣt and Bρ(E) are fullness peers.
For (2), if either ρˉ is not dihedral or kerβt=1, then AΣt=Bρ(Fβt(Σt)), which is finitely generated over Bρ(E) since W(F)Σt is finitely generated over W(E). In the case when ρˉ is dihedral and kerβt=1, recall that Bρ(E) is a noetherian ring (Proposition 7.6(1)) and A=Bρ(F)+W(F)B1(ρ) is a noetherian Bρ(E)-module by Proposition 7.6(2) and the fact that W(F) is noetherian over W(E). Thus the Bρ(E)-submodule AΣt of A is necessarily noetherian and hence finitely generated.
The third point has largely been established already. When J is small with respect to F/E, it follows from Corollary 9.9. When J is big with respect to F/E and either ρ is not dihedral or kerβt=1, this follows from Proposition 9.14 since in those cases W(E)+J=Bρ(E). Finally, when ρ is dihedral, kerβt=1, and J is big with respect to F/E we have
We now have the following corollary, which summarizes the most general theorem we have for images of admissible pseudodeformations with 2-power determinant.
Corollary 9.16**.**
Let ρ:Π→GL2(F) be a regular representation such that the order of detρ is a power of 2. If ρ is octahedral, assume furthermore that ρ is good. Let A be a domain and (t,d):Π→A an admissible pseudodeformation of ρ. If (t,d) is not a priori small, then (t,d) is AΣt-full, hence A0-full.
In Section 10 we draw conclusions from Corollary 9.16 that are useful in applications and when comparing our work with previous results in the literature. Although the constant determinant assumption is important to be able to use Bellaïche’s work in Section 9, in practice one rarely works in a constant-determinant setting. Here we give the most general fullness result we can prove; in particular, we remove the constant-determinant assumption present in Corollary 9.16. We then recast our main theorem and other highlights of the theory of fullness in the language of representation theory rather than pseudorepresentations.
To ensure that our main result can be read independent of much of the rest of the paper, we briefly recall our notation and terminology. Let p be an odd prime, A a local pro-p domain with maximal ideal m and residue field F, and Π a p-finite profinite group (Definition 2.6). We are interested in a continuous pseudodeformation (t,d):Π→A of a semisimple representation ρ:Π→GL2(F). Unlike much of the paper, in Section 10 we never require A to be the trace algebra of (t,d). We say (t,d) is not a priori small if it is not reducible, dihedral, or equal to a twist of its Teichmüller lift (Definition 2.3), and it is A0-full if the image of some representation carrying (t,d) contains, up to conjugation, an A0-congruence subgroup (Definition 3.1).
Recall that A0 is the adjoint trace ring of (t,d) (Definition 4.6); its residue field is the trace field of ρ and is denoted E. We say that ρ is regular if its image contains a matrix whose eigenvalue ratio is in E×∖{±1}: see Definition 2.19 and 2.20 immediately following. When ρ is octahedral (that is, projective image S4), see Definition 8.10 for the definition of goodness.
Theorem 10.1**.**
Let p>2 be prime, A a local pro-p domain with residue field F, and Π be a p-finite profinite group. Let ρ:Π→GL2(F) be a regular semisimple representation that is good if ρ is octahedral. If (t,d):Π→A is a pseudodeformation of ρ that is not a priori small, then (t,d) is A0-full.
Proof.
Let χ:Π→A× be a character such that (t′,d′):=(χt,χ2d) is a constant-determinant pseudorepresentation, and write ρ′:=χ⊗ρ, where χ:Π→F× is the reduction of χ modulo m. Assume that χ is chosen such that ρ′ has no conjugate self-twists if ρ is reducible and the order of detρ′ is a power of 2 if ρ is absolutely irreducible. This is possible by Corollary 8.2 in the reducible case and Lemma 8.4 in the absolutely irreducible case. Furthermore, note that if ρ is octahedral and good, then so is ρ′ by Lemma 8.11. Let A′ be the subring of A topologically generated by t′(Π). We have seen in Proposition 9.1 and Corollary 9.16 that if ρ′ is regular (and under the further assumption that ρ is good when ρ is octahedral) and (t′,d′) is not a priori small, then (t′,d′) is A0-full. This is sufficient by Corollary 3.12.
∎
Remark 10.2*.*
Let (t,d) be as in Theorem 10.1.
Then its constant-determinant twist (t′,d′):Π→A satisfies the conditions of Corollary 9.16, so that it is At′Σt′(At′)-full. By Corollary 3.12, (t,d) itself is also At′Σt′(At′)-full. However it does not follow that that (t,d) is AtΣt(A)-full.
Indeed, (t,d) may be affected by the pathologies pointed out in Examples 4.3 and 4.10, or even worse, the pro-p-part of d may be transcendental over A0. Such obstacles are not faced by well-behaved A0.
∎
We end this section by recasting all our headline results, including our main result (Theorem 10.1) in the language of representation theory, convenient for comparing applications to results in the literature in Section 12.
Building on the notation recalled above, let \rho:\Pi\to\operatorname{GL}_{2}\big{(}Q(A)\big{)} be a representation whose trace lands in A and is continuous, and let (t,d)=(trρ,detρ). Recall that ρ is not a priori small if ρ is strongly absolutely irreducible (Proposition 2.4). Write Aρ for the trace algebra of ρ — the W(F)-subalgebra of A topologically generated by t(Π) — and K for its field of fractions. Let K0 denote the field of fractions of A0, the adjoint trace ring of ρ. There is a unique semisimple representation ρ whose trace is equal to the reduction of t modulo m.
Finally, recall that we say that the determinant of ρ is A0-constant if the pro-p part of detρ is A0-valued. In this case K/K0 is Galois (Theorem 4.19).
Theorem 10.3**.**
Let ρ be not a priori small. Suppose ρ is regular, and good if octahedral.
(1)
A0-fullness:* ρ is A0-full.*
2. (2)
Optimality:* If ρ is A′-full for a subring A′⊆A, then A0 contains a fullness peer of A′.*
3. (3)
All CSTs fix A0:* For any extension B of A we have Σρ(B)=Σρ(B/A0).*
If further K is a separable extension of K0, then:
(4)
CSTs carve out K0:* (Ksep)Σρ(Ksep)=K0.*
If further still ρ has A0-constant determinant, then:
(5)
All CSTs are simple:* Every Ksep-valued conjugate self-twist restricts to a simple Aρ-valued one: Σρ(Ksep)↠Σρ(Aρ). Moreover,
KΣρ(Aρ)=K0.*
2. (6)
CST-invariants fullness:* ρ is AρΣρ(Aρ)-full.*
Proof.
Recall that ρ is A′-full if and only if (trρ,detρ) is A′-full by Lemma 3.2(2). Thus the first statement follows from Theorem 10.1 and the second from Theorem 5.3.
The third statement follows from (1) by Corollary 4.24. The fourth statement follows from Corollary 4.12(2) with L=Ksep, E=K0, F=K and F0 the (nontopological) adjoint trace field of (t,d) viewed as valued in K. For the fifth statement, use Corollary 4.24 to obtain the restriction map Σt(Ksep/A0)→Σt(Aρ), surjective since Ksep is normal over K. Alternatively, K is Galois over K0 with Σt(Aρ)=Gal(K/K0) by Theorem 4.19 and then Σt(Ksep)=Gal(Ksep/K0) by Theorem 4.11. The last statement follows from (1) and Corollary 4.21. ∎
11. Residually large representations
Here we show that by imposing stronger conditions on the residual image when ρ is large, we obtain a more precise description of the image of ρ. That is, we assume that ρ:Π→GL2(A) is a continuous representation such that Imρ⊇SL2(E) and ρ is large.
Under this assumption we have a more precise understanding of the image of ρ than simply fullness. Unlike our main fullness result, in this section A is any local pro-p ring; it need not be a domain.
Historically, this is the case that has been studied the most, starting with the work of Boston in [MW86, Appendix]. Boston shows that if A is a complete local noetherian ring and H is a closed subgroup of SL2(A) that projects onto SL2(A/m2), then H=SL2(A) [MW86, Appendix Proposition 2]. Bellaïche has pointed out that Boston’s result follows from his work [Bel19, Remark 6.8.4]. As an application of the description of the image found in Proposition 11.1, we show in Theorem 11.3 that one can replace the hypothesis that H projects onto SL2(A/m2) with the hypothesis that H projects onto SL2(A/m) and A is the trace ring of H to obtain the same conclusion. Theorems of this form have been obtained in special cases, for instance for the Galois representation attached to the mod-p Hecke algebra [Amo21, Theorem, Introduction]. Note that assuming H projects onto SL2(A/m), the hypothesis about A being the trace algebra can always be arranged while Boston’s hypothesis about projecting onto SL2(A/m2) may fail. Moreover, our description of the image in Proposition 11.1 does not require that the residual image contain all of SL2(F), rather only SL2(E), and is thus more general that the setting of Boston’s work.
Let ρ:Π→GL2(A) be a continuous representation such that Imρ⊇SL2(E). Note that such a ρ is large if and only if E=F3,F5. Let χ:Π→A× be the character described in the proof of Theorem 10.1, and let r:=χ⊗ρ. Assume that ρ is conjugated in such a way that Theorem 2.23 applies to r. Recall that Br(E)=W(E)[I1(r)].
Proposition 11.1**.**
Assume #E≥7. Then
(1)
Imr⊇SL2(Br(E))* as a finite index subgroup;*
2. (2)
Imρ⊇SL2(Br(E));
3. (3)
Br(E)* is the largest subring B of A for which Imρ⊇SL2(B).*
Proof.
For ease of notation, write
mr for the maximal ideal of Br(E).
Since Imr⊇SL2(E), it follows that Imr⊇SL2(W(E)) by [Man15, Main Theorem]. In particular, p∈I1(r) and so mr=I1(r) by Theorem 2.23. Let H be the subgroup of G:=Imr generated by Γ=Γ(r) and SL2(W(E)). Then H is a finite index subgroup of G since Γ is.
We claim that H=SL2(Br(E)). Indeed, note that Γ=ΓBr(E)(mr) by [Bel19, Corollary 6.8.3] and the fact that mr=I1(r). In particular, this shows that H⊆SL2(Br(E)). In fact, H is a subgroup of SL2(Br(E)) such that H/Γ=SL2(E)=SL2(Br(E))/Γ. Thus we must have equality.
Now suppose that Imr⊇SL2(B) for some subring B of A. Without loss of generality, we may assume B is closed, hence local. Then Γ⊇ΓB(mB), which implies that I1(r)⊇mB. On the other hand, if Imr⊇SL2(B) then Imr⊇SL2(B/mB). By definition of E, we know that E is the largest subfield of F such that Imr⊇SL2(E). Thus we must have B/mB⊆E. It follows that B⊆Br(E).
As for ρ, note that there is a character χ~:Imρ→A× such that Imr={xχ~(x):x∈Imρ}. Now χ~ must be trivial on SL2(B) for any ring B whose residue field has more than three elements by Corollary 3.10. Therefore Imρ and Imr contain the same copies of SL2.
∎
Corollary 11.2**.**
If #E≥7 and Imρ⊇SL2(E), then Imρ contains SL2(A0) up to conjugation.
Proof.
By Proposition 11.1 it suffices to show that Br(E)⊇A0, and this only needs to be shown when r=ρ is the constant-determinant universal representation. Without loss of generality we may twist to assume that the order of detρ is a power of 2. By Theorem 2.23 we have A=Br(F) since ρ is large. By Corollary 9.3 we may conjugate ρ to assume that I1(ρ) is fixed by all (A-valued) conjugate-self twists of ρ, and by Corollary 7.4 all conjugate-self twists of ρ lift to conjugate self-twists of ρ since ρ is universal. Therefore
[TABLE]
Theorem 11.3**.**
Let A be a pro-p local noetherian ring with p=2 and residue field F. Let H be a closed subgroup of SL2(A) that projects onto SL2(F). If A is the trace ring of H, that is, A is topologically generated by trH, then H=SL2(A).
Proof.
Let ρ:H→SL2(A) be the natural inclusion. Note that ρ has constant determinant. Since A is the trace ring of ρ, Theorem 2.23 implies that A=Bρ(F). Thus by Proposition 11.1, it suffices to show that E=F. Since Imρ=SL2(F), it follows that E is the subfield of F generated by the squares of traces of SL2(F). A straightforward matrix calculation shows that E=F.
∎
Remark 11.4*.*
In the preprint [AB], Aryas-de-Reina and Böckle prove a large image result for a residually full representation Π→G(A), where G is an adjoint group and A is the ring of definition of the representation. It seems possible to recover Theorem 11.3 by applying their result to the projective representation Pρ:Π→PGL2(A) attached to ρ and using the fact that the ring of definition of Pρ is the ring fixed by the conjugate self-twists of ρ.
∎
12. Applications to Galois representations
In this section we specialize Theorem 10.1 to some arithmetic settings, more specifically to representations coming from elliptic, Hilbert, and Bianchi cuspidal eigenforms (Section 12.1 through Section 12.3) and cuspidal p-adic families of elliptic and Hilbert eigenforms (Section 12.4 through Section 12.6). We explain how to recover, and in some cases improve, the results already present in the literature.
In particular, since our methods are entirely agnostic about the group Π, they reveal that many of the classical big-image results are fundamentally algebraic in nature: they do not rely on the arithmetic input, such as local information at the places where a Galois representation is ramified, that went into the original proof.
12.1. Classical modular eigenforms
Let f be a non-CM cuspidal modular eigenform of some level and some weight k≥2 defined over a number field K. Fix an algebraic closure Q of Q and let GQ:=Gal(Q/Q). For any prime p of K lying over a rational prime p, let Kp be the completion of K at p and Op its ring of integers. A construction of Deligne attaches to this data an irreducible continuous representation ρf,p:GQ→GL2(Kp), unramified almost everywhere and hence factoring through a p-finite extension, whose traces of Frobenius elements at unramified primes
correspond to Hecke eigenvalues of f.
Because GQ is compact we may view ρf,p as taking values in GL2(Op).
The following result about the image of ρf,p was proved by Ribet and Momose in the 1980s, generalizing an earlier theorem of Serre about Tate modules of elliptic curves. Let Kp,0 be the subfield of Kp fixed by all the generalized conjugate self-twists of ρf,p, and Op,0 its ring of integers.
Theorem 12.1** (Ribet, Momose at p: first version [Rib85, Mom81]).**
For all but finitely many primes p of K, the representation ρf,p is Op,0-full.
To show the extent to which our work recovers the result of this theorem at p, we first make more explicit Ribet and Momose’s condition on p.
Let K0 be the subfield of K fixed by the conjugate self-twists of f; the p-adic field Kp,0 defined above is the completion of K0 at the prime under p. Let H⊆GQ be the intersection of the kernels of all the conjugate self-twist characters of f:
[TABLE]
Then H is a finite-index normal subgroup of GQ.
Because all the conjugate self-twist characters of ρf,p are trivial on H, the trace of ρf,p∣H lands in Kp,0.
As described just after Definition A.11, there is therefore a Kp,0-quaternion algebra Dp splitting over Kp with ρf,p(H)⊂Dp×⊂GL2(Kp).
Ribet and Momose describe a global analogue to this picture. They define a global K0-quaternion algebra D split over K with D(Kp,0)≅Dp: that is, for each prime p, the restriction to H of ρf,p can be viewed as taking values in D(Kp,0)×. By compactness again we may view ρf,p(H) as a subgroup of the units of a maximal order OD,p of D(Kp,0). Ribet and Momose’s adelic open-image theorems say that the image of ρf,p always contains an open subgroup of the norm-1 units of OD,p×, and for all but finitely many p it contains all of those norm-1 units.
In particular, if D(Kp,0) is split, then up to conjugation OD,p×=GL2(Op,0); and we can therefore make Theorem 12.1 more precise.
Theorem 12.2** (Ribet, Momose at p: second version [Rib85, Mom81]).**
If D(Kp,0) is split, then the representation ρf,p is Op,0-full.
Remark 12.3*.*
In fact, the statement in Theorem 12.2 is an if-and-only-if. Indeed, no element of D(Kp,0) can have distinct GL2-eigenvalues in Kp,0. A matrix g with eigenvalues α,β satisfies (g−α)(g−β)=0. If α,β are in the center of a division algebra containing g, and at the same time are eigenvalues of g in any matrix setting, then the Cayley-Hamilton equation (g−α)(g−β)=0 means that either g=α or g=β. Thus no embedding of D(Kp,0) into GL2(Kp) can contain any congruence subgroup of
SL2(Op,0), or even of SL2(Zp). The same is true for all of ρ(Π): see Theorem 4.19 and Proposition 3.11 or Proposition 12.5 below. In other words, no nonsplit ρf,p can ever satisfy our present definition of fullness. As we’ve defined it, fullness is fundamentally a GL2 property;
a fitting notion for more general algebraic groups generalizing Ribet and Momose’s openness beyond Krull dimension 1 is outside the scope of this investigation.
∎
We now show that our results recover Ribet and Momose’s theorem at p in most cases. Let F be the residue field of Op, a finite extension of Fp, and let ρ:GQ→GL2(F) be the semisimplification ρf,p modulo the maximal ideal of Op.
Recall that ρ is regular if its image contains a matrix whose eigenvalue ratio
is not ±1 but is contained in the trace algebra E of adρ (which is a subfield of the residue field of Op,0): see Definition 2.19 and 2.20 immediately following.
If ρ is octahedral (that is, projective image S4), see Definition 8.10 for the notion of goodness.
Theorem 12.4** (Our results recovering Ribet and Momose at p).**
Assume that p is odd and that ρ is regular; if ρ is octahedral, assume further that ρ is good. Then ρf,p is Op,0-full.
Proof.
Since f is cuspidal, non-CM, and has weight k≥2, its associated representation ρf,p is strongly absolutely irreducible ([Rib77, Proposition 4.4]) and hence not a priori small. By Theorem 10.3, ρf,p is A0-full, where as usual A0⊆Op is the adjoint trace ring. By Corollary 4.21, A0 and Op,0 are fullness peers, and ρf,p is Op,0-full.
∎
The regularity assumption in Theorem 12.4 a posteriori forces D(Kp,0) to split (12.3). We can also see that a nonsplit D(Kp,0) means an irregular ρ directly:
Proposition 12.5**.**
If D(Kp,0) is a division algebra, then ρ is reducible and not regular.
Proof.
We first show that ρ∣H is reducible and not regular. Let L/Kp,0 be the unique quadratic unramified extension, π is a uniformizer of either, and σ the nontrivial element of Gal(L/Kp,0). Note that we do not assume that L is a subfield of Kp. Write ℓ, k for the residue fields of L, Kp,0, respectively; then [ℓ:k]=2 and E⊆k.
By compactness ρ(H) can be viewed as a subgroup of
[TABLE]
the maximal order of D(Kp,0) as viewed inside GL2(L),
which gives a representation ρ′ of H over L. By inspection, it is clear that its residual representation ρ′ is, up to semisimplification, a sum of two characters to ℓ conjugate over k. This means that the eigenvalue ratio r of any element in ρ′(H)
is in the form r=a#k−1 for some a∈ℓ×. Such an element is in k if and only if r#k−1=1; in other words, if and only if
[TABLE]
But this last is only possible if r=±1. In other words, ρ′ is residually neither absolutely irreducible nor regular. Since ρ′ is isomorphic to ρ∣H over Qp, the same is true for ρ∣H as well.
We now follow [Nek12, B.4.8(1)] to claim that the same is true for ρ on all of Π. Change notation to let L be any subextension of Kp that is quadratic over Kp,0 and hence splits D(Kp,0), with σ a generator of Gal(L/Kp,0) and π a uniformizer of Kp,0 that is not a norm from L. Since H is normal in Π and ρ(H) spans D(Kp,0) over Kp,0, the image of ρ, up to conjugation, will be contained in the normalizer of the subgroup Q={(ασ(β)πβσ(α)):(α,β)∈L2−{(0,0)}}, isomorphic to D(Kp,0)×, in GL2(Kp). One can show that this normalizer is just Kp×Q(viii)(viii)(viii)Certainly, Kp×Q normalizes. Conversely, if (acbd)∈GL2(Kp) normalizes (01π0), then the off-diagonal relation gives us σ((πa2−π2c2)δ−1)=(πd2−b2)δ−1,
where δ=ad−bc; if it normalizes (0−tπt0), where t∈L with σ(t)=−t, then the off-diagonal gives
σ((πa2+π2c2)δ−1)=(πd2+b2)δ−1;
if it normalizes (t0−t) then the off-diagonal is
σ(a(πc)δ−1)=dbδ−1.
Combining the first two, we obtain σ(a2δ−1)=d2δ−1 and σ((πc)2δ−1)=b2δ−1; adding the third gives us, for example if a is invertible, σ(πc/a)=b/d. By considering the diagonal relations, we also get σ(b/a)=πc/d and σ(a/d)=d/a. In other words any normalizing element with a nonzero entry in the upper left looks like (acuaσ(πc/a)ua), with a∈Kp× arbitrary, c∈aL, and u:=d/a∈L of norm 1. From Hilbert 90, any norm-1u is σ(α)/α for some α∈L; letting x:=a/α and β:=cα/a puts our matrix in the desired form x(αβπσ(β)σ(α))∈Kp×Q.
Or see [Nek12, B.1.6] for a more conceptual argument.
, so that passing from H to Π does not affect the projective image of ρ. Therefore ρ on all of Π remains reducible and not regular.
∎
In other words, the regularity assumption in Theorem 12.4 eliminates the division algebra case.
One might hope for a converse, so that our methods could recover all of Theorem 12.2. But alas this is not so: there are certainly cases where D(Kp,0) is a matrix algebra but regularity is not satisfied, so that our methods do not apply. In addition to p=2, we do not conclude fullness if F=F3, even if f has no conjugate self-twists and hence D is globally split, as is the case for a non-CM elliptic curve over Q. If the image of ρ is too small to accommodate the regularity assumption, our methods cannot handle it.
12.2. Hilbert modular eigenforms
Everything in Section 12.1 has been generalized to Hilbert modular forms. In particular, our results recover, in much the same manner and to much the same extent, the big-image results of Nekovář generalizing Ribet and Momose’s work over Q. We summarize the situation very briefly.
Let F be a totally real field and f a non-CM cuspidal Hilbert modular eigenform over F all of whose weights are at least 2. Fix an algebraic closure F of F, and let GF:=Gal(F/F). Fix a prime p and an embedding ιp:Q↪Qp. Let ρf,ιp:GF→GL2(Qp) be the Galois representation attached to f in the usual way, which we may view as having coefficients in the ring of integers O of some finite extension of Qp. Let O0⊆O be the ring of integers of the fixed field of all the conjugate self-twists of ρf,ιp. Like Ribet and Momose, Nekovář constructs a division algebra D over the fixed field K0 of Σf and proves an adelic open-image result, which implies O0-fullness when D splits.
For all but finitely many ιp, the representation ρf,ιp is O0-full.
Our results depend on hypotheses on the reduction ρf,ιp of ρf,ιp modulo the maximal ideal of O.
Theorem 12.7** (Our results recovering Nekovář at ιp).**
Suppose that p is odd and ρf,ιp is regular; if ρf,ιp is octahedral suppose further that it is good. Then ρf,ιp is O0-full.
The proof is analogous to that of Theorem 12.4. In particular, the fact that the weight of f is at least 2 at each infinite place means that ρf,ιp has distinct Hodge-Tate weights, which implies that no twist of it has finite image (see Proposition 2.4 for context). Or apply [CEG, Lemma 3.2.12].
12.3. Bianchi modular forms and generalizations
Unlike Hilbert modular forms, which are automorphic forms on GL2 over totally real fields, Galois representations associated to automorphic forms of GL2 over CM fields have only been constructed relatively recently, and even then only under some technical assumptions. We briefly summarize how our results can be applied to that context.
Let E be a CM field with maximal totally real subfield F. Fix an algebraic closure E, and let GE:=Gal(E/E). Let π be a cuspidal automorphic representation of GL2(AE), where AE denotes the adeles of E; when E is imaginary quadratic, π is called a Bianchi modular form. Assume that π is of cohomological type with central character ω. Following Mok [Mok14], assume moreover that ω arises from an algebraic idele class character ω~ on AF× via the norm map and that ω~=⨂vω~v such that ω~v(−1) takes the same value for all archimedean places of F. (When F=Q, this is simply the condition that ω is invariant under complex conjugation.) Suppose there is no nontrivial quadratic character δ of E such that π≅π⊗δ; this is analogous to the non-CM assumption present in Section 12.1 and Section 12.2.
For each rational prime p and fixed embedding ιp:Q↪Qp, associated to π there is a continuous irreducible representation ρπ,ιp:GE→GL2(Qp), which we may view as having coefficients in the ring of integers O of some finite extension of Qp. In this generality, the existence of ρπ,ιp is due to Mok [Mok14], who generalized the construction of Taylor in the imaginary quadratic case [Tay94]. Mok also shows, building on work of Berger and Harcos in the imaginary quadratic case [BH07], that ρπ,ιp is unramified outside a finite set of places and hence factors through a p-finite group Π.
Let O0 be the ring of integers of the fixed field of all the conjugate self-twists of ρπ,ιp.
Theorem 12.8**.**
Suppose p is odd and ρπ,ιp is regular and good if octahedral. Then ρπ,ιp is O0-full.
The proof is analogous to that of Theorem 12.4 and Theorem 12.7. The Hodge-Tate weights of ρπ,ιp are distinct by [Mok14, Theorem 5.17].
To our knowledge, Theorem 12.8 is the most general fullness result in the literature in this context, though Taylor proves the weaker theorem that the image of ρπ,ιp is Zariski dense in the imaginary quadratic case [Tay94, Corollary 2]. Our Theorem 12.8 may be well known to experts.
Remark 12.9*.*
In contrast to the case of elliptic or Hilbert modular forms that can be p-adically interpolated in families with dense classical points and thus have associated “big” Galois pseudorepresentations (see Sections 12.4 to 12.6), a p-adic family of Bianchi modular forms often has only finitely many classical points [CM09, Theorem 8.9], [Ser19, Theorem 1.1]. Therefore no Galois pseudorepresentations have been attached to Bianchi families by conventional methods.
∎
12.4. Hida p-adic families of modular forms
In this section, we explore the extent to which our methods recover known big-image results for Galois representations attached to ordinary p-adic families of modular forms, often called Hida families.
Fix p>2, and let A be the ring corresponding to a primitive non-CM irreducible component of Hida’s cuspidal shallow Hecke algebra parametrizing p-ordinary cuspforms of some fixed tame level and weight k such that k−1≡imodp−1 for some fixed 0≤i≤p−1 (for i=0 this is the ring I′ on [Lan16, p. 158]).
Then A is a finite extension of the Iwasawa algebra Λ:=Zp[[1+pZp]]≅Zp[[T]], which parametrizes the corresponding component of weight space. Let F be the residue field of A and K the fraction field of A.
Let (t,d):GQ→A be the pseudorepresentation obtained from gluing together those attached to the classical cuspforms in the family, all of which have the same semisimplifed residual representation ρ:GQ→GL2(F). Let εp:GQ→Zp× be the p-adic cyclotomic character and ⟨⋅⟩:Zp×→1+pZp the projection onto the pro-p part of Zp×. The weight character κ:GQ→Λ× is given by κ(g)=(1+T)⟨εp(g)⟩. Let χ be the tame Dirichlet character associated to the family. The determinant of ρ is given by
[TABLE]
Both ρ and (t,d) factor through Π, the Galois group of the maximal extension of Q unramified outside p and the level, a p-finite profinite group.
Let Πp⊂Π be a decomposition group at p and Ip⊂Πp its inertia subgroup. The ordinary condition guarantees that there exists a (t,d)-representation ρ, which we view as GL2(K)-valued by Lemma 2.10, with ρ∣Πp=(ϵ0∗δ), where δ an unramified character and ϵ coincides with κ on wild inertia and therefore surjects onto (1+T)1+pZp [Hid12, Theorem 4.3.2].
The image of ρ has been studied by Boston [MW86, Appendix], Fischman [Fis02], Hida [Hid15], and Lang [Lan16]. The latter two are the more recent and most general results, so we focus there.
Theorem 12.10** (Λ-fullness for Hida families, Hida [Hid15]).**
If ρ restricted to Πp is multiplicity free and ρ is realizable by a representation over A then ρ is Λ-full.
Hida’s Λ-fullness strongly suggested that every conjugate self-twist of (t,d) should fix Λ (see Theorem 5.4 here for a proof of this fact). Following Hida, Lang analyzes how the image of ρ is constrained by Σt(A/Λ), the conjugate self-twists that fix Λ pointwise.
To state her result, we let HΛ⊆Π be the intersection of all the kerη for (σ,η) in Σt(A/Λ).
Theorem 12.11** (Big image for Hida families, [Lan16, Theorem 2.4]).**
Suppose that F=F3, that ρ is absolutely irreducible, and that
there is an element in ρ(HΛ∩Πp) whose eigenvalue ratio is in E×∖{1}.(ix)(ix)(ix)There is a small error in [Lan16], which we correct here. Theorem 2.4 as stated loc. cit. requires merely that ρ restricted to HΛ∩Πp be multiplicity free, but in fact the result relies on the stronger regularity condition given here. Indeed, on [Lan16, p. 174] the definition of L[λ] only makes sense if one knows L is closed under multiplication by λ, where λ is an adjoint eigenvalue of the regular element.
Then ρ is AΣρ(A/Λ)-full.
A posteriori Lang’s fullness result by itself justifies considering only those conjugate self-twists that fix Λ, even without Hida’s Λ-fullness: see Theorem 5.4. Our work both recovers virtually all of Lang’s result (exception: [Lan16] is able to handle some cases where Pρ is the Klein-4 group) and extends it to include residually reducible ρ.
Let A0 be the adjoint trace ring of ρ; see Definition 4.6. For the notion of A0-constant determinant, see Definition 4.14; for good octahedral ρ see Definition 8.10.
If ρ is regular and good if octahedral, then ρ is A0-full.
2. (2)
If Πp contains a regular element for ρ, then A0 contains Λ.
Consequently, if Πp contains a regular element for ρ and ρ is further good if octahedral, then
(3)
ρ* is Λ-full;*
2. (4)
ρ* is AΣρ(A)-full;*
3. (5)
every conjugate self-twist of ρ fixes Λ, so that Σρ(A)=Σρ(A/Λ) and ρ is AΣρ(A/Λ)-full.
Proof.
For (1), the representation ρ is not reducible since the Hida family is cuspidal, and it is not dihedral since the Hida family is not CM. The fact that ρ(Π)≅ρ(Π) follows from the fact that a Hida family has classical specializations of weight at least 2. Therefore we know that ρ is A0-full by Theorem 10.3.
For (2), let d1:Π→A× be the pro-p part of d=detρ, and let ρ′=d1−1/2⊗ρ with (t′,d′)=(trρ′,detρ′) the constant-determinant (pseudo)representation of ρ. Note that ρ′∣Πp is still upper triangular since ρ∣Πp is. Let g0∈Πp be a regular element with residual eigenvalues λ0,μ0, and let r be a (t′,d′)-representation adapted to (g0,λ0,μ0). By the proof of [Bel19, Theorem 6.2.1], we see that, up to replacing g0 with the limit of a sequence of its powers, we may assume that r(g0)=(s(λ0)00s(μ0)).
Viewing both ρ′ and r as GL2(K)-valued by Lemma 2.10, we see that they are isomorphic since they have the same trace and are irreducible. In particular, ρ′(Πp) contains an element with eigenvalues s(λ0),s(μ0), which (up to swapping λ0 and μ0) is necessarily of the form M:=(s(λ0)0∗s(μ0)).
On the other hand, using the description of ϵ and δ above, we see that
ρ′(Πp) contains J:=((1+T)1/20∗(1+T)−1/2).
We compute adjoint-trace elements. Both a:=detM(trM)2=s(λ0)s(μ0)(s(λ0)+s(μ0))2=2+s(μ0)s(λ0)+s(λ0)s(μ0)
and
[TABLE]
are in A0 by construction.
The last expression shows that b is in W(E)[[T]], since λ0μ0−1∈E by the regularity assumption. Moreover, the T-coefficient of b is 2+2s(μ0)s(λ0)−a=s(μ0)s(λ0)−s(λ0)s(μ0), which is in W(E)× since λ0μ0−1=±1. It follows that the closed W(E)-algebra generated by b in A0 is all of W(E)[[T]]. In other words Λ⊆W(E)[[T]]⊆A0, as claimed.
For (3), combine (1) and (2).
For (4), from (2) and the expression for d in (9) (t,d) has A0-constant determinant. Now use Corollary 4.21.
For (5), combine (3), Theorem 5.4, and (4).
∎
Remark 12.13*.*
The regularity-on-Πp hypothesis in Theorem 12.12(2) can easily be check in terms of the data of the tame Nebentypus character χ and the mod-p eigenvalue ap of Up. Indeed, since δ sends Frobenius to Up, to verify regularity we need to check whether ϵδ−1 takes on a value in E×∖{±1}. Writing χ∣Πp=χunrχtame with χunr unramified and χtame a character on μp−1 by local class field theory, we see that the tame part of ϵδ−1 is χtameεpi and the unramified part is χunrδ−2. Note that the tame part necessarily takes values in Fp×, so if χtameεpi has order greater than 2, then the regularity-on-Πp hypothesis is automatically satisfied. Otherwise, one must look to the unramified part and check whether some power of χunr(p)ap−2 lies in E×∖{±1}.
∎
Remark 12.14*.*
Do all conjugate self-twists of p-adic families fix weight space? In an abstract algebraic setting, given a representation of a profinite group Π over a ring A that is finite over Λ=Zp[[T]], we cannot expect to prove that ρ is Λ-full — equivalently, that every conjugate self-twist of ρ fixes Λ — because it is simply not true.
On the other hand, one intuitively expects Λ, which parametrizes weight space, to be preserved by any conjugate self-twist of a p-adic family. Indeed, if (σ,χ) is a conjugate self-twist that doesn’t fix Λ, then modulo any prime ideal of Λ, the character χ will relate pairs of eigenforms of different p-adic weights — an implausible scenario. Therefore, although our A0-fullness result in Theorem 12.12(1) is purely algebraic, in order to recover the full strength of [Lan16], and to match our intuition of how conjugate self-twists in p-adic families behave, we necessarily need to use some modular-form-theoretic input. For Hida families the ordinary condition suffices: see the proof of Theorem 12.12 above, especially part (2).
We have not extended this result to Coleman families, which is a drawback on our big-image result in Theorem 12.15. Ultimately one hopes to formalize the geometric intuition alluded to above.
∎
12.5. Coleman p-adic families of classical modular forms
We now relax the ordinary assumption present in Section 12.4 and derive the consequences of our main theorem in the context of pseudorepresentations arising from the Coleman-Mazur eigencurve, comparing with known results in the literature.
Let X be a cuspidal irreducible component of the p-adic Coleman-Mazur eigencurve of some fixed tame level ([Buz07], [CM98]); having dealt with the ordinary case in Section 12.4, we assume that X is nonordinary. Let A be the ring of analytic functions on X bounded by 1. It is a compact Zp-algebra (hence pro-p) since X is nested [BC09, Lemma 7.2.11(ii), Corollary 7.2.12]. In fact, the map from X to weight space endows A with a Λ-algebra structure. As usual, A is a local domain since X is irreducible.
As in the Hida family setting, one obtains a 2-dimensional pseudorepresentation (t,d):GQ→A by gluing together those attached to classical cuspforms in the family, all of which have the same semisimplified residual representation ρ:GQ→GL2(F). Then (t,d) is unramified outside of a finite set of primes, namely p and the tame level, and thus factors through a p-finite quotient Π of GQ. Unlike the theorems presented in Section 12.1 through Section 12.4, there are no previous fullness results known for (t,d) (though see 12.16 below), so we proceed directly to a statement of our result in this setting.
Theorem 12.15**.**
If ρ is regular and good if octahedral, then (t,d) is A0-full.
Proof.
By Theorem 10.1 it suffices to show that (t,d) is not a priori small. It is not reducible since it is cuspidal. Any (t,d)-representation does not have finite image since X admits classical specializations of weight at least 2. Finally, since X is nonordinary its CM points are isolated and hence X necessarily admits a classical non-CM positive slope specialization: see [CIT16, Corollary 3.6]. Thus (t,d) is not dihedral.
∎
Remark 12.16*.*
Besides Bellaïche’s work [Bel19], the only previous work on images of Galois representations of finite slope p-adic families of modular forms was done in [CIT16]. We briefly compare Theorem 12.15 to their main result [CIT16, Theorem 1.3].
•
Setup: Rather than working with A as above, they restrict to an irreducible component I∘ of what they call the “adapted slope ≤h Hecke algebra” — essentially a bounded-slope piece of X as above: see [CIT16, §3.1].
Note that one can replace A above by I∘ and retain the veracity of Theorem 12.15.
•
Assumptions on ρ: In [CIT16, Theorem 1.3] the authors assume that ρ is absolutely irreducible, even when restricted to the intersection of the kernels of twist characters. Moreover, their regularity assumption is stronger than ours in that it requires the mod-p eigenvalues of the regular element to be in Fp× rather than requiring their ratio to lie in E×.
•
Conclusion: As mentioned above, [CIT16, Theorem 1.3] is not a true fullness result. Rather, it shows “rigid-Lie fullness” — a certain rigid analytic Lie algebra attached to the image of a (t,d)-representation contains the rigid analytic Lie algebra of a congruence subgroup. While highly suggestive, one does not know how to recover an actual congruence subgroup in the image from this result. Following [Lan16], Conti and his coauthors show rigid-Lie fullness with respect to the ring fixed by Σt(I∘/Λ).∎
Remark 12.17*.*
Although the determinant d has a form similar to (9) — the universal character κ times a finite-order character — in this setting we have not proved that d is A0-constant: we do not know whether the image of Λ is contained in A0. See 12.14 for why one expects this to be true nonetheless.
∎
12.6. p-Adic families of Hilbert modular forms
Since our methods are agnostic about the group Π, one can proceed with a similar analysis in the context of p-adic families of Hilbert modular forms, which we briefly outline here. We believe these are the first big image results in this context.
As in Section 12.2, fix a totally real field F. Let X be a cuspidal irreducible component of a p-adic eigenvariety interpolating classical Hilbert modular forms over F of a fixed tame level; there are several possible constructions, for instance [Urb11] or [AIP16]. Let A be the ring of analytic functions on X bounded by 1, which is again a pro-p local domain. Gluing together the pseudorepresentations attached to the classical cusp forms parametrized by X, all of which have the same semisimplified residual representation ρ:GF→GL2(F), yields a 2-dimensional pseudorepresentation (t,d):GF→A. It is unramified outside the tame level and p and hence factors through a p-finite quotient Π of GF.
Theorem 12.18**.**
Suppose that ρ is regular and good if octahedral. If X admits a non-CM classical specialization, then (t,d) is A0-full.
Proof.
By Theorem 10.1 it suffices to check that (t,d) is not a priori small, which follows from the fact that X admits classical specializations that are cuspidal, not CM, and whose weights are at least 2.
∎
Remark 12.19*.*
As in Section 12.5, we do not know whether (t,d) has A0-constant determinant in this case and hence lack an AtΣt-fullness result. As in 12.14, we expect the image in A of the ring ΛF of analytic functions on weight space to be contained in A0. Note that in this case, ΛF is a power series ring over Zp; the number of variables depends on the totally real field F.
∎
Appendix A Algebraic sundries
A.1. Representations with isomorphic adjoint differ by a character
Throughout Section A.1, let G be a group and F a separably closed field of odd characteristic. All representations are assumed to be finite-dimensional.
Let sln(F) denote the F-vector space of n×n-matrices of trace 0 and ad0:GLn(F)→GLn2−1(F) the representation obtained by letting GLn(F) act on sln(F) by conjugation. The primary goal of this section is to prove that if ρ1,ρ2:G→GL2(F) are semisimple representations such that ad0ρ1≅ad0ρ2, then ρ1≅ρ2⊗η for some character η:G→F×.
This is done in Theorem A.10. The easier case when the ρi are not dihedral is treated first in Section A.1.1. Section A.1.2 is an analysis of dihedral representations that allows us to conclude Theorem A.10 in full generality.
The results of this section are probably well known to experts, but we give proofs for lack of a reference in the generality we need. We were guided by Venkatarama’s answer to MathOverflow question 297746. In the nondihedral case, this result can be found in [KMP00, Lemma 2.9]. When the representations ρ1 and ρ2 arise from classical modular forms, the result can be found in [DK00, Appendix].
A.1.1. The nondihedral case
Given a representation ρ:G→GLn(F), we write ρ∗ for its dual representation. That is, if V is the representation space of ρ, then V∗:=Hom(V,F) is the representation space of ρ∗ with G-action given by (gφ)(v):=gφ(g−1v). In terms of matrices, if we fix a basis for V and take the dual basis for V∗, then ρ∗(g) is the inverse transpose of ρ(g).
If ρ is 2-dimensional, then an explicit calculation shows that ρ∗≅ρ⊗Λ2ρ∗, where Λ2 denotes the second exterior power of ρ. (The conjugating matrix can be taken to be (01−10).) We have that
[TABLE]
In particular, ad0ρ is self dual. Furthermore,
[TABLE]
and so ad0ρ≅Sym2ρ⊗Λ2ρ∗=Sym2ρ⊗detρ−1.
The following lemma is essentially a version of Schur’s lemma that will be useful in what follows.
Lemma A.1**.**
If ρ:G→GLn(F) is a semisimple representation such that ad0ρ contains a copy of the trivial representation, then ρ is reducible.
Proof.
Let V be the F-vector space on which G acts via ρ. Then EndV is the representation space for 1⊕ad0ρ, where 1 is the trivial representation, which corresponds to scalar endomorphisms of V. If ad0ρ contains a copy of the trivial representation, then there is a nonscalar φ∈EndV that commutes with the action of G. By Schur’s lemma, ρ must be reducible.
∎
Lemma A.2**.**
Let ρ1,ρ2:G→GL2(F) be semisimple reducible representations.
If ad0ρ1≅ad0ρ2, then there exists a character η:G→F× such that ρ1≅η⊗ρ2.
Proof.
By assumption there exist for i=1,2 characters λi,μi:G→F× such that ρi≅λi⊕μi.
It is straightforward to calculate
[TABLE]
Thus, up to switching λ2 and μ2, we must have λ1μ1−1=λ2μ2−1.
Let η=μ1μ2−1. Then
[TABLE]
Lemma A.3**.**
Let ρ1,ρ2:G→GL2(F) be semisimple representations such that both ad0ρi are irreducible.
If ad0ρ1≅ad0ρ2, then there exists a character η:G→F× such that ρ1≅η⊗ρ2.
Proof.
We begin by showing that ρ1⊗ρ2 must be reducible (which does not make use of the assumption that ad0ρi is irreducible). Indeed, by Lemma A.1 if ρ1⊗ρ2 were irreducible then its endomorphism ring would contain a single copy of the trivial representation. But
[TABLE]
and ad0ρ1⊗(ad0ρ1)∗≅End(ad0ρ1) contains a copy of the trivial representation, a contradiction.
Next we show that ρ1⊗ρ2 cannot be the sum of two 2-dimensional representations. Indeed, suppose that ρ1⊗ρ2≅r1⊕r2, where r1,r2:G→GL2(F) are representations. Take the second exterior product on both sides. We have
[TABLE]
and
[TABLE]
Since ad0ρi≅Sym2ρi⊗Λ2ρi∗, we have Sym2ρ1⊗Λ2ρ2≅Sym2ρ2⊗Λ2ρ1. But if
[TABLE]
then this contradicts irreducibility of ad0ρi. Thus ρ1⊗ρ2 must contain a 1-dimensional representation; call it χ. Then we claim that ρ2≅ρ1∗⊗χ≅ρ1⊗detρ1−1⊗χ, and so ρ1 and ρ2 differ by a twist.
To see that ρ2≅ρ1∗⊗χ, recall that ρ1⊗ρ2≅Hom(ρ1∗,ρ2). Thus having a 1-dimensional G-stable subspace corresponds to a nonzero linear map φ:ρ1∗→ρ2 such that gφ=λ(g)φ for some λ(g)∈F× for all g∈G. Define f:ρ1∗→ρ2⊗χ−1 by v↦φ(v)⊗e, where e is a basis for the 1-dimensional vector space on which G acts by χ. Note that f=0 since φ=0. It is straightforward to check that f(gv)=gf(v) for all g∈G. Therefore Hom(ρ1∗,ρ2⊗χ−1)=0. Since ρ1∗ and ρ2⊗χ−1 are irreducible, it follows that they must be isomorphic.
∎
The following observation can be checked easily via a direct calculation on 2×2 matrices.
Lemma A.4**.**
For any g∈GL2(F) with (not necessarily distinct) eigenvalues λ,μ, the eigenvalues of ad0g are 1,λμ−1,λ−1μ. In particular, we have
[TABLE]
A.1.2. The dihedral case
In Section A.1.2 we assume for simplicity that the characteristic of F is not equal to 2. The goal of Section A.1.2 is to remove the assumption that both ρi are reducible or both ad0ρi are irreducible from Lemmas A.2 and A.3. We begin with a lemma that shows that, in light of Lemmas A.2 and A.3, we only need to consider the case when both ρ1 and ρ2 are dihedral representations.
Lemma A.5**.**
If ρ:G→GL2(F) is irreducible but ad0ρ is reducible, then ρ is dihedral.
Proof.
If ad0ρ is reducible, then so is Sym2ρ and Sym2ρ∗ since ad0ρ≅Sym2ρ⊗detρ−1. But Sym2ρ∗ can be identified with the action of G on the F-vector space of quadratic forms on F2. Thus, there is a quadratic form Q on which G acts by a scalar. Since F is separably closed and charF=2, all quadratic forms are equivalent. In particular, we may assume that Q(x,y)=xy. But one checks immediately that the only matrices that preserve Q up to scalars are diagonal and antidiagonal. Thus ρ must be dihedral.
∎
The rest of this section is devoted to an analysis of dihedral representations.
Lemma A.6**.**
Assume that ρ:G→GL2(F) is a semisimple representation. If ρ≅η⊗ρ for some nontrivial character η:G→F×, then the image of ρ∣kerη is abelian.
Proof.
This argument essentially comes from [Rib77, Proposition 4.4].
Note that detρ=η2detρ and so η2=1. Set H:=kerη. Thus [G:H]=2 since η is nontrivial. By assumption, there is a matrix M∈GL2(F) such that Mρ(g)M−1=η(g)ρ(g) for all g∈G. In particular, ρ(H) is contained in the commutant of M.
We claim that M is semisimple. It suffices to show that M has distinct eigenvalues. Up to
a change of basis for ρ, we may assume that M is upper triangular, say M=\bigl{(}\begin{smallmatrix}a&b\\
0&c\end{smallmatrix}\bigr{)}. The eigenvalues of M acting on M2(F) by conjugation are 1,1,ac−1,a−1c by Lemma A.4. Note that for any g∈G∖H, we have
[TABLE]
Thus −1=ac−1, which implies that a=c and thus M has distinct eigenvalues, as claimed. Therefore M is semisimple and so its commutant, and hence ρ(H), is abelian.
∎
If H is a subgroup of G of index 2, then we use c to denote a fixed element in G∖H. For a character χ:H→F× and g∈G, we write χg:H→F× for the character defined by χg(h):=χ(g−1hg). It is not difficult to check that χg depends only on the coset of g in G/H. Set χ−:=χ/χc. We will write ηH:G→G/H≅{±1} for the canonical projection map. With this notation, we recall an explicit description of IndHGχ. Namely, IndHGχ is isomorphic to the representation
[TABLE]
Using Frobenius reciprocity it is easy to see that IndHGχ is irreducible if and only if χ=χc.
Lemma A.7**.**
(1)
If ρ=IndHGχ for a character χ:H→F× and [G:H]=2, then ρ≅ρ⊗ηH.
3. (2)
Conversely, if ρ:G→GL2(F) is a dihedral representation, then there is a subgroup H of G of index 2 and a character χ:H→F× such that ρ≅IndHGχ and χ=χc.
4. (3)
Furthermore, H as in (2) is unique unless χ2=(χc)2.
5. (4)
If χ2=(χc)2 then there are exactly three index 2 subgroups Hi of G for i=1,2,3 for which there exist characters χi:Hi→F× such that ρ≅IndHiGχi.
Proof.
For the first point, note that χ is a constituent of (ρ⊗ηH)∣H=ρ∣H. By Frobenius reciprocity and dimension counting, it follows that IndHGχ≅ρ⊗ηH.
If ρ is dihedral, then there is a nontrivial character η:G→F× such that trρ=ηtrρ and detρ=η2detρ. In particular, η2=1 and so η is a quadratic character. Let H:=kerη. Then H is a subgroup of G of index 2 and ρ∣H is reducible by Lemma A.6. Let χ:H→F× be one of the constituents of ρ∣H. By Frobenius reciprocity, IndHGχ is a constituent of ρ and we deduce equality for dimension reasons. Thus we have ρ∣H=χ⊕χc. Since ρ is irreducible by the definition of being dihedral, it follows by Frobenius reciprocity that χ=χc. This finishes the proof of the second point.
For the third point, suppose that ρ=IndH′Gχ′ for some character χ′:H′→F× and [G:H′]=2. Let c′∈G∖H′. Then by restricting to H we have χ⊕χc=(ηH′)∣H⋅χ⊕(ηH′)∣H⋅χc. Thus we either have χ=(ηH′)∣H⋅χ or χ=(ηH′)∣H⋅χc. In the first case, we see that H=kerηH′=H′. In the second case we conclude that χ2=(χc)2 since ηH′ is quadratic.
Finally, suppose that χ2=(χc)2. Then H0:=ker(χ/χc) is a subgroup of index 2 in H. We claim that H0 is normal in G. Recall that χc is independent of the choice of c∈G∖H. If h∈H0 and g∈G∖H then
[TABLE]
Furthermore, the above calculation shows that the class of c generates a subgroup of G/H0 of order 2 distinct from H. Thus G/H0 is isomorphic to (Z/2Z)2. We claim that if H′ is any of the three subgroups of G of index 2 containing H0, then there is a character χ′:H′→F× such that ρ≅IndH′Gχ′. By Frobenius reciprocity, it suffices to show that ρ∣H′ is reducible. Since ρ∣H0=χ∣H0⊕χc∣H0, it follows from Frobenius reciprocity that ρ∣H′=IndH0H′χ∣H0. But χ∣H0=χc∣H0 and so it follows (again by Frobenius reciprocity) that ρ∣H′ is reducible.
∎
Combining the following lemma with Frobenius reciprocity, we see that the irreducibility of IndHGχ is related to the question of whether the character χ:H→F× extends to a character of G.
Lemma A.8**.**
Let H be a subgroup of G of index 2 and χ:H→F× a character. Then there exists an extension of χ to a character G→F× if and only if χ=χc. If χ extends to a character of G, then there are exactly two different extensions, and they differ by ηH.
Proof.
If such an L and extension of χ exist, then certainly χ=χc. On the other hand, since c2∈H, we know that χ(c2) is well defined. Since F is algebraically closed, we may choose a square root r of χ(c2) in F. Define a new character χ~:G→L× by
[TABLE]
To see that χ~ is a character, it suffices to verify that it is multiplicative. That is, one must check that χ~(h)χ~(ch′)=χ~(hch′) and χ~(ch)χ~(ch′)=χ~(chch′) for h,h′∈H. It is easy to see by direct computation that these are satisfied if χ=χc.
∎
Lemma A.9**.**
Let ρ=IndHGχ be a dihedral representation. Then ad0ρ≅ηH⊕IndHGχ−. If ad0ρ is the sum of three characters, then χ2=(χc)2 and ad0ρ≅ηH1⊕ηH2⊕ηH3, where the Hi are the index 2 subgroups of G given in Lemma A.7.
Proof.
The first claim is an explicit calculation.
Let e1:=(100−1),e2:=(0010),e3:=(0100). Assume that ρ is given by (10). Then with respect to the basis e1,e2,e3 we see that
[TABLE]
We observe that ηH appears in the upper left corner. Furthermore, (χ−)c=(χ−)−1. Therefore the lower right 2×2-matrix in ad0ρ is isomorphic to IndHGχ− by (10). Thus ad0ρ≅ηH⊕IndHGχ−.
If ad0ρ is the sum of three characters, then IndHGχ− is reducible and thus χ−=(χ−)c. That is, χ2=(χc)2. By Lemma A.7, it follows that there are exactly three subgroups Hi of G of index 2 for which ρ≅IndHiGχi. By the above calculation, each ηHi must be a constituent of ad0ρ. By counting dimensions, we find that ad0ρ≅ηH1⊕ηH2⊕ηH3.
∎
Theorem A.10**.**
Let F be a field whose characteristic is not 2. Let ρ1,ρ2:G→GL2(F) be semisimple representations. If ad0ρ1≅ad0ρ2 then there is a character η:G→L× such that ρ1≅η⊗ρ2.
Proof.
The case when either of ρ1 or ρ2 is not dihedral is settled by Lemmas A.2, A.3 and A.5. Therefore we may assume that both ρ1 and ρ2 are dihedral. By Lemma A.9 there are index-2 subgroups Hi of G and characters χi:Hi→F× such that ρi≅IndHiGχi. Note that the set of possible such Hi can be read off from ad0ρi since ηHi is a constituent of ad0ρi by Lemma A.9 and Hi=kerηHi. In particular, since ad0ρ1≅ad0ρ2, we may assume that H:=H1=H2. By Lemma A.9 we have IndHGχ1−≅IndHGχ2−.
By restricting to H it follows that χ1−⊕(χ1−)c≅χ2−⊕(χ2−)c, and so up to replacing χ2 with χ2c (which is okay since IndHGχ2≅IndHGχ2c), it follows that χ1−=χ2−. That is, χ1χ2−1=(χ1χ2−1)c. By Lemma A.8 there is a character η:G→L× such that η∣H=χ1χ2−1. We claim that ρ1≅η⊗ρ2. Indeed, this is true upon restriction to H since
[TABLE]
Therefore ρ1≅η⊗ρ2 by Frobenius reciprocity since ρ1 is irreducible and thus χ1=χ1c.
∎
A.2. Trace field extensions
In this brief subsection, let G be an abstract group, F an abstract field, and (t,d):G→F a (2-dimensional) pseudorepresentation.
Definition A.11**.**
The trace field of (t,d) is the subfield of F generated by the image t(G) over the prime subfield of F. A pseudorepresentation (t,d):G→F is realizable over an extension L of F if there exists a semisimple representation ρ:G→GL2(L) that carries(t,d) — that is, with t=trρ and d=detρ.
Lemma A.12**.**
Let (t,d):G→F is a pseudorepresentation with trace field F. If the characteristic of F is not 2, then (t,d) realizable over an at-most quadratic extension of F.
Proof.
To start with, (t,d) is always realizable by a semisimple representation V over F [Che14, Theorem 2.12]. If we suppose that V is irreducible, then the image of the associated F-algebra map ρ:F[G]→EndF(V) surjects onto the full matrix algebra EndF(V) after extending scalars to F, and is therefore a quaternion algebra D over F. There are now two possibilities.
Either D is split, in which case D×≅GL2(F) is a realization of (t,d) over F. Or D is an F-division algebra, in which case ρ∣G:G→D× carries (t,d), in the sense that ρ(g) has reduced trace t(g) and reduced norm d(g), and any quadratic extension L of F that splits D carries a realization of (t,d) as an irreducible representation G→GL2(L).
On the other hand, suppose (t,d) splits into a sum of two characters χ,χ′:G→F×. The image of χ is a subgroup of F× whose every element is contained in an at-most-quadratic extension of F. Suppose that α=χ(a) and β=χ(b) for a,b∈G generate different quadratic extensions of F. Then on one hand, αβ=χ(ab) must generate the third quadratic subextension of F(α,β). But on the other hand, we claim that χ′(ab)=χ′(a)χ′(b) is equal to αβ: indeed, let cα be the generator of \operatorname{Gal}\big{(}F(\alpha,\beta)/F(\beta)\big{)} viewed as an element of \operatorname{Gal}\big{(}F(\alpha,\beta)/F\big{)}, so that χ′(a)=cα(α); define cβ similarly. Then cαcβ generates \operatorname{Gal}\big{(}F(\alpha,\beta)/F(\alpha\beta)\big{)} and hence fixes αβ. Therefore χ(ab)+χ′(ab)=2αβ, which is not in F(x)(x)(x)The constraint on the characteristic is necessary: consider G=Z2 and F=F2(x,y), with (t,d) the pseudocharacter corresponding to the scalar 2-dimensional representation (1,0) to x and (0,1) to y. — a contradiction.
∎
The following proposition is true in any dimension, but we state it here for dimension 2.
Proposition A.13**.**
Let (t,d):G→F be a pseudorepresentation and H⊆G a finite-index normal subgroup. Then the trace field of (t,d) is a finite extension of the trace field of (t∣H,d∣H).
Proof.
Replace F by the trace field of (t,d), and let E⊆F be the trace field of (t∣H,d∣H).
Let ρ be a semisimple representation carrying (t,d) over an extension of F. Note that F is algebraic over E: every g∈G satisfies g[G:H]∈H, so that every eigenvalue of ρ(g), and hence t(g), is algebraic over the finite extension of E containing the eigenvalues of g[G:H]. Since F is contained in the field generated by all these eigenvalues, F is algebraic over E, and in particular ρ is realizable over an algebraic extension of E.
By Clifford’s theorem [Cra19, Theorem 7.1.1], ρ∣H is still semisimple, so it is carried by a representation V over a finite extension E′ of E. Since F/E is algebraic, VE=V⊗E′E carries all of ρ.
Let g1,…,gn be coset representatives for H in G; write each ρ(gi) as a matrix in a fixed E′-basis of V extended to VE. Let M be the subfield of E generated over E′ by the matrix coefficients of all the ρ(gi). Then M is a finite extension of E′, and hence of E, containing the values of trρ. Therefore F/E is finite.
∎
A.3. Rings with involution
Throughout Section A.3, let A be a commutative noetherian ring equipped with an involution ∗. Note that we will need to apply the results in this section to the universal constant-determinant pseudodeformation ring A, so we cannot assume that A is a domain. Let Aε={a∈A:a∗=εa} for ε∈{+,−}. We will assume throughout that ∗ is not the identity on A so that A−=0. It is easy to see that A+ is a subring of A and A− is an A+-module. The following results have been adapted from [Lan75] and [CL77], where they are presented in the context when A may be noncommutative.
Definition A.14**.**
We say that an A-ideal a is a ∗-ideal if a∗=a. We say that A is ∗-prime if whenever a and b are ∗-ideals such that ab=0 then either a=0 or b=0.
Lemma A.15**.**
If A is ∗-prime then A is reduced.
Proof.
Let 0=a∈A be nilpotent. Then there is a smallest integer n>1 such that an=0. Let a=aA and b=an−1A. Note that a=0 and b=0 by the minimality of n. If a and b are ∗-ideals then we have reached a contradiction since ab=anA=0. In particular, if a+a∗=0 then a∗=−a∈aA and so a,b are ∗-ideals.
If a+a∗=0, then a+a∗ is still nilpotent since A is commutative. By replacing a with a+a∗ in the above argument, we find that a and b are ∗-ideals and thus we reach a contradiction.
∎
Lemma A.16**.**
If A is a noetherian commutative ring with 2∈A×, then A+ is a noetherian ring.
Proof.
The following argument comes from [CL77, Lemma]. Let I1⊆I2⊆⋯ be an ascending chain of ideals in A+. Then I1A⊆I2A⊆⋯ is an ascending chain of ideals in A. Since A is noetherian, there is some n such that InA=ImA for all m≥n.
Fix m≥n and a∈Im⊆A+. Since a∈ImA=InA we may write
[TABLE]
with bi∈In and xi∈A. Applying the involution ∗ yields
[TABLE]
Thus
[TABLE]
Since xi+xi∗∈A+ and 2∈A× it follows that a=21∑ibi(xi+xi∗)∈In. In particular, Im=In.
∎
We would like to show that A is finitely generated as an A+-module, which is equivalent to A being a noetherian A+-module since A+ is a noetherian ring by Lemma A.16. The following lemma follows the proof of [Lan75, Lemma 6].
Lemma A.17**.**
If there is an element d∈A− that is not a zero divisor in A, then A is noetherian as an A+-module.
Proof.
Since d is not a zero divisor, it follows that A is isomorphic to dA as an A+-module. On the other hand, for any a∈A we can write
[TABLE]
Thus dA is a submodule of the finitely generated A+-module A++dA+. Since A+ is noetherian by Lemma A.16, it follows that dA, and hence A, is a finitely generated (and hence noetherian) A+-module.
∎
Proposition A.18**.**
If A is a commutative noetherian ring with 2∈A×, then A is a noetherian A+-module.
Proof.
This proof combines elements of the proofs of [CL77, Theorem] and [Lan75, Theorem 7].
Suppose not. Let a0 be the largest ∗-ideal of A such that A/a0 is not a noetherian A+-module, which exists since A is a noetherian ring and is not noetherian as an A+-module. Thus, by replacing A with A/a0, we may assume that A/a is a noetherian A+-module for any ∗-ideal a=0.
We claim that, under this assumption, A is reduced. It suffices to show that A is ∗-prime by Lemma A.15. Suppose that a and b are nonzero ∗-ideals of A such that ab=0. Note that we can view a as an A/b-module since ab=0. We know that a is noetherian as an A/b-module since a is noetherian as an A-module. Furthermore, A/b is a noetherian A+-module since b=0. Thus a is noetherian as an A+-module. We also know that A/a is a noetherian A+-module since a=0. Therefore A is a noetherian A+-module, a contradiction. Thus A is ∗-prime and hence reduced.
Since A is a noetherian ring, it has only finitely many minimal prime ideals; call them p1,…,pn. Since A is reduced, we have that
[TABLE]
Note that n=1 corresponds to the case when A is a domain, and in that case we have already seen that A is a noetherian A+-module by Lemma A.17. Thus we assume henceforth that n>1 and thus each pi=0.
If pi∗∩pi=0, then pi∩pi∗ is a ∗-ideal and so A/(pi∩pi∗) is a noetherian A+-module. If every pi satisfies pi∩pi∗=0 then we can view A as a subring of
[TABLE]
which is noetherian as an A+-module. In particular, A is a noetherian A+-module, a contradiction, which proves the proposition.
Suppose there is some k such that pk∩pk∗=0. It is easy to check that pk∗ is another minimal prime ideal of A. We claim that n=2 in this case. Indeed, if p is any minimal prime ideal of A, then we have pkpk∗⊆pk∩pk∗=0 and thus
pkpk∗=0∈p.
Thus p=pk or p=pk∗.
Let us write p=pk henceforth. We can embed A into A/p×A/p∗ by identifying a∈A with (a+p,a+p∗). Note that A+∩p=0 since if a∈A+∩p then a=a∗∈p∗∩p=0. Similarly, A+∩p∗=0. In particular, A+ injects into A/p and is therefore a domain.
Note that by Lemma A.17, we may assume that every element of A− is a zero divisor in A. However, both A/p and A/p∗ are domains, so the only zero divisors in A/p×A/p∗ are elements of the form (a+p,p∗) or (p,a+p∗). Recall that (A−)2⊆A+. In particular, if (a+p,p∗)∈A−, then (a2+p,p∗)∈A+. That is, there is some a+∈A+ such that a+−a2∈p and a+∈p∗. But we have already seen that A+∩p∗=0. Similarly, any (p,a+p∗)∈A− must be trivial. In other words, A−=0, a contradiction. Therefore A must be noetherian as an A+-module.
∎
Given any ideal a of A, we define aε:=a∩Aε. We call a a graded ideal if a=a+⊕a−.
Proposition A.19**.**
Let A be a commutative local noetherian ring such that A and A+ have the same residue field. Assume that 2∈A×. If A′ is the quotient of A by a nongraded prime ideal, then A′ has the same field of fractions as the image of A+ in A′.
Proof.
Write f:A→A′ for the quotient map. It suffices to show that every element of f(A−) can be written as a quotient of elements in f(A+). Since the prime ideal p=kerf is assumed to be nongraded, it follows that there is some a∈p such that, if we decompose a=a++a− with a+∈A+ and a−∈A−, then neither a+ nor a− is in p. It follows that f(a−)=−f(a+), and so f(a−)∈f(A+). Note that f(a−)=0 since a−∈p. For any x∈A− we have that xa−∈A+ since (A−)2⊆A+. Thus f(x)=f(xa−)/f(a−)∈Q(f(A+)), as desired.
∎
A.4. Automorphisms and gradings
We recall how ring automorphisms give rise to gradings.
Let A be a complete local ring and X a finite abelian subgroup of the group of ring automorphisms of A. We write μn(A):={a∈A×:an=1}.
Given a character φ:X→A×, we define
[TABLE]
The following lemma is standard, so we leave the proof to the reader.
Lemma A.20**.**
Let F be a finite field of characteristic p and A a pro-p local ring with residue field F. If p∤n, then μn(A)=s(μn(F)).
Assume the following:
(∗) for every positive integer n, if X contains an element of order n, then #μn(A)=n.
Then one has #X=#Hom(X,A×). (It is easily checked when X is cyclic, and then for general X one applies the structure theorem of finite abelian groups.)
Corollary A.21**.**
Assume (∗). If p∤#X, then for any 1=σ∈X we have
[TABLE]
Proof.
First suppose that X is cyclic of order n and σ is a generator for X. Then Hom(X,A×) is cyclic, generated by any φ0 such that φ0(σ) is a primitive nth root of unity. Let H:=⟨φ0k⟩ be a nontrivial subgroup of Hom(X,A×). Then by Lemma A.20 we have
[TABLE]
Now we allow X to be any finite abelian group such that p∤#X and σ any nontrivial element of X. Then we have an exact sequence
[TABLE]
Thus ∑φ∈Hom(X,A×)φ(σ) is an integral multiple of
∑φ∈Hφ(σ),
where H is the image of Hom(X,A×) in Hom(⟨σ⟩,A×). This sum is [math] by the first paragraph.
∎
Lemma A.22**.**
Let A and X be as above. Assume that #X∈A× and that condition (∗) holds. Then A admits a grading given by A=⨁φ∈Hom(X,A×)Aφ.
Furthermore, for any Z[1/#X][X]-submodule M⊆A, letting Mφ:=M∩Aφ, there is a decomposition
[TABLE]
Proof.
For φ∈Hom(X,A×), define
eφ:=#X1∑σ∈Xφ(σ)σ−1∈Z[1/#X][X].
A straightforward computation shows that {eφ:φ∈Hom(X,A×)} is an orthogonal system of idempotents in Z[1/#X][X]. (Note that Corollary A.21 is needed to show that ∑φeφ=1.) There is a natural ring homomorphism
Z[1/#X][X]→EndA;
pushing forward the eφ to EndA gives the result.
∎
Bibliography51
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[AB] Sara Aryas-de-Reina and Gebhard Böckle. Deformation rings and images of Galois representations. Ar Xiv: https://arxiv.org/abs/2107.03114 .
2[AIP 16] Fabrizio Andreatta, Adrian Iovita, and Vincent Pilloni. On overconvergent Hilbert modular cusp forms. Astérisque , (382):163–193, 2016.
3[AM 69] M. F. Atiyah and I. G. Macdonald. Introduction to commutative algebra . Addison-Wesley Publishing Co., Reading, Mass.-London-Don Mills, Ont., 1969.
4[Amo 21] Laia Amorós. Images of Galois representations in mod p 𝑝 p Hecke algebras. International Journal of Number Theory , 17(5):1265–1285, 2021.
5[BC 09] Joël Bellaïche and Gaëtan Chenevier. Families of Galois representations and Selmer groups. Astérisque , 324:xii+314, 2009.
6[Bel 19] Joël Bellaïche. Image of pseudo-representations and coefficients of modular forms modulo p 𝑝 p . Adv. Math. , 353:647–721, 2019.
7[BH 07] Tobias Berger and Gergely Harcos. l 𝑙 l -Adic representations associated to modular forms over imaginary quadratic fields. Int. Math. Res. Not. IMRN , (23):Art. ID rnm 113, 16, 2007.
8[Böc 13] Gebhard Böckle. Deforming Galois representations. In Elliptic curves, Hilbert modular forms and Galois deformations , Adv. Courses Math. CRM Barcelona, pages 21–115. Birkhäuser/Springer, Basel, 2013.