Moduli spaces and geometric invariant theory: old and new perspectives

Victoria Hoskins

arXiv:2302.14499·math.AG·March 1, 2023

Moduli spaces and geometric invariant theory: old and new perspectives

Victoria Hoskins

PDF

Open Access

TL;DR

This paper reviews classical and recent advances in constructing moduli spaces via geometric invariant theory, highlighting extensions to non-reductive groups and stacks that enable new moduli space constructions.

Contribution

It introduces two new developments extending geometric invariant theory to non-reductive groups and stacks, broadening the scope of moduli space construction.

Findings

01

Extended GIT to non-reductive groups

02

Applied GIT to stacks for new moduli spaces

03

Surveyed classical and recent progress in the field

Abstract

Many moduli spaces are constructed as quotients of group actions; this paper surveys the classical theory, as well as recent progress and applications. We review geometric invariant theory for reductive groups and how it is used to construct moduli spaces, and explain two new developments extending this theory to non-reductive groups and to stacks, which enable the construction of new moduli spaces.

Equations230

\begin{array}[]{ccl}\mathcal{F}\sim^{\prime}_{S}\mathcal{G}&\iff&\mathcal{F}\cong\mathcal{G}\\ \mathcal{F}\sim_{S}\mathcal{G}&\iff&\mathcal{F}\cong\mathcal{G}\otimes\pi_{S}^{*}\mathcal{L}\mathrm{\>\>for\>\>a\>\>line\>\>bundle\>\>}\mathcal{L}\rightarrow S,\end{array}

\begin{array}[]{ccl}\mathcal{F}\sim^{\prime}_{S}\mathcal{G}&\iff&\mathcal{F}\cong\mathcal{G}\\ \mathcal{F}\sim_{S}\mathcal{G}&\iff&\mathcal{F}\cong\mathcal{G}\otimes\pi_{S}^{*}\mathcal{L}\mathrm{\>\>for\>\>a\>\>line\>\>bundle\>\>}\mathcal{L}\rightarrow S,\end{array}

M (S) := {families over S} / \sim_{S} and M (f : T \to S) = f^{*} : M (S) \to M (T) .

M (S) := {families over S} / \sim_{S} and M (f : T \to S) = f^{*} : M (S) \to M (T) .

{f \in F (S)} ⟶ {(f_{i} \in F (S_{i}))_{i} : f_{i} ∣_{S_{i} \cap S_{j}} = f_{j} ∣_{S_{j} \cap S_{i}} for all i, j}

{f \in F (S)} ⟶ {(f_{i} \in F (S_{i}))_{i} : f_{i} ∣_{S_{i} \cap S_{j}} = f_{j} ∣_{S_{j} \cap S_{i}} for all i, j}

\mathcal{F}_{s}=\left\{\begin{array}[]{ll}\mathcal{O}_{\mathbb{P}^{1}}^{\oplus 2}&s\neq 0\\ \mathcal{O}_{\mathbb{P}^{1}}(1)\oplus\mathcal{O}_{\mathbb{P}^{1}}(-1)&s=0.\end{array}\right.

\mathcal{F}_{s}=\left\{\begin{array}[]{ll}\mathcal{O}_{\mathbb{P}^{1}}^{\oplus 2}&s\neq 0\\ \mathcal{O}_{\mathbb{P}^{1}}(1)\oplus\mathcal{O}_{\mathbb{P}^{1}}(-1)&s=0.\end{array}\right.

Ext^{1} (O_{P^{1}} (1), O_{P^{1}} (- 1)) ≅ H^{1} (P^{1}, O_{P^{1}} (- 2)) ≅ H^{0} (P^{1}, O_{P^{1}})^{*} ≅ k .

Ext^{1} (O_{P^{1}} (1), O_{P^{1}} (- 1)) ≅ H^{1} (P^{1}, O_{P^{1}} (- 2)) ≅ H^{0} (P^{1}, O_{P^{1}})^{*} ≅ k .

S_{n} := {s \in S : dim H^{0} (P^{1}, F_{s}) \geq n},

S_{n} := {s \in S : dim H^{0} (P^{1}, F_{s}) \geq n},

O_{C}^{\oplus χ} ≅ H^{0} (E) \otimes O_{C} ↠ ev E

O_{C}^{\oplus χ} ≅ H^{0} (E) \otimes O_{C} ↠ ev E

m^{*} (t) = t \otimes 1 + 1 \otimes t and i^{*} (t) = - t .

m^{*} (t) = t \otimes 1 + 1 \otimes t and i^{*} (t) = - t .

m^{*} (t) = t \otimes t and i^{*} (t) = t^{- 1} .

m^{*} (t) = t \otimes t and i^{*} (t) = t^{- 1} .

m^{*} (x_{ij}) = k = 1 \sum n x_{ik} \otimes x_{k j} and i^{*} (x_{ij}) = (x_{ij})_{ij}^{- 1}

m^{*} (x_{ij}) = k = 1 \sum n x_{ik} \otimes x_{k j} and i^{*} (x_{ij}) = (x_{ij})_{ij}^{- 1}

m^{*} (t^{n} - 1) = t^{n} \otimes t^{n} - 1 \otimes 1 = (t^{n} - 1) \otimes t^{n} + 1 \otimes (t^{n} - 1) \in I \otimes R + R \otimes I

m^{*} (t^{n} - 1) = t^{n} \otimes t^{n} - 1 \otimes 1 = (t^{n} - 1) \otimes t^{n} + 1 \otimes (t^{n} - 1) \in I \otimes R + R \otimes I

Spec k \times X \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

Spec k \times X \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

G \times X \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

G \times X \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces \ignorespaces

(g \cdot f) (x) = f (g^{- 1} \cdot x)

(g \cdot f) (x) = f (g^{- 1} \cdot x)

dim G = dim G_{x} + dim G \cdot x

dim G = dim G_{x} + dim G \cdot x

{natural transformations η : M \to Hom (-, M)} ⟷ {G -invariant morphisms f : S \to M}

{natural transformations η : M \to Hom (-, M)} ⟷ {G -invariant morphisms f : S \to M}

F_{h_{i} (u)} \sim (h_{i}^{*} F)_{u} \sim G_{u} \sim (h_{j}^{*} F)_{u} \sim F_{h_{j} (u)}

F_{h_{i} (u)} \sim (h_{i}^{*} F)_{u} \sim G_{u} \sim (h_{j}^{*} F)_{u} \sim F_{h_{j} (u)}

O (X)^{G} := {f \in O (X) : g \cdot f = f for all g \in G} .

O (X)^{G} := {f \in O (X) : g \cdot f = f for all g \in G} .

O (X)^{H} ≅ (O (X) \otimes O (G / H))^{G} .

O (X)^{H} ≅ (O (X) \otimes O (G / H))^{G} .

c\mapsto\left(\begin{array}[]{cc}1&c\\ 0&1\end{array}\right).

c\mapsto\left(\begin{array}[]{cc}1&c\\ 0&1\end{array}\right).

α_{p} (R) := {c \in G_{a} (R) : c^{p} = 0} .

α_{p} (R) := {c \in G_{a} (R) : c^{p} = 0} .

V = χ \in X^{*} (T) ⨁ V_{χ} where V_{χ} = {v \in V : t \cdot v = χ (t) v for all t \in T} .

V = χ \in X^{*} (T) ⨁ V_{χ} where V_{χ} = {v \in V : t \cdot v = χ (t) v for all t \in T} .

linearly reductive ⟹ geometrically reductive ⟺ reductive

linearly reductive ⟹ geometrically reductive ⟺ reductive

f (W_{1}) = 0 and f (W_{2}) = 1.

f (W_{1}) = 0 and f (W_{2}) = 1.

W_{n} = W_{n}^{G} \oplus W_{n}^{'}

W_{n} = W_{n}^{G} \oplus W_{n}^{'}

l_{a} : W_{n} \to W_{m} .

l_{a} : W_{n} \to W_{m} .

ab = l_{a} (b) = l_{a} (b^{G}) + l_{a} (b^{'}) = a b^{G} + a b^{'} \in W_{m}^{G} \oplus W_{m}^{'} .

ab = l_{a} (b) = l_{a} (b^{G}) + l_{a} (b^{'}) = a b^{G} + a b^{'} \in W_{m}^{G} \oplus W_{m}^{'} .

A^{n} \times^{G_{a}} SL_{2} ≅ A^{n} \times SL_{2} / G_{a} .

A^{n} \times^{G_{a}} SL_{2} ≅ A^{n} \times SL_{2} / G_{a} .

O (A^{n})^{G_{a}} ≅ O (A^{n} \times^{G_{a}} SL_{2})^{SL_{2}} ≅ O (A^{n} \times SL_{2} / G_{a})^{SL_{2}} ≅ O (A^{n + 2})^{SL_{2}}

O (A^{n})^{G_{a}} ≅ O (A^{n} \times^{G_{a}} SL_{2})^{SL_{2}} ≅ O (A^{n} \times SL_{2} / G_{a})^{SL_{2}} ≅ O (A^{n + 2})^{SL_{2}}

k [tr, det] \subset O (M_{2 \times 2})^{GL_{2}} .

k [tr, det] \subset O (M_{2 \times 2})^{GL_{2}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Algebra and Geometry · Algebraic Geometry and Number Theory · Geometric and Algebraic Topology

Full text

Moduli spaces and geometric invariant theory:

old and new perspectives

Victoria Hoskins

For Peter Newstead on his 80th Birthday

Abstract.

Many moduli spaces are constructed as quotients of group actions; this paper surveys the classical theory, as well as recent progress and applications. We review geometric invariant theory for reductive groups and how it is used to construct moduli spaces, and explain two new developments extending this theory to non-reductive groups and to stacks, which enable the construction of new moduli spaces.

Introduction

Mumford’s geometric invariant theory (GIT) [74] for reductive groups provides a method for constructing quotients of reductive group actions in algebraic geometry. For a reductive group acting on a projective scheme, GIT provides an open semistable set admitting a categorical quotient which is projective and constructed from the invariant ring; furthermore, the semistable set can be explicitly described using (torus) weights via the Hilbert–Mumford criterion, rather than in terms of non-vanishing invariants.

Whilst reductive GIT has been successfully employed to construct numerous moduli spaces, it has some limitations. First, it only provides moduli spaces of semistable objects. Second, it only applies to reductive group actions. Third, it only applies in the situation where the moduli problem is presented in terms of a group action. Two recent developments aim to overcome some of these issues: GIT for non-reductive groups and stacky generalisations of GIT.

One of the first challenges for non-reductive group actions is the possibility of non-finitely generated invariant rings; the best-known example is Nagata’s counterexample [76] to Hilbert’s 14th Problem. However, even when non-reductive invariant rings are finitely generated, the corresponding ‘GIT quotient’ is not well-behaved (for example, the quotient map might not even be surjective and its image may only be constructible, see $\S$ 5.1). Although it is possible to construct geometric quotients of open subsets [34, 83, 95], these open subsets are typically hard to describe explicitly. However, recent work on GIT [15, 14] for non-reductive groups with graded unipotent radical (e.g. $\mathbb{G}_{a}\rtimes\mathbb{G}_{m}$ or parabolic subgroups) has enabled the construction of projective quotients of certain stable sets, which admit explicit Hilbert–Mumford type descriptions; the price to pay for obtaining these explicit projective non-reductive GIT quotients is that one must impose certain stabiliser assumptions. One of the goals of this survey is to explain the origins, assumptions and results of non-reductive GIT in as simple a context as possible to make them accessible to a broad audience, as well as to highlight some exciting applications.

We also outline another significant development that extends ideas of (reductive) GIT to stacks, as pioneered by Alper, Halpern–Leistner and Heinloth [2, 48, 7, 52]. Alper’s notion of good and adequate moduli spaces of stacks enables GIT-free constructions of moduli spaces. This is even more tangible following the recent existence criteria of Alper–Halpern-Leistner–Heinloth [7], which equates the existence of moduli spaces to two simple valuative criteria and has been applied to various moduli problems [4, 5, 12, 13].

However, adequate and good moduli spaces are locally modelled on reductive GIT and require closed points to have reductive stabiliser groups. Thus, in some senses these two recent developments are orthogonal to each other and ideally there should eventually be an extension of non-reductive GIT to stacks.

Acknowledgements

I am indebted to both Peter Newstead and Frances Kirwan, as I learned the basics of reductive GIT from Peter Newstead’s Tata lecture notes [79] and discussions with Frances Kirwan. I am very grateful to Greg Bérczi, Dominic Bunnett, Eloise Hamilton, Josh Jackson and Frances Kirwan for numerous conversations on non-reductive GIT. I would also like to thank the organisers of VBAC 2022 for soliciting this paper in honour of Peter Newstead.

Conventions

Throughout we will assume $k$ is an algebraically closed field and all schemes are assumed to be finite type $k$ -schemes, unless otherwise stated. By a point, we will mean a $k$ -point (or equivalently, a closed point).

1. Moduli problems and group actions

We start with an example-driven introduction to moduli problems in $\S$ 1.1 and describe the relation to group actions in $\S$ 1.2. Finally in $\S$ 1.3, we give some basic definitions on algebraic groups, actions and quotients, which lays the foundations for GIT in $\S$ 2.

1.1. Moduli functors and spaces

Naively, a moduli problem is a collection $\mathcal{A}$ of objects with an equivalence relation $\sim$ on $\mathcal{A}$ and we would like to give the set of equivalence classes $\mathcal{A}/\sim$ the structure of a scheme that encodes how objects vary continuously in ‘families’.

Example 1.1.

(1)

Let $\mathcal{A}$ be the set of $r$ -dimensional linear subspaces of an $n$ -dimensional $k$ -vector space and $\sim$ be equality. 2. (2)

Let $\mathcal{A}$ be the set of finite-dimensional $k$ -vector spaces with an endomorphism and $\sim$ be vector space isomorphisms commuting with the endomorphism. 3. (3)

Let $\mathcal{A}$ be the set of $n\times n$ matrices over $k$ and $\sim$ be the equivalence relation given by similarity of matrices. 4. (4)

Let $\mathcal{A}$ to be the set of hypersurfaces of degree $d$ in $\mathbb{P}^{n}$ and $\sim$ be the relation given by projective change of coordinates. 5. (5)

Let $\mathcal{A}$ be the collection of smooth projective curves of fixed genus and $\sim$ be the relation given by isomorphism. 6. (6)

Let $\mathcal{A}$ be the collection of vector bundles on a fixed scheme $X$ and $\sim$ be the relation given by isomorphisms of vector bundles.

Often there is a natural notion of families of objects over a scheme $S$ and an extension of $\sim$ to families over $S$ , such that we can pullback families by morphisms $T\rightarrow S$ compatibly with the notion of equivalence.

Example 1.2.

(1)

A family over $S$ of $r$ -dimensional linear subspaces of an $n$ -dimensional vector space is a rank $r$ vector subbundle $\mathcal{V}\subset\mathcal{O}_{S}^{\oplus n}$ . 2. (2)

A family over $S$ of vector spaces with endomorphisms is a vector bundle $\mathcal{V}$ over $S$ with an endomorphism $\Phi:\mathcal{V}\rightarrow\mathcal{V}$ .

The next example shows there might be several ways to extend $(\mathcal{A},\sim)$ to families over $S$ .

Example 1.3.

For vector bundles on a fixed scheme $X$ up to isomorphism, the natural notion for a family over $S$ is a vector bundle $\mathcal{F}$ over $X\times S$ over $S$ , but there are at least two natural equivalence relations:

[TABLE]

where $\pi_{S}:X\times S\rightarrow S$ . Since $\mathcal{L}\rightarrow S$ is locally trivial, there is a cover $S_{i}$ of $S$ such that $\mathcal{F}|_{X\times S_{i}}\cong\mathcal{G}|_{X\times S_{i}}$ . Hence $\sim_{S}$ can be thought of a Zariski local version of $\sim^{\prime}_{S}$ .

We can now give a more precise definition of a moduli problem using families.

Definition 1.4 (Moduli problem and moduli functor).

A moduli problem consists of

(1)

for each scheme $S$ , a collection $\mathcal{A}_{S}$ of families over $S$ with an equivalence relation $\sim_{S}$ , 2. (2)

for each morphism $T\rightarrow S$ of schemes, a pullback map $f^{*}:\mathcal{A}_{S}\rightarrow\mathcal{A}_{T}$

such that

(i)

for $f:T\rightarrow S$ and equivalent families $\mathcal{F}\sim_{S}\mathcal{G}$ over $S$ , we have $f^{*}\mathcal{F}\sim_{T}f^{*}\mathcal{G}$ ; 2. (ii)

for any family $\mathcal{F}$ over $S$ , we have $\text{Id}_{S}^{*}\mathcal{F}=\mathcal{F}$ ; 3. (iii)

for any morphisms $f:T\rightarrow S$ and $g:S\rightarrow R$ , and a family $\mathcal{F}$ over $R$ , we have an equivalence $(g\circ f)^{*}\mathcal{F}\sim_{T}f^{*}g^{*}\mathcal{F}$ .

This gives rise to a moduli functor $\mathcal{M}:\mathrm{Sch}^{\mathrm{op}}\rightarrow\mathrm{Set}$ where

[TABLE]

Notation: For a family $\mathcal{F}$ over $S$ and a point $s:\operatorname{Spec}k\rightarrow S$ , we write $\mathcal{F}_{s}:=s^{*}\mathcal{F}$ to denote the corresponding family over $\operatorname{Spec}k$ . We write $(\mathcal{A},\sim):=(\mathcal{A}_{\operatorname{Spec}k},\sim_{\operatorname{Spec}k})$ .

In particular, a moduli functor is a presheaf on the category $\mathrm{Sch}$ of schemes. Recall that the Yoneda Lemma gives an embedding of the category of schemes into the category of presheaves; more precisely there is a fully faithful functor $h:\mathrm{Sch}\rightarrow\mathrm{PSh}(\mathrm{Sch})$ which on objects sends a scheme $X$ to its functor of points $\operatorname{Hom}(-,X):\mathrm{Sch}^{\mathrm{op}}\rightarrow\mathrm{Set}$ . A presheaf is called representable if it is in the essential image of the Yoneda embedding.

A moduli functor being representable is the ideal situation and leads to the notion of a fine moduli space, but if that fails, one can instead ask for a universal natural transformation from $\mathcal{M}$ to the functor of points of a scheme, which leads to the notion of a coarse moduli space.

Definition 1.5 (Fine and coarse moduli spaces).

Let $\mathcal{M}:\mathrm{Sch}\rightarrow\mathrm{Set}$ be a moduli functor.

i)

A scheme $M$ is a fine moduli space for $\mathcal{M}$ if it represents $\mathcal{M}$ ; that is, there is a natural isomorphism $\mathcal{M}\rightarrow\operatorname{Hom}(-,M)$ . In this case, $\operatorname{Id}_{M}\in\operatorname{Hom}(M,M)$ corresponds to an element of $\mathcal{M}(M)$ called the universal family $\mathcal{U}$ , which is a family over $M$ up to the notion of equivalence. 2. ii)

A coarse moduli space for $\mathcal{M}$ is a scheme $M$ with a natural transformation of functors $\eta:\mathcal{M}\rightarrow h_{M}$ which is universal (for any natural transformation $\nu:\mathcal{M}\rightarrow\operatorname{Hom}(-,N)$ to the functor of points of a scheme, there exists a unique morphism $f:M\rightarrow N$ such that $\nu=f_{*}\circ\eta$ ) such that $\eta_{\operatorname{Spec}k}:\mathcal{M}(\operatorname{Spec}k)\rightarrow h_{M}(\operatorname{Spec}k)$ is bijective.

Example 1.6.

For 1-dimensional subspaces of $k^{n}$ , a fine moduli space is given by $\mathbb{P}^{n}$ , with the tautological line bundle $\mathcal{O}_{\mathbb{P}^{n}}(-1)\subset\mathcal{O}^{\oplus n}_{\mathbb{P}^{n}}$ giving a universal family (see [51, II Theorem 7.1]).

Remark 1.7.

(1)

If a fine or coarse moduli space exists, then it is unique up to unique isomorphism. 2. (2)

If a fine moduli space exists, then the universal family $\mathcal{U}$ over $M$ describes all other families in the following sense: for any scheme $S$ , we have that a family $\mathcal{F}\in\mathcal{M}(S)$ is equivalent to a morphism $f:S\rightarrow M$ with $f^{*}\mathcal{U}\sim_{S}\mathcal{F}$ . 3. (3)

Since $\operatorname{Hom}(-,M)$ is a sheaf in the Zariski toplogy (that is, for every scheme $S$ and Zariski cover $\{S_{i}\}$ of $S$ , the natural map

[TABLE]

is a bijection), for a moduli functor $\mathcal{M}$ to admit a fine moduli space it must be a sheaf in the Zariski topology.

Exercise 1.8.

Show that the equivalence relation $\sim^{\prime}_{S}$ on families over $S$ of vector bundles on a fixed scheme defines a moduli functor that is not representable. (Hint: Show it fails to be a Zariski sheaf by considering the other equivalence relation $\sim_{S}$ for families of vector bundles).

Unfortunately, there may be moduli problems which do not admit even a coarse moduli space.

Exercise 1.9.

Let $\mathcal{M}$ be a moduli functor with the jump phenomenon; that is, there is a family $\mathcal{F}$ over $\mathbb{A}^{1}$ such that $\mathcal{F}_{s}\sim\mathcal{F}_{1}$ for all $s\neq 0$ and $\mathcal{F}_{0}\nsim\mathcal{F}_{1}$ . Show that there is no coarse moduli space for $\mathcal{M}$ by showing for any natural transformation $\eta:\mathcal{M}\rightarrow\operatorname{Hom}(-,M)$ , the morphism $\eta_{\mathbb{A}^{1}}(\mathcal{F}):\mathbb{A}^{1}\rightarrow M$ is constant.

Example 1.10.

Moduli of rank 2 degree 0 vector bundles on $\mathbb{P}^{1}$ exhibit the jump phenomenon: there is a family $\mathcal{F}$ of rank 2 degree 0 vector bundles over $\mathbb{A}^{1}$ such that

[TABLE]

Indeed this family is constructed using the isomorphisms

[TABLE]

Another reason for a coarse moduli space to fail to exist is if the moduli problem is unbounded: there does not exist a family $\mathcal{F}$ over a scheme $S$ (of finite type over $k$ ) such that for any object $E$ (i.e family over $k$ ), we have $E\sim\mathcal{F}_{s}$ for some (possibly non-unique) $s\in S$ .

Exercise 1.11.

Show the moduli problem of rank 2 degree 0 vector bundles on $\mathbb{P}^{1}$ is unbounded: suppose there exists a family $\mathcal{F}$ over $S$ such that for any such vector bundle $E$ we have $\mathcal{F}_{s}\sim E$ for some $s\in S$ , then show that $S$ cannot be Noetherian by considering the subschemes

[TABLE]

which are closed by the semi-continuity theorem.

A further obstruction to the existence of a coarse moduli space is that there may be non-trivial families which are fibrewise trivial; this may happen when there are non-trivial automorphisms.

One possible solution for dealing with some of these issues is to instead work with a moduli stack; in this case, one can then look for a coarse or good moduli space for this stack as in $\S$ 4. However, often by imposing a notion of stability on objects, which can be moduli-theoretic or arising from the GIT construction, one obtains a much better-behaved moduli problem for which one can construct moduli spaces.

1.2. Construction of moduli spaces using group actions

Many moduli spaces are constructed as quotients of group actions via the following strategy (after fixing any discrete invariants and restricting to a bounded class of objects):

(1)

Find an overparametrisation: find a parameter scheme $X$ with a family $\mathcal{F}$ such that any other family can be locally obtained by pullback from $\mathcal{F}$ (possibly non-uniquely). 2. (2)

Find a group action describing the symmetries: find a group $G$ acting on $X$ such that the orbits correspond to the equivalence classes. 3. (3)

Take a quotient (in the category of schemes if possible).

The third step is typically performed using Geometric Invariant Theory (GIT).

Let us give some examples, before describing algebraic group actions in $\S$ 1.3 in more detail.

Example 1.12.

(1)

1-dimensional vector subspaces in $V=k^{n}$ can be parametrised by fixing a basis vector in $X=V\setminus\{0\}$ . Since two basis vectors are related by scalar multiplication, $\mathbb{G}_{m}$ acting on $X$ describes the symmetries and $\mathbb{P}^{n-1}=X/\mathbb{G}_{m}$ is the fine moduli space. 2. (2)

An $r$ -dimensional subspace in $V=k^{n}$ can be parametrised by choosing a basis, which gives an element in $\operatorname{Mat}_{r\times n}$ of rank $r$ and the choice of basis is controlled by the action of $\mathrm{GL}_{r}$ by left multiplication. In Exercise 3.14, we will see that the open locus of rank $r$ matrices is a GIT semistable locus that admits a quotient, namely a Grassmannian. 3. (3)

A projective hypersurface of degree $d$ in $\mathbb{P}^{n}$ is given by the vanishing locus $\{F=0\}$ of a degree $d$ homogeneous polynomial in $n+1$ variables. Since $\lambda F$ defines the same hypersurface, one can consider $X=\mathbb{P}(k[x_{0},\dots,x_{n}]_{d})$ and the action of $G=\mathrm{PGL}_{n}=\operatorname{Aut}(\mathbb{P}^{n})$ describes these hypersurfaces up to change of coordinates. 4. (4)

For vector bundles of rank $n$ and degree $d$ on a smooth projective curve $C$ , provided $d$ is sufficiently large, any semistable111There is a natural notion of semistability involving verifying an inequality of slopes for all subbundles, which turns out to be related to a corresponding GIT notion of semistability. vector bundle $E$ can be parametrised as a quotient of a fixed vector bundle (see [79, Lemma 5.2]): for $E$ semistable of sufficiently large degree, the evaluation map is surjective and $E$ has vanishing higher cohomology, so by choosing a basis of global sections we obtain a quotient

[TABLE]

where $\chi=d+n(1-g)$ is the Euler characteristic of $E$ . Consequently, a Quot scheme parametrising quotients of $\mathcal{O}_{C}^{\oplus\chi}$ with fixed invariants gives an overparametrisation and the action of $\mathrm{GL}_{\chi}$ describes the symmetries.

1.3. Algebraic groups, actions and quotients

Here we focus on the essential notions that we will need, and refer to [20, 22, 71] for more detailed expositions.

Definition 1.13.

An algebraic group over $k$ is a a group object in the category of $k$ -schemes (that is, a $k$ -scheme $G$ with identity element $e:\operatorname{Spec}k\rightarrow G$ , group operation $m:G\times G\rightarrow G$ and inversion $i:G\rightarrow G$ given by morphisms of schemes such that the usual group axioms are stated as commutativity of certain diagrams). We say $G$ is an affine algebraic group if the underlying scheme $G$ is affine.

Remark 1.14.

The $k$ -algebra $\mathcal{O}(G)$ of regular functions on $G$ is a Hopf algebra with comultiplication $m^{*}:\mathcal{O}(G)\rightarrow\mathcal{O}(G)\otimes\mathcal{O}(G)$ , coinversion $i^{*}:\mathcal{O}(G)\rightarrow\mathcal{O}(G)$ and counit $e^{*}:\mathcal{O}(G)\rightarrow k$ and dualised commutative diagrams. In fact, there is a one-one correspondence between finitely generated Hopf algebras over $k$ and affine algebraic groups over $k$ [71, II Theorem 5.1].

As the following example demonstrates, many familiar groups are affine algebraic groups.

Example 1.15.

(1)

The additive group $\mathbb{G}_{a}=\operatorname{Spec}k[t]$ over $k$ is the algebraic group whose underlying scheme is the affine line $\mathbb{A}^{1}$ over $k$ and whose group operation is given by addition:

[TABLE]

For a $k$ -algebra $R$ , we have $\mathbb{G}_{a}(R)=(R,+)$ . 2. (2)

The multiplicative group $\mathbb{G}_{m}=\operatorname{Spec}k[t,t^{-1}]$ over $k$ is the algebraic group whose underlying variety is the $\mathbb{A}^{1}-\{0\}$ and whose group operation is given by multiplication:

[TABLE]

For a $k$ -algebra $R$ , we have $\mathbb{G}_{m}(R)=(R^{\times},\cdot)$ . 3. (3)

The general linear group $\mathrm{GL}_{n}$ over $k$ is an open subvariety of $\mathbb{A}^{n^{2}}$ cut out by the non-vanishing of the determinant. It is an affine variety with coordinate ring $k[x_{ij}:1\leq i,j\leq n]_{\det(x_{ij})}$ . The co-group operations are defined by:

[TABLE]

where $(x_{ij})^{-1}_{ij}$ is the regular function on $\mathrm{GL}_{n}$ given by taking the $(i,j)$ -th entry of the inverse of a matrix. 4. (4)

For a finite group $G$ , the group algebra $k[G]$ is a Hopf algebra and determines an affine algebraic group $\underline{G}_{k}:=\operatorname{Spec}(k[G])$ , whose $k$ -points are identified with elements of $G$ . 5. (5)

For $n\geq 1$ , the group of $n$ th roots of unity is $\mu_{n}:=\operatorname{Spec}k[t,t^{-1}]/(t^{n}-1)\subset\mathbb{G}_{m}$ . Write $I$ for the ideal $(t^{n}-1)$ of $R:=k[t,t^{-1}]$ . Then

[TABLE]

which implies that $\mu_{n}$ is an algebraic subgroup of $\mathbb{G}_{m}$ . If $n$ is different from $\mathrm{char}(k)$ , the polynomial $X^{n}-1$ is separable and there are $n$ distinct roots in $k$ . Then the choice of a primitive $n$ th root of unity in $k$ determines an isomorphism $\mu_{n}\simeq\underline{\mathbb{Z}/n\mathbb{Z}}_{\>k}$ . However, if $n=\mathrm{char}(k)$ , then $X^{n}-1=(X-1)^{n}$ in $k[X]$ , which implies that the scheme $\mu_{n}$ is non-reduced (with $1$ as the only closed point).

A linear algebraic group is a closed subgroup of $\mathrm{GL}_{n}$ ; hence, any linear algebraic group is an affine algebraic group. The converse statement is also true: any affine algebraic group is a linear algebraic group (see Remark 1.19).

Definition 1.16.

An (algebraic) action of an affine algebraic group $G$ on a scheme $X$ is a morphism of schemes $\sigma:G\times X\rightarrow X$ such that the following diagrams commute

[TABLE]

A subscheme $Z\subset X$ is called $G$ -invariant if it is preserved by the action; that is, $\sigma(G\times Z)\subset Z$ .

A morphism $f:X\rightarrow Y$ between schemes with actions $\sigma_{X}:G\times X\rightarrow X$ and $\sigma_{Y}:G\times Y\rightarrow Y$ is $G$ -equivariant if the following diagram commutes

[TABLE]

If $Y$ is given the trivial action $\sigma_{Y}=\pi_{Y}:G\times Y\rightarrow Y$ , then we say $f:X\rightarrow Y$ is $G$ -invariant.

Remark 1.17.

For an action $\sigma:G\times X\rightarrow X$ , there is an induced action of $G$ on the ring of regular functions $\mathcal{O}(X)$ given by

[TABLE]

for $g\in G$ , $f\in\mathcal{O}(X)$ and $x\in X$ .

By the following lemma, any action of an affine algebraic group $G$ on an affine scheme $X$ gives a $G$ -action on the $k$ -algebra $\mathcal{O}(X)$ which is rational; that is, every $f\in\mathcal{O}(X)$ is contained in a finite dimensional $G$ -invariant linear subspace of $\mathcal{O}(X)$ .

Lemma 1.18.

For an affine algebraic group $G$ acting on an affine scheme $X$ , any finite dimensional vector subspace of $\mathcal{O}(X)$ is contained in a finite dimensional $G$ -invariant vector subspace.

Proof.

Let $\sigma^{*}:\mathcal{O}(X)\rightarrow\mathcal{O}(G)\otimes\mathcal{O}(X)$ denote the coaction. Let $W=\mathrm{Span}_{k}(f_{1},\dots,f_{n})\subset\mathcal{O}(X)$ and write $\sigma^{*}(f_{i})=\sum_{j=1}^{n_{i}}h_{ij}\otimes f_{ij}$ with $h_{ij}\in\mathcal{O}(G)$ and $g_{ij}\in\mathcal{O}(X)$ . The vector space spanned by $f_{ij}$ is a $G$ -invariant finite-dimensional subspace containing $W$ , as $g\cdot f_{i}=\sum_{j}h_{ij}(g)f_{ij}$ . ∎

Remark 1.19.

By applying this to the action of $G$ on itself by left multiplication, if we let $W$ be a vector space spanned by a finite choice of algebra generators for $\mathcal{O}(G)$ , then $W$ is contained in a finite dimensional $G$ -invariant vector subspace $V\subset\mathcal{O}(X)$ . One can prove that there is an embedding $G\rightarrow\mathrm{GL}(V)$ to show any affine algebraic group over $k$ is a linear algebraic group.

We can define orbits and stabilisers in this setting; the latter has a scheme structure by definition, and we shall soon see the former can also be equipped with a scheme structure.

Definition 1.20 (Orbits and stabilisers).

For an action $\sigma:G\times X\rightarrow X$ of an affine algebraic group $G$ on a scheme $X$ and for a $k$ -point $x\in X$ , we define

(1)

the orbit $G\cdot x$ of $x$ to be the (set-theoretic) image of $\sigma_{x}=\sigma(-,x):G(k)\rightarrow X(k)$ given by $g\mapsto g\cdot x$ ; 2. (2)

the stabiliser $G_{x}$ of $x$ to be the fibre product of $\sigma_{x}:G\rightarrow X$ and $x:\operatorname{Spec}k\rightarrow X$ .

The stabiliser $G_{x}$ of $x$ is a closed subscheme of $G$ (as it is the preimage of a closed subscheme of $X$ under $\sigma_{x}:G\rightarrow X$ ) and a subgroup of $G$ . By Chevalley’s Theorem [51, II Exercise 3.19], as the image of a morphism of schemes, the orbit is a priori only a constructible subset of $X$ . However, we claim it is a locally closed subset and so can be equipped with the structure of a reduced locally closed subscheme of $X$ . Indeed, $G\cdot x$ is open in its closure: since the orbit is constructible, there is a dense open subset $U$ of $\overline{G\cdot x}$ with $U\subset G\cdot x$ and, as $G$ acts transitively on the orbit, every point in the orbit is contained in a $G$ -translate of $U$ .

The boundary of an orbit is a union of orbits of strictly smaller dimension, and so in particular every orbit closure contains a closed orbit (of minimal dimension). There is also an orbit stabiliser theorem:

[TABLE]

as $\sigma_{x}:G\rightarrow G\cdot x$ is flat (by transitivity of the $G$ -action, we can deduce this from generic flatness) and so we can apply the dimension formula for fibres of a flat morphism [51, Proposition III.9.5].

In general, for an action $\sigma:G\times X\rightarrow X$ , the set of orbits $X/G$ is not a scheme. Instead, we ask for a universal quotient in the category of schemes.

Definition 1.21 (Categorical quotient).

For an action of an affine algebraic group $G$ on scheme $X$ , a categorical quotient is a $G$ -invariant morphism $\varphi:X\rightarrow Y$ of schemes which is universal (that is, every other $G$ -invariant morphism $f:X\rightarrow Z$ factors uniquely through $\varphi$ so that there exists a unique morphism $h:Y\rightarrow Z$ such that $f=h\circ\varphi$ ). If the preimage of each $k$ -point in $Y$ is a single orbit, then we say $\varphi$ is an orbit space.

If a categorical quotient exists, then it is unique up to unique isomorphism. In cases where a categorical quotient does not exist, one may want to enlarge the category of schemes to algebraic spaces or even algebraic stacks.

A categorical quotient is constant on orbits and orbit closures. Hence, a categorical quotient is an orbit space only if the action of $G$ on $X$ is closed; that is, all the orbits $G\cdot x$ are closed.

Example 1.22.

(1)

For $\mathbb{G}_{m}$ acting on $\mathbb{A}^{n}$ by scalar multiplication $t\cdot(a_{1},\dots,a_{n})=(ta_{1},\dots,ta_{n})$ , there are two types of orbits:

•

punctured lines through the origin,

•

the origin (closed of dimension [math])

Every orbit contains the origin in its closure. As any $\mathbb{G}_{m}$ -invariant function on $\mathbb{A}^{n}$ is constant on orbits and their closures, it must be constant and so factors via the structure map $\pi:\mathbb{A}^{n}\rightarrow\operatorname{Spec}k$ . Hence, the structure map is a categorical quotient. 2. (2)

For the action of $\mathbb{G}_{m}$ on $\mathbb{A}^{2}$ by $t\cdot(x,y)=(tx,t^{-1}y)$ , the orbits are

•

conics $\{(x,y):xy=\alpha\}$ for $\alpha\in\mathbb{A}^{1}\setminus\{0\}$ (closed of dimension $1$ ),

•

the punctured $x$ -axis,

•

the punctured $y$ -axis,

•

the origin (closed of dimension [math]).

The punctured axes both contain the origin in their orbit closures. We will see that the categorical quotient for this action is $\mathbb{A}^{2}\rightarrow\mathbb{A}^{1}$ given by $(x,y)\mapsto xy$ .

We see the sort of problems that may occur when we have non-closed orbits. In the first example, our geometric intuition tells us that we would ideally like to remove the origin and then take the quotient of $\mathbb{G}_{m}$ acting on $\mathbb{A}^{n}\setminus\{0\}$ to obtain the projective space $\mathbb{P}^{n-1}=(\mathbb{A}^{n}\setminus\{0\})/\mathbb{G}_{m}$ , which is an orbit space for this action. We will return to this example and see that by introducing a non-trivial notion of GIT semistability, we can remove the origin (see Example 2.20).

There is the following stronger notion of quotient that arises in GIT [87, Definition 1.5].

Definition 1.23 (Good quotient).

A morphism $\varphi:X\rightarrow Y$ is a good quotient for an action of $G$ on $X$ if

i)

$\varphi$ is $G$ -invariant, surjective and affine; 2. ii)

The map $\mathcal{O}_{Y}\rightarrow\varphi_{*}\mathcal{O}_{X}^{G}$ is an isomorphism. 3. iii)

If $W_{1}$ and $W_{2}$ are disjoint $G$ -invariant closed subschemes, then $\varphi(W_{1})$ and $\varphi(W_{2})$ are disjoint closed subschemes.

If moreover the preimage of each point is a single orbit, then we say $\varphi$ is a geometric quotient.

Remark 1.24.

(1)

The definition of a good quotient is local in the target, which enables the construction of good quotients via gluing. 2. (2)

The last two conditions imply that $\varphi$ is surjective: the second property shows that $\varphi$ is dominant (i.e. the image of $\varphi$ is dense in $Y$ ) and the third condition shows that the image of $\varphi$ is closed. Furthermore it implies that for each $y\in Y$ , the preimage $\varphi^{-1}(y)$ contains a unique closed orbit, whose stabiliser is reductive (see Definition 2.3 and [68], as the quotient of the reductive group $G$ by the subgroup $G_{x}$ is affine if and only if $G_{x}$ is reductive). In particular, if all orbits are closed, then $\varphi$ is a geometric quotient. 3. (3)

The third condition also enables us to determine when two orbit closures meet: we have $\overline{G\cdot x_{1}}\cap\overline{G\cdot x_{2}}\neq\phi$ if and only if $\varphi(x_{1})=\varphi(x_{2})$ . 4. (4)

Any good quotient is a categorical quotient; see [79, Proposition 3.11].

Let us relate the construction of moduli spaces with categorical quotients. For a moduli problem $\mathcal{M}$ , a family $\mathcal{F}$ over a scheme $S$ has the local universal property if for any other family $\mathcal{G}$ over a scheme $T$ and for any $k$ -point $t\in T$ , there exists a neighbourhood $U$ of $t$ in $T$ and a morphism $f:U\rightarrow S$ such that $\mathcal{G}|_{U}\sim_{U}f^{*}\mathcal{F}$ .

Proposition 1.25.

[79, Proposition 2.13]* Let $\mathcal{M}$ be a moduli problem for which there exists a family $\mathcal{F}$ over $S$ with the local universal property. Suppose that there is an algebraic group $G$ acting on $S$ such that two $k$ -points $s,t$ lie in the same $G$ -orbit if and only if $\mathcal{F}_{t}\sim\mathcal{F}_{s}$ . Then*

(1)

any coarse moduli space is a categorical quotient of the $G$ -action on $S$ ; 2. (2)

a categorical quotient of the $G$ -action on $S$ is a coarse moduli space if and only if it is an orbit space.

Proof.

For any scheme $M$ , we claim that there is a bijective correspondence

[TABLE]

given by $\eta\mapsto\eta_{S}(\mathcal{F})$ , which is $G$ -invariant by our assumptions about the $G$ -action on $S$ . Conversely, given a $G$ -invariant morphism $f:S\rightarrow M$ , we define $\eta:\mathcal{M}\rightarrow\operatorname{Hom}(-,M)$ to associate to a family $\mathcal{G}$ over $T$ , a morphism $\eta_{T}(\mathcal{G}):T\rightarrow M$ glued together locally by using the local universal property of $\mathcal{F}$ over $S$ . More precisely, we can cover $T$ by open subsets $U_{i}$ such that there is a morphism $h_{i}:U_{i}\rightarrow S$ and $h_{i}^{*}\mathcal{F}\sim_{U_{i}}\mathcal{G}|_{U_{i}}$ . For $u\in U_{i}\cap U_{j}$ , we have

[TABLE]

and so by assumption $h_{i}(u)$ and $h_{j}(u)$ lie in the same $G$ -orbit. Since $f$ is $G$ -invariant, the compositions $f\circ h_{i}:U_{i}\rightarrow M$ glue to a morphism $\eta_{T}(\mathcal{G}):T\rightarrow M$ .

Hence, if $(M,\eta:\mathcal{M}\rightarrow h_{M})$ is a coarse moduli space, then $\eta_{S}(\mathcal{F}):S\rightarrow M$ is $G$ -invariant and the universal $G$ -invariant morphism from $S$ , which proves statement a). Furthermore, the $G$ -invariant morphism $\eta_{S}(\mathcal{F}):S\rightarrow M$ is an orbit space if and only if $\eta_{\operatorname{Spec}k}$ is bijective, which proves statement b). ∎

2. Mumford’s reductive geometric invariant theory

The origins of GIT go back to 19th century invariant theory and a question of Hilbert on the finite generation of invariant rings. We begin with Hilbert’s 14th problem in $\S$ 2.1 and describe other techniques for constructing quotients in $\S$ 2.2. We give various definitions and examples related to reductive and unipotent groups in $\S$ 2.3, and prove finite generation results for invariant rings in $\S$ 2.4. We describe Mumford’s GIT [74] for affine schemes in $\S$ 2.5, for projective schemes in $\S$ 2.6 and in general in $\S$ 2.7, and state a twisted affine version in $\S$ 2.8. Beyond Mumford’s book [74], the notes of Newstead [79] and Thomas [92] provide excellent introductions to GIT.

2.1. Hilbert’s 14th Problem

Consider an action $\sigma:G\times X\rightarrow X$ of an affine algebraic group on an affine scheme. The coaction determines a linear representation $G\rightarrow\mathrm{GL}(\mathcal{O}(X))$ . Any $G$ -invariant morphism $\phi:X\rightarrow Z$ induces a homomorphism $\phi^{*}:\mathcal{O}(Z)\rightarrow\mathcal{O}(X)$ whose image is contained in the ring of $G$ -invariant functions

[TABLE]

Consequently one can ask if the inclusion of the ring of $G$ -invariant functions corresponds to a morphism of affine schemes.

Question 2.1 (Hilbert’s 14th Problem).

Is $\mathcal{O}(X)^{G}$ a finitely generated $k$ -algebra?

Hilbert showed that for the general linear group over the complex numbers, the answer was yes. However, in general $\mathcal{O}(X)^{G}$ is not finitely generated, due to a counterexample of Nagata constructed using an action of a product of additive groups [75, 76]; see [35] for a survey of counterexamples to Hilbert’s 14th problem. Fortunately, Nagata also showed that the answer is yes for a large number of groups, namely reductive groups and in this case Mumford showed that taking the spectrum of the inclusion $\mathcal{O}(X)^{G}\subset\mathcal{O}(X)$ gives a categorical quotient of the action. For non-reductive groups, we will see in $\S$ 5.1 that even when $\mathcal{O}(X)^{G}$ is finitely generated, taking the spectrum of the inclusion of invariants does not always yield a categorical quotient.

2.2. Constructions of quotients by affine algebraic groups

Before turning to Mumford’s GIT for reductive groups, let us give a brief summary of some important results about the construction of quotients of affine algebraic groups in general.

For a free action of an affine algebraic group on a scheme, there is a geometric quotient in the category of algebraic spaces by results of Artin [8] and Kollár [65]. For a finite group, it is easy to see that a free action induces an étale equivalence relation and any quotient of a scheme by an étale equivalence relation is an algebraic space. There are examples of free actions whose geometric quotient is not a scheme: an example of Hironaka gives an action of a finite group whose geometric quotient is not a scheme (see [51, Appendix B, Example 3.4.1]) and an example of Derksen gives an action of the additive group $\mathbb{G}_{a}$ whose geometric quotient is not a scheme (see [29, Example 18]),

Rosenlicht [83] showed that for a connected affine algebraic group $G$ acting on an irreducible variety $X$ , there is a dense open subset which admits a geometric quotient. Unfortunately, this set is non-explicit, as his proof involves showing that the field $k(X)^{G}$ of invariant rational functions is finitely generated (see [43, $\S$ 19 Appendix Lemma 1]).

Remark 2.2 (Transfer Principle).

Let $H<G$ be a closed subgroup of an affine algebraic group $G$ , then $H$ acts on $G$ by left multiplication and this action has a geometric quotient $G/H$ and the quotient map $G\rightarrow G/H$ is étale locally trivial (see [20, p181] and [22, Theorem 1.16]).

For $H<G$ as above, suppose there is an action of $H$ on $X$ ; then for the diagonal action of $H$ on $G\times X$ by $h\cdot(g,x)=(gh^{-1},hz)$ , there is a geometric quotient $G\times^{H}X$ ; for details on this construction, see [28, III $\S$ 4]. The action of $G$ on itself by left multiplication induces a $G$ -action on $G\times^{H}X$ such that there is a bijective correspondence between $H$ -orbits in $X$ and $G$ -orbits in $G\times^{H}X$ . A scheme is a geometric $H$ -quotient of $X$ if and only if it is a geometric $G$ -quotient of $G\times^{H}X$ . Furthermore, if $X$ is affine and the $H$ -action on $X$ extends to $G$ , then we have the following transfer principle (due to Roberts, see [43, $\S$ 9])

[TABLE]

Note $G/H$ may not be affine, so $\mathcal{O}(G/H)$ is not necessarily finitely generated (see Remark 2.10).

2.3. Reductive groups

Let us fix some definitions for unipotent and reductive groups; we note that there are alternative equivalent formulations (for example, see [26, 71]).

Definition 2.3 (Unipotent and reductive groups).

An affine algebraic $k$ -group $G$ is

i)

unipotent if it is isomorphic to a subgroup of a standard unipotent group $\mathbb{U}_{n}\subset\mathrm{GL}_{n}$ consisting of upper triangular matrices with diagonal entries equal to 1. 2. ii)

reductive if it is smooth and every connected unipotent normal subgroup is trivial (this second condition is often phrased as asking for the unipotent radical to be trivial). 3. iii)

geometrically reductive if for every finite dimensional linear representation $\rho:G\rightarrow\mathrm{GL}(V)$ and every non-zero $G$ -invariant point $v\in V$ , there is a non-constant $G$ -invariant homogeneous polynomial $f\in\mathcal{O}(V)$ such that $f(v)\neq 0$ . 4. iv)

linearly reductive if for every finite dimensional linear representation $\rho:G\rightarrow\mathrm{GL}(V)$ and every non-zero $G$ -invariant point $v\in V$ , there is a non-constant $G$ -invariant linear polynomial $f\in\mathcal{O}(V)$ such that $f(v)\neq 0$ .

Remark 2.4.

(1)

$G$ is unipotent if and only if every finite dimensional linear representation $\rho:G\rightarrow\mathrm{GL}(V)$ has a non-zero fixed point. 2. (2)

G is linearly reductive if and only if every finite dimensional linear representation $\rho:G\rightarrow\mathrm{GL}(V)$ is completely reducible (that is, $\rho$ decomposes as a direct sum of irreducible representations) or equivalenty if taking $G$ -invariants on finite dimensional linear $G$ -representations is exact.

Example 2.5.

(1)

The additive group $\mathbb{G}_{a}$ is unipotent, as we have an embedding $\mathbb{G}_{a}\hookrightarrow\mathbb{U}_{2}$ given by

[TABLE] 2. (2)

In characteristic $p$ , there is a finite subgroup $\alpha_{p}\subset\mathbb{G}_{a}$ where we define the functor of points of $\alpha_{p}$ by associating to a $k$ -algebra $R$ ,

[TABLE]

This is represented by the scheme $\operatorname{Spec}k[t]/(t^{p})$ and so $\alpha_{p}$ is a unipotent group which is not smooth. 3. (3)

The multiplicative group $\mathbb{G}_{m}$ or any algebraic torus $T=\mathbb{G}_{m}^{r}$ is linearly reductive: as any linear representation $\rho:T\rightarrow\mathrm{GL}(V)$ admits a weight decomposition

[TABLE]

Exercise 2.6.

Prove that any finite group of order not divisible by the characteristic of $k$ is linearly reductive. (Hint: consider averaging over the group.)

For smooth affine algebraic group schemes over $k$ , we have

[TABLE]

and all three notions coincide in characteristic zero. The first implication is immediate from the definitions and, in characteristic zero, the opposite implication goes back to Weyl and uses the representation theory of compact Lie groups (this is known as Weyl’s unitary trick). The equivalence between reductive and geometrically reductive for smooth affine group schemes was conjectured by Mumford after Nagata proved that every geometrically reductive group is reductive [76]; the opposite implication was proved by Haboush [45].

Let us state an important property of geometrically reductive group actions.

Lemma 2.7 (Geometrically reductive group actions separate closed orbits, [79, Lemma 3.3]).

Let $G$ be a geometrically reductive group acting on an affine scheme $X$ . If $W_{1}$ and $W_{2}$ are disjoint $G$ -invariant closed subsets of $X$ , then there is an invariant function $f\in\mathcal{O}(X)^{G}$ which separates these sets, i.e.

[TABLE]

2.4. Finitely generated rings of invariants

Recall that for an action of an affine algebraic group $G$ on an affine scheme $X$ , the associated action on the coordinate ring $\mathcal{O}(X)$ is rational.

Theorem 2.8 (Nagata, [76]).

Let $G$ be a geometrically reductive group acting rationally on a finitely generated $k$ -algebra $A$ . Then the $G$ -invariant subalgebra $A^{G}$ is finitely generated.

We will outline the proof of this theorem in the significantly easier case when $G$ is linearly reductive; see [79, Theorem 3.4] for the full proof. In this case, one can construct a Reynolds operator, which is a projection $R:A\twoheadrightarrow A^{G}$ onto the $G$ -invariants that satisfies $R(ab)=aR(b)$ for all $a\in A^{G}$ and $b\in A$ . If $G$ is finite, $R$ can be viewed as averaging over the group. Using the Reynolds operator, one can show that $A^{G}$ is Noetherian and then prove it is finitely generated. This approach is similar to Hilbert’s proof that over the complex numbers the ring of invariants for $\mathrm{GL}_{n}$ is finitely generated.

Proof of Theorem 2.8 (for linearly reductive groups).

Since $A$ is a finitely generated $k$ -algebra, it has a countable basis as a $k$ -vector space; thus $A$ can be written as an increasing union of finite dimensional vector spaces. By applying Lemma 1.18 to these vector spaces, we can write $A$ as an increasing union of finite dimensional $G$ -invariant vector spaces $W_{n}$ over $n\in\mathbb{N}$ .

Our assumption that $G$ is linearly reductive implies that the finite dimensional $G$ -representation $W_{n}$ is completely reducible. In particular, we can write $W_{n}$ as a sum of $G$ -representations

[TABLE]

and obtain a projection $R_{n}:W_{n}\twoheadrightarrow W_{n}^{G}$ , which together induce a projection $R:A\rightarrow A^{G}$ .

To show that this projection is a Reynolds operator, we need to show $R(ab)=aR(b)$ for all $a\in A^{G}$ and $b\in A$ . For this take $n$ , so $a,b\in W_{n}$ and pick $m\geq n$ such that left multiplication $l_{a}:A\rightarrow A$ restricts to a homomorphism of $G$ -representations

[TABLE]

As above, we write $W_{n}=W_{n}^{G}\oplus W_{n}^{\prime}$ . Since $a\in A^{G}$ , we have $l_{a}(W_{n}^{G})\subset W_{m}^{G}$ and by Schur’s Lemma, the image of each irreducible representation appearing in $W_{n}^{\prime}$ is either zero or isomorphic to that irreducible representation, thus $l_{a}(W_{n}^{\prime})\subset W_{m}^{\prime}$ . If we write $b=b^{G}+b^{\prime}\in W_{n}^{G}\oplus W_{n}^{\prime}$ , then

[TABLE]

Hence, $R(ab)=ab^{G}=aR(b)$ as required.

For any ideal $I\subset A^{G}$ , we have $I\subset IA\cap A^{G}$ , and using the Reynolds operator, one can show the opposite inclusion. Hence $I=IA\cap A^{G}$ and from this we deduce $A^{G}$ is Noetherian: any increasing chain of ideals $I_{n}$ in $A^{G}$ must stabilise, as the corresponding chain of ideals $I_{n}A$ stabilises due to $A$ being Noetherian.

By choosing generators for the $k$ -algebra $A$ , we can realise it as a quotient of a polynomial ring with linear $G$ -action $\operatorname{Sym}^{*}(V)\twoheadrightarrow A$ . Since any $G$ -equivariant homomorphism of algebras commutes with their Reynolds operators, we obtain a surjection $\operatorname{Sym}^{*}(V)^{G}\twoheadrightarrow A^{G}$ and so to show $A^{G}$ is finitely generated, it suffices to show $\operatorname{Sym}^{*}(V)^{G}$ is finitely generated. Thus we may assume $A=\operatorname{Sym}^{*}(V)$ is a polynomial ring with linear action $G\rightarrow\mathrm{GL}(V)$ . Since $A^{G}$ is Noetherian, the ideal $A_{+}^{G}:=\oplus_{n>0}\operatorname{Sym}^{n}(V)^{G}$ is finitely generated and the generators of this ideal are generators of $A^{G}$ as a $k$ -algebra. ∎

Popov [80] proved a converse to Nagata’s theorem: for any non-reductive group $G$ there is an affine scheme $X$ such that $\mathcal{O}(X)^{G}$ is not finitely generated.

In some simple situations, the ring of invariants for a non-reductive group is finitely generated; however, the corresponding morphism of schemes may fail to be a good quotient (see $\S$ 5.1).

Theorem 2.9 (Weitzenböck [94]).

Assume that the characteristic of $k$ is zero, then any linear $\mathbb{G}_{a}$ -action on $\mathbb{A}^{n}$ extends to $\mathrm{SL}_{2}$ . In this case, the invariant ring $\mathcal{O}(\mathbb{A}^{n})^{\mathbb{G}_{a}}$ is finitely generated.

Proof (after Seshadri [85]).

The first statement follows by putting the associated locally nilpotent derivation (see $\S$ 5.3) in Jordan normal form, so that each length $n$ Jordan block corresponds to the standard $\mathrm{SL}_{2}$ -representation $\operatorname{Sym}^{n-1}(k^{2})$ ; for details, see [43, Lemma 10.2].

Assuming that the linear $\mathbb{G}_{a}$ -action extends to $\mathrm{SL}_{2}$ , let us prove that the ring of invariants is finitely generated. By the transfer principle (Remark 2.2), the ring of $\mathbb{G}_{a}$ -invariants on $\mathbb{A}^{n}$ is isomorphic to the ring of $\mathrm{SL}_{2}$ -invariants on $\mathbb{A}^{n}\times^{\mathbb{G}_{a}}\mathrm{SL}_{2}$ .

For $\mathbb{G}_{a}$ -acting on $\mathrm{SL}_{2}$ by left multiplication, the bottom row is invariant and thus the map $\mathrm{SL}_{2}\rightarrow\mathbb{A}^{2}\setminus\{0\}$ given by sending a matrix $A\in\mathrm{SL}_{2}$ to its bottom row $(a_{21},a_{22})$ is $\mathbb{G}_{a}$ -invariant. In fact, this map is an orbit space and $\mathrm{SL}_{2}/\mathbb{G}_{a}\cong\mathbb{A}^{2}\setminus\{0\}$ ; one way to see this is to note that $\mathbb{G}_{a}$ is the $\mathrm{SL}_{2}$ -stabiliser of $(1,0)\in\mathbb{A}^{2}$ and its orbit $\mathrm{SL}_{2}\cdot(1,0)=\mathbb{A}^{2}\setminus\{0\}$ is isomorphic to $\mathrm{SL}_{2}/\mathbb{G}_{a}$ . Since the $\mathbb{G}_{a}$ -action on $\mathbb{A}^{n}$ extends to $\mathrm{SL}_{2}$ , we have an $\mathrm{SL}_{2}$ -equivariant isomorphism

[TABLE]

This is only quasi-affine (so its coordinate ring may not be finitely generated), but as $\{0\}\subset\mathbb{A}^{2}$ has codimension 2, any regular function extends from $\mathbb{A}^{2}\setminus\{0\}$ to $\mathbb{A}^{2}$ by Hartogs’ lemma. Hence,

[TABLE]

is finitely generated. ∎

Remark 2.10.

The second part of the above proof for $H=\mathbb{G}_{a}<G=\mathrm{SL}_{2}$ can be extended to any Grosshans subgroup, which is a closed subgroup $H<G$ of a reductive group such that $G/H$ is quasi-affine and $\mathcal{O}(G/H)=\mathcal{O}(G)^{H}$ is finitely generated (see [43] for a detailed treatment). Grosshans [41] shows $\mathcal{O}(G)^{H}$ is finitely generated if and only if $G/H$ can be emdedded in an affine variety with complement of codimension $2$ , and proves that unipotent radicals of parabolic subgroups in a reductive group are Grosshans subgroups [42]. For a Grosshans subgroup $H<G$ , the same proof shows that if a $H$ -action an an affine scheme $X$ extends to $G$ (which is not immediate as in the case of Weitzenböck’s Theorem), then $\mathcal{O}(X)^{H}$ is finitely generated.

The proof shows that for a non-reductive group, even if the ring of invariants is finitely generated, taking its spectrum does not necessarily provide a categorical quotient: for $\mathbb{G}_{a}$ -acting on $\mathrm{SL}_{2}$ , we have $\mathcal{O}(\mathrm{SL}_{2})^{\mathbb{G}_{a}}=k[x_{21},x_{22}]$ , but the induced map $\mathrm{SL}_{2}\rightarrow\mathbb{A}^{2}$ is not surjective, so $\mathbb{A}^{2}$ is not the categorical quotient. In fact, even worse, the image may only be a constructible subset (see $\S$ 5.1). In the next subsection we will see that when $G$ is reductive, taking the spectrum of the ring of invariants does give a categorical quotient.

2.5. Affine geometric invariant theory for reductive groups

Let $G$ be a reductive group acting on an affine scheme $X$ . There is an induced action of $G$ on the coordinate ring $\mathcal{O}(X)$ and the ring of invariants $\mathcal{O}(X)^{G}$ is a finitely generated $k$ -algebra by Nagata’s Theorem.

Definition 2.11 (Affine GIT quotient).

For an action of a reductive group $G$ on an affine scheme $X$ , the affine GIT quotient is the morphism $\varphi:X\rightarrow X/\!/G:=\operatorname{Spec}\mathcal{O}(X)^{G}$ of affine schemes associated to the inclusion $\varphi^{*}:\mathcal{O}(X)^{G}\hookrightarrow\mathcal{O}(X)$ .

The double slash notation $X/\!/G$ used for the GIT quotient is a reminder that this quotient is not necessarily an orbit space and so it may identify some orbits. In nice cases, the GIT quotient is an orbit space and in this case we shall write $X/G$ .

Theorem 2.12 (Mumford, [74, Theorem 1.1]).

For a reductive group $G$ acting on an affine scheme $X$ , the affine GIT quotient $X\rightarrow X/\!/G$ is a good quotient and thus categorical quotient.

We will not include the proof of this result, but we note that the proof that the affine GIT quotient is good uses several properties of reductive group actions beyond simply the finite generation of the invariant ring: for a geometrically reductive group, invariant functions can be used to separate closed orbits (see Lemma 2.7) and, for linearly reductive groups, the proof can be simplified by using the fact that taking invariants is exact.

The affine GIT quotient restricts to a geometric quotient on an open stable subset $X^{s}\subset X$ .

Definition 2.13.

A point $x\in X$ is stable if its orbit is closed in $X$ and $\dim G_{x}=0$ (or equivalently, $\dim G\cdot x=\dim G$ ). We let $X^{s}$ denote the set of stable points.

For $x\in X$ , we note that $x$ is stable if and only if $\sigma_{x}:G\rightarrow X$ is proper. Indeed if $\sigma_{x}$ is proper, then its image $G\cdot x$ is closed and the fibres, being both affine and proper, must be finite. Conversely if $x$ is stable, then $\sigma_{x}:G\rightarrow G\cdot x$ has finite fibres and one can show it is finite.

Example 2.14.

For the $\mathbb{G}_{m}$ -action on $\mathbb{A}^{2}$ by $t\cdot(x,y)=(tx,t^{-1}y)$ , we have $\mathcal{O}(\mathbb{A}^{2})^{\mathbb{G}_{m}}=k[xy]$ with affine GIT quotient $\varphi:\mathbb{A}^{2}\rightarrow\mathbb{A}^{1}$ is given by $(x,y)\mapsto xy$ . It is not a geometric quotient, as the three orbits consisting of the punctured axes and the origin are all identified. The stable locus is the complement of $xy=0$ , which admits a geometric quotient $\mathbb{A}^{1}\setminus\{0\}$ .

If we remove the origin, the affine line with a double origin is a geometric quotient of $\mathbb{A}^{2}\setminus\{0\}$ . In this case, we obtain a non-separated quotient as a categorical quotient of a separated scheme.

.

Example 2.15.

Consider $G=\mathrm{GL}_{2}$ acting by conjugation on the space $\operatorname{Mat}_{2\times 2}$ of $2\times 2$ matrices with $k$ -coefficients. The trace and determinant (which are the coefficients of the characteristic polynomial) are invariant functions, and so

[TABLE]

We will soon see this is in fact an equality.

First, we describe the orbits using the theory of Jordan normal forms. As any orbit contains a matrix in Jordan normal form, there are three types of orbits:

•

Matrices with distinct eigenvalues $\alpha,\beta$ and Jordan normal form

[TABLE]

These are closed 2 dimensional orbits, with 2 dimensional stabiliser (diagonal matrices).

•

Matrices with repeated eigenvalue and Jordan normal form with one block

[TABLE]

These orbits are also 2 dimensional but are not closed: for example

[TABLE]

•

Matrices with repeated eigenvalue and Jordan normal form with two blocks

[TABLE]

The stabiliser of such a matrix is $\mathrm{GL}_{2}$ and its orbit is a point, which is closed.

Every orbit closure of the second type contains an orbit of the third type.

Let us show that $\mathcal{O}(\operatorname{Mat}_{2\times 2})^{\mathrm{GL}_{2}}=k[\operatorname{tr},\det]$ . Since any orbit closure contains a diagonal matrix, any invariant function is completely determined by its values on the diagonal matrices and is invariant under permuting the diagonal entries. Hence

[TABLE]

by the theory of (elementary) symmetric polynomials.

The affine GIT quotient is $\varphi=(\mathrm{tr},\det):\operatorname{Mat}_{2\times 2}\rightarrow\operatorname{Mat}_{2\times 2}/\!/\mathrm{GL}_{2}=\mathbb{A}^{2}$ . Since scalar multiples of the identity fix every point, there are no stable points for this action; however, the restriction to the locus of matrices with distinct eigenvalues is a geometric quotient.

Exercise 2.16.

Show the GIT quotient of $\mathrm{GL}_{n}$ acting on $\operatorname{Mat}_{n\times n}$ by conjugation is $\mathbb{A}^{n}$ .

Newstead constructs moduli spaces of cyclic endomorphisms of vector spaces [79, Chapter 2].

2.6. Projective geometric invariant theory

Suppose that a reductive group $G$ acts on a projective scheme $X\subset\mathbb{P}^{n}$ linearly (i.e. by a representation $G\rightarrow\mathrm{GL}_{n+1}$ ). The homogeneous coordinate ring of $X$ is the graded ring

[TABLE]

where $\mathcal{O}(1)$ denotes the pullback of $\mathcal{O}_{\mathbb{P}^{n}}(1)$ to $X$ . By Nagata’s theorem, $R(X)^{G}$ is finitely generated. The inclusion $R(X)^{G}\hookrightarrow R(X)$ determines a rational map of projective schemes

[TABLE]

whose indeterminacy locus is the closed subscheme of $X$ defined by the homogeneous ideal $R(X)_{+}^{G}:=\oplus_{r>0}R(X)_{r}^{G}$ . The domain of definition of this map is the GIT semistable locus.

Definition 2.17.

Let $G$ be a reductive group acting linearly on a projective scheme $X\subset\mathbb{P}^{n}$ .

i)

We say $x\in X$ is semistable if there exists a $G$ -invariant homogeneous function $f\in R(X)^{G}_{r}$ for some $r>0$ such that $f(x)\neq 0$ . We write $X^{ss}$ for the open set in $X$ of semistable points; this is the domain of definition of (1). 2. ii)

We say $x\in X$ is stable222Usually stability is defined by asking for $\dim G_{x}=0$ and for the existence of $f\in R(X)^{G}_{r}$ for some $r>0$ non-vanishing at $x$ such that the $G$ -action on $X_{f}$ is closed; however, this is equivalent to the stated definition. if its orbit is closed in $X^{ss}$ and its stabiliser is zero dimensional. We write $X^{s}$ for the open set in $X$ of stable points. 3. iii)

The restriction of the rational map (1) to the semistable locus $X^{ss}\rightarrow X/\!/G:=\operatorname{Proj}R(X)^{G}$ is called the projective GIT quotient, which is projective over $k$ .

Rather confusingly, a point is called unstable if it is not semistable; this terminology is now standard and there is not much we can do to change it! We refer to points which are semistable but not stable, as strictly semistable. If there are several groups acting on $X$ , we clarify which group we mean by talking about $G$ -(semi)stability.

By definition, $X^{ss}$ is open as it is the domain of definition of the rational map (1). To see that $X^{s}$ is open, we use the equivalent formulation and note it is the intersection of two opens: the set of points with zero dimensional stabiliser is open as $x\mapsto\dim G_{x}$ is upper semi-continuous and the union of $X_{f}$ for $f\in R(X)^{G}_{+}$ on which the action on $X_{f}$ is closed is open.

Theorem 2.18 (Mumford, see [79, Theorem 3.14]).

For a reductive group $G$ acting linearly on a projective scheme $X\subset\mathbb{P}^{n}$ , the projective GIT quotient $\varphi:X^{ss}\rightarrow X/\!/G$ is a projective and good quotient, which restricts to a quasi-projective and geometric quotient of $X^{s}$ .

This result can be proved by gluing together affine GIT quotients: for $f\in R(X)_{+}^{G}$ , the non-vanishing locus $X_{f}$ is affine with affine GIT quotient $X_{f}\rightarrow X_{f}/\!/G$ and we can write $X^{ss}$ as the union of these open affines $X_{f}$ , so $X/\!/G$ is covered by the open affines $X_{f}/\!/G$ .

We have $\varphi(x)=\varphi(y)$ if and only if the orbit closures of $x$ and $y$ meet in $X^{ss}$ . Furthermore, the preimage of any point in $X/\!/G$ contains a unique closed orbit (of minimal dimension in this preimage), whose stabiliser is reductive (see Remark 1.24).

Remark 2.19.

It is important to note that the semistable set and the GIT quotient both depend on the $G$ -equivariant embedding $X\hookrightarrow\mathbb{P}^{n}$ , as the homogeneous coordinate ring depends on this embedding (or equivalently on the line bundle $\mathcal{O}(1)$ pulled back from $\mathbb{P}^{n}$ ).

Alternatively, rather than fixing a linear $G$ -equivariant projective embedding of $X$ , one can instead fix an ample $G$ -equivariant line bundle $\mathcal{L}$ on $X$ , which is often called an ample $G$ -linearisation: $\mathcal{L}=(L,\Phi)$ is an ample invertible sheaf $L$ on $X$ together with a $G$ -equivariant structure given by an isomorphism $\Phi:\sigma^{*}L\rightarrow\pi_{2}^{*}L$ , where $\sigma,\pi_{2}:G\times X\rightarrow X$ denote the action and second projection, which satisfies a cocycle condition $\pi_{23}^{*}\Phi\circ(\operatorname{Id}_{G}\times\sigma)^{*}\Phi=(m\times\operatorname{Id}_{G})^{*}\Phi$ on $G\times G\times X$ . In terms of the associated geometric line bundle, which by abuse of notation we shall also call $L$ , this is equivalent to a $G$ -action on $L$ commuting with the projection $L\rightarrow X$ such that the action on the fibres $L_{g\cdot x}\rightarrow L_{x}$ is linear.

Given an ample $G$ -equivariant line bundle $\mathcal{L}$ on $X$ , we obtain a graded ring with a $G$ -action

[TABLE]

such that the inclusion of invariants induces a rational map whose domain of definition is the semistable set and whose codomain is the GIT quotient (both with respect to $\mathcal{L}$ )

[TABLE]

Since replacing $\mathcal{L}$ with a positive power just has the effect of changing the grading on this ring333By construction, the projective GIT quotient comes with a line bundle and this regrading does not change the GIT quotient but does change this line bundle., we can assume $\mathcal{L}$ is very ample and then we obtain a linear $G$ -equivariant embedding $X\hookrightarrow\mathbb{P}(V)$ where $V:=H^{0}(X,L)^{*}$ , which recovers the above setting of a linear action.

The effect of changing $\mathcal{L}$ is called variation of GIT and can be described in terms of certain birational transformations known as VGIT flips [31, 91]. Furthermore, the space of $G$ -linearisations admits a wall and chamber decomposition describing how semistability varies: in chambers, semistability coincides with stability, but semistability changes on crossing a wall.

2.7. General GIT quotients

More generally, given a scheme $X$ with a $G$ -linearisation $\mathcal{L}$ , Mumford defines a GIT quotient using invariant sections of positive powers of $L$ whose non-vanishing locus is affine (so that one can take affine GIT quotients and glue them). This produces a good quotient of a ‘semistable locus’ ([74, Definition 1.7]), which in this situation is defined to be the set of points $x\in X$ such that there exists $\sigma\in H^{0}(X,L^{\oplus r})^{G}$ for $r>0$ with $\sigma(x)\neq 0$ and such that $X_{\sigma}$ is affine444If $X$ is projective and $\mathcal{L}$ is ample, then this non-vanishing locus is always affine.. The semistable set and quotient obtained in this way are both quasi-projective (see [79, Theorem 3.21]).

Let us remark that in this survey we have assumed that we are working over an algebraically closed field $k$ . The assumption that $k$ is algebraically closed can be dropped, but one has to be careful about rationality questions and work with geometric points for certain statements (for example, the Hilbert–Mumford criterion). Moreover, Seshadri [88] extended GIT to work relative to a base scheme $S$ with mild assumptions on $S$ .

2.8. Affine GIT linearised by a character

As a special case of $\S$ 2.7, consider a linear action of $G$ on an affine scheme $X\subset\mathbb{A}^{n}$ ; then the structure sheaf $\mathcal{O}_{X}$ is naturally equipped with a $G$ -equivariant structure, where if we view this as a geometric line bundle $X\times\mathbb{A}^{1}$ , the $G$ -action on $\mathbb{A}^{1}$ is trivial. In this case, the GIT quotient with respect to this ample $G$ -linearisation $\mathcal{O}_{X}$ is just the affine GIT quotient, as

[TABLE]

with trivial $G$ -action on $z$ and this ring is graded by the degree of $z$ , thus

[TABLE]

This linearisation can be modified by using a character $\rho:G\rightarrow\mathbb{G}_{m}$ to obtain a linearisation $\mathcal{O}_{\rho}$ which is given by $G$ acting linearly on the geometric line bundle $X\times\mathbb{A}^{1}$ by the given action on $X$ and acting via multiplication with $\rho$ on $\mathbb{A}^{1}$ . The outcome of applying GIT in this situation of twisting the linearisation by a character was described by King [62] and results in an open subset $X^{\rho-ss}$ of $\rho$ -semistable points and a GIT quotient

[TABLE]

In this case, the $G$ -invariant sections of $(\mathcal{O}_{\rho})^{\otimes r}\cong\mathcal{O}_{\rho^{r}}$ are $f\in\mathcal{O}(X)$ with $f(g\cdot x)=\rho^{r}(g)f(x)$ for all $g\in G$ and $x\in X$ , which we refer to as $\rho$ -semi-invariant functions of weight $r$ . By definition, $x$ is $\rho$ -semistable if there exists a $\rho$ -semi-invariant function of weight $r>0$ which is non-vanishing at $x$ . Furthermore, $X/\!/_{\!\!\rho}G$ is projective over the spectrum of the [math]th-graded piece which is just the affine GIT quotient $X/\!/G=\operatorname{Spec}\mathcal{O}(X)^{G}$ .

Example 2.20.

For $\mathbb{G}_{m}$ acting on $\mathbb{A}^{n}$ by scalar multiplication linearised by $\mathcal{O}_{\rho}$ for $\rho:\mathbb{G}_{m}\rightarrow\mathbb{G}_{m}$ given by $t\rightarrow t$ , the coordinate functions are $\rho$ -semi-invariant functions of weight $1$ and these generate the ring of invariants. Consequently, we obtain the GIT quotient

[TABLE]

3. Semistability and instability in reductive GIT

Since the reductive GIT quotient only provides a quotient of an open semistable locus, this naturally leads to two questions: can we describe the semistable points and what can we say about unstable (i.e. not semistable) points? For actions on projective (over affine) schemes, the first question is tackled by the Hilbert–Mumford criterion for semistability described in $\S$ 3.1. In moduli problems with a natural notion of subobjects, the Hilbert–Mumford criterion often gives a clean moduli-theoretic interpretation of GIT semistability. We state some application of reductive GIT to moduli in $\S$ 3.2. We then turn to the second question in $\S$ 3.3 and describe how work of Kempf [61], Hesselink [53], Kirwan [63] and Ness [78] gives a stratification of the unstable locus, with a largely combinatorial flavour; we survey some applications of these stratifications and discuss the question of construction quotients of unstable strata, where naturally non-reductive groups (namely, parabolic subgroups, representing an instability flag) appear.

3.1. Semistability and the Hilbert–Mumford criterion

By definition, semistability in reductive GIT is given in terms of the existence of a non-vanishing invariant section. From this definition, it is extremely challenging to determine semistability, as it is essentially equivalent to computing invariant rings, which is a notoriously challenging problem. Fortunately, in certain situations (projective GIT, affine GIT linearised by a character or more generally a projective over affine set-up), the Hilbert–Mumford criterion reduces semistability to checking semistability for $\mathbb{G}_{m}$ -actions, which in turn can be combinatorially described using the weights of the action. More precisely, a $G$ -semistable point is semistable for any subgroup, and thus in particular, for any $\mathbb{G}_{m}$ contained in $G$ ; the Hilbert-Mumford criterion gives a converse to this statement.

For simplicity, throughout this section, we assume we have a linear representation $G\rightarrow\mathrm{GL}(V)$ of a reductive group $G$ and consider the associated linear action on $X=\mathbb{P}(V)$ . We will describe the semistable points in this setting. For a closed subscheme $Y\subset X$ with a linear $G$ -action, we have $Y^{ss}=Y\times_{X}X^{ss}$ and so it suffices to understand semistability on the ambient projective space. For an ample $G$ -linearisation $\mathcal{L}$ on $X$ , using a power of $\mathcal{L}$ puts us in this linear setting.

We will see several different versions of the Hilbert–Mumford criterion, which make it possible to determine semistability in practice. The first, and weakest, version is a topological criterion.

Proposition 3.1 (Topological Hilbert–Mumford criterion, [74, Proposition 2.2]).

For a linear action of a reductive group $G$ on $\mathbb{P}(V)$ , the following statements hold for $x=[v]\in\mathbb{P}(V)$ .

i)

$x$ * is semistable if and only if $0\notin\overline{G\cdot v}$ ;* 2. ii)

$x$ * is stable if and only if $\dim G_{v}=0$ and $G\cdot v$ is closed in $V$ .*

Proof.

We will just give the proof of the first statement. By definition $x=[v]$ is semistable if and only if there is a $G$ -invariant homogeneous polynomial $f\in R(X)^{G}$ which is non-zero at $x$ . Since $f$ is $G$ -invariant it is constant on orbit closures, and so $f$ separates the closed schemes $\overline{G\cdot v}$ and [math], which shows these closed subschemes are disjoint. Conversely, if the closed $G$ -invariant schemes $\overline{G\cdot v}$ and [math] in $V$ are disjoint, then as $G$ is geometrically reductive, there exists a $G$ -invariant polynomial $f\in\mathcal{O}(V)^{G}$ separating these subsets

[TABLE]

by Lemma 2.7. By considering the decomposition of $f=\sum_{i}f_{i}$ into ( $G$ -invariant) homogeneous pieces, we see there is a $G$ -invariant homogeneous piece $f_{i}$ which is non-vanishing at $x$ . ∎

Definition 3.2.

For a linear action of a torus $T=\mathbb{G}_{m}^{n}$ on $\mathbb{P}(V)$ , consider the associated weight decomposition $V=\oplus_{\chi\in X^{*}(T)}V_{\chi}$ . We refer to the support of this decomposition as the $T$ -weights on $\mathbb{P}(V)$ . For $x=[v]\in\mathbb{P}(V)$ , we write $v=\sum v_{\chi}$ and define the $T$ -weight set of this point to be

[TABLE]

For a $\mathbb{G}_{m}$ -action on a separated scheme, we will often use the following notation.

Notation 3.3.

If a morphism $f:\mathbb{G}_{m}\rightarrow S$ , with $S$ separated, extends to $\tilde{f}:\mathbb{A}^{1}\rightarrow S$ , then this extension is unique and we write $\lim_{t\rightarrow 0}f(t):=\tilde{f}(0)$ . Similarly if $f$ extends to $\mathbb{P}^{1}$ , we write $\lim_{t\rightarrow\infty}f(t):=\tilde{f}(\infty)$ .

We can now give a combinatorial description of (semi)stability for a $\mathbb{G}_{m}$ -action in terms of whether or not the origin lies in (the interior of) the convex hull of $\mathbb{G}_{m}$ -weights.

Proposition 3.4 (Hilbert–Mumford for $\mathbb{G}_{m}$ -actions).

For a linear action of $\mathbb{G}_{m}$ on $\mathbb{P}(V)$ and $x\in\mathbb{P}(V)$ , the following statements hold:

i)

$x$ * is $\mathbb{G}_{m}$ -semistable if and only if $0\in\operatorname{conv}(\operatorname{wt}_{\mathbb{G}_{m}}(x))$ .* 2. ii)

$x$ * is $\mathbb{G}_{m}$ -stable if and only if $0\in\operatorname{Int}(\operatorname{conv}(\operatorname{wt}_{\mathbb{G}_{m}}(x)))$ .*

Proof.

We again just prove the statement for semistability. By the topological Hilbert–Mumford criterion, we have that $x=[v]\in\mathbb{P}(V)$ is $\mathbb{G}_{m}$ -semistable if and only if $0\notin\overline{\mathbb{G}_{m}\cdot v}$ . Any point in the boundary of this orbit closure is either

[TABLE]

Moreover, we have $\lim_{t\rightarrow 0}t\cdot v=0$ if and only if $\operatorname{wt}_{\mathbb{G}_{m}}(v)\subset\mathbb{Z}_{>0}$ (and similarly $\lim_{t\rightarrow\infty}t\cdot v=0$ if and only if $\operatorname{wt}_{\mathbb{G}_{m}}(v)\subset\mathbb{Z}_{<0}$ ). Hence $x=[v]\in\mathbb{P}(V)$ is $\mathbb{G}_{m}$ -semistable if and only if there exists $r_{0}\leq 0$ and $r_{\infty}\geq\infty$ in $\operatorname{wt}_{\mathbb{G}_{m}}(v)$ , or equivalently $0\in\operatorname{conv}(\operatorname{wt}_{\mathbb{G}_{m}}(x))$ . ∎

Example 3.5.

The linear action of $\mathbb{G}_{m}$ on $X=\mathbb{P}^{n}$ by

[TABLE]

has weights $\pm 1$ . Hence, for a point $x$ to be (semi)stable it needs both these weights, which means its first coordinate $x_{0}$ must be non-zero and at least one of the other coordinates $x_{i}$ for $i>0$ must be non-zero. One can also see this by directly proving that

[TABLE]

In particular, $X^{ss}\cong\mathbb{A}^{n}\setminus\{0\}$ and $\mathbb{P}^{n}/\!/\mathbb{G}_{m}=\mathbb{P}^{n-1}$ is a geometric $\mathbb{G}_{m}$ -quotient.

The Hilbert–Mumford criterion will ultimately be a numerical criterion that phrases semistability in terms of the weights of 1-parameter subgroups (1-PS), which are non-trivial group homomorphisms $\lambda:\mathbb{G}_{m}\rightarrow G$ .

Definition 3.6 (Hilbert–Mumford weight).

For a linear action of a reductive group $G$ on $\mathbb{P}(V)$ , we define the Hilbert-Mumford weight of $x=[v]$ at a 1-parameter subgroup $\lambda:\mathbb{G}_{m}\rightarrow G$ to be

[TABLE]

Let us note some useful properties of the Hilbert–Mumford weight.

Exercise 3.7.

Show that the Hilbert–Mumford weight of $x=[v]$ has the following properties.

(1)

$\mu(x,\lambda)$ is the unique integer $\mu$ such that $\lim_{t\to 0}t^{\mu}\lambda(t)\cdot v$ exists and is non-zero. 2. (2)

$\mu(x,\lambda)=\mu(x_{0},\lambda)$ where $x_{0}=\lim_{t\to 0}\lambda(t)\cdot x$ (and this limit exists as $X$ is projective). 3. (3)

$\mu(x,\lambda)\leq 0\iff\lim_{t\to 0}\lambda(t)\cdot v$ exists, with equality if and only if $\lim_{t\to 0}\lambda(t)\cdot v\neq 0$ . 4. (4)

$\mu(g\cdot x,g\lambda g^{-1})=\mu(x,\lambda)$ for all $g\in G$ . 5. (5)

$\mu(x,\lambda^{n})=n\mu(x,\lambda)$ for a positive integer $n$ .

For a linear $\mathbb{G}_{m}$ -action on $\mathbb{P}(V)$ , we see that for the 1-PS given by $\lambda(t)=t$ , we have

[TABLE]

and

[TABLE]

Hence $x$ is semistable if $\mu(x,-)\geq 0$ for $\lambda$ and $\lambda^{-1}$ . Furthermore, $x$ is stable if and only if this inequality is strict for both 1-PSs. This is precisely the numerical version of the Hilbert–Mumford criterion that we now can state.

Theorem 3.8 (Hilbert–Mumford criterion, [74, Theorem 2.1]).

For a reductive group $G$ acting linearly on a projective scheme $X\subset\mathbb{P}^{n}$ , the following statements hold for $x\in X$ .

i)

$x$ * is semistable if and only if $\mu(x,\lambda)\geq 0$ for all 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ ,* 2. ii)

$x$ * is stable if and only if $\mu(x,\lambda)>0$ for all 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ .*

Note that it suffices to check these inequalities for primitive 1-PSs (i.e. 1-PSs which are not positive powers of another 1-PS); see Exercise 3.7.

The full proof of the Hilbert–Mumford criterion is beyond the scope of this survey, but following the topological version (Proposition 3.1), it suffices to show that the reductive group $G$ has enough 1-PSs to detect if the origin is contained in the closure of orbits of linear actions $G\rightarrow\mathrm{GL}(V)$ , which is precisely the following result (see [74, p53] and [61, Theorem 1.4]), whose proof involves the Cartan-Iwahori decomposition for the reductive group $G$ .

Theorem 3.9 (Fundamental Theorem of GIT).

Let $G$ be a reductive group acting on an affine space $V$ . If $v\in V$ and $0\in\overline{G\cdot v}$ , then there is a 1-PS $\lambda$ of $G$ such that $\lim_{t\to 0}\lambda(t)\cdot v=0$ .

Remark 3.10 (Hilbert–Mumford weight for a linearised action).

In the case of a linear $G$ -action on a projective scheme $X\subset\mathbb{P}^{n}$ , the Hilbert–Mumford weight for $x\in X$ defined above depends on the choice of $G$ -representation $G\rightarrow\mathrm{GL}_{n+1}$ (as the weights depend on this representation).

In general, for a $G$ -linearisation $\mathcal{L}=(L,\Phi)$ on a projective $G$ -scheme $X$ , we consider the $\lambda(\mathbb{G}_{m})$ -fixed point $x_{0}=\lim_{t\rightarrow 0}t\cdot x$ . The linearisation $\Phi$ induces a $\mathbb{G}_{m}$ -representation on the fibre of $L$ over $x_{0}$

[TABLE]

of weight $r$ (that is $\lambda(t)^{-1}$ acts on this fibre by $t\mapsto t^{r}$ ). Then the Hilbert–Mumford weight (with respect to $\mathcal{L}$ ) is defined to be minus the weight on this fibre

[TABLE]

In this linearised situation, the Hilbert–Mumford criterion says $x\in X$ is semistable (with respect to $\mathcal{L}$ ) if and only if $\mu^{\mathcal{L}}(x,\lambda)\geq 0$ for all 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ .

If $X\subset\mathbb{P}^{n}$ and $\mathcal{L}=\mathcal{O}(1)$ is the pullback of $\mathcal{O}_{\mathbb{P}^{n}}(1)$ , then these two definitions coincide: we have $\mu^{\mathcal{O}(1)}(x,\lambda)=\mu(x,\lambda)$ by [74, Proposition 2.3].

As any 1-PS can be conjugated to lie in a fixed maximal torus $T<G$ , one can phrase $G$ -semistability of a point in terms of $T$ -semistability of all $G$ -translates of that point by Exercise 3.7 above. Then $T$ -semistability can be stated combinatorially using the torus weights analogous to Proposition 3.4 above. This gives a combinatorial Hilbert–Mumford criterion.

Proposition 3.11 (Torus weights version of Hilbert–Mumford criterion, [30, $\S$ 9.4]).

For a reductive group $G$ acting on a projective scheme $X\subset\mathbb{P}^{n}$ linearly, fix a maximal torus $T<G$ . For $x\in X$ , the following statements hold.

i)

$x$ * is $G$ -(semi)stable if and only if $g\cdot x$ is $T$ -(semi)stable for all $g\in G$ .* 2. ii)

$x$ * is $T$ -semistable if and only if $0\in\operatorname{conv}(\operatorname{wt}_{T}(x))$ .* 3. iii)

$x$ * is $T$ -stable if and only if $0\in\operatorname{Int}(\operatorname{conv}(\operatorname{wt}_{T}(x)))$ .*

By the first statement, the $G$ -semistable set is the $G$ -sweep of the $T$ -semistable set:

[TABLE]

Exercise 3.12 (Semistability for binary forms).

Consider the action of $\mathrm{SL}_{2}$ on the space of degree $d$ binary forms $\mathbb{P}^{d}=\mathbb{P}(k[x,y]_{d})$ . For $p_{F}\in\mathbb{P}^{d}$ corresponding to $F(x,y)\in k[x,y]_{d}$ , show

i)

$F$ is semistable if and only if all roots of $F$ have multiplicity less than or equal to $d/2$ ; 2. ii)

$F$ is stable if and only if all roots of $F$ have multiplicity strictly less than $d/2$ .

Remark 3.13 (Hilbert–Mumford criterion for action on affine scheme twisted by a character).

For a linear action of a reductive group $G$ on an affine scheme $X\subset\mathbb{A}^{n}$ linearised via $\mathcal{O}_{\rho}$ for a character $\rho:G\rightarrow\mathbb{G}_{m}$ (see $\S$ 2.8), King proved a topological Hilbert–Mumford criterion [62, Lemma 2.2], by using the total space of the dual linearisation to replace the affine cone, and obtained the following numerical Hilbert–Mumford criterion [62, Proposition 2.5]:

(1)

$x$ is $\rho$ -semistable if and only if $\langle\rho,\lambda\rangle\geq 0$ for all 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ such that $\lim_{t\rightarrow 0}\lambda(t)\cdot x$ exists, 2. (2)

$x$ is $\rho$ -stable if and only if $\langle\rho,\lambda\rangle>0$ for all 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ such that $\lim_{t\rightarrow 0}\lambda(t)\cdot x$ exists,

where $\langle\rho,\lambda\rangle=r$ if $\rho\circ\lambda(t)=t^{r}$ , i.e. this is the natural pairing between characters and cocharacters. Using the abstract definition of the Hilbert–Mumford weight in terms of the weight of the action on the fibre over the limit point of the $\mathbb{G}_{m}$ -action (see Remark 3.10), we see that if $x_{0}=\lim_{t\rightarrow 0}\lambda(t)\cdot x$ exists, then $\mu^{\mathcal{O}_{\rho}}(x,\lambda)=\langle\rho,\lambda\rangle.$

Exercise 3.14.

Using Remark 3.13, show that for $\mathrm{GL}_{2}$ acting on $\operatorname{Mat}_{2\times n}$ by left multiplication the (semi)stable locus for the character $\rho=\det$ is the matrices of maximal rank (namely rank $2$ ), and so the GIT quotient is the Grassmannian $\operatorname{Gr}(2,n)$ .

We note that there is a Hilbert–Mumford criterion in the more general setting of a projective over affine variety with an action of a linearly reductive group [44].

3.2. A brief survey of applications of reductive GIT to moduli

The notion of moduli functor is heavily influenced by Grothendieck’s approach to algebraic geometry. Furthermore, Grothendieck proved that the Hilbert and Quot functors are representable by projective schemes; these are fine moduli spaces and provide parameter spaces in the GIT constructions of moduli of smooth projective curves and moduli of vector bundles on curves.

The first truly interesting application of GIT was Mumford’s construction of moduli spaces of curves [74, Chapter 5]. Mumford constructed a coarse moduli space $M_{g}$ for smooth projective curves of genus $g\geq 2$ by using a power of the canonical bundle to give a projective embedding $C\hookrightarrow\mathbb{P}^{N}$ and constructing $M_{g}$ as a quotient of a suitable Chow variety parametrising pluricanonical curves. Gieseker [38] provided an alternative GIT construction of $M_{g}$ and its Deligne–Mumford compactification $\overline{M_{g}}$ via stable curves as a quotient of the $\mathrm{PGL}_{N+1}$ -action on a suitable Hilbert scheme with a linearisation given by embedding in a Grassmannian associated to a sufficiently large choice of $m$ . Although there is a direct proof that smooth curves are (asymptotically) GIT stable, the proof that stable curves are (asymptotically) GIT stable is indirect (see [72, $\S$ 3.1]). Gieseker’s Hilbert scheme construction is now the prevalent perspective, which has been generalised to give GIT constructions of moduli spaces of pointed stable curves and stable maps and their (birational) geometry is studied using VGIT (see [67, 72]).

The other influential and successful application of GIT was the construction of moduli spaces of vector bundles (of fixed rank and degree) on a fixed smooth projective curve $C$ . One of the first ideas to construct vector bundle moduli spaces over $k=\mathbb{C}$ was to use unitary representations of the fundamental group $\pi_{1}(C)$ and led to the Narasimhan–Seshadri Theorem [77] relating irreducible representations with stable vector bundles considered by Mumford [73], where Mumford’s notion of stability came from the Hilbert–Mumford criterion in GIT and involves verifying an inequality of slopes (the ratio of the degree and the rank) for all subbundles. For moduli problems with a natural notion of subobjects, the study of 1-PSs in GIT often corresponds to filtrations by subobjects and stability can be phrased as an inequality for all subobjects. The GIT construction of moduli spaces of (semi)stable vector bundles was given by Seshadri [86] (see [79, Chapter 5]), and was later generalised by Simpson [89] to construct moduli spaces of sheaves (and Higgs sheaves) on higher dimensional schemes as GIT quotients of Quot schemes. Quot schemes appear as semistable vector bundles can be parametrised as quotients of a fixed vector bundle as mentioned in Example 1.12(4). This construction has been generalised to construct various bundle moduli spaces [84].

Mumford also applied GIT to construct moduli spaces of projective hypersurfaces $X\subset\mathbb{P}^{n}$ of degree $d$ by taking a quotient of $\mathrm{PGL}_{n+1}$ acting on $\mathbb{P}(k[x_{1},\dots,x_{n}]_{d})$ as in Example 1.12 (3). He showed smooth hypersurfaces are GIT stable if $n\geq 2$ and $d\geq 3$ (see [74, Chapter 4.2]).

King [62] developed GIT for a linear action on an affine space with respect to a character (see $\S$ 2.8) to construct reasonable moduli spaces of semistable representations of a quiver, where semistability depends on a stability parameter; the Hilbert–Mumford criterion gives a moduli-theoretic interpretation of semistability as an inequality holding for all subrepresentations.

3.3. Instability

In this section, we continue to suppose that we have a reductive group $G$ acting on a projective scheme $X\subset\mathbb{P}^{n}$ linearly. Since the GIT quotient provides a categorical quotient of the semistable locus $X^{ss}$ , it is natural to ask what can be said about the unstable points (i.e. not semistable points)

[TABLE]

By the Hilbert–Mumford criterion, if a point is unstable, then it has a negative Hilbert–Mumford weight for some 1-PS. Starting from this observation, Kempf [61] associated to an unstable orbit a conjugacy class of 1-PSs which are ‘most responsible’ for its instability, in the sense that they minimise a ‘normalised Hilbert–Mumford weight’. Hesselink then used Kempf’s work to statify the unstable locus [53]. This stratification was described more explicitly and, when $k=\mathbb{C}$ , compared with a Morse stratification associated to the norm square of the moment map for the action of a maximal compact subgroup by Kirwan [63] and Ness [78].

Let us start by describing how to fix a conjugation invariant norm on 1-PSs of $G$ .

Definition 3.15.

A conjugation invariant norm on 1-PSs of a reductive group $G$ is given by fixing a maximal torus $T<G$ and a Weyl-invariant integral-valued bilinear form on the 1-PSs $X_{*}(T)$ of $T$ with associated norm $||-||$ . For any 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ , there exists $g\in G$ such that $g\lambda g^{-1}\in X_{*}(T)$ and we define

[TABLE]

which is independent of the choice of $g$ due to the Weyl invariance.

Over the complex numbers, such a norm can be constructed by fixing a Weyl invariant inner product on the Lie algebra $\mathfrak{t}$ of $T$ , which gives an identification $\mathfrak{t}\cong\mathfrak{t}^{*}$ .

Example 3.16.

If $G=\mathrm{GL}_{n}$ and $T$ is the diagonal maximal torus, then the Euclidean norm on $\mathbb{R}^{n}\cong X_{*}(T)_{\mathbb{R}}$ is invariant under the Weyl group $S_{n}$ .

Using the norm $||-||$ , we define a normalised Hilbert–Mumford weight and state Kempf’s notion [61] of an adapted 1-PS for an unstable point.

Definition 3.17 (Normalised Hilbert–Mumford weight and adapted 1-PS).

For a reductive group $G$ acting on a projective scheme $X\subset\mathbb{P}^{n}$ linearly and a fixed conjugation invariant norm on 1-PSs of $G$ , we define the normalised Hilbert–Mumford weight of $x\in X$ at a 1-PS $\lambda$ to be $\mu(x,\lambda)/||\lambda||$ and we define the minimum normalised Hilbert–Mumford weight of $x$ to be

[TABLE]

If $x$ is unstable, a primitive 1-PS is said to be adapted to $x$ if it acheives this minimum and we write $\Lambda_{x}$ for the set of primitive 1-PSs adapted to $x$ .

Let us collect Kempf’s results on adapted 1-PSs in the following theorem.

Theorem 3.18 (Kempf, [61]).

Let $G$ be a reductive group acting on a projective scheme $X\subset\mathbb{P}^{n}$ linearly and fix a conjugation invariant norm on 1-PSs of $G$ . Then for an unstable point $x\in X$ , we have $\Lambda_{x}\neq\emptyset$ and there is a parabolic subgroup $P_{x}<G$ with the following properties.

i)

For any $\lambda\in\Lambda_{x}$ , we have $P_{x}=P_{\lambda}$ (see Definition 3.19 below). 2. ii)

Any two 1-PSs in $\Lambda_{x}$ are conjugate by an element of $P_{x}$ . 3. iii)

If $T<G$ is a maximal torus with $T<P_{x}$ , then $\Lambda_{x}\cap X_{*}(T)$ is a single Weyl orbit. 4. iv)

We have $g\Lambda_{x}g^{-1}=\Lambda_{g\cdot x}$ for all $g\in G$ . 5. v)

If $\lambda\in\Lambda_{x}$ , then $\lambda\in\Lambda_{x_{0}}$ and also $M(x)=M(x_{0})$ where $x_{0}:=\lim_{t\rightarrow 0}\lambda(t)\cdot x$ .

Although we will not give the details on the proof of this theorem, the existence of an adapted 1-PS boils down to the fact that one can work in a maximal torus (by translating using the $G$ -action) and then the normalised Hilbert–Mumford weight for 1-PS in a given maximal torus can be determined from subsets of the torus weights of the action, which is a finite set (see also Remark 3.25). The last two properties follow from Exercise 3.7.

Definition 3.19 (Parabolic and Levi group associated to a 1-PS).

For a 1-PS $\lambda:\mathbb{G}_{m}\rightarrow G$ of a reductive group $G$ , we define

[TABLE]

then $P_{\lambda}=U_{\lambda}\rtimes L_{\lambda}$ is a parabolic subgroup with Levi subgroup $L_{\lambda}$ and unipotent radical $U_{\lambda}$ and $q_{\lambda}:P_{\lambda}\rightarrow L_{\lambda}$ is a retraction onto the Levi.

Hesselink [53] stratified the unstable locus by pairs $\beta=([\lambda],m)$ consisting of a conjugacy class of an adapted 1-PS $[\lambda]$ and a minimum normalised Hilbert–Mumford weight $m$ . Before, we give the concrete construction of this stratification, we provide a summary of its properties and give an overview of various applications. As is customary, we include the semistable set as the lowest stratum in this stratification.

Theorem 3.20 (Kempf [61], Hesselink [53], Kirwan [63], Ness [78]).

For a reductive group $G$ acting linearly on a projective scheme $X\subset\mathbb{P}(V)$ and a conjugation invariant norm on 1-PSs of $G$ , there is a finite instability stratification

[TABLE]

into locally closed subschemes with a partial ordered index set $\mathcal{B}$ with the following properties.

i)

The lowest stratum is indexed by $\beta=0$ and we have $S_{0}=X^{ss}$ . 2. ii)

The closure of any stratum is contained in the union of higher strata: $\overline{S_{\beta}}\subset\bigsqcup_{\gamma\geq\beta}S_{\gamma}$ . 3. iii)

$\mathcal{B}$ * is determined combinatorially from the weights on $V$ of a maximal torus $T<G$ .* 4. iv)

The strata $S_{\beta}$ can be determined from simpler limit sets, which are GIT semistable loci for smaller reductive group actions with a twisted linearisation.

We will soon make the last two statements more precise. First, let us state some applications of these instability (or Hesselink–Kempf–Kirwan–Ness) stratifications.

Remark 3.21.

When $X$ is smooth, these stratifications have been used in the following ways:

(1)

By Kirwan [63], over $k=\mathbb{C}$ compute the $G$ -equivariant rational Betti numbers of $X^{ss}$ (which coincides with the rational Betti numbers of $X/\!/G$ when $X^{s}=X^{ss}$ ) by showing the Gysin long exact sequences in equivariant cohomology for this stratification split. Hence, there is a surjection known as the Kirwan map

[TABLE]

with explicit kernel. 2. (2)

By Dolgachev–Hu [31] and Thaddeus [91], to describe the birational transformations between GIT quotients given by varying the linearisation. 3. (3)

By Halpern-Leistner [47] and Ballard–Favero–Katzarkov [10], to construct semi-orthogonal decompositions in the derived category of a (stacky) GIT quotient. 4. (4)

By Halpern-Leistner [48], as inspiration to formulate an abstract notion of a $\Theta$ -stratification on a stack.

Remark 3.22.

Over $k=\mathbb{C}$ , there is a close relationship between GIT quotients and symplectic reductions, which are quotients in symplectic geometry. A smooth projective variety $X\subset\mathbb{P}^{n}_{\mathbb{C}}$ is Kähler and thus has a symplectic form (inherited from the Fubini-Study form on $\mathbb{P}^{n}_{\mathbb{C}}$ ). For a representation $G\rightarrow\mathrm{GL}_{n+1}$ , there is a maximal compact subgroup $K<G$ which acts by unitary transformations, and $K$ preserves the Kähler form. Moreover, there is a moment map $\mu:X\rightarrow\mathfrak{k}^{*}$ to the co-Lie algebra of $K$ such that the GIT quotient is homeomorphic to the symplectic reduction (the quotient of the zero level set of the moment map by $K$ , see [69]):

[TABLE]

via the Kempf–Ness Theorem [60] (see also [92]). More precisely, $x\in X$ is semistable if and only if its $G$ -orbit closure meets $\mu^{-1}(0)$ (and if this intersection is non-empty, it consists of a unique $K$ -orbit). Furthermore, by work of Kirwan [63] and Ness [78], the GIT instability stratification with respect to a conjugation invariant norm coincides with the Morse stratification associated to the norm square of the moment map $||\mu||^{2}:X\rightarrow\mathbb{R}$ (see also [74, Chapter 8]).

We now turn to the (first set-theoretic) construction of the unstable strata in the situation of a reductive group $G$ acting linearly on a projective scheme $X\subset\mathbb{P}^{n}$ with a fixed conjugation invariant norm $||-||$ on 1-PSs on $G$ .

Definition 3.23 (Unstable strata, blades and limit sets).

For $\beta=([\lambda],m)\in X_{*}(G)/G\times\mathbb{R}_{<0}$ , we define the associated unstable stratum

[TABLE]

If we fix a representative $\lambda$ of $[\lambda]$ , then we define limit sets $Z_{\beta}^{ss}$ and blades $Y_{\beta}^{ss}$ as follows

[TABLE]

where $p_{\lambda}(x):=\lim_{t\rightarrow 0}\lambda(t)\cdot x$ . We define the index set $\mathcal{B}=\{0\}\sqcup\{\beta:S_{\beta}\neq\emptyset\}$ , which turns out to be finite (see Remark 3.25 below).

Let us also introduce a closed subscheme $Z_{\beta}$ of the $\lambda$ -fixed locus on which the normalised Hilbert–Mumford weight of $\beta$ is $m$ and its attracting set $Y_{\beta}$ under the flow by $\lambda(\mathbb{G}_{m})$ as $t\rightarrow 0$ :

[TABLE]

Note that $Z_{\beta}^{ss}\subset Z_{\beta}$ and $Y_{\beta}^{ss}\subset Y_{\beta}$ ; we will soon see these are open subsets and thus we can give these sets a scheme structure. Furthermore, these schemes all depend on the chosen representative $\lambda$ of $[\lambda]$ . Recall that $Y_{\beta}^{(ss)}$ and $Z_{\beta}^{(ss)}$ depend on a choice of 1-PS $\lambda\in[\lambda]$ as well as $m$ , whereas $P_{\lambda}$ only depends on $\lambda$ (and not $m$ ). Note that in [63], $P_{\lambda}$ is denoted by $P_{\beta}$ .

Since the notion of adapted 1-PS depends on the choice of norm, the unstable strata also depend on this choice of norm, but $S_{0}=X^{ss}$ does not.

Proposition 3.24 (Kirwan, [63, $\S$ 12]).

Assume that $X\subset\mathbb{P}^{n}$ is smooth. For an unstable index $\beta=([\lambda],m)\neq 0\in\mathcal{B}$ , the stratum $S_{\beta}$ can be described as follows.

[TABLE]

Furthermore, the blades and limit sets can be described as follows.

i)

The retraction $p_{\beta}:Y_{\beta}\rightarrow Z_{\beta}$ is a Zariski locally trivial affine space fibration555This follows by work of Białynicki-Birula [19] describing the decomposition of a smooth projective variety with a $\mathbb{G}_{m}$ -action by taking the flow as $t\rightarrow 0$ . Here it is crucial that $X$ is smooth for the fibres to be affine spaces.. 2. ii)

The scheme $Y_{\beta}$ is preserved by the $P_{\lambda}$ -action and $Z_{\beta}$ is preserved by the $L_{\lambda}$ -action. Moreover, $p_{\beta}$ is equivariant with respect to the retraction $q_{\lambda}:P_{\lambda}\rightarrow L_{\lambda}$ . 3. iii)

For $p_{\beta}:Y_{\beta}\rightarrow Z_{\beta}$ , we have $p_{\beta}^{-1}(Z_{\beta}^{ss})=Y_{\beta}^{ss}$ . 4. iv)

$Z_{\beta}^{ss}$ * is the GIT semistable set for the action of the reductive Levi subgroup $L_{\lambda}$ on $Z_{\beta}$ with respect to a canonical linearisation $\mathcal{L}_{\beta}$ obtained by twisting by a rational multiple666Via $||-||$ , we can identify characters and co-characters, so we twist by the rational character $\chi$ corresponding to the rational 1-PS $\frac{-m}{||\lambda||}\lambda$ , so that $\mu^{\mathcal{L}_{\beta}}(x,\lambda)=\mu(x,\lambda)+\langle\chi,\lambda\rangle=0$ for $x\in X^{\lambda}$ to ‘cancel’ the effect of $\lambda$ . of a character corresponding to $\lambda$ .*

Remark 3.25.

For $G$ acting linearly on $X=\mathbb{P}(V)$ with a fixed choice of norm $||-||$ , we can compute the index set $\mathcal{B}$ of the instability stratification in terms of the finitely many weights of the action of a maximal torus $T<G$ . For any subset of the $T$ -weights whose convex hull does not contain the origin (that is, this is a weight set of an unstable point), we let $\lambda$ be the primitive 1-PS of $T$ corresponding under $||-||$ to the ray in the $X_{*}(T)$ through the closest point to [math] in the convex hull of this weight set and define $m$ to be the minimum normalised Hilbert–Mumford weight of any point with this weight set, then $\beta=([\lambda],m)\in\mathcal{B}$ (see [63, Lemma 12.6]). In particular, $\mathcal{B}$ is finite as there are only finitely many $T$ -weights.

Exercise 3.26 (Instability stratification for binary forms).

Consider the action of $\mathrm{SL}_{2}$ on $\mathbb{P}^{d}=\mathbb{P}(k[x,y]_{d})$ as in Exercise 3.12 and show for each integer $\frac{d}{2}<r\leq d$ , there is an unstable stratum corresponding to binary forms $F(x,y)$ with a root of exactly multiplicity $r$ .

In the situation of a reductive group acting linearly on an affine space linearised by a character, using King’s Hilbert–Mumford criterion [62, Proposition 2.5] (see also Remark 3.13), one can construct an instability stratification [54]. For the action of $\mathrm{GL}_{r}$ on $\operatorname{Mat}_{r\times n}$ by left multiplication generalising Exercise 3.14, the (semi)stable locus for the character $\rho=\det$ is the matrices of maximal rank and the instability stratification is given by the rank (see [55, Example 2.14]).

Given an instability stratification (2), we can ask if there is a categorical $G$ -quotient of an unstable strata $S_{\beta}$ , or equivalently via the isomorphism (3), a categorical $P_{\lambda}$ -quotient of $Y_{\beta}^{ss}$ .

Proposition 3.27 (Categorical quotients of unstable strata, [57, Lemma 3.1]).

The composition $Y_{\beta}^{ss}\stackrel{{\scriptstyle p_{\beta}}}{{\longrightarrow}}Z_{\beta}^{ss}\stackrel{{\scriptstyle\pi}}{{\longrightarrow}}Z_{\beta}/\!/_{\mathcal{L}_{\beta}}L_{\lambda}$ of $p_{\beta}$ with the reductive GIT quotient $\pi$ is a categorical $P_{\lambda}$ -quotient.

Proof.

This composition is $P_{\lambda}$ -invariant, as $p_{\beta}$ is $q_{\lambda}$ -equivariant and $\pi$ is $L_{\lambda}$ -invariant. Given a $P_{\lambda}$ -invariant morphism $f:Y_{\beta}^{ss}\rightarrow S$ , its restriction $f|$ to $Z_{\beta}^{ss}$ is $L_{\lambda}$ -invariant and since $f$ is constant on orbit closures, we have $f=f|\circ\pi$ . Then by the universal property of $\pi$ , we see that $f|$ (and thus also $f$ ) factors uniquely via $Z_{\beta}/\!/_{\mathcal{L}_{\beta}}L_{\lambda}$ . ∎

Furthermore, by [57, Lemma 3.1] we have

[TABLE]

which is finitely generated, and the categorical quotient coincides with the projective spectrum of the invariants. However, this categorical quotient factors via the retraction $p_{\beta}$ and so identifies every $x$ with $p_{\beta}(x)=\lim_{t\rightarrow 0}\lambda(t)\cdot x$ . Thus this categorical quotient is far from being an orbit space. Since the closed subscheme $Z_{\beta}^{ss}$ causes these identifications, we would like to further twist the linearisation to make $Z_{\beta}^{ss}$ become unstable and prevent these unwanted identifications. It is at this point where we see that the non-reductive action of $P_{\lambda}$ on ${Y^{ss}_{\beta}}$ is preferable to the reductive action of $G$ on $S_{\beta}$ : the parabolic subgroup has more characters which can be used to twist the given ample linearisation; in $\S$ 6.3, we explain how to apply non-reductive GIT.

In fact, since linearisations give line bundles on quotient stacks, we have a line bundle

[TABLE]

but whilst the corresponding $P_{\lambda}$ -equivariant line bundle on $Y_{\beta}^{ss}$ is ample, the corresponding $G$ -equivariant line bundle on $S_{\beta}$ is not ample (see [57, Remark 9]) and so is not suitable for working with from the perspective of GIT.

4. Generalisations of reductive GIT to stacks

In $\S$ 4.1 we describe different types of moduli spaces for stacks and in $\S$ 4.2, we state a recent existence criterion [7] for stacks to admit a good moduli space.

Throughout this section, for simplicity, we will assume that our algebraically closed field $k$ is of characteristic [math] to avoid the distinction between linearly reductive and geometrically reductive groups in positive characteristic, which in turn leads to a distinction between good moduli spaces and adequate moduli spaces for stacks. We will assume all stacks are noetherian algebraic stacks over $k$ ; however, everything in this section also extends to a relative setting. For a detailed introduction to algebraic spaces and stacks with a focus on moduli, see [1, $\S$ 3].

4.1. Moduli spaces for stacks

Associated to an action $G\times X\rightarrow X$ , there is a quotient stack $[X/G]$ whose points (and residual gerbes) describe the orbits (and stabilisers) of the action. A categorical quotient of this action is equivalent to a universal map from $[X/G]$ to a scheme. One can naturally ask if an arbitrary stack has a universal map to a scheme (or possibly an algebraic space, as was necessary in $\S$ 2.2). If additionally, as in the definition of a coarse moduli space for a moduli functor, we ask for a bijection on $k$ -points, this leads to the following notion.

Definition 4.1.

A coarse moduli space (CMS) for a stack $\mathfrak{X}$ is a map $\mathfrak{X}\rightarrow X$ to an algebraic space which is initial for maps from $\mathfrak{X}$ to algebraic spaces and such that the induced map $|\mathfrak{X}(k)|\rightarrow X(k)$ is bijective.

Keel and Mori [59] studied quotients of groupoids in the category of algebraic spaces and proved the existence of a categorical quotient under the assumption of finite stabilisers; let us state a stacky reformulation as in [25].

Theorem 4.2 (Keel–Mori Theorem).

An algebraic stack $\mathfrak{X}$ over $k$ with finite inertia stack admits a coarse moduli space $\pi:\mathfrak{X}\rightarrow X$ with the following properties

i)

$\mathcal{O}_{X}\rightarrow\pi_{*}\mathcal{O}_{\mathfrak{X}}$ * is an isomorphism;* 2. ii)

If $\mathfrak{X}$ is separated (resp. of finite type) over $k$ , then $X$ is separated (resp. of finite type) over $k$ ; 3. iii)

$\pi$ * is a proper universal homeomorphism;* 4. iv)

Any flat base change of $\pi$ is also a coarse moduli space.

In particular, the Keel–Mori Theorem applies to separated Deligne–Mumford stacks. Requiring $\mathfrak{X}\rightarrow X$ to induce a bijection on closed points is very strong: as soon as there are non-closed orbits this condition fails. Fortunately Alper [2] adapted the reductive GIT notion of good quotient to the setting of stacks as follows.

Definition 4.3.

A good moduli space (GMS) for a stack $\mathfrak{X}$ is a quasi-compact quasi-separated morphism $f:\mathfrak{X}\rightarrow X$ to an algebraic space such that

(1)

the pushforward map $f_{*}:\mathcal{Q}Coh(\mathfrak{X})\rightarrow\mathcal{Q}Coh(X)$ on quasi-coherent sheaves is exact, 2. (2)

the natural map $\mathcal{O}_{X}\rightarrow f_{*}\mathcal{O}_{\mathfrak{X}}$ is an isomorphism.

The following example shows this theory applies to quotients of linearly reductive groups.

Example 4.4 (Alper, [2, Example 12.9]).

For the classifying stack $BG=[\operatorname{Spec}k/G]$ of an affine algebraic group, the map $\pi:BG\rightarrow\operatorname{Spec}k$ satisfies the second property in the definition of a good moduli space. Then $\pi_{*}:\mathcal{V}ect^{G}_{k}\rightarrow\mathcal{V}ect_{k}$ is given by taking $G$ -invariants of a linear representation of $G$ . Hence, the first condition holds if and only if $G$ is linearly reductive.

Subsequently, Alper later adapted his theory to include geometrically reductive groups in positive and mixed characteristic by giving a notion of an adequate moduli space [3], which weakens the first property in the definition of good moduli spaces.

Example 4.5 (Alper, [2, Theorem 13.6]).

For a linearly reductive group $G$ acting on an affine scheme $X$ , the quotient stack admits a good moduli space $[X/G]\rightarrow X/\!/G$ given by the affine GIT quotient. More generally, for a linearly reductive group acting on a scheme $Y$ with respect to an ample linearisation $\mathcal{L}$ , the morphism $[Y^{ss}(\mathcal{L})/G]\rightarrow Y/\!/_{\!\mathcal{L}}G$ is a good moduli space.

Let us note some important properties of good moduli spaces.

Remark 4.6.

If $f:\mathfrak{X}\rightarrow X$ is a good moduli space, then it has the following properties.

(1)

The morphism $f$ is initial among maps from $\mathfrak{X}$ to algebraic spaces [2, Theorem 6.6] and is surjective and universally closed [2, Theorem 4.16 (i) and (ii)]. 2. (2)

For every point $x\in X$ , there is a unique closed point $x_{0}$ in $f^{-1}(x)$ and the automorphism group of $x_{0}$ is linearly reductive [2, Theorem 9.1 and Proposition 12.14]. 3. (3)

The morphism $f$ induces a bijection between closed points in $\mathfrak{X}$ and closed points in $X$ [2, Theorem 4.16 (iv)]. 4. (4)

If $\mathfrak{X}$ is of finite type over $k$ , then so is $X$ [2, Theorem 4.16 (xi)]. 5. (5)

Any base change of $f$ along a morphism of quasi-separated algebraic spaces is also a good moduli space [2, Proposition 4.7].

4.2. Stability and existence criteria

Halpern-Leistner [46] and Heinloth [52] studied how ideas in reductive GIT, such as the Hilbert–Mumford criterion, can be applied to stacks. The role of 1-PSs and their limits can be replaced by the stack Theta:

[TABLE]

over $\operatorname{Spec}\mathbb{Z}$ , for the $\mathbb{G}_{m}$ -action on $\mathbb{A}^{1}=\operatorname{Spec}k[x]$ by scalar multiplication. This stack plays a prominent role in the work of Halpern-Leistner [46] and led to a notion of $\Theta$ -stability for stacks and a generalisation of GIT instability stratifications to $\Theta$ -stratifications of stacks [48].

In this section, we will state a recent existence theorem of Alper, Halpern-Leistner and Heinloth [7] which gives necessary and sufficient conditions for a stack to admit a good moduli space. These conditions are valuative criteria known as $\Theta$ -reductivity and S-completeness.

Definition 4.7 (Valuative criteria for stacks).

A noetherian algebraic stack $\mathfrak{X}$ is said to be

i)

$\Theta$ -reductive if for any DVR $R$ , any morphism $\Theta_{R}\setminus\{0\}\rightarrow\mathfrak{X}$ extends uniquely to $\Theta_{R}$ , where $\Theta_{R}=\Theta\times_{\mathbb{Z}}\operatorname{Spec}R$ and $0\in\Theta_{R}$ denotes the unique closed point. 2. ii)

S-complete if for any DVR $R$ , any morphism $\overline{\mathrm{ST}}_{R}\setminus\{0\}\rightarrow\mathfrak{X}$ extends uniquely to

[TABLE]

for a uniformiser $\pi$ and $\mathbb{G}_{m}$ -action with weights $+1,-1$ on $s,t$ .

The stack $\overline{\mathrm{ST}}_{R}$ originates from work of Heinloth [52] and naturally generalises Example 2.14 where $\mathbb{G}_{m}$ acts on $\mathbb{A}^{2}$ with weights $+1,-1$ . Recall that after removing the origin, $\mathbb{A}^{2}\setminus\{0\}$ has non-separated geometric quotient given by the affine line with two origins. S-completeness should be thought of as a stacky valuative criterion for separatedness.

Remark 4.8.

Assume that $\mathfrak{M}$ is a moduli stack for objects in an abelian category as in [7, $\S$ 7]. Then a morphism $\Theta_{k}\rightarrow\mathfrak{M}$ is a (weighted) filtration on a family of $\mathfrak{M}$ over $k$ such that the associated graded lies in $\mathfrak{M}$ . Let $R$ be a DVR with fraction field $K$ and residue field $k=R/(\pi)$ , where $\pi$ denotes a uniformiser. In this case, we can interpret the above conditions as follows.

(1)

A morphism $\Theta_{R}\setminus\{0\}\rightarrow\mathfrak{M}$ is given by a family of $\mathfrak{M}$ over the DVR $R$ together with a filtration on the generic fibre $K$ (whose associated graded object lies in $\mathfrak{M}$ ). This extends uniquely to $\Theta_{R}$ if and only if the filtration on the generic fibre extends uniquely to the special fibre (again with the associated graded object lying in $\mathfrak{M}$ ). 2. (2)

A morphism $\overline{\mathrm{ST}}_{R}\setminus\{0\}\rightarrow\mathfrak{M}$ is equivalent to two families of $\mathfrak{M}$ over $R$ whose generic fibres over $K$ are equivalent. This extends uniquely to $\overline{\mathrm{ST}}_{R}$ if and only if the special fibres have filtrations whose associated graded objects are isomorphic777For the stack of semistable vector bundles on a curve, this asks for the bundles on the special fibre to be S-equivalent. in $\mathfrak{M}$ .

This can be seen from looking at the following diagrams appearing in [1, $\S$ 6.7.2]

[TABLE]

where the left side of the diagram corresponds to the open immersion $\Theta_{R}\setminus\{0\}\hookrightarrow\Theta_{R}$ and all morphisms on the left are open immersions, and the right side represents the closed immersion $\{0\}\hookrightarrow\Theta_{R}$ and all morphisms on the right are closed immersions.

There is a similar diagram for $\overline{\mathrm{ST}}_{R}$ :

[TABLE]

We can now state the existence theorem of Alper–Halpern-Leistner–Heinloth [7].

Theorem 4.9 (Existence criterion for GMS, [7, Theorem A]).

Let $\mathfrak{X}$ be an algebraic stack of finite type over $k$ of characteristic zero, with affine diagonal. Then $\mathfrak{X}$ admits a separated good moduli space $X$ if and only if $\mathfrak{X}$ is $\Theta$ -reductive and S-complete. Moreover, $X$ is proper if and only if $\mathfrak{X}$ satisfies the existence part of the valuative criterion for properness.

In characteristic zero, the existence criterion is much nicer due to the existence of étale local quotient presentations around closed points with reductive stabiliser due to Alper–Hall–Rydh [6], which is a stacky generalisation of Luna’s étale slice theorem [68]. Showing these conditions are necessary for $\mathfrak{X}$ to have a good moduli space is relatively formal. To show these conditions suffice to construct a good moduli space, one first uses S-completeness to show closed points have reductive stabilisers and then one glues the affine GIT quotients associated to the étale local quotient presentations, which uses both $\Theta$ -reductivity and S-completeness.

In positive characteristic there is also an existence theorem for adequate moduli spaces, but as an input it requires the existence of étale local quotient presentations, which is part of the local reductivity assumption in [7, Theorem A].

With this existence criterion in hand, one can construct moduli spaces without GIT as follows.

(1)

Interpret the moduli problem as an algebraic stack $\mathfrak{M}$ . 2. (2)

Apply an existence theorem to obtain a (proper) good moduli space $\mathfrak{M}\rightarrow M$ .

However, this only yields a proper good moduli space in cases where GIT would provide a projective moduli space. Consequently, one further step is required for this strategy:

(3)

Find an ample line bundle on $M$ to show it is projective.

This approach has been implemented for moduli of smooth projective curves in [24], where the second step uses the Keel–Mori Theorem and the third step follows Kollar’s proof of projectivity using the determinant of a relative pluricanonical sheaf for the universal family; and for moduli of vector bundles on a smooth projective curve in characteristic zero in [4], where the second step uses Theorem 4.9 and the third step follows Faltings’ proof of projectivity using a determinantal line bundle constructed from the universal family. For moduli of representations of an acyclic quiver (in arbitrary characteristic), a new moduli-theoretic proof of projectivity was given in [13], where again a determinantal line bundle is used. Of course, in all these cases, reductive GIT could be applied to produce the moduli space without this extra work! However, this intrinsic moduli-theoretic approach to projectivity does give new insights: the line bundle obtained in the final step is of inherent interest to the moduli problem and the techniques give new effective bounds for global generation of determinantal line bundles (for example, see [13, Theorem B]).

The advantage of this approach is that one does not need a quotient presentation of the stack; for example, for moduli of Bridgeland semistable objects in a derived category without a quotient presentation, this approach can be applied provided the stack is algebraic as in [93, Theorem 1.3]. There have been impressive applications to moduli of K-polystable Fano varieties [5] and moduli of torsors for a Bruhat–Tits group scheme over a curve [7, Theorem 8.1].

Since the theory of good (and adequate) moduli spaces is based on GIT for reductive groups, there are still many interesting moduli problems which do not admit good (or adequate) moduli spaces, such as moduli stacks of weighted projective hypersurfaces, where non-reductive groups naturally appear (see $\S$ 6.2).

5. Non-reductive geometric invariant theory

In this section, we will describe recent progress on non-reductive GIT (in characteristic zero) due to Bérczi, Doran, Hawes and Kirwan [15, 14] concerning non-reductive groups with graded unipotent radicals. Before we describe their approach, we start by illustrating the issues in constructing non-reductive quotients in $\S$ 5.1 and give a survey of some previous contributions in $\S$ 5.2. We then proceed to explain the relationship between $\mathbb{G}_{a}$ -actions and locally nilpotent derivations in $\S$ 5.3, as well as their interaction with $\mathbb{G}_{m}$ -actions in $\S$ 5.4. In $\S$ 5.5 we introduce graded unipotent groups and state the main results of [15, 14] in $\S$ 5.6, before giving details on some of the proofs in $\S$ 5.7. Throughout this section, we assume $k$ is of characteristic zero.

5.1. Examples of bad behaviour of additive actions

The ring of invariants being non-finitely generated for a non-reductive group is not the only issue that arises when trying to construct quotients of non-reductive group actions. Even when the ring of invariants is finitely generated (for example, see Theorem 2.9), there can be further issues:

(1)

The quotient morphism given by the inclusion of invariants may fail to be surjective (and in general, its image is only a constructible subset), so will not be a good quotient. 2. (2)

There may not be enough invariants to separate disjoint closed orbits in contrast to the case for geometrically reductive groups (Lemma 2.7). 3. (3)

Invariants may not extend to the ambient space.

Let us give some concrete examples of linear $\mathbb{G}_{a}$ -actions that demonstrate these issues.

Example 5.1.

The following examples are all built from powers (or symmetric powers) of the standard representation of the additive group on $\mathbb{A}^{2}$ given by $\mathbb{G}_{a}<\mathrm{SL}_{2}$ (as the upper triangular unipotent radical) acting on $V=\mathbb{A}^{2}$ by left multiplication; thus Theorem 2.9 applies.

(1)

For $\mathbb{G}_{a}$ acting on $V\times V=\mathbb{A}^{4}$ via $u\cdot(x_{1},x_{2},x_{3},x_{4})=(x_{1}+ux_{2},x_{2},x_{3}+ux_{4},x_{4})$ ,

[TABLE]

and the (invariant theoretic) quotient map $\pi:\mathbb{A}^{4}\rightarrow\mathbb{A}^{3}\cong\operatorname{Spec}\mathcal{O}(V\times V)^{\mathbb{G}_{a}}$ only has constructible image, as it misses the punctured line $\{(0,0,\eta):\eta\neq 0\}$ . 2. (2)

For $\mathbb{G}_{a}$ acting on $\operatorname{Sym}^{2}(V)=\mathbb{A}^{3}$ via $u\cdot(x_{1},x_{2},x_{3})=(x_{1}+2ux_{2}+u^{2}x_{3},x_{2}+ux_{3},x_{3})$ ,

[TABLE]

and these invariants do not separate the closed orbits $\mathbb{G}_{a}\cdot(1,1,0)=\{(\eta,1,0)\}$ and $\mathbb{G}_{a}\cdot(1,-1,0)=\{(\eta,-1,0)\}$ . 3. (3)

For $\mathbb{G}_{a}$ acting on $\operatorname{Sym}^{2}(V)=\mathbb{A}^{3}$ as in (2), the $\mathbb{G}_{a}$ -invariant closed subscheme $X=\{x_{3}=0\}$ has an invariant function $x_{2}\in\mathcal{O}(X)^{\mathbb{G}_{a}}$ which does not extend to $\mathcal{O}(\mathbb{A}^{3})^{\mathbb{G}_{a}}$ .

In particular, just trying to get rings of invariants to be finitely generated will not suffice to generalise the nice properties of reductive GIT.

Even in the case of free $\mathbb{G}_{a}$ -actions, there are examples which do not admit a geometric quotient (for example, see [29, Example 18] which is proved via a geometrical argument).

5.2. Short historical note on non-reductive group actions

Let us summarise some of the contributions towards the development of non-reductive GIT, before turning to [15, 14].

As mentioned in Remarks 2.2 and 2.10, the transfer principle was used by Grosshans [41] to prove finite generation in certain cases. Grosshans proved the unipotent radical $U$ of a parabolic subgroup in a reductive group $G$ is a Grosshans group and thus if a $U$ -action on an affine variety extends to $G$ , then the ring of $U$ -invariant functions is finitely generated [42]; his proof uses a grading by weights of a maximal torus (see [22, Theorem 2.7]), which will also play an important role in non-reductive GIT.

Fauntleroy defines global intrinsic notions of semistability for a connected unipotent group action on a quasi-affine normal variety in terms of properties of invariant sections and showed if the stabilisers are finite that the open semistable set admits a categorical quotient [33, Theorem 5]. In subsequent work [34], he more generally defined a notion of properly stable points for a linearised action of a connected linear algebraic group on a normal projective variety and shows this locus has a quasi-projective geometric quotient. Fauntleroy combined ideas from reductive GIT with Seshadri covers, which Seshadri used to show actions of connected linear algebraic groups on a normal variety with finite stabiliser groups admit geometric quotients up to a finite equivariant replacement [87].

For non-reductive actions on an affine scheme, Winkelmann [95] showed there is a rational quotient map to a quasi-affine variety, whose coordinate ring is the ring of invariants (and is not necessarily finitely generated). In fact, he showed the study of coordinate rings of quasi-affine varieties corresponds to the study of rings of invariants for $\mathbb{G}_{a}$ -actions on affine varieties.

Alternatively, one can ignore the issue of whether or not rings of invariants are finitely generated and work in the category of all schemes (not necessarily of finite type) over $k$ ; Greuel and Pfister take this approach in [40] to define a notion of stability for unipotent group actions and show there is a geometric quotient in the category of varieties. Their motivation came from the study of singularities, where non-reductive translation actions appear.

We note that additive group actions also arise as translation actions in affine geometry and have led to progress on classical questions on affine spaces such as the cancellation problem, the existence of exotic affine spaces and the Jacobian conjecture (for example, see [66]).

Doran and Kirwan [32] give various notions of (semi)stability for non-reductive GIT using properties of invariants and by transferring the problem to a reductive GIT setting using a notion of ‘fine reductive envelope’ and they obtain various types of quotients of these (semi)stable sets.

The recent progress on non-reductive GIT [15], which we explain below, uses a multiplicative group to grade the unipotent radical; this enables the construction of geometric unipotent quotients, as well as providing a natural projective completion and an explicit Hilbert–Mumford type description of stability. To trace back the origin of this grading multiplicative group, we begin with the correspondence between $\mathbb{G}_{a}$ -actions and locally nilpotent derivations, and will see these multiplicative group actions naturally appear when there is a slice of the $\mathbb{G}_{a}$ -action.

5.3. Actions of the additive group

Since $k$ is of characteristic zero, we can utilise the dictionary between additive group actions and locally nilpotent derivations (e.g. see [36]).

For an action $\sigma:\mathbb{G}_{a}\times X\rightarrow X$ on an affine scheme, the coaction

[TABLE]

can be used to define a derivation $D_{\sigma}:\mathcal{O}(X)\rightarrow\mathcal{O}(X)$ given by $D_{\sigma}(f):=\frac{\partial}{\partial t}(\sigma^{*}(f))|_{t=0}$ satisfying the Leibniz rule $D_{\sigma}(fg)=fD_{\sigma}(g)+D_{\sigma}(f)g$ . This derivation is locally nilpotent (i.e. for any $f\in\mathcal{O}(X)$ , there is $n\in\mathbb{N}$ such that $D_{\sigma}^{n}(f)=0$ ), as one can inductively show

[TABLE]

and since the left side is a polynomial in $t$ , we must have $D_{\sigma}^{n}(f)=0$ for all $n$ sufficiently large.

Conversely, given a locally nilpotent derivation $D:A\rightarrow A$ on the $k$ -algebra $A=\mathcal{O}(X)$ of functions on an affine scheme, we can exponentiate to construct a coaction

[TABLE]

which is well-defined, as $D$ is locally nilpotent.

These constructions are inverse to each other and we collect some other results relating these geometric and algebraic points of view.

Proposition 5.2.

For an affine variety $X$ with coordinate ring $A=\mathcal{O}(X)$ , there is a bijective correspondence

[TABLE]

For an action $\sigma$ corresponding to a locally nilpotent derivation $D$ , the following statements hold.

i)

The ring of invariants is the kernel of the derivation: $\mathcal{O}(X)^{\mathbb{G}_{a}}=\ker(D)$ , 2. ii)

$x\in X$ * is $\mathbb{G}_{a}$ -fixed if and only if $D(A)\subset\mathfrak{m}_{x}$ .* 3. iii)

If $D$ has a slice (i.e. there exists $s\in A=\mathcal{O}(X)$ such that $D(s)=1$ ), then $\mathcal{O}(X)^{\mathbb{G}_{a}}$ is a finitely generated $k$ -algebra, the subscheme $S:=\{s=0\}\subset X$ is a geometric slice of the $\mathbb{G}_{a}$ -action and $X\rightarrow\operatorname{Spec}\mathcal{O}(X)^{\mathbb{G}_{a}}$ is a trivial principal $\mathbb{G}_{a}$ -bundle.

Proof.

Let us just give some details on the third statement, as this will be important in what follows. Suppose that $s\in\mathcal{O}(X)$ is a slice for $D$ . Then define a $k$ -algebra homomorphism

[TABLE]

such that $\operatorname{Im}(\Phi)=\ker(D)$ . In particular, $\mathcal{O}(X)^{\mathbb{G}_{a}}=\ker(D)=\operatorname{Im}(\Phi)$ is a finitely generated $k$ -algebra: the images under $\Phi$ of generators for the algebra $A=\mathcal{O}(X)$ are generators of $\mathcal{O}(X)^{\mathbb{G}_{a}}$ .

By induction, any $f\in\mathcal{O}(X)$ is a polynomial in $s$ with coefficient in $\operatorname{Im}(\Phi)=\mathcal{O}(X)^{\mathbb{G}_{a}}$ :

[TABLE]

and thus $\mathcal{O}(X)=\mathcal{O}(X)^{\mathbb{G}_{a}}[s]$ and $\mathcal{O}(X)^{\mathbb{G}_{a}}=\mathcal{O}(X)/(s)$ . Moreover $S=\{s=0\}\subset X$ is isomorphic to $\operatorname{Spec}\mathcal{O}(X)^{\mathbb{G}_{a}}$ . We claim that $S$ is a geometric slice; that is $\mathbb{G}_{a}\times S\rightarrow X$ given by $(u,x)\rightarrow\sigma(u,x)$ is an isomorphism. Indeed, by considering the following commutative diagrams:

[TABLE]

one can construct the inverse. ∎

Note that if there is a slice, then the coordinate ring is a polynomial ring in a single variable with coefficients in the ring of invariants. In particular, the coordinate ring is naturally graded by $\mathbb{N}$ (the degree of this polynomial) and thus this gives a $\mathbb{G}_{m}$ -action. This naturally leads to considering actions of semi-direct products of $\mathbb{G}_{a}$ and $\mathbb{G}_{m}$ .

5.4. Semi-direct products of additive and multiplicative groups

For an affine scheme $X$ , a $\mathbb{G}_{m}$ -action on $X$ is equivalent to a $\mathbb{Z}$ -grading of the $k$ -algebra $\mathcal{O}(X)$ . The limit under the $\mathbb{G}_{m}$ -action as $t\rightarrow 0$ exists for all $x\in X$ if and only if $\mathbb{G}_{m}$ acts with non-negative weights on $X$ , which is if and only if the grading on $\mathcal{O}(X)$ is supported in non-positive degrees.

Definition 5.3.

A semi-direct product of $\mathbb{G}_{a}$ and $\mathbb{G}_{m}$ is given by specifying a group homomorphism $\varphi:\mathbb{G}_{m}\rightarrow\operatorname{Aut}(\mathbb{G}_{a})$ and defining the semi-direct product $\mathbb{G}_{a}\rtimes_{\varphi}\mathbb{G}_{m}$ to have underlying set $\mathbb{G}_{a}\times\mathbb{G}_{m}$ with group operation

[TABLE]

For $n\in\mathbb{Z}$ , let us define $\varphi$ by $t\mapsto t^{-n}$ and write $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ for the associated semi-direct product, where $tut^{-1}=t^{n}u$ .

Example 5.4.

The upper triangular Borel $B<\mathrm{SL}_{2}$ is isomorphic to a semi-direct product $\mathbb{G}_{a}\rtimes_{2}\mathbb{G}_{m}$ , via $(u,t)\mapsto\left(\begin{smallmatrix}t&ut\\ 0&t^{-1}\end{smallmatrix}\right).$

Remark 5.5.

For an action of a semi-direct product $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ on an affine scheme, the locally nilpotent derivation $D$ associated to the $\mathbb{G}_{a}$ -action is homogeneous of degree $n$ with respect to the grading $\mathcal{O}(X)=\bigoplus_{r\in\mathbb{Z}}\mathcal{O}(X)_{r}$ determined by the $\mathbb{G}_{m}$ -action; that is, $D(\mathcal{O}(X)_{r})\subset\mathcal{O}(X)_{r+n}$ .

The following result is the key proposition which enables the inductive construction of quotients of unipotent group actions in the presence of an appropriate $\mathbb{G}_{m}$ -action grading the unipotent action. This is a modification of [14, Lemma 7.3].

Proposition 5.6 (Key Proposition).

For an affine $\mathbb{G}_{a}$ -scheme $X$ with locally nilpotent derivation $D:\mathcal{O}(X)\rightarrow\mathcal{O}(X)$ , the following statements are equivalent:

(1)

$D$ * has a slice (i.e. there exists $s\in\mathcal{O}(X)$ with $D(s)=1$ ),* 2. (2)

The $\mathbb{G}_{a}$ -action extends to $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ for some $n>0$ such that

(a)

$\lim_{t\rightarrow 0}t\cdot x$ * exists for all $x\in X$ and* 2. (b)

$\operatorname{Stab}_{\mathbb{G}_{a}}(z)=\{e\}$ * for all $z\in Z:=\{\lim_{t\rightarrow 0}t\cdot x\}$ .*

In particular, if (2) holds, then there is a trivial $U$ -quotient $X\mapsto\mathcal{O}(X)^{\mathbb{G}_{a}}$ .

Proof.

Suppose $D$ has a slice $s$ ; then $\mathcal{O}(X)=\mathcal{O}(X)^{\mathbb{G}_{a}}[s]$ as in the proof of Proposition 5.2. Since $\mathcal{O}(X)$ is a polynomial ring in $s$ , it is naturally graded, and we choose a $\mathbb{Z}_{\leq 0}$ -grading given by $\mathcal{O}(X)_{-r}=\mathcal{O}(X)^{\mathbb{G}_{a}}s^{r}$ so that $\lim_{t\rightarrow 0}t\cdot x$ exists for all $x\in X$ . Since $D(s)=1$ , we see that $D$ is homogeneous of degree $1$ and thus there is an action of $\mathbb{G}_{a}\rtimes_{1}\mathbb{G}_{m}$ . Since $X\rightarrow\operatorname{Spec}\mathcal{O}(X)^{\mathbb{G}_{a}}$ is a trivial $\mathbb{G}_{a}$ -bundle (see Proposition 5.2), all $\mathbb{G}_{a}$ -stabilisers are trivial.

Let us outline the converse direction (see [14, Lemma 7.3] for the full proof). Given $n>0$ and a $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ -action on $X$ such that $\lim_{t\rightarrow 0}t\cdot x$ exists for all $x\in X$ and $\operatorname{Stab}_{\mathbb{G}_{a}}(z)=\{e\}$ for all $z\in Z$ , we note that $D$ has homogeneous degree $n$ with respect to the grading $\mathcal{O}(X)=\oplus_{r}\mathcal{O}(X)_{r}$ given by the $\mathbb{G}_{m}$ -action. By the assumption that $\lim_{t\rightarrow 0}t\cdot x$ exists for all $x\in X$ , this grading is supported in non-positive degrees. We have

[TABLE]

and we will show that $D(\mathcal{O}(X)_{-n})=\mathcal{O}(X)_{0}$ , so there exists $s\in\mathcal{O}(X)_{-n}$ with $D(s)=1$ . Let

[TABLE]

which is $\mathbb{G}_{a}$ -stable by (4) and also $\mathbb{G}_{m}$ -stable, as $\mathcal{O}(X)_{<0}$ is a sum of $\mathbb{G}_{m}$ -weight spaces and the $\mathbb{G}_{m}$ -action on $D(\mathcal{O}(X)_{-n})\subset\mathcal{O}(X)_{0}$ is trivial. Then it suffices to show that $I$ is an ideal (Claim 1) and moreover $I=\mathcal{O}(X)$ (Claim 2). Indeed, as $I=\mathcal{O}(X)$ we must have $D(\mathcal{O}(X)_{-n})=\mathcal{O}(X)_{0}$ and so $D(s)=1$ for some $s\in\mathcal{O}(X)_{-n}$ .

To prove Claim 1, we need to show for $f\in I$ and $h\in\mathcal{O}(X)$ that $hf\in I$ ; for this we can write $h=\sum h_{r}$ with respect to the $\mathbb{G}_{m}$ -grading and it suffices to show $h_{r}f\in I$ for all $r$ . We can also write $f=D(p_{-n})+\sum_{r<0}f_{r}$ by (5). The only non-trivial case is for $h=h_{0}$ and $f=D(p_{-n})$ ; however, by the Leibniz rule $D(h_{0}p_{-n})=h_{0}D(p_{-n})$ and so $h_{0}f=D(h_{0}p_{-n})\in I$ .

To prove Claim 2, we argue by contradiction. If $I\subsetneq\mathcal{O}(X)$ , then it is contained in a maximal ideal $\mathfrak{m}_{x}$ . Since $I$ is $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ -stable, so is $\mathfrak{m}_{x}$ and so it corresponds to a $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ -fixed point $x\in Z$ . However, this contradicts the assumption that $\operatorname{Stab}_{\mathbb{G}_{a}}(z)=\{e\}$ for all $z\in Z$ . ∎

Note that there is a choice of sign here. One could consider actions of $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ for $n<0$ such that $\lim_{t\rightarrow\infty}t\cdot x$ exists for all $x\in X$ and obtain an analogous result.

Remark 5.7.

For a linear representation $\mathbb{G}_{a}\rtimes\mathbb{G}_{m}\rightarrow\mathrm{GL}(V)$ , we have $V_{\max}\subset V^{\mathbb{G}_{a}}$ , where $V_{\max}$ is the weight space for the maximal $\mathbb{G}_{m}$ -weight.

Example 5.8.

Consider the upper triangular Borel $\mathbb{G}_{a}\rtimes_{2}\mathbb{G}_{m}\cong B<\mathrm{SL}_{2}$ acting on $V=\mathbb{A}^{2}$ by left multiplication. We have

[TABLE]

and $V_{\max}=V_{+1}\subset V^{\mathbb{G}_{a}}$ .

5.5. Graded unipotent groups

Generalising Proposition 5.6, we will consider actions of unipotent groups which are graded by a $\mathbb{G}_{m}$ -action in the following sense.

Definition 5.9.

A graded unipotent group is a semi-direct product $\widehat{U}:=U\rtimes\mathbb{G}_{m}$ of a unipotent group with a multiplicative group such that the conjugation of $\mathbb{G}_{m}$ on the Lie algebra of $U$ has strictly positive weights.

Example 5.10.

For $n>0$ , the group $\mathbb{G}_{a}\rtimes_{n}\mathbb{G}_{m}$ is a graded unipotent group. In particular, the upper triangular Borel $B<\mathrm{SL}_{2}$ is a graded unipotent group.

Proposition 5.11.

Let $X$ be an affine scheme with an action of a graded unipotent group $\widehat{U}:=U\rtimes\mathbb{G}_{m}$ such that $\lim_{t\rightarrow 0}t\cdot x$ exists for all $x\in X$ and $\operatorname{Stab}_{U}(z)=\{e\}$ for all $z\in Z:=\{\lim_{t\rightarrow 0}t\cdot x\}$ , then $\mathcal{O}(X)^{U}$ is finitely generated and $X\rightarrow\operatorname{Spec}\mathcal{O}(X)^{U}$ is a trivial $U$ -quotient.

Proof.

The idea is to iteratively apply Proposition 5.6 as in the proof of [14, Proposition 7.4]. By lifting a filtration on the Lie algebra via the exponential map, we obtain normal subgroups

[TABLE]

whose successive quotients are copies of $\mathbb{G}_{a}$ on which $\mathbb{G}_{m}$ acts by conjugation with strictly positive weights. Assume that we have constructed a quotient $q_{j}:X\rightarrow X_{j}:=\operatorname{Spec}\mathcal{O}(X)^{U_{j}}$ which is a trivial $U_{j}$ -quotient. The base case for $j=1$ is given by Proposition 5.6. For the inductive step, we claim that we can apply Proposition 5.6 to the $\mathbb{G}_{a}\cong U_{j+1}/U_{j}$ action on $X_{j}$ to show there is a trivial $U_{j+1}/U_{j}$ -quotient $X_{j}\rightarrow\mathcal{O}(X_{j})^{U_{j+1}/U_{j}}$ . Then the composition

[TABLE]

is a principal $U_{j}$ -bundle (see [17, Proposition 4.7]), which is trivial as the base is affine by [9, Theorem 3.12].

To complete the proof, we must show that for the $\mathbb{G}_{a}\cong U_{j+1}/U_{j}$ action on $X_{j}$ graded by $\mathbb{G}_{m}<\widehat{U}$ all limits $\lim_{t\rightarrow 0}t\cdot x_{j}$ exists for $x_{j}\in X_{j}$ and $\operatorname{Stab}_{U_{j+1}/U_{j}}(x_{j})=\{e\}$ for all $x_{j}\in X_{j}^{\mathbb{G}_{m}}$ in order to be able to apply Proposition 5.6. Since $q_{j}$ is $\mathbb{G}_{m}$ -equivariant, we have

[TABLE]

Thus the set $Z_{j}$ of such limits in $X_{j}$ is contained in $q_{j}(Z)$ . For a point $q_{j}(z)\in Z_{j}$ , suppose that $uU_{j}\in\operatorname{Stab}_{U_{j+1}/U_{j}}(q_{j}(z))$ ; that is, there exists $u^{\prime}\in U_{j}$ such that $u^{\prime}z=uz$ . However, by assumption $\operatorname{Stab}_{U}(z)=\{e\}$ and so we have $u=u^{\prime}\in U_{j}$ , which means $\operatorname{Stab}_{U_{j+1}/U_{j}}(q_{j}(z))$ is trivial and we can apply Proposition 5.6 to $X_{j}$ as claimed. ∎

Remark 5.12.

For a representation $\rho:\widehat{U}\rightarrow\mathrm{GL}(V)$ , note that $V_{\max}\subset V^{U}$ .

In particular, rather than trying to find $U$ -invariant sections, we can use $\mathbb{G}_{m}$ -maximal sections to construct a non-reductive GIT quotient as in the next subsection.

5.6. Statement of the key results in non-reductive GIT

Instead of working with a linearised action on a projective scheme, for simplicity we will assume that we have a linear action on $X=\mathbb{P}(V)$ as in $\S$ 2.6. As before, in the case of a (very ample) linearisation $\mathcal{L}$ on $X$ , we get a projective embedding $X\hookrightarrow\mathbb{P}(V)$ where $V=H^{0}(X,\mathcal{L})^{*}$ has a linear action.

This concerns actions of affine algebraic groups, whose unipotent radical is graded by a $\mathbb{G}_{m}$ .

Definition 5.13.

Let $G=U\rtimes R$ be an affine algebraic group with unipotent radical $U$ and reductive Levi factor $R$ . A 1-PS $\lambda:\mathbb{G}_{m}\rightarrow Z(R)$ is said to grade $U$ if its conjugation action on $\operatorname{Lie}U$ has strictly positive weights. In this case, we say $G$ has graded unipotent radical (by $\lambda$ ).

This is what is called an internal grading in [14], where there is also a more general notion of an external grading; however, we will stick to the simpler internal point of view, as this suffices in all examples and applications we consider.

There could be several central 1-PSs in $Z(R)$ that grade $U$ , but we will assume we have fixed a grading $\mathbb{G}_{m}$ and write $\widehat{U}=U\times\mathbb{G}_{m}$ . Different grading 1-PSs gives rise to different quotients, so can be thought of as an additional choice to the linearisation (see Remark 5.17 below).

Example 5.14.

A parabolic subgroup $P$ of $\mathrm{GL}_{n}$ (or more generally any reductive group) has graded unipotent radical: write $P=P_{\lambda}$ for a 1-PS $\lambda$ , then $P_{\lambda}=U_{\lambda}\rtimes L_{\lambda}$ , with $\lambda$ being central in $L_{\lambda}$ and grading $U_{\lambda}$ .

Definition 5.15 (Minimal weight space and attracting set).

For a multiplicative group $\mathbb{G}_{m}$ acting linearly on $X=\mathbb{P}(V)$ , we define the minimal weight space $Z_{\min}$ and minimal attracting set $X_{\min}$ as follows:

[TABLE]

where $\omega_{\min}=\omega_{0}<\omega_{1}<\dots<\omega_{n}$ are the $\mathbb{G}_{m}$ -weights on $V$ and $V_{\min}=V_{\omega_{\min}}$ .

In [15], the minimal attracting set $X_{\min}$ is denoted $X_{\min}^{0}$ , but we have simplified the notation. For the associated Białynicki-Birula stratification [19] (flowing as $t\rightarrow 0$ ), the variety $X_{\min}$ is the open stratum and $p:X_{\min}\rightarrow Z_{\min}$ is a Zariski locally trivial affine space fibration.

In [14], there are two types of assumptions needed for the construction of non-reductive GIT quotients: the first (Definition 5.16) concerns the positioning of the weights for the linearised action of the grading $\mathbb{G}_{m}$ , which selects a particular VGIT chamber for $\mathbb{G}_{m}$ and can be achieved by twisting the linearisation by a (rational) character $\chi:\widehat{U}\rightarrow\mathbb{G}_{m}$ , and the second (Assumption $[\widehat{U}]_{0}$ in Definition 5.18) requires certain unipotent stabiliser groups to be trivial, which is referred to as semistability coincides with stability in [14].

Definition 5.16 (Adapted linearisation).

Let $G=U\rtimes R$ be a group with unipotent radical graded by $\mathbb{G}_{m}<Z(R)$ acting linearly on $X=\mathbb{P}(V)$ . We say this linearised action is adapted if the $\mathbb{G}_{m}$ -weights on $V$ satisfy

[TABLE]

Remark 5.17.

The assumption that the linearised action is adapted fixes a particular VGIT chamber for the $\mathbb{G}_{m}$ -action, where (semi)stability is given by

[TABLE]

Since twisting the linearisation by a character $\chi:\widehat{U}\rightarrow\mathbb{G}_{m}$ shifts the weights by $-\chi$ (where we identify characters of $\widehat{U}$ with integers $\mathbb{Z}$ such that $1\in\mathbb{Z}$ corresponds to a character $\widehat{U}\rightarrow\mathbb{G}_{m}$ with kernel $U$ ), if we are prepared to modify the linearisation we can always arrange for this condition to hold. Note that this shift of weights does not change the minimal weight space $Z_{\min}$ and attracting set $X_{\min}$ .

Definition 5.18 (Stabiliser assumptions).

Let $G=U\rtimes R$ be a group with unipotent radical graded by $\mathbb{G}_{m}<Z(R)$ acting linearly on $X=\mathbb{P}(V)$ .

(1)

We say the unipotent stabiliser assumption holds if

[TABLE] 2. (2)

We say the reductive stabiliser assumption holds if

[TABLE]

where as $\mathbb{G}_{m}$ is central in $R$ , there is an induced $R$ -action on the $\mathbb{G}_{m}$ -fixed variety $Z_{\min}$ and we let $Z_{\min}^{\overline{R}-ss}$ denote the GIT semistable locus for $\overline{R}:=R/\mathbb{G}_{m}$ .

Remark 5.19.

The unipotent stabiliser assumptions are crucial to apply Proposition 5.11, whereas the reductive stabiliser assumptions are used to more readily obtain an explicit Hilbert–Mumford type description of the locus we obtain a quotient of. The reductive stabiliser assumption implies semistability coincides with stability for the $\overline{R}$ -action on $Z_{\min}$ .

If these stabiliser assumptions fail, then one would like to perform a sequence of equivariant blow-ups to arrange for this to hold on the blow-up (similar to Kirwan’s partial desingularisation procedure [64]) and then construct a quotient of the original scheme using the quotient of the blow-up; however, this procedure is much more complicated in the non-reductive setting described in [14, $\S$ 9]. Furthermore, if there are generically positive dimensional unipotent stabilisers, blowing up would only result in constant dimensional stabilisers (rather than trivial stabilisers), and so instead one must filter $U$ by normal subgroups whose stabilisers are constant (assuming this can be done via blow-ups) and then proceed as in [14, Remark 7.1] and [81].

The strategy is to first use the grading $\mathbb{G}_{m}$ to obtain a projective quotient by $\widehat{U}=U\rtimes\mathbb{G}_{m}$ and then take a reductive GIT quotient by the residual group $\overline{R}=R/\mathbb{G}_{m}$ . Therefore, we first state the result for quotients by graded unipotent groups $\widehat{U}=U\rtimes\mathbb{G}_{m}$ .

Theorem 5.20 (The $\widehat{U}$ -Theorem, [15, Theorem 2.16]).

Let $\widehat{U}=U\rtimes\mathbb{G}_{m}$ be a graded unipotent group acting linearly on $X=\mathbb{P}(V)$ . If the linearised action is adapted and the unipotent stabiliser assumption $[\widehat{U}]_{0}$ hold, then we have the following statements.

i)

There is a geometric $U$ -quotient $q_{U}:X_{\min}\rightarrow X_{\min}/U$ such that $X_{\min}/U$ is a quasi-projective variety. 2. ii)

There is a geometric $\widehat{U}$ -quotient $q_{\widehat{U}}:X_{\min}\setminus UZ_{\min}\rightarrow X/\!/\widehat{U}:=(X_{\min}\setminus UZ_{\min})/\widehat{U}$ such that $X/\!/\widehat{U}$ is a projective variety. 3. iii)

If the linearised action is well-adapted888See Definition 5.21 below, which can be achieved by further twisting by a rational character., then the ring of $\widehat{U}$ -invariant sections (for an appropriate power) of the linearisation is finitely generated and taking the Proj construction gives the above geometric $\widehat{U}$ -quotient $q_{\widehat{U}}$ .

Let us outline the structure of the proof.

(1)

Since $X_{\min}=\bigcup_{\sigma\in H^{0}(X,\mathcal{O}(1))_{\max}}X_{\sigma}$ , the $U$ -quotient is constructed by applying Proposition 5.11 to each affine $\widehat{U}$ -variety $X_{\sigma}$ for $\sigma\in H^{0}(X,\mathcal{O}(1))_{\max}$ (see Proposition 5.26). 2. (2)

Construct a $\mathbb{G}_{m}$ -equivariant embedding $X_{\min}/U\hookrightarrow\mathbb{P}(W)$ with $W:=(H^{0}(X,\mathcal{O}(r))^{U})^{*}$ for some $r>0$ , to show $X_{\min}/U$ is quasi-projective (see Proposition 5.27). 3. (3)

By appropriately twisting the original linearisation (to make it well-adapted), the induced $\mathbb{G}_{m}$ -action on $\mathbb{P}(W)$ is adapted and $\mathbb{P}(W)^{\mathbb{G}_{m}-(s)s}=\mathbb{P}(W)_{\min}\setminus\mathbb{P}(W_{\min})$ ; thus

[TABLE]

has a geometric $\widehat{U}$ -quotient, as a closed subvariety of $\mathbb{P}(W)/\!/\mathbb{G}_{m}$ (see Proposition 5.29). 4. (4)

To prove that the ring of $\widehat{U}$ -invariant sections is finitely generated, one shows that $q_{\widehat{U}}$ coincides with an enveloping quotient (see [16, Definition 3.1.6]) and as the enveloping quotient is projective, the ring of $\widehat{U}$ -invariant sections for an appropriately divisible power of the well-adapted linearisation is finitely generated by [16, Corollary 3.1.21].

The first two steps give the proof of Theorem 5.20 i), whereas (3) and (4) give statements ii) and iii) respectively; more details on the proofs of Steps (1) - (3) are given in $\S$ 5.7 below.

In (1), it is crucial that the $U$ -action is graded by $\mathbb{G}_{m}$ , that the unipotent stabiliser assumption holds and that the action is adapted in order to apply Proposition 5.11. The grading $\mathbb{G}_{m}$ -action is also used in (2) and (3) to firstly show $X_{\min}/U$ is quasi-projective and then obtain a projective quotient of $X_{\min}\setminus UZ_{\min}$ as a closed subvariety of $\mathbb{P}(W)/\!/\mathbb{G}_{m}$ . In (3), in order for the induced $\mathbb{G}_{m}$ -linearisation on $\mathbb{P}(W)$ to be adapted, we need the minimal weight $\omega_{\min}$ for the original linearised action to be negative but very small (see the proof of Proposition 5.29); this leads to the following notion.

Definition 5.21 (Well-adapted linearisation).

Let $G=U\rtimes R$ be a group with unipotent radical graded by $\mathbb{G}_{m}<Z(R)$ acting linearly on $X=\mathbb{P}(V)$ . We say this linearised action is well-adapted if there is $0<\epsilon<\!<1$ such that the $\mathbb{G}_{m}$ -weights on $V$ satisfy

[TABLE]

Let us present a simple example for matrices up to conjugation extending Example 2.15.

Example 5.22 (To appear in upcoming joint work with E. Hamilton and J. Jackson).

Consider the upper triangular Borel subgroup $\widehat{U}=\mathbb{G}_{a}\rtimes_{2}\mathbb{G}_{m}<\mathrm{SL}_{2}$ acting by conjugation on $\operatorname{Mat}_{2\times 2}$ . To apply Theorem 5.20, we consider the projective embedding $\operatorname{Mat}_{2\times 2}\hookrightarrow X:=\mathbb{P}(\operatorname{Mat}_{2\times 2}\oplus k)$ , with trivial action on $k$ . The minimal weight space is 1-dimensional and spanned by the elementary matrix $E_{21}$ , so $Z_{\min}=\{*\}=\mathbb{P}(kE_{21})$ is contained at infinity (i.e. in $\mathbb{P}(\operatorname{Mat}_{2\times 2})$ ), and $X_{\min}=\{[A:z]:a_{21}\neq 0\}$ . The unique point in $Z_{\min}$ has trivial $\mathbb{G}_{a}$ -stabiliser; thus $[\widehat{U}]_{0}$ holds. By twisting the linearisation to make it adapted, we obtain a geometric $\widehat{U}$ -quotient

[TABLE]

As $UZ_{\min}$ is contained at infinity, $\operatorname{Mat}_{2\times 2}\cap X^{\widehat{U}-s}=\operatorname{Mat}_{2\times 2}\cap X_{\min}=(\operatorname{Mat}_{2\times 2})_{a_{21}}$ and we obtain a geometric $\widehat{U}$ -quotient of matrices whose bottom left entry is non-zero

[TABLE]

Suppose that $G=U\rtimes R$ acts on $X$ ; then the projective geometric $\widehat{U}$ -quotient $X/\!/\widehat{U}$ of Theorem 5.20 has a residual action of the reductive group $\overline{R}=R/\mathbb{G}_{m}$ . There is a quotient

[TABLE]

where the second morphism is the reductive GIT quotient; this gives a projective and good $G$ -quotient of the open set $q_{\widehat{U}}^{-1}((X/\!/\widehat{U})^{\overline{R}-ss})$ . The key challenge is to determine this preimage in terms of the original action on $X$ . This is easiest when $[\overline{R}]_{0}$ holds (i.e. semistability coincides with stability for the $\overline{R}$ -action on $Z_{\min}$ ); in this case, we define the following semistable sets.

Definition 5.23 (Non-reductive stable set).

Let $G=U\rtimes R$ be a group with unipotent radical graded by $\mathbb{G}_{m}<Z(R)$ acting linearly on $X=\mathbb{P}(V)$ . If the linearisation is well-adapted and both $[\widehat{U}]_{0}$ and $[\overline{R}]_{0}$ hold, then we define the $G$ -stable set by

[TABLE]

Remark 5.24.

Since $[\widehat{U}]_{0}$ holds for the linear action of the graded unipotent group $\widehat{U}$ on $X=\mathbb{P}(V)$ , the $U$ -sweep of $Z_{\min}$ is a closed subvariety of $X_{\min}$ by [14, Lemma 5.4]. Consequently, the $\widehat{U}$ -stable locus $X^{\widehat{U}-s}=X_{\min}\setminus UZ_{\min}$ is open in $X$ , and so is the $G$ -stable locus.

The next result is a special case of [14, Theorem 2.20] as stated in [56, Theorem 2.28], where the reductive stabiliser assumption $[\overline{R}]_{0}$ is used to describe the preimage of the reductive (semi)stable locus by using the reductive Hilbert-Mumford criterion and comparing the torus weight sets of points $x\in X_{\min}$ and their images under $q_{U}$ . Although some torus weights are lost on applying $q_{U}$ , all torus weights corresponding to maximal grading $\mathbb{G}_{m}$ -weights survive, and if $x\in UZ_{\min}$ then it has at least one non-minimal weight; these observations together with the assumption that semistability coincides with stability for $\overline{R}$ on $Z_{\min}$ are central in the proof.

Theorem 5.25 (Construction of non-reductive GIT quotients, [17, Theorem 4.28]).

Let $G=U\rtimes R$ be a group with unipotent radical graded by $\mathbb{G}_{m}<Z(R)$ acting linearly on $X=\mathbb{P}(V)$ . If the linearisation is well-adapted and both $[\widehat{U}]_{0}$ and $[\overline{R}]_{0}$ hold, then there is a projective and geometric $G$ -quotient

[TABLE]

which coincides with the Proj construction associated to the (finitely generated) invariant ring.

5.7. Overview of the proof

We provide some details on the proof of Theorem 5.20 i) and ii). The structure of the proof is as follows.

(1)

Using Proposition 5.11, construct a geometric $U$ -quotient of $X_{\min}$ (see Proposition 5.26). 2. (2)

Construct $X_{\min}/U\hookrightarrow\mathbb{P}(W)$ to show $X_{\min}/U$ is quasi-projective (see Proposition 5.27). 3. (3)

Inside $\mathbb{P}(W)/\!/\mathbb{G}_{m}$ , construct a geometric $\widehat{U}$ -quotient of $X^{\widehat{U}-s}$ (see Proposition 5.29).

The first step relies on the Key Proposition (Proposition 5.6) about $\mathbb{G}_{a}$ -slices via $\mathbb{G}_{m}$ -gradings.

Proposition 5.26.

Let $\widehat{U}=U\rtimes\mathbb{G}_{m}$ be a graded unipotent group acting linearly on $X=\mathbb{P}(V)$ . If the linearised action is adapted and $[\widehat{U}]_{0}$ holds, then there is a geometric $U$ -quotient of $X_{\min}$ .

Proof.

Since $X_{\min}$ is the union of the open affine varieties $X_{\sigma}$ over ${\sigma\in H^{0}(X,\mathcal{O}(1))_{\max}}$ , we will construct a geometric $U$ -quotient by gluing trivial quotients $X_{\sigma}\rightarrow X_{\sigma}/U=\operatorname{Spec}\mathcal{O}(X_{\sigma})^{U}$ which are constructed by applying Proposition 5.11 to each affine $\widehat{U}$ -variety $X_{\sigma}$ for $\sigma\in H^{0}(X,\mathcal{O}(1))_{\max}$ .

In fact, by choosing a basis of $V$ consisting of $\mathbb{G}_{m}$ -weight vectors, which gives an identification $X\cong\mathbb{P}^{n}$ , it suffices to construct these trivial quotients in the case where $\sigma=x_{i}\in H^{0}(X,\mathcal{O}(1))_{\max}$ is a coordinate function. Then $X_{\sigma}=\mathbb{P}(V)_{x_{i}}\cong\mathbb{A}^{n}$ has coordinates $x_{j}/x_{i}$ . Since $x_{i}\in H^{0}(X,\mathcal{O}(1))_{\max}$ , its $\mathbb{G}_{m}$ -weight is $-\omega_{\min}$ (recall that $\omega_{\min}$ is the minimal weight in $V$ and $V=H^{0}(X,\mathcal{O}(1))^{*}$ ). Hence the weights of the $\mathbb{G}_{m}$ -action on $X_{\sigma}=\mathbb{P}(V)_{x_{i}}\cong\mathbb{A}^{n}$ are of the form $\omega_{j}-\omega_{\min}\geq 0$ , where this inequality holds due to the linearised action being adapted. In particular, the flow under $\mathbb{G}_{m}$ as $t\rightarrow 0$ exists for all points in $X_{\sigma}$ . Since $Z_{\sigma}\subset Z_{\min}$ , the unipotent stabiliser assumption $[\widehat{U}]_{0}$ implies that the corresponding stabiliser assumption in Proposition 5.11 holds; hence we obtain the claimed trivial $U$ -quotient $X_{\sigma}\rightarrow X_{\sigma}/U=\operatorname{Spec}\mathcal{O}(X_{\sigma})^{U}$ . ∎

The quotient obtained from this gluing construction is a priori just an abstract scheme, but the next result shows it is in fact quasi-projective. This result is [14, Lemma 7.6].

Proposition 5.27.

Let $\widehat{U}$ be a graded unipotent group acting linearly on $X=\mathbb{P}(V)$ such that the linearised action is adapted and $[\widehat{U}]_{0}$ holds. There exists a positive integer $r$ and an embedding

[TABLE]

where $W:=(H^{0}(X,\mathcal{O}(r))^{U})^{*}$ , the first morphism is a closed immersion and the second morphism is the open inclusion of the minimal attracting set for the induced $\mathbb{G}_{m}$ -action on $W$ .

Proof.

Fix a basis $\sigma_{1},\dots,\sigma_{l}$ of $H^{0}(X,\mathcal{O}(1))_{\max}$ such that $\mathcal{O}(X_{\sigma_{i}})$ is finitely generated (see the proof of Proposition 5.26 above). Then there exists a positive integer $r$ , such that for $1\leq i\leq l$ , we have that $R(X,\mathcal{O}(1))^{U}_{(\sigma_{i}^{r})}$ is generated by $\{\frac{f}{\sigma_{i}^{r}}:f\in H^{0}(X,\mathcal{O}(r))^{U}\}$ . Let $\Sigma_{i}\in H^{0}(\mathbb{P}(W),\mathcal{O}(1))\cong H^{0}(X,\mathcal{O}(r))$ correspond to $\sigma_{i}^{r}$ ; then $\operatorname{Sym}(W^{*})_{\Sigma_{i}}\rightarrow\mathcal{O}(X)_{\sigma_{i}}$ is surjective.

The inclusion $H^{0}(X,\mathcal{O}(r))^{U}\hookrightarrow H^{0}(X,\mathcal{O}(r))$ induces a rational map $\phi:X\dashrightarrow\mathbb{P}(W)$ , which is well-defined on $X_{\min}$ , as maximal sections are $U$ -invariant, and $\phi|_{X_{\min}}=\overline{\phi}\circ q_{U}$ for

[TABLE]

Since $\operatorname{Spec}\mathcal{O}(X_{\sigma_{i}})^{U}$ cover $X_{\min}/U$ , we see that $\overline{\phi}$ factors via

[TABLE]

For a partition $\underline{k}=(k_{1},\dots,k_{l})$ of $r$ , the section $\sigma^{\underline{k}}:=\prod_{i=1}^{l}\sigma_{i}^{k_{i}}\in H^{0}(X,\mathcal{O}(r))_{\max}$ corresponds to $\Sigma_{\underline{k}}\in H^{0}(\mathbb{P}(W),\mathcal{O}(1))_{\max}$ .

Since being a closed immersion is a local property on the target, $\overline{\phi}:X_{\min}/U\rightarrow\mathbb{P}(W)_{\min}$ is a closed immersion if $\overline{\phi}_{\underline{k}}:\operatorname{Spec}\mathcal{O}(X_{\sigma^{\underline{k}}})^{U}\rightarrow\mathbb{P}(W)_{\Sigma_{\underline{k}}}$ is a closed immersion for each partition $\underline{k}$ , or equivalently $\operatorname{Sym}(W^{*})_{\Sigma_{\underline{k}}}\rightarrow\mathcal{O}(X)_{\sigma^{\underline{k}}}$ is surjective. This last statement is deduced from the fact that $\operatorname{Sym}(W^{*})_{\Sigma_{i}}\rightarrow\mathcal{O}(X)_{\sigma_{i}}$ is surjective for each $i$ by the choice of $r$ . ∎

The next two results are described in the discussion after Lemma 7.7 in [14].

Lemma 5.28.

For an adapted linear action of a graded unipotent group $\widehat{U}$ on $X=\mathbb{P}(V)$ , assume $[\widehat{U}]_{0}$ holds; thus there is a geometric quotient $q_{U}:X_{\min}\rightarrow X_{\min}/U$ , which is locally closed in $\mathbb{P}(W)$ by Proposition 5.29. Let $x\in X_{\min}$ ; then $q_{U}\in\mathbb{P}(W_{\min})$ if and only if $x\in UZ_{\min}$ .

Proof.

For $q_{U}(x)\in X_{\min}/U\hookrightarrow\mathbb{P}(W)_{\min}$ , we have that $q_{U}\in\mathbb{P}(W_{\min})$ if and only if

[TABLE]

or, as $q_{U}$ is a geometric quotient, equivalently $U\cdot x=U\cdot p(x)$ , i.e. $x\in UZ_{\min}$ . ∎

Proposition 5.29.

For an adapted linear action of a graded unipotent group $\widehat{U}$ on $X=\mathbb{P}(V)$ , assume $[\widehat{U}]_{0}$ holds. There is a well-adapted rational twist of the $\widehat{U}$ -linearisation on $X$ such that the induced $\mathbb{G}_{m}$ -linearisation on $\mathbb{P}(W)$ is adapted and

[TABLE]

Furthermore, the preimage under $q_{U}$ of the $\mathbb{G}_{m}$ -stable locus of the closure of $X_{\min}/U$ in $\mathbb{P}(W)$

[TABLE]

admits a projective geometric $\widehat{U}$ -quotient $X_{\min}\setminus UZ_{\min}\rightarrow(\overline{X_{\min}/U})/\!/\mathbb{G}_{m}$ .

Proof.

Recall that $\omega_{\min}$ is the minimal weight on $V=H^{0}(X,\mathcal{O}(1))^{*}$ and so $r\omega_{\min}$ is the minimal weight on $W:=(H^{0}(X,\mathcal{O}(r))^{U})^{*}$ . Pick $\epsilon>0$ so that $r\epsilon<1$ . Let $\chi:\widehat{U}\rightarrow\mathbb{G}_{m}$ be the rational character corresponding to $\omega_{\min}+\epsilon\in\mathbb{Q}$ . Then the $\mathbb{G}_{m}$ -weights $\alpha_{j}$ on $\mathcal{O}_{\mathbb{P}(W)}(1)^{r\chi}$ satisfy

[TABLE]

so the induced $\mathbb{G}_{m}$ -linearisation $\mathcal{O}_{\mathbb{P}(W)}(1)$ is adapted and $\mathbb{P}(W)^{\mathbb{G}_{m}-(s)s}=\mathbb{P}(W)_{\min}\setminus\mathbb{P}(W_{\min})$ . This linearisation on $\mathbb{P}(W)$ is induced from the well-adapted twisted linearisation $\mathcal{O}_{X}(1)^{\chi}$ . The final claim follows from Lemma 5.28, as $\overline{X_{\min}/U}^{\mathbb{G}_{m}-(s)s}=(X_{\min}/U)\setminus((X_{\min}/U)\cap\mathbb{P}(W_{\min}))$ . ∎

The proof of the finite generation of an appropriate power of the linearisation is given in the discussion proceeding Corollary 7.10 in [14].

6. Recent applications of non-reductive GIT

In this section we will give an overview of some recent applications of non-reductive GIT.

6.1. Moduli of jets of map germs and hyperbolicity

Bérczi and Kirwan [18] used non-reductive GIT to construct and study compactifications of spaces of invariant jet differentials in order to prove polynomial versions of the Green–Griffiths–Lang conjecture and Kobayashi conjecture concerning hyperbolicity properties of generic smooth projective hypersurfaces. Let us outline these conjectures and the approach using non-reductive GIT.

A complex projective manifold $X$ is Brody hyperbolic if every holomorphic map $f:\mathbb{C}\rightarrow X$ is constant. For example, in dimension $1$ , a curve is hyperbolic if and only if $g\geq 2$ . Hyperbolic varieties are interesting from the point of view of complex geometry and also for their conjectural Diophantine properties (Lang conjectured that if a projective variety defined over $\mathbb{Q}$ is hyperbolic, then $X(\mathbb{Q})$ is finite).

The Kobayashi conjecture predicts that a very general hypersurface $X\subset\mathbb{P}^{n+1}$ of sufficiently large degree $d_{n}$ is Brody hyperbolic. Green, Griffiths and Lang conjectured that every projective algebraic variety $X$ of general type is weakly hyperbolic; that is, there exists a proper subvariety $Y\subsetneq X$ such that the image of every holomorphic map $f:\mathbb{C}\rightarrow X$ is contained in $Y$ . These conjecture are related by a recent result of Riedl and Yang [82]: if the Green–Griffiths–Lang conjecture holds for projective hypersurfaces of dimension $n$ and degree at least $d_{n}$ , then the Kobayashi conjecture is true for projective hypersurfaces of dimension $n$ with degree at least $d_{2n-1}$ . The strategy for approaching these conjectures goes back to work of Demailly [27] and Siu [90], which involves studying invariant jet differentials; here non-reductive group actions naturally arise as reparametrisation groups.

For a smooth projective complex variety $X$ of dimension $n$ , the bundle of $J_{k}X\rightarrow X$ of $k$ -jet germs in $X$ has fibre over $p\in X$ is given by germs of holomorphic maps $f:(\mathbb{C},0)\rightarrow(X,p)$ for fixed local coordinates at $p$ up to the equivalence relation given by equality of the first $k$ -derivatives at [math]; thus the fibres can be represented by truncated Taylor expansions or equivalently $k$ -tuples of vectors in $\mathbb{C}^{n}$ given by the first $k$ -derivatives. The transition functions are polynomial, but not linear, so $J_{k}X\rightarrow X$ is not a vector bundle. The group $\mathrm{Diff}_{k}$ of regular $k$ -jets of maps $(\mathbb{C},0)\rightarrow(\mathbb{C},0)$ acts fibrewise on $J_{k}X\rightarrow X$ by reparametrisations; $\mathrm{Diff}_{k}$ is naturally an upper triangular subgroup of $\mathrm{GL}_{k}$ and fortunately is a graded unipotent group

[TABLE]

where $\dim U_{k}=k-1$ . Green and Griffiths studied algebraic differential operators, which are polynomial functions on $J_{k}X$ , and constructed a sheaf of algebraic differential operators of order $k$ of fixed weighted degree (with respect to the $\mathbb{C}^{*}$ -weights). Demailly considered a subbundle of jet differentials invariant under reparametrisations from $U_{k}$ . A key tool to finding invariant jet differentials is to produce a projective completion of the fibrewise quotient of $\mathrm{Diff}_{k}$ acting on the jet bundle $J_{k}X\rightarrow X$ . The projective completion given by Bérczi and Kirwan [18] uses non-reductive GIT, where a blow-up at $Z_{\min}$ , which is just a point, is needed for the unipotent stabiliser assumption to hold. They then use intersection theory for non-reductive GIT quotients (see [17] and $\S$ 6.4 below) to prove a polynomial version of the Green–Griffiths–Lang conjecture.

Theorem 6.1 (Polynomial Green–Griffiths–Lang Theorem of Bérczi–Kirwan [18]).

A generic smooth projective hypersurface of dimension $n$ and degree $d\geq 32n^{4}$ is weakly hyperbolic.

By work of Riedl–Yang [82], this gives a polynomial Kobayashi theorem [18, Theorem 1.4] for generic smooth projective hypersurfaces of dimension $n$ and degree $d\geq 32(2n-1)^{4}$ .

6.2. Moduli spaces of hypersurfaces in weighted projective orbifolds

One classical application of reductive GIT is to construct moduli spaces of projective hypersurfaces as the GIT quotient of the $\mathrm{PGL}_{n+1}$ -action on $\mathbb{P}(k[x_{0},\dots,x_{n}]_{d})$ ; this gives compactifications of moduli spaces of smooth hypersurfaces, as Mumford proved that any smooth hypersurface $X\subset\mathbb{P}^{n}$ of degree $d\geq 3$ is GIT stable when $n>1$ . In general determining precisely which other hypersurfaces are (semi)stable is challenging, even with the Hilbert–Mumford criterion in hand.

The advent of non-reductive GIT enables this to be extended to hypersurfaces in weighted projective spaces and more general projective toric varieties, whose automorphism groups are non-reductive affine algebraic groups and are explicitly described by the work of Cox as quotients of the graded automorphism group of the Cox ring.

Example 6.2.

The weighted projective plane $\mathbb{P}(1,1,2)$ has automorphism group given by

[TABLE]

where the unipotent group appears from automorphisms of the form $z\mapsto z+ax^{2}+bxy+cy^{2}$ .

Fortunately the automorphism groups of weighted projective spaces have graded unipotent radicals; see [15, Lemma 4.1] and also [23].

Bunnett [23] studied the application of non-reductive GIT to moduli of weighted projective hypersurfaces and more generally hypersurfaces in toric orbifolds. In a well-formed weighted projective space $\mathbb{P}(a_{0},\dots,a_{n})$ he proves [23, Theorem 5.18] that any quasi-smooth hypersurface (see [11, $\S$ 3]) of degree $d\geq 2+\max\{a_{0},\dots,a_{n}\}$ is stable (in the sense of non-reductive GIT) provided the unipotent stabiliser assumption holds, so that no blow-ups are required.

In the case of a well-formed weighted projective space $\mathbb{P}(a_{0},\dots,a_{n})$ , any quasi-smooth hypersurface of degree $d\geq 2+\max\{a_{0},\dots,a_{n}\}$ has finitely many automorphisms coming from the ambient automorphisms of $\mathbb{P}(a_{0},\dots,a_{n})$ by [23, Theorem 3.13]. Consequently, the Keel–Mori Theorem gives the existence of a coarse moduli spaces as an algebraic space. However, non-reductive GIT gives the construction of a quasi-projective moduli space.

For a toric variety $X$ , there is an $A$ -discriminant (see [37]) for hypersurfaces of class $\alpha$ , which vanishes on non-quasi-smooth hypersurfaces (in contrast to the case of projective hypersurfaces, the converse is not necessarily true, as the $A$ -discriminant only checks for singularities in the sweep under $G=\operatorname{Aut}_{\alpha}(X)$ of the torus $T\subset X$ , see [23, Remark 4.11]). The $A$ -discriminant of [37] is interpreted as an invariant section of a twisted linearisation in [23, Corollary 4.12].

Let us explain the non-reductive GIT set-up for hypersurfaces in $\mathbb{P}(a_{0},\dots,a_{n})$ of degree $d$ . Assume that hypersurfaces of degree $d$ are Cartier divisors (i.e. the lowest common multiple of the weights divides $d$ ). Consider the non-reductive group $G=\operatorname{Aut}(\mathbb{P}(a_{0},\dots,a_{n}))$ acting on the space $X=\mathbb{P}(k[x_{0},\dots,x_{n}]_{d})$ of weighted degree $d$ homogeneous polynomials. Bunnett proves that quasi-smooth hypersurfaces are contained in the $\widehat{U}$ -stable set $X_{\min}\setminus UZ_{\min}$ (under the unipotent stabiliser assumption). Using the non-reductive GIT Hilbert–Mumford criterion of [14], he shows that quasi-smooth hypersurfaces are stable for the action of $G$ assuming that $d$ is a Cartier degree with $d\geq 2+\max\{a_{0},\dots,a_{n}\}$ and the unipotent stabiliser assumption holds. Furthermore, if the weighted projective space has only two weights, then the unipotent stabiliser assumption holds (see [23, Proposition 5.9]).

Bunnett obtains the best results for Cartier hypersurfaces in a rational cone $\mathbb{P}(1,\dots,1,r)$ of degree $d\geq r+2$ (see [23, Theorem 5.20]): he explicitly describes the quasi-smooth locus as the non-vanishing locus of a section (which is obtained by multiplying the $A$ -discriminant with a variable) and constructs a $U$ -quotient of this open affine variety using Proposition 5.11, where the necessary unipotent stabiliser assumption is easily verified. He then directly obtains a geometric quotient of the locus of quasi-smooth projective hypersurfaces, which is a projective over affine variety, because it is constructed as a reductive GIT quotient of the affine $U$ -quotient twisted by a character as in $\S$ 2.8 rather than using the more complicated methods of [14].

6.3. Moduli of unstable objects

Recall from $\S$ 3.3 that associated to a linear action of a reductive group $G$ on $\mathbb{P}(V)$ and a choice of norm, there is an instability stratification

[TABLE]

where $S_{\beta}\cong G\times^{P_{\lambda}}Y_{\beta}^{ss}$ for a parabolic subgroup $P_{\lambda}<G$ . A categorical $P_{\lambda}$ -quotient of $Y_{\beta}^{ss}$ , or equivalently a categorical $G$ -quotient of $S_{\beta}$ , is given by Proposition 3.27; however, as explained after this proposition, this is far from being an orbit space as it factors via the retraction $p_{\beta}:Y_{\beta}^{ss}\rightarrow Z_{\beta}^{ss}$ sending a point to its flow under $\lambda$ as $t\rightarrow 0$ . Instead, we would like to apply non-reductive GIT to the action of $P_{\lambda}$ on the closure $\overline{Y_{\beta}}$ of $Y_{\beta}\subset X$ , where we can twist the linearisation by (a rational multiple of) a character corresponding to $\lambda$ to make it well-adapted. Fortunately, the non-reductive notion of stability precisely picks out the locus we would like and removes (the $P_{\lambda}$ -sweep) of the limit set $Z_{\beta}^{ss}$ ; see Theorem 6.3 below.

For the parabolic group $P_{\lambda}=U_{\lambda}\rtimes L_{\lambda}$ acting on the blade closure $X=\overline{Y_{\beta}}$ of an unstable stratum $S_{\beta}$ as in (6) above, there is a twisted rational linearisation $\mathcal{L}_{\beta(1+\epsilon)}$ which is well-adapted. Furthermore, we have that in the non-reductive GIT notation the map $p:X_{\min}\rightarrow Z_{\min}$ coincides with the retraction $p_{\beta}:Y_{\beta}\rightarrow Z_{\beta}$ appearing in the description of the unstable strata. Furthermore, $Z_{\beta}^{ss}$ is defined to be the semistable locus for $L_{\lambda}$ with respect to $\mathcal{L}_{\beta}$ , or equivalently for $\overline{L_{\lambda}}:=L_{\lambda}/\lambda(\mathbb{G}_{m})$ as $\lambda(\mathbb{G}_{m})$ acts trivially, which coincides with the semistable locus in $Z_{\min}$ appearing in the definition of the non-reductive stable locus.

Theorem 6.3 (Non-reductive GIT quotients of unstable strata, [56, Theorem 1.1]).

For the parabolic group $P_{\lambda}=U_{\lambda}\rtimes L_{\lambda}$ graded by $\lambda$ acting on the blade closure $X:=\overline{Y_{\beta}}$ of an unstable stratum $S_{\beta}$ as in (6) with the well-adapted linearisation $\mathcal{L}_{\beta(1+\epsilon)}$ , the following statements hold.

i)

If $[\widehat{U}]_{0}$ holds, then there is a projective geometric $\widehat{U}_{\lambda}$ -quotient

[TABLE]

and by taking a reductive GIT quotient by $\overline{L_{\lambda}}$ one obtains a projective categorical $P_{\lambda}$ -quotient of an open subset of the $\widehat{U}$ -stable locus. 2. ii)

If both $[\widehat{U}]_{0}$ and $[\overline{R}]_{0}$ hold, then there is a projective geometric $P_{\lambda}$ -quotient

[TABLE]

Moreover, the ring of invariant sections if finitely generated and $q_{P_{\lambda}}$ coincides with the Proj construction for this invariant ring.

We would like to apply this theorem to moduli of objects in an abelian category, where there are moduli-theoretic instability filtrations, such as the Harder–Narasimhan (HN) filtrations for vector bundles [50]; for example, moduli of sheaves on projective schemes or moduli of quiver representations. In these examples, the GIT instability stratification has been compared with the moduli-theoretic Harder–Narasimhan stratification [39, 57, 54, 55, 96] and this suggests moduli of objects of fixed HN type should be constructed as non-reductive GIT quotients.

Unfortunately the stabiliser assumptions in Theorem 6.3 are quite restrictive and so it is only possible in quite limited situations. For example, for vector bundles (or Higgs bundles) on a smooth projective curve of fixed HN type, the reductive stabiliser assumption $[\overline{R}]_{0}$ only holds for coprime HN types of length $2$ (i.e. the HN filtration has two terms and the invariants for the successive quotients are coprime, so that semistability coincides with stability) and even in this case, the unipotent stabiliser assumption rarely holds and so blow-ups are needed (see [56, $\S$ 3.2.1] for a detailed discussion). In this length 2 coprime case, the non-reductive GIT quotient picks out non-split HN filtrations of length $2$ whose automorphism groups have a fixed dimension; see [21, 58] for the case of vector bundles and [49] for the case of Higgs bundles. To rectify the failure of the reductive stabiliser assumption $[\overline{R}]_{0}$ , one can alternatively perform a quotient in stages, using different 1-PSs in the centre of $L_{\lambda}$ to grade different subgroups of the unipotent radical as in [56]; this results in a natural notion of stability for sheaves of a fixed HN type, but again the unipotent stabiliser assumption is rarely satisfied, so a blow-up procedure would be required.

6.4. Interactions with symplectic geometry and cohomological descriptions

GIT quotients for complex reductive groups are closely related to symplectic quotients for a maximal compact group (see Remark 3.22); the close relationship between the reductive GIT instability stratification and a Morse-theoretic stratification for the norm square of the moment map was used in [63] to describe the rational Betti numbers of reductive GIT quotients.

Fortunately, for non-reductive groups with internally graded unipotent radicals, this close relationship with symplectic geometry has been extended by work of Bérczi and Kirwan [17], and applied to compute cohomology of non-reductive GIT quotients.

For a reductive group $G$ acting on a smooth complex projective variety $Y$ , to construct a moment map one fixes a maximal compact subgroup $K<G$ and a symplectic form $\omega$ invariant under the $K$ -action. The moment map for this maximal compact and symplectic form is a $K$ -invariant map $\mu_{K,\omega}:Y\rightarrow\mathfrak{K}^{*}:=\operatorname{Lie}(K)^{*}$ with the moment map property (that it lifts the infinitesimal action via the correspondence between vector fields and forms given by $\omega$ ). However, any other maximal compact subgroup is of the form $g^{-1}Kg$ and $g^{*}\omega$ is invariant under the $g^{-1}Kg$ -action with moment map $\mu_{g^{-1}Kg,g^{*}\omega}=\mathrm{Ad}^{*}_{g^{-1}}\circ\mu_{K,\omega}\circ g$ . Therefore rather than defining a moment map $\mu_{K,\omega}:Y\rightarrow\mathfrak{K}^{*}$ , Bérczi and Kirwan instead fix a $G$ -equivariant Kähler structure $\Omega$ (namely a $G$ -orbit in the space of pairs $(K,\omega)$ of maximal compact subgroups of $G$ and Kähler forms on $Y$ which are invariant under this maximal compact) and define an $\Omega$ -moment map to be a smooth $G$ -equivariant map

[TABLE]

such that $m_{G,Y,\Omega}(K,\omega,-)=\iota_{K}\circ\mu_{K,\omega}:Y\rightarrow\mathfrak{K}^{*}\hookrightarrow\mathfrak{g}^{*}$ is a moment map for the $K$ -action on $(Y,\omega)$ , where as $\mathfrak{g}=\mathfrak{K}\otimes\mathbb{C}$ , we have a canonical embedding $\iota_{K}:\mathfrak{K}^{*}\hookrightarrow\mathfrak{g}^{*}$ .

Let us explain how Bérczi and Kirwan define moment maps for a smooth complex projective variety with an action of a graded unipotent group $\widehat{U}=U\rtimes\mathbb{C}^{*}$ . Assume that $\widehat{U}<G$ is a subgroup of a reductive group and that $X\subset Y$ is a submanifold of a compact Kähler manifold $Y$ with a $G$ -action on $Y$ that restricts to the given $\widehat{U}$ -action on $X$ (note that $X$ is not required to be invariant under the $G$ -action). As above, fix a $G$ -equivariant Kähler structure $\Omega$ on $Y$ and let $m_{G,Y,\Omega}:\Omega\times Y\rightarrow\mathfrak{g}^{*}$ be an $\Omega$ -moment map, they define $m_{\widehat{U},X,\Omega}:X\times\Omega\rightarrow\hat{\mathfrak{u}}^{*}$ by restricting the $\Omega$ -moment map to $X$ and composing with the restriction $\mathfrak{g}^{*}\rightarrow\hat{\mathfrak{u}}^{*}$

[TABLE]

Assuming the unipotent stabiliser assumption $[\widehat{U}]_{0}$ holds, Bérczi and Kirwan provide a moment map description of the non-reductive GIT quotient: for any $(K,\omega)\in\Omega$ , they show

[TABLE]

and that [math] is a regular value of $\mu_{(K,\omega)}^{\widehat{U}}$ and the inclusion of the zero level set of the moment map in the stable locus induces a diffeomorphism of orbifolds

[TABLE]

which can be viewed as a non-reductive Kempf-Ness Theorem. They extend this result to an action of $H=U\rtimes R$ with internally graded unipotent radical (see [17, Theorem 1.1]).

They apply this to compute Betti numbers of non-reductive GIT quotients. Assuming $[\widehat{U}]_{0}$ holds so that $X^{\widehat{U}-s}=X_{\min}\setminus UZ_{\min}$ , they show that the stratification $X_{\min}=X^{\widehat{U}-s}\sqcup UZ_{\min}$ is $\widehat{U}$ -equivariantly perfect and so the Poincaré series of $X/\!/\widehat{U}$ can be computed from that of $Z_{\min}$ . Similarly for $H=U\rtimes R$ as above, they show the Poincaré series of $X/\!/H$ can be computed from that of the reductive GIT quotient of $Z_{\min}$ by $\overline{R}:=R/\mathbb{G}_{m}$ , whose Poincaré series can in turn be computed as in [63] using the reductive GIT instability stratification of $\S$ 3.3.

Furthermore, they adapt methods of Martin [70] relating the rational cohomology of GIT quotients by reductive groups to that of GIT quotients for a maximal torus. Martin shows the intersection pairing for the reductive GIT quotient can be computed from that of the GIT quotient for the maximal torus via an integration formula, which can then be combined with torus localisation techniques. In the non-reductive case, this enables a description of the rational cohomology ring and a non-reductive integration formula [17, Theorems 1.4 and 1.5], which leads to a residue formula for the intersection pairing on the non-reductive GIT quotient.

These methods are used to prove the polynomial versions of the Green–Griffiths–Lang conjecture and Kobayashi conjecture described in $\S$ 6.1. They can also be applied in the future to describe the cohomology of new moduli spaces constructed via non-reductive GIT.

Bibliography96

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Alper, Stacks and moduli , https://sites.math.washington.edu/~jarod/moduli.pdf .
2[2] J. Alper, Good moduli spaces for Artin stacks , Ann. Inst. Fourier (Grenoble) 63 (2013), no. 6, 2349–2402.
3[3] by same author, Adequate moduli spaces and geometrically reductive group schemes , Algebr. Geom. 1 (2014), no. 4, 489–531.
4[4] J. Alper, P. Belmans, D. Bragg, J. Liang, and T. Tajakka, Projectivity of the moduli space of vector bundles on a curve , Stacks Project Expository Collection (SPEC), London Math. Soc. Lecture Note Ser., vol. 480, Cambridge Univ. Press, Cambridge, 2022, pp. 90–125.
5[5] J. Alper, H. Blum, D. Halpern-Leistner, and C. Xu, Reductivity of the automorphism group of K 𝐾 K -polystable Fano varieties , Invent. Math. 222 (2020), no. 3, 995–1032.
6[6] J. Alper, J. Hall, and D. Rydh, The étale local structure of algebraic stacks , https://arxiv.org/abs/1912.06162 .
7[7] J. Alper, D. Halpern-Leistner, and J. Heinloth, Existence of moduli spaces for algebraic stacks , https://arxiv.org/abs/1812.01128 .
8[8] M. Artin, Versal deformations and algebraic stacks , Invent. Math. 27 (1974), 165–189.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Moduli spaces and geometric invariant theory:

Abstract.

Contents

Introduction

Acknowledgements

Conventions

1. Moduli problems and group actions

1.1. Moduli functors and spaces

Example 1.1**.**

Example 1.2**.**

Example 1.3**.**

Definition 1.4** (Moduli problem and moduli functor).**

Definition 1.5** (Fine and coarse moduli spaces).**

Example 1.6**.**

Remark 1.7**.**

Exercise 1.8**.**

Exercise 1.9**.**

Example 1.10**.**

Exercise 1.11**.**

1.2. Construction of moduli spaces using group actions

Example 1.12**.**

1.3. Algebraic groups, actions and quotients

Definition 1.13**.**

Remark 1.14**.**

Example 1.15**.**

Definition 1.16**.**

Remark 1.17**.**

Lemma 1.18**.**

Proof.

Remark 1.19**.**

Definition 1.20** (Orbits and stabilisers).**

Definition 1.21** (Categorical quotient).**

Example 1.22**.**

Definition 1.23** (Good quotient).**

Remark 1.24**.**

Proposition 1.25**.**

Proof.

2. Mumford’s reductive geometric invariant theory

2.1. Hilbert’s 14th Problem

Question 2.1** (Hilbert’s 14th Problem).**

2.2. Constructions of quotients by affine algebraic groups

Remark 2.2** (Transfer Principle).**

2.3. Reductive groups

Definition 2.3** (Unipotent and reductive groups).**

Remark 2.4**.**

Example 2.5**.**

Exercise 2.6**.**

Lemma 2.7** (Geometrically reductive group actions separate closed orbits, [79, Lemma 3.3]).**

2.4. Finitely generated rings of invariants

Theorem 2.8** (Nagata, [76]).**

Proof of Theorem 2.8 (for linearly reductive groups).

Theorem 2.9** (Weitzenböck [94]).**

Proof (after Seshadri [85]).

Remark 2.10**.**

2.5. Affine geometric invariant theory for reductive groups

Definition 2.11** (Affine GIT quotient).**

Theorem 2.12** (Mumford, [74, Theorem 1.1]).**

Definition 2.13**.**

Example 2.14**.**

Example 2.15**.**

Exercise 2.16**.**

2.6. Projective geometric invariant theory

Definition 2.17**.**

Theorem 2.18** (Mumford, see [79, Theorem 3.14]).**

Remark 2.19**.**

2.7. General GIT quotients

2.8. Affine GIT linearised by a character

Example 2.20**.**

3. Semistability and instability in reductive GIT

3.1. Semistability and the Hilbert–Mumford criterion

Proposition 3.1** (Topological Hilbert–Mumford criterion, [74, Proposition 2.2]).**

Proof.

Definition 3.2**.**

Example 1.1.

Example 1.2.

Example 1.3.

Definition 1.4 (Moduli problem and moduli functor).

Definition 1.5 (Fine and coarse moduli spaces).

Example 1.6.

Remark 1.7.

Exercise 1.8.

Exercise 1.9.

Example 1.10.

Exercise 1.11.

Example 1.12.

Definition 1.13.

Remark 1.14.

Example 1.15.

Definition 1.16.

Remark 1.17.

Lemma 1.18.

Remark 1.19.

Definition 1.20 (Orbits and stabilisers).

Definition 1.21 (Categorical quotient).

Example 1.22.

Definition 1.23 (Good quotient).

Remark 1.24.

Proposition 1.25.

Question 2.1 (Hilbert’s 14th Problem).

Remark 2.2 (Transfer Principle).

Definition 2.3 (Unipotent and reductive groups).

Remark 2.4.

Example 2.5.

Exercise 2.6.

Lemma 2.7 (Geometrically reductive group actions separate closed orbits, [79, Lemma 3.3]).

Theorem 2.8 (Nagata, [76]).

Theorem 2.9 (Weitzenböck [94]).

Remark 2.10.

Definition 2.11 (Affine GIT quotient).

Theorem 2.12 (Mumford, [74, Theorem 1.1]).

Definition 2.13.

Example 2.14.

Example 2.15.

Exercise 2.16.

Definition 2.17.

Theorem 2.18 (Mumford, see [79, Theorem 3.14]).

Remark 2.19.

Example 2.20.

Proposition 3.1 (Topological Hilbert–Mumford criterion, [74, Proposition 2.2]).

Definition 3.2.

Notation 3.3.

Proposition 3.4 (Hilbert–Mumford for $\mathbb{G}_{m}$ -actions).

Example 3.5.

Definition 3.6 (Hilbert–Mumford weight).

Exercise 3.7.

Theorem 3.8 (Hilbert–Mumford criterion, [74, Theorem 2.1]).

Theorem 3.9 (Fundamental Theorem of GIT).

Remark 3.10 (Hilbert–Mumford weight for a linearised action).

Proposition 3.11 (Torus weights version of Hilbert–Mumford criterion, [30, $\S$ 9.4]).

Exercise 3.12 (Semistability for binary forms).

Remark 3.13 (Hilbert–Mumford criterion for action on affine scheme twisted by a character).

Exercise 3.14.

Definition 3.15.

Example 3.16.

Definition 3.17 (Normalised Hilbert–Mumford weight and adapted 1-PS).

Theorem 3.18 (Kempf, [61]).

Definition 3.19 (Parabolic and Levi group associated to a 1-PS).

Theorem 3.20 (Kempf [61], Hesselink [53], Kirwan [63], Ness [78]).

Remark 3.21.

Remark 3.22.

Definition 3.23 (Unstable strata, blades and limit sets).

Proposition 3.24 (Kirwan, [63, $\S$ 12]).

Remark 3.25.

Exercise 3.26 (Instability stratification for binary forms).

Proposition 3.27 (Categorical quotients of unstable strata, [57, Lemma 3.1]).

Definition 4.1.

Theorem 4.2 (Keel–Mori Theorem).

Definition 4.3.

Example 4.4 (Alper, [2, Example 12.9]).

Example 4.5 (Alper, [2, Theorem 13.6]).

Remark 4.6.

Definition 4.7 (Valuative criteria for stacks).

Remark 4.8.