Moduli spaces and geometric invariant theory: old and new perspectives
Victoria Hoskins

TL;DR
This paper reviews classical and recent advances in constructing moduli spaces via geometric invariant theory, highlighting extensions to non-reductive groups and stacks that enable new moduli space constructions.
Contribution
It introduces two new developments extending geometric invariant theory to non-reductive groups and stacks, broadening the scope of moduli space construction.
Findings
Extended GIT to non-reductive groups
Applied GIT to stacks for new moduli spaces
Surveyed classical and recent progress in the field
Abstract
Many moduli spaces are constructed as quotients of group actions; this paper surveys the classical theory, as well as recent progress and applications. We review geometric invariant theory for reductive groups and how it is used to construct moduli spaces, and explain two new developments extending this theory to non-reductive groups and to stacks, which enable the construction of new moduli spaces.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algebra and Geometry · Algebraic Geometry and Number Theory · Geometric and Algebraic Topology
Moduli spaces and geometric invariant theory:
old and new perspectives
Victoria Hoskins
For Peter Newstead on his 80th Birthday
Abstract.
Many moduli spaces are constructed as quotients of group actions; this paper surveys the classical theory, as well as recent progress and applications. We review geometric invariant theory for reductive groups and how it is used to construct moduli spaces, and explain two new developments extending this theory to non-reductive groups and to stacks, which enable the construction of new moduli spaces.
Contents
- 1 Moduli problems and group actions
- 2 Mumford’s reductive geometric invariant theory
- 3 Semistability and instability in reductive GIT
- 4 Generalisations of reductive GIT to stacks
- 5 Non-reductive geometric invariant theory
- 6 Recent applications of non-reductive GIT
Introduction
Mumford’s geometric invariant theory (GIT) [74] for reductive groups provides a method for constructing quotients of reductive group actions in algebraic geometry. For a reductive group acting on a projective scheme, GIT provides an open semistable set admitting a categorical quotient which is projective and constructed from the invariant ring; furthermore, the semistable set can be explicitly described using (torus) weights via the Hilbert–Mumford criterion, rather than in terms of non-vanishing invariants.
Whilst reductive GIT has been successfully employed to construct numerous moduli spaces, it has some limitations. First, it only provides moduli spaces of semistable objects. Second, it only applies to reductive group actions. Third, it only applies in the situation where the moduli problem is presented in terms of a group action. Two recent developments aim to overcome some of these issues: GIT for non-reductive groups and stacky generalisations of GIT.
One of the first challenges for non-reductive group actions is the possibility of non-finitely generated invariant rings; the best-known example is Nagata’s counterexample [76] to Hilbert’s 14th Problem. However, even when non-reductive invariant rings are finitely generated, the corresponding ‘GIT quotient’ is not well-behaved (for example, the quotient map might not even be surjective and its image may only be constructible, see 5.1). Although it is possible to construct geometric quotients of open subsets [34, 83, 95], these open subsets are typically hard to describe explicitly. However, recent work on GIT [15, 14] for non-reductive groups with graded unipotent radical (e.g. or parabolic subgroups) has enabled the construction of projective quotients of certain stable sets, which admit explicit Hilbert–Mumford type descriptions; the price to pay for obtaining these explicit projective non-reductive GIT quotients is that one must impose certain stabiliser assumptions. One of the goals of this survey is to explain the origins, assumptions and results of non-reductive GIT in as simple a context as possible to make them accessible to a broad audience, as well as to highlight some exciting applications.
We also outline another significant development that extends ideas of (reductive) GIT to stacks, as pioneered by Alper, Halpern–Leistner and Heinloth [2, 48, 7, 52]. Alper’s notion of good and adequate moduli spaces of stacks enables GIT-free constructions of moduli spaces. This is even more tangible following the recent existence criteria of Alper–Halpern-Leistner–Heinloth [7], which equates the existence of moduli spaces to two simple valuative criteria and has been applied to various moduli problems [4, 5, 12, 13].
However, adequate and good moduli spaces are locally modelled on reductive GIT and require closed points to have reductive stabiliser groups. Thus, in some senses these two recent developments are orthogonal to each other and ideally there should eventually be an extension of non-reductive GIT to stacks.
Acknowledgements
I am indebted to both Peter Newstead and Frances Kirwan, as I learned the basics of reductive GIT from Peter Newstead’s Tata lecture notes [79] and discussions with Frances Kirwan. I am very grateful to Greg Bérczi, Dominic Bunnett, Eloise Hamilton, Josh Jackson and Frances Kirwan for numerous conversations on non-reductive GIT. I would also like to thank the organisers of VBAC 2022 for soliciting this paper in honour of Peter Newstead.
Conventions
Throughout we will assume is an algebraically closed field and all schemes are assumed to be finite type -schemes, unless otherwise stated. By a point, we will mean a -point (or equivalently, a closed point).
1. Moduli problems and group actions
We start with an example-driven introduction to moduli problems in 1.1 and describe the relation to group actions in 1.2. Finally in 1.3, we give some basic definitions on algebraic groups, actions and quotients, which lays the foundations for GIT in 2.
1.1. Moduli functors and spaces
Naively, a moduli problem is a collection of objects with an equivalence relation on and we would like to give the set of equivalence classes the structure of a scheme that encodes how objects vary continuously in ‘families’.
Example 1.1**.**
- (1)
Let be the set of -dimensional linear subspaces of an -dimensional -vector space and be equality. 2. (2)
Let be the set of finite-dimensional -vector spaces with an endomorphism and be vector space isomorphisms commuting with the endomorphism. 3. (3)
Let be the set of matrices over and be the equivalence relation given by similarity of matrices. 4. (4)
Let to be the set of hypersurfaces of degree in and be the relation given by projective change of coordinates. 5. (5)
Let be the collection of smooth projective curves of fixed genus and be the relation given by isomorphism. 6. (6)
Let be the collection of vector bundles on a fixed scheme and be the relation given by isomorphisms of vector bundles.
Often there is a natural notion of families of objects over a scheme and an extension of to families over , such that we can pullback families by morphisms compatibly with the notion of equivalence.
Example 1.2**.**
- (1)
A family over of -dimensional linear subspaces of an -dimensional vector space is a rank vector subbundle . 2. (2)
A family over of vector spaces with endomorphisms is a vector bundle over with an endomorphism .
The next example shows there might be several ways to extend to families over .
Example 1.3**.**
For vector bundles on a fixed scheme up to isomorphism, the natural notion for a family over is a vector bundle over over , but there are at least two natural equivalence relations:
[TABLE]
where . Since is locally trivial, there is a cover of such that . Hence can be thought of a Zariski local version of .
We can now give a more precise definition of a moduli problem using families.
Definition 1.4** (Moduli problem and moduli functor).**
A moduli problem consists of
- (1)
for each scheme , a collection of families over with an equivalence relation , 2. (2)
for each morphism of schemes, a pullback map
such that
- (i)
for and equivalent families over , we have ; 2. (ii)
for any family over , we have ; 3. (iii)
for any morphisms and , and a family over , we have an equivalence .
This gives rise to a moduli functor where
[TABLE]
Notation: For a family over and a point , we write to denote the corresponding family over . We write .
In particular, a moduli functor is a presheaf on the category of schemes. Recall that the Yoneda Lemma gives an embedding of the category of schemes into the category of presheaves; more precisely there is a fully faithful functor which on objects sends a scheme to its functor of points . A presheaf is called representable if it is in the essential image of the Yoneda embedding.
A moduli functor being representable is the ideal situation and leads to the notion of a fine moduli space, but if that fails, one can instead ask for a universal natural transformation from to the functor of points of a scheme, which leads to the notion of a coarse moduli space.
Definition 1.5** (Fine and coarse moduli spaces).**
Let be a moduli functor.
- i)
A scheme is a fine moduli space for if it represents ; that is, there is a natural isomorphism . In this case, corresponds to an element of called the universal family , which is a family over up to the notion of equivalence. 2. ii)
A coarse moduli space for is a scheme with a natural transformation of functors which is universal (for any natural transformation to the functor of points of a scheme, there exists a unique morphism such that ) such that is bijective.
Example 1.6**.**
For 1-dimensional subspaces of , a fine moduli space is given by , with the tautological line bundle giving a universal family (see [51, II Theorem 7.1]).
Remark 1.7**.**
- (1)
If a fine or coarse moduli space exists, then it is unique up to unique isomorphism. 2. (2)
If a fine moduli space exists, then the universal family over describes all other families in the following sense: for any scheme , we have that a family is equivalent to a morphism with . 3. (3)
Since is a sheaf in the Zariski toplogy (that is, for every scheme and Zariski cover of , the natural map
[TABLE]
is a bijection), for a moduli functor to admit a fine moduli space it must be a sheaf in the Zariski topology.
Exercise 1.8**.**
Show that the equivalence relation on families over of vector bundles on a fixed scheme defines a moduli functor that is not representable. (Hint: Show it fails to be a Zariski sheaf by considering the other equivalence relation for families of vector bundles).
Unfortunately, there may be moduli problems which do not admit even a coarse moduli space.
Exercise 1.9**.**
Let be a moduli functor with the jump phenomenon; that is, there is a family over such that for all and . Show that there is no coarse moduli space for by showing for any natural transformation , the morphism is constant.
Example 1.10**.**
Moduli of rank 2 degree 0 vector bundles on exhibit the jump phenomenon: there is a family of rank 2 degree 0 vector bundles over such that
[TABLE]
Indeed this family is constructed using the isomorphisms
[TABLE]
Another reason for a coarse moduli space to fail to exist is if the moduli problem is unbounded: there does not exist a family over a scheme (of finite type over ) such that for any object (i.e family over ), we have for some (possibly non-unique) .
Exercise 1.11**.**
Show the moduli problem of rank 2 degree 0 vector bundles on is unbounded: suppose there exists a family over such that for any such vector bundle we have for some , then show that cannot be Noetherian by considering the subschemes
[TABLE]
which are closed by the semi-continuity theorem.
A further obstruction to the existence of a coarse moduli space is that there may be non-trivial families which are fibrewise trivial; this may happen when there are non-trivial automorphisms.
One possible solution for dealing with some of these issues is to instead work with a moduli stack; in this case, one can then look for a coarse or good moduli space for this stack as in 4. However, often by imposing a notion of stability on objects, which can be moduli-theoretic or arising from the GIT construction, one obtains a much better-behaved moduli problem for which one can construct moduli spaces.
1.2. Construction of moduli spaces using group actions
Many moduli spaces are constructed as quotients of group actions via the following strategy (after fixing any discrete invariants and restricting to a bounded class of objects):
- (1)
Find an overparametrisation: find a parameter scheme with a family such that any other family can be locally obtained by pullback from (possibly non-uniquely). 2. (2)
Find a group action describing the symmetries: find a group acting on such that the orbits correspond to the equivalence classes. 3. (3)
Take a quotient (in the category of schemes if possible).
The third step is typically performed using Geometric Invariant Theory (GIT).
Let us give some examples, before describing algebraic group actions in 1.3 in more detail.
Example 1.12**.**
- (1)
1-dimensional vector subspaces in can be parametrised by fixing a basis vector in . Since two basis vectors are related by scalar multiplication, acting on describes the symmetries and is the fine moduli space. 2. (2)
An -dimensional subspace in can be parametrised by choosing a basis, which gives an element in of rank and the choice of basis is controlled by the action of by left multiplication. In Exercise 3.14, we will see that the open locus of rank matrices is a GIT semistable locus that admits a quotient, namely a Grassmannian. 3. (3)
A projective hypersurface of degree in is given by the vanishing locus of a degree homogeneous polynomial in variables. Since defines the same hypersurface, one can consider and the action of describes these hypersurfaces up to change of coordinates. 4. (4)
For vector bundles of rank and degree on a smooth projective curve , provided is sufficiently large, any semistable111There is a natural notion of semistability involving verifying an inequality of slopes for all subbundles, which turns out to be related to a corresponding GIT notion of semistability. vector bundle can be parametrised as a quotient of a fixed vector bundle (see [79, Lemma 5.2]): for semistable of sufficiently large degree, the evaluation map is surjective and has vanishing higher cohomology, so by choosing a basis of global sections we obtain a quotient
[TABLE]
where is the Euler characteristic of . Consequently, a Quot scheme parametrising quotients of with fixed invariants gives an overparametrisation and the action of describes the symmetries.
1.3. Algebraic groups, actions and quotients
Here we focus on the essential notions that we will need, and refer to [20, 22, 71] for more detailed expositions.
Definition 1.13**.**
An algebraic group over is a a group object in the category of -schemes (that is, a -scheme with identity element , group operation and inversion given by morphisms of schemes such that the usual group axioms are stated as commutativity of certain diagrams). We say is an affine algebraic group if the underlying scheme is affine.
Remark 1.14**.**
The -algebra of regular functions on is a Hopf algebra with comultiplication , coinversion and counit and dualised commutative diagrams. In fact, there is a one-one correspondence between finitely generated Hopf algebras over and affine algebraic groups over [71, II Theorem 5.1].
As the following example demonstrates, many familiar groups are affine algebraic groups.
Example 1.15**.**
- (1)
The additive group over is the algebraic group whose underlying scheme is the affine line over and whose group operation is given by addition:
[TABLE]
For a -algebra , we have . 2. (2)
The multiplicative group over is the algebraic group whose underlying variety is the and whose group operation is given by multiplication:
[TABLE]
For a -algebra , we have . 3. (3)
The general linear group over is an open subvariety of cut out by the non-vanishing of the determinant. It is an affine variety with coordinate ring . The co-group operations are defined by:
[TABLE]
where is the regular function on given by taking the -th entry of the inverse of a matrix. 4. (4)
For a finite group , the group algebra is a Hopf algebra and determines an affine algebraic group , whose -points are identified with elements of . 5. (5)
For , the group of th roots of unity is . Write for the ideal of . Then
[TABLE]
which implies that is an algebraic subgroup of . If is different from , the polynomial is separable and there are distinct roots in . Then the choice of a primitive th root of unity in determines an isomorphism . However, if , then in , which implies that the scheme is non-reduced (with as the only closed point).
A linear algebraic group is a closed subgroup of ; hence, any linear algebraic group is an affine algebraic group. The converse statement is also true: any affine algebraic group is a linear algebraic group (see Remark 1.19).
Definition 1.16**.**
An (algebraic) action of an affine algebraic group on a scheme is a morphism of schemes such that the following diagrams commute
[TABLE]
A subscheme is called -invariant if it is preserved by the action; that is, .
A morphism between schemes with actions and is -equivariant if the following diagram commutes
[TABLE]
If is given the trivial action , then we say is -invariant.
Remark 1.17**.**
For an action , there is an induced action of on the ring of regular functions given by
[TABLE]
for , and .
By the following lemma, any action of an affine algebraic group on an affine scheme gives a -action on the -algebra which is rational; that is, every is contained in a finite dimensional -invariant linear subspace of .
Lemma 1.18**.**
For an affine algebraic group acting on an affine scheme , any finite dimensional vector subspace of is contained in a finite dimensional -invariant vector subspace.
Proof.
Let denote the coaction. Let and write with and . The vector space spanned by is a -invariant finite-dimensional subspace containing , as . ∎
Remark 1.19**.**
By applying this to the action of on itself by left multiplication, if we let be a vector space spanned by a finite choice of algebra generators for , then is contained in a finite dimensional -invariant vector subspace . One can prove that there is an embedding to show any affine algebraic group over is a linear algebraic group.
We can define orbits and stabilisers in this setting; the latter has a scheme structure by definition, and we shall soon see the former can also be equipped with a scheme structure.
Definition 1.20** (Orbits and stabilisers).**
For an action of an affine algebraic group on a scheme and for a -point , we define
- (1)
the orbit of to be the (set-theoretic) image of given by ; 2. (2)
the stabiliser of to be the fibre product of and .
The stabiliser of is a closed subscheme of (as it is the preimage of a closed subscheme of under ) and a subgroup of . By Chevalley’s Theorem [51, II Exercise 3.19], as the image of a morphism of schemes, the orbit is a priori only a constructible subset of . However, we claim it is a locally closed subset and so can be equipped with the structure of a reduced locally closed subscheme of . Indeed, is open in its closure: since the orbit is constructible, there is a dense open subset of with and, as acts transitively on the orbit, every point in the orbit is contained in a -translate of .
The boundary of an orbit is a union of orbits of strictly smaller dimension, and so in particular every orbit closure contains a closed orbit (of minimal dimension). There is also an orbit stabiliser theorem:
[TABLE]
as is flat (by transitivity of the -action, we can deduce this from generic flatness) and so we can apply the dimension formula for fibres of a flat morphism [51, Proposition III.9.5].
In general, for an action , the set of orbits is not a scheme. Instead, we ask for a universal quotient in the category of schemes.
Definition 1.21** (Categorical quotient).**
For an action of an affine algebraic group on scheme , a categorical quotient is a -invariant morphism of schemes which is universal (that is, every other -invariant morphism factors uniquely through so that there exists a unique morphism such that ). If the preimage of each -point in is a single orbit, then we say is an orbit space.
If a categorical quotient exists, then it is unique up to unique isomorphism. In cases where a categorical quotient does not exist, one may want to enlarge the category of schemes to algebraic spaces or even algebraic stacks.
A categorical quotient is constant on orbits and orbit closures. Hence, a categorical quotient is an orbit space only if the action of on is closed; that is, all the orbits are closed.
Example 1.22**.**
- (1)
For acting on by scalar multiplication , there are two types of orbits:
- •
punctured lines through the origin,
- •
the origin (closed of dimension [math])
Every orbit contains the origin in its closure. As any -invariant function on is constant on orbits and their closures, it must be constant and so factors via the structure map . Hence, the structure map is a categorical quotient. 2. (2)
For the action of on by , the orbits are
- •
conics for (closed of dimension ),
- •
the punctured -axis,
- •
the punctured -axis,
- •
the origin (closed of dimension [math]).
The punctured axes both contain the origin in their orbit closures. We will see that the categorical quotient for this action is given by .
We see the sort of problems that may occur when we have non-closed orbits. In the first example, our geometric intuition tells us that we would ideally like to remove the origin and then take the quotient of acting on to obtain the projective space , which is an orbit space for this action. We will return to this example and see that by introducing a non-trivial notion of GIT semistability, we can remove the origin (see Example 2.20).
There is the following stronger notion of quotient that arises in GIT [87, Definition 1.5].
Definition 1.23** (Good quotient).**
A morphism is a good quotient for an action of on if
- i)
is -invariant, surjective and affine; 2. ii)
The map is an isomorphism. 3. iii)
If and are disjoint -invariant closed subschemes, then and are disjoint closed subschemes.
If moreover the preimage of each point is a single orbit, then we say is a geometric quotient.
Remark 1.24**.**
- (1)
The definition of a good quotient is local in the target, which enables the construction of good quotients via gluing. 2. (2)
The last two conditions imply that is surjective: the second property shows that is dominant (i.e. the image of is dense in ) and the third condition shows that the image of is closed. Furthermore it implies that for each , the preimage contains a unique closed orbit, whose stabiliser is reductive (see Definition 2.3 and [68], as the quotient of the reductive group by the subgroup is affine if and only if is reductive). In particular, if all orbits are closed, then is a geometric quotient. 3. (3)
The third condition also enables us to determine when two orbit closures meet: we have if and only if . 4. (4)
Any good quotient is a categorical quotient; see [79, Proposition 3.11].
Let us relate the construction of moduli spaces with categorical quotients. For a moduli problem , a family over a scheme has the local universal property if for any other family over a scheme and for any -point , there exists a neighbourhood of in and a morphism such that .
Proposition 1.25**.**
[79, Proposition 2.13]* Let be a moduli problem for which there exists a family over with the local universal property. Suppose that there is an algebraic group acting on such that two -points lie in the same -orbit if and only if . Then*
- (1)
any coarse moduli space is a categorical quotient of the -action on ; 2. (2)
a categorical quotient of the -action on is a coarse moduli space if and only if it is an orbit space.
Proof.
For any scheme , we claim that there is a bijective correspondence
[TABLE]
given by , which is -invariant by our assumptions about the -action on . Conversely, given a -invariant morphism , we define to associate to a family over , a morphism glued together locally by using the local universal property of over . More precisely, we can cover by open subsets such that there is a morphism and . For , we have
[TABLE]
and so by assumption and lie in the same -orbit. Since is -invariant, the compositions glue to a morphism .
Hence, if is a coarse moduli space, then is -invariant and the universal -invariant morphism from , which proves statement a). Furthermore, the -invariant morphism is an orbit space if and only if is bijective, which proves statement b). ∎
2. Mumford’s reductive geometric invariant theory
The origins of GIT go back to 19th century invariant theory and a question of Hilbert on the finite generation of invariant rings. We begin with Hilbert’s 14th problem in 2.1 and describe other techniques for constructing quotients in 2.2. We give various definitions and examples related to reductive and unipotent groups in 2.3, and prove finite generation results for invariant rings in 2.4. We describe Mumford’s GIT [74] for affine schemes in 2.5, for projective schemes in 2.6 and in general in 2.7, and state a twisted affine version in 2.8. Beyond Mumford’s book [74], the notes of Newstead [79] and Thomas [92] provide excellent introductions to GIT.
2.1. Hilbert’s 14th Problem
Consider an action of an affine algebraic group on an affine scheme. The coaction determines a linear representation . Any -invariant morphism induces a homomorphism whose image is contained in the ring of -invariant functions
[TABLE]
Consequently one can ask if the inclusion of the ring of -invariant functions corresponds to a morphism of affine schemes.
Question 2.1** (Hilbert’s 14th Problem).**
Is a finitely generated -algebra?
Hilbert showed that for the general linear group over the complex numbers, the answer was yes. However, in general is not finitely generated, due to a counterexample of Nagata constructed using an action of a product of additive groups [75, 76]; see [35] for a survey of counterexamples to Hilbert’s 14th problem. Fortunately, Nagata also showed that the answer is yes for a large number of groups, namely reductive groups and in this case Mumford showed that taking the spectrum of the inclusion gives a categorical quotient of the action. For non-reductive groups, we will see in 5.1 that even when is finitely generated, taking the spectrum of the inclusion of invariants does not always yield a categorical quotient.
2.2. Constructions of quotients by affine algebraic groups
Before turning to Mumford’s GIT for reductive groups, let us give a brief summary of some important results about the construction of quotients of affine algebraic groups in general.
For a free action of an affine algebraic group on a scheme, there is a geometric quotient in the category of algebraic spaces by results of Artin [8] and Kollár [65]. For a finite group, it is easy to see that a free action induces an étale equivalence relation and any quotient of a scheme by an étale equivalence relation is an algebraic space. There are examples of free actions whose geometric quotient is not a scheme: an example of Hironaka gives an action of a finite group whose geometric quotient is not a scheme (see [51, Appendix B, Example 3.4.1]) and an example of Derksen gives an action of the additive group whose geometric quotient is not a scheme (see [29, Example 18]),
Rosenlicht [83] showed that for a connected affine algebraic group acting on an irreducible variety , there is a dense open subset which admits a geometric quotient. Unfortunately, this set is non-explicit, as his proof involves showing that the field of invariant rational functions is finitely generated (see [43, 19 Appendix Lemma 1]).
Remark 2.2** (Transfer Principle).**
Let be a closed subgroup of an affine algebraic group , then acts on by left multiplication and this action has a geometric quotient and the quotient map is étale locally trivial (see [20, p181] and [22, Theorem 1.16]).
For as above, suppose there is an action of on ; then for the diagonal action of on by , there is a geometric quotient ; for details on this construction, see [28, III 4]. The action of on itself by left multiplication induces a -action on such that there is a bijective correspondence between -orbits in and -orbits in . A scheme is a geometric -quotient of if and only if it is a geometric -quotient of . Furthermore, if is affine and the -action on extends to , then we have the following transfer principle (due to Roberts, see [43, 9])
[TABLE]
Note may not be affine, so is not necessarily finitely generated (see Remark 2.10).
2.3. Reductive groups
Let us fix some definitions for unipotent and reductive groups; we note that there are alternative equivalent formulations (for example, see [26, 71]).
Definition 2.3** (Unipotent and reductive groups).**
An affine algebraic -group is
- i)
unipotent if it is isomorphic to a subgroup of a standard unipotent group consisting of upper triangular matrices with diagonal entries equal to 1. 2. ii)
reductive if it is smooth and every connected unipotent normal subgroup is trivial (this second condition is often phrased as asking for the unipotent radical to be trivial). 3. iii)
geometrically reductive if for every finite dimensional linear representation and every non-zero -invariant point , there is a non-constant -invariant homogeneous polynomial such that . 4. iv)
linearly reductive if for every finite dimensional linear representation and every non-zero -invariant point , there is a non-constant -invariant linear polynomial such that .
Remark 2.4**.**
- (1)
is unipotent if and only if every finite dimensional linear representation has a non-zero fixed point. 2. (2)
G is linearly reductive if and only if every finite dimensional linear representation is completely reducible (that is, decomposes as a direct sum of irreducible representations) or equivalenty if taking -invariants on finite dimensional linear -representations is exact.
Example 2.5**.**
- (1)
The additive group is unipotent, as we have an embedding given by
[TABLE] 2. (2)
In characteristic , there is a finite subgroup where we define the functor of points of by associating to a -algebra ,
[TABLE]
This is represented by the scheme and so is a unipotent group which is not smooth. 3. (3)
The multiplicative group or any algebraic torus is linearly reductive: as any linear representation admits a weight decomposition
[TABLE]
Exercise 2.6**.**
Prove that any finite group of order not divisible by the characteristic of is linearly reductive. (Hint: consider averaging over the group.)
For smooth affine algebraic group schemes over , we have
[TABLE]
and all three notions coincide in characteristic zero. The first implication is immediate from the definitions and, in characteristic zero, the opposite implication goes back to Weyl and uses the representation theory of compact Lie groups (this is known as Weyl’s unitary trick). The equivalence between reductive and geometrically reductive for smooth affine group schemes was conjectured by Mumford after Nagata proved that every geometrically reductive group is reductive [76]; the opposite implication was proved by Haboush [45].
Let us state an important property of geometrically reductive group actions.
Lemma 2.7** (Geometrically reductive group actions separate closed orbits, [79, Lemma 3.3]).**
Let be a geometrically reductive group acting on an affine scheme . If and are disjoint -invariant closed subsets of , then there is an invariant function which separates these sets, i.e.
[TABLE]
2.4. Finitely generated rings of invariants
Recall that for an action of an affine algebraic group on an affine scheme , the associated action on the coordinate ring is rational.
Theorem 2.8** (Nagata, [76]).**
Let be a geometrically reductive group acting rationally on a finitely generated -algebra . Then the -invariant subalgebra is finitely generated.
We will outline the proof of this theorem in the significantly easier case when is linearly reductive; see [79, Theorem 3.4] for the full proof. In this case, one can construct a Reynolds operator, which is a projection onto the -invariants that satisfies for all and . If is finite, can be viewed as averaging over the group. Using the Reynolds operator, one can show that is Noetherian and then prove it is finitely generated. This approach is similar to Hilbert’s proof that over the complex numbers the ring of invariants for is finitely generated.
Proof of Theorem 2.8 (for linearly reductive groups).
Since is a finitely generated -algebra, it has a countable basis as a -vector space; thus can be written as an increasing union of finite dimensional vector spaces. By applying Lemma 1.18 to these vector spaces, we can write as an increasing union of finite dimensional -invariant vector spaces over .
Our assumption that is linearly reductive implies that the finite dimensional -representation is completely reducible. In particular, we can write as a sum of -representations
[TABLE]
and obtain a projection , which together induce a projection .
To show that this projection is a Reynolds operator, we need to show for all and . For this take , so and pick such that left multiplication restricts to a homomorphism of -representations
[TABLE]
As above, we write . Since , we have and by Schur’s Lemma, the image of each irreducible representation appearing in is either zero or isomorphic to that irreducible representation, thus . If we write , then
[TABLE]
Hence, as required.
For any ideal , we have , and using the Reynolds operator, one can show the opposite inclusion. Hence and from this we deduce is Noetherian: any increasing chain of ideals in must stabilise, as the corresponding chain of ideals stabilises due to being Noetherian.
By choosing generators for the -algebra , we can realise it as a quotient of a polynomial ring with linear -action . Since any -equivariant homomorphism of algebras commutes with their Reynolds operators, we obtain a surjection and so to show is finitely generated, it suffices to show is finitely generated. Thus we may assume is a polynomial ring with linear action . Since is Noetherian, the ideal is finitely generated and the generators of this ideal are generators of as a -algebra. ∎
Popov [80] proved a converse to Nagata’s theorem: for any non-reductive group there is an affine scheme such that is not finitely generated.
In some simple situations, the ring of invariants for a non-reductive group is finitely generated; however, the corresponding morphism of schemes may fail to be a good quotient (see 5.1).
Theorem 2.9** (Weitzenböck [94]).**
Assume that the characteristic of is zero, then any linear -action on extends to . In this case, the invariant ring is finitely generated.
Proof (after Seshadri [85]).
The first statement follows by putting the associated locally nilpotent derivation (see 5.3) in Jordan normal form, so that each length Jordan block corresponds to the standard -representation ; for details, see [43, Lemma 10.2].
Assuming that the linear -action extends to , let us prove that the ring of invariants is finitely generated. By the transfer principle (Remark 2.2), the ring of -invariants on is isomorphic to the ring of -invariants on .
For -acting on by left multiplication, the bottom row is invariant and thus the map given by sending a matrix to its bottom row is -invariant. In fact, this map is an orbit space and ; one way to see this is to note that is the -stabiliser of and its orbit is isomorphic to . Since the -action on extends to , we have an -equivariant isomorphism
[TABLE]
This is only quasi-affine (so its coordinate ring may not be finitely generated), but as has codimension 2, any regular function extends from to by Hartogs’ lemma. Hence,
[TABLE]
is finitely generated. ∎
Remark 2.10**.**
The second part of the above proof for can be extended to any Grosshans subgroup, which is a closed subgroup of a reductive group such that is quasi-affine and is finitely generated (see [43] for a detailed treatment). Grosshans [41] shows is finitely generated if and only if can be emdedded in an affine variety with complement of codimension , and proves that unipotent radicals of parabolic subgroups in a reductive group are Grosshans subgroups [42]. For a Grosshans subgroup , the same proof shows that if a -action an an affine scheme extends to (which is not immediate as in the case of Weitzenböck’s Theorem), then is finitely generated.
The proof shows that for a non-reductive group, even if the ring of invariants is finitely generated, taking its spectrum does not necessarily provide a categorical quotient: for -acting on , we have , but the induced map is not surjective, so is not the categorical quotient. In fact, even worse, the image may only be a constructible subset (see 5.1). In the next subsection we will see that when is reductive, taking the spectrum of the ring of invariants does give a categorical quotient.
2.5. Affine geometric invariant theory for reductive groups
Let be a reductive group acting on an affine scheme . There is an induced action of on the coordinate ring and the ring of invariants is a finitely generated -algebra by Nagata’s Theorem.
Definition 2.11** (Affine GIT quotient).**
For an action of a reductive group on an affine scheme , the affine GIT quotient is the morphism of affine schemes associated to the inclusion .
The double slash notation used for the GIT quotient is a reminder that this quotient is not necessarily an orbit space and so it may identify some orbits. In nice cases, the GIT quotient is an orbit space and in this case we shall write .
Theorem 2.12** (Mumford, [74, Theorem 1.1]).**
For a reductive group acting on an affine scheme , the affine GIT quotient is a good quotient and thus categorical quotient.
We will not include the proof of this result, but we note that the proof that the affine GIT quotient is good uses several properties of reductive group actions beyond simply the finite generation of the invariant ring: for a geometrically reductive group, invariant functions can be used to separate closed orbits (see Lemma 2.7) and, for linearly reductive groups, the proof can be simplified by using the fact that taking invariants is exact.
The affine GIT quotient restricts to a geometric quotient on an open stable subset .
Definition 2.13**.**
A point is stable if its orbit is closed in and (or equivalently, ). We let denote the set of stable points.
For , we note that is stable if and only if is proper. Indeed if is proper, then its image is closed and the fibres, being both affine and proper, must be finite. Conversely if is stable, then has finite fibres and one can show it is finite.
Example 2.14**.**
For the -action on by , we have with affine GIT quotient is given by . It is not a geometric quotient, as the three orbits consisting of the punctured axes and the origin are all identified. The stable locus is the complement of , which admits a geometric quotient .
If we remove the origin, the affine line with a double origin is a geometric quotient of . In this case, we obtain a non-separated quotient as a categorical quotient of a separated scheme.
.
Example 2.15**.**
Consider acting by conjugation on the space of matrices with -coefficients. The trace and determinant (which are the coefficients of the characteristic polynomial) are invariant functions, and so
[TABLE]
We will soon see this is in fact an equality.
First, we describe the orbits using the theory of Jordan normal forms. As any orbit contains a matrix in Jordan normal form, there are three types of orbits:
- •
Matrices with distinct eigenvalues and Jordan normal form
[TABLE]
These are closed 2 dimensional orbits, with 2 dimensional stabiliser (diagonal matrices).
- •
Matrices with repeated eigenvalue and Jordan normal form with one block
[TABLE]
These orbits are also 2 dimensional but are not closed: for example
[TABLE]
- •
Matrices with repeated eigenvalue and Jordan normal form with two blocks
[TABLE]
The stabiliser of such a matrix is and its orbit is a point, which is closed.
Every orbit closure of the second type contains an orbit of the third type.
Let us show that . Since any orbit closure contains a diagonal matrix, any invariant function is completely determined by its values on the diagonal matrices and is invariant under permuting the diagonal entries. Hence
[TABLE]
by the theory of (elementary) symmetric polynomials.
The affine GIT quotient is . Since scalar multiples of the identity fix every point, there are no stable points for this action; however, the restriction to the locus of matrices with distinct eigenvalues is a geometric quotient.
Exercise 2.16**.**
Show the GIT quotient of acting on by conjugation is .
Newstead constructs moduli spaces of cyclic endomorphisms of vector spaces [79, Chapter 2].
2.6. Projective geometric invariant theory
Suppose that a reductive group acts on a projective scheme linearly (i.e. by a representation ). The homogeneous coordinate ring of is the graded ring
[TABLE]
where denotes the pullback of to . By Nagata’s theorem, is finitely generated. The inclusion determines a rational map of projective schemes
[TABLE]
whose indeterminacy locus is the closed subscheme of defined by the homogeneous ideal . The domain of definition of this map is the GIT semistable locus.
Definition 2.17**.**
Let be a reductive group acting linearly on a projective scheme .
- i)
We say is semistable if there exists a -invariant homogeneous function for some such that . We write for the open set in of semistable points; this is the domain of definition of (1). 2. ii)
We say is stable222Usually stability is defined by asking for and for the existence of for some non-vanishing at such that the -action on is closed; however, this is equivalent to the stated definition. if its orbit is closed in and its stabiliser is zero dimensional. We write for the open set in of stable points. 3. iii)
The restriction of the rational map (1) to the semistable locus is called the projective GIT quotient, which is projective over .
Rather confusingly, a point is called unstable if it is not semistable; this terminology is now standard and there is not much we can do to change it! We refer to points which are semistable but not stable, as strictly semistable. If there are several groups acting on , we clarify which group we mean by talking about -(semi)stability.
By definition, is open as it is the domain of definition of the rational map (1). To see that is open, we use the equivalent formulation and note it is the intersection of two opens: the set of points with zero dimensional stabiliser is open as is upper semi-continuous and the union of for on which the action on is closed is open.
Theorem 2.18** (Mumford, see [79, Theorem 3.14]).**
For a reductive group acting linearly on a projective scheme , the projective GIT quotient is a projective and good quotient, which restricts to a quasi-projective and geometric quotient of .
This result can be proved by gluing together affine GIT quotients: for , the non-vanishing locus is affine with affine GIT quotient and we can write as the union of these open affines , so is covered by the open affines .
We have if and only if the orbit closures of and meet in . Furthermore, the preimage of any point in contains a unique closed orbit (of minimal dimension in this preimage), whose stabiliser is reductive (see Remark 1.24).
Remark 2.19**.**
It is important to note that the semistable set and the GIT quotient both depend on the -equivariant embedding , as the homogeneous coordinate ring depends on this embedding (or equivalently on the line bundle pulled back from ).
Alternatively, rather than fixing a linear -equivariant projective embedding of , one can instead fix an ample -equivariant line bundle on , which is often called an ample -linearisation: is an ample invertible sheaf on together with a -equivariant structure given by an isomorphism , where denote the action and second projection, which satisfies a cocycle condition on . In terms of the associated geometric line bundle, which by abuse of notation we shall also call , this is equivalent to a -action on commuting with the projection such that the action on the fibres is linear.
Given an ample -equivariant line bundle on , we obtain a graded ring with a -action
[TABLE]
such that the inclusion of invariants induces a rational map whose domain of definition is the semistable set and whose codomain is the GIT quotient (both with respect to )
[TABLE]
Since replacing with a positive power just has the effect of changing the grading on this ring333By construction, the projective GIT quotient comes with a line bundle and this regrading does not change the GIT quotient but does change this line bundle., we can assume is very ample and then we obtain a linear -equivariant embedding where , which recovers the above setting of a linear action.
The effect of changing is called variation of GIT and can be described in terms of certain birational transformations known as VGIT flips [31, 91]. Furthermore, the space of -linearisations admits a wall and chamber decomposition describing how semistability varies: in chambers, semistability coincides with stability, but semistability changes on crossing a wall.
2.7. General GIT quotients
More generally, given a scheme with a -linearisation , Mumford defines a GIT quotient using invariant sections of positive powers of whose non-vanishing locus is affine (so that one can take affine GIT quotients and glue them). This produces a good quotient of a ‘semistable locus’ ([74, Definition 1.7]), which in this situation is defined to be the set of points such that there exists for with and such that is affine444If is projective and is ample, then this non-vanishing locus is always affine.. The semistable set and quotient obtained in this way are both quasi-projective (see [79, Theorem 3.21]).
Let us remark that in this survey we have assumed that we are working over an algebraically closed field . The assumption that is algebraically closed can be dropped, but one has to be careful about rationality questions and work with geometric points for certain statements (for example, the Hilbert–Mumford criterion). Moreover, Seshadri [88] extended GIT to work relative to a base scheme with mild assumptions on .
2.8. Affine GIT linearised by a character
As a special case of 2.7, consider a linear action of on an affine scheme ; then the structure sheaf is naturally equipped with a -equivariant structure, where if we view this as a geometric line bundle , the -action on is trivial. In this case, the GIT quotient with respect to this ample -linearisation is just the affine GIT quotient, as
[TABLE]
with trivial -action on and this ring is graded by the degree of , thus
[TABLE]
This linearisation can be modified by using a character to obtain a linearisation which is given by acting linearly on the geometric line bundle by the given action on and acting via multiplication with on . The outcome of applying GIT in this situation of twisting the linearisation by a character was described by King [62] and results in an open subset of -semistable points and a GIT quotient
[TABLE]
In this case, the -invariant sections of are with for all and , which we refer to as -semi-invariant functions of weight . By definition, is -semistable if there exists a -semi-invariant function of weight which is non-vanishing at . Furthermore, is projective over the spectrum of the [math]th-graded piece which is just the affine GIT quotient .
Example 2.20**.**
For acting on by scalar multiplication linearised by for given by , the coordinate functions are -semi-invariant functions of weight and these generate the ring of invariants. Consequently, we obtain the GIT quotient
[TABLE]
3. Semistability and instability in reductive GIT
Since the reductive GIT quotient only provides a quotient of an open semistable locus, this naturally leads to two questions: can we describe the semistable points and what can we say about unstable (i.e. not semistable) points? For actions on projective (over affine) schemes, the first question is tackled by the Hilbert–Mumford criterion for semistability described in 3.1. In moduli problems with a natural notion of subobjects, the Hilbert–Mumford criterion often gives a clean moduli-theoretic interpretation of GIT semistability. We state some application of reductive GIT to moduli in 3.2. We then turn to the second question in 3.3 and describe how work of Kempf [61], Hesselink [53], Kirwan [63] and Ness [78] gives a stratification of the unstable locus, with a largely combinatorial flavour; we survey some applications of these stratifications and discuss the question of construction quotients of unstable strata, where naturally non-reductive groups (namely, parabolic subgroups, representing an instability flag) appear.
3.1. Semistability and the Hilbert–Mumford criterion
By definition, semistability in reductive GIT is given in terms of the existence of a non-vanishing invariant section. From this definition, it is extremely challenging to determine semistability, as it is essentially equivalent to computing invariant rings, which is a notoriously challenging problem. Fortunately, in certain situations (projective GIT, affine GIT linearised by a character or more generally a projective over affine set-up), the Hilbert–Mumford criterion reduces semistability to checking semistability for -actions, which in turn can be combinatorially described using the weights of the action. More precisely, a -semistable point is semistable for any subgroup, and thus in particular, for any contained in ; the Hilbert-Mumford criterion gives a converse to this statement.
For simplicity, throughout this section, we assume we have a linear representation of a reductive group and consider the associated linear action on . We will describe the semistable points in this setting. For a closed subscheme with a linear -action, we have and so it suffices to understand semistability on the ambient projective space. For an ample -linearisation on , using a power of puts us in this linear setting.
We will see several different versions of the Hilbert–Mumford criterion, which make it possible to determine semistability in practice. The first, and weakest, version is a topological criterion.
Proposition 3.1** (Topological Hilbert–Mumford criterion, [74, Proposition 2.2]).**
For a linear action of a reductive group on , the following statements hold for .
- i)
* is semistable if and only if ;* 2. ii)
* is stable if and only if and is closed in .*
Proof.
We will just give the proof of the first statement. By definition is semistable if and only if there is a -invariant homogeneous polynomial which is non-zero at . Since is -invariant it is constant on orbit closures, and so separates the closed schemes and [math], which shows these closed subschemes are disjoint. Conversely, if the closed -invariant schemes and [math] in are disjoint, then as is geometrically reductive, there exists a -invariant polynomial separating these subsets
[TABLE]
by Lemma 2.7. By considering the decomposition of into (-invariant) homogeneous pieces, we see there is a -invariant homogeneous piece which is non-vanishing at . ∎
Definition 3.2**.**
For a linear action of a torus on , consider the associated weight decomposition . We refer to the support of this decomposition as the -weights on . For , we write and define the -weight set of this point to be
[TABLE]
For a -action on a separated scheme, we will often use the following notation.
Notation 3.3**.**
If a morphism , with separated, extends to , then this extension is unique and we write . Similarly if extends to , we write .
We can now give a combinatorial description of (semi)stability for a -action in terms of whether or not the origin lies in (the interior of) the convex hull of -weights.
Proposition 3.4** (Hilbert–Mumford for -actions).**
For a linear action of on and , the following statements hold:
- i)
* is -semistable if and only if .* 2. ii)
* is -stable if and only if .*
Proof.
We again just prove the statement for semistability. By the topological Hilbert–Mumford criterion, we have that is -semistable if and only if . Any point in the boundary of this orbit closure is either
[TABLE]
Moreover, we have if and only if (and similarly if and only if ). Hence is -semistable if and only if there exists and in , or equivalently . ∎
Example 3.5**.**
The linear action of on by
[TABLE]
has weights . Hence, for a point to be (semi)stable it needs both these weights, which means its first coordinate must be non-zero and at least one of the other coordinates for must be non-zero. One can also see this by directly proving that
[TABLE]
In particular, and is a geometric -quotient.
The Hilbert–Mumford criterion will ultimately be a numerical criterion that phrases semistability in terms of the weights of 1-parameter subgroups (1-PS), which are non-trivial group homomorphisms .
Definition 3.6** (Hilbert–Mumford weight).**
For a linear action of a reductive group on , we define the Hilbert-Mumford weight of at a 1-parameter subgroup to be
[TABLE]
Let us note some useful properties of the Hilbert–Mumford weight.
Exercise 3.7**.**
Show that the Hilbert–Mumford weight of has the following properties.
- (1)
is the unique integer such that exists and is non-zero. 2. (2)
where (and this limit exists as is projective). 3. (3)
exists, with equality if and only if . 4. (4)
for all . 5. (5)
for a positive integer .
For a linear -action on , we see that for the 1-PS given by , we have
[TABLE]
and
[TABLE]
Hence is semistable if for and . Furthermore, is stable if and only if this inequality is strict for both 1-PSs. This is precisely the numerical version of the Hilbert–Mumford criterion that we now can state.
Theorem 3.8** (Hilbert–Mumford criterion, [74, Theorem 2.1]).**
For a reductive group acting linearly on a projective scheme , the following statements hold for .
- i)
* is semistable if and only if for all 1-PS ,* 2. ii)
* is stable if and only if for all 1-PS .*
Note that it suffices to check these inequalities for primitive 1-PSs (i.e. 1-PSs which are not positive powers of another 1-PS); see Exercise 3.7.
The full proof of the Hilbert–Mumford criterion is beyond the scope of this survey, but following the topological version (Proposition 3.1), it suffices to show that the reductive group has enough 1-PSs to detect if the origin is contained in the closure of orbits of linear actions , which is precisely the following result (see [74, p53] and [61, Theorem 1.4]), whose proof involves the Cartan-Iwahori decomposition for the reductive group .
Theorem 3.9** (Fundamental Theorem of GIT).**
Let be a reductive group acting on an affine space . If and , then there is a 1-PS of such that .
Remark 3.10** (Hilbert–Mumford weight for a linearised action).**
In the case of a linear -action on a projective scheme , the Hilbert–Mumford weight for defined above depends on the choice of -representation (as the weights depend on this representation).
In general, for a -linearisation on a projective -scheme , we consider the -fixed point . The linearisation induces a -representation on the fibre of over
[TABLE]
of weight (that is acts on this fibre by ). Then the Hilbert–Mumford weight (with respect to ) is defined to be minus the weight on this fibre
[TABLE]
In this linearised situation, the Hilbert–Mumford criterion says is semistable (with respect to ) if and only if for all 1-PS .
If and is the pullback of , then these two definitions coincide: we have by [74, Proposition 2.3].
As any 1-PS can be conjugated to lie in a fixed maximal torus , one can phrase -semistability of a point in terms of -semistability of all -translates of that point by Exercise 3.7 above. Then -semistability can be stated combinatorially using the torus weights analogous to Proposition 3.4 above. This gives a combinatorial Hilbert–Mumford criterion.
Proposition 3.11** (Torus weights version of Hilbert–Mumford criterion, [30, 9.4]).**
For a reductive group acting on a projective scheme linearly, fix a maximal torus . For , the following statements hold.
- i)
* is -(semi)stable if and only if is -(semi)stable for all .* 2. ii)
* is -semistable if and only if .* 3. iii)
* is -stable if and only if .*
By the first statement, the -semistable set is the -sweep of the -semistable set:
[TABLE]
Exercise 3.12** (Semistability for binary forms).**
Consider the action of on the space of degree binary forms . For corresponding to , show
- i)
is semistable if and only if all roots of have multiplicity less than or equal to ; 2. ii)
is stable if and only if all roots of have multiplicity strictly less than .
Remark 3.13** (Hilbert–Mumford criterion for action on affine scheme twisted by a character).**
For a linear action of a reductive group on an affine scheme linearised via for a character (see 2.8), King proved a topological Hilbert–Mumford criterion [62, Lemma 2.2], by using the total space of the dual linearisation to replace the affine cone, and obtained the following numerical Hilbert–Mumford criterion [62, Proposition 2.5]:
- (1)
is -semistable if and only if for all 1-PS such that exists, 2. (2)
is -stable if and only if for all 1-PS such that exists,
where if , i.e. this is the natural pairing between characters and cocharacters. Using the abstract definition of the Hilbert–Mumford weight in terms of the weight of the action on the fibre over the limit point of the -action (see Remark 3.10), we see that if exists, then
Exercise 3.14**.**
Using Remark 3.13, show that for acting on by left multiplication the (semi)stable locus for the character is the matrices of maximal rank (namely rank ), and so the GIT quotient is the Grassmannian .
We note that there is a Hilbert–Mumford criterion in the more general setting of a projective over affine variety with an action of a linearly reductive group [44].
3.2. A brief survey of applications of reductive GIT to moduli
The notion of moduli functor is heavily influenced by Grothendieck’s approach to algebraic geometry. Furthermore, Grothendieck proved that the Hilbert and Quot functors are representable by projective schemes; these are fine moduli spaces and provide parameter spaces in the GIT constructions of moduli of smooth projective curves and moduli of vector bundles on curves.
The first truly interesting application of GIT was Mumford’s construction of moduli spaces of curves [74, Chapter 5]. Mumford constructed a coarse moduli space for smooth projective curves of genus by using a power of the canonical bundle to give a projective embedding and constructing as a quotient of a suitable Chow variety parametrising pluricanonical curves. Gieseker [38] provided an alternative GIT construction of and its Deligne–Mumford compactification via stable curves as a quotient of the -action on a suitable Hilbert scheme with a linearisation given by embedding in a Grassmannian associated to a sufficiently large choice of . Although there is a direct proof that smooth curves are (asymptotically) GIT stable, the proof that stable curves are (asymptotically) GIT stable is indirect (see [72, 3.1]). Gieseker’s Hilbert scheme construction is now the prevalent perspective, which has been generalised to give GIT constructions of moduli spaces of pointed stable curves and stable maps and their (birational) geometry is studied using VGIT (see [67, 72]).
The other influential and successful application of GIT was the construction of moduli spaces of vector bundles (of fixed rank and degree) on a fixed smooth projective curve . One of the first ideas to construct vector bundle moduli spaces over was to use unitary representations of the fundamental group and led to the Narasimhan–Seshadri Theorem [77] relating irreducible representations with stable vector bundles considered by Mumford [73], where Mumford’s notion of stability came from the Hilbert–Mumford criterion in GIT and involves verifying an inequality of slopes (the ratio of the degree and the rank) for all subbundles. For moduli problems with a natural notion of subobjects, the study of 1-PSs in GIT often corresponds to filtrations by subobjects and stability can be phrased as an inequality for all subobjects. The GIT construction of moduli spaces of (semi)stable vector bundles was given by Seshadri [86] (see [79, Chapter 5]), and was later generalised by Simpson [89] to construct moduli spaces of sheaves (and Higgs sheaves) on higher dimensional schemes as GIT quotients of Quot schemes. Quot schemes appear as semistable vector bundles can be parametrised as quotients of a fixed vector bundle as mentioned in Example 1.12(4). This construction has been generalised to construct various bundle moduli spaces [84].
Mumford also applied GIT to construct moduli spaces of projective hypersurfaces of degree by taking a quotient of acting on as in Example 1.12 (3). He showed smooth hypersurfaces are GIT stable if and (see [74, Chapter 4.2]).
King [62] developed GIT for a linear action on an affine space with respect to a character (see 2.8) to construct reasonable moduli spaces of semistable representations of a quiver, where semistability depends on a stability parameter; the Hilbert–Mumford criterion gives a moduli-theoretic interpretation of semistability as an inequality holding for all subrepresentations.
3.3. Instability
In this section, we continue to suppose that we have a reductive group acting on a projective scheme linearly. Since the GIT quotient provides a categorical quotient of the semistable locus , it is natural to ask what can be said about the unstable points (i.e. not semistable points)
[TABLE]
By the Hilbert–Mumford criterion, if a point is unstable, then it has a negative Hilbert–Mumford weight for some 1-PS. Starting from this observation, Kempf [61] associated to an unstable orbit a conjugacy class of 1-PSs which are ‘most responsible’ for its instability, in the sense that they minimise a ‘normalised Hilbert–Mumford weight’. Hesselink then used Kempf’s work to statify the unstable locus [53]. This stratification was described more explicitly and, when , compared with a Morse stratification associated to the norm square of the moment map for the action of a maximal compact subgroup by Kirwan [63] and Ness [78].
Let us start by describing how to fix a conjugation invariant norm on 1-PSs of .
Definition 3.15**.**
A conjugation invariant norm on 1-PSs of a reductive group is given by fixing a maximal torus and a Weyl-invariant integral-valued bilinear form on the 1-PSs of with associated norm . For any 1-PS , there exists such that and we define
[TABLE]
which is independent of the choice of due to the Weyl invariance.
Over the complex numbers, such a norm can be constructed by fixing a Weyl invariant inner product on the Lie algebra of , which gives an identification .
Example 3.16**.**
If and is the diagonal maximal torus, then the Euclidean norm on is invariant under the Weyl group .
Using the norm , we define a normalised Hilbert–Mumford weight and state Kempf’s notion [61] of an adapted 1-PS for an unstable point.
Definition 3.17** (Normalised Hilbert–Mumford weight and adapted 1-PS).**
For a reductive group acting on a projective scheme linearly and a fixed conjugation invariant norm on 1-PSs of , we define the normalised Hilbert–Mumford weight of at a 1-PS to be and we define the minimum normalised Hilbert–Mumford weight of to be
[TABLE]
If is unstable, a primitive 1-PS is said to be adapted to if it acheives this minimum and we write for the set of primitive 1-PSs adapted to .
Let us collect Kempf’s results on adapted 1-PSs in the following theorem.
Theorem 3.18** (Kempf, [61]).**
Let be a reductive group acting on a projective scheme linearly and fix a conjugation invariant norm on 1-PSs of . Then for an unstable point , we have and there is a parabolic subgroup with the following properties.
- i)
For any , we have (see Definition 3.19 below). 2. ii)
Any two 1-PSs in are conjugate by an element of . 3. iii)
If is a maximal torus with , then is a single Weyl orbit. 4. iv)
We have for all . 5. v)
If , then and also where .
Although we will not give the details on the proof of this theorem, the existence of an adapted 1-PS boils down to the fact that one can work in a maximal torus (by translating using the -action) and then the normalised Hilbert–Mumford weight for 1-PS in a given maximal torus can be determined from subsets of the torus weights of the action, which is a finite set (see also Remark 3.25). The last two properties follow from Exercise 3.7.
Definition 3.19** (Parabolic and Levi group associated to a 1-PS).**
For a 1-PS of a reductive group , we define
[TABLE]
then is a parabolic subgroup with Levi subgroup and unipotent radical and is a retraction onto the Levi.
Hesselink [53] stratified the unstable locus by pairs consisting of a conjugacy class of an adapted 1-PS and a minimum normalised Hilbert–Mumford weight . Before, we give the concrete construction of this stratification, we provide a summary of its properties and give an overview of various applications. As is customary, we include the semistable set as the lowest stratum in this stratification.
Theorem 3.20** (Kempf [61], Hesselink [53], Kirwan [63], Ness [78]).**
For a reductive group acting linearly on a projective scheme and a conjugation invariant norm on 1-PSs of , there is a finite instability stratification
[TABLE]
into locally closed subschemes with a partial ordered index set with the following properties.
- i)
The lowest stratum is indexed by and we have . 2. ii)
The closure of any stratum is contained in the union of higher strata: . 3. iii)
* is determined combinatorially from the weights on of a maximal torus .* 4. iv)
The strata can be determined from simpler limit sets, which are GIT semistable loci for smaller reductive group actions with a twisted linearisation.
We will soon make the last two statements more precise. First, let us state some applications of these instability (or Hesselink–Kempf–Kirwan–Ness) stratifications.
Remark 3.21**.**
When is smooth, these stratifications have been used in the following ways:
- (1)
By Kirwan [63], over compute the -equivariant rational Betti numbers of (which coincides with the rational Betti numbers of when ) by showing the Gysin long exact sequences in equivariant cohomology for this stratification split. Hence, there is a surjection known as the Kirwan map
[TABLE]
with explicit kernel. 2. (2)
By Dolgachev–Hu [31] and Thaddeus [91], to describe the birational transformations between GIT quotients given by varying the linearisation. 3. (3)
By Halpern-Leistner [47] and Ballard–Favero–Katzarkov [10], to construct semi-orthogonal decompositions in the derived category of a (stacky) GIT quotient. 4. (4)
By Halpern-Leistner [48], as inspiration to formulate an abstract notion of a -stratification on a stack.
Remark 3.22**.**
Over , there is a close relationship between GIT quotients and symplectic reductions, which are quotients in symplectic geometry. A smooth projective variety is Kähler and thus has a symplectic form (inherited from the Fubini-Study form on ). For a representation , there is a maximal compact subgroup which acts by unitary transformations, and preserves the Kähler form. Moreover, there is a moment map to the co-Lie algebra of such that the GIT quotient is homeomorphic to the symplectic reduction (the quotient of the zero level set of the moment map by , see [69]):
[TABLE]
via the Kempf–Ness Theorem [60] (see also [92]). More precisely, is semistable if and only if its -orbit closure meets (and if this intersection is non-empty, it consists of a unique -orbit). Furthermore, by work of Kirwan [63] and Ness [78], the GIT instability stratification with respect to a conjugation invariant norm coincides with the Morse stratification associated to the norm square of the moment map (see also [74, Chapter 8]).
We now turn to the (first set-theoretic) construction of the unstable strata in the situation of a reductive group acting linearly on a projective scheme with a fixed conjugation invariant norm on 1-PSs on .
Definition 3.23** (Unstable strata, blades and limit sets).**
For , we define the associated unstable stratum
[TABLE]
If we fix a representative of , then we define limit sets and blades as follows
[TABLE]
where . We define the index set , which turns out to be finite (see Remark 3.25 below).
Let us also introduce a closed subscheme of the -fixed locus on which the normalised Hilbert–Mumford weight of is and its attracting set under the flow by as :
[TABLE]
Note that and ; we will soon see these are open subsets and thus we can give these sets a scheme structure. Furthermore, these schemes all depend on the chosen representative of . Recall that and depend on a choice of 1-PS as well as , whereas only depends on (and not ). Note that in [63], is denoted by .
Since the notion of adapted 1-PS depends on the choice of norm, the unstable strata also depend on this choice of norm, but does not.
Proposition 3.24** (Kirwan, [63, 12]).**
Assume that is smooth. For an unstable index , the stratum can be described as follows.
[TABLE]
Furthermore, the blades and limit sets can be described as follows.
- i)
The retraction is a Zariski locally trivial affine space fibration555This follows by work of Białynicki-Birula [19] describing the decomposition of a smooth projective variety with a -action by taking the flow as . Here it is crucial that is smooth for the fibres to be affine spaces.. 2. ii)
The scheme is preserved by the -action and is preserved by the -action. Moreover, is equivariant with respect to the retraction . 3. iii)
For , we have . 4. iv)
* is the GIT semistable set for the action of the reductive Levi subgroup on with respect to a canonical linearisation obtained by twisting by a rational multiple666Via , we can identify characters and co-characters, so we twist by the rational character corresponding to the rational 1-PS , so that for to ‘cancel’ the effect of . of a character corresponding to .*
Remark 3.25**.**
For acting linearly on with a fixed choice of norm , we can compute the index set of the instability stratification in terms of the finitely many weights of the action of a maximal torus . For any subset of the -weights whose convex hull does not contain the origin (that is, this is a weight set of an unstable point), we let be the primitive 1-PS of corresponding under to the ray in the through the closest point to [math] in the convex hull of this weight set and define to be the minimum normalised Hilbert–Mumford weight of any point with this weight set, then (see [63, Lemma 12.6]). In particular, is finite as there are only finitely many -weights.
Exercise 3.26** (Instability stratification for binary forms).**
Consider the action of on as in Exercise 3.12 and show for each integer , there is an unstable stratum corresponding to binary forms with a root of exactly multiplicity .
In the situation of a reductive group acting linearly on an affine space linearised by a character, using King’s Hilbert–Mumford criterion [62, Proposition 2.5] (see also Remark 3.13), one can construct an instability stratification [54]. For the action of on by left multiplication generalising Exercise 3.14, the (semi)stable locus for the character is the matrices of maximal rank and the instability stratification is given by the rank (see [55, Example 2.14]).
Given an instability stratification (2), we can ask if there is a categorical -quotient of an unstable strata , or equivalently via the isomorphism (3), a categorical -quotient of .
Proposition 3.27** (Categorical quotients of unstable strata, [57, Lemma 3.1]).**
The composition of with the reductive GIT quotient is a categorical -quotient.
Proof.
This composition is -invariant, as is -equivariant and is -invariant. Given a -invariant morphism , its restriction to is -invariant and since is constant on orbit closures, we have . Then by the universal property of , we see that (and thus also ) factors uniquely via . ∎
Furthermore, by [57, Lemma 3.1] we have
[TABLE]
which is finitely generated, and the categorical quotient coincides with the projective spectrum of the invariants. However, this categorical quotient factors via the retraction and so identifies every with . Thus this categorical quotient is far from being an orbit space. Since the closed subscheme causes these identifications, we would like to further twist the linearisation to make become unstable and prevent these unwanted identifications. It is at this point where we see that the non-reductive action of on is preferable to the reductive action of on : the parabolic subgroup has more characters which can be used to twist the given ample linearisation; in 6.3, we explain how to apply non-reductive GIT.
In fact, since linearisations give line bundles on quotient stacks, we have a line bundle
[TABLE]
but whilst the corresponding -equivariant line bundle on is ample, the corresponding -equivariant line bundle on is not ample (see [57, Remark 9]) and so is not suitable for working with from the perspective of GIT.
4. Generalisations of reductive GIT to stacks
In 4.1 we describe different types of moduli spaces for stacks and in 4.2, we state a recent existence criterion [7] for stacks to admit a good moduli space.
Throughout this section, for simplicity, we will assume that our algebraically closed field is of characteristic [math] to avoid the distinction between linearly reductive and geometrically reductive groups in positive characteristic, which in turn leads to a distinction between good moduli spaces and adequate moduli spaces for stacks. We will assume all stacks are noetherian algebraic stacks over ; however, everything in this section also extends to a relative setting. For a detailed introduction to algebraic spaces and stacks with a focus on moduli, see [1, 3].
4.1. Moduli spaces for stacks
Associated to an action , there is a quotient stack whose points (and residual gerbes) describe the orbits (and stabilisers) of the action. A categorical quotient of this action is equivalent to a universal map from to a scheme. One can naturally ask if an arbitrary stack has a universal map to a scheme (or possibly an algebraic space, as was necessary in 2.2). If additionally, as in the definition of a coarse moduli space for a moduli functor, we ask for a bijection on -points, this leads to the following notion.
Definition 4.1**.**
A coarse moduli space (CMS) for a stack is a map to an algebraic space which is initial for maps from to algebraic spaces and such that the induced map is bijective.
Keel and Mori [59] studied quotients of groupoids in the category of algebraic spaces and proved the existence of a categorical quotient under the assumption of finite stabilisers; let us state a stacky reformulation as in [25].
Theorem 4.2** (Keel–Mori Theorem).**
An algebraic stack over with finite inertia stack admits a coarse moduli space with the following properties
- i)
* is an isomorphism;* 2. ii)
If is separated (resp. of finite type) over , then is separated (resp. of finite type) over ; 3. iii)
* is a proper universal homeomorphism;* 4. iv)
Any flat base change of is also a coarse moduli space.
In particular, the Keel–Mori Theorem applies to separated Deligne–Mumford stacks. Requiring to induce a bijection on closed points is very strong: as soon as there are non-closed orbits this condition fails. Fortunately Alper [2] adapted the reductive GIT notion of good quotient to the setting of stacks as follows.
Definition 4.3**.**
A good moduli space (GMS) for a stack is a quasi-compact quasi-separated morphism to an algebraic space such that
- (1)
the pushforward map on quasi-coherent sheaves is exact, 2. (2)
the natural map is an isomorphism.
The following example shows this theory applies to quotients of linearly reductive groups.
Example 4.4** (Alper, [2, Example 12.9]).**
For the classifying stack of an affine algebraic group, the map satisfies the second property in the definition of a good moduli space. Then is given by taking -invariants of a linear representation of . Hence, the first condition holds if and only if is linearly reductive.
Subsequently, Alper later adapted his theory to include geometrically reductive groups in positive and mixed characteristic by giving a notion of an adequate moduli space [3], which weakens the first property in the definition of good moduli spaces.
Example 4.5** (Alper, [2, Theorem 13.6]).**
For a linearly reductive group acting on an affine scheme , the quotient stack admits a good moduli space given by the affine GIT quotient. More generally, for a linearly reductive group acting on a scheme with respect to an ample linearisation , the morphism is a good moduli space.
Let us note some important properties of good moduli spaces.
Remark 4.6**.**
If is a good moduli space, then it has the following properties.
- (1)
The morphism is initial among maps from to algebraic spaces [2, Theorem 6.6] and is surjective and universally closed [2, Theorem 4.16 (i) and (ii)]. 2. (2)
For every point , there is a unique closed point in and the automorphism group of is linearly reductive [2, Theorem 9.1 and Proposition 12.14]. 3. (3)
The morphism induces a bijection between closed points in and closed points in [2, Theorem 4.16 (iv)]. 4. (4)
If is of finite type over , then so is [2, Theorem 4.16 (xi)]. 5. (5)
Any base change of along a morphism of quasi-separated algebraic spaces is also a good moduli space [2, Proposition 4.7].
4.2. Stability and existence criteria
Halpern-Leistner [46] and Heinloth [52] studied how ideas in reductive GIT, such as the Hilbert–Mumford criterion, can be applied to stacks. The role of 1-PSs and their limits can be replaced by the stack Theta:
[TABLE]
over , for the -action on by scalar multiplication. This stack plays a prominent role in the work of Halpern-Leistner [46] and led to a notion of -stability for stacks and a generalisation of GIT instability stratifications to -stratifications of stacks [48].
In this section, we will state a recent existence theorem of Alper, Halpern-Leistner and Heinloth [7] which gives necessary and sufficient conditions for a stack to admit a good moduli space. These conditions are valuative criteria known as -reductivity and S-completeness.
Definition 4.7** (Valuative criteria for stacks).**
A noetherian algebraic stack is said to be
- i)
-reductive if for any DVR , any morphism extends uniquely to , where and denotes the unique closed point. 2. ii)
S-complete if for any DVR , any morphism extends uniquely to
[TABLE]
for a uniformiser and -action with weights on .
The stack originates from work of Heinloth [52] and naturally generalises Example 2.14 where acts on with weights . Recall that after removing the origin, has non-separated geometric quotient given by the affine line with two origins. S-completeness should be thought of as a stacky valuative criterion for separatedness.
Remark 4.8**.**
Assume that is a moduli stack for objects in an abelian category as in [7, 7]. Then a morphism is a (weighted) filtration on a family of over such that the associated graded lies in . Let be a DVR with fraction field and residue field , where denotes a uniformiser. In this case, we can interpret the above conditions as follows.
- (1)
A morphism is given by a family of over the DVR together with a filtration on the generic fibre (whose associated graded object lies in ). This extends uniquely to if and only if the filtration on the generic fibre extends uniquely to the special fibre (again with the associated graded object lying in ). 2. (2)
A morphism is equivalent to two families of over whose generic fibres over are equivalent. This extends uniquely to if and only if the special fibres have filtrations whose associated graded objects are isomorphic777For the stack of semistable vector bundles on a curve, this asks for the bundles on the special fibre to be S-equivalent. in .
This can be seen from looking at the following diagrams appearing in [1, 6.7.2]
[TABLE]
where the left side of the diagram corresponds to the open immersion and all morphisms on the left are open immersions, and the right side represents the closed immersion and all morphisms on the right are closed immersions.
There is a similar diagram for :
[TABLE]
We can now state the existence theorem of Alper–Halpern-Leistner–Heinloth [7].
Theorem 4.9** (Existence criterion for GMS, [7, Theorem A]).**
Let be an algebraic stack of finite type over of characteristic zero, with affine diagonal. Then admits a separated good moduli space if and only if is -reductive and S-complete. Moreover, is proper if and only if satisfies the existence part of the valuative criterion for properness.
In characteristic zero, the existence criterion is much nicer due to the existence of étale local quotient presentations around closed points with reductive stabiliser due to Alper–Hall–Rydh [6], which is a stacky generalisation of Luna’s étale slice theorem [68]. Showing these conditions are necessary for to have a good moduli space is relatively formal. To show these conditions suffice to construct a good moduli space, one first uses S-completeness to show closed points have reductive stabilisers and then one glues the affine GIT quotients associated to the étale local quotient presentations, which uses both -reductivity and S-completeness.
In positive characteristic there is also an existence theorem for adequate moduli spaces, but as an input it requires the existence of étale local quotient presentations, which is part of the local reductivity assumption in [7, Theorem A].
With this existence criterion in hand, one can construct moduli spaces without GIT as follows.
- (1)
Interpret the moduli problem as an algebraic stack . 2. (2)
Apply an existence theorem to obtain a (proper) good moduli space .
However, this only yields a proper good moduli space in cases where GIT would provide a projective moduli space. Consequently, one further step is required for this strategy:
- (3)
Find an ample line bundle on to show it is projective.
This approach has been implemented for moduli of smooth projective curves in [24], where the second step uses the Keel–Mori Theorem and the third step follows Kollar’s proof of projectivity using the determinant of a relative pluricanonical sheaf for the universal family; and for moduli of vector bundles on a smooth projective curve in characteristic zero in [4], where the second step uses Theorem 4.9 and the third step follows Faltings’ proof of projectivity using a determinantal line bundle constructed from the universal family. For moduli of representations of an acyclic quiver (in arbitrary characteristic), a new moduli-theoretic proof of projectivity was given in [13], where again a determinantal line bundle is used. Of course, in all these cases, reductive GIT could be applied to produce the moduli space without this extra work! However, this intrinsic moduli-theoretic approach to projectivity does give new insights: the line bundle obtained in the final step is of inherent interest to the moduli problem and the techniques give new effective bounds for global generation of determinantal line bundles (for example, see [13, Theorem B]).
The advantage of this approach is that one does not need a quotient presentation of the stack; for example, for moduli of Bridgeland semistable objects in a derived category without a quotient presentation, this approach can be applied provided the stack is algebraic as in [93, Theorem 1.3]. There have been impressive applications to moduli of K-polystable Fano varieties [5] and moduli of torsors for a Bruhat–Tits group scheme over a curve [7, Theorem 8.1].
Since the theory of good (and adequate) moduli spaces is based on GIT for reductive groups, there are still many interesting moduli problems which do not admit good (or adequate) moduli spaces, such as moduli stacks of weighted projective hypersurfaces, where non-reductive groups naturally appear (see 6.2).
5. Non-reductive geometric invariant theory
In this section, we will describe recent progress on non-reductive GIT (in characteristic zero) due to Bérczi, Doran, Hawes and Kirwan [15, 14] concerning non-reductive groups with graded unipotent radicals. Before we describe their approach, we start by illustrating the issues in constructing non-reductive quotients in 5.1 and give a survey of some previous contributions in 5.2. We then proceed to explain the relationship between -actions and locally nilpotent derivations in 5.3, as well as their interaction with -actions in 5.4. In 5.5 we introduce graded unipotent groups and state the main results of [15, 14] in 5.6, before giving details on some of the proofs in 5.7. Throughout this section, we assume is of characteristic zero.
5.1. Examples of bad behaviour of additive actions
The ring of invariants being non-finitely generated for a non-reductive group is not the only issue that arises when trying to construct quotients of non-reductive group actions. Even when the ring of invariants is finitely generated (for example, see Theorem 2.9), there can be further issues:
- (1)
The quotient morphism given by the inclusion of invariants may fail to be surjective (and in general, its image is only a constructible subset), so will not be a good quotient. 2. (2)
There may not be enough invariants to separate disjoint closed orbits in contrast to the case for geometrically reductive groups (Lemma 2.7). 3. (3)
Invariants may not extend to the ambient space.
Let us give some concrete examples of linear -actions that demonstrate these issues.
Example 5.1**.**
The following examples are all built from powers (or symmetric powers) of the standard representation of the additive group on given by (as the upper triangular unipotent radical) acting on by left multiplication; thus Theorem 2.9 applies.
- (1)
For acting on via ,
[TABLE]
and the (invariant theoretic) quotient map only has constructible image, as it misses the punctured line . 2. (2)
For acting on via ,
[TABLE]
and these invariants do not separate the closed orbits and . 3. (3)
For acting on as in (2), the -invariant closed subscheme has an invariant function which does not extend to .
In particular, just trying to get rings of invariants to be finitely generated will not suffice to generalise the nice properties of reductive GIT.
Even in the case of free -actions, there are examples which do not admit a geometric quotient (for example, see [29, Example 18] which is proved via a geometrical argument).
5.2. Short historical note on non-reductive group actions
Let us summarise some of the contributions towards the development of non-reductive GIT, before turning to [15, 14].
As mentioned in Remarks 2.2 and 2.10, the transfer principle was used by Grosshans [41] to prove finite generation in certain cases. Grosshans proved the unipotent radical of a parabolic subgroup in a reductive group is a Grosshans group and thus if a -action on an affine variety extends to , then the ring of -invariant functions is finitely generated [42]; his proof uses a grading by weights of a maximal torus (see [22, Theorem 2.7]), which will also play an important role in non-reductive GIT.
Fauntleroy defines global intrinsic notions of semistability for a connected unipotent group action on a quasi-affine normal variety in terms of properties of invariant sections and showed if the stabilisers are finite that the open semistable set admits a categorical quotient [33, Theorem 5]. In subsequent work [34], he more generally defined a notion of properly stable points for a linearised action of a connected linear algebraic group on a normal projective variety and shows this locus has a quasi-projective geometric quotient. Fauntleroy combined ideas from reductive GIT with Seshadri covers, which Seshadri used to show actions of connected linear algebraic groups on a normal variety with finite stabiliser groups admit geometric quotients up to a finite equivariant replacement [87].
For non-reductive actions on an affine scheme, Winkelmann [95] showed there is a rational quotient map to a quasi-affine variety, whose coordinate ring is the ring of invariants (and is not necessarily finitely generated). In fact, he showed the study of coordinate rings of quasi-affine varieties corresponds to the study of rings of invariants for -actions on affine varieties.
Alternatively, one can ignore the issue of whether or not rings of invariants are finitely generated and work in the category of all schemes (not necessarily of finite type) over ; Greuel and Pfister take this approach in [40] to define a notion of stability for unipotent group actions and show there is a geometric quotient in the category of varieties. Their motivation came from the study of singularities, where non-reductive translation actions appear.
We note that additive group actions also arise as translation actions in affine geometry and have led to progress on classical questions on affine spaces such as the cancellation problem, the existence of exotic affine spaces and the Jacobian conjecture (for example, see [66]).
Doran and Kirwan [32] give various notions of (semi)stability for non-reductive GIT using properties of invariants and by transferring the problem to a reductive GIT setting using a notion of ‘fine reductive envelope’ and they obtain various types of quotients of these (semi)stable sets.
The recent progress on non-reductive GIT [15], which we explain below, uses a multiplicative group to grade the unipotent radical; this enables the construction of geometric unipotent quotients, as well as providing a natural projective completion and an explicit Hilbert–Mumford type description of stability. To trace back the origin of this grading multiplicative group, we begin with the correspondence between -actions and locally nilpotent derivations, and will see these multiplicative group actions naturally appear when there is a slice of the -action.
5.3. Actions of the additive group
Since is of characteristic zero, we can utilise the dictionary between additive group actions and locally nilpotent derivations (e.g. see [36]).
For an action on an affine scheme, the coaction
[TABLE]
can be used to define a derivation given by satisfying the Leibniz rule . This derivation is locally nilpotent (i.e. for any , there is such that ), as one can inductively show
[TABLE]
and since the left side is a polynomial in , we must have for all sufficiently large.
Conversely, given a locally nilpotent derivation on the -algebra of functions on an affine scheme, we can exponentiate to construct a coaction
[TABLE]
which is well-defined, as is locally nilpotent.
These constructions are inverse to each other and we collect some other results relating these geometric and algebraic points of view.
Proposition 5.2**.**
For an affine variety with coordinate ring , there is a bijective correspondence
[TABLE]
For an action corresponding to a locally nilpotent derivation , the following statements hold.
- i)
The ring of invariants is the kernel of the derivation: , 2. ii)
* is -fixed if and only if .* 3. iii)
If has a slice (i.e. there exists such that ), then is a finitely generated -algebra, the subscheme is a geometric slice of the -action and is a trivial principal -bundle.
Proof.
Let us just give some details on the third statement, as this will be important in what follows. Suppose that is a slice for . Then define a -algebra homomorphism
[TABLE]
such that . In particular, is a finitely generated -algebra: the images under of generators for the algebra are generators of .
By induction, any is a polynomial in with coefficient in :
[TABLE]
and thus and . Moreover is isomorphic to . We claim that is a geometric slice; that is given by is an isomorphism. Indeed, by considering the following commutative diagrams:
[TABLE]
one can construct the inverse. ∎
Note that if there is a slice, then the coordinate ring is a polynomial ring in a single variable with coefficients in the ring of invariants. In particular, the coordinate ring is naturally graded by (the degree of this polynomial) and thus this gives a -action. This naturally leads to considering actions of semi-direct products of and .
5.4. Semi-direct products of additive and multiplicative groups
For an affine scheme , a -action on is equivalent to a -grading of the -algebra . The limit under the -action as exists for all if and only if acts with non-negative weights on , which is if and only if the grading on is supported in non-positive degrees.
Definition 5.3**.**
A semi-direct product of and is given by specifying a group homomorphism and defining the semi-direct product to have underlying set with group operation
[TABLE]
For , let us define by and write for the associated semi-direct product, where .
Example 5.4**.**
The upper triangular Borel is isomorphic to a semi-direct product , via
Remark 5.5**.**
For an action of a semi-direct product on an affine scheme, the locally nilpotent derivation associated to the -action is homogeneous of degree with respect to the grading determined by the -action; that is, .
The following result is the key proposition which enables the inductive construction of quotients of unipotent group actions in the presence of an appropriate -action grading the unipotent action. This is a modification of [14, Lemma 7.3].
Proposition 5.6** (Key Proposition).**
For an affine -scheme with locally nilpotent derivation , the following statements are equivalent:
- (1)
* has a slice (i.e. there exists with ),* 2. (2)
The -action extends to for some such that
- (a)
* exists for all and* 2. (b)
* for all .*
In particular, if (2) holds, then there is a trivial -quotient .
Proof.
Suppose has a slice ; then as in the proof of Proposition 5.2. Since is a polynomial ring in , it is naturally graded, and we choose a -grading given by so that exists for all . Since , we see that is homogeneous of degree and thus there is an action of . Since is a trivial -bundle (see Proposition 5.2), all -stabilisers are trivial.
Let us outline the converse direction (see [14, Lemma 7.3] for the full proof). Given and a -action on such that exists for all and for all , we note that has homogeneous degree with respect to the grading given by the -action. By the assumption that exists for all , this grading is supported in non-positive degrees. We have
[TABLE]
and we will show that , so there exists with . Let
[TABLE]
which is -stable by (4) and also -stable, as is a sum of -weight spaces and the -action on is trivial. Then it suffices to show that is an ideal (Claim 1) and moreover (Claim 2). Indeed, as we must have and so for some .
To prove Claim 1, we need to show for and that ; for this we can write with respect to the -grading and it suffices to show for all . We can also write by (5). The only non-trivial case is for and ; however, by the Leibniz rule and so .
To prove Claim 2, we argue by contradiction. If , then it is contained in a maximal ideal . Since is -stable, so is and so it corresponds to a -fixed point . However, this contradicts the assumption that for all . ∎
Note that there is a choice of sign here. One could consider actions of for such that exists for all and obtain an analogous result.
Remark 5.7**.**
For a linear representation , we have , where is the weight space for the maximal -weight.
Example 5.8**.**
Consider the upper triangular Borel acting on by left multiplication. We have
[TABLE]
and .
5.5. Graded unipotent groups
Generalising Proposition 5.6, we will consider actions of unipotent groups which are graded by a -action in the following sense.
Definition 5.9**.**
A graded unipotent group is a semi-direct product of a unipotent group with a multiplicative group such that the conjugation of on the Lie algebra of has strictly positive weights.
Example 5.10**.**
For , the group is a graded unipotent group. In particular, the upper triangular Borel is a graded unipotent group.
Proposition 5.11**.**
Let be an affine scheme with an action of a graded unipotent group such that exists for all and for all , then is finitely generated and is a trivial -quotient.
Proof.
The idea is to iteratively apply Proposition 5.6 as in the proof of [14, Proposition 7.4]. By lifting a filtration on the Lie algebra via the exponential map, we obtain normal subgroups
[TABLE]
whose successive quotients are copies of on which acts by conjugation with strictly positive weights. Assume that we have constructed a quotient which is a trivial -quotient. The base case for is given by Proposition 5.6. For the inductive step, we claim that we can apply Proposition 5.6 to the action on to show there is a trivial -quotient . Then the composition
[TABLE]
is a principal -bundle (see [17, Proposition 4.7]), which is trivial as the base is affine by [9, Theorem 3.12].
To complete the proof, we must show that for the action on graded by all limits exists for and for all in order to be able to apply Proposition 5.6. Since is -equivariant, we have
[TABLE]
Thus the set of such limits in is contained in . For a point , suppose that ; that is, there exists such that . However, by assumption and so we have , which means is trivial and we can apply Proposition 5.6 to as claimed. ∎
Remark 5.12**.**
For a representation , note that .
In particular, rather than trying to find -invariant sections, we can use -maximal sections to construct a non-reductive GIT quotient as in the next subsection.
5.6. Statement of the key results in non-reductive GIT
Instead of working with a linearised action on a projective scheme, for simplicity we will assume that we have a linear action on as in 2.6. As before, in the case of a (very ample) linearisation on , we get a projective embedding where has a linear action.
This concerns actions of affine algebraic groups, whose unipotent radical is graded by a .
Definition 5.13**.**
Let be an affine algebraic group with unipotent radical and reductive Levi factor . A 1-PS is said to grade if its conjugation action on has strictly positive weights. In this case, we say has graded unipotent radical (by ).
This is what is called an internal grading in [14], where there is also a more general notion of an external grading; however, we will stick to the simpler internal point of view, as this suffices in all examples and applications we consider.
There could be several central 1-PSs in that grade , but we will assume we have fixed a grading and write . Different grading 1-PSs gives rise to different quotients, so can be thought of as an additional choice to the linearisation (see Remark 5.17 below).
Example 5.14**.**
A parabolic subgroup of (or more generally any reductive group) has graded unipotent radical: write for a 1-PS , then , with being central in and grading .
Definition 5.15** (Minimal weight space and attracting set).**
For a multiplicative group acting linearly on , we define the minimal weight space and minimal attracting set as follows:
[TABLE]
where are the -weights on and .
In [15], the minimal attracting set is denoted , but we have simplified the notation. For the associated Białynicki-Birula stratification [19] (flowing as ), the variety is the open stratum and is a Zariski locally trivial affine space fibration.
In [14], there are two types of assumptions needed for the construction of non-reductive GIT quotients: the first (Definition 5.16) concerns the positioning of the weights for the linearised action of the grading , which selects a particular VGIT chamber for and can be achieved by twisting the linearisation by a (rational) character , and the second (Assumption in Definition 5.18) requires certain unipotent stabiliser groups to be trivial, which is referred to as semistability coincides with stability in [14].
Definition 5.16** (Adapted linearisation).**
Let be a group with unipotent radical graded by acting linearly on . We say this linearised action is adapted if the -weights on satisfy
[TABLE]
Remark 5.17**.**
The assumption that the linearised action is adapted fixes a particular VGIT chamber for the -action, where (semi)stability is given by
[TABLE]
Since twisting the linearisation by a character shifts the weights by (where we identify characters of with integers such that corresponds to a character with kernel ), if we are prepared to modify the linearisation we can always arrange for this condition to hold. Note that this shift of weights does not change the minimal weight space and attracting set .
Definition 5.18** (Stabiliser assumptions).**
Let be a group with unipotent radical graded by acting linearly on .
- (1)
We say the unipotent stabiliser assumption holds if
[TABLE] 2. (2)
We say the reductive stabiliser assumption holds if
[TABLE]
where as is central in , there is an induced -action on the -fixed variety and we let denote the GIT semistable locus for .
Remark 5.19**.**
The unipotent stabiliser assumptions are crucial to apply Proposition 5.11, whereas the reductive stabiliser assumptions are used to more readily obtain an explicit Hilbert–Mumford type description of the locus we obtain a quotient of. The reductive stabiliser assumption implies semistability coincides with stability for the -action on .
If these stabiliser assumptions fail, then one would like to perform a sequence of equivariant blow-ups to arrange for this to hold on the blow-up (similar to Kirwan’s partial desingularisation procedure [64]) and then construct a quotient of the original scheme using the quotient of the blow-up; however, this procedure is much more complicated in the non-reductive setting described in [14, 9]. Furthermore, if there are generically positive dimensional unipotent stabilisers, blowing up would only result in constant dimensional stabilisers (rather than trivial stabilisers), and so instead one must filter by normal subgroups whose stabilisers are constant (assuming this can be done via blow-ups) and then proceed as in [14, Remark 7.1] and [81].
The strategy is to first use the grading to obtain a projective quotient by and then take a reductive GIT quotient by the residual group . Therefore, we first state the result for quotients by graded unipotent groups .
Theorem 5.20** (The -Theorem, [15, Theorem 2.16]).**
Let be a graded unipotent group acting linearly on . If the linearised action is adapted and the unipotent stabiliser assumption hold, then we have the following statements.
- i)
There is a geometric -quotient such that is a quasi-projective variety. 2. ii)
There is a geometric -quotient such that is a projective variety. 3. iii)
If the linearised action is well-adapted888See Definition 5.21 below, which can be achieved by further twisting by a rational character., then the ring of -invariant sections (for an appropriate power) of the linearisation is finitely generated and taking the Proj construction gives the above geometric -quotient .
Let us outline the structure of the proof.
- (1)
Since , the -quotient is constructed by applying Proposition 5.11 to each affine -variety for (see Proposition 5.26). 2. (2)
Construct a -equivariant embedding with for some , to show is quasi-projective (see Proposition 5.27). 3. (3)
By appropriately twisting the original linearisation (to make it well-adapted), the induced -action on is adapted and ; thus
[TABLE]
has a geometric -quotient, as a closed subvariety of (see Proposition 5.29). 4. (4)
To prove that the ring of -invariant sections is finitely generated, one shows that coincides with an enveloping quotient (see [16, Definition 3.1.6]) and as the enveloping quotient is projective, the ring of -invariant sections for an appropriately divisible power of the well-adapted linearisation is finitely generated by [16, Corollary 3.1.21].
The first two steps give the proof of Theorem 5.20 i), whereas (3) and (4) give statements ii) and iii) respectively; more details on the proofs of Steps (1) - (3) are given in 5.7 below.
In (1), it is crucial that the -action is graded by , that the unipotent stabiliser assumption holds and that the action is adapted in order to apply Proposition 5.11. The grading -action is also used in (2) and (3) to firstly show is quasi-projective and then obtain a projective quotient of as a closed subvariety of . In (3), in order for the induced -linearisation on to be adapted, we need the minimal weight for the original linearised action to be negative but very small (see the proof of Proposition 5.29); this leads to the following notion.
Definition 5.21** (Well-adapted linearisation).**
Let be a group with unipotent radical graded by acting linearly on . We say this linearised action is well-adapted if there is such that the -weights on satisfy
[TABLE]
Let us present a simple example for matrices up to conjugation extending Example 2.15.
Example 5.22** (To appear in upcoming joint work with E. Hamilton and J. Jackson).**
Consider the upper triangular Borel subgroup acting by conjugation on . To apply Theorem 5.20, we consider the projective embedding , with trivial action on . The minimal weight space is 1-dimensional and spanned by the elementary matrix , so is contained at infinity (i.e. in ), and . The unique point in has trivial -stabiliser; thus holds. By twisting the linearisation to make it adapted, we obtain a geometric -quotient
[TABLE]
As is contained at infinity, and we obtain a geometric -quotient of matrices whose bottom left entry is non-zero
[TABLE]
Suppose that acts on ; then the projective geometric -quotient of Theorem 5.20 has a residual action of the reductive group . There is a quotient
[TABLE]
where the second morphism is the reductive GIT quotient; this gives a projective and good -quotient of the open set . The key challenge is to determine this preimage in terms of the original action on . This is easiest when holds (i.e. semistability coincides with stability for the -action on ); in this case, we define the following semistable sets.
Definition 5.23** (Non-reductive stable set).**
Let be a group with unipotent radical graded by acting linearly on . If the linearisation is well-adapted and both and hold, then we define the -stable set by
[TABLE]
Remark 5.24**.**
Since holds for the linear action of the graded unipotent group on , the -sweep of is a closed subvariety of by [14, Lemma 5.4]. Consequently, the -stable locus is open in , and so is the -stable locus.
The next result is a special case of [14, Theorem 2.20] as stated in [56, Theorem 2.28], where the reductive stabiliser assumption is used to describe the preimage of the reductive (semi)stable locus by using the reductive Hilbert-Mumford criterion and comparing the torus weight sets of points and their images under . Although some torus weights are lost on applying , all torus weights corresponding to maximal grading -weights survive, and if then it has at least one non-minimal weight; these observations together with the assumption that semistability coincides with stability for on are central in the proof.
Theorem 5.25** (Construction of non-reductive GIT quotients, [17, Theorem 4.28]).**
Let be a group with unipotent radical graded by acting linearly on . If the linearisation is well-adapted and both and hold, then there is a projective and geometric -quotient
[TABLE]
which coincides with the Proj construction associated to the (finitely generated) invariant ring.
5.7. Overview of the proof
We provide some details on the proof of Theorem 5.20 i) and ii). The structure of the proof is as follows.
- (1)
Using Proposition 5.11, construct a geometric -quotient of (see Proposition 5.26). 2. (2)
Construct to show is quasi-projective (see Proposition 5.27). 3. (3)
Inside , construct a geometric -quotient of (see Proposition 5.29).
The first step relies on the Key Proposition (Proposition 5.6) about -slices via -gradings.
Proposition 5.26**.**
Let be a graded unipotent group acting linearly on . If the linearised action is adapted and holds, then there is a geometric -quotient of .
Proof.
Since is the union of the open affine varieties over , we will construct a geometric -quotient by gluing trivial quotients which are constructed by applying Proposition 5.11 to each affine -variety for .
In fact, by choosing a basis of consisting of -weight vectors, which gives an identification , it suffices to construct these trivial quotients in the case where is a coordinate function. Then has coordinates . Since , its -weight is (recall that is the minimal weight in and ). Hence the weights of the -action on are of the form , where this inequality holds due to the linearised action being adapted. In particular, the flow under as exists for all points in . Since , the unipotent stabiliser assumption implies that the corresponding stabiliser assumption in Proposition 5.11 holds; hence we obtain the claimed trivial -quotient . ∎
The quotient obtained from this gluing construction is a priori just an abstract scheme, but the next result shows it is in fact quasi-projective. This result is [14, Lemma 7.6].
Proposition 5.27**.**
Let be a graded unipotent group acting linearly on such that the linearised action is adapted and holds. There exists a positive integer and an embedding
[TABLE]
where , the first morphism is a closed immersion and the second morphism is the open inclusion of the minimal attracting set for the induced -action on .
Proof.
Fix a basis of such that is finitely generated (see the proof of Proposition 5.26 above). Then there exists a positive integer , such that for , we have that is generated by . Let correspond to ; then is surjective.
The inclusion induces a rational map , which is well-defined on , as maximal sections are -invariant, and for
[TABLE]
Since cover , we see that factors via
[TABLE]
For a partition of , the section corresponds to .
Since being a closed immersion is a local property on the target, is a closed immersion if is a closed immersion for each partition , or equivalently is surjective. This last statement is deduced from the fact that is surjective for each by the choice of . ∎
The next two results are described in the discussion after Lemma 7.7 in [14].
Lemma 5.28**.**
For an adapted linear action of a graded unipotent group on , assume holds; thus there is a geometric quotient , which is locally closed in by Proposition 5.29. Let ; then if and only if .
Proof.
For , we have that if and only if
[TABLE]
or, as is a geometric quotient, equivalently , i.e. . ∎
Proposition 5.29**.**
For an adapted linear action of a graded unipotent group on , assume holds. There is a well-adapted rational twist of the -linearisation on such that the induced -linearisation on is adapted and
[TABLE]
Furthermore, the preimage under of the -stable locus of the closure of in
[TABLE]
admits a projective geometric -quotient .
Proof.
Recall that is the minimal weight on and so is the minimal weight on . Pick so that . Let be the rational character corresponding to . Then the -weights on satisfy
[TABLE]
so the induced -linearisation is adapted and . This linearisation on is induced from the well-adapted twisted linearisation . The final claim follows from Lemma 5.28, as . ∎
The proof of the finite generation of an appropriate power of the linearisation is given in the discussion proceeding Corollary 7.10 in [14].
6. Recent applications of non-reductive GIT
In this section we will give an overview of some recent applications of non-reductive GIT.
6.1. Moduli of jets of map germs and hyperbolicity
Bérczi and Kirwan [18] used non-reductive GIT to construct and study compactifications of spaces of invariant jet differentials in order to prove polynomial versions of the Green–Griffiths–Lang conjecture and Kobayashi conjecture concerning hyperbolicity properties of generic smooth projective hypersurfaces. Let us outline these conjectures and the approach using non-reductive GIT.
A complex projective manifold is Brody hyperbolic if every holomorphic map is constant. For example, in dimension , a curve is hyperbolic if and only if . Hyperbolic varieties are interesting from the point of view of complex geometry and also for their conjectural Diophantine properties (Lang conjectured that if a projective variety defined over is hyperbolic, then is finite).
The Kobayashi conjecture predicts that a very general hypersurface of sufficiently large degree is Brody hyperbolic. Green, Griffiths and Lang conjectured that every projective algebraic variety of general type is weakly hyperbolic; that is, there exists a proper subvariety such that the image of every holomorphic map is contained in . These conjecture are related by a recent result of Riedl and Yang [82]: if the Green–Griffiths–Lang conjecture holds for projective hypersurfaces of dimension and degree at least , then the Kobayashi conjecture is true for projective hypersurfaces of dimension with degree at least . The strategy for approaching these conjectures goes back to work of Demailly [27] and Siu [90], which involves studying invariant jet differentials; here non-reductive group actions naturally arise as reparametrisation groups.
For a smooth projective complex variety of dimension , the bundle of of -jet germs in has fibre over is given by germs of holomorphic maps for fixed local coordinates at up to the equivalence relation given by equality of the first -derivatives at [math]; thus the fibres can be represented by truncated Taylor expansions or equivalently -tuples of vectors in given by the first -derivatives. The transition functions are polynomial, but not linear, so is not a vector bundle. The group of regular -jets of maps acts fibrewise on by reparametrisations; is naturally an upper triangular subgroup of and fortunately is a graded unipotent group
[TABLE]
where . Green and Griffiths studied algebraic differential operators, which are polynomial functions on , and constructed a sheaf of algebraic differential operators of order of fixed weighted degree (with respect to the -weights). Demailly considered a subbundle of jet differentials invariant under reparametrisations from . A key tool to finding invariant jet differentials is to produce a projective completion of the fibrewise quotient of acting on the jet bundle . The projective completion given by Bérczi and Kirwan [18] uses non-reductive GIT, where a blow-up at , which is just a point, is needed for the unipotent stabiliser assumption to hold. They then use intersection theory for non-reductive GIT quotients (see [17] and 6.4 below) to prove a polynomial version of the Green–Griffiths–Lang conjecture.
Theorem 6.1** (Polynomial Green–Griffiths–Lang Theorem of Bérczi–Kirwan [18]).**
A generic smooth projective hypersurface of dimension and degree is weakly hyperbolic.
By work of Riedl–Yang [82], this gives a polynomial Kobayashi theorem [18, Theorem 1.4] for generic smooth projective hypersurfaces of dimension and degree .
6.2. Moduli spaces of hypersurfaces in weighted projective orbifolds
One classical application of reductive GIT is to construct moduli spaces of projective hypersurfaces as the GIT quotient of the -action on ; this gives compactifications of moduli spaces of smooth hypersurfaces, as Mumford proved that any smooth hypersurface of degree is GIT stable when . In general determining precisely which other hypersurfaces are (semi)stable is challenging, even with the Hilbert–Mumford criterion in hand.
The advent of non-reductive GIT enables this to be extended to hypersurfaces in weighted projective spaces and more general projective toric varieties, whose automorphism groups are non-reductive affine algebraic groups and are explicitly described by the work of Cox as quotients of the graded automorphism group of the Cox ring.
Example 6.2**.**
The weighted projective plane has automorphism group given by
[TABLE]
where the unipotent group appears from automorphisms of the form .
Fortunately the automorphism groups of weighted projective spaces have graded unipotent radicals; see [15, Lemma 4.1] and also [23].
Bunnett [23] studied the application of non-reductive GIT to moduli of weighted projective hypersurfaces and more generally hypersurfaces in toric orbifolds. In a well-formed weighted projective space he proves [23, Theorem 5.18] that any quasi-smooth hypersurface (see [11, 3]) of degree is stable (in the sense of non-reductive GIT) provided the unipotent stabiliser assumption holds, so that no blow-ups are required.
In the case of a well-formed weighted projective space , any quasi-smooth hypersurface of degree has finitely many automorphisms coming from the ambient automorphisms of by [23, Theorem 3.13]. Consequently, the Keel–Mori Theorem gives the existence of a coarse moduli spaces as an algebraic space. However, non-reductive GIT gives the construction of a quasi-projective moduli space.
For a toric variety , there is an -discriminant (see [37]) for hypersurfaces of class , which vanishes on non-quasi-smooth hypersurfaces (in contrast to the case of projective hypersurfaces, the converse is not necessarily true, as the -discriminant only checks for singularities in the sweep under of the torus , see [23, Remark 4.11]). The -discriminant of [37] is interpreted as an invariant section of a twisted linearisation in [23, Corollary 4.12].
Let us explain the non-reductive GIT set-up for hypersurfaces in of degree . Assume that hypersurfaces of degree are Cartier divisors (i.e. the lowest common multiple of the weights divides ). Consider the non-reductive group acting on the space of weighted degree homogeneous polynomials. Bunnett proves that quasi-smooth hypersurfaces are contained in the -stable set (under the unipotent stabiliser assumption). Using the non-reductive GIT Hilbert–Mumford criterion of [14], he shows that quasi-smooth hypersurfaces are stable for the action of assuming that is a Cartier degree with and the unipotent stabiliser assumption holds. Furthermore, if the weighted projective space has only two weights, then the unipotent stabiliser assumption holds (see [23, Proposition 5.9]).
Bunnett obtains the best results for Cartier hypersurfaces in a rational cone of degree (see [23, Theorem 5.20]): he explicitly describes the quasi-smooth locus as the non-vanishing locus of a section (which is obtained by multiplying the -discriminant with a variable) and constructs a -quotient of this open affine variety using Proposition 5.11, where the necessary unipotent stabiliser assumption is easily verified. He then directly obtains a geometric quotient of the locus of quasi-smooth projective hypersurfaces, which is a projective over affine variety, because it is constructed as a reductive GIT quotient of the affine -quotient twisted by a character as in 2.8 rather than using the more complicated methods of [14].
6.3. Moduli of unstable objects
Recall from 3.3 that associated to a linear action of a reductive group on and a choice of norm, there is an instability stratification
[TABLE]
where for a parabolic subgroup . A categorical -quotient of , or equivalently a categorical -quotient of , is given by Proposition 3.27; however, as explained after this proposition, this is far from being an orbit space as it factors via the retraction sending a point to its flow under as . Instead, we would like to apply non-reductive GIT to the action of on the closure of , where we can twist the linearisation by (a rational multiple of) a character corresponding to to make it well-adapted. Fortunately, the non-reductive notion of stability precisely picks out the locus we would like and removes (the -sweep) of the limit set ; see Theorem 6.3 below.
For the parabolic group acting on the blade closure of an unstable stratum as in (6) above, there is a twisted rational linearisation which is well-adapted. Furthermore, we have that in the non-reductive GIT notation the map coincides with the retraction appearing in the description of the unstable strata. Furthermore, is defined to be the semistable locus for with respect to , or equivalently for as acts trivially, which coincides with the semistable locus in appearing in the definition of the non-reductive stable locus.
Theorem 6.3** (Non-reductive GIT quotients of unstable strata, [56, Theorem 1.1]).**
For the parabolic group graded by acting on the blade closure of an unstable stratum as in (6) with the well-adapted linearisation , the following statements hold.
- i)
If holds, then there is a projective geometric -quotient
[TABLE]
and by taking a reductive GIT quotient by one obtains a projective categorical -quotient of an open subset of the -stable locus. 2. ii)
If both and hold, then there is a projective geometric -quotient
[TABLE]
Moreover, the ring of invariant sections if finitely generated and coincides with the Proj construction for this invariant ring.
We would like to apply this theorem to moduli of objects in an abelian category, where there are moduli-theoretic instability filtrations, such as the Harder–Narasimhan (HN) filtrations for vector bundles [50]; for example, moduli of sheaves on projective schemes or moduli of quiver representations. In these examples, the GIT instability stratification has been compared with the moduli-theoretic Harder–Narasimhan stratification [39, 57, 54, 55, 96] and this suggests moduli of objects of fixed HN type should be constructed as non-reductive GIT quotients.
Unfortunately the stabiliser assumptions in Theorem 6.3 are quite restrictive and so it is only possible in quite limited situations. For example, for vector bundles (or Higgs bundles) on a smooth projective curve of fixed HN type, the reductive stabiliser assumption only holds for coprime HN types of length (i.e. the HN filtration has two terms and the invariants for the successive quotients are coprime, so that semistability coincides with stability) and even in this case, the unipotent stabiliser assumption rarely holds and so blow-ups are needed (see [56, 3.2.1] for a detailed discussion). In this length 2 coprime case, the non-reductive GIT quotient picks out non-split HN filtrations of length whose automorphism groups have a fixed dimension; see [21, 58] for the case of vector bundles and [49] for the case of Higgs bundles. To rectify the failure of the reductive stabiliser assumption , one can alternatively perform a quotient in stages, using different 1-PSs in the centre of to grade different subgroups of the unipotent radical as in [56]; this results in a natural notion of stability for sheaves of a fixed HN type, but again the unipotent stabiliser assumption is rarely satisfied, so a blow-up procedure would be required.
6.4. Interactions with symplectic geometry and cohomological descriptions
GIT quotients for complex reductive groups are closely related to symplectic quotients for a maximal compact group (see Remark 3.22); the close relationship between the reductive GIT instability stratification and a Morse-theoretic stratification for the norm square of the moment map was used in [63] to describe the rational Betti numbers of reductive GIT quotients.
Fortunately, for non-reductive groups with internally graded unipotent radicals, this close relationship with symplectic geometry has been extended by work of Bérczi and Kirwan [17], and applied to compute cohomology of non-reductive GIT quotients.
For a reductive group acting on a smooth complex projective variety , to construct a moment map one fixes a maximal compact subgroup and a symplectic form invariant under the -action. The moment map for this maximal compact and symplectic form is a -invariant map with the moment map property (that it lifts the infinitesimal action via the correspondence between vector fields and forms given by ). However, any other maximal compact subgroup is of the form and is invariant under the -action with moment map . Therefore rather than defining a moment map , Bérczi and Kirwan instead fix a -equivariant Kähler structure (namely a -orbit in the space of pairs of maximal compact subgroups of and Kähler forms on which are invariant under this maximal compact) and define an -moment map to be a smooth -equivariant map
[TABLE]
such that is a moment map for the -action on , where as , we have a canonical embedding .
Let us explain how Bérczi and Kirwan define moment maps for a smooth complex projective variety with an action of a graded unipotent group . Assume that is a subgroup of a reductive group and that is a submanifold of a compact Kähler manifold with a -action on that restricts to the given -action on (note that is not required to be invariant under the -action). As above, fix a -equivariant Kähler structure on and let be an -moment map, they define by restricting the -moment map to and composing with the restriction
[TABLE]
Assuming the unipotent stabiliser assumption holds, Bérczi and Kirwan provide a moment map description of the non-reductive GIT quotient: for any , they show
[TABLE]
and that [math] is a regular value of and the inclusion of the zero level set of the moment map in the stable locus induces a diffeomorphism of orbifolds
[TABLE]
which can be viewed as a non-reductive Kempf-Ness Theorem. They extend this result to an action of with internally graded unipotent radical (see [17, Theorem 1.1]).
They apply this to compute Betti numbers of non-reductive GIT quotients. Assuming holds so that , they show that the stratification is -equivariantly perfect and so the Poincaré series of can be computed from that of . Similarly for as above, they show the Poincaré series of can be computed from that of the reductive GIT quotient of by , whose Poincaré series can in turn be computed as in [63] using the reductive GIT instability stratification of 3.3.
Furthermore, they adapt methods of Martin [70] relating the rational cohomology of GIT quotients by reductive groups to that of GIT quotients for a maximal torus. Martin shows the intersection pairing for the reductive GIT quotient can be computed from that of the GIT quotient for the maximal torus via an integration formula, which can then be combined with torus localisation techniques. In the non-reductive case, this enables a description of the rational cohomology ring and a non-reductive integration formula [17, Theorems 1.4 and 1.5], which leads to a residue formula for the intersection pairing on the non-reductive GIT quotient.
These methods are used to prove the polynomial versions of the Green–Griffiths–Lang conjecture and Kobayashi conjecture described in 6.1. They can also be applied in the future to describe the cohomology of new moduli spaces constructed via non-reductive GIT.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Alper, Stacks and moduli , https://sites.math.washington.edu/~jarod/moduli.pdf .
- 2[2] J. Alper, Good moduli spaces for Artin stacks , Ann. Inst. Fourier (Grenoble) 63 (2013), no. 6, 2349–2402.
- 3[3] by same author, Adequate moduli spaces and geometrically reductive group schemes , Algebr. Geom. 1 (2014), no. 4, 489–531.
- 4[4] J. Alper, P. Belmans, D. Bragg, J. Liang, and T. Tajakka, Projectivity of the moduli space of vector bundles on a curve , Stacks Project Expository Collection (SPEC), London Math. Soc. Lecture Note Ser., vol. 480, Cambridge Univ. Press, Cambridge, 2022, pp. 90–125.
- 5[5] J. Alper, H. Blum, D. Halpern-Leistner, and C. Xu, Reductivity of the automorphism group of K 𝐾 K -polystable Fano varieties , Invent. Math. 222 (2020), no. 3, 995–1032.
- 6[6] J. Alper, J. Hall, and D. Rydh, The étale local structure of algebraic stacks , https://arxiv.org/abs/1912.06162 .
- 7[7] J. Alper, D. Halpern-Leistner, and J. Heinloth, Existence of moduli spaces for algebraic stacks , https://arxiv.org/abs/1812.01128 .
- 8[8] M. Artin, Versal deformations and algebraic stacks , Invent. Math. 27 (1974), 165–189.
