Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in the Geometric Study of PDEs
Abraham D. Smith

TL;DR
This paper explores advanced geometric concepts in PDEs, focusing on involutive tableaux, characteristic varieties, and rank-one varieties, providing a unified approach to key theorems in the field.
Contribution
It introduces a new geometric interpretation of PDEs using Grassmann bundles and enhances Guillemin normal form to analyze involutivity of tableaux.
Findings
Characterization of characteristic varieties and their incidence correspondence.
Development of an enhanced Guillemin normal form for involutivity.
Reinterpretation of PDE geometry through smooth sub-bundles of Grassmann bundles.
Abstract
This expository monograph cuts a short path from the common, elementary background in geometry (linear algebra, vector bundles, and algebraic ideals) to the most advanced theorems about involutive exterior differential systems: (1) The incidence correspondence of the characteristic variety, (2) Guillemin normal form and Quillen's thesis, (3) The Integrability of Characteristics by Guillemin, Quillen, Sternberg, and Gabber, and (4) Yang's Hyperbolicity Criterion. To do so, the geometric theory of PDEs is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle, whereby the rank-1 variety is emphasized. The primary computational tool is an enhanced formulation of Guillemin normal form that is equivalent to involutivity of tableaux.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in the Geometric Study of PDEs
Abraham D. Smith
Department of Mathematics, Statistics and Computer Science
University of Wisconsin-Stout
Menomonie, Wisconsin 54751-2506, USA
Abstract.
This expository monograph cuts a short path from the common, elementary background in geometry (linear algebra, vector bundles, and algebraic ideals) to the most advanced theorems about involutive exterior differential systems: (1) The incidence correspondence of the characteristic variety, (2) Guillemin normal form and Quillen’s thesis, (3) The Integrability of Characteristics by Guillemin, Quillen, Sternberg, and Gabber, and (4) Yang’s Hyperbolicity Criterion. To do so, the geometric theory of PDEs is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle, whereby the rank-1 variety is emphasized. The primary computational tool is an enhanced formulation of Guillemin normal form that is equivalent to involutivity of tableaux.
Key words and phrases:
characteristic variety, Guillemin normal form, eikonal system
2010 Mathematics Subject Classification:
Primary 58A15, Secondary 35A27, 35A30
Contents
0. Introduction and Overview
Given a system of partial differential equations [PDEs] over a manifold, does the system of PDEs have any local solutions to the Cauchy initial-value problem? That is, given initial conditions on a locally-defined hypersurface, can we produce a local solution that satisfies those initial conditions and also satisfies the PDEs? More generally, which initial hypersurfaces admit such solutions? Can we do this iteratively by solving a sequence of initial-value problems from dimension 0 to 1, 1 to 2, and so on to build solutions through any point?
These questions are the heart of exterior differential systems [EDS], a powerful specialist approach to the geometric study of PDEs. EDS typically present as ideals of exterior differential forms over a manifold.
Some deeper questions are: What is the shape of the family of local solutions obtained in this way? How can we determine whether two systems of PDEs are “the same” up to local coordinate transformations? Does the space of all PDEs (up to local coordinate transformation) have any meaningful shape or structure of its own?
These deeper questions are answered by analyzing the characteristic variety of an EDS. The original motivation for the characteristic variety is to see where the Cauchy initial-value problem becomes ambiguous. That is, given an initial condition for our PDEs on a local submanifold of dimension , when would the -dimensional solutions for that initial condition fail to be unique?
When analyzing the characteristic variety of various EDS, one discovers that the characteristic variety is an exquisitely subtle structure that reveals far more than originally anticipated. The characteristic variety dictates the internal geometry of the solutions of the original PDEs, while also controlling the parameter space of all such solutions. Under reasonable hypotheses, this means that EDS or PDEs can be understood up to coordinate equivalence as “parametrized families of solution manifolds with associated characteristic geometry.”
This is beautiful and important, but it has been a difficult topic for researchers to access, because the foundations of EDS have not yet entered the common curriculum of graduate students. Fluency with differential ideals remains a relatively rare skill, practiced in a handful of schools worldwide. Indeed, it is common for researchers first encountering the subject to become trapped in an endless cycle of translating systems from local jet coordinates to differential forms and back again, without gaining any new geometric insights and without using the most powerful theoretical ideas in EDS. In particular, it can take many years for researchers to appreciate the central role that the characteristic variety plays in uncovering geometric insights. However—despite the name—differential forms are not themselves the core idea behind exterior differential systems. Differential forms are merely a concise language. Rather, the core idea is to recognize that these questions are more geometric than analytic, and that ideals (that is, conditions defined by functions) and varieties (that is, shapes cut out by functions) must come into play. To describe families of solutions, we need the geometric language of bundles, schemes, and moduli.
Therefore, the goal of this monograph is to cut the shortest-possible expository path from the common curriculum in geometry (linear algebra, vector bundles, and algebraic ideals) to several key results regarding the characteristic variety. These key results are
- (i)
the incidence correspondence of the characteristic and rank-1 varieties, and its relationship to eigenspace decomposition, 2. (ii)
Guillemin normal form, its enhancements, and Quillen’s thesis, 3. (iii)
the Integrability of Characteristics (Guillemin, Quillen, Sternberg, Gabber), and 4. (iv)
Yang’s Hyperbolicity Criterion.
The required common curriculum is
- (i)
graduate-level linear algebra (short-exact sequences, dual spaces, the rank-nullity theorem, tensor products, generalized eigenspaces, as in Artin’s Algebra [Art91]), 2. (ii)
the fundamentals of smooth manifolds (tangent spaces, Sard’s theorem, bundles, as in Milnor’s Topology from the Differential Viewpoint [Mil97]), and 3. (iii)
basic algebraic geometry (projective space, ideal, variety, scheme, as in Harris’ Algebraic Geometry, a first course [Har92]).
To accomplish this, the subject of exterior differential systems is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle over a smooth manifold. In doing so, the role of exterior differential forms becomes obscured, in favor of tableaux (vector spaces of homomorphisms) and symbols (varieties of endomorphisms). Specifically, Guillemin normal form for tableaux and symbols plays the central computational role, not differential forms. This is because most humans—and their computer algebra systems—are more comfortable with matrices than with exterior algebra. Exterior differential ideals are not introduced until absolutely needed. This is because many of the essential lemmas depend only on the geometry of the Grassmann bundle, which is the variety of the trivial exterior differential system. This reformulation allows elementary versions of those key results (in fact, almost all the lemmas are restatements of the rank-nullity theorem), and it becomes possible to outline how these results could be used to push the frontiers of the subject.
While the audience is assumed to have a general interest and cultural awareness of PDEs or EDS in some form, all the required definitions are provided when needed. Even so, it is wise always to have Bryant, Chern, Gardner, Goldschmidt, and Griffiths’s Exterior Differential Systems [BCG*+*90] and Ivey and Landsberg’s Cartan for Beginners [IL03] nearby. They are cited for comparison frequently. Another excellent reference is McKay’s Introduction to Exterior Differential Systems [McK18]. A note for EDS experts: the results in these pages can be found in numerous places in the literature in some form or other, and I have indicated my favorite sources throughout. The only innovation here is in presentation. Most notably, in contrast to the vast majority of expositions, the central topic is the characteristic variety, not the Cartan–Kähler theorem. This is because the key open question is “what does the family of all involutive PDEs look like?” not “how do I solve this particular involutive PDE?”
This monograph is organized into four parts, each containing several sections. Part I covers the structure of tableau (subspaces of a space of homomorphisms) and the differential geometry of the Grassmann variety. Because the results are elementary—almost trivially so—they provide a good foundation for building from the common curriculum to the central topic. Part II converts those elementary results to the language of bundles, PDEs and EDS. That language allows an enhanced version of Guillemin normal form that is equivalent to involutivity. Part III achieves the key purpose of this monograph, as a triumvirate is formed by the characteristic scheme, the rank-1 cone, and the mutual eigenvector problem for symbols. Part IV examines the integrability of the characteristic variety in various guises, and offers a general dogma (that the characteristic scheme knows all coordinate-invariant data about a system of PDEs) that suggests possible future developments in the theory of EDS.
This monograph was developed to support a series of lectures at the Institute of Mathematics at the Polish Academy of Sciences in September 2016, as part of a Workshop on the Geometry of Lagrangian Grassmannians and Nonlinear PDEs.
Part I Matrices and Subspaces
1. Tableaux and Symbols
Tableaux are very simple objects; every undergraduate encounters the example “ matrices form a vector space using the usual matrix operations,” and a tableau is any subspace of that vector space.
Given vector111When it becomes appropriate to do so, at (4.5) in Section 4, we switch from vector spaces to complex projective spaces for algebraic convenience. spaces and with and , a tableau is a linear subspace of . We use the notation and interchangeably.
Being a subspace, any tableau is the kernel of some linear map , called the symbol, whose range is written as . We have a short exact sequence of spaces:
[TABLE]
where is just notation for . Let and .
For example, let and , and consider the 5-dimensional tableau described in the standard bases by
[TABLE]
If is a matrix with entries , then the symbol defining consists of four conditions:
[TABLE]
1(a). Rank-one ideal
The fundamental theorem of linear algebra states that any homomorphism has a well-defined rank. Thus, for any tableau , we could ask how varies across . For our purposes, the most interesting222There is a good reason that the rank-1 case is most interesting: the varieties of higher-rank matrices are determined algebraically by the varieties of lower-rank matrices, so the geometry of across comes down to the rank-1 case. case is .
The space admits the rank-1 ideal, , which is irreducible and generated by all minors in any basis. This is a homogeneous ideal, so we may consider the rank-1 cone in vector space or the rank-1 variety in projective space. (The vertex of the rank-1 cone is the rank-0 matrix.)
For any , consider the ideal , which defines as the variety . The variety is the set of matrices in that are also rank-1; it is a linear section of the rank-1 cone defined by .
In the example (1.2), can be parametrized as matrices of the form
[TABLE]
which can be interpreted as the rational normal Veronese curve333 For more on Veronese curves and the more general Segre embeddings and determinantal varieties, see [Har92, Sha94].,
[TABLE]
Moreover, the projection of to is another rational normal curve,
[TABLE]
This toy example plays a crucial role in applications for hyperbolic and hydrodynamically integrable PDEs [FHK09, Smi09].
1(b). Generic Bases
We would like to find a “good” basis in which to express a tableau and study its properties.
First, an analogy. When studying a single homomorphism , or , there are various “good” bases of the domain and co-domain to express . A basis of is “generic” for if the first columns are independent. A basis of is “generic” for if the first rows of are independent in that basis. Among the generic bases, we can construct particularly “good” bases for writing . When is written in a “good” basis, we say it is in a “normal form,” and the normal form allows us readily to study properties of . For example:
- •
Use Gaussian elimination444Algorithmically, this is accomplished using improved Gram-Schmidt or Householder triangularization. See [TB97] for a discussion of stability of row-reduction. to place in reduced row-echelon form. Then, the rank, kernel, and image of are immediately apparent. The fundamental theorems in linear algebra depend on this normal form.
- •
Apply a polar/unitary decomposition to find the singular-value decomposition of . Then, the norm of and its action with respect to the Hermitian inner products of and are immediately apparent. Important theorems in metric geometry and multivariate statistics depend on this normal form.
- •
Solve a sequence of eigenvalue problems in the case to find Jordan normal form. Then, the eigenspace structure of , and the commutative algebra of matrices to which it belongs are immediately apparent. The theory of Lie groups and Lie algebras depends on this normal form.
Given a tableau with symbol , we are curious whether we can construct bases that are “good” simultaneously for all homomorphisms in the tableau. This situation is considerably more complicated than the situation of a single homomorphism, and it turns out that it is most important to focus on the symbol maps, but we arrive at a satisfying answer in Section 7. By the above analogy, it is convenient to establish a notion of “generic” bases formulated in terms of independence. This is done as follows.
In any bases of and , the tableau is a space of matrices only of whose entries are linearly independent. That is, in a given basis, we can consider the entries as elements of , just as we think of the components of vectors in as being linear functions on , using the dual basis of .
Across all bases of , there is a maximum number of independent entries that can occur in column 1; call that number . (In a measure-zero set of bases of , the number of actual independent entries in the first column may be less than .) Once those independent entries are accounted for, there is a maximum number of new independent entries that can occur in the second column. (In a measure-zero set of bases of that achieve in column 1, the number of actual independent entries in columns 1 and 2 may be less than .) Once those independent entries are accounted for, there is a maximum number of new independent entries that can occur in column 3. (In a measure-zero set of bases of that achieve in columns 1 and 2, the number of actual independent entries in columns 1, 2, and 3 may be less than .) Continuing in this way, we have as the number of new independent entries in the th column achieved for almost-all bases of . (In a measure-zero set of bases of that achieve in columns 1 through , the number of actual independent entries in columns 1 through may be less than .) Eventually, for such a basis, there is a column where we have reached , so there is some maximum column such that , where the last independent entry appears. So,
[TABLE]
The index is called the character of , and the number is called the Cartan integer of . The tuple gives the Cartan characters of . Note that , since otherwise the maximality condition would have been violated in an earlier column.
Permanently reserve the index ranges
[TABLE]
A basis555We follow the notation of differential geometry. This notation indicates an ordered basis of co-vectors, not a vector. Each is an element of . of is called generic if its Cartan characters achieve the lexicographical maximum value . As seen in the previous paragraph, almost all bases of are generic in this sense. Given a basis of , a basis666We follow the notation of differential geometry. This notation indicates an ordered basis of vectors, not a co-vector. Each is an element of . of is called generic if the first independent entries in column are independent.
Choose generic a basis for , and let be its dual basis for . Choose a generic basis for , and let be its dual basis for . An element of the tableau is written as
[TABLE]
and the upper-left entries for form a basis of .
Because the bases are generic, the symbol map can be written as
[TABLE]
It is implicit that if or or . That is, entries to the lower-right (unshaded) are written as linear combinations of the entries in the upper-left (shaded) using the coefficients , as in Figure 1.
Consider the example (1.3), which is not written in generic bases. If we exchange columns and rows , then it becomes generic with , seen here:
[TABLE]
Equation (1.10) becomes:
[TABLE]
One can take the dual perspective, wherein the symbol coefficients define a map from the upper-left independent entries to the lower-right entries. That is, consider the map
[TABLE]
defined by
[TABLE]
Equation (1.14) is the formal inclusion in the defining exact sequence (1.1). By fixing and , we obtain an endomorphism defined by the column relations of , as in Figure 2. We use the shorthand for , but note that this is not quite the same as because of the identity term in Equation (1.14). That is, for all .
For the example (1.11)–(1.12), the maps are:
[TABLE]
So, if and , the endomorphism is
[TABLE]
Using our generic basis for and its dual basis for , define decompositions and using our index convention (1.8) as follows:
[TABLE]
and
[TABLE]
The isomorphisms and depend on the basis; they are non-canonical but sometimes useful.
It is apparent from (1.14) that if ; that is if for all , so we usually consider only for .
Thus, in generic bases, we have a collection of endomorphisms of . For our purposes of constructing a normal form, a “good” basis is one which makes the endomorphisms as structurally similar as possible. Section 1(c) imposes additional conditions on the images of these endomorphisms for this purpose.
1(c). Endovolutive Tableaux
Suppose and are generic bases for . For any , define a decomposition by
[TABLE]
By (1.14), the map has support , and its image lies in .
More generally, for any , let and , where is the minimum index such that . (For general , we have .)
A tableau expressed in bases and is called endovolutive777The term endovolutive was coined in [Smi15], but the phenomenon was described previously in [BCG*+*90, Chapter IV§5], [Yan87], and it is certainly familiar to anyone who has manipulated tableaux of linear Pfaffian systems. if for all . That is, endovolutive means that is an endomorphism of , as in Figure 3.
Note that the example (1.15) is endovolutive because and and have non-zero entries only in the upper-left part.
In this way, when considering endovolutive tableaux, it useful to arrange the symbol endomorphisms as an array of matrices:
[TABLE]
Each “diagonal” entry is the matrix for which the non-zero upper-left part is an identity matrix, . Endovolutivity means that , which is the matrix in row of (1.20), is zero outside the upper-left part.
If a tableau is endovolutive in certain bases for and , then it is also endovolutive under any upper-triangular change-of-basis for . Under such a basis change, the columns of are linear combinations of the ones to their right, and the sub-matrices in (1.20) change by the corresponding conjugation. Endovolutivity is a property of the flag generated by the basis of .
1(d). Mutual Eigenvectors and Rank
For endovolutive bases, each is an endomorphism of a particular vector space, so it is sensible to consider an eigenvector problem for these maps: For any , let
[TABLE]
That is, we want to find the vectors that are preserved by but are annihilated by all for . More generally, let
[TABLE]
Equation (1.22) can be rewritten as a mutual eigenvector problem on the endomorphisms , …, :
[TABLE]
Alternatively, because , equation (1.22) says that is rank-1 when restricted to , so we can rewrite it as
[TABLE]
This space is the focus of [Gui68], and it plays an important part in our story. Unlike , its definition does not depend on the basis; its definition depends only on the splitting . Its dimension is an important invariant.
Lemma 1.25**.**
Suppose that the tableau admits endovolutive bases. For generic , .
Lemma 1.25 is the result of a quick rank computation using (1.22)–(1.23). See [Smi15].
Our “good” basis and normal form will be built on the requirement that the maps commute on certain combinations of these spaces (and thus the maps share generalized eigenspaces and Jordan-block normal form there). That is, we are aiming for something like the commutative subalgebras seen in [Ger61] and [GS00]. Endovolutivity allows surprisingly direct computation of the desired conditions. For more detail on endovolutivity, see [Smi15] and the references therein. We return to this topic in Section 5, but before that we must introduce the geometry of subspaces.
2. Grassmann and Universal Bundles
The Grassmann variety is the set of -planes in an -dimensional vector space . It is a smooth projective variety and a smooth manifold of dimension . An -plane is called an element.
2(a). Tangent and Arctangent
Depending on one’s favorite notation, there are several ways to see that the tangent space of at is .
First, for any , choose a basis for , and choose so as to complete a basis of the entire vector space . Any -plane near admits a basis that we may assume is related by a matrix in reduced column-echelon form:
[TABLE]
More succinctly, using the summation convention:
[TABLE]
That is, and are related by an matrix of rank whose image is determined uniquely by the submatrix . In this sense, is isomorphic to the space of matrices, which is isomorphic to . This is easy and computational, but this isomorphism is not canonical for an abstract vector space (without metric) because it depends on a choice of splitting by choosing the complementary basis .
Alternatively, to see and avoid splitting, we can use the dual888Recall that is canonically isomorphic to : if , then is well-defined for all . short-exact sequences
[TABLE]
Choose any basis of the annihilator space , and let be the corresponding dual basis of . Then, we may take the coefficients of
[TABLE]
as coordinates on ; that is, gives a basis of .
More abstractly, an explicit choice of bases for and for is unnecessary. Instead, we need only the abstract homomorphism , because the space999Note that is not well-defined in for any particular , but the span over all such is well-defined.
[TABLE]
is invariant under transformations on and as well as transformations on . That is, is the “graph” of over all .
As in Figure 4, the derivative map near is a multidimensional generalization of the tangent function, so the inverse map101010The map is analogous to exponential map from Riemannian geometry or Lie group representation theory, except that this description of does not make explicit use of a metric or group structure. is written
[TABLE]
The reader is encouraged to read [MS74, §5] and [KN63] and to search for the terms Plücker embedding and Stiefel manifold for more detail on this subject.
Remark 2.7*.*
Notice that any linear subspace of is a tableau in the sense of Section 1. In some sense, it is the only example, as arbitrary and could be studied by setting and . Moreover, any smooth submanifold with tangent space at gives as a tableau in . This observation is the heart of the entire subject of exterior differential systems, and it reappears forcefully in Section 4.
2(b). Polar pairs
The purpose of this subsection is to establish two results, Lemmas 2.12 and 2.15, that tie the algebraic geometry of intersecting subspaces to the differential geometry of the Grassmannian. These lemmas are used in Part III to demonstrate the correspondence between the characteristic variety (in the Cauchy problem of a system of PDEs) and the rank-1 variety (of the tableau of an EDS) in Lemma 6.4, thus providing the foundation of the geometric theory of PDEs.
Suppose that , and that they intersect along a hyperplane. That is, suppose and . We call the pair of -planes and a polar pair because they are both polar extensions111111This is a classical terminology that reappears in Section 6(a). of . For any , let
[TABLE]
We say is a polar pair of . This relationship is symmetric—hence the unqualified term polar pair—as if and only if , but this relationship is not an equivalence relation, as it fails both reflexivity and transitivity.
Within the image of , Lemma 2.9 ties the notion of polar pairs to lines in the tangent space ,
Lemma 2.9**.**
Suppose and for . Then if and only if .
Proof.
Suppose that . Let , so . Let be a basis for , and extend that basis to a basis for and to a basis for . Writing (2.2) in this case, it is apparent that only the th column of is nonzero. That is, the tangent homomorphism is rank-1. (It cannot be the degenerate rank-0 unless .)
Conversely, suppose that is rank-1. Let , which is a subspace of of dimension . Any line in is preserved by the map defined by the matrix in (2.2); hence, the subspace is also a subspace of . (It cannot be the degenerate case unless is rank-0.) ∎
The concept of polar pairs generalizes to co-dimensions other than . For any , let
[TABLE]
Because and , the set is nonempty if and only if , because . The definition is trivial and fairly useless for . Again, the -polar-pair relationship is symmetric but neither reflexive nor transitive for the interesting case .
One can see immediately that Lemma 2.9 generalizes by replacing with any rank to give Lemma 2.11.
Lemma 2.11**.**
Suppose and for . Then if and only if .
Next, we can generalize Lemma 2.11 to Lemma 2.12 by dropping the use of . That is, we can consider a -polar pair where lies outside the open image of . From an algebraic perspective, Lemma 2.12 can be seen as a Grassmannian version of the rank-nullity theorem. Phrased in other ways, it is popular true/false homework question in undergraduate linear algebra textbooks.
Lemma 2.12**.**
Fix and . The canonical maps and both have rank- images, yielding the incidence correspondence Figure 6.
Proof.
Let . Consider the short-exact sequences (2.3), and apply the rank-nullity theorem of those maps on , which has dimension . In the first short-exact sequence, has dimension as a subspace of . In the second short-exact sequence, the space has dimension as a subspace of . In both cases, and such subspace can be constructed this way. ∎
Now, reconsider the case in light of Lemma 2.12. Then each yields a hyperplane . The right image in Figure 6 is some line . The left image is some line . So, each yields a rank-1 projective homomorphism . Any element of could be obtained this way by appropriate choice of .
To see how this generalizes Lemma 2.9, let us write explicitly. Let
[TABLE]
be a basis for such that and for some . Then, for some . Because , it must be that
[TABLE]
Hence, each is a multiple of ; call it . (Note that for all if and only if , which contradicts our assumption .) We can use this fact to build a rank-1 homomorphism: Let be the basis of dual to . Let also denote the basis of induced by , so that also denotes the image of . Let . Then the induced homomorphism
[TABLE]
is rank-1. Each of and is well-defined up to scale, so is well-defined up to scale, yielding .
It may be that lies outside the open image of . How then do we interpret ? Is there any relationship between and ? From a differential geometric perspective, this is reminiscent of the failure of injectivity at large distances for the exponential map in Riemannian geometry. Lemma 2.15 shows that for any polar pair of , either lies in the curve or is the limit of the curve.
Lemma 2.15**.**
Suppose and . Then there is a continuous path in such that , , and for all . The rank-1 line induced by via from Lemma 2.12 is constant across . Moreover, for .
Proof.
Let . For some independent vectors , we may write and and define121212If preferred, one can reparametrize from a linear interpolation to a circular interpolation by replacing with and with for some angle . a curve from to in by
[TABLE]
Note that for all . It is apparent from (2.16) that is the line , which is constant versus . It is also apparent from (2.16) that is the line , which is constant versus . Hence, all such have the same representative rank-1 homomorphism, in Lemma 2.12.
It may be that lies outside the open image of . However, comparison of (2.16) and (2.2) implies that all lie inside the image of for all . So, the image contains an open set of where . ∎
Consider the example summarized in Figure 5, where
[TABLE]
Note that is outside the open image of because (2.2) breaks down as written in this basis. But, is the family obtained by rotating from toward about the axis through an angle , which varies from [math] to . For all , we have
[TABLE]
Thus, the line of rank-1 matrices in is written as in this basis. This line represented by every in a curve that converges to as . Indeed, up to a choice of basis, this is essentially the only example.
Overall, we have learned that any polar pair in is represented by a line of rank-1 matrices in the tangent space, and vice-versa. This is sufficient for our purposes, but those seeking a more detailed understanding of polar pairs are encouraged to investigate Schubert varieties—for example in [Rob12]—and the other outgrowths of Hilbert’s 15th problem.
2(c). The Tautological Bundle
Soon, we will consider algebraic equations defined on . To facilitate this, for any , we consider the complexified projective space and its subspace . For standard complex projective space, we write for . That is, , and .
If we consider all such spaces across all simultaneously, we obtain the tautological bundle131313 These are also called universal bundles or canonical bundles. They are analogous to the sheaves and , respectively, for varieties in projective space. over with fiber
[TABLE]
and its dual bundle over with fiber
[TABLE]
and its annihilator bundle over with fiber
[TABLE]
and its cokernel bundle over with fiber
[TABLE]
See Figure 7.
There is a dual pair of short exact sequences of projective bundles, analogous to (2.3).
[TABLE]
Hence, the complex projectivized tangent bundle is isomorphic (canonically) to . If we choose a splitting of these sequences, then we can use the dual bases to establish a (non-canonical) decomposition for any .
One can also consider the frame141414Some authors might flip the names of the frame and coframe bundles. I tend to choose this notation because the frame bundle is covariant with diffeomorphisms on the base space, and only contravariant objects get a “co-” prefix. The jargon for duality is always frustrating. bundle over associated to , whose fiber is all linear isomorphisms
[TABLE]
and the coframe bundle over associated to , whose fiber is all linear isomorphisms
[TABLE]
To write homogeneous complex-algebraic ideals on that vary across , one can choose any section of to give coordinates, and use the ring
[TABLE]
Part II PDEs on Manifolds
In this part, we build bundles whose fibers are the structures seen in Part I. This produces a satisfying language for describing a system of PDEs on a manifold in Section 4.
3. Bundles upon Bundles
If is a smooth manifold of dimension , then we can form the smooth bundle with fiber . Let denote the bundle projection.
Because (2.3) holds for at any , any local section of can be described by choosing its annihilator section of , and vice-versa. For every , the Grassmann variety has a tautological bundle with fiber , a dual bundle, and so on.
The total space is a manifold in its own right, so we may consider as a bundle over the manifold , which is itself a bundle over . In other words, we can reinterpret all of Section 2(c) in terms of bundles over by using denote the projective bundle over that has fiber at with . A complete description of some would be where , and , and . A complete description of some would be where , and , and . See Figure 8. The same bundle-wise constructions hold for , , , and from Section 2(c).
Extending (2.26) to write homogeneous complex-algebraic ideals on that vary across , one can choose any section of to give coordinates, and use the ring
[TABLE]
3(a). The Contact Ideal
For any , consider its annihilator subspace . There is a corresponding subspace , defined as
[TABLE]
as in Figure 9. If is a basis of , then we let for each to define a basis of .
In the exterior algebra , consider the ideal that is generated as . This is called the contact ideal, and it is the first example of an EDS as seen in Section 4. Note that, for any (local) section , the contact ideal satisfies the universal reproducing property
[TABLE]
Because this property is universal, the subbundle is a submodule defined globally across even if topology forces any particular section to be defined locally.
If one were to choose local coordinates for and local fiber coordinates for near a particular -plane , then is the ideal typically written as
[TABLE]
where the functions depend on in an open neighborhood of in .
After reading Section 3(b), compare this coordinate description to your favorite definition of jet space, . Also, compare the local fiber coordinates to the tangent coordinates from Section 2(a); when restricting to the fiber over a single basepoint , they are essentially identical. For some highly amusing applications of the contact system, see [Gro86].
3(b). Immersions and Frame Bundles
Fix an immersion with . For any with , the push-forward derivative has image , which is an -dimensional subspace of ; hence, . Define the map by
[TABLE]
and note that , so .
It is obvious from the definition that is also an immersion. Therefore, we can use it to pull-back the tautological bundle as defined in Sections 2(c) and 3. Let , which has fiber
[TABLE]
that is, is identified with via . See Figure 10.
The immersion is called the prolongation of the immersion .
Now, consider the contact forms forms from Section 3(a). For all and all , we have
[TABLE]
which ultimately gives the following lemma:
Lemma 3.8**.**
If is an immersion for , then . Conversely, if is an immersion for satisfying and such that the image is transverse to the fiber for all , then there is some immersion such that .
Moreover, recall that any manifold of dimension admits a projective frame bundle with fiber
[TABLE]
The total space admits a tautological151515In various references, this 1-form is called the canonical, the Hilbert, and the soldering 1-form. 1-form defined by as in Figure 11. It is characterized by its universal reproducing property: for any (local) section :
[TABLE]
or, more succinctly, .
Because this property is universal, the 1-form is defined globally across even if topology forces any particular 1-form to be defined locally.
For any local diffeomorphism , there is an induced (covariant) map on the frame bundles by . Using the universal property, it is easy to prove this lemma, which shows that diffeomorphisms are characterized by their preservation of the tautological form on the frame bundle:
Lemma 3.11**.**
If is a diffeomorphism, then . Conversely, if is -equivariant diffeomorphism such that , then there exists a unique diffeomorphism such that .
Combining the universal properties of the and , we obtain the following theorem telling us what information we can transfer from to an immersed submanifold:
Theorem 3.12**.**
If is a smooth immersion, then
- •
, and
- •
.
Conversely, if is a smooth immersion such that
- •
, and
- •
,
then there exists a smooth immersion such that .
That is, an immersed submanifold satisfies the contact ideal, which is generated differentially by some annihilator 1-forms spanning , and its frame bundle is equipped with tautological 1-forms spanning .
Remark 3.13*.*
Note the similarity between the universal property of the contact ideal on the Grassman bundle and the universal property of the tautological 1-form on the frame bundle. Exploitation of this interaction as in Theorem 3.12 has a long and interesting history.
For example, consider the study of a Lie pseudogroup acting on a manifold . One option is to differentiate the coordinates of repeatedly using the contact ideal until differential syzygies of the Lie pseudogroup action can be found in prolonged local coordinates, which are then converted to a coordinate-free description using the pseudogroup action. The other option is to work on the frame bundle of immediately, where any expression on the tautological 1-form is automatically invariant, then prolong as necessary to reveal the syzygies. The latter is used often when the Lie pseudogroup arises as equivalence of intrinsic -structures, and the former is used often when the Lie pseudogroup arises from an extrinsic action on some ambient coordinates. For more on these fascinating and interconnected ideas, I encourage you to read [Cle17], [Olv95], [Val13], and [Gar89]—and the collected works of E. Cartan.
4. Exterior Differential Systems
Let be a smooth manifold of finite dimension . An exterior differential system [EDS] on consists of an ideal in the total exterior algebra that is differentially closed and finitely generated. Differentially closed means that . Finitely generated means that in each degree , the -forms in the ideal, , form a finitely generated -module. We assume that ; otherwise, one would restrict to a subvariety of defined by those functions. A solution or integral manifold is an immersed manifold such that . Optionally, we sometimes specify an independence condition as an -form that is not allowed to vanish on solutions. When an EDS represents a system of PDEs in local coordinates , then , meaning that we seek solutions on which those coordinates are sensible, and is a function that gives the dependent variables in (those transverse to ) as functions of the independent variables in .
Remark 4.1*.*
Exterior differential systems are defined this way because the term “PDE” or “system of PDEs” is difficult to pin down with geometric precision. Colloquially, “system of PDEs” usually means a finite set of (hopefully, smooth) equations on some local jet space. In Section 2, we explored the geometry of the bundle ; recall that the contact system on provides a coordinate-invariant notion of jet space. So, a system of PDEs can be thought of as a collection of equations on jet . Hopefully, those equations are smooth and respect the bundle structure coming from the contact system (otherwise, derivatives misbehave). By virtue of the Plücker embedding , an EDS provides precisely the structure to write an ideal whose variety is a subvariety (in the bundle sense) of . By taking smooth subvarieties, we can apply Remark 2.7 and apply our knowledge of tableaux from Part I to study EDS. Even by this definition, an EDS could be rather wild; however, in many practical applications, it happens that is generated by a finite collection of smooth differential forms of homogeneous degree, so one obtains a smooth algebraic variety in local fiber coordinates of . See [McK18] for more examples, additional insight, and historical context.
4(a). Differential Ideals and Integral Elements
To be precise, an integral element of at is a linear subspace such that for all . That is, the -forms in provide a collection of functions that cut out a variety, . These functions vary smoothly in and are homogeneous in the fiber variables.
There is a maximal dimension for which is locally non-empty, which is the case of interest. If an independence condition is specified, we also require , which forces to lie in the open subset of for which that condition holds. (For example, in the case of the contact system, the condition holds in the same neighborhood where (3.4) makes sense.)
Because is finitely generated by smooth functions, Sard’s theorem guarantees an open, dense subset defined as the smooth subbundle of that is cut out smoothly by smooth functions.
Definition 4.2** (Kähler-ordinary).**
Integral elements in are called Kähler-ordinary.
A single connected component of is denoted . We allow ourselves to redefine so that is a smooth bundle.
Let denote the dimension of each fiber of the projection , so is the corresponding codimension of in . That is, the projective bundle is a tableau in the sense of Remark 2.7, as each fiber is a linear subspace of . Because is a smooth manifold, we have
Lemma 4.3**.**
* implies .*
That is, we have a well-defined vector bundle over .
Definition 4.4** (Kähler-regular).**
If is a Kähler-ordinary integral element and the Cartan characters of each tableau are constant in an open neighborhood of , then is called Kähler-regular.
That is, the Kähler-regular integral elements form a dense open subset of the Kähler-ordinary integral elements, which form a dense open subset of the whole variety of integral elements.
So that we may apply the results of Section 1 without treating the Cartan characters of as functions of , we redefine to be a single connected component of Kähler-regular integral elements, and we again allow ourselves to redefine so that is a smooth bundle.
Such is called the first prolongation of , though it is clear from the definition that there could be multiple first prolongations, depending on which components of are under consideration.
To generalize the notation and results of Part I to , define the restricted tautological bundles
[TABLE]
Warning! These are now complex projective bundles, not vector spaces as in Section 1! Sometimes, it is convenient to think of as being a complex projective bundle, too, in which case we consider it to be a subbundle of the projective bundle . Of course, the notation has been developed to be consistent regardless.
An integral manifold of is an immersion such that for all . (If an independence condition is specified, we require that , too.) When we are considering a particular Kähler-regular component as above, we say is an ordinary integral manifold provided that . All of the observations from Section 3(b) apply, but lies in the submanifold , and lies in the subbundle . The overall goal is to construct all ordinary integral manifolds of through the careful study of the geometry of a Kähler-regular first prolongation .
4(b). Prolongation and Spencer Cohomology
Suppose that is an ordinary integral manifold of . By Theorem 3.12, the 1-forms spanning must vanish for each . The tautological form on pulls back to a nondegenerate frame on , since is an immersion.
Therefore, if actually exists, we have
[TABLE]
However, working on the frame bundle of , these forms satisfy a more general equation, called Cartan’s structure equation:
[TABLE]
The derivative of must take this form, because and are semi-basic with respect to the bundle , whereas is vertical, so cannot involve a totally vertical 2-form. See discussion of connections and principal bundles in [KN63].
Let us now describe the meaning of each of the terms in (4.7), with respect to the ordinary integral manifold . Using the dual coframe for , we can see that lies in . (Hence, it is called the tableau term.) In particular, it must be that
[TABLE]
for some function that must satisfy , so . That is, the homomorphism lies in the fiber of over , as
[TABLE]
Moreover, the existence of an immersion requires that the torsion term can be removed in (4.7); otherwise, it cannot be that as required. That is, it must be possible to rewrite \pi^{a}_{i}\mathrel{\reflectbox{\mapsto}}\pi^{a}_{i}+Q^{a}_{i,j}\omega^{j} for such that any term is absorbed. Note that this absorption of torsion is an algebraic property of the tableau . In summary, we have Lemma 4.10.
Lemma 4.10**.**
Let denote the composition of skewing and inclusion , and write and :
[TABLE]
For any ordinary integral manifold , the homomorphism of (4.8) and (4.9) lies in , and the pullback of torsion is zero in .
Writing in a chosen coframe, it is easy to check that
[TABLE]
The case of equality is considered in Section 5.
The exterior differential system on generated as
[TABLE]
is called the (first) prolongation of , and we are back where we started at the beginning of Section 4. We can construct , and repeat the entire process for over that was used for over . Lemma 4.10 essentially says that is the tableau bundle . Thus, we can construct over and re-apply Lemma 4.10, and so on. By the definition of and (4.13), we have
Corollary 4.14**.**
Every ordinary integral manifold of is also an ordinary integral manifold of . However, the converse might fail, as the smooth connected locus of may be a strict subset of .
Overall, we achieve exact sequences that summarize the entire situation of the tangent spaces of an immersed ordinary integral manifold of , , , , …
[TABLE]
The cokernels , , …, are the Spencer cohomology of the tableau . Even outside the context of exterior differential systems, they are defined for formal tableaux via the exact sequences (4.15) as
[TABLE]
Spencer cohomology detects functional obstructions to the solution of the initial-value problem on in the form of torsion; this is explained nicely in [IL03, Section 5.6], and the reader is urged to read their presentation.
Spencer cohomology was a major focus of the formal study of partial differential equations and Lie pseudogroups in the mid-20th century; most notably, [Spe62, Qui64, SS65, GQS66, Gol67, Gar67, Gui68, GK68, GQS70]. As it happens, many of the major results of that era are easy to re-prove under our regularity assumptions on and using the endovolutive notation from Section 1, particularly when using the involutivity criteria in Section 5 that were detailed in [Smi15]. We demonstrate this in Parts III and IV.
5. Involutivity of Exterior Differential Systems
Definition 5.1** (Cartan’s test).**
A tableau is called involutive if equality holds in Equation (4.12),
[TABLE]
Definition 5.2**.**
A tableau is called formally integrable if for all .
Cartan’s test comes from the following consequence of the Cartan–Kähler theorem.161616See [BCG*+*90, Chapter III] or [IL03] for more background on the Cartan–Kähler theorem; it is not our focus here.
Theorem 5.3**.**
Suppose that is an analytic exterior differential differential system, that is a smooth sub-bundle, and that the tableau bundle of homomorphisms has constant171717That is, is Kähler-regular. Cartan characters over . If is involutive and formally integrable, then through any point in , there is an analytic ordinary integral manifold . Moreover, such are parametrized locally by constants, functions of 1 variable, functions of 2 variables, …, functions of variables.
Somewhat confusingly, the situation in Theorem 17 is called involutivity of ; that is, an EDS might fail to be involutive even if its tableau is involutive, because there may be nonzero torsion in , meaning that fails to be formally integrable. This means essentially that the ideal is being studied on the wrong manifold.
For a beautiful interpretation of Cartan’s test that is relevant to the later Sections of this course, read the introduction of [Yan87]. In summary, ordinary integral manifolds are constructed by decomposing the Cauchy problem into a sequence of steps, each of which is determined and has solutions using the Cauchy–Kowalevski theorem.
For fixed spaces and , involutivity is a closed algebraic condition on tableaux in . Because the conditions come from Cartan’s test, which involves , it is not surprising that these conditions are quadratic; however, writing down the precise ideal is a lengthy argument. Doing so was suggested in [BCG*+*90, Chapter IV§5] and accomplished for general tableaux in [Smi15] following the outline in [Yan87].
Theorem 5.4** (Involutivity Criteria).**
Suppose a tableau is given in generic bases as in (1.14). The tableau is involutive if and only if there exists a basis of such that
- (i)
* is endovolutive in that basis, and* 2. (ii)
* for all and and all .*
This theorem is our main computational tool in Part III.
5(a). Moduli of Involutive Tableaux
While it seems like a trivial (if lengthy) computation, consider carefully the meaning of Theorem 5.4: We can fix , , and Cartan characters and then write down an explicit ideal in coordinates whose variety is all of the involutive tableaux with those Cartan characters. Hence, we can use computer algebra systems such as Macaulay2, Magma, and Sage to decompose and analyze that ideal using Gröbner basis techniques. With enough computer memory, we can answer the question “What is the moduli of involutive tableaux?” By virtue of Theorem 17, this is fairly close to answering the question “What is the moduli of involutive PDEs?”
For example, fix and . For some coefficients in the ring , an endovolutive tableau must be of the form
[TABLE]
or in block form like (1.20),
[TABLE]
Involutivity is an affine quadratic ideal on generated by the last rows of and , so:
[TABLE]
The complete primary decomposition of this ideal reveals two components. The maximal component has dimension 12, and it is described by the fairly boring prime ideal . The other component has dimension 11 and its prime ideal is generated by 27 polynomials. See http://goo.gl/jGTnMU for how to compute this in SageMathCell.
Many of your favorite involutive second-order scalar PDEs in three independent variables live somewhere in this variety; see (1.15) and Section 6(c). Up to some notion of equivalence, this is essentially the moduli space of such equations. As seen in Part III, their characteristic varieties are obtained by combining with the rank-1 ideal on .
However, there is still some ambiguity to be resolved, as it may be that a given abstract tableau admits several endovolutive bases with apparently distinct coordinate descriptions.
5(b). Cauchy retractions
Before proceeding to Part III, it is worthwhile to mention Cauchy retractions, which are much simpler than—and quite distinct from—elements of the characteristic variety. To confuse matters, many references call these “Cauchy characteristics.” For any differentially closed ideal , the Cauchy retractions are the vectors that preserve ; that is, . Because is differentially closed, the annihilator bundle is the smallest Frobenius ideal in that contains . Then, for any integral manifold , the subspaces form an integrable distribution; that is, is Frobenius as well [Gar67].
Because is a Frobenius system—a system of ODEs—it is common to redefine so that it is free of Cauchy retractions before proceeding to study its integral manifolds. The distinction between and the characteristic variety is explored further in [Smi14].
Part III Characteristic and Rank-one Varieties
Thank you for taking the time to read the enormous amount of background in Parts I and II. We are ready to define and deconstruct a fascinating mathematical object that lies at the heart of PDE theory.
Here we stand: We have an exterior differential system on . Perhaps this EDS arose from a system of PDEs on and is equipped with an independence condition . The EDS yields a smooth Kähler-regular subbundle , where any is an integral element of the original EDS. As a manifold in its own right, is equipped with tautological bundles , , , and from (4.5). Moreover, is a subbundle of , so it is a tableau bundle. Its symbol gives a short-exact sequence of bundles,
[TABLE]
An integral manifold is an immersion such that for all . Let denote the map .
As you read this part, compare it to [IL03, Section 4.6] and [BCG*+*90, Chapter V]. The reader will note that we do not assume that is a linear Pfaffian system, nor do we build a prolonged EDS using the contact system. Instead we are working with the tautological bundles per Remark 3.13.
6. The Characteristic Variety
The original motivation for the characteristic variety is to see where the initial-value problem becomes ambiguous. That is, given an initial condition for our PDE on a local submanifold of dimension , when would the -dimensional solutions for that initial condition fail to be unique? We express this condition in terms of integral elements.
6(a). via Polar Extension
For an integral element , we consider its space181818The polar space is a vector space thanks to the assumption that is a finitely-generated -module, because that assumption implies that the polar equations over are a linear subspace of . of integral extensions, called the polar space,
[TABLE]
and the polar equations comprise its annihilator,
[TABLE]
The polar rank is . If , then admits no extensions. If , then admits a unique extension to some .
The case of interest is , meaning that admits many extensions, so the initial-value problem from to is ambiguous. For any , we can identify a hyperplane with via . Because where is the maximal dimension of integral elements of , the function cannot be positive on an open set of , so the case is a closed condition. Moreover, the function is the rank of a linear system of equations, so it defines a Zariski-closed projective algebraic variety. We choose to study that algebraic variety projectively over . Hence, the typical definition of the characteristic variety of is
[TABLE]
This initial definition is refined in Section 6(b) to produce a scheme. To study properly the ambiguity of the initial-value problem, we want to assign a multiplicity to each and decompose into irreducible components based on the structure of the space .
6(b). via Rank-one Incidence
For both computational and theoretical purposes, it would be convenient to tie the polar space to the geometry of the tableau of an extension of . The discussion of polar pairs in Section 2(b) links these two objects, to provide another interpretation of the initial-value problem that is much more convenient than (6.3).
Fix , and suppose that both and are integral extensions of for some . By the definition of , it must be that lies in , but we do not know whether lies in the particular maximal smooth component of that we call . However, the results of Section 2(b) guarantee that is detected by even if is not in , in the following way.
Lemma 6.4**.**
Fix , and suppose that both and are integral extensions of for some . Let be such that . Then , and there is an open 1-parameter family of integral extensions of near in that also represent .
Proof.
Because , Lemma 2.12 yields a particular line of rank-1 homomorphisms in representing . Because is a vector space191919Here we see again why it is helpful for an EDS to be finitely generated. such that and , the rank-1 projective homomorphism takes the form of for some .
By Lemma 2.15, there is a continuous 1-parameter family of other polar pairs of , with , converging to , all of which share the rank-1 projective homomorphism .
That is, as a line of rank-1 homomorphisms, is contained in , as a subspace of . Applying , this implies that for all . By the definition of , this means for all . But, the follow a continuous curve, and lies in the open subset . Therefore, all for an open set of sufficiently small . Differentiating, we see that the line is contained in the tangent space of the fiber of at , namely . ∎
On the other hand, for fixed and , there are various distinct corresponding to linearly independent . With Figure 6 in mind, it is easy to see that
[TABLE]
Recall the rank-1 ideal from Section 1. Here it applies to vector bundles. As a set, the rank-1 subvariety of the tableau is
[TABLE]
As a set, the characteristic variety is the projection of to . More precisely, is the scheme202020We must study along with its various components and multiplicities, so it is better to think of it as a scheme than as a simple-minded variety. defined by the characteristic ideal on that is obtained from the rank-1 ideal on in the following way: For any , define by . Note that by (6.5) and (6.6), but this does not account for multiplicity within itself. Then the scheme is the incidence correspondence212121 For more background on the utility of incidence correspondences in algebraic geometry, see the 2013 Columbia Eilenberg lecture series by Joe Harris, [Har13]. A YouTube link is in the bibliography. of for the symbol map . See Figure 12.
This interpretation is amazing. Suddenly, two completely elementary ideas from Section 1—tableaux of matrices and rank-1 matrices—come together to give a concise description of the most subtle structure in PDE theory.
However, the scheme components and multiplicities are still not obvious from Figure 12; they must be obtained by examining the degree of the equations defining . The powerful third interpretation in Section 7 provides this detail. But first an example.
6(c). Example: The Wave Equation
Consider the PDE . To do this, we consider the manifold with coordinates ,,, , , , , , , , , . Consider the exterior differential system generated by
[TABLE]
Let for , so the derivatives are computed as
[TABLE]
where , , , and .
Changing bases, this tableau is equivalent to an endovolutive one of the form
[TABLE]
Or in block form
[TABLE]
Note that the third row of both and are zero, so the tableau is involutive by Theorem 5.4.
The rank-1 condition is
[TABLE]
After a simple change of basis, this becomes the example (1.2) – (1.4), seen throughout the earlier sections.
7. Guillemin Normal Form and Eigenvalues
In this section, we reinterpret and as properties of the endomorphisms . This section is the key to all of the more advanced results that follow. Our main computation tool is the structure of an endovolutive tableau discussed in Section 1(c), where and and are now bundles over .
The incidence correspondence of Figure 12 is rephrased in Lemma 7.1.
Lemma 7.1**.**
If , , and , then
[TABLE]
In particular, is an eigenvector of for all .
Proof.
Fix generic bases and and , so that and and . Set , so for all , and this must satisfy the symbol relations (1.10). In particular, for . Therefore
[TABLE]
(Here we see the utility of including the first summand in Equation (1.14).) ∎
Recalling the decomposition (1.17) and (1.18), Lemma 7.4 provides a sort of converse of Lemma 7.1.
Lemma 7.4**.**
Suppose that is an endovolutive tableau. Fix and suppose that is an eigenvector of for every . Then there is a over such that , so .
Proof.
For each , let denote the eigenvalue corresponding to , so that . Because is linear in , so is . Then . Therefore, . In particular, the rank-1 condition implies that
[TABLE]
This is the same expression as in (1.22), so by comparing recursively over , we see that for all , so . ∎
Lemma 7.4 deserves a warning: There may be multiple over the same , for perhaps there are different eigenvectors admitting different sequences of eigenvalues , for , associated to the same . Moreover, it is not (yet) clear that a mutual eigenvector exists for every such .
But overall it is clear that there is some relationship between the eigenvalues of and the characteristic variety of an endovolutive tableau . This relationship is made precise for involutive tableau using a result from [Gui68].
Theorem 7.6** (Guillemin normal form).**
Suppose that is involutive. For every and , the restricted homomorphism is an endomorphism of . Moreover, for all ,
[TABLE]
Compare Theorem 7.7 to Lemma 4.1 in [Gui68] and Proposition 6.3 in Chapter VIII of [BCG*+*90]. Theorem 7.7 is known as Guillemin normal form because it implies that the family of homomorphisms can be placed in simultaneous Jordan normal form on . It is the “normal form” alluded to in Section 1(b). We defer the proof of Theorem 7.7 to Section 9 so we may first see its important consequences.
Corollary 7.8**.**
If is involutive, then for each , there exists some satisfying the hypotheses of Lemma 7.4. That is, the projection map is onto. In particular, if is nontrivial and involutive, then is nonempty.
Proof.
Because we are working over , the commutativity condition (7.7) guarantees that common eigenvectors exist for the commutative algebra . ∎
Lemma 7.9**.**
Suppose that is an involutive tableau. Then the map of projective varieties induced by is a finite branched cover. In particular, both and have affine fiber dimension .
Proof.
Fix . The set of over is nonempty by Corollary 7.8. If it were true that the set of projecting to a particular were infinite, then the parameter would take infinitely many values in some expression of the form
[TABLE]
But, the matrix can have at most eigenvalues. ∎
Here we arrive at an easy222222It is easy in the sense that we have the explicit polynomials of in hand, and they are recognizable as the familiar eigenvector equations. The reader should compare (7.14) to the descriptions provided in [BCG*+*90] and [IL03]. Both references defer their decomposition of to the abstract Grothendeick–Riemann–Roch theorem. Hence, neither reference indicates how to compute the scheme by hand for general tableaux. While details are given in [BCG*+*90] in the simple case of rectangular tableaux, a complete description is achieved here because of the normal form provided by Theorem 5.4. proof of the main theorem regarding the structure of .
Theorem 7.11**.**
If is involutive, then and .
Proof.
We work in endovolutive coordinates. From Lemma 7.9, we already know that .
Fix a generic point over . Let denote the fiber over in . To understand the scheme , we must determine the degree of the condition defining . Note that must be a subvariety of , and is a linear subspace of , so the degree of is the degree of some condition on .
By Lemma 7.1 and (6.6), the condition that is nontrivial is precisely the condition that
[TABLE]
Since we may restrict our attention to , the condition (7.12) for is automatic by (1.23). Hence, only these terms contribute to the non-linear part of the ideal:
[TABLE]
So, without coordinates, the defining equations of are
[TABLE]
For a particular , this is the characteristic polynomial of as an endomorphism of . By involutivity and Theorem 7.7, all for admit the same Jordan-block form, so they admit the same factorization type for their respective characteristic polynomials. That means it suffices to consider a single . By definition, the characteristic polynomial of has degree at generic . Therefore, follows from Lemma 1.25. ∎
Theorems 7.7 and 7.11 provide a powerful interpretation of the form of an involutive tableau seen in Theorem 5.4 and Figure 3; the first columns represent a projection of , as in Lemma 7.9, and the rank-1 incidence correspondence in Figure 12 is precisely the eigenvector condition on the appropriate subspaces. It is peculiar and interesting that these results were discovered in the opposite order historically, as explored in Section 9.
The proof of Theorem 7.11—in particular Equation (7.14)—gives a precise understanding of as a scheme. Specifically, the characteristic scheme (in the sense of PDE) is merely a scheme of characteristic equations (in the sense of linear algebra)! The components of correspond to the various Jordan blocks apparent in (7.14). The multiplicity of each component is the dimension of that generalized eigenspace. The sheets of the finite branched cover come from different generalized eigenspaces where the first eigenvalues match. See Section 8 for how to compute this.
8. Examples
8(a). Zero-dimensional examples
Consider some cases of involutive tableaux with .
[TABLE]
Or, in endovolutive block form:
[TABLE]
The characteristic ideal will have degree and projective dimension . That is, will be 4 points, counted with multiplicity. The involutivity condition is (all rows); that is, the matrices commute. Thus the matrices and must have compatible Jordan-block forms; they span a commutative algebra. In these examples, we will use colors to emphasize the distinct generalized eigenspaces.
One possibility is that the matrices are diagonal with distinct Jordan blocks:
[TABLE]
In this case, the rank-1 variety is
[TABLE]
Each point has multiplicity 1.
Another possibility is that they are diagonal, but there is an two-dimensional eigenspace.
[TABLE]
In this case, the rank-1 cone is
[TABLE]
One point has multiplicity 2; in particular, the fiber for \xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}] should be seen as a . This is reflected clearly in (7.14), because \xi=[\xi_{1}:\xi_{2}:\xi_{3}]=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}] is a root of degree 2 for any :
[TABLE]
Another possibility is that there is a block:
[TABLE]
In this case, the rank-1 cone is
[TABLE]
Note that the fiber over of over has dimension 1 in each case; however, the first point has multiplicity 2. We see that the dimension of the fiber is insufficient to measure the multiplicity of the scheme , because the incidence correspondence involves the ideal . We can see this because of the structure of the rank-1 matrices: the upper minors vanish if and only if , so the fiber for \xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}] should be seen as a of degree 2. This is reflected clearly in (7.14), because \xi=[\xi_{1}:\xi_{2}:\xi_{3}]=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}] is a root of degree 2 for any :
[TABLE]
Finally, consider the case where both types of multiplicity occur. For example,
[TABLE]
In this case, the rank-1 cone is
[TABLE]
The scheme structure of is apparent here. The point \xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}] appears in two components, which correspond to the factorization of
[TABLE]
From the perspective of , these components correspond to the rank-1 matrices
[TABLE]
The fiber should be seen as two components, a and a . Overall, this point has multiplicity 3.
Remark 8.15*.*
For readers interested in hydrodynamic integrability criteria, take a moment to compute the secant varieties and , , in each of these cases. The secant variety is all linear combinations of points from the given variety. One can consider both the embedded secant variety within and , respectively, as well as the Grassmannian secant variety within and , respectively Note that hyperbolic systems of conservation laws have and take the non-degenerate diagonal form of the first example, over .
8(b). One-dimensional examples
Consider an involutive tableau with .
[TABLE]
Or, in endovolutive block form,
[TABLE]
The characteristic ideal will have degree and projective dimension . That is, will be a single curve.
For the sake of concreteness, let us assume that the coefficients are:
[TABLE]
so that
[TABLE]
The rank-1 ideal is just . Write a generic element of as , like so:
[TABLE]
Thus, a generic element of is of the form with fiber .
Using (7.14), the characteristic scheme of is generated by , restricted to the space , which is 1-dimensional. Write for ; so we are trying to find over as in Lemma 7.4. The space is the space spanned by . Hence, the single linear sheet of the characteristic variety over is given by .
8(c). One-dimensional exercise
Now is the time go back and re-read the example (1.4) and see how it fits into Sections 6(c) and 5(a). The wave-equation example offers a single whose fiber is also a . By choosing appropriate coefficients, you should be able to produce examples with with various other components and multiplicities.
In principle, you can choose any Cartan characters, and choose coefficients subject to Theorem 5.4 to build examples in this way. See the Sage code at https://bitbucket.org/curieux/symbol_sage, which can generate and analyze any such example (given sufficient memory).
9. Results of Guillemin and Quillen
As in the analogy Section 1(b), normal forms often reveal shortcuts to other advanced ideas.
Guillemin’s proof of Theorem 7.7 made use of two results derived from Quillen’s thesis [Qui64]. In this section, we see how these results become easier using Theorem 5.4. (Note that Theorem 5.4 and Theorem 7.7 are not equivalent. Theorem 5.4 is strictly stronger; it is easy to construct endovolutive tableaux that satisfy the conclusion of (7.7) but are not involutive. See [Smi15].)
Recall the Spencer cohomology groups from Section 4(b). For any , wedging by gives a map . This induces a map on the quotient spaces, .
Theorem 9.1** (Quillen’s Exactness Theorem).**
Suppose is an involutive tableau, and that . Then the sequence of maps by ,
[TABLE]
is exact.
In [Qui64], this theorem is proven using enormous commutative diagrams. In our context, with Theorem 5.4 in hand, we can prove an easy version of Quillen’s result, in the form of Lemma 9.3. Lemma 9.3 is a consequence of Corollary 9.2, which for us is an easy corollary of Theorem 5.4. This corollary is called Theorem A in [Gui68], where it was proved using a large diagram chase using Quillen’s exactness theorem, Theorem 9.1.
Corollary 9.2** (Quillen, Guillemin).**
Consider the subspace for a generic basis of , as in (1.17). If is involutive, then is involutive, and the natural map between prolongations is bijective.
Proof.
The first part is an immediate consequence of Theorem 5.4, as the quadratic condition still holds if the range of indices is truncated at (or greater). In particular, the generators of are preserved.
The second part is similarly immediate, using the proof of Theorem 5.4 given in [Smi15]: the contact relation for gives coordinates to the prolongation , and the independent generators are precisely those with and . Since they involve no indices , these generators remain independent when the range of indices is truncated at . ∎
Now we come to our simplified version of Theorem 9.1. Compare Lemma 9.3 to the exact sequence in [Gui68].
Lemma 9.3**.**
Recall that is a complement to , so that as in (1.17) and (1.18). For involutive, the sequence
[TABLE]
is exact.
Proof.
This proof is just an explicit description of the maps in a basis and an application of Corollary 9.2. Let be a basis for such that is a basis for and is a basis for , using the index convention (1.8) from Section 1.
The sequence makes sense because we can split the Spencer sequence (4.15) as by identifying the space with , which is the space spanned by the unshaded entries in Figure 1. Using this identification, two elements and of are equivalent in if and only if for some , the shaded entries in Figure 1. In other words, the projection is defined by (1.10), and the projection is defined by the projection onto the orange generator components in Figure 1, those with .
Since for all , the inclusion is an inclusion . Hence, the inclusion is understood as
[TABLE]
An element of is written in as
[TABLE]
The image in is
[TABLE]
so is of the form
[TABLE]
Recall that . So, in if and only if there is some such that in . Looking at (9.7), it is apparent that such must have , as has no terms. By involutivity and Corollary 9.2, we consider the involutive tableau
[TABLE]
with prolongation
[TABLE]
Therefore, lies in the kernel of , so . Therefore, Corollary 9.2 tells us . That is, if and only if .
Therefore, if and only if and on these index ranges. This occurs if and only if , meaning . ∎
We are ready to prove Theorem 7.7. The structure of the proof is identical to the original proof in [Gui68].
Proof of Theorem 7.7.
Suppose that , so that for some with for all . First, we must show that the span of the columns of lies in .
Consider the element . Because , it must be that represents the same point in . So, we can compute
[TABLE]
By Corollary 9.2, there exists such that . That is, writing , we have for all , meaning for all . Therefore, for any , we have .
Now, mapping again, , so , which is already known to be symmetric in . Therefore,
[TABLE]
This is symmetric in , giving the commutativity condition (7.7) ∎
It is interesting to see the inversion of logic that happened here. In the original literature, the overall implications are
[TABLE]
But, the arguments here give the overall implications
[TABLE]
However, we can write a shorter proof of Theorem 7.7 that relies Theorem 5.4 more directly, avoiding the general results of Quillen. For motivation, consider the following trivial corollary of Theorem 5.4 that is obtained by setting .
Corollary 9.12**.**
Suppose an involutive tableau is given in a generic, endovolutive basis as in (1.14), so that Theorem 5.4 holds. Then is an endomorphism of such that for all ,
[TABLE]
Alternate Proof of Theorem 7.7.
Fix , and suppose that . We must verify that all maps preserve and that they commute. Note that the definition of in Equation 1.24 depends on the choice of subspace but not on its basis, so we may verify these conditions using any basis we like.
First a trivial case: if it happens that , then is a rescaling, and it is immediate that .
Otherwise, we have . Then we may choose a generic basis of in which . Moreover, we may use that basis to construct an endovolutive basis of . By Corollary 9.12, it suffices to prove in this basis that is preserved by every . Write , and examine (1.22) on a column of . For each , we must verify
[TABLE]
If , then this is immediate, since .
If , then we are verifying . Note that , since . Moreover, by Theorem 5.4, we have
[TABLE]
for . Therefore, lies in . On the other hand, note that the output of lies in by the construction of the maps from the reduced symbol in Section 1(c). Combining these, we see that lies in .
Hence, the space is preserved by for all . By Corollary 9.12, they commute. ∎
On the theoretical side, it would be interesting to see how many of the hard classical theorems in the subject can be re-proven with elementary techniques. Specifically, the proof of Lemma 9.3 suggests an elementary proof of Quillen’s exactness theorem. The other hard theorem is the integrability of the characteristic variety, and a proof of that theorem using Guillemin’s original formulation is the subject of [GQS70]. That result was applied immediately to study primitive Lie pseudogroups.
10. Prolongation
How does the characteristic scheme change under prolongation? The short answer is that it does not! This does not depend on endovolutivity or involutivity.
Recall that is a tableau within . An element of is . Using any bases for , we may write as , with the additional condition that from (4.9). Let denote the rank-1 elements of , and let denote its projection to , as in Section 6(b).
Theorem 10.1**.**
If , then for some . Conversely, if , then . In particular, as schemes.
Proof.
Suppose that for some and . That is, and , so , and for all .
Let be the minimum index such that . Then , so column of is a multiple—namely —of column for all . Therefore, is rank-1, and there is some with . The converse is immediate. ∎
Remark 10.2*.*
Theorem 10.1 is used sometimes as a method for computing the characteristic variety, as follows: Given a tableau whose entries might depend on , consider as a map ; that is, a map from to . For a general point in , this map has rank at least 1. Its rank falls to 0 if and only if . But, this method is inefficient. If you have in hand and want to compute minors of something, you would save ink by computing the minors of itself to find .
11. Characteristic Sheaf
For a single endomorphism, the characteristic polynomial and the Jordan block decomposition of generalized eigenspaces together reveal all of the information that is independent of coordinates.
The ultimate conclusion of the preceding sections is that, for an abstract tableau , the characteristic sheaf knows the dimensions , , , as well as all of the dimensions and relationships among the mutual eigenspaces of the various symbol maps. The rank-1 cone knows the algebraic relationships among the sequences of eigenvalues (which we call ), and it also knows on which subspaces the symbol maps commute and on which fail to commute. In summary, and together know everything about an abstract tableau that is independent of coordinates.232323We revealed this fact using special bases, but as with traditional Jordan normal form, there is an abstract structure independent of basis that is easiest to see by building an adapted basis. Moreover, they are invariant under prolongation!
If the abstract tableau is a smooth projective bundle, then this applies to involutive Kähler-regular exterior differential systems in the smooth category.
If this formal perspective is appealing, then one might as well dispense with tableaux, symbols, Grassmann bundles, and differential ideals, and instead study the sheaf directly, with modern algebraic tools such as [Eis05]. Consider as an ideal in
[TABLE]
and consider its free resolution. The Hilbert syzygy theorem states that there is a finite free resolution that is characterized by its Hilbert polynomial . Of course, Theorem 7.11 is reading the leading term of !
One might ask how the involutivity of can be detected as an algebraic property of . The answer is tied to Castelnuovo–Mumford regularity, which measures the growth of the Hilbert polynomial. This computation is equivalent to the Cartan characters in Cartan’s test!
While it is not necessarily a useful computational tool versus differential forms or tableaux, this perspective allows a broader view of the techniques in PDE analysis, and it suggests that future progress in the field will emphasize on invariant algebraic techniques.
For more on this perspective, see [Mal03], [BCG*+*90, Chapter VIII], and the notes by Mark Green from the 2013 conference New Directions in Exterior Differential Systems in Estes Park, Colorado, which are based on the perspective in [CGG09].
Part IV Eikonal Systems
In Part III, we studied the characteristic scheme defined over . In this part, we turn our attention to the characteristic scheme as pulled back to an integral manifold . This is where the meaning of as “directions with an ambiguous initial value problem” has clear implications for the internal structure of solutions of a differential equation, as the eikonal system yields intrinsic foliations of integral manifolds .
12. General Eikonal Systems
First, let us consider the general notion of “eikonal equations” of a projective variety, without specific regard to the characteristic variety.
Consider a smooth manifold of dimension . Here are three ways to produce a smooth local hypersurface .
- (i)
The implicit function theorem says that a smooth hypersurface is defined locally by a smooth function , where for all . 2. (ii)
By the Frobenius theorem, this is equivalent to having a local smooth section of such that , for then is a rescaling of some . 3. (iii)
We can also look at the Frobenius theorem from the perspective of Cartan–Kähler theory242424Although Theorem 17 applies as stated only in the analytic category, it can be extended to the smooth category in this case. This sort of extension is explored in Section 14., as in Theorem 17. To make a local function or a local section of , consider the jet space , which is isomorphic to the bundle . Jet space is an open neighborhood (or local linearization) of equipped with local coordinates and a contact system generated by and , as in Section 3(a). In these local coordinates, set the independence condition . Any -dimensional integral manifold of the exterior differential system corresponds to a function with , so we may take . It is easy to see that this exterior differential system has no torsion and has a Kähler-regular tableau with Cartan characters . That is, integral manifolds are parametrized by 1 function of variables (hardly a surprise).
Now, consider a projective subbundle , meaning it is defined smoothly by homogeneous functions in the local fiber variables of . We want a test that tells us whether there exist hypersurfaces for which everywhere. Specifically, we want a theorem like the following.
Theorem 12.1**.**
Suppose that the eikonal system (defined below) of is involutive. Then for any smooth point , there is a smooth hypersurface such that and such that lies in the smooth locus of for all .
Because the hypersurface and the 1-form are not chosen a priori, this condition is difficult to interpret using the above formulations (i) and (ii) of hypersurfaces; however, the third formulation on is well-suited to this theorem. Consider the inclusion . (Recall that indicates the affine de-projectivization of a projective variety, resulting in a cone.) The eikonal system of is the exterior differential system on ; that is, is generated by and and has independence condition . An integral manifold of corresponds to a hypersurface in whose tangent space in annihilated by a section of .
We do not prove involutivity of in any significant case here; it is typically extremely deep and difficult, and references are provided below. However, the situation in Theorem 12.1 has several interesting consequences and interpretations.
Corollary 12.2**.**
Suppose that the eikonal system of is involutive. Let denote the projective fiber dimension of . The hypersurfaces guaranteed by Theorem 12.1 depend on functions of 1 variable.
Proof.
Fix . We work locally252525In fact, we work microlocally in the bundle. Microlocally means that we are working over a contractible neighborhood of the base space with a local trivialization of the bundle, and also within a neighborhood in the fiber. near , so we may assume is open, connected, and simply connected, and that . Because is smooth with affine fiber dimension in , we may choose local coordinates on each fiber of near such that is defined by near .
For each , let denote the lines of 1-forms specified as
[TABLE]
nonzero in the slot, in these coordinates. By Theorem 12.1, there is a local hypersurface and a corresponding local function such that . Complete to a local coordinate system on , and let be the canonical Darboux coordinates (that is, roughly corresponding to ) on the fiber of . Note that by construction, so is defined by . (Note that the open neighborhood of around may have shrunk during this process, which is why this is microlocal.)
Therefore, the contact system on is generated in a neighborhood of by , which pulls back to as
[TABLE]
The corresponding tableau is the space of matrices with entries for , so its has . ∎
This is an interesting proof, using all three perspectives of hypersurfaces. The implicit function theorem on the fiber provides local coordinates on the base by involutivity. Then, the Frobenius theorem on the base produces contact coordinates on the fiber that are compatible with the original fiber coordinates. It is easy to adapt this proof to the following corollary, which is useful for constructing coordinates in some situations, as in [Smi14].
Corollary 12.3**.**
For any , let denote its linear span, which is itself a projective subbundle of . If is involutive, then is involutive.
We will now examine several interpretations of the eikonal system that tie together various branches of geometry. Compare Sections 12(a), 12(b), and 13 to [BCG*+*90, V§3(vi)].
12(a). as Lagrangian Geometry
The term in plays little role for the eikonal system . It is there merely to make obvious the relationship between the eikonal equations and hypersurfaces.
Instead, consider the symplectic manifold with symplectic 2-form , which is expressed in local coordinates as according to Darboux’s theorem. The Lagrangian Grassmannian is the bundle over whose fiber is all the Legendrian -planes
[TABLE]
Each fiber is isomorphic to the homogeneous space , which is the variety of -planes in on which . If we consider a plane for which , then on with . Hence, the non-vertical open neighborhood of is identified with the space of symmetric matrices, .
Suppose the de-projectivized affine subvariety is defined smoothly by homogeneous functions in the local fiber variables of . From this perspective, the eikonal system is measuring the intersection of with for all .
Corollary 12.5**.**
The eikonal system is involutive if and only if there are local coordinates of near in which the non-vertical open set in is described as the symmetric matrices that vanish outside the upper-left part.
Proof.
If the eikonal system is involutive, then we may construct coordinates as in Corollary 12.2 such that the de-projectivized affine variety is defined by for all , so is defined by for all . In such coordinates, the open neighborhood of the Lagrangian Grassmannian takes the block form
[TABLE]
using our index convention (1.8) from Section 1. The condition implies , so the lower blocks are zero. The matrix is symmetric, so the upper-right block is zero.
Conversely, suppose such coordinates exist. Then satisfies the closed 1-forms , and the dimensions match, so satisfies . Since the equations defining are homogeneous, it must be . Using these coordinates for and yields , as in Corollary 12.2, which is involutive with the correct Cartan characters and gives the desired hypersurfaces in Theorem 12.1. ∎
Compare this to Proposition 3.22 in [BCG*+*90, Chapter V]. For more symplectic and Lagrangian geometry, see [Bry93].
12(b). as Poisson Brackets
If describes the state of a physical system, a function is called an observable [SW86]. The Poisson bracket of observables is the operation given in local coordinates by
[TABLE]
The Poisson bracket plays a fundamental role in Hamiltonian mechanics and the relationship between symmetries and conservation laws in physics. This is because (12.7) is a Lie bracket on . (See [Bry93] for details.)
Suppose that is some subspace of , so that is a nonempty set of smooth observables that is closed under linear combinations. Suppose also that for all . Then, is a Lie subalgebra of with respect to the Poisson bracket.
Because is a projective variety in each fiber, the de-projectivized affine subvariety is defined smoothly by observables that take the form of homogeneous functions in the local fiber variables of . For convenience, let us make the additional assumption that the homogeneous functions are algebraic of degree in , so that is defined smoothly near for by a set of equations in multi-index form
[TABLE]
Corollary 12.9**.**
Let denote the module in generated by (12.8). The eikonal system is involutive if and only if . That is, is involutive if and only if the module is a Lie algebra with respect to the Poisson bracket.
A proof—which does not depend on the polynomial form (12.8)—can be derived from Corollary 12.5 along with the observation that the Poisson bracket can be defined in a coordinate-free way as the operator such that
[TABLE]
Equations of the form (12.8) appear in analysis as systems of homogeneous first-order PDEs on of the form
[TABLE]
A famous example is the characteristic equation for the wave equation of Section 6(c):
[TABLE]
This is generalized to any involutive EDS in Section 13.
13. Involutivity of the Characteristic Variety
We would like to apply the entire discussion from Section 12 to the case where is a characteristic variety, but first we must establish that is well-defined in .
Suppose that is a connected integral manifold of an involutive exterior differential system , and that lies in , a smooth and Kähler-regular component of , as in Section 4.
Fix , and suppose and . For , we can consider the pullback . In a basis of , we can write a representative as for coefficients . As a bundle over , we have . In this sense, we can pull back the characteristic variety—as a set—to .
More precisely, recall that has degree and affine fiber dimension , but it is a scheme defined by the characteristic sheaf . For any local section of the coframe bundle , we can write the characteristic sheaf as a homogeneous ideal in the module . At each , the coframe is just a complex basis of . Therefore, we obtain a basis for of the form . That is, in some neighborhood of , the section of is well-defined. Moreover the stalks of the sheaf can be pulled back, as is well-defined for any defined in a neighborhood of . Therefore, we can pull back both the coefficients and the coordinates to define the homogeneous ideal in . Let be the scheme defined by .
Now, the entire discussion from Section 12 applies where is any particular component of . We focus our attention on the maximal smooth locus of . We know additionally that takes the polynomial form (12.8) as derived from (7.14), so it has degree and fiber dimension at smooth points, as a complex projective variety.
Theorem 13.1** (Guillemin–Quillen–Sternberg).**
Suppose that is an ordinary integral manifold of an involutive exterior differential system with character and Cartan integer . The eikonal system of the smooth locus of the (complex) characteristic variety, , is involutive. At smooth points in , the characteristic hypersurfaces are parametrized by 1 function of variables.
Note that our definition of is the complex characteristic variety.262626Recall that, in the complex case, the distinction between elliptic and hyperbolic second-order PDEs does not occur, because there is only one nondegenerate signature. Theorem 13.1 is called the “integrability of characteristics.” Cartan demonstrated several examples of this phenomenon in [Car11]. The proof appears in [GQS70], where a major step is the application of Theorem 7.7. Hence, this result appears to rely in an essential way on all three facets of the characteristic variety seen in Part III.
The converse of Theorem 13.1 is not true; it is easy to write down non-involutive exterior differential systems for which is involutive.
However, in [Gab81], Ofer Gabber proved a more general form of Theorem 13.1 that was conjectured in [GQS70] and that removes practically all of the technical assumptions. Phrased as Theorem 13.2, Gabber’s theorem recalls the ideas of Section 12(b).
Theorem 13.2** (Gabber).**
Let be a filtered ring whose graded ring is a Noetherian commutative algebra over . Let be a -ideal that is finitely generated as an -module. Then
In our context, Gabber’s theorem applies to the case where , the ring of polynomials in local fiber variables of , filtered by degree. Then, is the ring of homogeneous polynomials, graded by degree, which admits a Poisson structure like (12.7). The -ideal is the characteristic sheaf , which by (7.14) is defined by homogeneous polynomials if the original exterior differential system is involutive. By Hilbert’s Nullstellensatz, the radical ideal defines the generic component . Thus, the conclusion invokes Corollary 12.9 to say that the eikonal system is involutive.
From the general discussion of eikonal systems surrounding Theorem 12.1, the interpretation of these theorems is apparent, in the form of Corollary 13.3.
Corollary 13.3**.**
Suppose that is an ordinary integral manifold of an involutive exterior differential system with character and Cartan integer . Then admits a local—possibly complex—coordinate system such that .
In [Smi14], the linear span of the characteristic variety, is studied in comparison to the Cauchy retraction space , where is the maximum Frobenius system within , as in Section 5(b).
Suppose that the affine fiber dimension of is and that the affine fiber dimension of is . These spaces are nested, so .
Corollary 13.4**.**
Suppose that is an ordinary integral manifold of an involutive exterior differential system with character and Cartan integer . Then admits a local—possibly complex—coordinate system such that , such that , and such that .
Corollary 13.4 is a simple result, but its proof relies on building a coframe of in which the nilpotent parts of the commuting symbol maps are identified clearly; that is, it depends in an essential way on Theorems 13.1 and 5.4. The key point is that it reinforces the following remark.
Remark 13.5* (General Dogma of the Characteristic Variety).*
An exterior differential system is a geometric object over , meaning that its key properties are coordinate-invariant. On each Kähler-regular component , knowing this geometry is equivalent to knowing the characteristic scheme and rank-1 variety over , which are prolongation-invariant. Moreover, the geometry of an EDS imposes a geometry on its solutions, , and this imposition is also dictated by the characteristic scheme and rank-1 variety. Therefore, exterior differential systems can be classified up to coordinate equivalence as “parametrized families of manifolds with associated characteristic geometry.”
Remark 13.5 is not a theorem; it is an attitude.
To make this remark robust for a general exterior differential system, the scheme separating into its components —each component smooth with its own fixed Cartan characters over some subvariety of —would have to be studied, and very little progress has been made at that level of abstraction. Nonetheless, whenever some property of PDEs is encountered, Remark 13.5 urges us to ask “is this property really invariant, or an artifact of my coordinates?” which is best answered by asking “can this property be reinterpreted using the characteristic scheme?” Sections 14 and 15 discuss progress of this type.
14. Yang’s Hyperbolicity Criterion
One of the great frustrations of the Cartan–Kähler theorem is that it relies on the Cauchy–Kowalevski theorem, so it applies only in the analytic category. One can see its dramatic failure in the smooth category in [Lew57]. However, this frustration has been escaped in some special cases by exploiting the structure272727If we take the broadest possible interpretation of Remark 13.5 to heart, then any possible escape from analyticity ought to arise from the structure of . However, the reader is cautioned again that a dogma is not a theorem. of . For example
- (i)
ODE systems. Suppose that is involutive over and that . Then , so the tableau is the trivial (irrelevant) subspace of . The prolonged system on is Frobenius, and is merely a copy of whose fiber is the unique element of an integrable distribution. That integrable distribution is the Cauchy retraction space of as in Section 5(b), so it must have been that . The flow-box theorem foliates by solutions in the smooth category. (Actually, in the Lipschitz category, by standard ODE theory!) If is a leaf of this foliation, then removing Cauchy retractions on the original exterior differential system yields the exterior differential system . 2. (ii)
Empty systems. Suppose that is involutive over and that with . Then, the tableau is the total space . Therefore, is an open domain in , so , and there is no condition whatsoever282828The most extreme and amusing exploitations of the flexibility of come from the homotopy principle [Gro86, EM02]. on integral manifolds ; however, the prolongation would have to satisfy the contact ideal, forcing some regularity on . We studied this EDS in Section 2.
A less trivial special case is presented in [Yan87], which is the subject of this section.292929As it happens, the attempt to understand [Yan87] in the context of [BCG*+*90, Chapter VIII] was the inspiration for computing the details shown in [Smi15] and the entire approach of these notes.
A tableau is called determined if and . That is, , so , and . Cartan’s test shows that a determined tableau is always involutive, so we may assume that is written in endovolutive form as in Theorem 5.4. The only nontrivial symbol endomorphisms in (1.20) are and for , like this:
[TABLE]
The quadratic involutivity condition is trivial, which is why Cartan’s test passes automatically.
Lemma 14.2**.**
Suppose is determined and written in endovolutive bases. Identify with , and use our endovolutive basis of for both. Then for any , the symbol map from Section 6(b) is
[TABLE]
Then
[TABLE]
and the characteristic ideal is generated by
[TABLE]
In particular, if and only if is an eigenvalue of .
Proof.
The first two equations are immediate from our block form. From Part III, we know that if and only if for all . Therefore, we compute in our endovolutive basis
[TABLE]
That is, . ∎
Corollary 14.7**.**
Consider a determined tableau as in Lemma 14.2. Fix an integral element . Suppose that is a real hyperplane in such that and . Then is an isomorphism.
Proof.
By Lemma 14.2, we have if and only if . ∎
Definition 14.8**.**
Suppose is a real hyperplane in corresponding to the real covector . The real hyperplane is called space-like if the following conditions hold.
- (i)
. 2. (ii)
For any , there is a real basis of in which is real and diagonal. 3. (iii)
The above choice of basis is a smooth function of .
A determined tableau is called determined hyperbolic if admits a (real) space-like hyperplane.
Here is a simple example using our notation from Lemma 14.2. Fix . To meet the first condition, suppose that is not in . Then , and [math] is not an eigenvalue of , which of course implies that is invertible. Say , so that . The second condition is that is diagonalizable using some change-of-basis . The third condition is that is continuous in the projective variable . Suppose moreover that we take our basis such that the basis-change at is . Then we have the condition that is a diagonal matrix, . This puts restrictions on the possible forms of these matrices. For example, and .
Definition 14.9**.**
A tableau is called hyperbolic if admits a flag given by a basis of such that each of the sequential initial value problems from to has a hyperbolic determined tableau.
Theorem 14.10** (Yang).**
Theorem 17 applies in the smooth category, if is hyperbolic.
The proof proceeds by replacing the Cauchy–Kowalevski initial-value problem with the Cauchy initial-value problem for determined first-order quasilinear hyperbolic PDEs. See [Yan87] and Appendix A of [Kam89] for more details.
Clearly the definition of hyperbolic depends on the geometry of and the symbol maps ; however, to the author’s knowledge no one has succeeded in writing down the explicit criteria on or or for general hyperbolicity. Hence, Yang’s condition is not yet available to computer algebra systems. If that can be accomplished, it means we can identify a subvariety of the moduli of involutive tableaux—as in Section 5(a)—that admit solutions in the smooth category.
One well-understood special case is when , so contains real points (with multiplicity). If the number of distinct points is sufficiently large (greater than ), then this is the situation for hyperbolic systems of conservation laws, as in [Tsa91]. The eikonal system is rigid, so each solution is foliated by characteristic hypersurfaces. Multiplicity corresponds to nilpotent pieces of the generalized eigenspaces of the symbol endomorphisms . See again Section 8.
15. Open Problems and Future Directions
Our perspective here has been simple-minded—focusing on matrices and their computable properties—to gain intuition of and as rapidly as possible. The articles [Smi15] and [Smi14] are founded on this perspective, but reveal additional detail in the structures discussed here. For more modern and sophisticated treatment, please see [Mal03], [KL07], and [CGG09]. Additionally, Chapters V–VIII of [BCG*+*90] contain significantly more results than we have summarized here.
To conclude, here are some interesting questions which—to the author’s present knowledge—are open subjects that represent the major theoretical gaps in the subject of exterior differential systems. They are worth serious consideration as research projects, and offer great opportunities for collaboration between analysts, differential geometers, algebraic geometers, and scientific programmers.
- (i)
Variety of involutive tableaux. For given , , and Cartan characters , what is the variety of involutive tableaux (with fixed coefficients)? Can we compute its dimension or degree or Hilbert polynomial? Section 5(a) demonstrates a first step toward understanding the variety of involutive tableaux, as Theorem 5.4 gives the ideal in certain bases. However, to answer the question completely, one would need to examine how the coefficients in (5.6) vary under arbitrary changes of basis in and . 2. (ii)
Special hyperbolic integrability criteria. Solution techniques (such as Lax pairs, inverse scattering, hydrodynamic reduction, and Bäcklund transformations) play a key rôle in the analysis of wave-like PDEs, especially those coming from physics and geometry. Given that these techniques are coordinate-invariant, Remark 13.5 suggests that they should all be expressible as algebraic conditions on . Expressing those conditions in an abstract way over and would allow more systematic geometric approach to many of the ad hoc methods in the analysis of PDEs.303030Indeed, the central theme of the conference for which these notes were prepared was to express Ferapontov’s notion of hydrodynamic integrability in terms of algebro-geometric structures in the Lagrangian Grassmannian. The notion of hydrodynamic integrability is tied completely to the secant variety of . 3. (iii)
Elliptic systems. Consider the classical results regarding elliptic regularity of quasilinear elliptic operators. This is another form of “special integrability criteria.” How far can the notion of elliptic regularity be extended to general exterior differential systems? Certainly the conditions of involutivity, , and are necessary, and one can directly translate the classical theorems to an EDS written specifically to describe a quasilinear second-order elliptic operator in local coordinates, but what other technical assumptions can be dropped? Some discussion appears in [BCG*+*90, Chapter X§3]. 4. (iv)
Moduli of involutive tableaux. Refining the first problem in light of the second and third problems, can we identify invariant sub-varieties of the variety of involutive tableaux? Dogma 13.5 indicates that we should be able to identify subvarieties, such as hyperbolic tableaux, elliptic tableaux, systems satisfying special integrability conditions, and so on. What does it mean when these sub-varieties intersect? Lewy showed that there are involutive PDEs with no solution in the smooth category [Lew57], which cannot happen in the analytic category. Where do the Lewy examples fall in this variety? Are there other subvarieties that have not been observed in classical equations? If there is any organizing geometry behind the “nearly impenetrable jungle” of involutive PDEs, this is where we should look. 5. (v)
Weakness of involutivity of characteristics. Note that Theorem 13.2 does not regard the involutivity of an exterior differential system in any direct way; the assumption of involutivity of enters Theorem 13.2 only because we know that is an ideal of homogeneous polynomials from (7.14). Thus, we expect that the condition “ is the characteristic scheme of an exterior differential system , and is involutive” is much weaker than “ is the characteristic scheme of an exterior differential system , and is involutive.” The gap between these two statements is extremely important to explore, as it goes to the heart of the question about how involutivity leads to solutions of the initial-value problem for a system of PDEs. To put this a different way, can we construct an embedded variety that is involutive, but for which there is no involutive exterior differential system for which is the characteristic variety? 6. (vi)
Global integrability of the characteristic variety. If is involutive, then the system is involutive on an ordinary integral manifold, . However, it is not clear whether is involutive as a bundle over itself in any reasonable way that considers all simultaneously. That is, consider the EDS on generated by for some section of . Under what circumstances is this involutive? Can Gabber’s theorem 13.2 be adopted to this case? This has theoretical implications for special integrability conditions (above), because it would allow one to count special solutions among all solutions from directly. Additionally, given its algebraic nature, can Gabber’s theorem provide solutions for certain types of PDEs with low regularity, bypassing the Lewy examples with various additional conditions? 7. (vii)
Prolongation theorems. Does prolongation always uncover solutions of an exterior differential system, if we remove the regularity assumptions on and consider the many components of the scheme ? As experts are well aware, this is has been the key open question in the subject for most of a century. (See [BCG*+*90, Chapter VI].) In the context of this monograph, the question is related to whether the block form of involutive tableau (1.20) and the involutivity conditions of Theorem 5.4 can be extended from to non-smooth points in ? Because of the interaction of Guillemin normal form and involutivity with Spencer cohomology as in Section 9, such an extension of the endovolutive block form could be helpful in an effort to construct (or prove the non-existence of) counterexamples. 8. (viii)
Representation theory of Lie pseudogroups. Lie pseudogroups are subgroups of the diffeomorphism pseudogroup whose trajectories are the solutions of involutive PDEs. See [Olv09]. Just as Jordan form (in the guise of the Levi decomposition) is the key first step toward understanding the representation of Lie groups, it is reasonable to expect that the endovolutive block-form (1.20) and Theorem 5.4 can serve as the foundation of a representation theory of Lie pseudogroups. Any results regarding the “moduli of involutive tableaux” can be applied to Lie pseudogroups with those tableaux. Indeed, the first application of Theorem 13.1 was the classification of the primitive Lie pseudogroups [GQS66].
Acknowledgments
Thanks to Robert L. Bryant, Niky Kamran, and Deane Yang for always humoring my interest in these details, to Ian Morrison for helping me appreciate the role of incidence correspondences in algebraic geometry, and to Giovanni Moreno for arranging this course and for encouraging me repeatedly to complete these notes. The punny photograph in Figure 5 is public domain from the U.S. Fish and Wildlife Service.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Art 91] M. Artin , Algebra , Pearson, 1991.
- 2[Bry 93] R. L. Bryant , An Introduction to Lie Groups and Symplectic Geometry, (lecture notes) (1993).
- 3[BCG + 90] R. L. Bryant , S.-S. Chern , R. B. Gardner , H. L. Goldschmidt , and P. A. Griffiths , Exterior Differential Systems , 1 ed., 18 , Springer-Verlag, 1990. Available at http://library.msri.org/books/Book 18/MSRI-v 18-Bryant-Chern-et-al.pdf .
- 4[CGG 09] J. Carlson , M. Green , and P. Griffiths , Variations of Hodge Structure Considered as an Exterior Differential System: Old and New Results, Symmetry, Integrability and Geometry: Methods and Applications (2009). Available at http://www.emis.de/journals/SIGMA/2009/087/ .
- 5[Car 11] É. Cartan , Sur les systèmes en involution d’équations aux dérivées partielles du second ordre à une fonction invonnue de trois variables indépendantes, Bulletin de la Société Mathématique de France 39 (1911), 352–443. Available at http://www.numdam.org/item?id=BSMF_1911__39__352_1 .
- 6[Cle 17] J. N. Clelland , From Frenet to Cartan: The Method of Moving Frames , American Mathematical Society, Graduate Texts in Mathematics, 2017.
- 7[Eis 05] D. Eisenbud , The geometry of syzygies , Graduate Texts in Mathematics 229 , Springer-Verlag, New York, 2005.
- 8[EM 02] Y. Eliashberg and N. Mishachev , Introduction to the h ℎ h -Principle , Graduate Studies in Mathematics 48 , American Mathematical Society, Providence, Rhode Island, jun 2002. Available at http://www.ams.org/gsm/048 .
