Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in   the Geometric Study of PDEs

Abraham D. Smith

arXiv:1701.04930·math.DG·February 7, 2018

Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in the Geometric Study of PDEs

Abraham D. Smith

PDF

TL;DR

This paper explores advanced geometric concepts in PDEs, focusing on involutive tableaux, characteristic varieties, and rank-one varieties, providing a unified approach to key theorems in the field.

Contribution

It introduces a new geometric interpretation of PDEs using Grassmann bundles and enhances Guillemin normal form to analyze involutivity of tableaux.

Findings

01

Characterization of characteristic varieties and their incidence correspondence.

02

Development of an enhanced Guillemin normal form for involutivity.

03

Reinterpretation of PDE geometry through smooth sub-bundles of Grassmann bundles.

Abstract

This expository monograph cuts a short path from the common, elementary background in geometry (linear algebra, vector bundles, and algebraic ideals) to the most advanced theorems about involutive exterior differential systems: (1) The incidence correspondence of the characteristic variety, (2) Guillemin normal form and Quillen's thesis, (3) The Integrability of Characteristics by Guillemin, Quillen, Sternberg, and Gabber, and (4) Yang's Hyperbolicity Criterion. To do so, the geometric theory of PDEs is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle, whereby the rank-1 variety is emphasized. The primary computational tool is an enhanced formulation of Guillemin normal form that is equivalent to involutivity of tableaux.

Figures1

Click any figure to enlarge with its caption.

Equations274

0 \to A \to W \otimes V^{*} \to σ H^{1} (A) \to 0,

0 \to A \to W \otimes V^{*} \to σ H^{1} (A) \to 0,

⎩ ⎨ ⎧ α_{0} α_{1} α_{2} α_{1} α_{2} α_{3} α_{2} α_{3} α_{4} : α_{i} \in R ⎭ ⎬ ⎫ .

⎩ ⎨ ⎧ α_{0} α_{1} α_{2} α_{1} α_{2} α_{3} α_{2} α_{3} α_{4} : α_{i} \in R ⎭ ⎬ ⎫ .

0000 = π_{3}^{2} - π_{2}^{3}, = π_{3}^{1} - π_{1}^{3}, = π_{2}^{2} - π_{1}^{3}, = π_{2}^{1} - π_{1}^{2} .

0000 = π_{3}^{2} - π_{2}^{3}, = π_{3}^{1} - π_{1}^{3}, = π_{2}^{2} - π_{1}^{3}, = π_{2}^{1} - π_{1}^{2} .

κ^{4} κ^{3} τ κ^{2} τ^{2} κ^{3} τ κ^{2} τ^{2} κ τ^{3} κ^{2} τ^{2} κ τ^{3} τ^{4} = κ^{2} κ τ τ^{2} \otimes (κ^{2} κ τ τ^{2}),

κ^{4} κ^{3} τ κ^{2} τ^{2} κ^{3} τ κ^{2} τ^{2} κ τ^{3} κ^{2} τ^{2} κ τ^{3} τ^{4} = κ^{2} κ τ τ^{2} \otimes (κ^{2} κ τ τ^{2}),

[κ^{4} : κ^{3} τ : κ^{2} τ^{2} : κ τ^{3} : τ^{4}] ≅ P^{1} \subset P^{4} ≅ P A .

[κ^{4} : κ^{3} τ : κ^{2} τ^{2} : κ τ^{3} : τ^{4}] ≅ P^{1} \subset P^{4} ≅ P A .

[κ^{2} : κ τ : τ^{2}] ≅ P^{1} \subset P^{2} ≅ P V^{*} .

[κ^{2} : κ τ : τ^{2}] ≅ P^{1} \subset P^{2} ≅ P V^{*} .

s = s_{1} + s_{2} + \dots + s_{ℓ} + s_{ℓ + 1} + \dots + s_{n} = s_{1} + s_{2} + \dots + s_{ℓ} + 0 + \dots + 0.

s = s_{1} + s_{2} + \dots + s_{ℓ} + s_{ℓ + 1} + \dots + s_{n} = s_{1} + s_{2} + \dots + s_{ℓ} + 0 + \dots + 0.

i, j λ, μ ϱ, ς a, b \in {1, \dots, ℓ, ℓ + 1, \dots, n}, \in {1, \dots, ℓ, ℓ + 1, \dots, n}, \in {1, \dots, ℓ, ℓ + 1, \dots, n}, and \in {1, \dots, r} .

i, j λ, μ ϱ, ς a, b \in {1, \dots, ℓ, ℓ + 1, \dots, n}, \in {1, \dots, ℓ, ℓ + 1, \dots, n}, \in {1, \dots, ℓ, ℓ + 1, \dots, n}, and \in {1, \dots, r} .

π = π_{i}^{a} (z_{a} \otimes u^{i}) \in W \otimes V^{*},

π = π_{i}^{a} (z_{a} \otimes u^{i}) \in W \otimes V^{*},

\Big{\{}0=\pi^{a}_{i}-B^{a,\lambda}_{i,b}\pi^{b}_{\lambda}~{}:~{}1\leq i\leq n,\ s_{i}<a\leq r\Big{\}}.

\Big{\{}0=\pi^{a}_{i}-B^{a,\lambda}_{i,b}\pi^{b}_{\lambda}~{}:~{}1\leq i\leq n,\ s_{i}<a\leq r\Big{\}}.

⎩ ⎨ ⎧ α_{2} α_{1} α_{0} α_{4} α_{3} α_{2} α_{3} α_{2} α_{1} ⎭ ⎬ ⎫ = ⎩ ⎨ ⎧ \pagecolor io π_{1}^{1} \pagecolor io π_{1}^{2} \pagecolor io π_{1}^{3} \pagecolor io π_{2}^{1} \pagecolor io π_{2}^{2} π_{1}^{1} π_{2}^{2} π_{1}^{1} π_{1}^{2} ⎭ ⎬ ⎫ .

⎩ ⎨ ⎧ α_{2} α_{1} α_{0} α_{4} α_{3} α_{2} α_{3} α_{2} α_{1} ⎭ ⎬ ⎫ = ⎩ ⎨ ⎧ \pagecolor io π_{1}^{1} \pagecolor io π_{1}^{2} \pagecolor io π_{1}^{3} \pagecolor io π_{2}^{1} \pagecolor io π_{2}^{2} π_{1}^{1} π_{2}^{2} π_{1}^{1} π_{1}^{2} ⎭ ⎬ ⎫ .

0000 = π_{2}^{3} - 1 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2}, = π_{3}^{1} - 0 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 1 π_{2}^{2}, = π_{3}^{2} - 1 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2}, = π_{3}^{3} - 0 π_{1}^{1} - 1 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2} .

0000 = π_{2}^{3} - 1 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2}, = π_{3}^{1} - 0 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 1 π_{2}^{2}, = π_{3}^{2} - 1 π_{1}^{1} - 0 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2}, = π_{3}^{3} - 0 π_{1}^{1} - 1 π_{1}^{2} - 0 π_{1}^{3} - 0 π_{2}^{1} - 0 π_{2}^{2} .

B \in V^{*} \otimes V \otimes W \otimes W^{*} ≅ End (V^{*}) \otimes End (W)

B \in V^{*} \otimes V \otimes W \otimes W^{*} ≅ End (V^{*}) \otimes End (W)

a \leq s_{i} \sum δ_{i}^{λ} δ_{b}^{a} (z_{a} \otimes z^{b}) \otimes (u^{i} \otimes u_{λ}) + a > s_{i} \sum B_{i, b}^{a, λ} (z_{a} \otimes z^{b}) \otimes (u^{i} \otimes u_{λ}) .

a \leq s_{i} \sum δ_{i}^{λ} δ_{b}^{a} (z_{a} \otimes z^{b}) \otimes (u^{i} \otimes u_{λ}) + a > s_{i} \sum B_{i, b}^{a, λ} (z_{a} \otimes z^{b}) \otimes (u^{i} \otimes u_{λ}) .

B_{1}^{1} = 100010001 B_{2}^{1} = 001000000 B_{2}^{2} = 100010000 B_{3}^{1} = 010001000 B_{3}^{2} = 000100000 .

B_{1}^{1} = 100010001 B_{2}^{1} = 001000000 B_{2}^{2} = 100010000 B_{3}^{1} = 010001000 B_{3}^{2} = 000100000 .

B (φ) (v) = φ_{1} v^{1} + φ_{2} v^{2} φ_{1} v^{3} φ_{1} v^{2} φ_{2} v^{3} φ_{1} v^{1} + φ_{2} v^{2} φ_{1} v^{3} 00 φ_{1} v^{1} .

B (φ) (v) = φ_{1} v^{1} + φ_{2} v^{2} φ_{1} v^{3} φ_{1} v^{2} φ_{2} v^{3} φ_{1} v^{1} + φ_{2} v^{2} φ_{1} v^{3} 00 φ_{1} v^{1} .

V U Y = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{i} ⟩, = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{λ} ⟩, = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{ϱ} ⟩,

V U Y = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{i} ⟩, = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{λ} ⟩, = ⟨ u_{1}, \dots, u_{ℓ}, u_{ℓ + 1}, \dots, u_{n} ⟩ = ⟨ u_{ϱ} ⟩,

V^{*} U^{*} ≅ Y^{⊥} Y^{*} ≅ U^{⊥} = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{i} ⟩, = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{λ} ⟩, = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{ϱ} ⟩ .

V^{*} U^{*} ≅ Y^{⊥} Y^{*} ≅ U^{⊥} = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{i} ⟩, = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{λ} ⟩, = ⟨ u^{1}, \dots, u^{ℓ}, u^{ℓ + 1}, \dots, u^{n} ⟩ = ⟨ u^{ϱ} ⟩ .

W W_{i}^{-} W_{i}^{+} = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩ = ⟨ z_{a} ⟩ = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩ = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩

W W_{i}^{-} W_{i}^{+} = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩ = ⟨ z_{a} ⟩ = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩ = ⟨ z_{1}, \dots, z_{s_{i}}, z_{s_{i} + 1}, \dots, z_{r} ⟩

B_{1}^{1} 0000 B_{2}^{1} B_{2}^{2} 000 B_{3}^{1} B_{3}^{2} B_{3}^{3} 00 B_{4}^{1} B_{4}^{2} B_{4}^{3} B_{4}^{4} 0 \dots \dots \dots \dots ⋱ 0 B_{ℓ}^{1} B_{ℓ}^{2} B_{ℓ}^{3} B_{ℓ}^{4} B_{ℓ}^{ℓ} \dots \dots \dots \dots B_{i}^{λ} \dots B_{n}^{1} B_{n}^{2} B_{n}^{3} B_{n}^{4} ⋮ B_{n}^{ℓ} .

B_{1}^{1} 0000 B_{2}^{1} B_{2}^{2} 000 B_{3}^{1} B_{3}^{2} B_{3}^{3} 00 B_{4}^{1} B_{4}^{2} B_{4}^{3} B_{4}^{4} 0 \dots \dots \dots \dots ⋱ 0 B_{ℓ}^{1} B_{ℓ}^{2} B_{ℓ}^{3} B_{ℓ}^{4} B_{ℓ}^{ℓ} \dots \dots \dots \dots B_{i}^{λ} \dots B_{n}^{1} B_{n}^{2} B_{n}^{3} B_{n}^{4} ⋮ B_{n}^{ℓ} .

W^{1} (u^{λ}) = {w \in W_{λ}^{-} : B_{μ}^{λ} w = δ_{μ}^{λ} w, \forall μ \leq ℓ} .

W^{1} (u^{λ}) = {w \in W_{λ}^{-} : B_{μ}^{λ} w = δ_{μ}^{λ} w, \forall μ \leq ℓ} .

W^{1} (φ) = {w \in W^{-} (φ) : (λ \sum φ_{λ} B_{μ}^{λ} - φ_{μ} I) w = 0, \forall μ \leq ℓ} .

W^{1} (φ) = {w \in W^{-} (φ) : (λ \sum φ_{λ} B_{μ}^{λ} - φ_{μ} I) w = 0, \forall μ \leq ℓ} .

W^{1} (φ) = {w \in W^{-} (φ) : B (φ) (u_{μ}) w = φ_{μ} w, \forall μ \leq ℓ} .

W^{1} (φ) = {w \in W^{-} (φ) : B (φ) (u_{μ}) w = φ_{μ} w, \forall μ \leq ℓ} .

W^{1} (φ) = {w \in W^{-} (φ) : w \otimes φ + J_{ϱ}^{a} (z_{a} \otimes u^{ϱ}) \in A_{e}, \exists J \in W \otimes U^{⊥}} .

W^{1} (φ) = {w \in W^{-} (φ) : w \otimes φ + J_{ϱ}^{a} (z_{a} \otimes u^{ϱ}) \in A_{e}, \exists J \in W \otimes U^{⊥}} .

(\tilde{u}_{1} \dots \tilde{u}_{n}) = (u_{1} \dots u_{n} z_{1} \dots z_{r}) 10 K_{1}^{1} ⋮ K_{1}^{r} \dots ⋱ \dots \dots ⋱ \dots 01 K_{n}^{1} ⋮ K_{n}^{r} .

(\tilde{u}_{1} \dots \tilde{u}_{n}) = (u_{1} \dots u_{n} z_{1} \dots z_{r}) 10 K_{1}^{1} ⋮ K_{1}^{r} \dots ⋱ \dots \dots ⋱ \dots 01 K_{n}^{1} ⋮ K_{n}^{r} .

\tilde{u}_{i} = u_{i} + z_{a} K_{i}^{a} = u_{j} δ_{i}^{j} + z_{a} K_{i}^{a} .

\tilde{u}_{i} = u_{i} + z_{a} K_{i}^{a} = u_{j} δ_{i}^{j} + z_{a} K_{i}^{a} .

0 \to e \to X \to X / e \to 0, 0 \to e^{⊥} \to X^{*} \to e^{*} \to 0.

0 \to e \to X \to X / e \to 0, 0 \to e^{⊥} \to X^{*} \to e^{*} \to 0.

K = z_{a} \otimes K_{i}^{a} (\tilde{e}) = z_{a} \otimes θ^{a} (\tilde{u}_{i}) \in (X / e) \otimes e^{*}

K = z_{a} \otimes K_{i}^{a} (\tilde{e}) = z_{a} \otimes θ^{a} (\tilde{u}_{i}) \in (X / e) \otimes e^{*}

\tilde{e} = ⟨ \tilde{u}_{i} ⟩ = ⟨ u_{i} + K (u_{i}) ⟩ = ⟨ v + K (v) : v \in e ⟩

\tilde{e} = ⟨ \tilde{u}_{i} ⟩ = ⟨ u_{i} + K (u_{i}) ⟩ = ⟨ v + K (v) : v \in e ⟩

arctan_{e} : (X / e) \otimes e^{*} \to Gr_{n} (X) .

arctan_{e} : (X / e) \otimes e^{*} \to Gr_{n} (X) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in the Geometric Study of PDEs

Abraham D. Smith

Department of Mathematics, Statistics and Computer Science

University of Wisconsin-Stout

Menomonie, Wisconsin 54751-2506, USA

[email protected]

Abstract.

This expository monograph cuts a short path from the common, elementary background in geometry (linear algebra, vector bundles, and algebraic ideals) to the most advanced theorems about involutive exterior differential systems: (1) The incidence correspondence of the characteristic variety, (2) Guillemin normal form and Quillen’s thesis, (3) The Integrability of Characteristics by Guillemin, Quillen, Sternberg, and Gabber, and (4) Yang’s Hyperbolicity Criterion. To do so, the geometric theory of PDEs is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle, whereby the rank-1 variety is emphasized. The primary computational tool is an enhanced formulation of Guillemin normal form that is equivalent to involutivity of tableaux.

Key words and phrases:

characteristic variety, Guillemin normal form, eikonal system

2010 Mathematics Subject Classification:

Primary 58A15, Secondary 35A27, 35A30

0 Introduction and Overview
I Matrices and Subspaces
1 Tableaux and Symbols
1(a) Rank-one ideal
1(b) Generic Bases
1(c) Endovolutive Tableaux
1(d) Mutual Eigenvectors and Rank
2 Grassmann and Universal Bundles
2(a) Tangent and Arctangent
2(b) Polar pairs
2(c) The Tautological Bundle
II PDEs on Manifolds
3 Bundles upon Bundles
3(a) The Contact Ideal
3(b) Immersions and Frame Bundles
4 Exterior Differential Systems
4(a) Differential Ideals and Integral Elements
4(b) Prolongation and Spencer Cohomology
5 Involutivity of Exterior Differential Systems
5(a) Moduli of Involutive Tableaux
5(b) Cauchy retractions
III Characteristic and Rank-one Varieties
6 The Characteristic Variety
6(a) via Polar Extension
6(b) via Rank-one Incidence
6(c) Example: The Wave Equation
7 Guillemin Normal Form and Eigenvalues
8 Examples
8(a) Zero-dimensional examples
8(b) One-dimensional examples
8(c) One-dimensional exercise
9 Results of Guillemin and Quillen
10 Prolongation
11 Characteristic Sheaf
IV Eikonal Systems
12 General Eikonal Systems
12(a) as Lagrangian Geometry
12(b) as Poisson Brackets
13 Involutivity of the Characteristic Variety
14 Yang’s Hyperbolicity Criterion
15 Open Problems and Future Directions

0. Introduction and Overview

Given a system of partial differential equations [PDEs] over a manifold, does the system of PDEs have any local solutions to the Cauchy initial-value problem? That is, given initial conditions on a locally-defined hypersurface, can we produce a local solution that satisfies those initial conditions and also satisfies the PDEs? More generally, which initial hypersurfaces admit such solutions? Can we do this iteratively by solving a sequence of initial-value problems from dimension 0 to 1, 1 to 2, and so on to build solutions through any point?

These questions are the heart of exterior differential systems [EDS], a powerful specialist approach to the geometric study of PDEs. EDS typically present as ideals of exterior differential forms over a manifold.

Some deeper questions are: What is the shape of the family of local solutions obtained in this way? How can we determine whether two systems of PDEs are “the same” up to local coordinate transformations? Does the space of all PDEs (up to local coordinate transformation) have any meaningful shape or structure of its own?

These deeper questions are answered by analyzing the characteristic variety of an EDS. The original motivation for the characteristic variety is to see where the Cauchy initial-value problem becomes ambiguous. That is, given an initial condition for our PDEs on a local submanifold of dimension $n{-}1$ , when would the $n$ -dimensional solutions for that initial condition fail to be unique?

When analyzing the characteristic variety of various EDS, one discovers that the characteristic variety is an exquisitely subtle structure that reveals far more than originally anticipated. The characteristic variety dictates the internal geometry of the solutions of the original PDEs, while also controlling the parameter space of all such solutions. Under reasonable hypotheses, this means that EDS or PDEs can be understood up to coordinate equivalence as “parametrized families of solution manifolds with associated characteristic geometry.”

This is beautiful and important, but it has been a difficult topic for researchers to access, because the foundations of EDS have not yet entered the common curriculum of graduate students. Fluency with differential ideals remains a relatively rare skill, practiced in a handful of schools worldwide. Indeed, it is common for researchers first encountering the subject to become trapped in an endless cycle of translating systems from local jet coordinates to differential forms and back again, without gaining any new geometric insights and without using the most powerful theoretical ideas in EDS. In particular, it can take many years for researchers to appreciate the central role that the characteristic variety plays in uncovering geometric insights. However—despite the name—differential forms are not themselves the core idea behind exterior differential systems. Differential forms are merely a concise language. Rather, the core idea is to recognize that these questions are more geometric than analytic, and that ideals (that is, conditions defined by functions) and varieties (that is, shapes cut out by functions) must come into play. To describe families of solutions, we need the geometric language of bundles, schemes, and moduli.

Therefore, the goal of this monograph is to cut the shortest-possible expository path from the common curriculum in geometry (linear algebra, vector bundles, and algebraic ideals) to several key results regarding the characteristic variety. These key results are

(i)

the incidence correspondence of the characteristic and rank-1 varieties, and its relationship to eigenspace decomposition, 2. (ii)

Guillemin normal form, its enhancements, and Quillen’s thesis, 3. (iii)

the Integrability of Characteristics (Guillemin, Quillen, Sternberg, Gabber), and 4. (iv)

Yang’s Hyperbolicity Criterion.

The required common curriculum is

(i)

graduate-level linear algebra (short-exact sequences, dual spaces, the rank-nullity theorem, tensor products, generalized eigenspaces, as in Artin’s Algebra [Art91]), 2. (ii)

the fundamentals of smooth manifolds (tangent spaces, Sard’s theorem, bundles, as in Milnor’s Topology from the Differential Viewpoint [Mil97]), and 3. (iii)

basic algebraic geometry (projective space, ideal, variety, scheme, as in Harris’ Algebraic Geometry, a first course [Har92]).

To accomplish this, the subject of exterior differential systems is reinterpreted as the study of smooth sub-bundles of the Grassmann bundle over a smooth manifold. In doing so, the role of exterior differential forms becomes obscured, in favor of tableaux (vector spaces of homomorphisms) and symbols (varieties of endomorphisms). Specifically, Guillemin normal form for tableaux and symbols plays the central computational role, not differential forms. This is because most humans—and their computer algebra systems—are more comfortable with matrices than with exterior algebra. Exterior differential ideals are not introduced until absolutely needed. This is because many of the essential lemmas depend only on the geometry of the Grassmann bundle, which is the variety of the trivial exterior differential system. This reformulation allows elementary versions of those key results (in fact, almost all the lemmas are restatements of the rank-nullity theorem), and it becomes possible to outline how these results could be used to push the frontiers of the subject.

While the audience is assumed to have a general interest and cultural awareness of PDEs or EDS in some form, all the required definitions are provided when needed. Even so, it is wise always to have Bryant, Chern, Gardner, Goldschmidt, and Griffiths’s Exterior Differential Systems [BCG*+*90] and Ivey and Landsberg’s Cartan for Beginners [IL03] nearby. They are cited for comparison frequently. Another excellent reference is McKay’s Introduction to Exterior Differential Systems [McK18]. A note for EDS experts: the results in these pages can be found in numerous places in the literature in some form or other, and I have indicated my favorite sources throughout. The only innovation here is in presentation. Most notably, in contrast to the vast majority of expositions, the central topic is the $C^{\infty}$ characteristic variety, not the $C^{\omega}$ Cartan–Kähler theorem. This is because the key open question is “what does the family of all involutive PDEs look like?” not “how do I solve this particular involutive PDE?”

This monograph is organized into four parts, each containing several sections. Part I covers the structure of tableau (subspaces of a space of homomorphisms) and the differential geometry of the Grassmann variety. Because the results are elementary—almost trivially so—they provide a good foundation for building from the common curriculum to the central topic. Part II converts those elementary results to the language of bundles, PDEs and EDS. That language allows an enhanced version of Guillemin normal form that is equivalent to involutivity. Part III achieves the key purpose of this monograph, as a triumvirate is formed by the characteristic scheme, the rank-1 cone, and the mutual eigenvector problem for symbols. Part IV examines the integrability of the characteristic variety in various guises, and offers a general dogma (that the characteristic scheme knows all coordinate-invariant data about a system of PDEs) that suggests possible future developments in the theory of EDS.

This monograph was developed to support a series of lectures at the Institute of Mathematics at the Polish Academy of Sciences in September 2016, as part of a Workshop on the Geometry of Lagrangian Grassmannians and Nonlinear PDEs.

Part I Matrices and Subspaces

1. Tableaux and Symbols

Tableaux are very simple objects; every undergraduate encounters the example “ $r\times n$ matrices form a vector space using the usual matrix operations,” and a tableau is any subspace of that vector space.

Given vector111When it becomes appropriate to do so, at (4.5) in Section 4, we switch from vector spaces to complex projective spaces for algebraic convenience. spaces $W$ and $V$ with $\dim W=r$ and $\dim V=n$ , a tableau is a linear subspace of $A\subset\operatorname{Hom}(V,W)$ . We use the notation $W\otimes V^{*}$ and $\operatorname{Hom}(V,W)$ interchangeably.

Being a subspace, any tableau $A$ is the kernel of some linear map $\sigma$ , called the symbol, whose range is written as $H^{1}(A)$ . We have a short exact sequence of spaces:

[TABLE]

where $H^{1}(A)$ is just notation for $(W\otimes V^{*})/A$ . Let $\dim A=s$ and $\dim H^{1}(A)=t=nr-s$ .

For example, let $W=\mathbb{R}^{3}$ and $V=\mathbb{R}^{3}$ , and consider the 5-dimensional tableau $A\subset W\otimes V^{*}$ described in the standard bases by

[TABLE]

If $\pi\in W\otimes V^{*}$ is a $3\times 3$ matrix with entries $\pi^{a}_{i}$ , then the symbol $\sigma$ defining $A$ consists of four conditions:

[TABLE]

1(a). Rank-one ideal

The fundamental theorem of linear algebra states that any homomorphism $\pi\in W\otimes V^{*}$ has a well-defined rank. Thus, for any tableau $A\subset W\otimes V^{*}$ , we could ask how $\operatorname{rank}(\pi)$ varies across $\pi\in A$ . For our purposes, the most interesting222There is a good reason that the rank-1 case is most interesting: the varieties of higher-rank matrices are determined algebraically by the varieties of lower-rank matrices, so the geometry of $\operatorname{rank}(\pi)$ across $\pi\in A$ comes down to the rank-1 case. case is $\operatorname{rank}(\pi)=1$ .

The space $W\otimes V^{*}$ admits the rank-1 ideal, $\mathscr{R}$ , which is irreducible and generated by all $2\times 2$ minors $\left\{0=\pi^{a}_{i}\pi^{b}_{j}-\pi^{a}_{j}\pi^{b}_{i}\right\}$ in any basis. This is a homogeneous ideal, so we may consider the rank-1 cone in vector space or the rank-1 variety in projective space. (The vertex of the rank-1 cone is the rank-0 matrix.)

For any $A$ , consider the ideal $A^{\perp}+\mathscr{R}$ , which defines $\operatorname{\mathscr{C}}\subset A$ as the variety $\operatorname{\mathscr{C}}=A\cap\operatorname{Var}(\mathscr{R})$ . The variety $\operatorname{\mathscr{C}}$ is the set of matrices in $A$ that are also rank-1; it is a linear section of the rank-1 cone defined by $\mathscr{R}$ .

In the example (1.2), $\operatorname{\mathscr{C}}$ can be parametrized as matrices of the form

[TABLE]

which can be interpreted as the rational normal Veronese curve333 For more on Veronese curves and the more general Segre embeddings and determinantal varieties, see [Har92, Sha94].,

[TABLE]

Moreover, the projection of $\operatorname{\mathscr{C}}$ to $\mathbb{P}V^{*}$ is another rational normal curve,

[TABLE]

This toy example plays a crucial role in applications for hyperbolic and hydrodynamically integrable PDEs [FHK09, Smi09].

1(b). Generic Bases

We would like to find a “good” basis in which to express a tableau $A$ and study its properties.

First, an analogy. When studying a single homomorphism $F:\mathbb{C}^{n}\to\mathbb{C}^{r}$ , or $F\in\mathbb{C}^{r}\otimes\mathbb{C}^{n*}$ , there are various “good” bases of the domain and co-domain to express $F$ . A basis of $\mathbb{C}^{n*}$ is “generic” for $F$ if the first $\operatorname{rank}(F)$ columns are independent. A basis of $\mathbb{F}^{r}$ is “generic” for $F$ if the first $\operatorname{rank}(F)$ rows of $F$ are independent in that basis. Among the generic bases, we can construct particularly “good” bases for writing $F$ . When $F$ is written in a “good” basis, we say it is in a “normal form,” and the normal form allows us readily to study properties of $F$ . For example:

•

Use Gaussian elimination444Algorithmically, this is accomplished using improved Gram-Schmidt or Householder triangularization. See [TB97] for a discussion of stability of row-reduction. to place $F$ in reduced row-echelon form. Then, the rank, kernel, and image of $F$ are immediately apparent. The fundamental theorems in linear algebra depend on this normal form.

•

Apply a polar/unitary decomposition to find the singular-value decomposition of $F$ . Then, the norm of $F$ and its action with respect to the Hermitian inner products of $\mathbb{C}^{n}$ and $\mathbb{C}^{r}$ are immediately apparent. Important theorems in metric geometry and multivariate statistics depend on this normal form.

•

Solve a sequence of eigenvalue problems in the case $n=r$ to find Jordan normal form. Then, the eigenspace structure of $F$ , and the commutative algebra of matrices to which it belongs are immediately apparent. The theory of Lie groups and Lie algebras depends on this normal form.

Given a tableau $A\subset W\otimes V^{*}$ with symbol $\sigma$ , we are curious whether we can construct bases that are “good” simultaneously for all homomorphisms in the tableau. This situation is considerably more complicated than the situation of a single homomorphism, and it turns out that it is most important to focus on the symbol maps, but we arrive at a satisfying answer in Section 7. By the above analogy, it is convenient to establish a notion of “generic” bases formulated in terms of independence. This is done as follows.

In any bases of $V^{*}$ and $W$ , the tableau $A$ is a space of $r\times n$ matrices only $s$ of whose entries are linearly independent. That is, in a given basis, we can consider the entries $\pi\mapsto\pi^{a}_{i}$ as elements of $A^{*}$ , just as we think of the components $v\mapsto v^{i}$ of vectors in $V$ as being linear functions on $v\in\mathbb{R}^{n}$ , using the dual basis of $V^{*}$ .

Across all bases of $V^{*}$ , there is a maximum number of independent entries that can occur in column 1; call that number $s_{1}$ . (In a measure-zero set of bases of $V^{*}$ , the number of actual independent entries in the first column may be less than $s_{1}$ .) Once those independent entries are accounted for, there is a maximum number $s_{2}$ of new independent entries that can occur in the second column. (In a measure-zero set of bases of $V^{*}$ that achieve $s_{1}$ in column 1, the number of actual independent entries in columns 1 and 2 may be less than $s_{1}+s_{2}$ .) Once those independent entries are accounted for, there is a maximum number $s_{3}$ of new independent entries that can occur in column 3. (In a measure-zero set of bases of $V^{*}$ that achieve $s_{1}+s_{2}$ in columns 1 and 2, the number of actual independent entries in columns 1, 2, and 3 may be less than $s_{1}+s_{2}+s_{3}$ .) Continuing in this way, we have $s_{i}$ as the number of new independent entries in the $i$ th column achieved for almost-all bases of $V^{*}$ . (In a measure-zero set of bases of $V^{*}$ that achieve $s_{1}+s_{2}+\cdots+s_{i-1}$ in columns 1 through $i{-}1$ , the number of actual independent entries in columns 1 through $i$ may be less than $s_{1}+\cdots+s_{i}$ .) Eventually, for such a basis, there is a column $\ell$ where we have reached $s_{1}+s_{2}+\cdots+s_{\ell}=s$ , so there is some maximum column $\ell\leq n$ such that $s_{\ell}>0$ , where the last independent entry appears. So,

[TABLE]

The index $\ell$ is called the character of $A$ , and the number $s_{\ell}$ is called the Cartan integer of $A$ . The tuple $(s_{1},\ldots,s_{\ell})$ gives the Cartan characters of $A$ . Note that $s_{1}\geq s_{2}\geq\cdots\geq s_{\ell}$ , since otherwise the maximality condition would have been violated in an earlier column.

Permanently reserve the index ranges

[TABLE]

A basis555We follow the notation of differential geometry. This notation indicates an ordered basis of co-vectors, not a vector. Each $u^{i}$ is an element of $V^{*}$ . $(u^{i})=(u^{1},\ldots,u^{n})$ of $V^{*}$ is called generic if its Cartan characters achieve the lexicographical maximum value $(s_{1},s_{2},\ldots,s_{n})$ . As seen in the previous paragraph, almost all bases of $V^{*}$ are generic in this sense. Given a basis $(u^{i})$ of $V^{*}$ , a basis666We follow the notation of differential geometry. This notation indicates an ordered basis of vectors, not a co-vector. Each $z_{a}$ is an element of $W$ . $(z_{a})=(z_{1},\ldots,z_{r})$ of $W$ is called generic if the first $s_{i}$ independent entries in column $i$ are independent.

Choose generic a basis $(u^{i})=(u^{1},\ldots,u^{n})$ for $V^{*}$ , and let $(u_{i})=(u_{1},\ldots,u_{n})$ be its dual basis for $V$ . Choose a generic basis $(z_{a})=(z_{1},\ldots,z_{r})$ for $W$ , and let $(z^{a})=(z^{1},\ldots,z^{n})$ be its dual basis for $W^{*}$ . An element of the tableau is written as

[TABLE]

and the upper-left entries $\pi^{a}_{\lambda}$ for $a\leq s_{\lambda}$ form a basis of $A^{*}$ .

Because the bases are generic, the symbol map $\sigma$ can be written as

[TABLE]

It is implicit that $B^{a,\lambda}_{i,b}=0$ if $a\leq s_{i}$ or $b\geq s_{\lambda}$ or $i<\lambda$ . That is, entries to the lower-right (unshaded) are written as linear combinations of the entries in the upper-left (shaded) using the coefficients $B^{a,\lambda}_{i,b}$ , as in Figure 1.

Consider the example (1.3), which is not written in generic bases. If we exchange columns $2\leftrightarrow 3$ and rows $1\leftrightarrow 3$ , then it becomes generic with $(s_{1},s_{2},s_{3})=(3,2,0)$ , seen here:

[TABLE]

Equation (1.10) becomes:

[TABLE]

One can take the dual perspective, wherein the symbol coefficients $B^{a,\lambda}_{i,b}$ define a map from the upper-left independent entries to the lower-right entries. That is, consider the map

[TABLE]

defined by

[TABLE]

Equation (1.14) is the formal inclusion $A\to W\otimes V^{*}$ in the defining exact sequence (1.1). By fixing $\varphi\in V^{*}$ and $v\in V$ , we obtain an endomorphism $\operatorname{B}(\varphi)(v):W\to W$ defined by the column relations of $(\pi^{a}_{i})$ , as in Figure 2. We use the shorthand $\operatorname{B}^{\lambda}_{i}$ for $\operatorname{B}(u^{\lambda})(u_{i})$ , but note that this is not quite the same as $B^{a,\lambda}_{i,b}z_{a}\otimes z^{b}$ because of the identity term in Equation (1.14). That is, $\operatorname{B}^{\lambda}_{\lambda}=\sum_{a\leq s_{\lambda}}\delta^{a}_{b}(z_{a}\otimes z_{b})$ for all $\lambda\leq\ell$ .

For the example (1.11)–(1.12), the maps $\operatorname{B}^{\lambda}_{i}:W\to W$ are:

[TABLE]

So, if $\varphi=\varphi_{i}u^{i}\in V^{*}$ and $v=v^{j}u_{j}\in V$ , the endomorphism $\operatorname{B}(\varphi)(v):W\to W$ is

[TABLE]

Using our generic basis $(u_{i})$ for $V$ and its dual basis $(u^{i})$ for $V^{*}$ , define decompositions $V=U\oplus Y$ and $V^{*}=Y^{\perp}\oplus U^{\perp}$ using our index convention (1.8) as follows:

[TABLE]

and

[TABLE]

The isomorphisms $U^{*}\cong Y^{\perp}$ and $Y^{*}\cong U^{\perp}$ depend on the basis; they are non-canonical but sometimes useful.

It is apparent from (1.14) that $\operatorname{B}(\varphi)=\operatorname{B}(\tilde{\varphi})$ if $\varphi-\tilde{\varphi}\in U^{\perp}$ ; that is if $\varphi_{\varrho}=\tilde{\varphi}_{\varrho}$ for all $\varrho\geq\ell+1$ , so we usually consider $\operatorname{B}(\varphi)$ only for $\varphi\in Y^{\perp}$ .

Thus, in generic bases, we have a collection $\operatorname{B}^{\lambda}_{i}$ of endomorphisms of $W$ . For our purposes of constructing a normal form, a “good” basis is one which makes the endomorphisms $\operatorname{B}^{\lambda}_{i}$ as structurally similar as possible. Section 1(c) imposes additional conditions on the images of these endomorphisms for this purpose.

1(c). Endovolutive Tableaux

Suppose $(u^{i})$ and $(z_{a})$ are generic bases for $A$ . For any $i$ , define a decomposition $W=\operatorname{\mathbf{W}}^{-}_{i}\oplus\operatorname{\mathbf{W}}^{+}_{i}$ by

[TABLE]

By (1.14), the map $\operatorname{B}^{\lambda}_{i}:W\to W$ has support $\operatorname{\mathbf{W}}^{-}_{\lambda}\subset W$ , and its image lies in $\operatorname{\mathbf{W}}^{+}_{i}\subset W$ .

More generally, for any $\varphi\in V^{*}$ , let $\operatorname{\mathbf{W}}^{-}(\varphi)=\operatorname{\mathbf{W}}^{-}_{\underline{\lambda}}$ and $\operatorname{\mathbf{W}}^{+}(\varphi)=\operatorname{\mathbf{W}}^{+}_{\underline{\lambda}}$ , where $\underline{\lambda}$ is the minimum index such that $\varphi_{\underline{\lambda}}\neq 0$ . (For general $\varphi$ , we have $\dim\operatorname{\mathbf{W}}^{-}(\varphi)=s_{1}$ .)

A tableau $A$ expressed in bases $(u^{i})$ and $(z_{a})$ is called endovolutive777The term endovolutive was coined in [Smi15], but the phenomenon was described previously in [BCG*+*90, Chapter IV§5], [Yan87], and it is certainly familiar to anyone who has manipulated tableaux of linear Pfaffian systems. if $B^{a,\lambda}_{i,b}=0$ for all $a>s_{\lambda}$ . That is, endovolutive means that $\operatorname{B}^{\lambda}_{i}$ is an endomorphism of $\operatorname{\mathbf{W}}^{-}_{\lambda}\subset W$ , as in Figure 3.

Note that the example (1.15) is endovolutive because $s_{2}=2$ and $\operatorname{B}^{2}_{2}$ and $\operatorname{B}^{2}_{3}$ have non-zero entries only in the upper-left $2\times 2$ part.

In this way, when considering endovolutive tableaux, it useful to arrange the symbol endomorphisms as an $\ell\times n$ array of $r\times r$ matrices:

[TABLE]

Each “diagonal” entry $\operatorname{B}^{\lambda}_{\lambda}$ is the $r\times r$ matrix for which the non-zero upper-left part is an $s_{\lambda}\times s_{\lambda}$ identity matrix, $I_{s_{\lambda}}$ . Endovolutivity means that $\operatorname{B}^{\lambda}_{i}$ , which is the $r\times r$ matrix in row $\lambda$ of (1.20), is zero outside the upper-left $s_{\lambda}\times s_{\lambda}$ part.

If a tableau is endovolutive in certain bases for $W$ and $V^{*}$ , then it is also endovolutive under any upper-triangular change-of-basis for $u^{i}\mapsto g^{i}_{j}u^{j}$ . Under such a basis change, the columns of $(\pi^{a}_{i})$ are linear combinations of the ones to their right, and the sub-matrices in (1.20) change by the corresponding conjugation. Endovolutivity is a property of the flag generated by the basis of $V^{*}$ .

1(d). Mutual Eigenvectors and Rank

For endovolutive bases, each $\operatorname{B}^{\lambda}_{i}$ is an endomorphism of a particular vector space, so it is sensible to consider an eigenvector problem for these maps: For any $\lambda$ , let

[TABLE]

That is, we want to find the vectors that are preserved by $\operatorname{B}^{\lambda}_{\lambda}=I_{s_{\lambda}}$ but are annihilated by all $\operatorname{B}^{\lambda}_{\mu}$ for $\mu\neq\lambda$ . More generally, let

[TABLE]

Equation (1.22) can be rewritten as a mutual eigenvector problem on the $\ell$ endomorphisms $\operatorname{B}(\varphi)(u_{1})$ , …, $\operatorname{B}(\varphi)(u_{\ell})$ :

[TABLE]

Alternatively, because $B^{\mu}_{\mu}=I_{s_{\mu}}$ , equation (1.22) says that $\operatorname{B}(\varphi)(\cdot)w$ is rank-1 when restricted to $Y^{\perp}$ , so we can rewrite it as

[TABLE]

This space is the focus of [Gui68], and it plays an important part in our story. Unlike $\operatorname{\mathbf{W}}^{-}(\varphi)$ , its definition does not depend on the basis; its definition depends only on the splitting $V=U\oplus Y$ . Its dimension is an important invariant.

Lemma 1.25.

Suppose that the tableau $A$ admits endovolutive bases. For generic $\varphi$ , $\dim\operatorname{\mathbf{W}}^{1}(\varphi)=s_{\ell}$ .

Lemma 1.25 is the result of a quick rank computation using (1.22)–(1.23). See [Smi15].

Our “good” basis and normal form will be built on the requirement that the maps $\operatorname{B}^{\lambda}_{i}$ commute on certain combinations of these spaces (and thus the maps share generalized eigenspaces and Jordan-block normal form there). That is, we are aiming for something like the commutative subalgebras seen in [Ger61] and [GS00]. Endovolutivity allows surprisingly direct computation of the desired conditions. For more detail on endovolutivity, see [Smi15] and the references therein. We return to this topic in Section 5, but before that we must introduce the geometry of subspaces.

2. Grassmann and Universal Bundles

The Grassmann variety is the set $\operatorname{Gr}_{n}(X)$ of $n$ -planes in an $(n{+}r)$ -dimensional vector space $X$ . It is a smooth projective variety and a smooth manifold of dimension $nr$ . An $n$ -plane $e\in\operatorname{Gr}_{n}(X)$ is called an element.

2(a). Tangent and Arctangent

Depending on one’s favorite notation, there are several ways to see that the tangent space of $\operatorname{Gr}_{n}(X)$ at $e$ is $(X/e)\otimes e^{*}$ .

First, for any $e\in\operatorname{Gr}_{n}(X)$ , choose a basis $(u_{i})=(u_{1},\ldots,u_{n})$ for $e$ , and choose $(z_{a})=(z_{1},\ldots,z_{r})$ so as to complete a basis of the entire vector space $X$ . Any $n$ -plane $\tilde{e}$ near $e$ admits a basis $(\tilde{u}_{i})=(\tilde{u}_{1},\ldots,\tilde{u}_{n})$ that we may assume is related by a matrix in reduced column-echelon form:

[TABLE]

More succinctly, using the summation convention:

[TABLE]

That is, $(\tilde{u}_{i})$ and $(u_{i})$ are related by an $(n+r)\times n$ matrix of rank $n$ whose image $\left\langle\tilde{u}_{1},\ldots,\tilde{u}_{n}\right\rangle=\tilde{e}$ is determined uniquely by the $r\times n$ submatrix $(K^{a}_{i})$ . In this sense, $T_{e}\operatorname{Gr}_{n}(X)$ is isomorphic to the space of $r\times n$ matrices, which is isomorphic to $(X/e)\otimes e^{*}$ . This is easy and computational, but this isomorphism is not canonical for an abstract vector space (without metric) because it depends on a choice of splitting $X=e\oplus(X/e)$ by choosing the complementary basis $(z_{a})$ .

Alternatively, to see $T_{e}\operatorname{Gr}_{n}(X)=(X/e)\otimes e^{*}$ and avoid splitting, we can use the dual888Recall that $(X/e)^{*}$ is canonically isomorphic to $e^{\perp}$ : if $[v]=\{u+e\}\in X/e$ , then $\varphi([v])=\varphi(v)+0$ is well-defined for all $\varphi\in e^{\perp}$ . short-exact sequences

[TABLE]

Choose any basis $(\theta^{a})=(\theta^{1},\ldots,\theta^{r})$ of the annihilator space $e^{\perp}=(X/e)^{*}$ , and let $(z_{a})=(z_{1},\ldots,z_{r})$ be the corresponding dual basis of $(X/e)$ . Then, we may take the coefficients $K^{a}_{i}$ of

[TABLE]

as $nr$ coordinates on $T_{e}\operatorname{Gr}_{n}(X)$ ; that is, $K^{a}_{i}$ gives a basis of $T^{*}_{e}\operatorname{Gr}_{n}(X)$ .

More abstractly, an explicit choice of bases $(u_{i})$ for $e$ and $(\theta^{a})$ for $e^{\perp}$ is unnecessary. Instead, we need only the abstract homomorphism $K\in(X/e)\otimes e^{*}$ , because the space999Note that $v+K(v)$ is not well-defined in $X$ for any particular $v\in e$ , but the span over all such $v$ is well-defined.

[TABLE]

is invariant under $GL(n)$ transformations on $(u_{i})$ and $(\tilde{u}_{i})$ as well as $GL(r)$ transformations on $\theta$ . That is, $\tilde{e}$ is the “graph” of $v\mapsto v+K(v)$ over all $v\in e$ .

As in Figure 4, the derivative map $\operatorname{Gr}_{n}(X)\to(X/e)\otimes e^{*}$ near $e$ is a multidimensional generalization of the tangent function, so the inverse map101010The map $\arctan_{e}$ is analogous to exponential map $\exp_{p}:T_{p}M\to M$ from Riemannian geometry or Lie group representation theory, except that this description of $\arctan_{e}$ does not make explicit use of a metric or group structure. is written

[TABLE]

The reader is encouraged to read [MS74, §5] and [KN63] and to search for the terms Plücker embedding and Stiefel manifold for more detail on this subject.

*Remark 2.7**.*

Notice that any linear subspace of $(X/e)\otimes e^{*}$ is a tableau in the sense of Section 1. In some sense, it is the only example, as arbitrary $V$ and $W$ could be studied by setting $X=V\oplus W$ and $e=V+0$ . Moreover, any smooth submanifold $Z\subset\operatorname{Gr}_{n}(X)$ with tangent space $T_{e}Z\subset T_{e}\operatorname{Gr}(X)$ at $e\in Z$ gives $T_{e}Z$ as a tableau in $(X/e)\otimes e^{*}$ . This observation is the heart of the entire subject of exterior differential systems, and it reappears forcefully in Section 4.

2(b). Polar pairs

The purpose of this subsection is to establish two results, Lemmas 2.12 and 2.15, that tie the algebraic geometry of intersecting subspaces to the differential geometry of the Grassmannian. These lemmas are used in Part III to demonstrate the correspondence between the characteristic variety (in the Cauchy problem of a system of PDEs) and the rank-1 variety (of the tableau of an EDS) in Lemma 6.4, thus providing the foundation of the geometric theory of PDEs.

Suppose that $e,\tilde{e}\in\operatorname{Gr}_{n}(X)$ , and that they intersect along a hyperplane. That is, suppose $e^{\prime}=e\cap\tilde{e}$ and $\dim e^{\prime}=n-1$ . We call the pair of $n$ -planes $e$ and $\tilde{e}$ a polar pair because they are both polar extensions111111This is a classical terminology that reappears in Section 6(a). of $e^{\prime}$ . For any $e\in\operatorname{Gr}_{n}(X)$ , let

[TABLE]

We say $\tilde{e}\in\operatorname{Pol}_{1}(e)$ is a polar pair of $e$ . This relationship is symmetric—hence the unqualified term polar pair—as $\tilde{e}\in\operatorname{Pol}_{1}(e)$ if and only if $e\in\operatorname{Pol}_{1}(\tilde{e})$ , but this relationship is not an equivalence relation, as it fails both reflexivity and transitivity.

Within the image of $\arctan_{e}$ , Lemma 2.9 ties the notion of polar pairs to lines in the tangent space $T_{e}\operatorname{Gr}_{n}(X)$ ,

Lemma 2.9.

Suppose $e\in\operatorname{Gr}_{n}(X)$ and $\tilde{e}=\arctan_{e}(K)$ for $K\in(X/e)\otimes e^{*}$ . Then $\operatorname{rank}(K)=1$ if and only if $\tilde{e}\in\operatorname{Pol}_{1}(e)$ .

Proof.

Suppose that $\tilde{e}\in\operatorname{Pol}_{1}(e)$ . Let $e^{\prime}=e\cap\tilde{e}$ , so $\dim e^{\prime}=n-1$ . Let $(u_{1},\ldots,u_{n-1})$ be a basis for $e^{\prime}$ , and extend that basis to a basis $(u_{1},\ldots,u_{n-1},v)$ for $e$ and to a basis $(u_{1},\ldots,u_{n-1},\tilde{v})$ for $\tilde{e}$ . Writing (2.2) in this case, it is apparent that only the $n$ th column of $\left(K^{a}_{i}\right)$ is nonzero. That is, the tangent homomorphism $K\in(X/e)\otimes e^{*}$ is rank-1. (It cannot be the degenerate rank-0 unless $e=\tilde{e}$ .)

Conversely, suppose that $K\in(X/e)\otimes e^{*}$ is rank-1. Let $e^{\prime}=\ker K\subset e$ , which is a subspace of $e$ of dimension $n{-}1$ . Any line in $e^{\prime}$ is preserved by the map $e\to X$ defined by the matrix in (2.2); hence, the subspace $e^{\prime}$ is also a subspace of $\tilde{e}$ . (It cannot be the degenerate case $e=\tilde{e}$ unless $K$ is rank-0.) ∎

The concept of polar pairs generalizes to co-dimensions $k$ other than $1$ . For any $e\in\operatorname{Gr}_{n}(X)$ , let

[TABLE]

Because $\dim X=n+r$ and $\dim e=n$ , the set $\operatorname{Pol}_{k}(e)$ is nonempty if and only if $k\leq r$ , because $n+k=\dim(e+\tilde{e})\leq n+r$ . The definition is trivial and fairly useless for $k=0$ . Again, the $k$ -polar-pair relationship $\tilde{e}\in\operatorname{Pol}_{k}(e)$ is symmetric but neither reflexive nor transitive for the interesting case $0<k\leq r$ .

One can see immediately that Lemma 2.9 generalizes by replacing $1$ with any rank $k$ to give Lemma 2.11.

Lemma 2.11.

Suppose $e\in\operatorname{Gr}_{n}(X)$ and $\tilde{e}=\arctan_{e}(K)$ for $K\in(X/e)\otimes e^{*}$ . Then $\operatorname{rank}(K)=k$ if and only if $\tilde{e}\in\operatorname{Pol}_{k}(e)$ .

Next, we can generalize Lemma 2.11 to Lemma 2.12 by dropping the use of $\arctan$ . That is, we can consider a $k$ -polar pair $(e,\tilde{e})$ where $\tilde{e}$ lies outside the open image of $\arctan_{e}$ . From an algebraic perspective, Lemma 2.12 can be seen as a Grassmannian version of the rank-nullity theorem. Phrased in other ways, it is popular true/false homework question in undergraduate linear algebra textbooks.

Lemma 2.12.

Fix $e\in\operatorname{Gr}_{n}(X)$ and $\tilde{e}\in\operatorname{Pol}_{k}(e)$ . The canonical maps $\tilde{e}\mapsto\tilde{e}/e$ and $\tilde{e}\mapsto(\tilde{e}\cap e)^{\perp}/e^{\perp}$ both have rank- $k$ images, yielding the incidence correspondence Figure 6.

Proof.

Let $e^{\prime}=e\cap\tilde{e}$ . Consider the short-exact sequences (2.3), and apply the rank-nullity theorem of those maps on $e^{\prime}$ , which has dimension $n-k$ . In the first short-exact sequence, $\tilde{e}/e=e^{\prime}/e$ has dimension $k$ as a subspace of $X/e$ . In the second short-exact sequence, the space $\left(\tilde{e}\cap e\right)^{\perp}/e^{\perp}=(e^{\prime})^{\perp}/e^{\perp}$ has dimension $k$ as a subspace of $e^{*}=X^{*}/e^{\perp}$ . In both cases, and such subspace can be constructed this way. ∎

Now, reconsider the case $k=1$ in light of Lemma 2.12. Then each $\tilde{e}\in\operatorname{Pol}_{1}(e)$ yields a hyperplane $e^{\prime}=\tilde{e}\cap e$ . The right image $(e^{\prime})^{\perp}/e^{\perp}$ in Figure 6 is some line $[\xi]\in\mathbb{P}e^{*}$ . The left image $\tilde{e}/e$ is some line $[w]\in\mathbb{P}(X/e)$ . So, each $\tilde{e}\in\operatorname{Pol}_{1}(e)$ yields a rank-1 projective homomorphism $[w]\otimes[\xi]=[w\otimes\xi]\in\mathbb{P}\left((X/e)\otimes e^{*}\right)$ . Any element of $\mathbb{P}\left((X/e)\otimes e^{*}\right)$ could be obtained this way by appropriate choice of $\tilde{e}$ .

To see how this generalizes Lemma 2.9, let us write $[w]\otimes[\xi]$ explicitly. Let

[TABLE]

be a basis for $X^{*}$ such that $e=\ker\{\theta^{1},\ldots,\theta^{r}\}$ and $e^{\prime}=\ker\{\theta^{1},\ldots,\theta^{r},\xi\}$ for some $\xi=\xi_{i}\omega^{i}$ . Then, $\tilde{e}=\ker\{\tilde{\theta}^{1},\ldots,\tilde{\theta}^{r}\}$ for some $\tilde{\theta}^{a}=J^{a}_{b}\theta^{b}+K^{a}_{i}\omega^{i}$ . Because $e^{\prime}\subset\tilde{e}$ , it must be that

[TABLE]

Hence, each $K^{a}_{i}\omega^{i}$ is a multiple of $\xi$ ; call it $w^{a}\xi$ . (Note that $w^{a}=0$ for all $a$ if and only if $\tilde{e}=e$ , which contradicts our assumption $\dim e^{\prime}=n-1$ .) We can use this fact to build a rank-1 homomorphism: Let $(z_{a})$ be the basis of $X/e$ dual to $(\theta^{a})$ . Let $(\omega^{i})$ also denote the basis of $e^{*}=X^{*}/e^{\perp}$ induced by $\omega^{i}\in X^{*}$ , so that $\xi\in e^{*}$ also denotes the image of $\xi\in X^{*}$ . Let $w=w^{a}z_{a}$ . Then the induced homomorphism

[TABLE]

is rank-1. Each of $w$ and $\xi$ is well-defined up to scale, so $K$ is well-defined up to scale, yielding $[K]=\mathbb{P}\left((X/e)\otimes e^{*}\right)$ .

It may be that $\tilde{e}$ lies outside the open image of $\arctan_{e}$ . How then do we interpret $K$ ? Is there any relationship between $\tilde{e}$ and $\arctan_{e}(K)$ ? From a differential geometric perspective, this is reminiscent of the failure of injectivity at large distances for the exponential map in Riemannian geometry. Lemma 2.15 shows that for any polar pair $\tilde{e}$ of $e$ , either $\tilde{e}$ lies in the curve $\arctan_{e}([K])$ or is the limit of the curve.

Lemma 2.15.

Suppose $e\in\operatorname{Gr}_{n}(X)$ and $\tilde{e}\in\operatorname{Pol}_{1}(e)$ . Then there is a continuous path $\{e_{\tau}:0\leq 0\leq 1\}$ in $\operatorname{Gr}_{n}(X)$ such that $e_{0}=e$ , $e_{1}=\tilde{e}$ , and $e_{\tau}\cap e=\tilde{e}\cap e=e_{\tau}\cap\tilde{e}$ for all $0<\tau<1$ . The rank-1 line $[K]$ induced by $e_{\tau}$ via from Lemma 2.12 is constant across $0<\tau\leq 1$ . Moreover, $e_{\tau}\in\arctan_{e}([K])$ for $0\leq\tau<1$ .

Proof.

Let $e^{\prime}=e\cap\tilde{e}$ . For some independent vectors $v,w\in X$ , we may write $e=e^{\prime}+\left\langle v\right\rangle$ and $\tilde{e}=e^{\prime}+\left\langle w\right\rangle$ and define121212If preferred, one can reparametrize from a linear interpolation to a circular interpolation by replacing $\tau$ with $\cos\vartheta$ and $1-\tau$ with $\sin\vartheta$ for some angle $0\leq\vartheta\leq\pi/2$ . a curve from $e$ to $\tilde{e}$ in $\operatorname{Gr}_{n}(X)$ by

[TABLE]

Note that $e^{\prime}=e\cap e_{\tau}=\tilde{e}\cap e_{\tau}$ for all $0<\tau<1$ . It is apparent from (2.16) that $e_{\lambda}/e$ is the line $[\tau w]=[w]$ , which is constant versus $\tau$ . It is also apparent from (2.16) that $(e_{\tau}\cap e)^{\perp}/e^{\perp}=(e^{\prime})^{\perp}/e^{\perp}$ is the line $[\xi]$ , which is constant versus $\tau$ . Hence, all such $e_{\tau}$ have the same representative rank-1 homomorphism, $[w\otimes\xi]=[K]$ in Lemma 2.12.

It may be that $\tilde{e}=e_{1}$ lies outside the open image of $\arctan_{e}$ . However, comparison of (2.16) and (2.2) implies that all $e_{\tau}$ lie inside the image of $\arctan_{e}$ for all $\tau<1$ . So, the image $\arctan_{e}([w\otimes\xi])$ contains an open set of $\{e_{\tau}\}$ where $e_{\tau}\cap e=e^{\prime}$ . ∎

Consider the example summarized in Figure 5, where

[TABLE]

Note that $\tilde{e}$ is outside the open image of $\arctan_{e}$ because (2.2) breaks down as written in this basis. But, $e_{\tau}$ is the family obtained by rotating from $e$ toward $\tilde{e}$ about the axis $e^{\prime}$ through an angle $\arctan(\frac{\tau}{1-\tau})$ , which varies from [math] to $\frac{\pi}{2}$ . For all $0\leq\tau<1$ , we have

[TABLE]

Thus, the line of rank-1 matrices $[K]$ in $(X/e)\otimes e^{*}$ is written as $[\begin{pmatrix}0&1\end{pmatrix}]$ in this basis. This line represented by every $e_{\tau}$ in a curve that converges to $\tilde{e}$ as $\tau\to 1$ . Indeed, up to a choice of basis, this is essentially the only example.

Overall, we have learned that any $k=1$ polar pair in $\operatorname{Gr}_{n}(X)$ is represented by a line of rank-1 matrices in the tangent space, and vice-versa. This is sufficient for our purposes, but those seeking a more detailed understanding of polar pairs are encouraged to investigate Schubert varieties—for example in [Rob12]—and the other outgrowths of Hilbert’s 15th problem.

2(c). The Tautological Bundle

Soon, we will consider algebraic equations defined on $e^{*}$ . To facilitate this, for any $e\in\operatorname{Gr}_{n}(X)$ , we consider the complexified projective space $\mathbb{X}=\mathbb{P}X\otimes\mathbb{C}$ and its subspace $\mathbb{P}e\otimes\mathbb{C}$ . For standard complex projective space, we write $\mathbb{P}^{d}$ for $\mathbb{CP}^{d}=\mathbb{P}(\mathbb{C}^{d+1})$ . That is, $\mathbb{X}\cong\mathbb{P}^{n+r-1}$ , and $\mathbb{P}e\otimes\mathbb{C}\cong\mathbb{P}^{n-1}$ .

If we consider all such spaces across all $e$ simultaneously, we obtain the tautological bundle131313 These are also called universal bundles or canonical bundles. They are analogous to the sheaves $\mathscr{O}(-1)$ and $\mathscr{O}(1)$ , respectively, for varieties in projective space. $\boldsymbol{\gamma}$ over $\operatorname{Gr}_{n}(X)$ with fiber

[TABLE]

and its dual bundle $\boldsymbol{\gamma}^{*}$ over $\operatorname{Gr}_{n}(X)$ with fiber

[TABLE]

and its annihilator bundle $\boldsymbol{\gamma}^{\perp}$ over $\operatorname{Gr}_{n}(X)$ with fiber

[TABLE]

and its cokernel bundle $\mathbb{X}/\boldsymbol{\gamma}$ over $\operatorname{Gr}_{n}(X)$ with fiber

[TABLE]

See Figure 7.

There is a dual pair of short exact sequences of projective bundles, analogous to (2.3).

[TABLE]

Hence, the complex projectivized tangent bundle $\mathbb{P}T\operatorname{Gr}(X)\otimes\mathbb{C}$ is isomorphic (canonically) to $(\mathbb{X}/\boldsymbol{\gamma})\otimes\boldsymbol{\gamma}^{*}$ . If we choose a splitting of these sequences, then we can use the dual bases to establish a (non-canonical) decomposition $\mathbb{P}X\otimes\mathbb{C}\cong\boldsymbol{\gamma}_{e}\oplus(\mathbb{X}/\boldsymbol{\gamma})_{e}$ for any $e$ .

One can also consider the frame141414Some authors might flip the names of the frame and coframe bundles. I tend to choose this notation because the frame bundle is covariant with diffeomorphisms on the base space, and only contravariant objects get a “co-” prefix. The jargon for duality is always frustrating. bundle $\mathcal{F}_{\boldsymbol{\gamma}}$ over $\operatorname{Gr}_{n}(X)$ associated to $\boldsymbol{\gamma}$ , whose fiber is all linear isomorphisms

[TABLE]

and the coframe bundle $\mathcal{F}_{\boldsymbol{\gamma}^{*}}$ over $\operatorname{Gr}_{n}(X)$ associated to $\boldsymbol{\gamma}^{*}$ , whose fiber is all linear isomorphisms

[TABLE]

To write homogeneous complex-algebraic ideals on $\boldsymbol{\gamma}^{*}_{e}$ that vary across $e\in\operatorname{Gr}_{n}(X)$ , one can choose any section $(u_{i})$ of $\mathcal{F}_{\boldsymbol{\gamma}^{*}}$ to give coordinates, and use the ring

[TABLE]

Part II PDEs on Manifolds

In this part, we build bundles whose fibers are the structures seen in Part I. This produces a satisfying language for describing a system of PDEs on a manifold in Section 4.

3. Bundles upon Bundles

If $M$ is a smooth manifold of dimension $m=n+r$ , then we can form the smooth bundle $\operatorname{Gr}_{n}(TM)$ with fiber $\operatorname{Gr}_{n}(T_{p}M)$ . Let $\varpi:\operatorname{Gr}_{n}(TM)\to M$ denote the bundle projection.

Because (2.3) holds for $X=T_{p}M$ at any $p\in M$ , any local section of $\operatorname{Gr}_{n}(TM)$ can be described by choosing its annihilator section of $\operatorname{Gr}_{r}(T^{*}M)$ , and vice-versa. For every $p\in M$ , the Grassmann variety $\operatorname{Gr}_{n}(T_{p}M)$ has a tautological bundle $\boldsymbol{\gamma}(p)$ with fiber $\boldsymbol{\gamma}_{e}(p)=\mathbb{P}e\otimes\mathbb{C}$ , a dual bundle, and so on.

The total space $\operatorname{Gr}_{n}(TM)$ is a manifold in its own right, so we may consider $\boldsymbol{\gamma}$ as a bundle over the manifold $\operatorname{Gr}_{n}(TM)$ , which is itself a bundle over $M$ . In other words, we can reinterpret all of Section 2(c) in terms of bundles over $\operatorname{Gr}_{n}(TM)$ by using $\mathbb{X}$ denote the projective bundle over $\operatorname{Gr}_{n}(TM)$ that has fiber $\mathbb{X}_{e}=\mathbb{P}T_{p}M\otimes\mathbb{C}$ at $e$ with $\varpi(e)=p$ . A complete description of some $v\in\boldsymbol{\gamma}$ would be $(p,e,v)$ where $v\in\mathbb{P}e\otimes\mathbb{C}$ , and $e\in\operatorname{Gr}_{n}(T_{p}M)$ , and $p\in M$ . A complete description of some $\varphi\in\boldsymbol{\gamma}^{*}$ would be $(p,e,\varphi)$ where $\varphi\in\mathbb{P}e^{*}\otimes\mathbb{C}$ , and $e\in\operatorname{Gr}_{n}(T_{p}M)$ , and $p\in M$ . See Figure 8. The same bundle-wise constructions hold for $\boldsymbol{\gamma}^{\perp}$ , $(\mathbb{X}/\boldsymbol{\gamma})$ , $\mathcal{F}_{\boldsymbol{\gamma}}$ , and $\mathcal{F}_{\boldsymbol{\gamma}^{*}}$ from Section 2(c).

Extending (2.26) to write homogeneous complex-algebraic ideals on $\boldsymbol{\gamma}^{*}_{e}$ that vary across $e\in\operatorname{Gr}_{n}(TM)$ , one can choose any section $(u_{i})$ of $\mathcal{F}_{\boldsymbol{\gamma}^{*}}$ to give coordinates, and use the ring

[TABLE]

3(a). The Contact Ideal

For any $e\in\operatorname{Gr}_{n}(TM)$ , consider its annihilator subspace $e^{\perp}\subset T^{*}_{p}M$ . There is a corresponding subspace $J_{e}\subset T^{*}_{e}\operatorname{Gr}_{n}(TM)$ , defined as

[TABLE]

as in Figure 9. If $(z^{a})$ is a basis of $e^{\perp}$ , then we let $\theta^{a}=z^{a}\circ\varpi_{*}$ for each $a$ to define a basis $(\theta^{a})$ of $J_{e}$ .

In the exterior algebra $\Omega^{\bullet}\left(\operatorname{Gr}_{n}(TM)\right)$ , consider the ideal $\mathcal{J}$ that is generated as $\left\langle J,\mathrm{d}J\right\rangle=\left\langle\theta^{a},\mathrm{d}\theta^{a}\right\rangle$ . This is called the contact ideal, and it is the first example of an EDS as seen in Section 4. Note that, for any (local) section $\epsilon:M\to\operatorname{Gr}_{n}(TM)$ , the contact ideal satisfies the universal reproducing property

[TABLE]

Because this property is universal, the subbundle $J$ is a submodule defined globally across $\operatorname{Gr}_{n}(TM)$ even if topology forces any particular section $\epsilon$ to be defined locally.

If one were to choose local coordinates $(x^{i},y^{a})$ for $M$ and local fiber coordinates $(P^{a}_{i})$ for $\operatorname{Gr}_{n}(TM)$ near a particular $n$ -plane $e=\ker\{\mathrm{d}y^{a}\}$ , then $\mathcal{J}$ is the ideal typically written as

[TABLE]

where the functions $P^{a}_{i}$ depend on $\tilde{e}$ in an open neighborhood of $e$ in $\operatorname{Gr}_{n}(TM)$ .

After reading Section 3(b), compare this coordinate description to your favorite definition of jet space, $\mathbb{J}^{1}(\mathbb{R}^{n},\mathbb{R}^{r})$ . Also, compare the local fiber coordinates $P^{a}_{i}$ to the tangent coordinates $K^{a}_{i}$ from Section 2(a); when restricting to the fiber over a single basepoint $p\in M$ , they are essentially identical. For some highly amusing applications of the contact system, see [Gro86].

3(b). Immersions and Frame Bundles

Fix an immersion $\iota:N\to M$ with $\dim N=n$ . For any $x\in N$ with $\iota(x)=p$ , the push-forward derivative has image $\iota_{*}(T_{x}N)$ , which is an $n$ -dimensional subspace of $T_{p}M$ ; hence, $\iota_{*}(T_{x}N)\in\operatorname{Gr}_{n}(TM)$ . Define the map $\iota^{(1)}:N\to\operatorname{Gr}_{n}(TM)$ by

[TABLE]

and note that $\iota=\varpi\circ\iota^{(1)}$ , so $\iota_{*}=\varpi_{*}\circ\iota^{(1)}_{*}$ .

It is obvious from the definition that $\iota^{(1)}$ is also an immersion. Therefore, we can use it to pull-back the tautological bundle $\boldsymbol{\gamma}^{*}$ as defined in Sections 2(c) and 3. Let $\boldsymbol{\gamma}^{*}_{N}=\iota^{(1)*}\boldsymbol{\gamma}^{*}$ , which has fiber

[TABLE]

that is, $\boldsymbol{\gamma}^{*}_{N}$ is identified with $\mathbb{P}T^{*}N\otimes\mathbb{C}$ via $\iota_{*}$ . See Figure 10.

The immersion $\iota^{(1)}$ is called the prolongation of the immersion $\iota$ .

Now, consider the contact forms $(\theta^{a})=(z^{a}\circ\varpi_{*})$ forms from Section 3(a). For all $x\in N$ and all $v\in T_{x}N$ , we have

[TABLE]

which ultimately gives the following lemma:

Lemma 3.8.

If $\iota:N\to M$ is an immersion for $\dim N=n$ , then $\iota^{(1)*}(\mathcal{J})=0$ . Conversely, if $\iota^{\prime}:N\to\operatorname{Gr}_{n}(TM)$ is an immersion for $\dim N=n$ satisfying $\iota^{\prime*}(\mathcal{J})=0$ and such that the image $\iota^{\prime}_{*}(T_{x}N)$ is transverse to the fiber $\ker\varpi_{*}$ for all $x\in N$ , then there is some immersion $\iota:N\to M$ such that $\iota^{(1)}=\iota^{\prime}$ .

Moreover, recall that any manifold $N$ of dimension $n$ admits a projective frame bundle $\Pi:\mathcal{F}N\to N$ with fiber

[TABLE]

The total space $\mathcal{F}N$ admits a tautological151515In various references, this 1-form is called the canonical, the Hilbert, and the soldering 1-form. 1-form $\omega:T\mathcal{F}N\to\mathbb{P}^{n-1}$ defined by $\omega^{i}_{u}=u^{i}\circ\Pi_{*}$ as in Figure 11. It is characterized by its universal reproducing property: for any (local) section $\eta:N\to\mathcal{F}N$ :

[TABLE]

or, more succinctly, $\eta^{*}(\omega)=\eta$ .

Because this property is universal, the 1-form $\omega$ is defined globally across $\mathcal{F}N$ even if topology forces any particular 1-form $\eta$ to be defined locally.

For any local diffeomorphism $f:N\to\tilde{N}$ , there is an induced (covariant) map on the frame bundles $f^{\dagger}:\mathcal{F}N\to\mathcal{F}\tilde{N}$ by $f^{\dagger}:(u^{i})\mapsto(u^{i})\circ(f_{*})^{-1}$ . Using the universal property, it is easy to prove this lemma, which shows that diffeomorphisms are characterized by their preservation of the tautological form on the frame bundle:

Lemma 3.11.

If $f:N\to\tilde{N}$ is a diffeomorphism, then $(f^{\dagger})^{*}(\tilde{\omega})=\omega$ . Conversely, if $F:\mathcal{F}N\to\mathcal{F}\tilde{N}$ is $PGL(n)$ -equivariant diffeomorphism such that $F^{*}(\tilde{\omega})=\omega$ , then there exists a unique diffeomorphism $f:N\to\tilde{N}$ such that $f^{\dagger}=F$ .

Combining the universal properties of the $\mathcal{J}$ and $\omega$ , we obtain the following theorem telling us what information we can transfer from $\operatorname{Gr}_{n}(TM)$ to an immersed submanifold:

Theorem 3.12.

If $\iota:N\to M$ is a smooth immersion, then

•

$\iota^{(1)*}(\mathcal{J})=0$ , and

•

$\mathcal{F}N=\iota^{(1)*}(\mathcal{F}_{\boldsymbol{\gamma}})$ .

Conversely, if $\iota^{\prime}:N\to\operatorname{Gr}_{n}(TM)$ is a smooth immersion such that

•

$\iota^{\prime*}(\mathcal{J})=0$ , and

•

$\mathcal{F}N=\iota^{\prime*}(\mathcal{F}_{\boldsymbol{\gamma}})$ ,

then there exists a smooth immersion $\iota:N\to M$ such that $\iota^{(1)}=\iota^{\prime}$ .

That is, an immersed submanifold satisfies the contact ideal, which is generated differentially by some annihilator 1-forms $(\theta^{a})$ spanning $\boldsymbol{\gamma}^{\perp}$ , and its frame bundle is equipped with tautological 1-forms $(\omega^{i})$ spanning $\boldsymbol{\gamma}^{*}$ .

*Remark 3.13**.*

Note the similarity between the universal property of the contact ideal on the Grassman bundle and the universal property of the tautological 1-form on the frame bundle. Exploitation of this interaction as in Theorem 3.12 has a long and interesting history.

For example, consider the study of a Lie pseudogroup acting on a manifold $M$ . One option is to differentiate the coordinates of $M$ repeatedly using the contact ideal until differential syzygies of the Lie pseudogroup action can be found in prolonged local coordinates, which are then converted to a coordinate-free description using the pseudogroup action. The other option is to work on the frame bundle of $M$ immediately, where any expression on the tautological 1-form is automatically invariant, then prolong as necessary to reveal the syzygies. The latter is used often when the Lie pseudogroup arises as equivalence of intrinsic $G$ -structures, and the former is used often when the Lie pseudogroup arises from an extrinsic action on some ambient coordinates. For more on these fascinating and interconnected ideas, I encourage you to read [Cle17], [Olv95], [Val13], and [Gar89]—and the collected works of E. Cartan.

4. Exterior Differential Systems

Let $M$ be a smooth manifold of finite dimension $m$ . An exterior differential system [EDS] on $M$ consists of an ideal $\mathcal{I}$ in the total exterior algebra $\Omega^{\bullet}(M)$ that is differentially closed and finitely generated. Differentially closed means that $\mathrm{d}\mathcal{I}\subset\mathcal{I}$ . Finitely generated means that in each degree $d$ , the $d$ -forms in the ideal, $\mathcal{I}_{d}=\mathcal{I}\cap\Omega^{d}(M)$ , form a finitely generated $C^{\infty}(M)$ -module. We assume that $\mathcal{I}_{0}=0$ ; otherwise, one would restrict to a subvariety of $M$ defined by those functions. A solution or integral manifold is an immersed manifold $\iota:N\to M$ such that $\iota^{*}(\mathcal{I})=0$ . Optionally, we sometimes specify an independence condition as an $n$ -form $\boldsymbol{\omega}\in\Omega^{n}(M)$ that is not allowed to vanish on solutions. When an EDS represents a system of PDEs in local coordinates $x^{1},\ldots,x^{n}$ , then $\boldsymbol{\omega}=\mathrm{d}x^{1}\wedge\cdots\wedge\mathrm{d}x^{n}$ , meaning that we seek solutions $N$ on which those coordinates are sensible, and $\iota:N\to M$ is a function that gives the dependent variables in $M$ (those transverse to $\iota(N)\subset M$ ) as functions of the independent variables in $N$ .

*Remark 4.1**.*

Exterior differential systems are defined this way because the term “PDE” or “system of PDEs” is difficult to pin down with geometric precision. Colloquially, “system of PDEs” usually means a finite set of (hopefully, smooth) equations on some local jet space. In Section 2, we explored the geometry of the bundle $\operatorname{Gr}_{n}(TM)$ ; recall that the contact system $\mathcal{J}$ on $\operatorname{Gr}_{n}(TM)$ provides a coordinate-invariant notion of jet space. So, a system of PDEs can be thought of as a collection of equations on jet $\operatorname{Gr}_{n}(TM)$ . Hopefully, those equations are smooth and respect the bundle structure coming from the contact system (otherwise, derivatives misbehave). By virtue of the Plücker embedding $\operatorname{Gr}_{n}(TM)\to\mathbb{P}\wedge^{n}(TM)$ , an EDS provides precisely the structure to write an ideal whose variety is a subvariety (in the bundle sense) of $\operatorname{Gr}_{n}(TM)$ . By taking smooth subvarieties, we can apply Remark 2.7 and apply our knowledge of tableaux from Part I to study EDS. Even by this definition, an EDS could be rather wild; however, in many practical applications, it happens that $\mathcal{I}$ is generated by a finite collection of smooth differential forms of homogeneous degree, so one obtains a smooth algebraic variety in local fiber coordinates of $\operatorname{Gr}_{n}(TM)$ . See [McK18] for more examples, additional insight, and historical context.

4(a). Differential Ideals and Integral Elements

To be precise, an integral element of $\mathcal{I}$ at $p\in M$ is a linear subspace $e\subset T_{p}M$ such that $\varphi|_{e}=0$ for all $\varphi\in\mathcal{I}_{n}$ . That is, the $n$ -forms in $\mathcal{I}$ provide a collection of functions that cut out a variety, $\operatorname{Var}_{n}(\mathcal{I})\subset\operatorname{Gr}_{n}(TM)$ . These functions vary smoothly in $M$ and are homogeneous in the fiber variables.

There is a maximal dimension $n$ for which $\operatorname{Var}_{n}(\mathcal{I})$ is locally non-empty, which is the case of interest. If an independence condition $\boldsymbol{\omega}$ is specified, we also require $\boldsymbol{\omega}|_{e}\neq 0$ , which forces $\operatorname{Var}_{n}(\mathcal{I})$ to lie in the open subset of $\operatorname{Gr}_{n}(TM)$ for which that condition holds. (For example, in the case of the contact system, the condition $\boldsymbol{\omega}=\mathrm{d}x^{1}\wedge\cdots\wedge\mathrm{d}x^{n}\neq 0$ holds in the same neighborhood where (3.4) makes sense.)

Because $\mathcal{I}_{n}$ is finitely generated by smooth functions, Sard’s theorem guarantees an open, dense subset $\operatorname{Var}_{n}^{o}(\mathcal{I})\subset\operatorname{Var}_{n}(\mathcal{I})$ defined as the smooth subbundle of $\operatorname{Gr}_{n}(TM)$ that is cut out smoothly by smooth functions.

Definition 4.2 (Kähler-ordinary).

Integral elements in $\operatorname{Var}_{n}^{o}(\mathcal{I})$ are called Kähler-ordinary.

A single connected component of $\operatorname{Var}_{n}^{o}(\mathcal{I})$ is denoted $M^{(1)}$ . We allow ourselves to redefine $M$ so that $\varpi:M^{(1)}\to M$ is a smooth bundle.

Let $s$ denote the dimension of each fiber of the projection $M^{(1)}\to M$ , so $t=nr-s$ is the corresponding codimension of $T_{e}M^{(1)}_{p}$ in $T_{e}\operatorname{Gr}_{n}(T_{p}M)$ . That is, the projective bundle $A=\ker\varpi_{*}=TM^{(1)}\subset T\operatorname{Gr}_{n}(TM)$ is a tableau in the sense of Remark 2.7, as each fiber $A_{e}=T_{e}M^{(1)}_{p}$ is a linear subspace of $T_{e}\operatorname{Gr}_{n}(T_{p}M)$ . Because $M^{(1)}$ is a smooth manifold, we have

Lemma 4.3.

$K\in A_{e}$ * implies $\arctan_{e}(K)\in M^{(1)}$ .*

That is, we have a well-defined vector bundle $A=\ker\varpi_{*}\subset TM^{(1)}$ over $M^{(1)}$ .

Definition 4.4 (Kähler-regular).

If $e$ is a Kähler-ordinary integral element and the Cartan characters of each tableau $A$ are constant in an open neighborhood of $e$ , then $e$ is called Kähler-regular.

That is, the Kähler-regular integral elements form a dense open subset of the Kähler-ordinary integral elements, which form a dense open subset of the whole variety $\operatorname{Var}_{n}(\mathcal{I})$ of integral elements.

So that we may apply the results of Section 1 without treating the Cartan characters of $A_{e}$ as functions of $e$ , we redefine $M^{(1)}$ to be a single connected component of Kähler-regular integral elements, and we again allow ourselves to redefine $M$ so that $\varpi:M^{(1)}\to M$ is a smooth bundle.

Such $M^{(1)}$ is called the first prolongation of $(M,\mathcal{I})$ , though it is clear from the definition that there could be multiple first prolongations, depending on which components of $\operatorname{Var}_{n}(\mathcal{I})$ are under consideration.

To generalize the notation and results of Part I to $M^{(1)}$ , define the restricted tautological bundles

[TABLE]

Warning! These are now complex projective bundles, not vector spaces as in Section 1! Sometimes, it is convenient to think of $A=\ker\varpi^{*}\subset TM^{(1)}$ as being a complex projective bundle, too, in which case we consider it to be a subbundle of the projective bundle $W\otimes V^{*}$ . Of course, the notation has been developed to be consistent regardless.

An integral manifold of $\mathcal{I}$ is an immersion $\iota:N\to M$ such that $\iota^{*}(\varphi)=0$ for all $\varphi\in\mathcal{I}$ . (If an independence condition $\boldsymbol{\omega}$ is specified, we require that $\iota^{*}(\boldsymbol{\omega})\neq 0$ , too.) When we are considering a particular Kähler-regular component $M^{(1)}\subset\operatorname{Var}_{n}(\mathcal{I})$ as above, we say $N$ is an ordinary integral manifold provided that $\iota_{*}(TN)\subset M^{(1)}$ . All of the observations from Section 3(b) apply, but $\iota^{(1)}(N)$ lies in the submanifold $M^{(1)}$ , and $\iota^{(1)}_{*}(TN)$ lies in the subbundle $A$ . The overall goal is to construct all ordinary integral manifolds of $(M,\mathcal{I})$ through the careful study of the geometry of a Kähler-regular first prolongation $M^{(1)}$ .

4(b). Prolongation and Spencer Cohomology

Suppose that $\iota:N\to M$ is an ordinary integral manifold of $\mathcal{I}$ . By Theorem 3.12, the 1-forms $\theta^{a}$ spanning $J_{e}$ must vanish for each $e\in\iota^{(1)}(N)$ . The tautological form $(\omega^{i})$ on $\mathcal{F}_{\boldsymbol{\gamma}}$ pulls back to a nondegenerate frame $(\eta^{i})$ on $N$ , since $\iota^{(1)}$ is an immersion.

Therefore, if $\iota^{(1)}:N\to M^{(1)}$ actually exists, we have

[TABLE]

However, working on the frame bundle of $M^{(1)}$ , these forms satisfy a more general equation, called Cartan’s structure equation:

[TABLE]

The derivative of $\theta^{a}$ must take this form, because $\theta^{a}$ and $\omega^{i}$ are semi-basic with respect to the bundle $\varpi:M^{(1)}\to M$ , whereas $\pi^{a}_{i}\in A$ is vertical, so $\mathrm{d}\theta^{a}$ cannot involve a totally vertical 2-form. See discussion of connections and principal bundles in [KN63].

Let us now describe the meaning of each of the terms in (4.7), with respect to the ordinary integral manifold $\iota:N\to M$ . Using the dual coframe $z_{a}\leftrightarrow\theta^{a}$ for $W\leftrightarrow V^{\perp}$ , we can see that $\pi=\pi^{a}_{i}(z_{a}\otimes\omega^{i})$ lies in $A$ . (Hence, it is called the tableau term.) In particular, it must be that

[TABLE]

for some function $P^{a}_{i,j}$ that must satisfy $P^{a}_{i,j}\eta^{i}\wedge\eta^{j}=0$ , so $P^{a}_{i,j}=P^{a}_{j,i}$ . That is, the homomorphism $P$ lies in the fiber of $W\otimes(V^{*}\otimes V^{*})$ over $e$ , as

[TABLE]

Moreover, the existence of an immersion $\iota^{(1)}:N\to M^{(1)}$ requires that the torsion term $w_{a}T^{a}_{i,j}\,\omega^{i}\wedge\omega^{j}$ can be removed in (4.7); otherwise, it cannot be that $\iota^{(1)*}\mathrm{d}\theta^{a}=0$ as required. That is, it must be possible to rewrite $\pi^{a}_{i}\mathrel{\reflectbox{$ \mapsto $}}\pi^{a}_{i}+Q^{a}_{i,j}\omega^{j}$ for $Q\in A\otimes V^{*}$ such that any $T^{a}_{i,j}$ term is absorbed. Note that this absorption of torsion is an algebraic property of the tableau $A$ . In summary, we have Lemma 4.10.

Lemma 4.10.

Let $\delta:A\otimes V^{*}\to W\otimes\wedge^{2}V^{*}$ denote the composition of skewing $\otimes^{2}V^{*}\to\wedge^{2}V^{*}$ and inclusion $A\to W\otimes V^{*}$ , and write $A^{(1)}=\ker\delta$ and $H^{2}(A)=\operatorname{coker}\delta$ :

[TABLE]

For any ordinary integral manifold $N$ , the homomorphism $P$ of (4.8) and (4.9) lies in $A^{(1)}$ , and the pullback of torsion $T$ is zero in $H^{2}(A)$ .

Writing $\delta$ in a chosen coframe, it is easy to check that

[TABLE]

The case of equality is considered in Section 5.

The exterior differential system $\mathcal{I}^{(1)}$ on $M^{(1)}$ generated as

[TABLE]

is called the (first) prolongation of $(M,\mathcal{I})$ , and we are back where we started at the beginning of Section 4. We can construct $M^{(2)}\subset\operatorname{Gr}_{n}(TM^{(1)})$ , and repeat the entire process for $E\in M^{(2)}$ over $e\in M^{(1)}$ that was used for $e\in M^{(1)}$ over $p\in M$ . Lemma 4.10 essentially says that $A^{(1)}$ is the tableau bundle $TM^{(2)}\subset T\operatorname{Gr}_{n}(TM^{(1)})$ . Thus, we can construct $M^{(3)}$ over $M^{(2)}$ and re-apply Lemma 4.10, and so on. By the definition of $M^{(1)}$ and (4.13), we have

Corollary 4.14.

Every ordinary integral manifold $N$ of $(M^{(1)},\mathcal{I}^{(1)})$ is also an ordinary integral manifold of $(M,\mathcal{I})$ . However, the converse might fail, as the smooth connected locus of $M^{(1)}$ may be a strict subset of $\operatorname{Var}_{n}(\mathcal{I})$ .

Overall, we achieve exact sequences that summarize the entire situation of the tangent spaces of an immersed ordinary integral manifold $N$ of $\mathcal{I}$ , $\mathcal{I}^{(1)}$ , $\mathcal{I}^{(2)}$ , $\mathcal{I}^{(3)}$ , …

[TABLE]

The cokernels $H^{1}(A)$ , $H^{2}(A)$ , …, $H^{n}(A)$ are the Spencer cohomology of the tableau $A$ . Even outside the context of exterior differential systems, they are defined for formal tableaux $A\subset W\otimes V^{*}$ via the exact sequences (4.15) as

[TABLE]

Spencer cohomology detects functional obstructions to the solution of the initial-value problem on $M^{(k)}$ in the form of torsion; this is explained nicely in [IL03, Section 5.6], and the reader is urged to read their presentation.

Spencer cohomology was a major focus of the formal study of partial differential equations and Lie pseudogroups in the mid-20th century; most notably, [Spe62, Qui64, SS65, GQS66, Gol67, Gar67, Gui68, GK68, GQS70]. As it happens, many of the major results of that era are easy to re-prove under our regularity assumptions on $M^{(1)}$ and using the endovolutive notation from Section 1, particularly when using the involutivity criteria in Section 5 that were detailed in [Smi15]. We demonstrate this in Parts III and IV.

5. Involutivity of Exterior Differential Systems

Definition 5.1 (Cartan’s test).

A tableau $A$ is called involutive if equality holds in Equation (4.12),

[TABLE]

Definition 5.2.

A tableau $A$ is called formally integrable if $H^{k}(A)=0$ for all $k\geq 2$ .

Cartan’s test comes from the following consequence of the Cartan–Kähler theorem.161616See [BCG*+*90, Chapter III] or [IL03] for more background on the Cartan–Kähler theorem; it is not our focus here.

Theorem 5.3.

Suppose that $(M,\mathcal{I})$ is an analytic exterior differential differential system, that $M^{(1)}$ is a smooth sub-bundle, and that the tableau bundle $A$ of $r\times n$ homomorphisms has constant171717That is, $M^{(1)}$ is Kähler-regular. Cartan characters $(s_{1},s_{2},\ldots,s_{\ell})$ over $M^{(1)}$ . If $A$ is involutive and formally integrable, then through any point in $M$ , there is an analytic ordinary integral manifold $\iota:N\to M$ . Moreover, such $N$ are parametrized locally by $r$ constants, $s_{1}$ functions of 1 variable, $s_{2}$ functions of 2 variables, …, $s_{\ell}$ functions of $\ell$ variables.

Somewhat confusingly, the situation in Theorem 17 is called involutivity of $(M,\mathcal{I})$ ; that is, an EDS might fail to be involutive even if its tableau is involutive, because there may be nonzero torsion in $H^{k}(A)$ , meaning that $\mathcal{I}$ fails to be formally integrable. This means essentially that the ideal $\mathcal{I}$ is being studied on the wrong manifold.

For a beautiful interpretation of Cartan’s test that is relevant to the later Sections of this course, read the introduction of [Yan87]. In summary, ordinary integral manifolds are constructed by decomposing the Cauchy problem into a sequence of steps, each of which is determined and has solutions using the Cauchy–Kowalevski theorem.

For fixed spaces $W$ and $V^{*}$ , involutivity is a closed algebraic condition on tableaux in $W\otimes V^{*}$ . Because the conditions come from Cartan’s test, which involves $W\otimes\wedge^{2}V^{*}$ , it is not surprising that these conditions are quadratic; however, writing down the precise ideal is a lengthy argument. Doing so was suggested in [BCG*+*90, Chapter IV§5] and accomplished for general tableaux in [Smi15] following the outline in [Yan87].

Theorem 5.4 (Involutivity Criteria).

Suppose a tableau is given in generic bases as in (1.14). The tableau is involutive if and only if there exists a basis of $W$ such that

(i)

$\operatorname{B}^{\lambda}_{i}$ * is endovolutive in that basis, and* 2. (ii)

$\left(\operatorname{B}^{\lambda}_{l}\operatorname{B}^{\mu}_{k}-\operatorname{B}^{\lambda}_{k}\operatorname{B}^{\mu}_{l}\right)^{a}_{b}=0$ * for all $\lambda<l<k$ and $\lambda\leq\mu<k$ and all $a>s_{l}$ .*

This theorem is our main computational tool in Part III.

5(a). Moduli of Involutive Tableaux

While it seems like a trivial (if lengthy) computation, consider carefully the meaning of Theorem 5.4: We can fix $r$ , $n$ , and Cartan characters $s_{1},\ldots,s_{n}$ and then write down an explicit ideal in coordinates whose variety is all of the involutive tableaux with those Cartan characters. Hence, we can use computer algebra systems such as Macaulay2, Magma, and Sage to decompose and analyze that ideal using Gröbner basis techniques. With enough computer memory, we can answer the question “What is the moduli of involutive tableaux?” By virtue of Theorem 17, this is fairly close to answering the question “What is the moduli of involutive PDEs?”

For example, fix $r=n=3$ and $(s_{1},s_{2},s_{3})=(3,2,0)$ . For some coefficients $x_{0},\ldots,x_{15}$ in the ring $S$ , an endovolutive tableau must be of the form

[TABLE]

or in block form like (1.20),

[TABLE]

Involutivity is an affine quadratic ideal $\mathscr{G}$ on $\mathbb{C}(x_{0},\ldots,x_{15})$ generated by the last rows of $\operatorname{B}^{1}_{2}\operatorname{B}^{1}_{3}-\operatorname{B}^{1}_{3}\operatorname{B}^{1}_{2}$ and $\operatorname{B}^{1}_{2}\operatorname{B}^{2}_{3}-\operatorname{B}^{1}_{3}\operatorname{B}^{2}_{2}$ , so:

[TABLE]

The complete primary decomposition of this ideal reveals two components. The maximal component has dimension 12, and it is described by the fairly boring prime ideal $\{x_{0},x_{1},x_{5},x_{8}\}$ . The other component has dimension 11 and its prime ideal is generated by 27 polynomials. See http://goo.gl/jGTnMU for how to compute this in SageMathCell.

Many of your favorite involutive second-order scalar PDEs in three independent variables live somewhere in this variety; see (1.15) and Section 6(c). Up to some notion of equivalence, this is essentially the moduli space of such equations. As seen in Part III, their characteristic varieties are obtained by combining $\mathscr{G}$ with the rank-1 ideal $\mathscr{R}$ on $\mathbb{C}[x_{0},\ldots,x_{15},a_{0},\ldots,a_{4}]$ .

However, there is still some ambiguity to be resolved, as it may be that a given abstract tableau admits several endovolutive bases with apparently distinct coordinate descriptions.

5(b). Cauchy retractions

Before proceeding to Part III, it is worthwhile to mention Cauchy retractions, which are much simpler than—and quite distinct from—elements of the characteristic variety. To confuse matters, many references call these “Cauchy characteristics.” For any differentially closed ideal $\mathcal{I}\subset\Omega^{\bullet}M$ , the Cauchy retractions are the vectors that preserve $\mathcal{I}$ ; that is, $\mathfrak{g}=\{v\in TM:v\mathbin{\hbox{\vrule height=1.4pt,width=4.0pt,depth=-1.0pt\vrule height=4.0pt,width=0.4pt,depth=-1.0pt}}\mathcal{I}\subset\mathcal{I}\}$ . Because $\mathcal{I}$ is differentially closed, the annihilator bundle $\mathfrak{g}^{\perp}\subset T^{*}M$ is the smallest Frobenius ideal in $\Omega^{\bullet}(M)$ that contains $\mathcal{I}$ . Then, for any integral manifold $\iota:N\to M$ , the subspaces $\mathfrak{g}\cap\iota^{(1)}(N)$ form an integrable distribution; that is, $\mathfrak{g}^{\perp}_{N}$ is Frobenius as well [Gar67].

Because $\mathfrak{g}^{\perp}$ is a Frobenius system—a system of ODEs—it is common to redefine $(M,\mathcal{I})$ so that it is free of Cauchy retractions before proceeding to study its integral manifolds. The distinction between $\mathfrak{g}^{\perp}$ and the characteristic variety $\Xi$ is explored further in [Smi14].

Part III Characteristic and Rank-one Varieties

Thank you for taking the time to read the enormous amount of background in Parts I and II. We are ready to define and deconstruct a fascinating mathematical object that lies at the heart of PDE theory.

Here we stand: We have an exterior differential system $\mathcal{I}$ on $M$ . Perhaps this EDS arose from a system of PDEs on $M$ and is equipped with an independence condition $\boldsymbol{\omega}$ . The EDS yields a smooth Kähler-regular subbundle $M^{(1)}\subset\operatorname{Gr}_{n}(TM)$ , where any $e\in M^{(1)}$ is an integral element of the original EDS. As a manifold in its own right, $M^{(1)}$ is equipped with tautological bundles $V$ , $V^{*}$ , $W$ , and $A$ from (4.5). Moreover, $A$ is a subbundle of $W\otimes V^{*}$ , so it is a tableau bundle. Its symbol $\sigma$ gives a short-exact sequence of bundles,

[TABLE]

An integral manifold is an immersion $\iota:N\to M$ such that $\iota_{*}(T_{x}N)\in M^{(1)}_{\iota(x)}$ for all $x\in N$ . Let $\iota^{(1)}:N\to M^{(1)}$ denote the map $x\mapsto e=\iota_{*}(T_{x}N)$ .

As you read this part, compare it to [IL03, Section 4.6] and [BCG*+*90, Chapter V]. The reader will note that we do not assume that $\mathcal{I}$ is a linear Pfaffian system, nor do we build a prolonged EDS $\mathcal{I}^{(1)}$ using the contact system. Instead we are working with the tautological bundles per Remark 3.13.

6. The Characteristic Variety

The original motivation for the characteristic variety is to see where the initial-value problem becomes ambiguous. That is, given an initial condition for our PDE on a local submanifold of dimension $n{-}1$ , when would the $n$ -dimensional solutions for that initial condition fail to be unique? We express this condition in terms of integral elements.

6(a). via Polar Extension

For an integral element $e^{\prime}\in\operatorname{Var}_{n-1}(\mathcal{I})$ , we consider its space181818The polar space is a vector space thanks to the assumption that $\mathcal{I}_{n}$ is a finitely-generated $C^{\infty}(M)$ -module, because that assumption implies that the polar equations over $p\in M$ are a linear subspace of $T_{p}^{*}M$ . of integral extensions, called the polar space,

[TABLE]

and the polar equations comprise its annihilator,

[TABLE]

The polar rank is $r(e^{\prime})=\dim H(e^{\prime})-\dim e^{\prime}-1$ . If $r(e^{\prime})=-1$ , then $e^{\prime}$ admits no extensions. If $r(e^{\prime})=0$ , then $e^{\prime}$ admits a unique extension to some $e\in\operatorname{Var}_{n}(\mathcal{I})$ .

The case of interest is $r(e^{\prime})>0$ , meaning that $e^{\prime}$ admits many extensions, so the initial-value problem from $e^{\prime}$ to $e=e^{\prime}+\left\langle v\right\rangle$ is ambiguous. For any $e\in M^{(1)}$ , we can identify a hyperplane $e^{\prime}\in\operatorname{Gr}_{n-1}(e)$ with $\xi\in\mathbb{P}e^{*}$ via $e^{\prime}=\ker\xi$ . Because $e\in M^{(1)}\subset\operatorname{Gr}_{n}(TM)$ where $n$ is the maximal dimension of integral elements of $\mathcal{I}$ , the function $r$ cannot be positive on an open set of $\mathbb{P}e^{*}$ , so the case $r(e^{\prime})>0$ is a closed condition. Moreover, the function $r:\mathbb{P}e^{*}\to\mathbb{N}$ is the rank of a linear system of equations, so it defines a Zariski-closed projective algebraic variety. We choose to study that algebraic variety projectively over $\mathbb{C}$ . Hence, the typical definition of the characteristic variety of $e$ is

[TABLE]

This initial definition is refined in Section 6(b) to produce a scheme. To study properly the ambiguity of the initial-value problem, we want to assign a multiplicity to each $\xi\in\Xi_{e}$ and decompose $\Xi$ into irreducible components based on the structure of the space $H(\xi^{\perp})$ .

6(b). via Rank-one Incidence

For both computational and theoretical purposes, it would be convenient to tie the polar space $H(e^{\prime})$ to the geometry of the tableau $A_{e}$ of an extension $e$ of $e^{\prime}$ . The discussion of polar pairs in Section 2(b) links these two objects, to provide another interpretation of the initial-value problem that is much more convenient than (6.3).

Fix $e\in M^{(1)}$ , and suppose that both $e$ and $\tilde{e}$ are integral extensions of $e^{\prime}=\ker\xi$ for some $\xi\in e^{*}$ . By the definition of $H(e^{\prime})$ , it must be that $\tilde{e}$ lies in $\operatorname{Var}_{n}(\mathcal{I})\cap\operatorname{Pol}_{1}(e)$ , but we do not know whether $\tilde{e}$ lies in the particular maximal smooth component of $\operatorname{Var}_{n}(\mathcal{I})$ that we call $M^{(1)}$ . However, the results of Section 2(b) guarantee that $\tilde{e}$ is detected by $A_{e}$ even if $\tilde{e}$ is not in $M^{(1)}$ , in the following way.

Lemma 6.4.

Fix $e\in M^{(1)}$ , and suppose that both $e$ and $\tilde{e}$ are integral extensions of $e^{\prime}=\ker\xi$ for some $\xi\in e^{*}$ . Let $w$ be such that $\tilde{e}=e^{\prime}+\left\langle w\right\rangle$ . Then $w\otimes\xi\in A_{e}$ , and there is an open 1-parameter family of integral extensions of $e^{\prime}$ near $e$ in $M^{(1)}$ that also represent $[w\otimes\xi]$ .

Proof.

Because $\tilde{e}\in\operatorname{Pol}_{1}(e)$ , Lemma 2.12 yields a particular line $[K]$ of rank-1 homomorphisms in $(T_{p}M/e)\otimes e^{*}$ representing $\tilde{e}$ . Because $H(e^{\prime})$ is a vector space191919Here we see again why it is helpful for an EDS to be finitely generated. such that $w\in H(e^{\prime})$ and $w\not\in e$ , the rank-1 projective homomorphism $[K]$ takes the form of $[w\otimes\xi]$ for some $w\in H(e^{\prime})/e$ .

By Lemma 2.15, there is a continuous 1-parameter family of other polar pairs $e_{\tau}$ of $e$ , with $e_{\tau}\cap e=e^{\prime}$ , converging to $\tilde{e}$ , all of which share the rank-1 projective homomorphism $[w\otimes\xi]$ .

That is, as a line of rank-1 homomorphisms, $[w\otimes\xi]$ is contained in $(H(e^{\prime})/e)\otimes e^{*}$ , as a subspace of $(T_{p}M/e)\otimes e^{*}$ . Applying $\arctan_{e}$ , this implies that $e_{\tau}\subset H(e^{\prime})$ for all $\tau$ . By the definition of $H(e^{\prime})$ , this means $e_{\tau}\in\operatorname{Var}_{n}(\mathcal{I})$ for all $\tau$ . But, the $e_{\tau}$ follow a continuous curve, and $e_{0}=e$ lies in the open subset $M^{(1)}$ . Therefore, all $e_{\tau}$ for an open set of sufficiently small $\tau$ . Differentiating, we see that the line $[w\otimes\xi]$ is contained in the tangent space of the fiber of $M^{(1)}$ at $e$ , namely $A_{e}$ . ∎

On the other hand, for fixed $e$ and $\xi$ , there are various distinct $\tilde{e}$ corresponding to linearly independent $w$ . With Figure 6 in mind, it is easy to see that

[TABLE]

Recall the rank-1 ideal $\mathscr{R}$ from Section 1. Here it applies to vector bundles. As a set, the rank-1 subvariety of the tableau is

[TABLE]

As a set, the characteristic variety $\Xi$ is the projection of $\operatorname{\mathscr{C}}$ to $V^{*}$ . More precisely, $\Xi$ is the scheme202020We must study $\Xi$ along with its various components and multiplicities, so it is better to think of it as a scheme than as a simple-minded variety. defined by the characteristic ideal $\mathscr{M}$ on $V^{*}$ that is obtained from the rank-1 ideal $\mathscr{R}$ on $A\subset W\otimes V^{*}$ in the following way: For any $\xi\in V^{*}$ , define $\sigma_{\xi}:W\to H^{1}$ by $\sigma_{\xi}(w)=\sigma(w\otimes\xi)$ . Note that $\dim\ker\sigma_{\xi}=r(\xi^{\perp})$ by (6.5) and (6.6), but this does not account for multiplicity within $\operatorname{\mathscr{C}}$ itself. Then the scheme $\operatorname{\mathscr{C}}$ is the incidence correspondence212121 For more background on the utility of incidence correspondences in algebraic geometry, see the 2013 Columbia Eilenberg lecture series by Joe Harris, [Har13]. A YouTube link is in the bibliography. of $\Xi$ for the symbol map $\sigma_{\xi}$ . See Figure 12.

This interpretation is amazing. Suddenly, two completely elementary ideas from Section 1—tableaux of matrices and rank-1 matrices—come together to give a concise description of the most subtle structure in PDE theory.

However, the scheme components and multiplicities are still not obvious from Figure 12; they must be obtained by examining the degree of the equations defining $\ker\sigma_{\xi}$ . The powerful third interpretation in Section 7 provides this detail. But first an example.

6(c). Example: The Wave Equation

Consider the PDE $f_{11}+f_{22}=f_{33}$ . To do this, we consider the manifold $M=\mathbb{R}^{3+1+3+5}\subset=\mathbb{R}^{13}=\mathbb{J}^{2}(\mathbb{R}^{3},\mathbb{R})$ with coordinates $x^{1}$ , $x^{2}$ , $x^{3}$ , $f$ , $p_{1}$ , $p_{2}$ , $p_{3}$ , $p_{11}$ , $p_{12}$ , $p_{13}$ , $p_{22}$ , $p_{23}$ . Consider the exterior differential system generated by

[TABLE]

Let $\omega^{i}=\mathrm{d}x^{i}$ for $i=1,2,3$ , so the derivatives are computed as

[TABLE]

where $\pi^{1}_{2}=\pi^{2}_{1}$ , $\pi^{1}_{3}=\pi^{3}_{1}$ , $\pi^{2}_{3}=\pi^{3}_{2}$ , and $\pi^{3}_{3}=\pi^{1}_{1}+\pi^{2}_{2}$ .

Changing bases, this tableau is equivalent to an endovolutive one of the form

[TABLE]

Or in block form

[TABLE]

Note that the third row of both $\operatorname{B}^{1}_{2}\operatorname{B}^{1}_{3}-\operatorname{B}^{1}_{3}\operatorname{B}^{1}_{2}$ and $\operatorname{B}^{1}_{2}\operatorname{B}^{2}_{3}-\operatorname{B}^{1}_{3}\operatorname{B}^{2}_{2}$ are zero, so the tableau is involutive by Theorem 5.4.

The rank-1 condition is

[TABLE]

After a simple change of basis, this becomes the example (1.2) – (1.4), seen throughout the earlier sections.

7. Guillemin Normal Form and Eigenvalues

In this section, we reinterpret $\operatorname{\mathscr{C}}$ and $\Xi$ as properties of the endomorphisms $\operatorname{B}^{\lambda}_{i}$ . This section is the key to all of the more advanced results that follow. Our main computation tool is the structure of an endovolutive tableau discussed in Section 1(c), where $W$ and $V$ and $A$ are now bundles over $M^{(1)}$ .

The incidence correspondence of Figure 12 is rephrased in Lemma 7.1.

Lemma 7.1.

If $\xi\in\Xi$ , $v\in V$ , and $w\in\ker\sigma_{\xi}\subset W$ , then

[TABLE]

In particular, $w$ is an eigenvector of $\operatorname{B}(\xi)(v)$ for all $v$ .

Proof.

Fix generic bases $(u^{i})$ and $(z_{a})$ and $(u_{i})$ , so that $\xi=\xi_{i}u^{i}$ and $w=w^{a}z_{a}$ and $v=v^{i}u_{i}$ . Set $\pi=w\otimes\xi\in\operatorname{\mathscr{C}}\subset A$ , so $\pi^{a}_{i}=w^{a}\xi_{i}$ for all $a,i$ , and this $\pi$ must satisfy the symbol relations (1.10). In particular, $w^{a}\xi_{i}=B^{a,\lambda}_{i,b}w^{b}\xi_{\lambda}$ for $a>s_{i}$ . Therefore

[TABLE]

(Here we see the utility of including the first summand in Equation (1.14).) ∎

Recalling the decomposition (1.17) and (1.18), Lemma 7.4 provides a sort of converse of Lemma 7.1.

Lemma 7.4.

Suppose that $A$ is an endovolutive tableau. Fix $\varphi\in Y^{\perp}\cong U^{*}$ and suppose that $w\in\operatorname{\mathbf{W}}^{-}(\varphi)$ is an eigenvector of $\operatorname{B}(\varphi)(v)$ for every $v\in V$ . Then there is a $\xi\in\Xi$ over $\varphi\in Y^{\perp}$ such that $w\in\operatorname{\mathbf{W}}^{1}(\varphi)$ , so $w\otimes\xi\in A$ .

Proof.

For each $v\in V$ , let $\xi(v)$ denote the eigenvalue corresponding to $v$ , so that $\xi(v)w=\operatorname{B}(\varphi)(v)w$ . Because $\operatorname{B}(\varphi)(v)w$ is linear in $v$ , so is $\xi(v)$ . Then $\xi=\xi_{i}u^{i}\in V^{*}$ . Therefore, $\operatorname{B}(\varphi)(\cdot)w=w\otimes\xi$ . In particular, the rank-1 condition implies that

[TABLE]

This is the same expression as in (1.22), so by comparing recursively over $\mu=1,2,\ldots,\ell$ , we see that $\xi_{\lambda}=\varphi_{\lambda}$ for all $\lambda$ , so $w\in\operatorname{\mathbf{W}}^{1}(\varphi)\subset\operatorname{\mathbf{W}}^{-}(\varphi)$ . ∎

Lemma 7.4 deserves a warning: There may be multiple $\xi$ over the same $\varphi$ , for perhaps there are different eigenvectors $w\in\operatorname{\mathbf{W}}^{-}(\varphi)$ admitting different sequences of eigenvalues $\xi_{\varrho}$ , for $\varrho>\ell$ , associated to the same $\varphi$ . Moreover, it is not (yet) clear that a mutual eigenvector $w$ exists for every such $\varphi$ .

But overall it is clear that there is some relationship between the eigenvalues of $\operatorname{B}^{\lambda}_{i}$ and the characteristic variety of an endovolutive tableau $A$ . This relationship is made precise for involutive tableau using a result from [Gui68].

Theorem 7.6 (Guillemin normal form).

Suppose that $A$ is involutive. For every $\varphi\in Y^{\perp}$ and $v\in V$ , the restricted homomorphism $\operatorname{B}(\varphi)(v)|_{\operatorname{\mathbf{W}}^{1}(\varphi)}$ is an endomorphism of $\operatorname{\mathbf{W}}^{1}(\varphi)$ . Moreover, for all $v,\tilde{v}\in V$ ,

[TABLE]

Compare Theorem 7.7 to Lemma 4.1 in [Gui68] and Proposition 6.3 in Chapter VIII of [BCG*+*90]. Theorem 7.7 is known as Guillemin normal form because it implies that the family of homomorphisms $\operatorname{B}(\varphi)(\cdot)$ can be placed in simultaneous Jordan normal form on $\operatorname{\mathbf{W}}^{1}(\varphi)$ . It is the “normal form” alluded to in Section 1(b). We defer the proof of Theorem 7.7 to Section 9 so we may first see its important consequences.

Corollary 7.8.

If $A$ is involutive, then for each $\varphi\in Y^{\perp}$ , there exists some $w$ satisfying the hypotheses of Lemma 7.4. That is, the projection map $\Xi\to Y^{\perp}$ is onto. In particular, if $A$ is nontrivial and involutive, then $\Xi$ is nonempty.

Proof.

Because we are working over $\mathbb{C}$ , the commutativity condition (7.7) guarantees that common eigenvectors exist for the commutative algebra $\{\operatorname{B}(\varphi)(v)~{}:~{}v\in V\}$ . ∎

Lemma 7.9.

Suppose that $A$ is an involutive tableau. Then the map of projective varieties induced by $\Xi\to Y^{\perp}$ is a finite branched cover. In particular, both $\hat{\Xi}$ and $Y^{\perp}$ have affine fiber dimension $\ell$ .

Proof.

Fix $\varphi\in Y^{\perp}$ . The set of $\xi$ over $\varphi$ is nonempty by Corollary 7.8. If it were true that the set of $\xi$ projecting to a particular $\varphi$ were infinite, then the parameter $\xi_{i}$ would take infinitely many values in some expression of the form

[TABLE]

But, the matrix $\sum_{\lambda}\varphi_{\lambda}\operatorname{B}^{\lambda}_{i}\in\operatorname{End}(\operatorname{\mathbf{W}}^{-}_{1})$ can have at most $s_{1}$ eigenvalues. ∎

Here we arrive at an easy222222It is easy in the sense that we have the explicit polynomials of $\mathscr{M}$ in hand, and they are recognizable as the familiar eigenvector equations. The reader should compare (7.14) to the descriptions provided in [BCG*+*90] and [IL03]. Both references defer their decomposition of $\Xi$ to the abstract Grothendeick–Riemann–Roch theorem. Hence, neither reference indicates how to compute the scheme by hand for general tableaux. While details are given in [BCG*+*90] in the simple case of rectangular tableaux, a complete description is achieved here because of the normal form provided by Theorem 5.4. proof of the main theorem regarding the structure of $\Xi$ .

Theorem 7.11.

If $A$ is involutive, then $\dim\Xi=\ell-1$ and $\deg\Xi=s_{\ell}$ .

Proof.

We work in endovolutive coordinates. From Lemma 7.9, we already know that $\dim\Xi=\ell-1$ .

Fix a generic point $\xi\in\Xi$ over $\varphi\in Y^{\perp}$ . Let $\operatorname{\mathscr{C}}_{\xi}=(\ker\sigma_{\xi})\otimes\xi$ denote the fiber over $\xi$ in $\operatorname{\mathscr{C}}$ . To understand the scheme $\Xi$ , we must determine the degree of the condition defining $\operatorname{\mathscr{C}}_{\xi}$ . Note that $\operatorname{\mathscr{C}}_{\xi}$ must be a subvariety of $\operatorname{\mathbf{W}}^{1}(\varphi)\otimes\xi$ , and $\operatorname{\mathbf{W}}^{1}(\varphi)$ is a linear subspace of $W$ , so the degree of $\Xi$ is the degree of some condition on $\operatorname{\mathbf{W}}^{1}(\varphi)$ .

By Lemma 7.1 and (6.6), the condition that $\operatorname{\mathscr{C}}_{\xi}$ is nontrivial is precisely the condition that

[TABLE]

Since we may restrict our attention to $\operatorname{\mathbf{W}}^{1}(\varphi)\otimes\xi$ , the condition (7.12) for $i\leq\ell$ is automatic by (1.23). Hence, only these terms contribute to the non-linear part of the ideal:

[TABLE]

So, without coordinates, the defining equations of $\operatorname{\mathscr{C}}_{\xi}$ are

[TABLE]

For a particular $v$ , this is the characteristic polynomial of $\operatorname{B}(\xi)(v)$ as an endomorphism of $\operatorname{\mathbf{W}}^{1}(\varphi)$ . By involutivity and Theorem 7.7, all $\operatorname{B}(\xi)(v)$ for $v\in Y$ admit the same Jordan-block form, so they admit the same factorization type for their respective characteristic polynomials. That means it suffices to consider a single $v$ . By definition, the characteristic polynomial of $\operatorname{B}(\xi)(v)|_{\operatorname{\mathbf{W}}^{1}(\varphi)}$ has degree $\dim\operatorname{\mathbf{W}}^{1}(\varphi)$ at generic $\varphi$ . Therefore, $\deg\Xi=s_{\ell}$ follows from Lemma 1.25. ∎

Theorems 7.7 and 7.11 provide a powerful interpretation of the form of an involutive tableau seen in Theorem 5.4 and Figure 3; the first $\ell$ columns represent a projection of $\Xi$ , as in Lemma 7.9, and the rank-1 incidence correspondence in Figure 12 is precisely the eigenvector condition on the appropriate subspaces. It is peculiar and interesting that these results were discovered in the opposite order historically, as explored in Section 9.

The proof of Theorem 7.11—in particular Equation (7.14)—gives a precise understanding of $\Xi$ as a scheme. Specifically, the characteristic scheme (in the sense of PDE) is merely a scheme of characteristic equations (in the sense of linear algebra)! The components of $\Xi$ correspond to the various Jordan blocks apparent in (7.14). The multiplicity of each component is the dimension of that generalized eigenspace. The sheets of the finite branched cover $\Xi\to Y^{\perp}$ come from different generalized eigenspaces where the first $\ell$ eigenvalues match. See Section 8 for how to compute this.

8. Examples

8(a). Zero-dimensional examples

Consider some cases of involutive tableaux with $(s_{1},s_{2},s_{3})=(4,0,0)$ .

[TABLE]

Or, in endovolutive block form:

[TABLE]

The characteristic ideal $\mathcal{M}$ will have degree $s_{\ell}=4$ and projective dimension $\ell-1=0$ . That is, $\Xi$ will be 4 points, counted with multiplicity. The involutivity condition is $0=\operatorname{B}^{1}_{2}\operatorname{B}^{1}_{3}-\operatorname{B}^{1}_{3}\operatorname{B}^{1}_{2}$ (all rows); that is, the matrices commute. Thus the matrices $\operatorname{B}^{1}_{2}$ and $\operatorname{B}^{1}_{3}$ must have compatible Jordan-block forms; they span a commutative algebra. In these examples, we will use colors to emphasize the distinct generalized eigenspaces.

One possibility is that the matrices are diagonal with distinct Jordan blocks:

[TABLE]

In this case, the rank-1 variety is

[TABLE]

Each point $\xi\in\Xi$ has multiplicity 1.

Another possibility is that they are diagonal, but there is an two-dimensional eigenspace.

[TABLE]

In this case, the rank-1 cone is

[TABLE]

One point $\xi\in\Xi$ has multiplicity 2; in particular, the fiber $\ker\sigma_{\xi}$ for $\xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}]$ should be seen as a $\mathbb{P}^{1}$ . This is reflected clearly in (7.14), because $\xi=[\xi_{1}:\xi_{2}:\xi_{3}]=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}]$ is a root of degree 2 for any $v$ :

[TABLE]

Another possibility is that there is a $2\times 2$ block:

[TABLE]

In this case, the rank-1 cone is

[TABLE]

Note that the fiber over of $\mathscr{C}$ over $\Xi$ has dimension 1 in each case; however, the first point has multiplicity 2. We see that the dimension of the fiber is insufficient to measure the multiplicity of the scheme $\Xi$ , because the incidence correspondence involves the ideal $\mathscr{R}$ . We can see this because of the structure of the rank-1 matrices: the upper $2\times 2$ minors vanish if and only if $\alpha_{2}\alpha_{2}=0$ , so the fiber $\ker\sigma_{\xi}$ for $\xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}]$ should be seen as a $\mathbb{P}^{0}$ of degree 2. This is reflected clearly in (7.14), because $\xi=[\xi_{1}:\xi_{2}:\xi_{3}]=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}]$ is a root of degree 2 for any $v$ :

[TABLE]

Finally, consider the case where both types of multiplicity occur. For example,

[TABLE]

In this case, the rank-1 cone is

[TABLE]

The scheme structure of $\Xi$ is apparent here. The point $\xi=[1:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}c_{1}}:{\color[rgb]{0.82,.10,0.26}\definecolor[named]{pgfstrokecolor}{rgb}{0.82,.10,0.26}d_{1}}]$ appears in two components, which correspond to the factorization of

[TABLE]

From the perspective of $\operatorname{\mathscr{C}}$ , these components correspond to the rank-1 matrices

[TABLE]

The fiber should be seen as two components, a $\mathbb{P}^{1}$ and a $\mathbb{P}^{0}$ . Overall, this point has multiplicity 3.

*Remark 8.15**.*

For readers interested in hydrodynamic integrability criteria, take a moment to compute the secant varieties $\operatorname{Sec}_{k}(\operatorname{\mathscr{C}})$ and $\operatorname{Sec}_{k}(\Xi)$ , $k=2,3$ , in each of these cases. The secant variety is all linear combinations of $k$ points from the given variety. One can consider both the embedded secant variety within $A$ and $V^{*}$ , respectively, as well as the Grassmannian secant variety within $\operatorname{Gr}_{k}(A)$ and $\operatorname{Gr}_{k}(V^{*})$ , respectively Note that hyperbolic systems of conservation laws have $s_{1}=n$ and take the non-degenerate diagonal form of the first example, over $\mathbb{R}$ .

8(b). One-dimensional examples

Consider an involutive tableau with $(s_{1},s_{2},s_{3})=(2,1,0)$ .

[TABLE]

Or, in endovolutive block form,

[TABLE]

The characteristic ideal $\mathcal{M}$ will have degree $s_{\ell}=1$ and projective dimension $\ell-1=1$ . That is, $\Xi$ will be a single curve.

For the sake of concreteness, let us assume that the coefficients are:

[TABLE]

so that

[TABLE]

The rank-1 ideal is just $\alpha_{0}\alpha_{0}-9\alpha_{1}\alpha_{2}=0$ . Write a generic element of $\operatorname{\mathscr{C}}$ as $[\alpha_{0}:\alpha_{1}:\alpha_{2}]=[3\tau:1:\tau^{2}]$ , like so:

[TABLE]

Thus, a generic element of $\xi$ is of the form $\xi=[3:\tau:15+9\tau]$ with fiber $\begin{bmatrix}3\tau\\ 1\end{bmatrix}$ .

Using (7.14), the characteristic scheme of $\xi=[3:\xi_{2}:\xi_{3}]$ is generated by $0=\det\left(\xi_{1}v^{3}\operatorname{B}^{1}_{3}+\xi_{2}v^{3}\operatorname{B}^{2}_{3}-\xi_{3}v^{3}I_{2}\right)$ , restricted to the space $\operatorname{\mathbf{W}}^{1}(\xi)\subset W$ , which is 1-dimensional. Write $\tau$ for $\xi_{2}$ ; so we are trying to find $\xi=[3:\tau:\xi_{3}]$ over $\varphi=[3:\tau:0]$ as in Lemma 7.4. The space $\operatorname{\mathbf{W}}^{1}(\varphi)$ is the space spanned by $\begin{bmatrix}3\tau\\ 1\end{bmatrix}$ . Hence, the single linear sheet of the characteristic variety over $[3:\tau:0]$ is given by $[3:\tau:15+9\tau]$ .

8(c). One-dimensional exercise

Now is the time go back and re-read the example (1.4) and see how it fits into Sections 6(c) and 5(a). The wave-equation example offers a single $\mathbb{P}^{1}$ whose fiber is also a $\mathbb{P}^{1}$ . By choosing appropriate coefficients, you should be able to produce examples with $(s_{1},s_{2},s_{3})=(3,2,0)$ with various other components and multiplicities.

In principle, you can choose any Cartan characters, and choose coefficients subject to Theorem 5.4 to build examples in this way. See the Sage code at https://bitbucket.org/curieux/symbol_sage, which can generate and analyze any such example (given sufficient memory).

9. Results of Guillemin and Quillen

As in the analogy Section 1(b), normal forms often reveal shortcuts to other advanced ideas.

Guillemin’s proof of Theorem 7.7 made use of two results derived from Quillen’s thesis [Qui64]. In this section, we see how these results become easier using Theorem 5.4. (Note that Theorem 5.4 and Theorem 7.7 are not equivalent. Theorem 5.4 is strictly stronger; it is easy to construct endovolutive tableaux that satisfy the conclusion of (7.7) but are not involutive. See [Smi15].)

Recall the Spencer cohomology groups from Section 4(b). For any $\varphi\in V^{*}$ , wedging by $\varphi$ gives a map $W\otimes\wedge^{k}V^{*}\to W\otimes\wedge^{k+1}V^{*}$ . This induces a map on the quotient spaces, $H^{k}(A)\to H^{k+1}(A)$ .

Theorem 9.1 (Quillen’s Exactness Theorem).

Suppose $A$ is an involutive tableau, and that $\varphi\not\in\Xi_{A}$ . Then the sequence of maps by $\wedge\varphi$ ,

[TABLE]

is exact.

In [Qui64], this theorem is proven using enormous commutative diagrams. In our context, with Theorem 5.4 in hand, we can prove an easy version of Quillen’s result, in the form of Lemma 9.3. Lemma 9.3 is a consequence of Corollary 9.2, which for us is an easy corollary of Theorem 5.4. This corollary is called Theorem A in [Gui68], where it was proved using a large diagram chase using Quillen’s exactness theorem, Theorem 9.1.

Corollary 9.2 (Quillen, Guillemin).

Consider the subspace $U=\left\langle u_{1},\ldots,u_{\ell}\right\rangle\subset V$ for a generic basis $(u_{i})$ of $V$ , as in (1.17). If $A$ is involutive, then $A|_{U}$ is involutive, and the natural map between prolongations $A^{(1)}\to\left(A|_{U}\right)^{(1)}$ is bijective.

Proof.

The first part is an immediate consequence of Theorem 5.4, as the quadratic condition still holds if the range of indices $\lambda,\mu,i,j$ is truncated at $\ell$ (or greater). In particular, the generators $(\pi^{a}_{\lambda})_{a\leq s_{\lambda}}$ of $A$ are preserved.

The second part is similarly immediate, using the proof of Theorem 5.4 given in [Smi15]: the contact relation $\pi^{a}_{\mu}=Z^{a}_{\mu,i}u^{i}$ for $a\leq s_{\lambda}$ gives coordinates $Z^{a}_{\mu,i}$ to the prolongation $A^{(1)}\subset A\otimes V^{*}$ , and the $s_{1}+2s_{2}+\cdots+\ell s_{\ell}$ independent generators are precisely those $Z^{a}_{\mu,\lambda}$ with $a\leq s_{\mu}$ and $\lambda\leq\mu\leq\ell$ . Since they involve no indices $i>\ell$ , these generators remain independent when the range of indices is truncated at $\ell$ . ∎

Now we come to our simplified version of Theorem 9.1. Compare Lemma 9.3 to the exact sequence $(3.4)_{2}$ in [Gui68].

Lemma 9.3.

Recall that $U^{\perp}$ is a complement to $Y^{\perp}\subset V^{*}$ , so that $V^{*}=Y^{\perp}\oplus U^{\perp}$ as in (1.17) and (1.18). For $A$ involutive, the sequence

[TABLE]

is exact.

Proof.

This proof is just an explicit description of the maps in a basis and an application of Corollary 9.2. Let $(u^{i})$ be a basis for $V^{*}$ such that $(u^{\lambda})$ is a basis for $Y^{\perp}$ and $(u^{\varrho})$ is a basis for $U^{\perp}$ , using the index convention (1.8) from Section 1.

The sequence makes sense because we can split the Spencer sequence (4.15) as $W\otimes V^{*}=A\oplus H^{1}$ by identifying the space $H^{1}$ with $\{\sum_{a>s_{i}}\pi^{a}_{i}(z_{a}\otimes u^{i})\}\subset W\otimes V^{*}$ , which is the space spanned by the unshaded entries in Figure 1. Using this identification, two elements $\sum_{a>s_{i}}\pi^{a}_{i}(z_{a}\otimes u^{i})$ and $\sum_{a>s_{i}}\hat{\pi}^{a}_{i}(z_{a}\otimes u^{i})$ of $W\otimes V^{*}$ are equivalent in $H^{1}$ if and only if $\pi^{a}_{i}-\hat{\pi}^{a}_{i}=\sum_{b\leq s_{\lambda}}B^{a,\lambda}_{i,b}z^{b}_{i}$ for some $\{z^{a}_{i}:a\leq s_{i}\}$ , the shaded entries in Figure 1. In other words, the projection $W\otimes V^{*}\to H^{1}$ is defined by (1.10), and the projection $W\otimes V^{*}\to A$ is defined by the projection onto the orange generator components in Figure 1, those $\pi^{a}_{\lambda}$ with $a\leq s_{\lambda}$ .

Since $s_{\varrho}=0$ for all $\varrho>\ell$ , the inclusion $W\otimes U^{\perp}\subset W\otimes V^{*}$ is an inclusion $W\otimes U^{\perp}\subset H^{1}$ . Hence, the inclusion is understood as

[TABLE]

An element of $H^{1}\otimes U^{\perp}$ is written in $W\otimes V^{*}\otimes U^{\perp}$ as

[TABLE]

The image $\delta(H^{1}\otimes U^{\perp})$ in $H^{2}$ is

[TABLE]

so $\delta P\in W\otimes\wedge^{2}V^{*}$ is of the form

[TABLE]

Recall that $H^{2}=\frac{W\otimes\wedge^{2}V^{*}}{\delta_{\sigma}(A\otimes V^{*})}$ . So, $\delta P\equiv 0$ in $H^{2}$ if and only if there is some $T\in A\otimes V^{*}$ such that $\delta_{\sigma}(T)=\delta(P)$ in $W\otimes\wedge^{2}V^{*}$ . Looking at (9.7), it is apparent that such $T$ must have $\delta_{\sigma}(T|_{U})=0$ , as $\delta(P)$ has no $Y^{\perp}\wedge Y^{\perp}$ terms. By involutivity and Corollary 9.2, we consider the involutive tableau

[TABLE]

with prolongation

[TABLE]

Therefore, $T|_{U}\in A|_{U}\otimes Y^{\perp}$ lies in the kernel of $\delta_{\sigma}|_{U}$ , so $T|_{U}\in\left(A|_{U}\right)^{(1)}$ . Therefore, Corollary 9.2 tells us $T\in A^{(1)}$ . That is, $\delta(P)\equiv 0\in H^{2}$ if and only if $\delta(P)=\delta_{\sigma}(T)=0$ .

Therefore, $\delta(P)\equiv 0\in H^{2}$ if and only if $P^{a}_{\lambda,\varsigma}=0$ and $P^{a}_{\varrho,\varsigma}=P^{a}_{\varsigma,\varrho}$ on these index ranges. This occurs if and only if $P=P^{a}_{\varrho,\varsigma}(z_{a}\otimes u^{\varrho}\otimes u^{\varsigma})$ , meaning $P\in W\otimes S^{2}U^{\perp}$ . ∎

We are ready to prove Theorem 7.7. The structure of the proof is identical to the original proof in [Gui68].

Proof of Theorem 7.7.

Suppose that $w\in\operatorname{\mathbf{W}}^{1}(\varphi)$ , so that $\pi=\operatorname{B}(\varphi)(\cdot)w=w\otimes\varphi+J$ for some $J\in W\otimes U^{\perp}$ with $J_{\varrho}=J^{a}_{\varrho}z_{a}\in\operatorname{\mathbf{W}}^{-}(\varphi)$ for all $\varrho$ . First, we must show that the span of the columns $J_{\varrho}$ of $J$ lies in $\operatorname{\mathbf{W}}^{1}(\varphi)$ .

Consider the element $-J\otimes\varphi=-J^{a}_{\varrho}\varphi_{\lambda}(z_{a}\otimes u^{\lambda}\otimes u^{\varrho})\in H^{1}\otimes U^{\perp}$ . Because $z\otimes\varphi+J\in A$ , it must be that $z\otimes\varphi\otimes\varphi\in W\otimes V^{*}\otimes V^{*}$ represents the same point in $H^{1}\otimes U^{\perp}$ . So, we can compute

[TABLE]

By Corollary 9.2, there exists $Q=Q^{a}_{\varrho,\varsigma}(z_{a}\otimes u^{\varsigma}\otimes u^{\varrho})\in W\otimes S^{2}U^{\perp}$ such that $-J\otimes\varphi-Q\in A\otimes U^{\perp}$ . That is, writing $Q_{\varrho}=Q^{a}_{\varrho,\varsigma}(z_{a}\otimes u^{\varsigma})\in W\otimes Y^{\perp}$ , we have $J_{\varrho}\otimes\varphi+Q_{\varrho}\in A$ for all $\varrho$ , meaning $J_{\varrho}\in\operatorname{\mathbf{W}}^{1}(\varphi)$ for all $\varrho$ . Therefore, for any $v\in V$ , we have $\operatorname{B}(\varphi)(v)z=\varphi(v)z+J(v)\in\operatorname{\mathbf{W}}^{1}(\varphi)$ .

Now, mapping again, $\operatorname{B}(\varphi)(\cdot)J_{\varrho}=J_{\varrho}\otimes\varphi+Q_{\varrho}$ , so $\operatorname{B}(\varphi)(u_{\varsigma})J_{\varrho}=Q_{\varrho,\varsigma}$ , which is already known to be symmetric in $\varrho,\varsigma$ . Therefore,

[TABLE]

This is symmetric in $v,\tilde{v}$ , giving the commutativity condition (7.7) ∎

It is interesting to see the inversion of logic that happened here. In the original literature, the overall implications are

[TABLE]

But, the arguments here give the overall implications

[TABLE]

However, we can write a shorter proof of Theorem 7.7 that relies Theorem 5.4 more directly, avoiding the general results of Quillen. For motivation, consider the following trivial corollary of Theorem 5.4 that is obtained by setting $\lambda=\mu$ .

Corollary 9.12.

Suppose an involutive tableau is given in a generic, endovolutive basis as in (1.14), so that Theorem 5.4 holds. Then $\operatorname{B}(u^{\lambda})(v)$ is an endomorphism of $\operatorname{\mathbf{W}}^{-}(u^{\lambda})$ such that for all $v,\tilde{v}\in Y$ ,

[TABLE]

Alternate Proof of Theorem 7.7.

Fix $\varphi\in Y^{\perp}$ , and suppose that $w\in\operatorname{\mathbf{W}}^{1}(\varphi)$ . We must verify that all maps $\operatorname{B}(\varphi)(v)$ preserve $\operatorname{\mathbf{W}}^{1}(\varphi)$ and that they commute. Note that the definition of $\operatorname{\mathbf{W}}^{1}(\varphi)$ in Equation 1.24 depends on the choice of subspace $Y^{\perp}$ but not on its basis, so we may verify these conditions using any basis we like.

First a trivial case: if it happens that $\varphi\in\Xi\cap Y^{\perp}$ , then $\operatorname{B}(\varphi)(v)w=\varphi(v)w\in\operatorname{\mathbf{W}}^{1}(\varphi)$ is a rescaling, and it is immediate that $[\operatorname{B}(\varphi)(v),\operatorname{B}(\varphi)(\tilde{v})]=0$ .

Otherwise, we have $\varphi\not\in\Xi$ . Then we may choose a generic basis of $V^{*}$ in which $\varphi=u^{1}$ . Moreover, we may use that basis to construct an endovolutive basis of $W$ . By Corollary 9.12, it suffices to prove in this basis that $\operatorname{\mathbf{W}}^{1}(u^{1})$ is preserved by every $\operatorname{B}(u^{1})(v)$ . Write $\operatorname{B}(\varphi)(\cdot)w=w\otimes u^{1}+J$ , and examine (1.22) on a column $J_{\varrho}$ of $J$ . For each $\mu=1,\ldots,\ell$ , we must verify

[TABLE]

If $\mu=1$ , then this is immediate, since $\operatorname{B}^{1}_{1}=I_{s_{1}}$ .

If $\mu\neq 1$ , then we are verifying $0=(\operatorname{B}^{1}_{\mu}\operatorname{B}^{1}_{\varrho}-0)w$ . Note that $\operatorname{B}^{1}_{\mu}w=0$ , since $\operatorname{B}(\varphi)(\cdot)w=w\otimes\varphi+J=w\otimes u^{1}+J$ . Moreover, by Theorem 5.4, we have

[TABLE]

for $a>s_{\mu}$ . Therefore, $\operatorname{B}^{1}_{\mu}\operatorname{B}^{1}_{\varrho}$ lies in $\operatorname{\mathbf{W}}^{-}(\mu)$ . On the other hand, note that the output of $\operatorname{B}^{1}_{\mu}$ lies in $\operatorname{\mathbf{W}}^{+}_{\mu}$ by the construction of the maps $\operatorname{B}^{\lambda}_{\mu}$ from the reduced symbol in Section 1(c). Combining these, we see that $\operatorname{B}^{1}_{\mu}\operatorname{B}^{1}_{\varrho}w$ lies in $\operatorname{\mathbf{W}}^{-}_{\mu}\cap\operatorname{\mathbf{W}}^{+}_{\mu}=0$ .

Hence, the space $\operatorname{\mathbf{W}}^{1}(\varphi)$ is preserved by $\operatorname{B}(\varphi)(v)$ for all $v$ . By Corollary 9.12, they commute. ∎

On the theoretical side, it would be interesting to see how many of the hard classical theorems in the subject can be re-proven with elementary techniques. Specifically, the proof of Lemma 9.3 suggests an elementary proof of Quillen’s exactness theorem. The other hard theorem is the integrability of the characteristic variety, and a proof of that theorem using Guillemin’s original formulation is the subject of [GQS70]. That result was applied immediately to study primitive Lie pseudogroups.

10. Prolongation

How does the characteristic scheme change under prolongation? The short answer is that it does not! This does not depend on endovolutivity or involutivity.

Recall that $A^{(1)}$ is a tableau within $A\otimes V^{*}$ . An element of $A^{(1)}$ is $P\in A\otimes V^{*}$ . Using any bases for $V,W,A$ , we may write $P$ as $P^{a}_{i,j}z_{a}\otimes u^{i}\otimes u^{j}$ , with the additional condition that $P^{a}_{i,j}=P^{a}_{j,i}$ from (4.9). Let $\operatorname{\mathscr{C}}^{(1)}$ denote the rank-1 elements of $A^{(1)}$ , and let $\Xi^{(1)}$ denote its projection to $V^{*}$ , as in Section 6(b).

Theorem 10.1.

If $\pi\otimes\xi\in\operatorname{\mathscr{C}}^{(1)}$ , then $\pi=w\otimes\xi\in\operatorname{\mathscr{C}}$ for some $w\in\ker\sigma_{\xi}$ . Conversely, if $w\otimes\xi\in\operatorname{\mathscr{C}}$ , then $(w\otimes\xi)\otimes\xi\in\operatorname{\mathscr{C}}^{(1)}$ . In particular, $\Xi\cong\Xi^{(1)}$ as schemes.

Proof.

Suppose that $\pi\otimes\xi\in\operatorname{\mathscr{C}}^{(1)}$ for some $\pi\in A$ and $\xi\in V^{*}$ . That is, $P\in A^{(1)}$ and $P=\pi\otimes\xi$ , so $P^{a}_{i,j}=\pi^{a}_{i}\xi_{j}$ , and $\pi^{a}_{i}\xi_{j}=\pi^{a}_{j}\xi_{i}$ for all $a,i,j$ .

Let $\underline{\lambda}$ be the minimum index such that $\xi_{\underline{\lambda}}\neq 0$ . Then $\pi^{a}_{\underline{\lambda}}\xi_{i}=\pi^{a}_{i}\xi_{\underline{\lambda}}$ , so column $i$ of $(\pi^{a}_{i})$ is a multiple—namely $\xi_{i}/\xi_{\underline{\lambda}}$ —of column $\underline{\lambda}$ for all $i$ . Therefore, $(\pi^{a}_{i})$ is rank-1, and there is some $w$ with $\pi=w\otimes\xi$ . The converse is immediate. ∎

*Remark 10.2**.*

Theorem 10.1 is used sometimes as a method for computing the characteristic variety, as follows: Given a tableau $(\pi^{a}_{i})$ whose entries might depend on $e\in M^{(1)}$ , consider $(\xi_{i})\mapsto(\pi^{a}_{i}\xi_{j}-\pi^{a}_{j}\xi_{i})$ as a map $V^{*}\to W\otimes\wedge^{2}V^{*}$ ; that is, a map from $\mathbb{C}^{n}$ to $\mathbb{C}^{r\binom{n}{2}}$ . For a general point in $\xi\in V^{*}$ , this map has rank at least 1. Its rank falls to 0 if and only if $\xi\in\Xi$ . But, this method is inefficient. If you have $(\pi^{a}_{i})$ in hand and want to compute $2\times 2$ minors of something, you would save ink by computing the $2\times 2$ minors of $(\pi^{a}_{i})$ itself to find $\operatorname{\mathscr{C}}$ .

11. Characteristic Sheaf

For a single endomorphism, the characteristic polynomial and the Jordan block decomposition of generalized eigenspaces together reveal all of the information that is independent of coordinates.

The ultimate conclusion of the preceding sections is that, for an abstract tableau $A$ , the characteristic sheaf $\mathscr{M}$ knows the dimensions $n$ , $r$ , $(s_{1},\ldots,s_{n})$ , as well as all of the dimensions and relationships among the mutual eigenspaces of the various symbol maps. The rank-1 cone $\operatorname{\mathscr{C}}$ knows the algebraic relationships among the sequences of eigenvalues (which we call $\xi$ ), and it also knows on which subspaces the symbol maps commute and on which fail to commute. In summary, $\mathscr{M}$ and $\operatorname{\mathscr{C}}$ together know everything about an abstract tableau $A$ that is independent of coordinates.232323We revealed this fact using special bases, but as with traditional Jordan normal form, there is an abstract structure independent of basis that is easiest to see by building an adapted basis. Moreover, they are invariant under prolongation!

If the abstract tableau $A$ is a smooth projective bundle, then this applies to involutive Kähler-regular exterior differential systems in the smooth category.

If this formal perspective is appealing, then one might as well dispense with tableaux, symbols, Grassmann bundles, and differential ideals, and instead study the sheaf $\mathscr{M}$ directly, with modern algebraic tools such as [Eis05]. Consider $\mathscr{M}$ as an ideal in

[TABLE]

and consider its free resolution. The Hilbert syzygy theorem states that there is a finite free resolution that is characterized by its Hilbert polynomial $h_{\mathscr{M}}(d)$ . Of course, Theorem 7.11 is reading the leading term of $h_{\mathscr{M}}(d)$ !

One might ask how the involutivity of $A$ can be detected as an algebraic property of $\mathscr{M}$ . The answer is tied to Castelnuovo–Mumford regularity, which measures the growth of the Hilbert polynomial. This computation is equivalent to the Cartan characters in Cartan’s test!

While it is not necessarily a useful computational tool versus differential forms or tableaux, this perspective allows a broader view of the techniques in PDE analysis, and it suggests that future progress in the field will emphasize on invariant algebraic techniques.

For more on this perspective, see [Mal03], [BCG*+*90, Chapter VIII], and the notes by Mark Green from the 2013 conference New Directions in Exterior Differential Systems in Estes Park, Colorado, which are based on the perspective in [CGG09].

Part IV Eikonal Systems

In Part III, we studied the characteristic scheme defined over $M^{(1)}\subset\operatorname{Gr}_{n}(TM)$ . In this part, we turn our attention to the characteristic scheme as pulled back to an integral manifold $\iota:N\to M$ . This is where the meaning of $\Xi$ as “directions with an ambiguous initial value problem” has clear implications for the internal structure of solutions of a differential equation, as the eikonal system yields intrinsic foliations of integral manifolds $N$ .

12. General Eikonal Systems

First, let us consider the general notion of “eikonal equations” of a projective variety, without specific regard to the characteristic variety.

Consider a smooth manifold $N$ of dimension $n$ . Here are three ways to produce a smooth local hypersurface $H\subset N$ .

(i)

The implicit function theorem says that a smooth hypersurface $H\subset N$ is defined locally by a smooth function $f:N\to\mathbb{R}$ , where $T_{x}H=\ker\mathrm{d}f$ for all $x\in H$ . 2. (ii)

By the Frobenius theorem, this is equivalent to having a local smooth section $\varphi$ of $T^{*}N=\Omega^{1}(N)$ such that $\mathrm{d}\varphi\equiv 0\mod\varphi$ , for then $\varphi$ is a rescaling of some $\mathrm{d}f$ . 3. (iii)

We can also look at the Frobenius theorem from the perspective of Cartan–Kähler theory242424Although Theorem 17 applies as stated only in the analytic category, it can be extended to the smooth category in this case. This sort of extension is explored in Section 14., as in Theorem 17. To make a local function $f:N\to\mathbb{R}$ or a local section $\varphi$ of $T^{*}N$ , consider the jet space $\mathbb{J}^{1}(N,\mathbb{R})$ , which is isomorphic to the bundle $T^{*}N\times\mathbb{R}$ . Jet space is an open neighborhood (or local linearization) of $\operatorname{Gr}_{n}(N\times\mathbb{R})$ equipped with local coordinates $(x^{i},p_{i},y)=(x^{1},\ldots,x^{n},p_{1},\ldots,p_{n},y)$ and a contact system $\mathcal{J}$ generated by $\Upsilon=\mathrm{d}y-p_{i}\mathrm{d}x^{i}$ and $\mathrm{d}\Upsilon$ , as in Section 3(a). In these local coordinates, set the independence condition $\boldsymbol{\omega}=\mathrm{d}x^{1}\wedge\cdots\wedge\mathrm{d}x^{n}\neq 0$ . Any $n$ -dimensional integral manifold of the exterior differential system $(T^{*}N\times\mathbb{R},\mathcal{J},\boldsymbol{\omega})$ corresponds to a function $y=f(x^{1},\ldots,x^{n})$ with $p_{i}=\frac{\partial f}{\partial x^{i}}$ , so we may take $\varphi=\mathrm{d}f=\frac{\partial f}{\partial x^{i}}\mathrm{d}x^{i}$ . It is easy to see that this exterior differential system has no torsion and has a Kähler-regular tableau with Cartan characters $s_{1}=s_{2}=\cdots=s_{n}=1$ . That is, integral manifolds are parametrized by 1 function of $n$ variables (hardly a surprise).

Now, consider a projective subbundle $\Sigma_{N}\subset\mathbb{P}T^{*}N$ , meaning it is defined smoothly by homogeneous functions in the local fiber variables $(p_{i})$ of $T^{*}N$ . We want a test that tells us whether there exist hypersurfaces $H$ for which $\mathrm{d}f\in\Sigma_{N}$ everywhere. Specifically, we want a theorem like the following.

Theorem 12.1.

Suppose that the eikonal system (defined below) of $\Sigma_{N}$ is involutive. Then for any smooth point $[\varphi]\in(\Sigma_{N})_{x}$ , there is a smooth hypersurface $H\subset N$ such that $(T_{x}H)^{\perp}=[\varphi]$ and such that $(T_{\tilde{x}}H)^{\perp}$ lies in the smooth locus of $(\Sigma_{N})_{\tilde{x}}$ for all $\tilde{x}\in H$ .

Because the hypersurface $H$ and the 1-form $\varphi$ are not chosen a priori, this condition is difficult to interpret using the above formulations (i) and (ii) of hypersurfaces; however, the third formulation on $T^{*}N\times\mathbb{R}$ is well-suited to this theorem. Consider the inclusion $\psi:\hat{\Sigma}_{N}\times\mathbb{R}\to\mathbb{J}^{1}(N,\mathbb{R})$ . (Recall that $\hat{~{}}$ indicates the affine de-projectivization of a projective variety, resulting in a cone.) The eikonal system of $\Sigma_{N}$ is the exterior differential system $\operatorname{\mathscr{E}}(\Sigma_{N})=\psi^{*}(\mathcal{J})$ on $\hat{\Sigma}_{N}\times\mathbb{R}$ ; that is, $\operatorname{\mathscr{E}}(\Sigma_{N})$ is generated by $\psi^{*}(\Upsilon)$ and $\psi^{*}(\mathrm{d}\Upsilon)$ and has independence condition $\mathrm{d}x^{1}\wedge\cdots\wedge\mathrm{d}x^{n}\neq 0$ . An integral manifold of $\operatorname{\mathscr{E}}(\Sigma_{N})$ corresponds to a hypersurface in $N$ whose tangent space in annihilated by a section of $\hat{\Sigma}_{N}$ .

We do not prove involutivity of $\mathcal{E}(\Sigma_{N})$ in any significant case here; it is typically extremely deep and difficult, and references are provided below. However, the situation in Theorem 12.1 has several interesting consequences and interpretations.

Corollary 12.2.

Suppose that the eikonal system of $\Sigma_{N}$ is involutive. Let $\ell-1$ denote the projective fiber dimension of $\Sigma_{N}$ . The hypersurfaces guaranteed by Theorem 12.1 depend on $\ell$ functions of 1 variable.

Proof.

Fix $[\varphi]\in(\Sigma_{N})_{x}$ . We work locally252525In fact, we work microlocally in the bundle. Microlocally means that we are working over a contractible neighborhood of the base space with a local trivialization of the bundle, and also within a neighborhood in the fiber. near $\varphi$ , so we may assume $N$ is open, connected, and simply connected, and that $T^{*}N=N\times\mathbb{R}^{n}$ . Because $\hat{\Sigma}_{N}$ is smooth with affine fiber dimension $\ell$ in $T^{*}N$ , we may choose local coordinates $(q_{1},\ldots,q_{n})$ on each fiber of $T^{*}N$ near $\varphi$ such that $\hat{\Sigma}_{N}$ is defined by $q_{\ell+1}=\cdots=q_{n}=0$ near $\varphi$ .

For each $\lambda=1,\ldots,\ell$ , let $\sigma^{\lambda}\in(\Sigma_{N})_{x}$ denote the lines of 1-forms specified as

[TABLE]

nonzero in the $\lambda$ slot, in these coordinates. By Theorem 12.1, there is a local hypersurface $H_{\lambda}\subset N$ and a corresponding local function $x^{\lambda}$ such that $\mathrm{d}x^{\lambda}\sim\sigma^{\lambda}$ . Complete $x^{1},\ldots,x^{\ell}$ to a local coordinate system $(x^{i})$ on $N$ , and let $p_{i}$ be the canonical Darboux coordinates (that is, roughly corresponding to $\frac{\partial y}{\partial x^{i}}$ ) on the fiber of $T^{*}N$ . Note that $p_{i}(\mathrm{d}x^{\ell})=\delta^{\lambda}_{i}$ by construction, so $\hat{\Sigma}_{N}$ is defined by $p_{\ell+1}=\cdots=p_{n}=0$ . (Note that the open neighborhood of $T^{*}N$ around $\varphi$ may have shrunk during this process, which is why this is microlocal.)

Therefore, the contact system on $T^{*}N\times\mathbb{R}$ is generated in a neighborhood of $\varphi$ by $\Upsilon=\mathrm{d}y-p_{i}\mathrm{d}x^{i}$ , which pulls back to $\hat{\Sigma}_{N}\times\mathbb{R}$ as

[TABLE]

The corresponding tableau is the space of $1\times\ell$ matrices with entries $\mathrm{d}p_{\lambda}$ for $\lambda=1,\ldots,\ell$ , so its has $s_{1}=s_{2}=\cdots=s_{\ell}=1$ . ∎

This is an interesting proof, using all three perspectives of hypersurfaces. The implicit function theorem on the fiber provides local coordinates on the base by involutivity. Then, the Frobenius theorem on the base produces contact coordinates on the fiber that are compatible with the original fiber coordinates. It is easy to adapt this proof to the following corollary, which is useful for constructing coordinates in some situations, as in [Smi14].

Corollary 12.3.

For any $\Sigma_{N}$ , let $\left\langle\Sigma_{N}\right\rangle$ denote its linear span, which is itself a projective subbundle of $\mathbb{P}T^{*}N$ . If $\operatorname{\mathscr{E}}(\Sigma_{N})$ is involutive, then $\operatorname{\mathscr{E}}(\left\langle\Sigma_{N}\right\rangle)$ is involutive.

We will now examine several interpretations of the eikonal system that tie together various branches of geometry. Compare Sections 12(a), 12(b), and 13 to [BCG*+*90, V§3(vi)].

12(a). as Lagrangian Geometry

The $\mathbb{R}$ term in $T^{*}N\times\mathbb{R}$ plays little role for the eikonal system $\mathcal{E}(\Sigma_{N})$ . It is there merely to make obvious the relationship between the eikonal equations and hypersurfaces.

Instead, consider the symplectic manifold $T^{*}N$ with symplectic 2-form $\mathrm{d}\Upsilon$ , which is expressed in local coordinates as $\mathrm{d}\Upsilon=-\mathrm{d}p_{i}\wedge\mathrm{d}x^{i}$ according to Darboux’s theorem. The Lagrangian Grassmannian $LG(N)$ is the bundle over $T^{*}N$ whose fiber is all the Legendrian $n$ -planes

[TABLE]

Each fiber is isomorphic to the homogeneous space $LG(n,2n)$ , which is the variety of $n$ -planes in $\mathbb{R}[x^{1},\ldots,x^{n},p_{1},\ldots p_{n}]$ on which $\mathrm{d}p_{i}\wedge\mathrm{d}x^{i}=0$ . If we consider a plane $e\in LG(n,2n)$ for which $\mathrm{d}x^{1}\wedge\cdots\wedge\mathrm{d}x^{n}\neq 0$ , then $\mathrm{d}p_{i}=P_{i,j}(e)\mathrm{d}x^{i}$ on $e$ with $P_{i,j}=P_{j,i}$ . Hence, the non-vertical open neighborhood of $LG(n,2n)$ is identified with the space of symmetric $n\times n$ matrices, $\operatorname{Sym}^{2}(\mathbb{R}^{n})$ .

Suppose the de-projectivized affine subvariety $\hat{\Sigma}_{N}\subset T^{*}N$ is defined smoothly by homogeneous functions in the local fiber variables $(p_{i})$ of $T^{*}N$ . From this perspective, the eikonal system $\mathcal{E}(\Sigma_{N})$ is measuring the intersection of $\operatorname{Gr}_{n}(T_{\varphi}\Sigma_{N})$ with $LG_{\varphi}(N)$ for all $\varphi\in\Sigma_{N}$ .

Corollary 12.5.

The eikonal system $\mathcal{E}(\Sigma_{N})$ is involutive if and only if there are local coordinates of $T^{*}N$ near $\varphi\in\hat{\Sigma}_{N}$ in which the non-vertical open set in $\operatorname{Gr}_{n}(T\Sigma_{N})\cap LG(N)$ is described as the $n\times n$ symmetric matrices $P_{i,j}(e)$ that vanish outside the upper-left $\ell\times\ell$ part.

Proof.

If the eikonal system $\operatorname{\mathscr{E}}(\Sigma_{N})$ is involutive, then we may construct coordinates as in Corollary 12.2 such that the de-projectivized affine variety $\hat{\Sigma}_{N}$ is defined by $p_{\varrho}=0$ for all $\varrho>\ell$ , so $T_{\varphi}\hat{\Sigma}_{N}$ is defined by $\mathrm{d}p_{\varrho}=0$ for all $\varrho>\ell$ . In such coordinates, the open neighborhood of the Lagrangian Grassmannian takes the block form

[TABLE]

using our index convention (1.8) from Section 1. The condition $e\in T\Sigma_{N}$ implies $\mathrm{d}p_{\varrho}=0$ , so the lower blocks are zero. The matrix is symmetric, so the upper-right block is zero.

Conversely, suppose such coordinates exist. Then $T\hat{\Sigma}_{N}$ satisfies the closed 1-forms $\mathrm{d}p_{\varrho}=0$ , and the dimensions match, so $\Sigma_{N}$ satisfies $p_{\varrho}=\text{constant}$ . Since the equations defining $\Sigma_{N}$ are homogeneous, it must be $p_{\varrho}=0$ . Using these coordinates for $T^{*}N\times\mathbb{R}$ and $\mathcal{J}$ yields $\psi^{*}(\Upsilon)=\mathrm{d}y-p_{\lambda}\mathrm{d}x^{\lambda}$ , as in Corollary 12.2, which is involutive with the correct Cartan characters and gives the desired hypersurfaces in Theorem 12.1. ∎

Compare this to Proposition 3.22 in [BCG*+*90, Chapter V]. For more symplectic and Lagrangian geometry, see [Bry93].

12(b). as Poisson Brackets

If $T^{*}N$ describes the state of a physical system, a function $F:T^{*}N\to\mathbb{R}$ is called an observable [SW86]. The Poisson bracket of observables is the operation given in local coordinates by

[TABLE]

The Poisson bracket plays a fundamental role in Hamiltonian mechanics and the relationship between symmetries and conservation laws in physics. This is because (12.7) is a Lie bracket on $C^{\infty}(T^{*}N)$ . (See [Bry93] for details.)

Suppose that $O$ is some subspace of $C^{\infty}(T^{*}N)$ , so that $O$ is a nonempty set of smooth observables that is closed under linear combinations. Suppose also that $\boldsymbol{\{}F,G\boldsymbol{\}}\in O$ for all $F,G\in O$ . Then, $O$ is a Lie subalgebra of $C^{\infty}(T^{*}N)$ with respect to the Poisson bracket.

Because $\Sigma_{N}\subset\mathbb{P}T^{*}N$ is a projective variety in each fiber, the de-projectivized affine subvariety $\hat{\Sigma}_{N}\subset T^{*}N$ is defined smoothly by observables that take the form of homogeneous functions in the local fiber variables $(p_{i})$ of $T^{*}N$ . For convenience, let us make the additional assumption that the homogeneous functions are algebraic of degree $d$ in $(p_{i})$ , so that $\hat{\Sigma}_{N}$ is defined smoothly near $\varphi\in\hat{\Sigma}_{N}$ for $\varphi\neq 0$ by a set of equations in multi-index form

[TABLE]

Corollary 12.9.

Let $O$ denote the module in $S=C^{\infty}(N)[p_{1},\ldots,p_{n}]$ generated by (12.8). The eikonal system $\mathcal{E}(\Sigma_{N})$ is involutive if and only if $\boldsymbol{\{}O,O\boldsymbol{\}}\subset O$ . That is, $\mathcal{E}(\Sigma_{N})$ is involutive if and only if the module $O$ is a Lie algebra with respect to the Poisson bracket.

A proof—which does not depend on the polynomial form (12.8)—can be derived from Corollary 12.5 along with the observation that the Poisson bracket can be defined in a coordinate-free way as the operator such that

[TABLE]

Equations of the form (12.8) appear in analysis as systems of homogeneous first-order PDEs on $u:\mathbb{R}^{n}\to\mathbb{R}$ of the form

[TABLE]

A famous example is the $n{-}\ell=1$ characteristic equation for the wave equation of Section 6(c):

[TABLE]

This is generalized to any involutive EDS in Section 13.

13. Involutivity of the Characteristic Variety

We would like to apply the entire discussion from Section 12 to the case where $\Sigma_{N}$ is a characteristic variety, but first we must establish that $\Xi$ is well-defined in $\mathbb{P}T^{*}N$ .

Suppose that $\iota:N\to M$ is a connected integral manifold of an involutive exterior differential system $(M,\mathcal{I})$ , and that $\iota^{(1)}(N)$ lies in $M^{(1)}$ , a smooth and Kähler-regular component of $\operatorname{Var}_{n}(\mathcal{I})$ , as in Section 4.

Fix $x\in N$ , and suppose $\iota(x)=p\in M$ and $\iota^{(1)}(x)=e\in M^{(1)}$ . For $\xi\in\Xi_{e}\subset V^{*}_{e}$ , we can consider the pullback $\iota^{(1)*}(\xi)\in\mathbb{P}T^{*}_{x}N\otimes\mathbb{C}$ . In a basis $(\eta^{i})$ of $T^{*}_{x}N$ , we can write a representative as $\xi=\xi_{i}\eta^{i}$ for coefficients $\xi_{i}\in\mathbb{C}$ . As a bundle over $N$ , we have $\iota^{(1)*}(\xi)=\xi_{i}\eta^{i}\in\mathbb{P}T^{*}N\otimes\mathbb{C}=\boldsymbol{\gamma}^{*}_{N}$ . In this sense, we can pull back the characteristic variety—as a set—to $N$ .

More precisely, recall that $\Xi$ has degree $s_{\ell}$ and affine fiber dimension $\ell$ , but it is a scheme defined by the characteristic sheaf $\mathscr{M}$ . For any local section $(u_{i})$ of the coframe bundle $\mathcal{F}_{\boldsymbol{\gamma}^{*}}\to M^{(1)}$ , we can write the characteristic sheaf $\mathscr{M}$ as a homogeneous ideal in the module $C^{\infty}(M^{(1)})[u_{1},\ldots,u_{n}]$ . At each $e=\iota^{(1)}(x)\subset M^{(1)}$ , the coframe $(u_{i})$ is just a complex basis of $e$ . Therefore, we obtain a basis for $T_{x}N$ of the form $\eta_{i}=\left(\iota^{(1)}_{*}\right)^{-1}(u_{i})$ . That is, in some neighborhood of $x$ , the section $(\eta_{i})$ of $\mathcal{F}^{*}N$ is well-defined. Moreover the stalks of the sheaf $C^{\infty}(M^{(1)})$ can be pulled back, as $\iota^{(1)*}(f)$ is well-defined for any $f$ defined in a neighborhood of $e$ . Therefore, we can pull back both the coefficients and the coordinates to define the homogeneous ideal $\mathscr{M}_{N}$ in $C^{\infty}(N)[\eta_{1},\ldots,\eta_{n}]$ . Let $\Xi_{N}\subset\mathbb{P}T^{*}N\otimes\mathbb{C}$ be the scheme defined by $\mathscr{M}_{N}$ .

Now, the entire discussion from Section 12 applies where $\Sigma_{N}$ is any particular component of $\Xi_{N}$ . We focus our attention on the maximal smooth locus $\Xi_{N}^{o}$ of $\Xi_{N}$ . We know additionally that $\Xi_{N}$ takes the polynomial form (12.8) as derived from (7.14), so it has degree $s_{\ell}$ and fiber dimension $\ell-1$ at smooth points, as a complex projective variety.

Theorem 13.1 (Guillemin–Quillen–Sternberg).

Suppose that $N$ is an ordinary integral manifold of an involutive exterior differential system $\mathcal{I}$ with character $\ell$ and Cartan integer $s_{\ell}$ . The eikonal system of the smooth locus of the (complex) characteristic variety, $\operatorname{\mathscr{E}}(\Xi_{N}^{o})$ , is involutive. At smooth points in $\Xi_{N}$ , the characteristic hypersurfaces are parametrized by 1 function of $\ell$ variables.

Note that our definition of $\Xi_{N}$ is the complex characteristic variety.262626Recall that, in the complex case, the distinction between elliptic and hyperbolic second-order PDEs does not occur, because there is only one nondegenerate signature. Theorem 13.1 is called the “integrability of characteristics.” Cartan demonstrated several examples of this phenomenon in [Car11]. The proof appears in [GQS70], where a major step is the application of Theorem 7.7. Hence, this result appears to rely in an essential way on all three facets of the characteristic variety seen in Part III.

The converse of Theorem 13.1 is not true; it is easy to write down non-involutive exterior differential systems for which $\operatorname{\mathscr{E}}(\Xi_{N})$ is involutive.

However, in [Gab81], Ofer Gabber proved a more general form of Theorem 13.1 that was conjectured in [GQS70] and that removes practically all of the technical assumptions. Phrased as Theorem 13.2, Gabber’s theorem recalls the ideas of Section 12(b).

Theorem 13.2 (Gabber).

Let $S$ be a filtered ring whose graded ring $gr(S)$ is a Noetherian commutative algebra over $\mathbb{Q}$ . Let $M$ be a $gr(S)$ -ideal that is finitely generated as an $S$ -module. Then $\boldsymbol{\{}\sqrt{M},\sqrt{M}\boldsymbol{\}}\subset\sqrt{M}$

In our context, Gabber’s theorem applies to the case where $S=C^{\infty}(N)[p_{1},\ldots,p_{n}]$ , the ring of polynomials in local fiber variables of $T^{*}N$ , filtered by degree. Then, $gr(S)$ is the ring of homogeneous polynomials, graded by degree, which admits a Poisson structure like (12.7). The $gr(S)$ -ideal $M$ is the characteristic sheaf $\mathscr{M}_{N}$ , which by (7.14) is defined by homogeneous polynomials if the original exterior differential system is involutive. By Hilbert’s Nullstellensatz, the radical ideal $\sqrt{M}$ defines the generic component $\Xi_{N}^{o}$ . Thus, the conclusion $\boldsymbol{\{}\sqrt{M},\sqrt{M}\boldsymbol{\}}\subset\sqrt{M}$ invokes Corollary 12.9 to say that the eikonal system $\operatorname{\mathscr{E}}(\Xi_{N}^{o})$ is involutive.

From the general discussion of eikonal systems surrounding Theorem 12.1, the interpretation of these theorems is apparent, in the form of Corollary 13.3.

Corollary 13.3.

Suppose that $N$ is an ordinary integral manifold of an involutive exterior differential system $\mathcal{I}$ with character $\ell$ and Cartan integer $s_{\ell}$ . Then $N$ admits a local—possibly complex—coordinate system $(x^{1},\ldots x^{n})$ such that $\mathrm{d}x^{1},\cdots,\mathrm{d}x^{\ell}\in\Xi_{N}$ .

In [Smi14], the linear span of the characteristic variety, $\left\langle\Xi_{N}\right\rangle$ is studied in comparison to the Cauchy retraction space $\mathfrak{g}^{\perp}_{N}=\iota^{*}(\mathfrak{g})$ , where $\mathfrak{g}^{\perp}$ is the maximum Frobenius system within $\mathcal{I}$ , as in Section 5(b).

Suppose that the affine fiber dimension of $\left\langle\Xi_{N}\right\rangle$ is $L$ and that the affine fiber dimension of $\mathfrak{g}^{\perp}_{N}$ is $\nu$ . These spaces are nested, so $\ell\leq L\leq\nu\leq n$ .

Corollary 13.4.

Suppose that $N$ is an ordinary integral manifold of an involutive exterior differential system $\mathcal{I}$ with character $\ell$ and Cartan integer $s_{\ell}$ . Then $N$ admits a local—possibly complex—coordinate system $(x^{1},\ldots,x^{n})$ such that $\mathrm{d}x^{1},\ldots,\mathrm{d}x^{\ell}\in\Xi_{N}$ , such that $\mathrm{d}x^{\ell+1},\ldots,\mathrm{d}x^{L}\in\left\langle\Xi_{N}\right\rangle$ , and such that $\mathrm{d}x^{L+1},\ldots,\mathrm{d}x^{\nu}\in\mathfrak{g}^{\perp}_{N}$ .

Corollary 13.4 is a simple result, but its proof relies on building a coframe of $N$ in which the nilpotent parts of the commuting symbol maps $\operatorname{B}^{\lambda}_{i}$ are identified clearly; that is, it depends in an essential way on Theorems 13.1 and 5.4. The key point is that it reinforces the following remark.

*Remark 13.5** (General Dogma of the Characteristic Variety).*

An exterior differential system $(M,\mathcal{I})$ is a geometric object over $M$ , meaning that its key properties are coordinate-invariant. On each Kähler-regular component $M^{(1)}$ , knowing this geometry is equivalent to knowing the characteristic scheme and rank-1 variety over $M^{(1)}$ , which are prolongation-invariant. Moreover, the geometry of an EDS imposes a geometry on its solutions, $\iota:N\to M$ , and this imposition is also dictated by the characteristic scheme and rank-1 variety. Therefore, exterior differential systems can be classified up to coordinate equivalence as “parametrized families of manifolds $N$ with associated characteristic geometry.”

Remark 13.5 is not a theorem; it is an attitude.

To make this remark robust for a general exterior differential system, the scheme separating $\operatorname{Var}_{n}(\mathcal{I})$ into its components $M^{(1)}$ —each component smooth with its own fixed Cartan characters over some subvariety of $M$ —would have to be studied, and very little progress has been made at that level of abstraction. Nonetheless, whenever some property of PDEs is encountered, Remark 13.5 urges us to ask “is this property really invariant, or an artifact of my coordinates?” which is best answered by asking “can this property be reinterpreted using the characteristic scheme?” Sections 14 and 15 discuss progress of this type.

14. Yang’s Hyperbolicity Criterion

One of the great frustrations of the Cartan–Kähler theorem is that it relies on the Cauchy–Kowalevski theorem, so it applies only in the analytic category. One can see its dramatic failure in the smooth category in [Lew57]. However, this frustration has been escaped in some special cases by exploiting the structure272727If we take the broadest possible interpretation of Remark 13.5 to heart, then any possible escape from analyticity ought to arise from the structure of $\Xi$ . However, the reader is cautioned again that a dogma is not a theorem. of $\Xi$ . For example

(i)

ODE systems. Suppose that $(M,\mathcal{I})$ is involutive over $C^{\infty}$ and that $\Xi=\emptyset$ . Then $\ell=0$ , so the tableau $A$ is the trivial (irrelevant) subspace of $W\otimes V^{*}$ . The prolonged system $\mathcal{I}^{(1)}$ on $M^{(1)}$ is Frobenius, and $M^{(1)}$ is merely a copy of $M$ whose fiber is the unique element of an integrable distribution. That integrable distribution is the Cauchy retraction space $\mathfrak{g}$ of $\mathcal{I}$ as in Section 5(b), so it must have been that $\mathcal{I}=\mathfrak{g}^{\perp}$ . The flow-box theorem foliates $M$ by solutions in the smooth category. (Actually, in the Lipschitz category, by standard ODE theory!) If $N$ is a leaf of this foliation, then removing Cauchy retractions on the original exterior differential system $(M,\mathcal{I})$ yields the exterior differential system $(N,0)$ . 2. (ii)

Empty systems. Suppose that $(M,\mathcal{I})$ is involutive over $C^{\infty}$ and that $\Xi=V^{*}$ with $(s_{1},s_{2},\ldots,s_{n})=(r,r,\ldots,r)$ . Then, the tableau $A$ is the total space $W\otimes V^{*}$ . Therefore, $M^{(1)}$ is an open domain in $\operatorname{Gr}_{n}(TM)$ , so $\mathcal{I}=0$ , and there is no condition whatsoever282828The most extreme and amusing exploitations of the flexibility of $\operatorname{Gr}_{n}(TM)$ come from the homotopy principle [Gro86, EM02]. on integral manifolds $\iota:N\to M$ ; however, the prolongation $\iota^{(1)}:N\to M^{(1)}$ would have to satisfy the contact ideal, forcing some regularity on $N$ . We studied this EDS in Section 2.

A less trivial special case is presented in [Yan87], which is the subject of this section.292929As it happens, the attempt to understand [Yan87] in the context of [BCG*+*90, Chapter VIII] was the inspiration for computing the details shown in [Smi15] and the entire approach of these notes.

A tableau $A\subset W\otimes V^{*}$ is called determined if $s_{1}=s_{2}=\cdots=s_{n-1}=r$ and $s_{n}=0$ . That is, $s=(n-1)r$ , so $t=r$ , and $H^{1}(A)\cong W$ . Cartan’s test shows that a determined tableau is always involutive, so we may assume that $A$ is written in endovolutive form as in Theorem 5.4. The only nontrivial symbol endomorphisms in (1.20) are $\operatorname{B}^{\lambda}_{\lambda}=I_{r\times r}$ and $\operatorname{B}^{\lambda}_{n}$ for $\lambda=1,\ldots,n-1$ , like this:

[TABLE]

The quadratic involutivity condition is trivial, which is why Cartan’s test passes automatically.

Lemma 14.2.

Suppose $A$ is determined and written in endovolutive bases. Identify $H^{1}(A)$ with $W$ , and use our endovolutive basis of $W$ for both. Then for any $\varphi\in V^{*}$ , the symbol map $\sigma_{\varphi}:w\mapsto\sigma(w\otimes\varphi)$ from Section 6(b) is

[TABLE]

Then

[TABLE]

and the characteristic ideal $\mathscr{M}$ is generated by

[TABLE]

In particular, $\xi\in\Xi$ if and only if $\xi_{n}$ is an eigenvalue of $\xi_{\lambda}\operatorname{B}^{\lambda}_{n}$ .

Proof.

The first two equations are immediate from our block form. From Part III, we know that $w\otimes\xi\in A$ if and only if $\operatorname{B}(\xi)(v)w=\xi(v)w$ for all $v$ . Therefore, we compute in our endovolutive basis

[TABLE]

That is, $\xi_{n}w=\xi_{\lambda}\operatorname{B}^{\lambda}_{n}w$ . ∎

Corollary 14.7.

Consider a determined tableau as in Lemma 14.2. Fix an integral element $e$ . Suppose that $e^{\prime}$ is a real hyperplane in $e$ such that $(e^{\prime})^{\perp}\otimes\mathbb{C}=\varphi\in V^{*}$ and $\varphi\not\in\Xi$ . Then $\sigma_{\varphi}:W\to H^{1}(A)$ is an isomorphism.

Proof.

By Lemma 14.2, we have $\ker\sigma_{\varphi}\neq 0$ if and only if $\varphi\in\Xi$ . ∎

Definition 14.8.

Suppose $e^{\prime}$ is a real hyperplane in $e$ corresponding to the real covector $\varphi=(e^{\prime})^{\perp}\in\mathbb{P}e^{*}$ . The real hyperplane is called space-like if the following conditions hold.

(i)

$\varphi\otimes\mathbb{C}\not\in\Xi_{e}$ . 2. (ii)

For any $\eta\in\mathbb{P}e^{*}$ , there is a real basis of $W$ in which $(\sigma_{\varphi})^{-1}(\sigma_{\eta}):W\to W$ is real and diagonal. 3. (iii)

The above choice of basis is a smooth function of $[\eta]\in e^{*}/\varphi=(e^{\prime})^{*}$ .

A determined tableau $A\subset W\otimes V^{*}$ is called determined hyperbolic if $V$ admits a (real) space-like hyperplane.

Here is a simple example using our notation from Lemma 14.2. Fix $n=3$ . To meet the first condition, suppose that $\varphi=1u^{1}+0u^{2}+0u^{3}$ is not in $\Xi$ . Then $\sigma_{\varphi}=\operatorname{B}^{1}_{3}$ , and [math] is not an eigenvalue of $\sigma_{\varphi}$ , which of course implies that $\sigma_{\varphi}=\operatorname{B}^{1}_{3}$ is invertible. Say $\eta=0u^{1}+1u^{2}+\tau u^{3}$ , so that $\sigma_{\eta}=\operatorname{B}^{2}_{3}-\tau I_{r}$ . The second condition is that $(\operatorname{B}^{1}_{3})^{-1}\left(\operatorname{B}^{2}_{3}-\tau I_{r}\right)$ is diagonalizable using some change-of-basis $g_{\tau}$ . The third condition is that $g_{\tau}$ is continuous in the projective variable $\tau$ . Suppose moreover that we take our basis such that the basis-change at $\tau=0$ is $g_{0}=I$ . Then we have the condition that $(\operatorname{B}^{1}_{3})^{-1}\operatorname{B}^{2}_{3}$ is a diagonal matrix, $D$ . This puts restrictions on the possible forms of these matrices. For example, $\ker\operatorname{B}^{2}_{3}=\ker D$ and $\operatorname{im}\operatorname{B}^{2}_{3}\subset\operatorname{im}\operatorname{B}^{1}_{3}$ .

Definition 14.9.

A tableau $A\subset W\otimes V^{*}$ is called hyperbolic if $V$ admits a flag given by a basis $(u^{1},\ldots,u^{n})$ of $V^{*}$ such that each of the sequential initial value problems from $\left\langle u^{i},\ldots,u^{n}\right\rangle^{\perp}$ to $\left\langle u^{i+1},\ldots,u^{n}\right\rangle^{\perp}$ has a hyperbolic determined tableau.

Theorem 14.10 (Yang).

Theorem 17 applies in the smooth category, if $A$ is hyperbolic.

The proof proceeds by replacing the Cauchy–Kowalevski initial-value problem with the Cauchy initial-value problem for determined first-order quasilinear hyperbolic PDEs. See [Yan87] and Appendix A of [Kam89] for more details.

Clearly the definition of hyperbolic depends on the geometry of $\Xi$ and the symbol maps $\operatorname{B}^{\lambda}_{i}$ ; however, to the author’s knowledge no one has succeeded in writing down the explicit criteria on $\operatorname{B}^{\lambda}_{i}$ or $\operatorname{\mathscr{C}}$ or $\Xi$ for general hyperbolicity. Hence, Yang’s condition is not yet available to computer algebra systems. If that can be accomplished, it means we can identify a subvariety of the moduli of involutive tableaux—as in Section 5(a)—that admit solutions in the smooth category.

One well-understood special case is when $\ell=1$ , so $\Xi_{e}$ contains $s_{1}$ real points (with multiplicity). If the number of distinct points is sufficiently large (greater than $n$ ), then this is the situation for hyperbolic systems of conservation laws, as in [Tsa91]. The eikonal system is rigid, so each solution is foliated by $s_{1}$ characteristic hypersurfaces. Multiplicity corresponds to nilpotent pieces of the generalized eigenspaces of the symbol endomorphisms $\operatorname{B}^{1}_{i}$ . See again Section 8.

15. Open Problems and Future Directions

Our perspective here has been simple-minded—focusing on matrices and their computable properties—to gain intuition of $\Xi$ and $\operatorname{\mathscr{E}}(\Xi)$ as rapidly as possible. The articles [Smi15] and [Smi14] are founded on this perspective, but reveal additional detail in the structures discussed here. For more modern and sophisticated treatment, please see [Mal03], [KL07], and [CGG09]. Additionally, Chapters V–VIII of [BCG*+*90] contain significantly more results than we have summarized here.

To conclude, here are some interesting questions which—to the author’s present knowledge—are open subjects that represent the major theoretical gaps in the subject of exterior differential systems. They are worth serious consideration as research projects, and offer great opportunities for collaboration between analysts, differential geometers, algebraic geometers, and scientific programmers.

(i)

Variety of involutive tableaux. For given $r$ , $n$ , and Cartan characters $(s_{1},\ldots,s_{\ell})$ , what is the variety of involutive tableaux (with fixed coefficients)? Can we compute its dimension or degree or Hilbert polynomial? Section 5(a) demonstrates a first step toward understanding the variety of involutive tableaux, as Theorem 5.4 gives the ideal in certain bases. However, to answer the question completely, one would need to examine how the coefficients in (5.6) vary under arbitrary changes of basis in $V^{*}$ and $W$ . 2. (ii)

Special hyperbolic integrability criteria. Solution techniques (such as Lax pairs, inverse scattering, hydrodynamic reduction, and Bäcklund transformations) play a key rôle in the analysis of wave-like PDEs, especially those coming from physics and geometry. Given that these techniques are coordinate-invariant, Remark 13.5 suggests that they should all be expressible as algebraic conditions on $\mathscr{M}$ . Expressing those conditions in an abstract way over $\Xi$ and $M^{(1)}$ would allow more systematic geometric approach to many of the ad hoc methods in the analysis of PDEs.303030Indeed, the central theme of the conference for which these notes were prepared was to express Ferapontov’s notion of hydrodynamic integrability in terms of algebro-geometric structures in the Lagrangian Grassmannian. The notion of hydrodynamic integrability is tied completely to the secant variety of $\operatorname{\mathscr{C}}$ . 3. (iii)

Elliptic systems. Consider the classical results regarding elliptic regularity of quasilinear elliptic operators. This is another form of “special integrability criteria.” How far can the notion of elliptic regularity be extended to general exterior differential systems? Certainly the conditions of involutivity, $\left\langle\Xi\right\rangle=V^{*}$ , and $\Xi_{\mathbb{R}}=\emptyset$ are necessary, and one can directly translate the classical theorems to an EDS written specifically to describe a quasilinear second-order elliptic operator in local coordinates, but what other technical assumptions can be dropped? Some discussion appears in [BCG*+*90, Chapter X§3]. 4. (iv)

Moduli of involutive tableaux. Refining the first problem in light of the second and third problems, can we identify invariant sub-varieties of the variety of involutive tableaux? Dogma 13.5 indicates that we should be able to identify subvarieties, such as hyperbolic tableaux, elliptic tableaux, systems satisfying special integrability conditions, and so on. What does it mean when these sub-varieties intersect? Lewy showed that there are involutive PDEs with no solution in the smooth category [Lew57], which cannot happen in the analytic category. Where do the Lewy examples fall in this variety? Are there other subvarieties that have not been observed in classical equations? If there is any organizing geometry behind the “nearly impenetrable jungle” of involutive PDEs, this is where we should look. 5. (v)

Weakness of involutivity of characteristics. Note that Theorem 13.2 does not regard the involutivity of an exterior differential system in any direct way; the assumption of involutivity of $\mathcal{I}$ enters Theorem 13.2 only because we know that $\mathscr{M}_{N}$ is an ideal of homogeneous polynomials from (7.14). Thus, we expect that the condition “ $\Xi_{N}$ is the characteristic scheme of an exterior differential system $\mathcal{I}$ , and $\operatorname{\mathscr{E}}(\Xi_{N})$ is involutive” is much weaker than “ $\Xi_{N}$ is the characteristic scheme of an exterior differential system $\mathcal{I}$ , and $\mathcal{I}$ is involutive.” The gap between these two statements is extremely important to explore, as it goes to the heart of the question about how involutivity leads to solutions of the initial-value problem for a system of PDEs. To put this a different way, can we construct an embedded variety $\Xi_{N}\subset\mathbb{P}T^{*}N$ that is involutive, but for which there is no involutive exterior differential system for which $\Xi$ is the characteristic variety? 6. (vi)

Global integrability of the characteristic variety. If $A$ is involutive, then the system $\operatorname{\mathscr{E}}(\Xi_{N}^{o})$ is involutive on an ordinary integral manifold, $N$ . However, it is not clear whether $\Xi^{o}$ is involutive as a bundle over $M^{(1)}$ itself in any reasonable way that considers all $N$ simultaneously. That is, consider the EDS on $M^{(1)}$ generated by $\mathcal{I}^{(1)}+\left\langle\xi\right\rangle$ for some section $\xi$ of $\Xi^{o}\subset V^{*}\subset\mathbb{P}TM^{(1)}\otimes\mathbb{C}$ . Under what circumstances is this involutive? Can Gabber’s theorem 13.2 be adopted to this case? This has theoretical implications for special integrability conditions (above), because it would allow one to count special solutions among all solutions from $M^{(1)}$ directly. Additionally, given its algebraic nature, can Gabber’s theorem provide solutions for certain types of PDEs with low regularity, bypassing the Lewy examples with various additional conditions? 7. (vii)

Prolongation theorems. Does prolongation always uncover solutions of an exterior differential system, if we remove the regularity assumptions on $M^{(1)}$ and consider the many components of the scheme $\operatorname{Var}_{n}(\operatorname{Var}_{n}(\cdots(\mathcal{I})\cdots))$ ? As experts are well aware, this is has been the key open question in the subject for most of a century. (See [BCG*+*90, Chapter VI].) In the context of this monograph, the question is related to whether the block form of involutive tableau (1.20) and the involutivity conditions of Theorem 5.4 can be extended from $M^{(1)}$ to non-smooth points in $\operatorname{Var}_{n}(\mathcal{I})$ ? Because of the interaction of Guillemin normal form and involutivity with Spencer cohomology as in Section 9, such an extension of the endovolutive block form could be helpful in an effort to construct (or prove the non-existence of) counterexamples. 8. (viii)

Representation theory of Lie pseudogroups. Lie pseudogroups are subgroups of the diffeomorphism pseudogroup whose trajectories are the solutions of involutive PDEs. See [Olv09]. Just as Jordan form (in the guise of the Levi decomposition) is the key first step toward understanding the representation of Lie groups, it is reasonable to expect that the endovolutive block-form (1.20) and Theorem 5.4 can serve as the foundation of a representation theory of Lie pseudogroups. Any results regarding the “moduli of involutive tableaux” can be applied to Lie pseudogroups with those tableaux. Indeed, the first application of Theorem 13.1 was the classification of the primitive Lie pseudogroups [GQS66].

Acknowledgments

Thanks to Robert L. Bryant, Niky Kamran, and Deane Yang for always humoring my interest in these details, to Ian Morrison for helping me appreciate the role of incidence correspondences in algebraic geometry, and to Giovanni Moreno for arranging this course and for encouraging me repeatedly to complete these notes. The punny photograph in Figure 5 is public domain from the U.S. Fish and Wildlife Service.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Art 91] M. Artin , Algebra , Pearson, 1991.
2[Bry 93] R. L. Bryant , An Introduction to Lie Groups and Symplectic Geometry, (lecture notes) (1993).
3[BCG + 90] R. L. Bryant , S.-S. Chern , R. B. Gardner , H. L. Goldschmidt , and P. A. Griffiths , Exterior Differential Systems , 1 ed., 18 , Springer-Verlag, 1990. Available at http://library.msri.org/books/Book 18/MSRI-v 18-Bryant-Chern-et-al.pdf .
4[CGG 09] J. Carlson , M. Green , and P. Griffiths , Variations of Hodge Structure Considered as an Exterior Differential System: Old and New Results, Symmetry, Integrability and Geometry: Methods and Applications (2009). Available at http://www.emis.de/journals/SIGMA/2009/087/ .
5[Car 11] É. Cartan , Sur les systèmes en involution d’équations aux dérivées partielles du second ordre à une fonction invonnue de trois variables indépendantes, Bulletin de la Société Mathématique de France 39 (1911), 352–443. Available at http://www.numdam.org/item?id=BSMF_1911__39__352_1 .
6[Cle 17] J. N. Clelland , From Frenet to Cartan: The Method of Moving Frames , American Mathematical Society, Graduate Texts in Mathematics, 2017.
7[Eis 05] D. Eisenbud , The geometry of syzygies , Graduate Texts in Mathematics 229 , Springer-Verlag, New York, 2005.
8[EM 02] Y. Eliashberg and N. Mishachev , Introduction to the h ℎ h -Principle , Graduate Studies in Mathematics 48 , American Mathematical Society, Providence, Rhode Island, jun 2002. Available at http://www.ams.org/gsm/048 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Involutive Tableaux, Characteristic Varieties, and Rank-one Varieties in the Geometric Study of PDEs

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

Contents

0. Introduction and Overview

Part I Matrices and Subspaces

1. Tableaux and Symbols

1(a). Rank-one ideal

1(b). Generic Bases

1(c). Endovolutive Tableaux

1(d). Mutual Eigenvectors and Rank

Lemma 1.25**.**

2. Grassmann and Universal Bundles

2(a). Tangent and Arctangent

Remark 2.7*.*

2(b). Polar pairs

Lemma 2.9**.**

Proof.

Lemma 2.11**.**

Lemma 2.12**.**

Proof.

Lemma 2.15**.**

Proof.

2(c). The Tautological Bundle

Part II PDEs on Manifolds

3. Bundles upon Bundles

3(a). The Contact Ideal

3(b). Immersions and Frame Bundles

Lemma 3.8**.**

Lemma 3.11**.**

Theorem 3.12**.**

Remark 3.13*.*

4. Exterior Differential Systems

Remark 4.1*.*

4(a). Differential Ideals and Integral Elements

Definition 4.2** (Kähler-ordinary).**

Lemma 4.3**.**

Definition 4.4** (Kähler-regular).**

4(b). Prolongation and Spencer Cohomology

Lemma 4.10**.**

Corollary 4.14**.**

5. Involutivity of Exterior Differential Systems

Definition 5.1** (Cartan’s test).**

Definition 5.2**.**

Theorem 5.3**.**

Theorem 5.4** (Involutivity Criteria).**

5(a). Moduli of Involutive Tableaux

5(b). Cauchy retractions

Part III Characteristic and Rank-one Varieties

6. The Characteristic Variety

6(a). via Polar Extension

6(b). via Rank-one Incidence

Lemma 6.4**.**

Proof.

6(c). Example: The Wave Equation

7. Guillemin Normal Form and Eigenvalues

Lemma 7.1**.**

Proof.

Lemma 7.4**.**

Proof.

Theorem 7.6** (Guillemin normal form).**

Corollary 7.8**.**

Proof.

Lemma 7.9**.**

Proof.

Theorem 7.11**.**

Proof.

8. Examples

8(a). Zero-dimensional examples

Remark 8.15*.*

8(b). One-dimensional examples

8(c). One-dimensional exercise

9. Results of Guillemin and Quillen

Lemma 1.25.

*Remark 2.7**.*

Lemma 2.9.

Lemma 2.11.

Lemma 2.12.

Lemma 2.15.

Lemma 3.8.

Lemma 3.11.

Theorem 3.12.

*Remark 3.13**.*

*Remark 4.1**.*

Definition 4.2 (Kähler-ordinary).

Lemma 4.3.

Definition 4.4 (Kähler-regular).

Lemma 4.10.

Corollary 4.14.

Definition 5.1 (Cartan’s test).

Definition 5.2.

Theorem 5.3.

Theorem 5.4 (Involutivity Criteria).

Lemma 6.4.

Lemma 7.1.

Lemma 7.4.

Theorem 7.6 (Guillemin normal form).

Corollary 7.8.

Lemma 7.9.

Theorem 7.11.

*Remark 8.15**.*

Theorem 9.1 (Quillen’s Exactness Theorem).

Corollary 9.2 (Quillen, Guillemin).

Lemma 9.3.

Corollary 9.12.

Theorem 10.1.

*Remark 10.2**.*

Theorem 12.1.

Corollary 12.2.

Corollary 12.3.

Corollary 12.5.

Corollary 12.9.

Theorem 13.1 (Guillemin–Quillen–Sternberg).

Theorem 13.2 (Gabber).

Corollary 13.3.

Corollary 13.4.

*Remark 13.5** (General Dogma of the Characteristic Variety).*

Lemma 14.2.

Corollary 14.7.

Definition 14.8.

Definition 14.9.

Theorem 14.10 (Yang).