Variational nonlinear WKB in the Eulerian frame

J. W. Burby; D. E. Ruiz

arXiv:1902.04221·math-ph·June 24, 2020

Variational nonlinear WKB in the Eulerian frame

J. W. Burby, D. E. Ruiz

PDF

TL;DR

This paper establishes a variational principle for nonlinear WKB in the Eulerian frame, revealing circulation invariants and applying the framework to model high-frequency acoustic waves interacting with large-scale flows.

Contribution

It demonstrates that nonlinear WKB in the Eulerian frame is variational and identifies its symmetry group, which was previously unknown.

Findings

01

Variational principle for extended Eulerian WKB equations established.

02

Loops of relabeling transformations form a symmetry group.

03

Derived a variational model for high-frequency acoustic waves in flows.

Abstract

Nonlinear WKB is a multiscale technique for studying locally-plane-wave solutions of nonlinear partial differential equations (PDE). Its application comprises two steps: (1) replacement of the original PDE with an extended system separating the large scales from the small, and (2) reduction of the extended system to its slow manifold. In the context of variational fluid theories with particle relabeling symmetry, nonlinear WKB in the mean Eulerian frame is known to possess a variational structure. This much has been demonstrated using, for instance, the theoretical apparatus known as the generalized Lagrangian mean. On the other hand, the variational structure of nonlinear WKB in the conventional Eulerian frame remains mysterious. By exhibiting a variational principle for the extended equations from step (1) above, we demonstrate that nonlinear WKB in the Eulerian frame is in fact…

Equations522

F^{b} (φ^{a} (x), \partial_{μ} φ^{a} (x), \partial_{μν}^{2} φ^{a} (x), \dots) = 0,

F^{b} (φ^{a} (x), \partial_{μ} φ^{a} (x), \partial_{μν}^{2} φ^{a} (x), \dots) = 0,

φ^{a} (x) = \tilde{φ}^{a} (x, S (x)),

φ^{a} (x) = \tilde{φ}^{a} (x, S (x)),

\tilde{φ}^{a} (x, θ)

\tilde{φ}^{a} (x, θ)

F^{b} (\tilde{φ}^{a} (x, θ), \partial_{μ} \tilde{φ}^{a} (x, θ) + \partial_{μ} S (x) \partial_{θ} \tilde{φ}^{a} (x, θ), \dots) = 0.

F^{b} (\tilde{φ}^{a} (x, θ), \partial_{μ} \tilde{φ}^{a} (x, θ) + \partial_{μ} S (x) \partial_{θ} \tilde{φ}^{a} (x, θ), \dots) = 0.

ρ (\partial_{t} u + u \cdot \nabla u) = - c^{2} \nabla ρ

ρ (\partial_{t} u + u \cdot \nabla u) = - c^{2} \nabla ρ

\partial_{t} ρ + \nabla \cdot (ρ u) = 0,

A_{ρ_{0}} (g) = \int_{t_{1}}^{t_{2}} \int_{Q_{0}} \frac{1}{2} ∣ \dot{g} (x_{0}) ∣^{2} ρ_{0} (x_{0}) d x_{0} - \int c^{2} ρ_{0} (x_{0}) ln (\frac{ρ _{0} ( x _{0} )}{det ( \nabla _{0} g ) ( x _{0} )}) d x_{0}

A_{ρ_{0}} (g) = \int_{t_{1}}^{t_{2}} \int_{Q_{0}} \frac{1}{2} ∣ \dot{g} (x_{0}) ∣^{2} ρ_{0} (x_{0}) d x_{0} - \int c^{2} ρ_{0} (x_{0}) ln (\frac{ρ _{0} ( x _{0} )}{det ( \nabla _{0} g ) ( x _{0} )}) d x_{0}

u (x)

u (x)

ρ (x)

\tilde{ρ} (\partial_{t} \tilde{u} + \tilde{u} \cdot \nabla \tilde{u} + Ω \partial_{θ} \tilde{u}) = - c^{2} \nabla \tilde{ρ} - c^{2} \nabla S \partial_{θ} \tilde{ρ}

\tilde{ρ} (\partial_{t} \tilde{u} + \tilde{u} \cdot \nabla \tilde{u} + Ω \partial_{θ} \tilde{u}) = - c^{2} \nabla \tilde{ρ} - c^{2} \nabla S \partial_{θ} \tilde{ρ}

\partial_{t} \tilde{ρ} + \nabla \cdot (\tilde{ρ} \tilde{u}) + \partial_{θ} (Ω \tilde{ρ}) = 0

Ω = \partial_{t} S + \tilde{u} \cdot \nabla S,

g (x_{0}, t)

g (x_{0}, t)

ξ (\overline{x}, t)

\overline{g}_{t} \mapsto \overline{g}_{t} \circ \overline{η},

\overline{g}_{t} \mapsto \overline{g}_{t} \circ \overline{η},

A_{U} (φ) = \int_{U} L (x, φ (x), \partial φ (x)) d x .

A_{U} (φ) = \int_{U} L (x, φ (x), \partial φ (x)) d x .

\displaystyle\frac{d}{d\epsilon}\bigg{|}_{0}A_{U}(\varphi+\epsilon\delta\varphi)=0

\displaystyle\frac{d}{d\epsilon}\bigg{|}_{0}A_{U}(\varphi+\epsilon\delta\varphi)=0

\frac{\partial L}{\partial φ ^{a}} (x, φ (x), \partial φ (x)) = \frac{\partial}{\partial x ^{μ}} (\frac{\partial L}{\partial v _{μ}^{a}} (x, φ (x), \partial φ (x))) .

\frac{\partial L}{\partial φ ^{a}} (x, φ (x), \partial φ (x)) = \frac{\partial}{\partial x ^{μ}} (\frac{\partial L}{\partial v _{μ}^{a}} (x, φ (x), \partial φ (x))) .

\frac{\partial L}{\partial φ ^{a}} (j (x, θ)) =

\frac{\partial L}{\partial φ ^{a}} (j (x, θ)) =

j (x, θ) = (x, \tilde{φ} (x, θ), \partial \tilde{φ} (x, θ) + \partial_{θ} \tilde{φ} (x, θ) \partial S (x))

j (x, θ) = (x, \tilde{φ} (x, θ), \partial \tilde{φ} (x, θ) + \partial_{θ} \tilde{φ} (x, θ) \partial S (x))

\partial_{μ} φ^{a} (x) = \partial_{μ} \tilde{φ}^{a} (x, S (x)) + \partial_{μ} S (x) \partial_{θ} \tilde{φ}^{a} (x, S (x)) .

\partial_{μ} φ^{a} (x) = \partial_{μ} \tilde{φ}^{a} (x, S (x)) + \partial_{μ} S (x) \partial_{θ} \tilde{φ}^{a} (x, S (x)) .

\frac{\partial L}{\partial φ ^{a}} (j (x, S (x)))

\frac{\partial L}{\partial φ ^{a}} (j (x, S (x)))

\displaystyle=\left(\frac{\partial}{\partial x^{\mu}}+\partial_{\mu}S(x)\frac{\partial}{\partial\theta}\right)\left(\frac{\partial\mathcal{L}}{\partial v^{a}_{\mu}}(j(x,\theta))\right)\bigg{|}_{\theta=S(x)},

\frac{\partial L}{\partial φ ^{a}} (j (x, θ)) =

\frac{\partial L}{\partial φ ^{a}} (j (x, θ)) =

A_{U} (φ) = \int_{U} L (j (x, S (x))) d x .

A_{U} (φ) = \int_{U} L (j (x, S (x))) d x .

A_{U_{i}} \approx \frac{1}{2 π} \int_{0}^{2 π} \int_{U_{i}} L (j (x_{i}, θ)) d x d θ .

A_{U_{i}} \approx \frac{1}{2 π} \int_{0}^{2 π} \int_{U_{i}} L (j (x_{i}, θ)) d x d θ .

A_{U} (φ)

A_{U} (φ)

\equiv \tilde{A}_{U \times S^{1}} (\tilde{φ}, S),

\displaystyle\frac{d}{d\epsilon}\bigg{|}_{0}\tilde{A}_{U\times S^{1}}(\tilde{\varphi}+\epsilon\delta\tilde{\varphi},S+\epsilon\delta S)\approx 0,

\displaystyle\frac{d}{d\epsilon}\bigg{|}_{0}\tilde{A}_{U\times S^{1}}(\tilde{\varphi}+\epsilon\delta\tilde{\varphi},S+\epsilon\delta S)\approx 0,

\tilde{L} (X, Φ, V) = \frac{1}{2 π} L (x, \tilde{φ}, \tilde{v} + ζ κ),

\tilde{L} (X, Φ, V) = \frac{1}{2 π} L (x, \tilde{φ}, \tilde{v} + ζ κ),

\displaystyle V=\left(\begin{array}[]{cc}\tilde{v}&\zeta\\ \kappa&\alpha\end{array}\right),\quad\begin{array}[]{c}\tilde{v}\in M_{f\times m}(\mathbb{R})\\ \zeta\in M_{f\times 1}(\mathbb{R})\end{array}\quad\begin{array}[]{c}\kappa\in M_{1\times m}(\mathbb{R})\\ \alpha\in\mathbb{R}.\end{array}

\displaystyle V=\left(\begin{array}[]{cc}\tilde{v}&\zeta\\ \kappa&\alpha\end{array}\right),\quad\begin{array}[]{c}\tilde{v}\in M_{f\times m}(\mathbb{R})\\ \zeta\in M_{f\times 1}(\mathbb{R})\end{array}\quad\begin{array}[]{c}\kappa\in M_{1\times m}(\mathbb{R})\\ \alpha\in\mathbb{R}.\end{array}

J (X) = (X, Φ (X), \partial Φ (X)) \in \tilde{M} \times \tilde{F} \times \tilde{D} .

J (X) = (X, Φ (X), \partial Φ (X)) \in \tilde{M} \times \tilde{F} \times \tilde{D} .

\int_{U \times S^{1}} [\frac{\partial L ~}{\partial Φ ^{\tilde{a}}} (J (X)) - \frac{\partial}{\partial X ^{\tilde{μ}}} (\frac{\partial L ~}{\partial V _{\tilde{μ}}^{\tilde{a}}} (J (X)))] δ Φ^{\tilde{a}} d X = 0,

\int_{U \times S^{1}} [\frac{\partial L ~}{\partial Φ ^{\tilde{a}}} (J (X)) - \frac{\partial}{\partial X ^{\tilde{μ}}} (\frac{\partial L ~}{\partial V _{\tilde{μ}}^{\tilde{a}}} (J (X)))] δ Φ^{\tilde{a}} d X = 0,

\frac{\partial L ~}{\partial φ ~ ^{a}} (J (X)) = \frac{\partial}{\partial x ^{μ}} (\frac{\partial L ~}{\partial v ~ _{μ}^{a}} (J (X))) + \frac{\partial}{\partial θ} (\frac{\partial L ~}{\partial ζ ^{a}} (J (X))) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Variational nonlinear WKB in the Eulerian frame

J. W. Burby

Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA

Mathematical Sciences Research Institute, 17 Gauss Way, Berkeley, California 94720, USA

D. E. Ruiz

Sandia National Laboratories, P.O. Box 5800, Albuquerque, New Mexico 87185, USA

Abstract

Nonlinear WKB is a multiscale technique for studying locally-plane-wave solutions of nonlinear partial differential equations (PDE). Its application comprises two steps: (1) replacement of the original PDE with an extended system separating the large scales from the small, and (2) reduction of the extended system to its slow manifold. In the context of variational fluid theories with particle relabeling symmetry, nonlinear WKB in the mean Eulerian frame is known to possess a variational structure. This much has been demonstrated using, for instance, the theoretical apparatus known as the generalized Lagrangian mean. On the other hand, the variational structure of nonlinear WKB in the conventional Eulerian frame remains mysterious. By exhibiting a variational principle for the extended equations from step (1) above, we demonstrate that nonlinear WKB in the Eulerian frame is in fact variational. Remarkably, the variational principle for the extended system admits loops of relabeling transformations as a symmetry group. Noether’s theorem therefore implies that the extended Eulerian equations possess a family of circulation invariants parameterized by $S^{1}$ . As an illustrative example, we use our results to systematically deduce a variational model of high-frequency acoustic waves interacting with a larger-scale compressible isothermal flow.

I Introduction

Nonlinear WKB is a powerful tool for studying solutions of partial differential equations (PDE) whose local behavior about any point is well approximated by a plane wave. The method, which is a generalization of the usual WKB method for linear PDE, goes back at least to the mid 1960’s, when it was used to study large-amplitude locally-plane-wave solutions of a variety of systems, including the Bousinesque equationsWhitham (1965a) and the Kortweg-DeVries equation.Miura and Kruskal (1974) Generally speaking, given a (possibly nonlinear) PDE of the form

[TABLE]

for the unknown multi-component field $\varphi^{a}$ , application of nonlinear WKB comprises two steps. First Eq. (1) is extended to a larger system of PDE using a procedure that we will refer to as “nonlinear WKB extension.” Next, scale separation present in the original system of PDE, either in $F^{b}$ or the initial conditions, is leveraged to identify slow solutions of the extended system. The power of this procedure comes from the fact that rapidly oscillating locally-plane-wave solutions $\varphi^{a}$ of Eq. (1) correspond to slowly-varying solutions of the extended system, which are easier to treat using asymptotic methods.

The nonlinear WKB extension procedure amounts to the following. First one introduces the nonlinear WKB ansatz

[TABLE]

where $\tilde{\varphi}^{a}(x,\theta)$ is $2\pi$ -periodic in the second argument, and $S(x)$ is referred to as a phase function. More explicitly, since $\tilde{\varphi}^{a}$ is periodic in the second argument, it can be written as a sum of Fourier harmonics in $S(x)$ ; that is,

[TABLE]

Thus, the nonlinear WKB ansatz differs from the conventional WKB ansatz in that it contains all harmonics in $S$ . The term “nonlinear” is appropriate here because the ansatz (2) can handle nonlinear terms appearing in the PDE (1) that produce harmonic coupling. The ansatz (2) is then substituted into Eq. (1) and the chain rule is applied to express $x$ -derivatives of ${\varphi}$ in terms of $x$ - and $\theta$ -derivatives of $\tilde{\varphi}$ and $S$ . Finally, the argument $S(x)$ in any of the derivatives of $\tilde{\varphi}^{a}$ is replaced with any arbitrary angle $\theta$ in order to obtain the extended system

[TABLE]

The dependent variables are now $\tilde{\varphi}^{a}(x,\theta)$ and $S(x)$ , while the independent variables are $x$ and $\theta$ . As is readily checked, each solution $(\tilde{\varphi}^{a},S)$ of Eq. (4) yields a solution $\varphi^{a}$ of Eq. (1), with $\varphi^{a}$ given by Eq. (2). It is in this sense that Eq. (4) extends the original equation (1).

In this paper we will study the nonlinear WKB extension procedure, i.e. the passage from Eq. (1) to Eq. (4), as an interesting mathematical construction in its own right, independent of any asymptotic methods. Specifically, we will be concerned with nonlinear WKB extension as it applies to a particular class of PDE from fluid mechanics known as Euler-Poincaré equations.Holm, Marsden, and Ratiu (1998) Such equations describe the evolution of ideal, i.e. dissipation-free, fluids. In the Euler-Poincaré setting, we will address the question of whether structural properties of the original system of PDE (1) are inherited by the extended equations (4). We will be particularly interested in the fate of variational structure and particle relabeling symmetry, the latter being the source of circulation invariants in ideal fluid models.

The methods of WhithamWhitham (1965b) are sufficient to study the fate of variational structure under nonlinear WKB extension when the system (1) is equivalent to the Euler-Lagrange equations associated with a classical field theory. As we will review, Whitham’s method of averaged Lagrangians provides a variational principle for the extended system in this case. However, conventional Euler-Poincaré variational principles for ideal fluid flow do not fit into the mold of variational principles used in classical field theory. Therefore Whitham’s methods cannot be applied directly to show that the system (4) is variational when Eq. (1) is an Euler-Poincaré fluid equation.

The essential difficulty can be understood through a close look at the ideal isothermal Euler equations

[TABLE]

where the unknown fields are the fluid velocity $\bm{u}(\bm{x},t)$ and the mass density $\rho(\bm{x},t)$ , and $c$ is a constant representing the speed of small-amplitude sound waves. This system of equations, which has the form (1), arises from an Euler-Poincaré variational principle in the following sense.Holm, Marsden, and Ratiu (1998) Let $Q$ be a compact region in $\mathbb{R}^{3}$ that represents the fluid container, and let $Q_{0}$ be a diffeomorphic copy of $Q$ equipped with a non-vanishing function $\rho_{0}:Q_{0}\rightarrow\mathbb{R}$ that represents a reference configuration of fluid elements. A path $t\mapsto\bm{g}(t)\in\text{Diff}(Q_{0},Q)$ in the space of diffeomorphisms $Q_{0}\rightarrow Q$ is a critical point of the functional

[TABLE]

if and only if Eq. (5) is satisfied with $\bm{u}$ and $\rho$ defined according to

[TABLE]

where we have suppressed the time argument $t$ for the sake of presentation. In particular, it is a consequence of these definitions that $\rho$ as defined in Eq. (9) satisfies the continuity equation (6). Thus, each critical point of $\mathcal{A}_{\rho_{0}}$ corresponds to a solution of the ideal isothermal Euler equations. Conversely, given a solution of the ideal isothermal Euler equations, there is some $\rho_{0}$ such that the time-dependent flow map of $\bm{u}$ is a critical point of $\mathcal{A}_{\rho_{0}}$ . It is therefore appropriate to say that the system (5)-(6) is variational. However, the field that appears in the variational principle is $\bm{g}(\bm{x}_{0},t)$ instead of $\bm{u}(\bm{x},t)$ or $\rho(\bm{x},t)$ , as one might expect from experience with classical field theory. In fact, $\bm{g}$ is not even defined on the same domain as $\bm{u}$ and $\rho$ . It is therefore not at all obvious how, or if, Whitham’s averaged Lagrangian technique can be applied to yield a variational principle for the nonlinear WKB extension of Eqs. (5)-(6),

[TABLE]

where $\tilde{\rho}=\tilde{\rho}(\bm{x},t,\theta)$ and $\tilde{\bm{u}}(\bm{x},t,\theta)$ comprise the multi-component field $\tilde{\varphi}^{a}$ in Eq. (4) and $S=S(\bm{x},t)$ is the phase function. For instance, one question that arises when attempting to apply Whitham’s averaging to the action functional $\mathcal{A}_{\rho_{0}}$ is “what is the appropriate nonlinear WKB ansatz for the mapping $\bm{g}$ ?” The naive guess $\bm{g}(\bm{x}_{0},t)=\tilde{\bm{g}}(\bm{x}_{0},t,S(\bm{x}_{0},t))$ does not make sense because the proper spatial domain of the phase function $S$ is $Q$ — not $Q_{0}$ . (In WKB theory, phases are assigned to spatial locations, not fluid element labels.)

We are by no means the first to consider the interplay between WKB theory and Euler-Poincaré variational principles. Before the terminology “Euler-Poincaré variational principle” was even invented, DewarDewar (1970) (and independently BrethertonBretherton (1971)) proposed the ansatz

[TABLE]

for the fluid configuration map that appears in the Euler-Poincaré variational principle for magnetohydrodynamic (MHD) flow.Newcomb (1962) The intuition leading to (13) is that $\overline{\bm{g}}$ represents the “mean” configuration of Lagrangian fluid elements. Under the assumptions that $\bm{\xi}$ is small and $S$ varies rapidly, this idea leads to a variational model of small-amplitude locally-plane waves interacting with a slowly-varying MHD background. In the context of purely hydrodynamic flow, Gjaja and HolmGjaja and Holm (1996) explored this idea further, and uncovered the consequences of the mean relabeling symmetry present in averaged Lagrangians built upon Eq. (13). Here mean fluid particle relabeling symmetry refers to invariance of the averaged Lagrangian under the replacement

[TABLE]

where $\overline{\bm{\eta}}:Q_{0}\rightarrow Q_{0}$ is any diffeomorphism that preserves the density $\rho_{0}\,d\bm{x}_{0}$ . Perhaps surprisingly, none of this previous work manages to provide a variational principle for the usual WKB extension of the fluid equations, e.g. Eqs. (10)-(11). Instead, the ansatz (13) leads to an alternative, ostensibly inequivalent extension of the fluid equations,Gjaja and Holm (1996) and Whitham averaging produces a variational structure for this alternative system. It is reasonable to refer to this alternative extension as an extension in the mean Eulerian frame because the quantity $\overline{\bm{x}}=\overline{\bm{g}}(\bm{x}_{0},t)$ gives the phase average of the Eulerian fluid element location. Thus, nonlinear WKB extension in the mean Eulerian frame is known to be variational. However, the variational structure of nonlinear WKB extension in the conventional Eulerian frame has never been found.

In what follows, we will prove that nonlinear WKB extension in the Eulerian frame is in fact variational. The proof will make use of Whitham averaging, but will not make use of the ansatz (13). In fact, even the more general notion of separating quantities into mean and fluctuating parts will not play a role in the argument. Exploiting this fact, we will also prove that the extended equations admit as a symmetry group the space of loops of particle relabeling transformations. Remarkably, this loop groupPressley and Segal (1988) is much larger than the group of mean relabeling transformations present in the work of Gjaja and Holm.Gjaja and Holm (1996) The presence of this loop group symmetry will allow us to prove that the nonlinear WKB extension of Euler-Poincaré fluid equations in the Eulerian frame admits a family of circulation invariants parameterized by $S^{1}$ . This result extends the circulation theorem of Gjaja and Holm, which may be seen as a consequence of invariance under the subgroup of constant loops. Finally, in order to demonstrate the utility of our results, we will apply them to derive a systematic, all-orders variational model of weakly-nonlinear high-frequency acoustic waves interacting with a longer-scale isothermal compressible flow. This example may be regarded as a fresh take on the analysis of Bretherton in Ref. Bretherton, 1971.

Our discussion will be organized in the following manner. In Section II we will prove that applying Whitham averaging to the Lagrangian of a classical field theory is equivalent to applying the usual nonlinear WKB extension procedure directly to the Euler-Lagrange equations. In particular, we will show that Whitham’s averaged Lagrangian is the Lagrangian for the nonlinear WKB extension of a classical field theory. In Section III we will show how fluid equations arising from Euler-Poincaré variational principles with local Lagrangians may be recast as classical field theories. In Section IV we will then combine the results of Section II and III to produce the variational structure underlying the nonlinear WKB extension (in the Eulerian frame) of Euler-Poincaré fluid equations. We will investigate the relabeling symmetries of this new variational principle in Section V. In particular, we will prove that the symmetry group of the WKB extension includes the space of loops of particle relabeling transformations, and identify the corresponding momentum map using Noether’s theorem. Finally, we will apply our results to high-frequency acoustic waves interacting with longer-scale compressible isothermal flow in Section VI. After presenting our results, we will discuss the relationship of our work with existing literature, in particular with Ref. Gjaja and Holm, 1996 and the theory of generalized Lagrangian means, in Section VII.

As a forewarning remark, unless indicated otherwise, we will assume in this paper that all mappings are $C^{\infty}$ . We make this assumption in spite of the fact that some of the PDEs we will encounter may not have a good existence and uniqueness theory in the smooth setting. In addition, the discussion contained in Section VI will proceed at the level of formal asymptotics. Regularity assumptions mentioned in Section VI are merely included to ensure that coefficients in various asymptotic expansion may be computed.

II A basic theorem on Whitham averaging

In this section we provide an anachronistic review of the nonlinear WKB extension procedure as it applies to general first-order classical field theories. Our goal is to prove that the nonlinear WKB extension of a field theory satisfies a variational principle. We will build upon this result in subsequent sections when uncovering the variational structure of nonlinear WKB extensions of Euler-Poincaré fluid equations. Essentially all of the ideas in this section can be found in the work of Whitham.Whitham (1965b)

For the purposes of our discussion, a first-order classical field theory will be defined as follows.

Definition 1.

A first-order classical field theory is a triple $(M,\mathcal{C},\mathcal{L})$ comprising a manifold $M$ , a space of functions $\mathcal{C}$ , and a function $\mathcal{L}$ with the following properties.

•

The spacetime $M$ is an $m$ -dimensional space presented as the product of a vector space with a torus of some dimension. The natural coordinates on $M$ are denoted $x^{\mu}$ , $\mu\in\{1,\dots,m\}$ .

•

The space of fields $\mathcal{C}$ is a vector space of functions $\varphi:M\rightarrow F$ , where the fiber $F$ is an $f$ -dimensional space of the same type as $M$ , but with a possibly different dimension. The natural coordinates on $F$ are denoted $\varphi^{a}$ , $a\in\{1,\dots,f\}$ .

•

The Lagrangian density $\mathcal{L}$ is a real-valued function on $M\times F\times D$ , where $D$ is the space of $f\times m$ matrices with components $v^{a}_{\mu}$ , $a\in\{1,\dots,f\}$ , $\mu\in\{1,\dots,m\}$ .

To each first-order classical field theory and compact subset $U\subset M$ , we associate the local action functional

[TABLE]

Here $\partial\varphi(x)\in D$ has entries $[\partial\varphi(x)]^{a}_{\mu}=\partial_{\mu}\varphi^{a}(x)$ . We say that a field $\varphi$ is a critical point of $A_{U}$ if

[TABLE]

for all $\delta\varphi\in\mathcal{C}$ that vanish on $\partial U$ .

Suppose that $\mathcal{C}$ contains all smooth fields with compact support. Then it is a standard result in the calculus of variations that $\varphi$ is a critical point of $A_{U}$ for all $U\subset M$ if and only if $\varphi$ satisfies the system of second-order PDE known as the Euler-Lagrange equations:

[TABLE]

In this setting, we refer to the first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ as ordinary.

Definition 2.

An ordinary first-order classical field theory is a first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ where the space of fields $\mathcal{C}$ contains all smooth fields $\varphi:M\rightarrow F$ with compact support.

Given an ordinary first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ , the nonlinear WKB extension procedure described in the introduction may be applied to the theory’s Euler-Lagrange equations. It will be convenient to refer to the resulting extended system as the nonlinear WKB extension of $(M,\mathcal{C},\mathcal{L})$ .

Definition 3.

The nonlinear WKB (NL-WKB) extension of the ordinary first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ is the nonlinear WKB extension of the field theory’s Euler-Lagrange equations (18). That is, the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ is the system of partial differential equations

[TABLE]

where

[TABLE]

is convenient shorthand notation, and $\tilde{\varphi}:M\times S^{1}\rightarrow F$ and $S:M\rightarrow S^{1}$ are the unknown fields in the extended system. We regard $\partial_{\theta}\tilde{\varphi}(x,\theta)$ and $\partial S(x)$ as $f\times 1$ and $1\times m$ matrices, respectively.

Our goal is to describe elements of the relationship between the ordinary field theory $(M,\mathcal{C},\mathcal{L})$ and its NL-WKB extension. Due to the following Lemma, we expect this relationship to be strong.

Lemma 1.

If $(\tilde{\varphi},S)$ is a solution of the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ , then $\varphi(x)=\tilde{\varphi}(x,S(x))$ is a solution of the Euler-Lagrange equations associated with $\mathcal{L}$ . Conversely, if $\varphi$ is a solution of $\mathcal{L}$ ’s Euler-Lagrange equations, then $\tilde{\varphi}(x,\theta)=\varphi(x)$ , $S(x)=0$ is a solution of the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ .

Proof.

That $\varphi(x)=\tilde{\varphi}(x,S(x))$ satisfies the Euler-Lagrange equations associated with $\mathcal{L}$ is a straightforward application of the chain rule. The converse statement follows from the fact that Eq. (19) reduces to Eq. (18) when $\tilde{\varphi}(x,\theta)=\varphi(x)$ and $S(x)=0$ . ∎

In order to go beyond Lemma 1 in our description of the relationship between an ordinary classical field theory and its nonlinear WKB extension, it is useful to understand the heuristic origins of the nonlinear WKB extension procedure. The key idea is scale separation. Suppose $\varphi$ is a solution of the Euler-Lagrange equations associated with $(M,\mathcal{C},\mathcal{L})$ that locally has the appearance of a plane wave. Formally, we may then write $\varphi(x)=\tilde{\varphi}(x,S(x))$ , where $\tilde{\varphi}:M\times S^{1}\rightarrow F$ is a profile and $S$ is a rapidly oscillating phase function. The derivatives of $\varphi$ are apparently given by

[TABLE]

In light of the Euler-Lagrange equations (18), the profile and phase function must therefore satisfy

[TABLE]

where we have used the shorthand notation $j(x,\theta)$ introduced in Definition 3. Because the phase function is, by hypothesis, rapidly rotating, we can extract more information from Eq. (II) by considering the latter in a spacetime region that is small compared with the long spacetime scale, but large compared with the short spacetime scale. In such a region, we may regard the argument $x$ in $j(x,S(x))$ as being fixed, while the argument $S(x)$ retains its rapidly oscillating character. If we make the (very weak) assumption that $S(x)$ makes at least one complete rotation in our intermediate-scale region, we may therefore conclude that the following strengthened version of Eq. (II) must be satisfied:

[TABLE]

where $\theta\in S^{1}$ is now arbitrary. Equation (23) reproduces the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ . Thus, the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ can be deduced by applying heuristic arguments based on scale separation to the Euler-Lagrange equations associated with $\mathcal{L}$ .

Now consider the application of similar heuristic arguments to the variational principle associated with $(M,\mathcal{C},\mathcal{L})$ . Suppose once more that $\varphi$ is a solution of the field theory that is locally a plane wave. Then, as before, we may write $\varphi(x)=\tilde{\varphi}(x,S(x))$ , where $\tilde{\varphi}:M\times S^{1}\rightarrow F$ is a profile and $S$ is a rapidly oscillating phase function. Moreover, the action $A_{U}$ evaluated on this special $\varphi$ can be written

[TABLE]

Because the phase function $S$ is rapidly oscillating by hypothesis, we may partition the integration domain $U=\cup_{i}U_{i}$ into cells with diameters that are large compared with the short scale and short compared with the large scale, and then write $A_{U}(\varphi)=\sum_{i}A_{U_{i}}(\varphi)$ . In each of the integrals $A_{U_{i}}$ the first argument of $j(x,S(x))$ may be replaced with the center $x_{i}$ of cell $U_{i}$ without appreciably changing the value of the integral. Moreover, because $S(x)$ varies rapidly in $U_{i}$ , the dominant contribution to the integral $A_{U_{i}}$ is given by averaging over $S(x)$ according to

[TABLE]

If we now interpret the previously established formula $A_{U}(\varphi)=\sum_{i}A_{U_{i}}(\varphi)$ as a Riemann sum, we conclude that the action functional evaluated on a locally-plane $\varphi$ is given approximately by

[TABLE]

where we have introduced the extended action functional $\tilde{A}_{U\times S^{1}}(\tilde{\varphi},S)$ . Moreover, because $\varphi$ is by assumption a critical point of $A_{U}$ , this argument suggests that

[TABLE]

where $\delta\tilde{\varphi}(x,\theta)$ and $\delta S(x)$ are arbitrary functions that vanish when $x\in\partial U$ . That is, the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ at least approximately satisfies a variational principle.

Somewhat surprisingly, the result suggested by the previous heuristic argument is correct. The extended action functional provides the NL-WKB extension of an ordinary first-order classical field theory with the following variational formulation.

Definition 4.

Given an ordinary first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ , the looping of $(M,\mathcal{C},\mathcal{L})$ is the first-order classical theory $(\tilde{M},\tilde{\mathcal{C}},\tilde{\mathcal{L}})$ prescribed as follows.

•

The looped spacetime $\tilde{M}=M\times S^{1}$ is the trivial $S^{1}$ bundle over $M$ .

•

The looped space of fields $\tilde{\mathcal{C}}$ comprises maps $\Phi:\tilde{M}\rightarrow\tilde{F}$ of the form $\Phi(x,\theta)=(\tilde{\varphi}(x,\theta),S(x))$ with $\tilde{\varphi}:\tilde{M}\rightarrow F$ , $S:M\rightarrow S^{1}$ , and $\tilde{F}=F\times S^{1}$ .

•

Set $\tilde{D}=M_{(f+1)\times(m+1)}(\mathbb{R})$ , the space of real-valued $(f+1)\times(m\times 1)$ matrices. Let $X=(x,\theta)\in\tilde{M}$ and $\Phi=(\tilde{\varphi},S)\in\tilde{F}$ . The looped Lagrangian density $\tilde{\mathcal{L}}:\tilde{M}\times\tilde{F}\times\tilde{D}\rightarrow\mathbb{R}$ is given by

[TABLE]

where the matrix $V\in\tilde{D}$ has the block structure

[TABLE]

*Remark 1**.*

The last component of $\Phi=(\tilde{\varphi},S)\in\tilde{\mathcal{C}}$ , being a function of $x$ alone, cannot be localized near an arbitrary angle $\theta$ . Thus, $\tilde{\mathcal{C}}$ does not contain all functions with compact support, meaning $(\tilde{M},\tilde{\mathcal{C}},\tilde{\mathcal{L}})$ is not ordinary. The Euler-Lagrange equations therefore do not take the standard form (18). The appropriate modification of the Euler-Lagrange equations will be found in the process of proving the next theorem.

Theorem 1 (Whitham averaging).

Let $(\tilde{M},\tilde{\mathcal{C}},\tilde{\mathcal{L}})$ be the looping of the ordinary first-order classical field theory $(M,\mathcal{C},\mathcal{L})$ . If $\Phi=(\tilde{\varphi},S)$ is a solution of $(M,\mathcal{C},\mathcal{L})$ ’s NL-WKB extension, then $\Phi$ is a critical point of $(\tilde{M},\tilde{\mathcal{C}},\tilde{\mathcal{L}})$ ’s local action functional $\tilde{A}_{U\times S^{1}}$ for each $U\subset M$ . Conversely, if $\Phi=(\tilde{\varphi},S)\in\tilde{\mathcal{C}}$ is a critical point of $\tilde{A}_{U\times S^{1}}$ for each $U\subset M$ , then $(\tilde{\varphi},S)$ is a solution of $(M,\mathcal{C},\mathcal{L})$ ’s NL-WKB extension.

Proof.

First suppose that $\Phi=(\tilde{\varphi},S)\in\tilde{\mathcal{C}}$ is a critical point of $\tilde{A}_{U\times S^{1}}$ for each $U\subset M$ . Introduce the indices $\tilde{a}\in\{1,\dots,f+1\}$ , $\tilde{\mu}\in\{1,\dots,m+1\}$ , as well as the shorthand notation

[TABLE]

(We refer the reader to the text below Eq. (16) for the definition of $\partial\Phi$ .) Then it must be true that

[TABLE]

for all $\delta\Phi\in\tilde{\mathcal{C}}$ that vanish on $\partial U\times S^{1}$ . Because $\delta\Phi^{a}=\delta\tilde{\varphi}^{a}$ when $a\in\{1,\dots,f\}$ and $\delta\tilde{\varphi}(x,\theta)$ is arbitrary away from the boundary, Eq. (37) implies

[TABLE]

Likewise, because $\delta\Phi^{f+1}=\delta S$ and $\delta S(x)$ is arbitrary away from the boundary, we have

[TABLE]

Here we have used $\frac{\partial\tilde{\mathcal{L}}}{\partial S}=0$ , which follows from Eq. (28). We will refer to Eqs. (38) and (39) as the Euler-Lagrange equations associated with the looping $(\tilde{M},\tilde{\mathcal{C}},\tilde{\mathcal{L}})$ , or the looped Euler-Lagrange equations. By reading the previous argument in reverse, we see that $\Phi$ is a critical point of $\tilde{A}_{U\times S^{1}}$ for all $U\subset M$ if and only if $\Phi$ satisfies the looped Euler-Lagrange equations.

The derivatives of $\tilde{\mathcal{L}}$ that appear in Eqs. (38) and (39) may be expressed in terms of derivatives of $\mathcal{L}$ using the definition (28) according to

[TABLE]

Therefore the looped Euler-Lagrange equations are equivalent to

[TABLE]

where $j(x,\theta)$ was defined in Eq. (20). In particular, Eq. (44) reproduces the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ , i.e. Eq. (19). This proves that each $\Phi$ that is a critical point of $\tilde{A}_{U\times S^{1}}$ for all $U\subset M$ is also a solution of the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ .

Now suppose conversely that $\Phi=(\tilde{\varphi},S)$ is a solution of the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ . The first of the looped Euler-Lagrange equations, i.e. Eq. (44), is then clearly satisfied. However it is not immediately clear that the second equation (45) is also satisfied. To establish Eq. (45) first note that we have the following identity:

[TABLE]

where we have used the chain rule on the first line, and the NL-WKB extension of $(M,\mathcal{C},\mathcal{L})$ (cf. Eq. (19)) on the second line. Next integrate Eq. (46) over $S^{1}$ and apply integration by parts as follows:

[TABLE]

where have used the equality of mixed partial derivatives. This completes the proof. ∎

*Remark 2**.*

The preceding proof demonstrated the presence of a redundancy in the looped Euler-Lagrange equations (44)-(45). The reason for this redundancy is the presence of a gauge symmetry. The gauge group is given by smooth functions $\psi:M\rightarrow S^{1}$ with addition as the group composition law. The action of $\psi$ on $(\tilde{\varphi},S)$ is given by $\psi\cdot(\tilde{\varphi},S)=(\tilde{\varphi}^{\prime},S^{\prime})$ , where

[TABLE]

The redundancy in the looped Euler-Lagrange equations, i.e. the fact that Eq. (44) implies Eq. (45), may be seen as a consequence of gauge symmetry by applying Noether’s second theorem. The presence of this gauge symmetry could have been anticipated by noting that the nonlinear WKB ansatz does not uniquely specify the phase function $S$ .

III Euler-Poincaré fluids as classical field theories

While the results from Section II are useful for identifying variational principles that govern the nonlinear WKB extension of a large class of dissipation-free PDE, they are not immediately applicable to many of the PDEs that appear in fluid dynamics. In particular, they cannot be applied directly to the fluid-mechanical PDEs that arise from Euler-Poincaré variational principles.Holm, Marsden, and Ratiu (1998) The essential issue is that, as we will review, Euler-Poincaré variational principles do not fit into the mold of classical field theory. The purpose of this section is to construct an alternative variational principle for Euler-Poincaré fluid equations to which the Whitham averaging (i.e. Theorem 1) can be profitably applied. We will show how to use Whitham averaging to identify a variational principle for the NL-WKB extension of Euler-Poincaré fluid equations in the following section.

We will restrict our attention to a large subclass of Euler-Poincaré fluid equations defined as follows.

Definition 5 (LBEP equations).

Given a function $\mathcal{L}_{\text{EP}}:\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}^{3}\rightarrow\mathbb{R}:(\bm{u},\rho,\bm{\nabla}\rho)\mapsto\mathcal{L}(\bm{u},\rho,\bm{\nabla}\rho)$ such that $\bm{u}\mapsto(\partial\mathcal{L}_{\text{EP}}/\partial\bm{u})(\bm{u},\rho,\bm{\nabla}\rho)$ is a diffeomorphism for each $(\rho,\bm{\nabla}\rho)\in\mathbb{R}\times\mathbb{R}^{3}$ , the associated local barotropic Euler-Poincaré fluid equations (LBEP equations) are the system of PDEs

[TABLE]

where the unknown fields are $(\rho(\bm{x},t),\bm{u}(\bm{x},t))\in\mathbb{R}\times\mathbb{R}^{3}$ and all derivatives of $\mathcal{L}_{\text{EP}}$ are evaluated at $(\bm{u}(\bm{x},t),\rho(\bm{x},t),\bm{\nabla}\rho(\bm{x},t))$ . The function $\mathcal{L}_{\text{EP}}$ is called the Euler-Poincaré Lagrange density and our convention for the tensor divergence is $(\bm{\nabla}\cdot T)_{j}=\partial_{i}T_{ij}$ . We have also introduced the notation $\otimes$ for the point-wise tensor product, i.e. the tensor product over the ring $C^{\infty}(Q)$ .

Upon introducing the EP Hamiltonian density $\mathcal{H}_{\text{EP}}$ , the LBEP equations may also be conveniently written in terms of the momentum density $\bm{p}=\partial\mathcal{L}_{\text{EP}}/\partial\bm{u}$ as follows.

Definition 6.

Let $\underline{\bm{u}}:\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}^{3}\rightarrow\mathbb{R}^{3}$ be defined implicitly by the formula

[TABLE]

The EP Hamiltonian density $\mathcal{H}_{\text{EP}}:\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}^{3}\rightarrow\mathbb{R}$ is defined by

[TABLE]

Lemma 2 (LBEP equations, momentum form).

The LBEP equations for the unknown fields $(\rho(\bm{x},t),\bm{u}(\bm{x},t))\in\mathbb{R}\times\mathbb{R}^{3}$ are equivalent to the following system of PDEs for the unknown fields $(\rho(\bm{x},t),\bm{p}(\bm{x},t))\in\mathbb{R}\times\mathbb{R}^{3}$ :

[TABLE]

We will refer to Eqs. (54) and (55) as the momentum form of the LBEP equations, or mLBEP equations for brevity.

Proof.

By differentiating the definition (53) of $\mathcal{H}_{\text{EP}}$ and substituting the definition (52) of $\underline{\bm{u}}$ , we obtain

[TABLE]

If $(\rho,\bm{p})$ is a solution of the mLBEP equations, then the identities (56)-(58) imply that $(\rho,\underline{\bm{u}}(\bm{p},\rho,\bm{\nabla}\rho))$ is a solution of the LBEP equations. This shows that $I:(\rho,\bm{p})\mapsto(\rho,\underline{\bm{u}}(\bm{p},\rho,\bm{\nabla}\rho))$ maps solutions of the mLBEP equations into solutions of the LBEP equations. If $(\rho,\bm{u})$ is a solution of the LBEP equations, then the identities (56)-(58) imply $(\rho,\partial\mathcal{L}_{\text{EP}}/\partial\bm{u}(\bm{u},\rho,\bm{\nabla}\rho))$ is a solution of mLBEP equations. This shows that the mapping $I$ is surjective. Injectivity of $I$ follows from the hypothesis that $\bm{u}\mapsto\partial\mathcal{L}_{\text{EP}}/\partial\bm{u}$ is a diffeomorphism. The mapping $I$ therefore establishes a bijection between solutions of the LBEP equations and solutions of the mLBEP equations. ∎

We aim to identify an ordinary first-order classical field theory whose associated Euler-Lagrange equations reproduce the LBEP equations. In order to illustrate why this task is non-trivial, let us briefly review the Euler-Poincaré variational formulation of the LBEP equations described in Ref. Holm, Marsden, and Ratiu, 1998. The basic idea is to introduce the space of Lagrangian configuration maps $\bm{g}:Q_{0}\rightarrow Q$ . A Lagrangian configuration map is a diffeomorphism that assigns to each particle label $\bm{x}_{0}\in Q_{0}$ the current Eulerian position of that fluid particle $\bm{x}=\bm{g}(\bm{x}_{0})$ . The space $Q_{0}$ is referred to as the label space, while the space $Q$ is the fluid container. Set $Q=(S^{1})^{3}$ , fix a positive function $\rho_{0}:Q_{0}\rightarrow\mathbb{R}$ , and consider the action

[TABLE]

where $\bm{g}:[t_{1},t_{2}]\rightarrow\text{Diff}(Q_{0},Q)$ . Define the fluid velocity $\bm{v}$ and the mass density $\rho$ according to

[TABLE]

When the Lagrangian $L_{\rho_{0}}:T\text{Diff}(Q_{0},Q)\rightarrow\mathbb{R}$ is given by

[TABLE]

we can establish a close relationship between the LBEP equations and the Euler-Lagrange equations associated with $L_{\rho_{0}}$ . To see this, observe first that because $\bm{v}$ and $\rho$ are defined in terms of the configuration map $\bm{g}$ , they cannot be varied independently. Instead, variations of $\bm{g}$ induce variations of $\bm{v}$ and $\rho$ as follows. Given a variation $\delta\bm{g}$ of $\bm{g}$ , we may construct an “Eulerianized” variation $\bm{\xi}=\delta\bm{g}\circ\bm{g}^{-1}$ . The variations of $\bm{v}$ and $\rho$ may then be computed as

[TABLE]

The Euler–Lagrange (EL) equations may therefore be obtained by varying the action $\mathcal{A}_{\rho_{0}}$ with respect to $t\mapsto\bm{g}(t)$ and making judicious use of the induced variation formulas (63)-(64). This leads to

[TABLE]

Substituting Eqs. (63) and (64) and integrating by parts leads to

[TABLE]

where $\bm{p}$ is the momentum density

[TABLE]

Equation (66) is a generalized momentum conservation equation. It can be written in conservative form as well. A simple calculation leads to

[TABLE]

Note that by substituting Eq. (67) and identifying $\bm{v}$ with $\bm{u}$ , Eq. (68) becomes the momentum equation (51) of the LBEP equations. In addition, since the fluid mass density $\rho$ is defined in terms of $\bm{g}$ by Eq. (9), it satisfies by construction the continuity equation

[TABLE]

In particular, Eq. (69) is not a consequence of the Euler-Lagrange equations. It follows that the Euler-Lagrange equations associated with the action $\mathcal{A}_{\rho_{0}}$ may be regarded as a second-order ordinary differential equation in the variable $\bm{g}\in\text{Diff}(Q_{0},Q)$ , or, equivalently, a first-order ordinary differential equation in the variables $(\bm{g},\bm{u})\in\text{Diff}(Q_{0},Q)\times\mathfrak{X}(Q)\approx T\text{Diff}(Q_{0},Q)$ ; the evolution equation for $\bm{g}$ is $\bm{v}=\dot{\bm{g}}\circ\bm{g}^{-1}=\bm{u}$ and the evolution equation for $\bm{u}$ is given by substituting $\bm{v}=\bm{u}$ in Eq. (68). Note in particular that if $t\mapsto(\bm{g}(t),\bm{u}(t))$ is a solution of the Euler-Lagrange equations, then $t\mapsto(\bm{u}(t),\rho(t))$ is a solution of the LBEP equations when Eq. (69) is used to define the mass density $\rho$ .

Following the discussion in Sec. II, we would like to find a variational formulation for the NL-WKB extension of the LBEP equations. Whitham averaging would seem to be a natural tool for this task. However, there are certain idiosyncrasies of the EP action principle $\delta\mathcal{A}_{\rho_{0}}=0$ that prevent us from directly applying the Whitham averaging theorem (Theorem 1). These are the following. First, the Lagrangian $L_{\rho_{0}}$ depends on the configuration map $\bm{g}$ , whose domain is the label space $Q_{0}\neq Q$ . In contrast, Theorem 1 applies to Lagrangians defined on spaces of fields over spacetime $M=Q\times\mathbb{R}$ . Second, for any given $\rho_{0}$ , not all of the solutions of the LBEP can be recovered from the EP action principle $\delta\mathcal{A}_{\rho_{0}}=0$ . Indeed, while the space of solutions of the LBEP equations on $Q=(S^{1})^{3}$ is parameterized by initial data $(\rho,\bm{u})\in C_{+}^{\infty}(Q)\times\mathfrak{X}(Q)$ , solutions of the Euler-Lagrange equations associated with $\mathcal{A}_{\rho_{0}}$ cannot accommodate all possible initial $\rho$ . In fact, solutions of the Euler-Lagrange equations for $\bm{g}$ can only recover solutions of the LBEP equations with initial $\rho$ that may be related to the parameter $\rho_{0}$ by some diffeomorphism $\bm{g}:Q_{0}\rightarrow Q$ using the formula (9). Note in particular that initial $\rho$ with $\int\rho\,d^{3}\bm{x}\neq\int\rho_{0}\,d^{3}\bm{x}_{0}$ cannot be obtained in this manner. In contrast, the PDEs addressed by the Whitham averaging theorem all have the property that solutions of the PDE are precisely the critical points of a single action functional, rather than a family of action functionals like $\mathcal{A}_{\rho_{0}}$ . Finally, and most superficially, since the mass density $\rho$ is defined using the Jacobian of the mapping $\bm{g}$ (see Eq. (9)), second derivatives of $\bm{g}$ appear in the Lagrangian $L_{\rho_{0}}$ . This suggests that theory of Whitham averaging described in Section II for first-order field theories cannot be applied.

In order to eventually bring Whitham averaging to bear on the problem of NL-WKB extension of the LBEP equations, we will construct an alternative variational formulation of the LBEP equations that fits into the framework of first-order classical field theory. We proceed as follows. (1) First, let us introduce the inverse of the configuration map, $\bm{h}\doteq\bm{g}^{-1}$ , which is also known as the back-to-labels map. Conveniently, $\bm{h}$ is a mapping from the spatial domain $Q$ to the label space $Q_{0}$ , and may therefore be regarded as a $Q_{0}$ -valued field. The motivation here is that while the ansatz $\bm{g}(\bm{x}_{0})=\tilde{\bm{g}}(\bm{x}_{0},S(\bm{x}_{0}))$ requires evaluating the phase function on points in the label space, the ansatz $\bm{h}(\bm{x})=\tilde{\bm{h}}(\bm{x},S(\bm{x}))$ does not. We then substitute $\bm{g}=\bm{h}^{-1}$ into the action Eq. (59). The velocity field $\bm{v}$ , which was originally given by $\bm{v}\doteq\dot{\bm{g}}\circ\bm{g}^{-1}$ , is now written in terms of $\bm{h}$ as

[TABLE]

It can be shown that variations of $\bm{v}$ and $\rho$ with respect to $\bm{h}$ are still given by Eq. (63) and Eq. (64), but now the vector field $\bm{\xi}$ is written as $\bm{\xi}=-\delta\bm{h}\cdot(\bm{\nabla}\bm{h})^{-1}$ . (2) Next we construct the parameter-dependent phase space Lagrangian $\mathsf{L}_{\rho_{0}}(\bm{h},\dot{\bm{h}},\bm{p},\dot{\bm{p}})$ given by

[TABLE]

The associated parameter-dependent phase space action functional,

[TABLE]

is defined on the space of paths $[t_{1},t_{2}]\rightarrow\text{Diff}(Q,Q_{0})\times\mathfrak{X}(Q)$ . This implies that variations are to be applied to $\bm{h}$ and $\bm{p}$ independently while holding the values of $\bm{h}$ and $\bm{p}$ fixed at $t_{1}$ and $t_{2}$ . (3) Finally, we introduce a scalar function $\chi:Q\rightarrow\mathbb{R}$ as a Lagrange multiplier that enforces the continuity equation as in Section 4.2 of Ref. Cotter and Holm, 2012. This leads to the parameter-independent phase space Lagrangian $\mathsf{L}(\bm{h},\dot{\bm{h}},\bm{p},\dot{\bm{p}},\rho,\dot{\rho},\chi,\dot{\chi})$ given by

[TABLE]

where the velocity $\bm{v}$ is defined in terms of $\bm{h}$ as in Eq. (70). The Lagrangian $\mathsf{L}$ is intrinsically a function on $T\mathcal{C}_{0}$ , where $\mathcal{C}_{0}$ the space of frozen field configurations.

Definition 7.

The space of frozen field configurations is the infinite-dimensional manifold $\mathcal{C}_{0}=\text{Diff}(Q,Q_{0})\times\mathfrak{X}(Q)\times C^{\infty}_{+}(Q)\times C^{\infty}(Q)$ , where $C^{\infty}_{+}(Q)$ is the set of smooth positive functions on $Q$ , and $\mathfrak{X}(Q)$ is the set of vector fields on $Q$ . That is, $\mathcal{C}_{0}$ comprises maps $Q\ni\bm{x}\mapsto(\bm{h}(\bm{x}),\bm{p}(\bm{x}),\rho(\bm{x}),\chi(\bm{x}))\in Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}$ , where $\bm{h}$ is a diffeomorphism and $\rho(\bm{x})>0$ for all $\bm{x}\in Q$ .

Correspondingly, the parameter-independent phase space action functional,

[TABLE]

is defined on the space of paths $[t_{1},t_{2}]\rightarrow\mathcal{C}_{0}$ , which implies that variations should be applied to $\bm{h}$ , $\bm{p}$ , $\rho$ , and $\chi$ independently.

The following proposition shows that the Euler-Lagrange equations associated with $\mathsf{A}$ define a system of PDEs that completely recover the LBEP equations.

Proposition 1.

A path $t\mapsto(\bm{h}(t),\bm{p}(t),\rho(t),\chi(t))\in\mathcal{C}_{0}$ is a critical point of the action functional $\mathsf{A}$ in Eq. (74) if and only $\bm{h}$ , $\bm{p}$ , $\rho$ , and $\chi$ satisfy the following system of PDEs:

[TABLE]

Moreover, every solution $t\mapsto(\bm{u}(t),\rho(t))$ of the LBEP equations may be obtained from some solution $t\mapsto(\bm{h}(t),\bm{p}(t),\rho(t),\chi(t))$ of the Euler-Lagrange equations associated with $\mathsf{A}$ by defining $\bm{u}(t)$ using the Legendre transform

[TABLE]

Proof.

The EL equations associated with the variational principle $\delta\mathsf{A}=0$ may be derived as follows. When varying the momentum density $\bm{p}$ , one immediately finds

[TABLE]

Thus, just as in finite-dimensional Hamiltonian systems, the fluid velocity is given by the partial derivative of the Hamiltonian with respect to the momentum density. Varying the action with respect to the scalar field $\chi$ leads to the continuity equation

[TABLE]

Varying the action with respect to $\rho$ gives

[TABLE]

As before, since $\bm{v}$ depends on $\bm{h}$ , the induced variations of $\bm{v}$ are given by Eq. (63). Therefore variations of $\bm{h}$ lead to

[TABLE]

Equations (80)–(83) are the EL equations associated with the action (74). In order to see that they are equivalent to Eqs. (75)-(78), first substitute Eqs. (81) and (82) into Eq. (83) in order to obtain

[TABLE]

Then use the identity

[TABLE]

together with Eq. (80) to write Eq. (84) as Eq. (76). Equations (75), (77), and (78) are finally seen to be equivalent to Eqs. (80), (81), and (82) in light of the relations $\bm{v}=\partial\mathcal{H}_{\text{EP}}/\partial\bm{p}=-\partial_{t}\bm{h}\cdot(\bm{\nabla}\bm{h})^{-1}$ .

In order to see that every solution of the LBEP equations may be obtained from solutions of Eqs. (75)-(78), we merely observe that Eqs. (76)-(77) are precisely the momentum form of the LBEP equations. This system of PDEs was shown to be equivalent to the LBEP equations in Lemma 2. We may say that the mLBEP equations are embedded within the Euler-Lagrange equations associated with $\mathsf{A}$ .

∎

*Remark 3**.*

It is to be noted that, in order to apply Whitham’s averaging to the EP action principle, it is not entirely necessary to construct the Hamiltonian formulation given in Eqs. (53) and (74) with $\bm{p}$ as an additional dynamical variable. One could have simply introduced the back-to-labels map $\bm{h}$ and added the term involving the Lagrange multiplier in order to construct the action. However, one of the advantages that we shall obtain after applying the NL–WKB extension to the generalized fluid system is that having $\bm{p}$ as an as an argument of the action functional is convenient when performing WKB asymptotics. This will be further discussed in Sec. VI, when we will apply our results to high-frequency acoustic waves interacting with a compressible isothermal flow.

A simple corollary of Proposition 1 is that the LBEP equations may be formulated as the Euler-Lagrange equations associated with an ordinary first-order classical field theory.

Theorem 2 (LBEP field theory).

Let $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ be the ordinary first-order classical field theory defined as follows.

•

$M_{\text{EP}}=Q\times\mathbb{R}$ .

•

$\mathcal{C}_{\text{EP}}$ * is the space of smooth functions $\varphi:M\rightarrow F$ , where $F=Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}$ .*

•

Write a general element $\partial\varphi\in D=M_{8\times 4}(\mathbb{R})$ as

[TABLE]

and a general element $\varphi\in F$ as $(\bm{h},\bm{p},\rho,\chi)\in Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}$ . The Lagrangian density $\mathfrak{L}_{\text{EP}}:M\times F\times D\rightarrow\mathbb{R}$ is given by

[TABLE]

The Euler-Lagrange equations associated with $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ are equivalent to Eqs. (75)-(78).

*Remark 4**.*

The Euler-Lagrange equations associated with $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ comprise a larger system of PDEs than the LBEP equations. However, Proposition 1 shows that the LBEP equations are embedded within the Euler-Lagrange equations associated with $\mathfrak{L}_{\text{EP}}$ . In this sense it seems reasonable to attempt to uncover properties of the LBEP equations by studying $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ . On the other hand, it also seems plausible that the additional variables present in $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ ’s Euler-Lagrange equations might render the study of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ even more complicated than studying the LBEP equations directly. We will show in Section IV that $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ does provide useful information about the LBEP equations because it can be can be combined with Whitham averaging in order to identify a variational formulation for the NL-WKB extension of the LBEP equations. In Section V we will show that symmetries of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ and $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ ’s looping explain why the additional fields present in $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ do not spoil the utility of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ .

IV Variational structure of nonlinear WKB in the Eulerian frame

In this section we will use the results from the previous two sections to identify a variational principle for the NL-WKB extension of the LBEP equations. We will frame our discussion in terms of the momentum form of the LBEP equations.

Definition 8.

Introduce the operators $\partial_{t}^{S}=\partial_{t}+\partial_{t}S\,\partial_{\theta}$ and $\bm{\nabla}^{S}=\bm{\nabla}+\bm{\nabla}S\,\partial_{\theta}$ . The NL-WKB extension of the LBEP equations is the system of PDEs

[TABLE]

where the derivatives of the EP Hamiltonian density are evaluated at $(\widetilde{\bm{p}},\widetilde{\rho},\bm{\nabla}^{S}\widetilde{\rho})$ . The unknown fields are $(\widetilde{\rho}(\bm{x},\theta,t),\widetilde{\bm{p}}(\bm{x},\theta,t),S(\bm{x},t))\in\mathbb{R}\times\mathbb{R}^{3}\times S^{1}$ . For the sake of brevity, we will refer to this system of PDEs as the extLBEP equations.

The rationale behind the existence of a variational formulation for the extLBEP equations is as follows. According to Theorem 2, the LBEP equations may be realized as a subset of the Euler-Lagrange equations arising from the ordinary first-order classical field theory $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ . Because the LBEP equations are a subset of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ ’s Euler-Lagrange equations, the extLBEP equations must be a subset of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ ’s NL-WKB extension. Indeed, applying the NL-WKB extension procedure to Eqs. (75)-(78) involves applying NL-WKB extension to Eqs. (76)-(77), the latter of which are equivalent to the momentum form of the LBEP equations. But by Theorem 1 the NL-WKB extension of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ arises as the Euler-Lagrange equations associated with the looping of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ , i.e. $(\widetilde{M}_{\text{EP}},\widetilde{\mathcal{C}}_{\text{EP}},\widetilde{\mathfrak{L}}_{\text{EP}})$ . (See Definition 4.) Therefore the variational principle furnished by $(\widetilde{M}_{\text{EP}},\widetilde{\mathcal{C}}_{\text{EP}},\widetilde{\mathfrak{L}}_{\text{EP}})$ ’s action functional must serve as a variational principle for the extLBEP equations. In summary, we have proved the following.

Proposition 2.

Let $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ be defined as in Theorem 2, and let $(\widetilde{M}_{\text{EP}},\widetilde{\mathcal{C}}_{\text{EP}},\widetilde{\mathfrak{L}}_{\text{EP}})$ be the looping of $(M_{\text{EP}},\mathcal{C}_{\text{EP}},\mathfrak{L}_{\text{EP}})$ . Consider $\Phi\in\widetilde{\mathcal{C}}_{\text{EP}}$ with components $\Phi(\bm{x},t,\theta)=(\widetilde{\bm{h}}(\bm{x},t,\theta),\widetilde{\bm{p}}(\bm{x},t,\theta),\widetilde{\rho}(\bm{x},t,\theta),\widetilde{\chi}(\bm{x},t,\theta),S(\bm{x},t))\in Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}\times S^{1}$ . The field $\Phi$ is a critical point of $(\widetilde{M}_{\text{EP}},\widetilde{\mathcal{C}}_{\text{EP}},\widetilde{\mathfrak{L}}_{\text{EP}})$ ’s local action functional $\tilde{A}_{U\times S^{1}}$ for each $U\subset M$ if and only if $\Phi$ ’s component functions satisfy the system of PDEs

[TABLE]

where the derivatives of $\mathcal{H}_{\text{EP}}$ are evaluated at $(\widetilde{\bm{p}},\widetilde{\rho},\bm{\nabla}^{S}\widetilde{\rho})$ . In particular, the extLBEP equations are recovered as a subset of the Euler-Lagrange equations associated with $(\widetilde{M}_{\text{EP}},\widetilde{\mathcal{C}}_{\text{EP}},\widetilde{\mathfrak{L}}_{\text{EP}})$ .

Proposition 2 gives the variational structure of the extLBEP equations in a manner that treats space and time on an equal footing. In order to analyze the extLBEP equations as a dynamical system, it is also important to formulate Proposition 2 in terms of evolving fields on space instead of “static” fields on spacetime. For this purpose, it is useful to introduce the looped frozen field configurations and the extLBEP Lagrangian.

Definition 9.

The space of looped frozen field configurations $\ell\mathcal{C}_{0}$ is the collection of smooth mappings $S^{1}\rightarrow\mathcal{C}_{0}$ . (cf. Definition 7.) We will identify elements of $\ell\mathcal{C}_{0}$ with mappings $Q\times S^{1}\ni(\bm{x},\theta)\mapsto(\widetilde{\bm{h}}(\bm{x},\theta),\widetilde{\bm{p}}(\bm{x},\theta),\widetilde{\rho}(\bm{x},\theta),\widetilde{\chi}(\bm{x},\theta))\in Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}$ , where $\bm{x}\mapsto\widetilde{\bm{h}}(\bm{x},\theta)$ is a diffeomorphism for each $\theta$ , and $\rho(\bm{x},\theta)>0$ for all $(\bm{x},\theta)\in Q\times S^{1}$ .

Definition 10.

The extLBEP Lagrangian is the functional $\widetilde{\mathsf{L}}:T(\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1}))\rightarrow\mathbb{R}$ whose value at $(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S,\dot{\widetilde{\bm{h}}},\dot{\widetilde{\rho}},\dot{\widetilde{\chi}},\dot{S})\in T(\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1}))$ is given by

[TABLE]

where

[TABLE]

and $\fint$ is defined by $\fint g(\theta)\,d\theta=(2\pi)^{-1}\int_{0}^{2\pi}g(\theta)\,d\theta$ .

In terms of $\widetilde{\mathsf{L}}$ and $\ell\mathcal{C}_{0}$ , the proper reformulation of Proposition 2 is the following.

Theorem 3.

Let $\gamma:[t_{1},t_{2}]\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ be a smooth curve with components $\gamma=(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ . The curve $\gamma$ is a (fixed-endpoint) critical point of the functional

[TABLE]

if and only if the component functions $(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ satisfy Eqs. (97)-(100).

This theorem can be deduced directly from Proposition 2 by unpacking definitions. However, in order to more clearly highlight the mechanisms underlying the variational formulation of the extLBEP equations, we will give a direct proof of Theorem 3 that proceeds without recourse to Proposition 2.

Before proceeding with the proof, we will first establish a generalization of the famous Lin constraint formulaNewcomb (1962); Bretherton (1970) from variational hydrodynamics.

Definition 11.

Given any set $T$ and a mapping $\psi:Q\times S^{1}\rightarrow T$ , define the phase shift of $\psi$ by $S$ , $\psi^{S}:Q\times S^{1}\rightarrow T$ , using the formula

[TABLE]

Note that $\psi^{S}$ is not the usual exponentiation operation used in elementary arithmetic.

Lemma 3 (WKB Lin constraint formula).

Let $(t,\epsilon)\mapsto\widetilde{\bm{h}}_{t,\epsilon}\in\ell\text{Diff}(Q,Q_{0})$ be a smooth 2-parameter family of maps $S^{1}\rightarrow\text{Diff}(Q,Q_{0})$ . Let $(t,\epsilon)\mapsto S_{t,\epsilon}\in C^{\infty}(Q,S^{1})$ be a smooth 2-parameter family of maps $Q\rightarrow S^{1}$ . Then the pair of parameter velocities,

[TABLE]

satisfies the identity

[TABLE]

where $[\widetilde{\bm{v}}_{t,\epsilon},\widetilde{\bm{\xi}}_{t,\epsilon}]=\widetilde{\bm{v}}_{t,\epsilon}\cdot\bm{\nabla}\widetilde{\bm{\xi}}_{t,\epsilon}-\widetilde{\bm{\xi}}_{t,\epsilon}\cdot\bm{\nabla}\widetilde{\bm{v}}_{t,\epsilon}$ denotes the $\theta$ -wise vector field commutator.

Proof.

Let $\bm{F}_{\lambda_{1},\lambda_{2}}\in\text{Diff}(Q,Q_{0})$ be any smooth two-parameter family of diffeomorphisms. The ordinary Lin constraint formula says that the parameter velocities $\bm{w}_{k}=-(\partial_{\lambda_{k}}\bm{F})\cdot(\bm{\nabla}\bm{F})^{-1}$ satisfy

[TABLE]

In the formula (108) set $\lambda_{1}=t$ , $\lambda_{2}=\epsilon$ , and $\bm{F}_{t,\epsilon}=\widetilde{\bm{h}}_{t,\epsilon}^{S_{t,\epsilon}}$ . (Regard $\theta$ as a third parameter that comes along for the ride.) We then have

[TABLE]

with

[TABLE]

Phase shifting the formula (109) by $-S$ and applying the chain rule then leads to Eq. (107).

∎

*Remark 5**.*

In the above proof, if we had instead set $\lambda_{1}=t,\lambda_{2}=\theta$ and applied the usual Lin constraint formula, the resulting identity would have been

[TABLE]

where $\widetilde{\bm{\zeta}}=-\partial_{\theta}\widetilde{\bm{h}}\cdot(\bm{\nabla}^{S}\widetilde{\bm{h}})^{-1}$ is the $\theta$ -parameter velocity. This identity will be used in the proof of Theorem 3.

proof of Theorem 3.

According to the WKB Lin constraint formula (107), the first variation of the velocity $\widetilde{\bm{v}}$ is given by

[TABLE]

where the loop of vector fields $\widetilde{\bm{\xi}}=-(\delta\widetilde{\bm{h}}+\delta S\,\partial_{\theta}\widetilde{\bm{h}})\cdot(\bm{\nabla}\widetilde{\bm{h}}+\bm{\nabla}S\otimes\partial_{\theta}\widetilde{\bm{h}})^{-1}$ . The first (fixed-endpoint) variation of the action $\widetilde{\mathsf{A}}$ is therefore given by

[TABLE]

where we have temporarily introduced the shorthand notation $\widetilde{\bm{P}}=\widetilde{\bm{p}}+\widetilde{\rho}\,\bm{\nabla}^{S}\widetilde{\chi}$ . Alternatively, we may isolate all of the variations of $S$ by writing $\widetilde{\bm{\xi}}=\widetilde{\bm{\xi}}_{0}+\delta S\,\widetilde{\bm{\zeta}}$ with $\widetilde{\bm{\zeta}}=-(\partial_{\theta}\widetilde{\bm{h}})\cdot(\bm{\nabla}\widetilde{\bm{h}}+\bm{\nabla}S\otimes\partial_{\theta}\widetilde{\bm{h}})^{-1}$ and $\widetilde{\bm{\xi}}_{0}=-\delta\widetilde{\bm{h}}\cdot(\bm{\nabla}^{S}\widetilde{\bm{h}})^{-1}$ , thereby obtaining

[TABLE]

where the specific wave action density $\widetilde{\mathcal{I}}$ is given by

[TABLE]

Because $\delta\widetilde{\chi},\delta\widetilde{\rho},\delta\widetilde{\bm{p}},\delta S,$ and $\widetilde{\bm{\xi}}_{0}$ are arbitrary, $\delta\widetilde{\mathsf{A}}=0$ if and only if

[TABLE]

Notice that in moving from the first variation formula (115) to Eqs. (117)-(121), we have used the $\widetilde{\bm{v}}=\partial\mathcal{H}_{\text{EP}}/\partial\widetilde{\bm{p}}$ in order to eliminate $\widetilde{\bm{v}}$ in favor of $\partial\mathcal{H}_{\text{EP}}/\partial\widetilde{\bm{p}}$ . In order to finish the proof, we will now show that Eqs. (117)-(121) are equivalent to Eqs. (97)-(100) by (a) demonstrating that Eq. (118) is equivalent to Eq. (98), and (b) proving that the wave action conservation law (121) is implied by Eqs. (97)-(100).

(a): Notice that Eq. (118) may be written as $\mathsf{M}\widetilde{\bm{P}}=0$ , where $\mathsf{M}$ is the linear differential operator whose action on any time-dependent loop of vector fields $\widetilde{\bm{w}}$ is given by

[TABLE]

Because $\widetilde{\bm{P}}=\widetilde{\bm{p}}+\widetilde{\rho}\bm{\nabla}^{S}\widetilde{\chi}$ , Eq. (118) is also equivalent to $\mathsf{M}\widetilde{\bm{p}}=-\mathsf{M}(\widetilde{\rho}\bm{\nabla}^{S}\widetilde{\chi})$ . By a direct calculation involving the continuity equation (99), we have

[TABLE]

where we have used the Euler-Lagrange equation (120) in the second line. Moreover, because

[TABLE]

when the derivatives of $\mathcal{H}_{\text{EP}}$ are evaluated at $(\widetilde{\bm{p}},\widetilde{\rho},\bm{\nabla}^{S}\widetilde{\rho})$ , the sum $\mathsf{M}(\widetilde{\rho}\bm{\nabla}^{S}\widetilde{\chi})+\bm{\nabla}^{S}\mathcal{H}_{\text{EP}}$ is given by

[TABLE]

Using Eq. (125) to evaluate the right-hand-side of $\mathsf{M}\widetilde{\bm{p}}=-\mathsf{M}(\widetilde{\rho}\bm{\nabla}^{S}\widetilde{\chi})$ leads directly to Eq. (98).

(b): A simple way to see that Eqs. (97)-(100) imply the wave action conservation law is to analyze the quantity

[TABLE]

Using Eqs. (98),(99), (100), and the identity (112) introduced in Remark 5, the quantity $\gamma$ may be written

[TABLE]

Upon applying the identity

[TABLE]

we also have

[TABLE]

The $\theta$ -average of $\gamma$ is therefore

[TABLE]

which establishes the wave action conservation law (121). ∎

V Looping the relabeling group

One of the most remarkable features of the variational principle introduced in Theorem 3 is its associated symmetry group. Let $G$ be the group of symmetries of the LBEP phase space action functional (74). (Think of $G$ as the symmetry group for the LBEP equations before applying nonlinear WKB extension.) To $G$ we may associate the loop groupPressley and Segal (1988) $\ell G$ , which comprises mappings $S^{1}\rightarrow G$ . In this section, we will show that $\ell G$ is a group of symmetries for the action functional (103) from Theorem 3. We say the symmetry group $G$ becomes looped in passing from the LBEP equations to their nonlinear WKB extension. In particular, the subgroup of $G$ given by particle relabeling transformations becomes looped when passing from the LBEP equations to their non-linear WKB extension. Using Noether’s theorem, we will deduce the conserved quantity associated with loops of relabeling transformations, and thereby infer the analogue of Kelvin’s circulation theorem for Eulerian nonlinear WKB. Notably, this circulation theorem represents a kind of extension of the circulation theorem discussed in Ref. Gjaja and Holm, 1996; the latter may be seen as a consequence of symmetry under the group of mean (i.e. $\theta$ -independent) relabeling transformations, while the circulation theorem discussed in this section is a consequence of symmetry under the larger group of loops of relabeling transformations. (We will discuss the relationship between these two notions of circulation in greater detail in the next section, where we will apply our theoretical results to a concrete example of wave-mean-flow interaction.) Finally, we will use the loops of relabeling transformations to give a group-theoretic explanation for the one-way coupling between $(\widetilde{\bm{p}},\widetilde{\rho})$ and $(\widetilde{\bm{h}},\widetilde{\chi})$ in the extLBEP equations. In so doing, we will have demonstrated that the nonlinear WKB extension of an Euler-Poincaré fluid theory fits into a general pattern that was emphasized by Marsden and Weinstein in Ref. Marsden and Weinstein, 1982; many dissipation-free models from continuum mechanics arise as quotients of variational models by an appropriate symmetry group.

We begin by recalling the definition of a loop group. Let $G$ be a group with elements $g\in G$ and product $g_{1}g_{2}\in G$ . The loop group $\ell G$ associated with $G$ is the set of all mappings $S^{1}\rightarrow G$ . When $G$ carries a manifold structure, we also require the mappings to be smooth. If $\widetilde{g}$ denotes a typical element of $\ell G$ , the group multiplication $\widetilde{g}_{1}*\widetilde{g}_{2}$ in $\ell G$ is given by $(\widetilde{g}_{1}*\widetilde{g}_{2})(\theta)=\widetilde{g}_{1}(\theta)\widetilde{g}_{2}(\theta)$ . Thus, the product on $\ell G$ is given by “parallelizing” the product on $G$ over the loop parameter $\theta\in S^{1}$ . Accordingly, the identity and inverse map for $\ell G$ are given by $e_{\ell G}(\theta)=e_{G}$ and $(\widetilde{g}^{-1})(\theta)=(\widetilde{g}(\theta))^{-1}$ . Note that we use the same symbol for the inverse operations in $G$ and $\ell G$ .

Next we turn to establishing the main result of this section.

Theorem 4.

Let $G$ be a group. Suppose $\Phi:\mathcal{C}_{0}\times G\rightarrow\mathcal{C}_{0}$ is a right $G$ -action on the space of frozen field configurations that leaves the action functional (74) invariant. Let $\widetilde{\Phi}_{0}$ be the right $\ell G$ -action on $\ell\mathcal{C}_{0}$ given by “parallelizing” the action $\Phi$ , i.e.

[TABLE]

Then there is a right $\ell G$ -action $\widetilde{\Phi}$ on $\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ given by

[TABLE]

that leaves the action functional (103) invariant. (Recall that the notation $\cdot^{S}$ was defined in Eq. (104).)

Proof.

Given a smooth curve $\hat{\gamma}:[t_{1},t_{2}]\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ , introduce the component curves $\hat{\gamma}_{1},\hat{\gamma}_{2}$ satisfying $\hat{\gamma}(t)=(\hat{\gamma}_{1}(t),\hat{\gamma}_{2}(t))\in\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ for all $t\in[t_{1},t_{2}]$ . Consider the action functional $\widetilde{\mathsf{A}}_{0}$ defined on the space of such curves by the formula

[TABLE]

We recall that $\mathsf{L}$ is the parameter-independent phase space Lagrangian for the LBEP equations introduced in Eq. (73). The intuition behind Eq. (133) is as follows. For each $\theta\in S^{1}$ , we may evaluate the action $\mathsf{A}$ in Eq. (74) on the curve $t\mapsto\hat{\gamma}_{1}(\theta,t)\in\mathcal{C}_{0}$ , thereby obtaining the real number $\mathsf{A}(\theta)$ . The value of $\widetilde{\mathsf{A}}_{0}(\hat{\gamma})$ is then given by averaging $\mathsf{A}(\theta)$ over $S^{1}$ . Because $\mathsf{A}(\theta)$ is $G$ -invariant for each $\theta\in S^{1}$ , it follows that the “parallelized” $G$ -action $\widetilde{\Phi}_{0}$ leaves the action $\widetilde{\mathsf{A}}_{0}$ invariant.

As is generally true in Lagrangian mechanics, equivalent formulations of the variational problem $\delta\widetilde{\mathsf{A}}_{0}=0$ may be obtained by applying invertible transformations to the “generalized coordinates,” which in this case may be identified with the space $\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ . In particular, we may apply the mapping $T:(\hat{\widetilde{\bm{h}}},\hat{\widetilde{\bm{p}}},\hat{\widetilde{\rho}},\hat{\widetilde{\chi}},S)\mapsto(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ , where

[TABLE]

After applying the transformation $T$ , the action functional $\widetilde{\mathsf{A}}_{0}$ is transformed into the action functional $\widetilde{\mathsf{A}}_{0}^{*}$ , whose value at $\gamma=(\gamma_{1},\gamma_{2}):[t_{1},t_{2}]\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ is given by

[TABLE]

Because $\widetilde{\mathsf{A}}_{0}(\hat{\gamma})$ is by hypothesis invariant under the transformation

[TABLE]

for each $\widetilde{g}\in\ell G$ , the quantity $\widetilde{\mathsf{A}}_{0}^{*}({\gamma})$ must be invariant under the transformation given by

[TABLE]

for each $\widetilde{g}\in\ell G$ . Note that we have recognized the definition (132) of $\widetilde{\Phi}$ in the last line of Eq. (137). We have therefore shown that the $G$ -action $\widetilde{\Phi}$ leaves the action functional $\widetilde{\mathsf{A}}_{0}^{*}$ invariant.

In order to complete the proof, we will now show by direct calculation that $\widetilde{\mathsf{A}}_{0}^{*}$ is in fact equal to the action defined in Eq. (103). Write $\hat{\gamma}_{1}=({\widetilde{\bm{h}}},{\widetilde{\bm{p}}},{\widetilde{\rho}},{\widetilde{\chi}})$ and ${\gamma}_{1}=S$ . According to Eq. (135) and (73), the value of $\widetilde{\mathsf{A}}_{0}^{*}(\gamma)$ is given by

[TABLE]

Using the derivative identities

[TABLE]

along with similar identities for $\widetilde{\chi}$ and $\widetilde{\rho}$ , we may also write

[TABLE]

where we have defined $\widetilde{\bm{v}}=-(\partial^{S}_{t}\widetilde{\bm{h}})\cdot(\bm{\nabla}^{S}\widetilde{\bm{h}})^{-1}$ as was done earlier in Eq. (102). Now apply the integral identity $\fint\int_{Q}f^{S}(\bm{x},\theta)\,d^{3}\bm{x}\,d\theta=\fint\int_{Q}f(\bm{x},\theta)\,d^{3}\bm{x}\,d\theta$ , which is valid for all integrable $f:Q\times S^{1}\rightarrow\mathbb{R}$ , to obtain

[TABLE]

By Eq. (101), Eq. (141) is just the formula (103) defining the action functional $\widetilde{\mathsf{A}}$ . ∎

Theorem 4 says that the symmetry group of the phase-space action for the LBEP equations becomes looped when applying the nonlinear WKB extension procedure. As a consequence, we should expect that momentum maps for the LBEP equations should be looped by nonlinear WKB extension as well. The next proposition shows that this is true.

Proposition 3.

Endow $\mathcal{C}_{0}$ with the symplectic form $-\mathbf{d}\Theta$ , where the $1$ -form $\Theta$ is given by

[TABLE]

with $\bm{\xi}=-\delta\bm{h}\cdot(\bm{\nabla}\bm{h})^{-1}$ . Endow $\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ with the presymplectic form $-\mathbf{d}\widetilde{\Theta}$ , where

[TABLE]

with $\widetilde{\bm{\xi}}=-(\delta\widetilde{\bm{h}}+\delta S\partial_{\theta}\widetilde{\bm{h}})\cdot(\bm{\nabla}^{S}\widetilde{\bm{h}})^{-1}$ . Let $G$ be a Lie group. Suppose there is a right $G$ -action on $\mathcal{C}_{0}$ that preserves $\Theta$ , and therefore admits an $\text{Ad}^{*}$ -equivariant momentum map $\mu:\mathcal{C}_{0}\rightarrow\mathfrak{g}^{*}$ given by

[TABLE]

for each $X\in\mathfrak{g}$ . (The vector field $X_{\mathcal{C}_{0}}$ is the infinitesimal generator on $\mathcal{C}_{0}$ in the direction $X\in\mathfrak{g}$ .) Then the looped $\ell G$ -action given by Theorem 4 admits an $\text{Ad}^{*}$ -equivariant presymplectic momentum map $\widetilde{\mu}:\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})\rightarrow\ell\mathfrak{g}^{*}$ given by

[TABLE]

Proof.

Given any group action $\Psi:M\times H\rightarrow M$ , where $M$ is a set and $H$ is the group, it will be convenient to introduce the maps $\Psi_{h}:M\rightarrow M$ for each $h\in H$ , where $\Psi_{h}(m)=\Psi(m,h)$ . Let $\Phi:\mathcal{C}_{0}\times G\rightarrow\mathcal{C}_{0}$ be the right $G$ -action that preserves $\Theta$ , $\widetilde{\Phi}_{0}:\ell\mathcal{C}_{0}\times\ell G\rightarrow\ell\mathcal{C}_{0}$ the parallelization of $\Phi$ defined by Eq. (131), and $\widetilde{\Phi}:\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})\times\ell G\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ the $\ell G$ action provided by Theorem 4. By hypothesis, the action $\Phi$ preserves the $1$ -form $\Theta$ in the sense that $\Phi_{g}^{*}\Theta=\Theta$ for each $g\in G$ .

Let $T:\ell\mathcal{C}_{0}\times C^{\infty}(Q,\mathbb{R})\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,\mathbb{R})$ be the diffeomorphism given by $T:(\hat{\widetilde{\bm{h}}},\hat{\widetilde{\bm{p}}},\hat{\widetilde{\rho}},\hat{\widetilde{\chi}},S)\mapsto(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ , with

[TABLE]

and $\widetilde{\Theta}_{0}$ the $1$ -form on $\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ defined by

[TABLE]

The proof of Theorem 4 shows that the pullback of $\widetilde{\Theta}$ along $T$ is given by $\widetilde{\Theta}_{0}$ .

Because $\Phi_{g}^{*}\Theta=\Theta$ for each $g\in G$ , we have $(\widetilde{\Phi}_{0\widetilde{g}}\times I)^{*}\widetilde{\Theta}_{0}=\widetilde{\Theta}_{0}$ for each $\widetilde{g}\in\ell G$ , where $I:C^{\infty}(Q,S^{1})\rightarrow C^{\infty}(Q,S^{1})$ is the identity mapping on the space of phase functions. Therefore the mapping $\widetilde{\mu}_{0}:\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})\rightarrow\ell\mathfrak{g}^{*}$ defined by

[TABLE]

for each $\widetilde{X}\in\ell\mathfrak{g}$ is an $\text{Ad}^{*}$ -equivariant presymplectic momentum map with respect to the presymplectic form $-\mathbf{d}\widetilde{\Theta}_{0}$ . (Note that we have used the same notation for the pairings between $\mathfrak{g},\mathfrak{g}^{*}$ and $\ell\mathfrak{g},\ell\mathfrak{g}^{*}$ .) In other words, we have

[TABLE]

for each $\widetilde{X}\in\ell\mathfrak{g}$ .

The pushforward of Eq. (149) along $T$ is

[TABLE]

But because $\widetilde{\Phi}_{\widetilde{g}}=T\circ(\widetilde{\Phi}_{0\widetilde{g}}\times I)\circ T^{-1}$ , the infinitesimal generator of $\widetilde{X}$ with respect to the group action $\widetilde{\Phi}$ is just $\widetilde{X}_{\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})}=T_{*}(\widetilde{X}_{\ell\mathcal{C}_{0}}\oplus 0)$ . Therefore $\widetilde{\mu}=\widetilde{\mu}_{0}\circ T^{-1}$ is a presymplectic momentum map with respect to $-\mathbf{d}\widetilde{\Theta}$ . In order to show that $\widetilde{\mu}$ is the same as $\widetilde{\mu}$ given in the statement of the proposition, it is enough to note that Eq. (148) implies

[TABLE]

∎

Theorem 4 and Proposition 3 apply to any (Lie) subgroup of the symmetry group for the action functional (74) whatsoever. They apply in particular to the group of isometries of the fluid container $Q$ , which corresponds to momentum conservation. From the perspective of dissipation-free fluid models, however, a more interesting subgroup is the group of particle relabeling transformations of the reference fluid container $Q_{0}$ . Before applying nonlinear WKB extension, this group of symmetries is responsible for the well-known Kelvin circulation theorem. Let us now use Proposition 3 to describe what happens to Kelvin’s circulation theorem after applying the nonlinear WKB extension procedure.

Proposition 4.

The mapping $\widetilde{\mu}:\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})\rightarrow(\ell\mathfrak{X}(Q_{0})\times C^{\infty}(Q_{0},\mathbb{R}))^{*}$ given by

[TABLE]

is a $(\ell\mathfrak{X}(Q_{0})\times C^{\infty}(Q_{0},\mathbb{R}))^{*}$ -valued first-integral of the extLBEP equations.

Corollary 1.

(Kelvin’s theorem for Eulerian WKB) Given a family of closed curves $C_{0}(\theta)\subset Q_{0}$ parameterized by $\theta\in S^{1}$ and a solution $(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi})$ of the extLBEP equations, the integral

[TABLE]

is constant in time for each $\theta\in S^{1}$ , where $C(\theta)=[\widetilde{\bm{h}}^{S}(\theta)]^{-1}(C_{0}(\theta))$ .

*Remark 6**.*

See the remark after Lemma 4 for the argument that proves this Corollary.

Before proving Proposition 4, we will first review the corresponding result for the (pre-WKB extension) LBEP equations that was proved, for instance, in Ref. Cotter and Holm, 2012. For this, we introduce the following group, which contains the particle relabeling group $\text{Diff}(Q_{0})$ as a subgroup.

Definition 12.

The infinite-dimensional group $\mathcal{G}=\text{Diff}(Q_{0})\ltimes C^{\infty}(Q_{0})$ consists of pairs $(\bm{\eta},\tau)\in\text{Diff}(Q_{0})\times C^{\infty}(Q_{0})$ with the group product given by

[TABLE]

Lemma 4.

There is a right $\mathcal{G}=\text{Diff}(Q_{0})\ltimes C^{\infty}(Q_{0})$ -action on $\mathcal{C}_{0}$ that leaves the $1$ -form $\Theta$ in Eq. (142) invariant. The associated momentum map is given by

[TABLE]

*Remark 7**.*

This result was established using Noether’s theorem in Ref. Cotter and Holm, 2012. Because the LBEP Hamiltonian $\int_{Q}\mathcal{H}(\bm{p},\rho,\bm{\nabla}\rho)\,d^{3}\bm{x}$ is $\mathcal{G}$ -invariant, standard arguments imply that $\mu$ is constant in time along solutions of Eqs. (75)-(78). In particular, because $\bm{h}_{*}\left[\rho\,d^{3}\bm{x}\right]$ is constant in time, and $\rho$ is non-vanishing, the $1$ -form

[TABLE]

is constant in time. Therefore the integral

[TABLE]

is constant in time for any closed curve $C_{0}\in Q_{0}$ . This is the usual statement of Kelvin’s circulation theorem.

We may now prove 4 by directly applying Proposition 3 with $G=\mathcal{G}$ .

proof of Proposition 4.

By Lemma 4, there is a right $\mathcal{G}$ -action on $\mathcal{C}_{0}$ that preserves $\Theta$ and admits an $\text{Ad}^{*}$ -equivariant momentum map. Proposition 3 therefore implies that

[TABLE]

defines a presymplectic $\text{Ad}^{*}$ -equivariant momentum map on $(\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1}),-\mathbf{d}\widetilde{\Theta})$ . Moreover, because the Hamiltonian functional $\fint\int_{Q}\mathcal{H}(\widetilde{\bm{p}},\widetilde{\rho},\bm{\nabla}^{S}\widetilde{\rho})\,d^{3}\bm{x}\,d\theta$ is $\ell\mathcal{G}$ -invariant, it follows that $\widetilde{\mu}$ is constant in time along solutions of Eqs. (97)-(100). ∎

We will now conclude this Section by giving a group-theoretic explanation for the $1$ -way coupling between $\widetilde{\bm{p}},\widetilde{\rho},S$ and $\widetilde{\bm{h}},\widetilde{\chi}$ in the Euler-Lagrange equations associated with the action functional (103). Because $\ell\mathcal{G}$ leaves the action functional (103) invariant, solutions of the corresponding Euler-Lagrange equations are mapped into other solutions by $\ell\mathcal{G}$ . The following proposition shows that the quotient of the space of solutions of the Euler-Lagrange equations equations by $\ell\mathcal{G}$ may be identified with the space of a solutions of the extLBEP equations. This “explains” the one-way coupling as a consequence of $\ell\mathcal{G}$ -invariance.

Proposition 5.

Let $\widetilde{\mathcal{C}}_{\widetilde{\mathsf{A}}}$ denote the space of solutions of the Euler-Lagrange equations associated with the action functional (103). Let $\widetilde{\mathcal{C}}_{\text{extLBEP}}$ denote the space of solutions of the extLBEP equations. There is a canonical bijection

[TABLE]

Proof.

According to Theorem 4 (see Eq. (132)), the right action of $\ell\mathcal{G}$ on $\ell\widetilde{\mathcal{C}}_{0}\times C^{\infty}(Q,S^{1})$ that leaves the action $\widetilde{\mathsf{A}}$ invariant is given by

[TABLE]

Apparently the quotient of $\ell\widetilde{\mathcal{C}}_{0}\times C^{\infty}(Q,S^{1})$ by $\ell\mathcal{G}$ may be identified with triples $(\widetilde{\bm{p}},\widetilde{\rho},S)\in\ell\mathfrak{X}(Q)\times\ell C_{+}^{\infty}(Q)\times C^{\infty}(Q,S^{1})$ using the quotient map $\pi:\ell\widetilde{\mathcal{C}}_{0}\times C^{\infty}(Q,S^{1})\rightarrow\ell\mathfrak{X}(Q)\times\ell C_{+}^{\infty}(Q)\times C^{\infty}(Q,S^{1}):(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)\mapsto(\widetilde{\bm{p}},\widetilde{\rho},S)$ .

If $t\mapsto\Gamma(t)=(\widetilde{\bm{h}}(t),\widetilde{\bm{p}}(t),\widetilde{\rho}(t),\widetilde{\chi}(t),S(t))$ is a solution of the Euler-Lagrange equations associated with $\widetilde{\mathsf{A}}$ , i.e. Eqs. (97)-(100), then $\gamma=\pi\circ\Gamma$ is a solution of the extLBEP equations because the extLBEP equations are a subset of the Euler-Lagrange equations. Thus there is a mapping $\Pi:\widetilde{\mathcal{C}}_{\widetilde{\mathsf{A}}}\rightarrow\widetilde{\mathcal{C}}_{\text{extLBEP}}$ . If we can show that $\Pi$ is in fact a quotient map for the $\ell\mathcal{G}$ -action on $\widetilde{\mathcal{C}}_{\widetilde{\mathsf{A}}}$ , the proof will be complete.

To that end, suppose that $t\mapsto\gamma(t)=(\widetilde{\bm{p}}(t),\widetilde{\rho}(t),S(t))$ is a solution of the extLBEP equations. Given $(\widetilde{\bm{h}}_{0},\widetilde{\chi}_{0})\in\ell\text{Diff}(Q,Q_{0})\times\ell C^{\infty}(Q)$ , the method of characteristics gives a unique curve $t\mapsto(\widetilde{\bm{h}}(t),\widetilde{\chi}(t))$ with $(\widetilde{\bm{h}}(0),\widetilde{\chi}(0))=(\widetilde{\bm{h}}_{0},\widetilde{\chi}_{0})$ satisfying Eqs. (97) and (100) with $S$ and the derivatives of $\mathcal{H}_{\text{EP}}$ evaluated along the solution $\gamma$ . Therefore the mapping $\Pi$ is surjective, and the preimage of $\gamma$ under $\Pi$ may be identified with the space of initial values $(\widetilde{\bm{h}}_{0},\widetilde{\chi}_{0})\in\ell\text{Diff}(Q,Q_{0})\times\ell C^{\infty}(Q)$ . The latter space is an entire $\mathcal{G}$ -orbit in $\widetilde{\mathcal{C}}_{\widetilde{\mathsf{A}}}$ for if $(\widetilde{\bm{h}}_{0},\widetilde{\chi}_{0})$ and $(\widetilde{\bm{h}}_{0}^{\prime},\widetilde{\chi}_{0}^{\prime})$ are two elements of $\Pi^{-1}(\{\gamma\})$ , then $(\widetilde{\bm{h}}_{0}^{\prime},\widetilde{\chi}_{0}^{\prime})=\widetilde{\Phi}\bigg{(}(\widetilde{\bm{h}}_{0},\widetilde{\chi}_{0}),(\widetilde{\bm{\eta}},\widetilde{\tau})\bigg{)}$ provided we set

[TABLE]

It follows that $\Pi$ is a quotient map for the $\mathcal{G}$ -action on $\widetilde{\mathcal{C}}_{\widetilde{\mathsf{A}}}$ . ∎

VI Example: Eulerian variational NL-WKB for isothermal fluids

In this section, we present a pedagogical example of how the methods developed so far can be useful for obtaining reduced, asymptotic models describing wave–mean-flow interactions. Specifically, here we study the time-averaged interaction between a small-amplitude, high-frequency acoustic wave and a slowly-varying isothermal perfect fluid. Due to the somewhat involved calculations given in this section, we present our results in three parts. In the first part, we introduce the governing Eulerian equations of motion and perform an intuitive asymptotic expansion up to leading order in some asymptotic parameter. We give an elementary proof that the resulting leading-order equations describing wave–mean-flow interactions are variational. In the second part, we combine the theory developed in this work with results from slow-manifold theoryFenichel (1979); Verhulst (2005) to explain why the wave–mean-flow equations ought to be variational. Finally, in the third section, we present additional details of a systematic derivation of the variational principle describing wave–mean-flow interactions in isothermal fluids.

VI.1 Governing equations for isothermal fluids and intuitive asymptotic expansion

The governing equations for an isothermal fluid are given by

[TABLE]

where $c_{s}\in\mathbb{R}$ is the sound speed. Since we are interested in studying the effects of a high-frequency acoustic wave, let us explicitly introduce a scale separation into the equations. To do this, we use the NL-WKB extension of the equations above. Hence, we write

[TABLE]

In the above, we have explicitly denoted the scale separation by rescaling the phase function $S$ such that $S\mapsto S/\epsilon$ , where $\epsilon\ll 1$ is a small dimensionless parameter that represents the ratio of the wave period (or wavelength) to the characteristic timescale (or length scale) of the mean flow.

Since we are interested in linear, small-amplitude waves, we parameterize the density and momentum-density fields as follows:

[TABLE]

Here $\overline{\rho}$ and $\overline{\bm{p}}$ respectively represent the slowly-varying density and momentum-density fields of the background fluid. Note that $\overline{\rho}$ and $\overline{\bm{p}}$ are independent of $\theta$ and thus are the $\theta$ -averaged fields. In contrast, $\widehat{\rho}$ and $\widehat{\bm{p}}$ are the fluctuating density and momentum-density fields, respectively. (To make the above parameterization unique, we assume that the $\theta$ -averages of $\widehat{\rho}$ and $\widehat{\bm{p}}$ are zero.) Since we consider small-amplitude waves, we scale the amplitude of the fluctuations according to the small parameter $\epsilon\ll 1$ in Eq. (167). With these asumptions, we can then deduce the following.

Proposition 6.

To lowest order in $\epsilon$ , the fields $(\widehat{\bm{p}},S)$ introduced in Eqs. (165) and (166) satisfy

[TABLE]

Solutions $(\widehat{\bm{p}},S)$ corresponding to linear acoustic oscillations are given by

[TABLE]

where $\bm{e}_{\bm{k}}=\nabla S/|\nabla S|$ .

Proof.

After inserting Eqs. (167) into Eq. (165), one can see that the $\theta$ -dependent part must satisfy (168) to lowest order in $\epsilon$ . Also when inserting Eqs. (167) into Eq. (166), one finds that the averaged momentum density $\overline{\bm{p}}$ satisfies

[TABLE]

where the overline denotes an average over the $\theta$ variable; e.g., $\overline{\widehat{\rho}\,\widehat{\bm{p}}}=\fint_{S^{1}}\widehat{\rho}\widehat{\bm{p}}\,\mathrm{d}\theta$ . Subtracting Eq. (172) from Eq. (166) leads to Eq. (169) to lowest-order in $\epsilon$ .

Solutions corresponding to acoustic oscillations are obtained by projecting Eq. (169) by $\bm{\nabla}S$ . Then, we substitute Eq. (168) and obtain

[TABLE]

For acoustic waves, $\widehat{\rho}\neq 0$ . Hence, the term inside the brackets must be zero. Taking the positive root leads to the dispersion relation in Eq. (171). Then, it is straightforward to verify that the expression for $\widehat{\bm{p}}$ given in Eq. (170) satisfies Eqs. (168) and (169). ∎

*Remark 8**.*

It is to be noted that the lowest-order (in $\epsilon$ ) equations for the density and momentum-density fluctuations [Eqs. (168) and (169)] lead to a time-evolution equation for the phase $S$ (which was not present in the NL WKB extension of the original fluid equations) and a constraint equation for fluctuations in the momentum density so that $\widehat{\bm{p}}=\widehat{\bm{p}}^{\star}(\overline{\rho},\overline{\bm{p}},\widehat{\rho},\bm{\nabla}S)$ . In order to fully describe the temporal wave dynamics, we must also deduce a time-evolution equation for $\widehat{\rho}$ . We shall come back to this point later.

As is well known from hydrodynamic theory, high-frequency waves can exert a ponderomotive (or time-averaged) force on a slowly-varying bulk fluid. This effect typically appears in the form of a Reynold-stress term in the momentum equation. In the following, we shall deduce the time-evolution equation for $\overline{\rho}$ and $\overline{\bm{p}}$ while taking into account the lowest-order corrections due to wave–mean-flow interactions.

Proposition 7.

The governing equations for $\overline{\rho}$ and $\overline{\bm{p}}$ with leading-order effects due to wave interactions are

[TABLE]

where $\mathcal{I}\colon Q\to\mathbb{R}$ is the wave action density

[TABLE]

Proof.

Since Eq. (165) is linear in $\widetilde{\rho}$ and $\widetilde{\bm{p}}$ , $\theta$ -averaging Eq. (165) immediately leads to Eq. (174). To obtain Eq. (175), one first inserts Eq. (167) into Eq. (172). After Taylor expanding up to $\mathcal{O}(\epsilon^{2})$ , one obtains

[TABLE]

where $T_{\rm Reynolds}$ is the Reynolds stress tensor

[TABLE]

Finally, substituting the expression for $\widehat{\bm{p}}$ in Eq. (170) into $T_{\rm Reynolds}$ leads to Eq. (175). ∎

With the above equations, we can now deduce a dynamical equation for the wave action density $\mathcal{I}$ . This is given in the next proposition.

Proposition 8.

The governing equation to leading order for the wave action density $\mathcal{I}$ is

[TABLE]

where $\bm{v}_{\rm g}\doteq\overline{\bm{p}}/\overline{\rho}+c_{s}\bm{e}_{\bm{k}}$ is the wave group velocity.

*Remark 9**.*

One can in principle prove this result by calculating the time derivative of $\mathcal{I}$ and then substituting the governing equations for the mean and fluctuating quantities [Eqs. (168), (169), (174), and (175)], as well as the dispersion relation (171). The ensuing calculation is tedious since one must take into account corrections to the leading-order solution for the momentum density [Eq. (170)]. Alternatively, the proof may be constructed as a straightforward corollary of results presented in Sec. IV and later in the present Section. Because this alternative approach is simpler, we will postpone the proof until we have proved Proposition 9.

We may now summarize the results obtained so far as the following set of equations governing the leading-order wave-mean-flow interaction between a sound wave and a bulk isothermal flow.

Definition 13 (Wave–mean-flow equations).

The governing equations describing a high-frequency, small-amplitude acoustic wave interacting with a slowly-varying, isothermal bulk fluid are

[TABLE]

where $\mathcal{I}$ is the wave action density (176) and $\bm{v}_{\rm g}\doteq\overline{\bm{p}}/\overline{\rho}+c_{s}\bm{e}_{\bm{k}}$ is the wave group velocity.

As written, Eqs. (180)–(183) are closed in the sense that they possess a (formally) well-posed initial value problem. Perhaps surprisingly, these equations also follow from a variational principle! This is shown in the theorem below.

Theorem 5 (Effective action for wave–mean-flow interactions).

Let $\overline{\mathcal{C}}_{0}=\mathcal{C}_{0}\times C^{\infty}_{+}(Q)\times C^{\infty}(Q,S^{1})$ . (The space $\mathcal{C}_{0}$ is introduced in Definition 7.) That is, $\overline{\mathcal{C}}$ comprises maps $Q\ni\bm{x}\mapsto(\overline{\bm{h}}(\bm{x}),\overline{\bm{p}}(\bm{x}),\overline{\rho}(\bm{x}),\overline{\chi}(\bm{x}),\mathcal{I}(\bm{x}),S(\bm{x}))\in Q_{0}\times\mathbb{R}^{3}\times\mathbb{R}\times\mathbb{R}\times\mathbb{R}\times S^{1}$ , where $\overline{\bm{h}}$ is a diffeomorphism and $\overline{\rho}(\bm{x})>0$ , $\mathcal{I}(\bm{x})>0$ for all $\bm{x}\in Q$ . Consider the action $\overline{\mathsf{A}}_{\rm T}$ defined on the space of paths $[t_{1},t_{2}]\rightarrow\overline{\mathcal{C}}_{0}$ by the formula

[TABLE]

where the Lagrangian $\overline{\mathsf{L}}_{\rm T}$ is given by

[TABLE]

Here the mean velocity $\overline{\bm{v}}$ is defined as $\overline{\bm{v}}\doteq-\partial_{t}\overline{\bm{h}}\cdot(\bm{\nabla}\overline{\bm{h}})^{-1}$ , and the Hamiltonian for the isothermal fluid is given by

[TABLE]

The parameters $c_{s}$ and $\rho_{0}$ are the sound speed and reference mass density, respectively. Equations (180)–(183) are embedded in the Euler–Lagrange equations obtained when varying the action $\overline{\mathsf{A}}_{\rm T}$ with respect to $\overline{\bm{h}}$ , $\overline{\bm{p}}$ , $\overline{\rho}$ , $\overline{\chi}$ , $\mathcal{I}$ , and $S$ .

Proof.

Since $\overline{\mathsf{A}}_{\rm T}$ is functional of paths $[t_{1},t_{2}]\rightarrow\overline{\mathcal{C}}_{0}$ , we can vary the fields $(\overline{\bm{h}},\overline{\bm{p}},\overline{\rho},\overline{\chi},\mathcal{I},S)$ independently. Moreover, varying the action with respect to the fields $(\overline{\bm{h}},\overline{\bm{p}},\overline{\rho},\overline{\chi})\in\mathcal{C}_{0}$ follows an almost identical procedure as that given in the proof of Proposition 1. Varying the action with respect to $\overline{\bm{p}}$ leads to $\overline{\bm{v}}=\overline{\bm{p}}/\overline{\rho}$ . Varying the action with respect to the scalar field $\overline{\chi}$ gives

[TABLE]

Substituting $\overline{\bm{v}}=\overline{\bm{p}}/\overline{\rho}$ into the equation above trivially leads to Eq. (180). Varying the action with respect to $\overline{\rho}$ gives

[TABLE]

Varying $\overline{\bm{h}}$ leads to

[TABLE]

By substituting Eqs. (187) and (188) into Eq. (189) and following a similar algebraic manipulation as in the proof of Proposition 1, one can recover Eq. (181) from Eq. (189). Finally, varying the action $\overline{\mathsf{A}}_{\rm T}$ with respect to $\mathcal{I}$ and $S$ leads to Eqs. (182) and (183), respectively. Thus, we have shown that Eqs. (180)–(183) are embedded in the Euler–Lagrange equations associated to the action $\overline{\mathsf{A}}_{\rm T}$ . ∎

VI.2 Why are the wave–mean-flow equations variational?

The proof of Theorem 5 gave no indication as to why the leading-order wave-mean-flow equations arise from a variational principle. We now want to give a principled explanation for this result using the machinery developed in this paper. One indication that the wave–mean-flow equations might be variational is that the parent isothermal fluid equations (163) and (164), where we started our asymptotic analysis, also come from a variational principle. This can be easily proven because the isothermal fluid equations comprise a special case of the LBEP equations discussed in Sec. III. Hence, we can readily write a variational principle for Eqs. (163) and (164), which is given below.

Corollary 2.

Let $\mathsf{A}_{\rm T}$ be the action functional defined on the space of paths $[t_{1},t_{2}]\rightarrow\mathcal{C}_{0}$ such that

[TABLE]

where

[TABLE]

and the Hamiltonian $\mathcal{H}_{\text{T}}$ is defined in Eq. (186).

A path $t\mapsto(\bm{h}(t),\bm{p}(t),\rho(t),\chi(t))\in\mathcal{C}_{0}$ is a critical point of the action functional (190) for isothermal fluids if and only if $\bm{h}$ , $\bm{p}$ , $\rho$ , and $\chi$ satisfy the following system of PDEs:

[TABLE]

Proof.

This is directly verified by substituting Eqs. (190)–(191) into Eqs. (75)–(78) in Proposition 1. ∎

*Remark 10**.*

It is clear that the isothermal fluid equations (163) and (164) are simply embedded into the LBEP equations above.

The next indication is that, in order to explicitly introduce a scale separation into the fluid equations, the next step we used in the analysis of Sec. VI.1 was passing to the NL-WKB extension of the isothermal fluid equations [see Eqs. (165) and (166)]. Following our results from Sec. IV, these equations are variational as well! The precise statement of this observation is as follows.

Corollary 3.

Let $\gamma:[t_{1},t_{2}]\rightarrow\ell\mathcal{C}_{0}\times C^{\infty}(Q,S^{1})$ be a smooth curve with components $\gamma=(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ . Let the functional $\widetilde{\mathsf{A}}(\gamma)=\int_{t_{1}}^{t_{2}}\widetilde{\mathsf{L}}_{\rm T}(\gamma(t),\partial_{t}\gamma(t))\,dt$ be defined such that

[TABLE]

where $\widetilde{\bm{v}}$ is defined in Eq. (102). Then, the curve $\gamma$ is a (fixed-endpoint) critical point of $\widetilde{\mathsf{A}}(\gamma)$ if and only if the component functions $(\widetilde{\bm{h}},\widetilde{\bm{p}},\widetilde{\rho},\widetilde{\chi},S)$ satisfy

[TABLE]

Proof.

This is immediately verified by substituting Eq. (196) into the result in Theorem 3. ∎

The question that now remains to be answered is whether the variational structure underlying the NL-WKB extension of the isothermal fluid equations is somehow compatible with the asymptotics leading to the wave–mean-flow equations (180)–(183). A geometrically satisfying way to address this question is through the application of a dynamical systems tool known as slow-manifold reduction.

The concept of slow manifolds originated from the theory of fast-slow dynamical systems, which essentially are singularly perturbed dynamical systems.Fenichel (1979); Verhulst (2005) Before explaining the role played by slow manifolds in our example, we will first give a quick overview of slow manifold theory.

Definition 14 (Fast-slow dynamical system).

Let $X,Y$ be Banach spaces and $\epsilon\ll 1$ . A fast-slow dynamical system is an ODE on $X\times Y$ of the form

[TABLE]

with $D_{y}f_{0}(x,y)\colon Y\to Y$ an isomorphism when $(x,y)\in f_{0}^{-1}(\{0\})$ . The functions $f_{\epsilon}$ and $g_{\epsilon}$ are required to depend smoothly on $\epsilon$ in such a manner that $f_{\epsilon},g_{\epsilon}=O(1)$ as $\epsilon\rightarrow 0$ .

By convention, the variable $y$ is called the “fast” variable, while the variable $x$ is called the “slow” variable. For fast-slow dynamical systems, it then follows that invariant manifolds given as graphs over the slow variables satisfy a nonlinear (functional) PDE. This is illustrated below.

Lemma 5.

Suppose a fast-slow dynamical systems admits an invariant manifold $I_{\epsilon}$ of the form $I_{\epsilon}=\{(x,y)\in X\times Y|y=y_{\epsilon}^{\star}(x)\}$ for some smooth map $y_{\epsilon}^{\star}\colon X\to Y$ . Then,

[TABLE]

for each $x\in X$ .

Proof.

Supposing $y=y_{\epsilon}^{\star}(x)$ , one then inserts this into $\epsilon\dot{y}=f(x,y)$ . Using the chain rule and substituting the time-evolution equation for $x$ leads to the claimed result. ∎

Definition 15 (Slow manifold).

If $I_{\epsilon}$ is an invariant manifold given as the graph of $y^{\star}_{\epsilon}\colon X\to Y$ , $I_{\epsilon}$ is a slow manifold when $y^{\star}_{\epsilon}(x)$ is a formal power series solution of Eq. (202).

Of the invariant manifolds given as graphs, slow manifolds play a special role for several reasons. First of all, slow manifolds are unique; i.e., if $I_{\epsilon}$ and $I_{\epsilon}^{\prime}$ are two slow manifolds, then $I_{\epsilon}=I_{\epsilon}^{\prime}$ . Moreover, the formal power series expansion of the graphing function $y^{\star}_{\epsilon}(x)$ may be obtained using explicit formulas. In addition, dynamics restricted to the slow manifold is indeed slow; it is simple to check that the time derivatives of both the fast and slow variables are $O(1)$ on the slow manifold. The slow manifold may therefore be interpreted intuitively as the region in phase space where the fast degrees of freedom are not excited.

For the purposes of the present discussion, a crucial result on slow-manifold dynamics is that they inherit Hamiltonian structure from the parent fast-slow system whenever the larger system has such a structure. One way to state this fact precisely is as follows.

Theorem 6 (Inheritance of Hamiltonian structure).

Consider a fast-slow system satisfying the variational principle

[TABLE]

with $\delta(x(t_{1}),y(t_{1}))=\delta(x(t_{2}),y(t_{2}))=0$ . Here $\Theta_{\epsilon}$ is an $\epsilon$ -dependent one-form on $X\times Y$ , and $H_{\epsilon}$ is an $\epsilon$ -dependent smooth function on $X\times Y$ . Suppose a slow manifold exists where $y=y^{\star}_{\epsilon}(x)$ . Then, the slow dynamics for the variable $x\in X$ satisfy the variational principle

[TABLE]

with $\delta x(t_{1})=\delta x(t_{2})=0$ . Here $\Theta_{\rm slow}(x)[\delta x]\doteq\Theta(x,y_{\epsilon}^{\star}(x))[\delta x,Dy_{\epsilon}^{\star}(x)[\delta x]]$ , and $H_{\rm slow}(x)\doteq H(x,y_{\epsilon}^{\star}(x))$ .

Proof.

With the given boundary conditions, it is clear that $\delta A=0$ holds even if the trajectory $t\mapsto(x(t),y(t))$ that is subject to variations lies in the slow manifold, i.e. $y(t)=y^{\star}_{\epsilon}(x(t))$ , since $\delta y(t_{1,2})=Dy^{\star}_{\epsilon}(x(t_{1,2}))\,\delta x(t_{1,2})=0$ . In particular, $\delta A=0$ when variations are constrained to lie along the slow manifold. This is equivalent to saying that the first variation of $A_{\text{slow}}$ , which is $A$ restricted to paths contained in the slow manifold, is zero along a solution contained in the slow manifold. After restriction to the slow manifold, the two terms in the integrand of $A$ may be written

[TABLE]

and $H_{\epsilon}(x,y_{\epsilon}^{\star}(x))=H_{\text{slow}}(x)$ . (We have omitted writing the time dependence explicitly.) Thus, $A_{\text{slow}}$ may be written as in the Theorem statement. Moreover, we have already argued $\delta A_{\text{slow}}=0$ along any solution of the fast-slow system contained in the slow manifold. This completes the proof. ∎

We will now argue that Theorem 6 may be used to systematically derive the variational principle for the leading-order wave–mean-flow equations (180)–(183). The first thing to be verified is that the NL-WKB extension of the isothermal fluid equations (197)–(200), together with the dispersion relation (183) for specifying the dynamics of $S$ , indeed form a fast-slow system. If this is the case, then by Theorem 6, slow-manifold reduction will allow us to construct a variational principle for the slow, wave–mean-flow system. Our argument will then be complete if we can show that the variational principle given by Theorem 6 reproduces the variational principle from Theorem 5. The rest of this subsection will be devoted to establishing that Eqs. (197)–(200), together with the dispersion relation (183), comprise a fast-slow system. The following subsection will sketch the details of manipulating $A_{\text{slow}}$ from Theorem 6 in order to produce $\overline{\mathsf{A}}_{\rm T}$ from Theorem 5.

As in the previous section, we consider only high-frequency, small-amplitude waves. Hence, we adopt the parameterization given in Eqs. (167) for the fields $\widetilde{\rho}$ and $\widetilde{\bm{p}}$ . Although it is not technically necessary for the slow-manifold analysis, we shall also parameterize the back-to-labels map $\widetilde{\bm{h}}$ by following the generalized-Lagrangian-mean (GLM) approach proposed by Andrew and McIntyre.Andrews and McIntyre (1978) For more information on this approach, we recommend reading as well the works by Holm and Gjaja,Holm (2002a, b); Gjaja and Holm (1996); Holm (2002c) as well as Buhler’s accessible book.Bühler (2009) In GLM theory, one introduces a space $\overline{Q}$ that is diffeomorphic to $Q$ and that is interpreted as the collection of “mean” Eulerian positions. Then $\widetilde{\bm{h}}$ is written as the composition of a mean component $\overline{\bm{h}}\in\text{Diff}(\overline{Q},Q_{0})$ and a fluctuating component $\widehat{\bm{\tau}}\in\text{Diff}(Q,\overline{Q})$ , i.e. $\widetilde{\bm{h}}=\overline{\bm{h}}\circ\widehat{\bm{\tau}}$ . Additionally, and in order to uniquely specify $\widehat{\bm{\tau}}$ , we consider $\widehat{\bm{\tau}}$ to be a near-identity transformation of the form

[TABLE]

where $\widehat{\bm{\alpha}}:Q\rightarrow\mathbb{R}^{3}$ satisfies $\fint\widehat{\bm{\alpha}}\,d\theta=0$ . Finally, we shall parameterize the Lagrange multiplier $\widetilde{\chi}$ according to

[TABLE]

where $\fint\widehat{\chi}\,d\theta=0$ .

Proposition 9.

With the parameterizations given in Eqs. (167), (206), and (207), Eqs. (197)–(200), together with the dispersion relation (183), are equivalent to a fast-slow dynamical system.

Proof.

We begin by inspecting the equations of motion for the mean fields. Upon $\theta$ -averaging Eqs. (197)–(200), we immediately obtain

[TABLE]

where in the first equation we used $\overline{\partial_{t}^{S/\epsilon}\widetilde{\bm{h}}}=\partial_{t}\overline{\bm{h}}+\mathcal{O}(\epsilon^{4})$ and $\overline{(\widetilde{\bm{p}}/\widetilde{\rho})\cdot\bm{\nabla}^{S/\epsilon}\widetilde{\bm{h}}}=-(\overline{\bm{p}}/\overline{\rho})\cdot\bm{\nabla}\overline{\bm{h}}+\mathcal{O}(\epsilon^{2})$ . We have also introduced the shorthand notation $\overline{Q}=\fint Q\,d\theta$ for denoting averages over $\theta$ . When comparing Eqs. (208)–(211) to Definition 14, it so far seems that the variables $(\overline{\bm{h}},\overline{\bm{p}},\overline{\rho},\overline{\chi})\in\mathcal{C}_{0}$ should be included amongst the slow variables. Additionally, according to the dispersion relation (183), the time derivative $\partial_{t}S=O(1)$ , suggesting that $S$ should be a slow variable.

Let us next examine the dynamical equations for the fluctuating quantities. A straightforward calculation leads to

[TABLE]

where we have omitted $\mathcal{O}(\epsilon)$ terms related to nonlinearities in the fluctuations and $\mathcal{O}(\epsilon)$ terms involving spatial derivatives. These omissions are motivated by the fact that, in order to prove the singularly-perturbed dynamical system $\epsilon\dot{y}=f_{\epsilon}(x,y)$ , $\dot{x}=g_{\epsilon}(x,y)$ is in fact a fast-slow system, it is enough to check that $f_{\epsilon},g_{\epsilon}=O(1)$ and that $D_{y}f_{0}$ is invertible along the zero level of $f_{0}$ .

At first glance, Eqs. (212)–(215) seem to suggest that $\widehat{\bm{\alpha}},\widehat{\bm{p}},\widehat{\rho}$ and $\widehat{\chi}$ should be fast variables. Indeed, the time derivative of each of these fields is generically $O(\epsilon^{-1})$ . However, there happens to be a non-trivial combination of these quantities whose time derivative is $\mathcal{O}(\epsilon)$ . It is straightforward to verify that the field $\widehat{\lambda}\colon Q\times S_{1}\to\mathbb{R}$ given by

[TABLE]

satisfies $\partial_{t}\widehat{\lambda}=O(1)$ . This suggests that a viable set of slow variables might be $x=(\overline{\bm{h}},\overline{\bm{p}},\overline{\rho},\overline{\chi},\widehat{\lambda})$ with corresponding fast variables $y=(\widehat{\bm{\alpha}},\widehat{\bm{p}},\widehat{\chi})$ . The rest of the proof will be devoted to showing that, when expressed in terms of $x$ and $y$ , Eqs. (197)–(200), together with the dispersion relation (183), do in fact comprise a fast-slow dynamical system.

In order to write Eqs. (197)–(200) and the dispersion relation (183) in terms of $x$ and $y$ , it is only necessary to exchange the dependent variable $\widehat{\rho}$ with the new dependent variable $\widehat{\lambda}$ . Because this change of dependent variables is independent of $\epsilon$ , our calculations so far already demonstrate that $dx/dt=O(\epsilon)$ . In order to prove that we have identified the correct fast and slow variables, we therefore only have to show that $\epsilon dy/dt=f_{0}(x,y)+O(\epsilon)$ and that $D_{y}f_{0}(x,y)$ is invertible along the zero level of $f_{0}$ .

In order to identify $f_{0}(x,y)$ , we substitute the definition of $\widehat{\lambda}$ given by Eq. (216) into Eqs. (212), (213), and (215), thereby obtaining

[TABLE]

These expressions show that $f_{0}(x,y)$ is of the form $f_{0}(x,y)=A(x)[y]+C(x)$ , where $A(x):Y\rightarrow Y$ is a linear map and $C(x)\in Y$ is independent of $y$ . In particular, the derivative of $f_{0}$ with respect to $y$ is given by $D_{y}f_{0}(x,y)=A(x)$ . Thus, for any $(x,y)\in X\times Y$ , $D_{y}f_{0}(x,y)$ is invertible if and only if $A(x)$ is invertible.

We will now complete the proof by showing that $A(x)$ is invertible for all $x\in X$ that satisfy $\nabla S(\bm{x})\neq 0$ for all $\bm{x}\in Q$ . Fix $\delta y=(\delta\widehat{\bm{\alpha}},\delta\widehat{\bm{p}},\delta\widehat{\chi})\in Y$ . If there is a $y=(\widehat{\bm{\alpha}},\widehat{\bm{p}},\widehat{\chi})$ that solves the equation $A(x)[y]=\delta y$ , then, by Eqs. (217)-(219), $y$ must satisfy

[TABLE]

By decomposing Eq. (221) into components parallel and perpendicular to $\nabla S$ , it is straightforward to show that $\partial_{\theta}\widehat{\bm{p}}$ must be given by

[TABLE]

which implies $\widehat{\bm{p}}=(c_{s}|\nabla S|)^{-1}\mathbb{T}\cdot I[\delta\widehat{\bm{p}}]$ , where $I[\delta\widehat{\bm{p}}]$ is the unique $\theta$ -antiderivative of $\delta\widehat{\bm{p}}$ with zero mean, i.e.

[TABLE]

By substituting this expression for $\widehat{\bm{p}}$ into Eqs. (220) and (222), it follows that $\widehat{\bm{\alpha}}$ and $\widehat{\chi}$ must be given by

[TABLE]

where $I^{2}[\delta\widehat{\bm{p}}/\overline{\rho}]=I[I[\delta\widehat{\bm{p}}/\overline{\rho}]]$ denotes the antiderivative operator applied two times. Thus, if there is a $y$ satisfying $A(x)[y]=\delta y$ , then that $y$ is unique. Conversely, by substituting the above expressions for $y$ back into $A(x)[y]=\delta y$ , we conclude that a solution $y$ exists for any $\delta y$ . Therefore $A(x)$ is invertible as claimed. ∎

*Remark 11**.*

In the above Proposition, the dispersion relation (183) plays two important roles. First, it supplies an evolution equation for $S$ . This is necessary for the Proposition to work because if the dispersion relation was not imposed, then Eqs. (197)–(200) would not specify a dynamical system, let alone a fast-slow dynamical system. Second, it ensures that the system supports wave motion whose asymptotic behavior is captured by the NL-WKB ansatz.

*Remark 12**.*

As is true of all fast-slow systems, Eqs. (197)–(200), together with the dispersion relation (183), admit a slow manifold. In terms of the fast and slow variables identified in the proof of the Proposition, the slow manifold is a subset of $X\times Y$ of the form $I_{\epsilon}=\{(x,y)\in X\times Y\mid y=y^{\star}_{\epsilon}(x)\}$ , where $y^{\star}_{\epsilon}$ is the so-called slaving function. Using the expressions for the inverse of $A(x)$ from the proof, it is straightforward to find the leading-order term in slaving function $y^{\star}_{0}(x)=(\widehat{\bm{\alpha}}_{0}^{\star},\widehat{\bm{p}}_{0}^{\star},\widehat{\chi}_{0}^{\star})$ . We have

[TABLE]

While it was convenient to introduce the dependent variable $\widehat{\lambda}$ for the sake of showing equivalence with a fast-slow system, now that the existence of the slow manifold has been established, we are free to express the slow manifold in terms of $\widehat{\rho}$ instead of $\widehat{\lambda}$ . By a slight abuse of notation, the slow manifold may written in terms of $\widehat{\rho}$ as

[TABLE]

where now $y_{\epsilon}^{*}=(\widehat{\bm{\alpha}}^{\star}_{\epsilon},\widehat{\bm{p}}^{\star}_{\epsilon},\widehat{\chi}_{\epsilon}^{\star})$ is a function of $(\overline{\bm{h}},\overline{\bm{p}},\overline{\chi},\widehat{\rho},S)$ . In this alternate representation, the leading-order terms in the slaving functions are given by:

[TABLE]

We remind the reader that the antiderivative operator $I$ was defined in Eq. (224).

*Remark 13**.*

We are now in a good position to prove Proposition 8.

proof of Proposition 8.

In Corollary 3, we demonstrated that the NL–WKB extension of the isothermal fluid equations (165) and (166) is variational. Additionally, Theorem 3 shows that all extLBEP fluid equations imply a wave-action conservation equation (121). Upon substituting the action (196) and the Hamiltonian (186) into Eq. (121), we obtain

[TABLE]

The specific wave action density $\widetilde{\mathcal{I}}$ is given by Eq. (116), which we rewrite below for clarity:

[TABLE]

Here $\widetilde{\bm{\zeta}}=-(\partial_{\theta}\widetilde{\bm{h}})\cdot(\bm{\nabla}\widetilde{\bm{h}}+\epsilon^{-1}\bm{\nabla}S\otimes\partial_{\theta}\widetilde{\bm{h}})^{-1}$ . Let us now calculate the terms in Eqs. (234) and (235) by substituting the leading-order slaving functions (231)–(233). Specifically, when inserting $\widetilde{\bm{h}}=\overline{\bm{h}}\circ\widehat{\bm{\tau}}$ and $\widehat{\bm{\tau}}(\bm{x})\simeq\bm{x}+\epsilon^{2}\widehat{\bm{\alpha}}_{0}^{\star}(\bm{x})$ into $\widetilde{\bm{\zeta}}$ , we obtain

[TABLE]

We then substitute this result as well as the parameterizations (167), (206), and (207) and the leading-order slaving functions (231)–(233) into the first term in Eq. (234). We obtain

[TABLE]

where $\mathcal{I}$ is the wave action density defined in Eq. (176). For the second term in Eq. (234), a similar calculation leads to

[TABLE]

Finally, inserting Eqs. (237) and (238) into Eq. (234) leads to our claim in Proposition 8. ∎

VI.3 Calculation of the effective action $\overline{\mathsf{A}}_{\rm T}$

In the previous section Sec. VI.2, we gave the general arguments explaining why the wave–mean-flow equations (180)–(183) are variational. We also broadly discussed how to calculate the effective action (184) for wave–mean-flow interactions by using slow-manifold reduction (see Theorem 6). In this section, we present some of the technical details in obtaining Eq. (184).

We start from the NL–WKB extended action (196) for the isothermal fluid equations. As was done in the proof of Theorem 4, we apply a phase shift to the fields. This gives

[TABLE]

where the superscript “ $S/\epsilon$ ” denotes that $\theta$ angle is shifted by $S/\epsilon$ (see Definition 11).

As it was explained in Theorem 6, after a slow manifold $I_{\epsilon}$ has been identified, one can restrict the action of the parent fast-slow system onto the slow manifold $I_{\epsilon}$ in order to obtain an effective action for the slow variables only. Following this same procedure, we substitute the parameterizations (167), (206), and (207) into Eq. (239). We then restrict the fast variables $(\widehat{\bm{\alpha}},\widehat{\bm{p}},\widehat{\chi})$ to the slow manifold by using Eqs. (231)–(233). This leads to

[TABLE]

where $\widetilde{\bm{p}}^{\star S/\epsilon}_{0}=\overline{\bm{p}}+\epsilon\widehat{\bm{p}}^{\star S/\epsilon}_{0}$ and similarly for the rest of the variables restricted to the slow manifold. From hereon, we shall consider the Lagrangian (240) restricted to the lowest-order slaving functions. Hence, to simplify our notation, we shall omit the “0” subscript when referring to the lowest-order slaving functions (231)–(233).

Before explicitly substituting the expressions for $\widehat{\bm{\alpha}}^{\star}$ , $\widehat{\bm{p}}^{\star}$ , and $\widehat{\chi}^{\star}$ into Eq. (240), it convenient to first perform a variable transformation. (The following transformation is closely related to the well-known oscillation-center transform used in kinetic theories for plasma–wave interactions.Dewar (1973)) First, we note that the velocity

[TABLE]

can be written as

[TABLE]

where $(\widetilde{\tau}^{\star S/\epsilon})^{*}$ is the pullback associated to $\widetilde{\bm{\tau}}^{\star S/\epsilon}$ , $\overline{\bm{v}}$ is the mean Lagrangian velocity

[TABLE]

and $\widetilde{\bm{\nu}}^{\star S/\epsilon}$ is the velocity associated to $\widetilde{\bm{\tau}}^{\star S/\epsilon}$ :

[TABLE]

In order to simplify the expression for the velocity (242), it is convenient to apply the pushforward $\widetilde{\tau}^{\star S/\epsilon}_{*}$ to the symplectic part of the Lagrangian (240). This leads to

[TABLE]

Some remarks should be given on the symbols appearing in Eq. (245). First, the term $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\bm{p}}^{S/\epsilon}$ is understood as the $\widetilde{\tau}^{\star S/\epsilon}_{*}$ acting on $\widetilde{\bm{p}}^{\star S/\epsilon}$ which is treated as a one-form density in the domain $Q$ . Similarly, $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\rho}^{\star S/\epsilon}$ is interpreted as the pull-back $\widetilde{\tau}^{\star S/\epsilon}_{*}$ acting on $\widetilde{\rho}^{S/\epsilon}$ treated as a three-form in the domain $Q$ . We also introduced a new variable

[TABLE]

which is a scalar in the domain $Q$ . Finally, we used the Lie derivative theorem to establish the identities $\partial_{t}\widetilde{\chi}^{\star S/\epsilon}=\partial_{t}[(\widetilde{\tau}^{\star S/\epsilon})^{*}\widetilde{\varphi}^{\star S/\epsilon}]=(\widetilde{\tau}^{\star S/\epsilon})^{*}\partial_{t}\widetilde{\varphi}^{\star S/\epsilon}+(\widetilde{\tau}^{\star S/\epsilon})^{*}\mathfrak{L}_{\widetilde{\bm{\nu}}^{\star S/\epsilon}}\widetilde{\varphi}^{\star S/\epsilon}$ and $\mathfrak{L}_{\widetilde{\bm{v}}^{\star S/\epsilon}}\widetilde{\chi}^{\star S/\epsilon}=(\widetilde{\tau}^{\star S/\epsilon})^{*}\mathfrak{L}_{\overline{\bm{v}}-\widetilde{\bm{\nu}}^{\star S/\epsilon}}\widetilde{\varphi}^{\star S/\epsilon}$ . Note that, by applying the pushforward $\widetilde{\tau}^{\star S/\epsilon}_{*}$ , we were able to replace the oscillating velocity $\widetilde{\bm{v}}^{\star S/\epsilon}$ appearing in the $\widetilde{\bm{v}}^{\star S/\epsilon}\cdot\bm{\nabla}\widetilde{\chi}^{\star S/\epsilon}$ term of Eq. (239) with the mean Lagrangian velocity $\overline{\bm{v}}$ . This was originally the main motivation for the transformation.

The next step is to explicitly calculate the terms $\widetilde{\bm{\nu}}^{\star S/\epsilon}$ , $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\bm{p}}^{\star S/\epsilon}$ , $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\rho}^{S/\epsilon}$ , and $\widetilde{\varphi}^{\star S/\epsilon}$ appearing in Eq. (245). Let us first start by calculating $\widetilde{\bm{\nu}}^{\star S/\epsilon}$ in Eq. (244). Since $(\widetilde{\bm{\tau}}^{\star S/\epsilon})^{-1}$ is a near-identity transformation, one can verify that $(\widetilde{\bm{\tau}}^{\star S/\epsilon})^{-1}(\bm{x})=\bm{x}-\epsilon^{2}\widehat{\bm{\alpha}}^{\star S/\epsilon}(\bm{x})+\mathcal{O}(\epsilon^{3})$ . Substituting this into Eq. (244) gives

[TABLE]

Here the term “ $\epsilon^{2}\mathrm{Osc}$ ” means that we have neglected $\mathcal{O}(\epsilon^{3})$ terms and that we have omitted writing fluctuating terms that are $\mathcal{O}(\epsilon^{2})$ whose $\theta$ -average is zero. Since we are only calculating the effective Lagrangian up to $\mathcal{O}(\epsilon^{2})$ , it is safe to omit those terms since they will not contribute anything once the Lagrangian (245) is explicitly $\theta$ -averaged. More specifically, the term $\epsilon(\partial_{t}S)\partial_{\theta}\widehat{\bm{\alpha}}^{\star S/\epsilon}$ is kept because it could later multiply another $\mathcal{O}(\epsilon)$ term in the Lagrangian (245). The term $\epsilon^{2}(\bm{\nabla}S\cdot\partial_{\theta}\widehat{\bm{\alpha}}^{\star S/\epsilon})(\partial_{t}S)\partial_{\theta}\widehat{\bm{\alpha}}^{\star S/\epsilon}$ is also kept because it is quadratic in $\widehat{\rho}$ , so it has a non-zero $\theta$ -average. The term $\epsilon^{2}(\partial_{t}\widehat{\bm{\alpha}}^{\star})^{S/\epsilon}$ is omitted because it is oscillatory so its $\theta$ -average is zero. Also, the term $\epsilon^{2}\partial_{\theta}((\bm{\nabla}S\cdot\widehat{\bm{\alpha}}^{\star S/\epsilon})(\partial_{t}S)\partial_{\theta}\widehat{\bm{\alpha}}^{\star S/\epsilon})$ is omitted because it is as a total derivative in $\theta$ , which will vanish when integrating over $\theta$ . Finally, substituting the expression for $\widehat{\bm{\alpha}}^{\star S/\epsilon}$ in Eq. (231) gives

[TABLE]

where $\bm{e}_{\bm{k}}\doteq\bm{\nabla}S/|\bm{\nabla}S|$ .

Let us now proceed by calculating the term $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\rho}^{S/\epsilon}$ appearing in Eq. (245). Remembering that the density should be considered as a 3-form, we obtain

[TABLE]

where in the last line, we substituted Eq. (231) so that $\bm{\nabla}S\cdot(\partial_{\theta}\widehat{\bm{\alpha}}^{\star S/\epsilon}\overline{\rho})=\widehat{\rho}^{S/\epsilon}$ . We also used the well-known formula for the determinant of a near-identity matrix:

[TABLE]

where at the end, the terms in parentheses cancel because $\widehat{\bm{\alpha}}^{\star}$ is parallel to $\bm{\nabla}S$ .

In a similar manner, we can calculate the term $\widetilde{\tau}^{\star S/\epsilon}_{*}\widetilde{\bm{p}}^{\star S/\epsilon}$ . Note, however, that we should consider $\widetilde{\bm{p}}^{\star S/\epsilon}$ as a one-form density so that the above is written as $\widetilde{\tau}^{S\star/\epsilon}_{*}(\widetilde{\bm{p}}^{\star S/\epsilon}\cdot d\bm{x}\otimes d^{3}\bm{x})$ . A direct calculation leads to

[TABLE]

where in the last line, we substituted the expressions for $\widehat{\bm{\alpha}}^{\star}$ and $\widehat{\bm{p}}^{\star}$ in Eqs. (231) and (232).

Finally, a far simpler calculation of $\widetilde{\varphi}^{\star S/\epsilon}$ introduced in Eq. (246) gives

[TABLE]

We now insert Eqs. (247), (249), (251), and (252) into the Lagrangian (245). Starting from the first integral in Eq. (245), we substitute the obtained expressions for $\widetilde{\tau}^{S/\epsilon}_{*}\widetilde{\bm{p}}^{S/\epsilon}$ and $\widetilde{\bm{\nu}}^{S/\epsilon}$ . We then Whitham average, or $\theta$ average, the Lagrangian and only keep terms up to $\mathcal{O}(\epsilon^{2})$ . We obtain

[TABLE]

where $\mathcal{I}$ is the wave action density introduced in Eq. (176). For the next term of the Lagrangian (245), we substitute Eqs. (249) and (252). This leads to

[TABLE]

In a similar manner, substituting Eqs. (167) and (232) gives the following for the $\theta$ -averaged Hamiltonian:

[TABLE]

When combining the results in Eqs. (253)–(255), we obtain the effective action given in Eq. (184), which we rewrite below for convenience:

[TABLE]

In summary, in this section we have presented additional details for calculating the effective action for wave–mean-flow interactions in an isothermal fluid. Our method was primarily based on slow-manifold reduction, whose general ideas were presented in Sec. VI.2. Our derivation followed to two main steps. First, we restricted the variational principle to the slow manifold. This is essentially done by substituting the expressions obtained for the fast variables. Second, we identified a transformation that facilitated computations of the wave–mean-flow action.

VII Discussion

In this article we have identified the variational structure underlying the nonlinear WKB methodWhitham (1965a); Miura and Kruskal (1974) as it applies to ideal fluid equations in the Eulerian frame. This work therefore compliments previous studies on variational nonlinear WKB in the mean Eulerian frame.Dewar (1970); Bretherton (1971); Gjaja and Holm (1996) Our main results concern what we have termed the nonlinear WKB extension procedure, which is the technique used for generating a system of equations governing the profile functions appearing in the nonlinear WKB ansatz. Our results may be summarized as follows. (i) Given Eulerian fluid equations arising from an Euler-Poincaré variational principle,Holm, Marsden, and Ratiu (1998) we have shown that the enlarged system resulting from the nonlinear WKB extension procedure also arises from a variational principle. (ii) This new variational principle inherits a “looped” version of the original system’s symmetry group. After recognizing that a subgroup of this looped group comprises a looped version of the particle relabeling group, we have used Noether’s theorem to identify a family of circulation invariants parameterized by $S^{1}$ . (iii) By combining the newly discovered class of variational principles with ideas from the theory of slow manifold reduction, we have presented an example of a systematic procedure for identifying variational principles governing the self-consistent interaction between (possibly nonlinear) locally-plane waves and mean flows.

Our analysis made use of several technical assumptions that are straightforward to relax. In particular, we restricted our attention to barotropic fluid equations that arise from a local Lagrangian. A more general equation of state involving an advected entropy could readily be incorporated into our discussion by an enterprising reader. Similarly, it would not be prohibitively difficult to allow for spatial non-locality in the Lagrangian. (On the other hand, temporal nonlocality would not be simple to include.) More generally, extensions of our work to fluid systems not discussed in this paper may readily be accommodated as long as the proof of Theorem 4 remains in tact.

Two key technical features that distinguish our work from much of the previous work on variational fluid mechanics are (i) our use of the inverse of the Lagrangian configuration map $\bm{h}=\bm{g}^{-1}$ , and (ii) our use of the fluid phase-space Lagrangian (akin to $L=p\dot{q}-H(q,p)$ ). The use of $\bm{h}(\bm{x})$ instead of $\bm{g}(\bm{x}_{0})$ allowed us to reformulate the Euler-Poincaré approach to fluid variational principles in terms of conventional classical field theory, which in turn enabled us to apply Whitham’s averaged Lagrangian technique in the Eulerian frame. Our inspiration for this shift in perspective came from Ref. Beig and Schmidt, 2003, which explains the use of $\bm{h}$ within the theory of relativistic elastic solids. Using the phase-space Lagrangian allowed us to apply Theorem 6 on the inheritance of Hamiltonian structure in order to explain the variational principle underlying the interaction between small-amplitude acoustic waves and a compressible barotropic mean flow. This same idea was used in Refs. Burby, 2017 and Burby and Sengupta, 2018 to explain the Hamiltonian structures underlying magnetohydrodynamics and kinetic magnetohydrodynamics, respectively.

It is most interesting to compare the approach we have introduced here for variational modeling of wave-mean-flow interaction with earlier approachesDewar (1970); Bretherton (1971); Gjaja and Holm (1996); Holm (2002a, b); Gjaja and Holm (1996); Holm (2002c) based on generalized Lagrangian mean (GLM) theory.Andrews and McIntyre (1978); Bühler (2009) As an intuitively appealing way of representing waves superimposed on a mean flow, previous authors have decomposed the Lagrangian configuration map $\bm{g}$ as the composition of a mean configuration map with a fluctuating configuration map. This decomposition forms the foundation of GLM theory. The mean configuration map takes values in (and in fact defines) the mean Eulerian frame, while the fluctuating configuration map takes values in the conventional Eulerian frame. When the averaging operation is identified with WKB phase averaging, the prevailing trend has then been to express the fluctuating configuration map in terms of the WKB ansatz. This effectively amounts to applying the WKB method within the mean Eulerian frame. While this approach obfuscates the connection between wave-mean-flow dynamics and the conventional Eulerian-frame WKB method, especially at higher orders in asymptotic expansions, it is compatible with variational formulations of fluid dynamics in a simple manner. Indeed, it is straightforward to decompose the Lagrangian configuration map in an Euler-Poincaré variational principle using the GLM ansatz, and then apply WKB phase averaging to the result.Dewar (1970); Bretherton (1971); Gjaja and Holm (1996) In contrast, the perspective taken in our new approach is that, in principle, there is no need to introduce the mean Eulerian frame in order to identify wave-mean-flow variational principles. Instead, one can start from our new variational principle for the nonlinear WKB extension of the Eulerian-frame fluid equations, and then apply slow manifold reduction to obtain the desired reduced variational principle for wave-mean-flow interaction. Aside from maintaining a clear link with the Eulerian-frame WKB procedure, a benefit of this approach is that it systematically incorporates the closure (a.k.a. “slaving” or “balance”) relations needed to express the rapidly-varying fluctuations in terms of slowly-varying mean quantities, thereby eliminating the risk of unwanted fast modes creeping into the variational principle. (For an example of the latter phenomenon, see the Hamiltonian models in Refs. Burby et al., 2015 and Brizard and Tronci, 2016 for low-frequency dynamics of strongly-magnetized plasmas. Those models support high-frequency electromagnetic waves that must be handled with care.) Interestingly however, our calculations have revealed that it is practically expedient to express our variational principle in terms of mean-Eulerian frame quantities. In so doing, the Lagrangian simplifies dramatically. In fact, it is not at all clear that Lagrangian expressed in terms of conventional Eulerian frame quantities behaves well with respect to truncation, i.e. when high-order terms in the asymptotic expansions of the slaving functions are dropped. Thus, GLM theory plays an important practical role in our new formalism, even though it is not a necessary ingredient at a conceptual level.

Given the dichotomy between our new method and the established mean-Eulerian frame approach, it is also interesting to ask how the family of circulation invariants given in Corollary 1 relates to the mean circulation invariants of Refs. Bretherton, 1971 and Gjaja and Holm, 1996. Because the family of circulation invariants identified in Corollary 1 is parameterized by the angle $\theta\in S^{1}$ , it may be averaged over $\theta$ to obtain a mean circulation invariant for the nonlinear WKB extension of the ideal barotropic fluid equations. In particular, the averaged circulation is constant along solutions that lie in the slow manifold. Thus, the $S^{1}$ -mean of our family of circulation invariants restricted to the slow manifold is a circulation invariant for the wave-mean-flow dynamics. It is in this manner that mean circulation invariants of the types found in Refs. Bretherton, 1971 and Gjaja and Holm, 1996 emerge from our formalism. As an illustration of this point, we prove in Appendix A that the average of our family of circulation invariants restricted to the slow manifold is equivalent to the circulation invariant associated with mean particle relabeling symmetry of the wave–mean-flow Lagrangian (196).

In the future, we plan to use the tools developed in this article to capture the effects of harmonic generation and corrections to ray trajectories caused by space-dependent wave polarizationLittlejohn and Flynn (1991); Ruiz and Dodin (2017) in wave-mean-flow problems arising in fluids and plasmas.

Acknowledgements.

The authors would like to thank Richard Montgomery and Cesare Tronci for a number of helpful discussions of this work at MSRI. This material is based upon work supported by the National Science Foundation under Grant No. DMS-1440140 while one of the authors (JWB) was in residence at the Mathematical Sciences Research Institute in Berkeley, California, during the Fall 2018 semester. In addition, research presented in this article was supported by (1) the Laboratory Directed Research and Development program of Los Alamos National Laboratory under project number 20180756PRD4, and (2) Sandia National Laboratories. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC., a wholly owned subsidiary of Honeywell International, Inc., for the U.S. DOE National Nuclear Security Administration under contract DE-NA-0003525. This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. DOE or the U.S. Government.

Appendix A Proof of mean circulation theorem

Corollary 4 (Kelvin’s theorem for wave–mean-flow system).

Given a closed curve $C_{0}\subset Q_{0}$ and a solution $(\overline{\bm{h}},\overline{\bm{p}},\overline{\rho},\overline{\chi},\mathcal{I},S)$ of the wave–mean-flow equations (180)–(183), then

[TABLE]

where $\overline{C}=\overline{\bm{h}}^{-1}(C_{0})$ .

Thus, in the wave–mean-flow framework developed here, the circulation theorem (257) is now a closed contour integral of the fluid momentum minus a term related to the wave momentum. The modification of Kelvin’s circulation theorem due to wave effects has been noticed before.Bretherton (1971) In essence, this result shows that waves can affect the vorticity of the bulk fluid. The last term in Eq. (257) is sometimes referred as “wave pseudomomentum.”Gjaja and Holm (1996); Bühler (2009)

Proof.

Equation (257) can be proven by following a similar procedure as that used in Lemma 4. The only difference is that now the simplectic form associated to $\overline{\mathcal{C}}_{0}$ is $-\bm{\mathrm{d}}\overline{\Theta}$ , where the 1-form $\overline{\Theta}$ is given by

[TABLE]

and $\overline{\bm{\xi}}\doteq-\delta\overline{\bm{h}}\cdot(\bm{\nabla}\overline{\bm{h}})^{-1}$ . Alternatively, we can also show the result in Eq. (257) by using the general Kelvin’s theorem for Eulerian WKB obtained in Corollary 1. Indeed, since solutions along the slow manifold [see, e.g., Eqs. (231)–(233)] are solutions of the NL–WKB isothermal fluid equations (197)–(200), then Eq. (257) can be obtained by restricting Eq. (153) onto the slow manifold and averaging over the phase $\theta$ . Specifically, we have

[TABLE]

Following a similar calculation as in Eq. (251), we can calculate the pushforward appearing in the integral above:

[TABLE]

where in the last line, some $\mathcal{O}(\epsilon^{2})$ terms were written as a total derivative of $\theta$ . Since their $\theta$ -average is zero, we omitted writing them and placed them under the symbol “ $\epsilon^{2}\mathrm{Osc}$ ”. Finally, averaging over $\theta$ and substituting the expressions for $\widehat{\bm{p}}^{\star S/\epsilon}$ and $\mathcal{I}$ leads to

[TABLE]

Inserting this into Eq. (259) finishes the proof. ∎

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Whitham (1965 a) G. B. Whitham, “Non-linear dispersive waves,” Proc. Roy. Soc. Lond. A 283 , 238 (1965 a).
2Miura and Kruskal (1974) R. M. Miura and M. D. Kruskal, “Application of a non linear WKB method to the Korteweg-De Vries equation,” SIAM J. Appl. Math. 26 , 376 (1974).
3Holm, Marsden, and Ratiu (1998) D. D. Holm, J. E. Marsden, and T. S. Ratiu, “The Euler-Poincaré equations and semidirect products with applications to continuum theories,” Adv. Math 137 , 1 (1998).
4Whitham (1965 b) G. B. Whitham, “A general approach to linear and non-linear dispersive waves using a Lagrangian,” J. Fluid Mech. 22 , 273 (1965 b).
5Dewar (1970) R. L. Dewar, “Interaction between hydromagnetic waves and a time-dependent inhomogeneous medium,” Phys. Fluids 13 , 2710 (1970).
6Bretherton (1971) F. P. Bretherton, “The general linearised theory of wave propagation,” (Am. Math. Soc., 1971) Chap. 6, pp. 61–102.
7Newcomb (1962) W. A. Newcomb, Nucl. Fusion Suppl. Pt. 2 , 451 (1962).
8Gjaja and Holm (1996) I. Gjaja and D. D. Holm, “Self-consistent Hamiltonian dynamics of wave mean-flow interaction for a rotating stratified incompressible fluid,” Physica D 98 , 343 (1996).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Variational nonlinear WKB in the Eulerian frame

Abstract

I Introduction

II A basic theorem on Whitham averaging

Definition 1**.**

Definition 2**.**

Definition 3**.**

Lemma 1**.**

Proof.

Definition 4**.**

Remark 1*.*

Theorem 1** (Whitham averaging).**

Proof.

Remark 2*.*

III Euler-Poincaré fluids as classical field theories

Definition 5** (LBEP equations).**

Definition 6**.**

Lemma 2** (LBEP equations, momentum form).**

Proof.

Definition 7**.**

Proposition 1**.**

Proof.

Remark 3*.*

Theorem 2** (LBEP field theory).**

Remark 4*.*

IV Variational structure of nonlinear WKB in the Eulerian frame

Definition 8**.**

Proposition 2**.**

Definition 9**.**

Definition 10**.**

Theorem 3**.**

Definition 11**.**

Lemma 3** (WKB Lin constraint formula).**

Proof.

Remark 5*.*

proof of Theorem 3.

V Looping the relabeling group

Theorem 4**.**

Proof.

Proposition 3**.**

Proof.

Proposition 4**.**

Corollary 1**.**

Remark 6*.*

Definition 12**.**

Lemma 4**.**

Remark 7*.*

proof of Proposition 4.

Proposition 5**.**

Proof.

VI Example: Eulerian variational NL-WKB for isothermal fluids

VI.1 Governing equations for isothermal fluids and intuitive asymptotic expansion

Proposition 6**.**

Proof.

Remark 8*.*

Proposition 7**.**

Proof.

Proposition 8**.**

Remark 9*.*

Definition 13** (Wave–mean-flow equations).**

Theorem 5** (Effective action for wave–mean-flow interactions).**

Proof.

VI.2 Why are the wave–mean-flow equations variational?

Corollary 2**.**

Proof.

Remark 10*.*

Corollary 3**.**

Proof.

Definition 14** (Fast-slow dynamical system).**

Lemma 5**.**

Proof.

Definition 15** (Slow manifold).**

Theorem 6** (Inheritance of Hamiltonian structure).**

Proof.

Definition 1.

Definition 2.

Definition 3.

Lemma 1.

Definition 4.

*Remark 1**.*

Theorem 1 (Whitham averaging).

*Remark 2**.*

Definition 5 (LBEP equations).

Definition 6.

Lemma 2 (LBEP equations, momentum form).

Definition 7.

Proposition 1.

*Remark 3**.*

Theorem 2 (LBEP field theory).

*Remark 4**.*

Definition 8.

Proposition 2.

Definition 9.

Definition 10.

Theorem 3.

Definition 11.

Lemma 3 (WKB Lin constraint formula).

*Remark 5**.*

Theorem 4.

Proposition 3.

Proposition 4.

Corollary 1.

*Remark 6**.*

Definition 12.

Lemma 4.

*Remark 7**.*

Proposition 5.

Proposition 6.

*Remark 8**.*

Proposition 7.

Proposition 8.

*Remark 9**.*

Definition 13 (Wave–mean-flow equations).

Theorem 5 (Effective action for wave–mean-flow interactions).

Corollary 2.

*Remark 10**.*

Corollary 3.

Definition 14 (Fast-slow dynamical system).

Lemma 5.

Definition 15 (Slow manifold).

Theorem 6 (Inheritance of Hamiltonian structure).

Proposition 9.

*Remark 11**.*

*Remark 12**.*

*Remark 13**.*

VI.3 Calculation of the effective action $\overline{\mathsf{A}}_{\rm T}$

Corollary 4 (Kelvin’s theorem for wave–mean-flow system).