This paper introduces the stochastic Lagrangian formalism as a theoretical framework for analyzing statistical dynamics of systems governed by stochastic differential equations, emphasizing its invariance properties and applications.
Contribution
It presents a comprehensive review of stochastic Lagrangian formalism and demonstrates its application to physically relevant stochastic dynamical systems.
The approach highlights invariance properties of statistical dynamics.
03
Applications to physical systems illustrate the formalism's utility.
Abstract
The concept of stochastic Lagrangian and its use in statistical dynamics is illustrated theoretically, and with some examples. Dynamical variables undergoing stochastic differential equations are stochastic processes themselves, and their realization probability functional within a given time interval arises from the interplay between the deterministic parts of dynamics and noise statistics. The stochastic Lagrangian is a tool to formulate realization probabilities via functional integrals, once the statistics of noises involved in the stochastic dynamical equations is known. In principle, it allows to highlight the invariance properties of the statistical dynamics of the system. In this work, after a review of the stochastic Lagrangian formalism, some applications of it to physically relevant cases are illustrated.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Stochastic Lagrangians for Statistical Dynamics
Massimo Materassi
Abstract
The concept of stochastic Lagrangian and its use in statistical dynamics
is illustrated theoretically, and with some examples.
Dynamical variables undergoing stochastic differential equations are
stochastic processes themselves, and their realization probability
functional within a given time interval arises from the interplay
between the deterministic parts of dynamics and noise statistics.
The stochastic Lagrangian is a tool to formulate realization probabilities
via functional integrals, once the statistics of noises involved in
the stochastic dynamical equations is known. In principle, it allows
to highlight the invariance properties of the statistical dynamics
of the system.
In this work, after a review of the stochastic Lagrangian formalism,
some applications of it to physically relevant cases are illustrated.
After Isaac Newton’s big work about “the motion of bodies” [1],
it was understood that physical systems are generally governed by
equations of motion, expressing the variability with time of
the state of the system in terms of the state of what
acts on the system. When one writes F=ma, the time
variability dtdp of “the state of motion” of
the pointlike particle p (namely, its momentum) is put in
relationship with “the force” F as
[TABLE]
and this force is a function of the state of what acts on the
system. As an example, think about Kepler’s problem, in which the
point of mass m at position x undergoes the gravitational
force exerted by the presence of another body of mass M at position
x⊙, according to the law
[TABLE]
the state of what acts on the pointlike particle is described indeed
by the force F=∣x⊙−x∣3GMm(x⊙−x),
containing the value of the gravitational mass M of the second
body and its relative position with respect to the point particle
of momentum p, i.e. x⊙−x. By the way,
notice that in the force the state of the system itself appears as
x, rendering (1) and (2)
proper differential equations for the state of the system ψ=(x,p).
As Newton’s equation F=ma is re-written considering
also the relationship between momentum and velocity, one writes
[TABLE]
here it is stressed that the “force” is a function of the system
state, of time t and of the “state of the environment” acting
on the system, indicated as Xenv. A system as (3)
is properly a system of ordinary differential equations (ODE),
in which the variability of the state ψ, of the portion of universe
we are intersted in, is put in relationship with how this portion
of universe interacts with the environment. Similar systems of ODEs
may be written as:
[TABLE]
In such a formulation, the system is described by a state ψ∈V,
where V is a certain mathematical ambient through which
the state ψ moves as time flows; V is referred to
as the phase space of the system.
Calculus teaches us that when ODEs as (4)
are equipped with some *initial condition *ψ(t0)=ψ0,
the problem
[TABLE]
admits a unique solution ψ(t)=Ψφ(t,ψ0)
for all t≥t0. To be honest, this happens only if the Cauchy
Problem (5) has the expression φ(ψ,t,Xenv)
that has particularly favorable conditions with respect to ψ,
which we assume to happen “always” (however, see [2]
for a not-that-problematic counterexample). From here on, the statement
[TABLE]
means that there exists a (suitably regular) map Ψφ:R×V↦V
depending on the dynamics φ, associating the state ψ(t)
to the initial condition ψ(t0) in an injective
way. This map Ψφ is what one calls evolution.
As a note, let us introduce here the system velocity spaceW, or dynamical flow space, so that φ(ψ,t,Xenv)∈W
and φ:V×R×Venv↦W
(here Venv is the phase space of the environment
acting on the system).
Uniqueness of the solution of initial value problems (5)
nourishes the Deterministic Paradigm (DP), according to which
once a system’s initial conditions are given, its future history will
be completely determined, as long as what acts on it is known for
all the future times. This is strongly accepted all through the Classical
Physics, and after all it is true also in Quantum Mechanics [3],
just considering the ODE (4) to
be Schrödinger equation iℏdtd∣ψ⟩=H∣ψ⟩,
according to which the motion of the quantum state ∣ψ⟩
is a perfectly deterministic trajectory through the quantum state
Hilbert space HS.
The intelligent criticism to the DP (that actually turns out to be
a generalization of the DP itself) must be based on the observation
that, in order for (5) to have a unique solution,
the quantities Xenv and ψ0 appearing there,
and the full mathematical construction of φ(ψ,t,Xenv),
must be known perfectly, i.e. with no uncertainty.
For instance, as the initial condition ψ0 is not known perfectly,
instead one knows just ψ0 to belong to some subset A0⊆V
of the phase space V, as
[TABLE]
one must admit the state of the system at time t≥t0 to be
any stateΨ(t,ψ(t0)) for
anyψ(t0)∈A0, i.e.:
[TABLE]
being A0 a finite size set111Stating anything about the “size” of a set in V has
not sense until a proper definition of “size”, or better “measure”,
is defined on V, which hasn’t been done, and won’t be
done, here. Of course, if V is a metric space, the built-in
distance d:V×V↦R+ may
be used to define the size of A⊆V, as ℓ(A)=max(d(x,y))/x,y∈A,
and this is also true if a measure is defined on V, as
when it is treated as the sample space of some probability. In the
latter case, if one is able to define a probability density on V,
a physically useful measure of A could be the Shannon entropy associated
to this probability relative to all the points in A: again, this
is beyond the scope of the present exposition., and so the set Ψφ(t,A0). Put in a simpler
way, with a coarse grained knowledge of the initial condition as ψ(t0)∈A0
instead of the sharp ψ(t0)=ψ0, one has to
get content with a coarse grained knowledge of the state ψ(t)
at later times. The uncertainty in the initial conditions will render
uncertain the evolution of the system, setting a natural limit to
the DP due to our finite precision and to how fast a finite size initial
condition set A0 may be deformed by the dynamics in (5):
the whole querelle between the DP and chaos theory comes
precisely from the capacity of non-linear dynamics to deform the initial
condition set A0 and diffuse it all over extended and complicated
regions of the phase space, rendering the evolution unpredictable
to an arbitrarily high precision.
In this work we will examine a different “limit of the DP”: namely,
we will deal with what happens when some elements of the mathematical
expression of φ are known only to some statistical
extent, i.e. when this mathematical function of ψ, t and
Xenv has terms whose exact values is unknown, and of
which one can only state they appear according to some given probability
distribution. As a simple example, consider ψ to be a real
variable, and the dynamics φ to read
[TABLE]
where K is a coefficient, and the time-dependent term γ(t)
is an addendum that is “extracted from some real set Ξ” at
each t>t0, according to the probability distribution function
Pt. A situation as that written in (8)
represents the case in which the dynamics φ is obtained,
at each time, by a first “completely known” term Kψ2
plus some term γ about which one only knows that it can be
a value within the set Ξ: as nothing sharper can be stated on
γ, the value γ(t) comes randomly
withinΞ at each different time, with a probability Pt(γ∗)δ
to fall in any interval [γ∗−2δ,γ∗+2δ].
Terms as the γ(t) in (8)
are referred to as stochastic terms or, more simply, noises.
In general, “noise terms” may appear in a variety of ways in the
dynamics φ, due to different “physical” reasons: typically,
as one distinguishes the dynamical variables ψ assigning the
state of the system from “everything else” encoded in Xenv,
noise terms will sensibly describe the degree of uncertainty about
Xenv, which is the other possible source of uncertainty
in (5), next to the initial conditions. For
instance, in (8) one might imagine that
ψ self-evolves with Kψ2, and undergoes the action of
the “random kicks” γ.
Considering that noise terms typically come from what one refers to
as “environment”, the general form of (8)
may read:
[TABLE]
In the presence of noise terms, the equation (4)
is named stochastic dynamical equation (SDE). The stochastic
version of (5) will read:
[TABLE]
The most relevant fact passing from (5) to
(10) is that the uniqueness of
the solution is lost, even in the presence of perfect knowledge of
the initial condition ψ(t0)=ψ0. Indeed,
depending on what point in Ξ is picked at each time t to play
the role of γ(t), one has different possible
curves
[TABLE]
hence *a different “histories” *φ(ψ,t,Xenv(γ(t)))
of the dynamics of the system. Since the noises γ may describe
any continuous trajectory for t∈[ti,tf],
as illustrated in the cartoon of Figure 1,
also the corresponding dynamics φ(ψ,t,Xenv(γ(t)))
will have any continuous shape as a curve in W,
see Figure 2. This explains in a pictorial
way the fail of uniqueness of the solution to the stochastic Cauchy
problem (10): the randomness transits
from noises γ to dynamics φ, and from dynamics to
the solution of the Cauchy problem ψ(t), see Figure
3 for the final step of the pictorial explanation.
Note that, even if all of the three solutions to (10)
start at the same initial value, they develop very differently and
the uniqueness invoked by DP is lost.
The random nature of each term γ(t) in (9)
and (10) renders it useful to consider
the sample space Σ(ti,tf)
of all the possible continuous trajectories γ(t)
in Ξ with t∈[ti,tf],
i.e. the sample space of all the possible realizations of the stochastic
process γ:R↦Ξ. The set Σ(ti,tf)
is infinite dimensional, namely Σ(ti,tf)⊆C(R,Ξ);
a probability measure defined on it, should be possibly based upon
the functional counterpart of what an ordinary probability distribution
function is for finite dimensional sample spaces, P[γ;ti,tf).
In this script
[TABLE]
the square bracket left to γ means that P depends
on the infinite number of values γ(t) (one per
each t∈[ti,tf]), while the
round bracket right after tf indicates that P
depends also on the two real variables ti and tf.
The P[γ;ti,tf)
is referred to as realization probability functional (RPF)
of γ, and its normalization condition reads:
[TABLE]
In this (11) the symbol [dγ]
indicates the *functional integral on *Σ(ti,tf)
of the stochastic terms, namely the continuous product:
[TABLE]
(see [4] for a thorough explanation of this). About
the RPF P[γ;ti,tf),
it must be stressed that this will be expressable as the continuous
product of all the Pt(γ)dt (see (9)
and (10)), for t∈[ti,tf],
only for time-δ-correlated noises, see § 2.2:
assigning the functional P[γ;ti,tf)
is then more powerful than giving all the time-local PDFs Pt(γ),
because P[γ;ti,tf)
contains any type of correlation among noises, that the collection
of the PDFs does not, and indeed the latter is equivalent to the RPF
only for the δ-correlated stochastic variables. In fully mathematical
terms, one could write:
[TABLE]
In what follows, the program is to start with some particular form
of the SDE in (10), in which one will
understand how to pass from the ensemble statistics of noises γ(t)
to that of the system realizations ψ(t), considering
the interplay between the deterministic and the stochastic features
of φ:
[TABLE]
where A[ψ;ti,tf)
is the RPF of the stochastic process ψ:[ti,tf]↦V,
while the script ↦φ means that this pass
is done thanks to the form of φ.
For the types of SDE we are going to work with, the scheme to perform
the transition (12) includes the introduction
of the concept of stochastic Lagrangian (SL), that will be
introduced in § 2. Then, in § 3 the
machinery will be applied to some particular case of physical relevance.
2 Functional Formalism for Statistical Dynamics
The mathematical quantity ψ, that describes the state of the
system, is in general given by a certain number of components, indicated
as ψI, so locally V will be described as some
Rn (or Cn) space, as I=1,...,n. Remarkably,
an infinite dimensional state may well be necessary, e.g. in a classical
continuum theory, in which the system is described by the local properties
of the continuum, so that the index I of ψ will rather be
some continuous position x in the three dimensional space.
The functional formalism is described here for a general finite
dimensional system, while in [5, 6, 7, 8]
an infinite dimensional system, namely magneto-hydrodynamics, is considered.
Let’s go back to the SDE in (10), and
consider a particular class of functions φ(ψ,t,Xenv(γ(t)))
of the noises γ. Let us assume that the noises enter the dynamical
equation (DE) of ψ both in an additive form, as (8),
and in a multiplicative way. In particular, as we assume the phase
space V to be n-dimensional, let us consider two
stochastic n-vectors, one of components fI and the other
of components gJ, whose statistics is assigned through their
RPF P[f,g;ti,tf),
and appearing in the SDE of ψ as:
[TABLE]
In (13) one has the co-existence of deterministic
expressions, depending on ψ
[TABLE]
with that of the *noise terms *f and g, of which one supposes
to know the statistics via P[f,g;ti,tf).
Equation (13) might appear rather particular
and simple, as a form of SDE: it is clearly more complicated than
the simplest case (8), but one may well
think more involuted expressions, as for instance
[TABLE]
being V=R for simplicity. The point is that:
•
the “simple” form (13) is the one allowing
for the formalism of stochastic Lagrangian, that is of interest here;
•
however, as well explained in [5, 6, 7, 8],
and in [9], important realistic systems as space plasmas
may be described by (13).
Our work will then concentrate on the stochastic Cauchy problem:
[TABLE]
As the paper [10] by Phythian is the first paper introducing
this formalism, we will refer to (13) as Langevin-Phythian
Equation (LPE) (as far as the Author is aware of, the formalism of
the stochastic Lagrangian222Please note that the binomial “stochastic Lagrangian” is used,
in the fluid dynamics literature, as a couple of adjectives characterizing
the Lagrangian, i.e. material, description of a fluid in the presence
of stochastic terms, while here “stochastic Lagrangian” is not
a couple of adjectives, because they mean “the Lagrangian function
of a stochastic theory”, hence “stochastic” is the attribution
of the noun “Lagrangian”. Indeed, the term “Lagrangian” means
the same thing as in Quantum Field Theory, not in Fluid Dynamics! has been introduced in [10] and in [11]).
Before going ahead, notice that in the SDE (13),
and in the system (14),
noises seem to appear “directly” without the mediation of the
environmental variables Xenv in (10):
this is only an appearance, as the reader may get convinced of going
through [5]. In that case, e.g., noise terms are
identified with the terms −∂×(ζ⋅J),
ρJ and −ρ∂p, being
ζ the plasma conductivity tensor, J the electric current,
ρ the plasma mass density and p the plasma pressure: as the
state of the system ψ=(V,B) included,
there, the plasma bulk velocity and the magnetic induction vector,
the quantities ζ, J, ρ and p could be understood
as “environmental variables” forcing ψ. Following this suggestion,
one forms noises with quantities determined by the microscopic nature
of the continuum, the microscopic stochastically treated degrees
of freedom (μSTDoF, see also [12]
for this concept) of which are the “environment” for the otherwise
isolated fluid variable system.
The dynamics governed by conditions (14)
will produce a multi-history evolution for ψ: as pictorially
indicated in Figure 3, the system history between
ti and tf is “statistically distributed”
along many elements of Σ(ti,tf),
according to the RPF A[ψ;ti,tf)
that will depend on the noise RPF P[f,g;ti,tf)
via
[TABLE]
that is the version of (12) adapted to the dynamics
(14). The theoretical program
we want to pursue here is to calculate the map just mimicked in (15),
i.e. to obtain a (closed as possible) expression of the RPF of ψ
from that of the RPF of the noises f and g, and the deterministic
parts Λ and Γ of (14)
(precisely, the competition between chance and necessity about
which Haken speaks in his book [13] about “Synergetics”).
Once the RPF A[ψ;ti,tf)
is given, the following program may be realized:
evaluate any statistical quantity ⟨F⟩,
for any functional F[ψ], on the ensemble Σ(ti,tf)
of trajectories through V admitted for the dynamics (14);
2. 2.
calculate the transition probability Pψi↦ψf(ti,tf)
for the system from an initial condition ψi=ψ(ti)
and a final one ψf=ψ(tf).
Both any ⟨F⟩ and the transition probability
Pψi↦ψf(ti,tf)
may be expressed in terms of A[ψ;ti,tf).
The quantity ⟨F⟩ in Point 1
is calculated as
[TABLE]
where it is intended that F[ψ] depends on the whole
trajectory ψ(t) for t∈[ti,tf]:
actually, the prescription (16) may work
also for a time-local functionF(ψ), that
can always be expressed as an integration with the presence of some
δ(t−t^), provided t^ is the instant
of interest to calculate the function.
The transition probability Pψi↦ψf(ti,tf)
may be calculated as the integration of A[ψ;ti,tf)
on all the possible configurations ψ(t)/t∈(ti,tf),
while the initial and final configurations ψ(ti)
and ψ(tf) are not integrated on, and
kept fixed to the values of interest ψi and ψf
respectively. One may write
[TABLE]
This may be also written as
[TABLE]
2.1 Stochastic Lagrangian
The program stated in Points 1 and
2 before needs the knowledge of A[ψ;ti,tf):
in this work the RPF of ψ is represented through a time-local
function \mathchar32L(dtdψ,ψ),
so that
[TABLE]
Clearly, the term N0(ti,tf)
is a normalization factor, so that the condition
[TABLE]
holds. The time-local function \mathchar32L(dtdψ,ψ)
is referred to as stochastic Lagrangian (SL).
Since the relationship between this \mathchar32L(dtdψ,ψ)
and the RPF A of the stochastic process is the same that
one has between the classical Lagrangian function and the quantum
amplitude in Quantum Mechanics [14], also here
one may expect the “symmetries” of \mathchar32L
to turn into invariances of the system statistics. One has to add
that, for real ψ, as transition probabilities Pψi↦ψf(ti,tf)
must be real positive numbers, the quantity \mathchar32L(dtdψ,ψ)
must be an imaginary number, as it happens indeed in real cases
[5, 7].
About the stochastic Lagrangian \mathchar32L(dtdψ,ψ)
one has to stress that it is a function intrinsically different
from what one calls “Lagrangian” in Analytical Mechanics: indeed,
this Lagrangian \mathchar32L(dtdψ,ψ),
contains the maximum time-derivative appearing in the ODE of the system
(13), that is an intrinsically first
order ODE, while the Lagrangian of Analytical Mechanics L(dtdq,q)
does not contain the maximum order time-derivative of the evolution
equation of the system, as Euler-Lagrange equations dtd(∂q˙∂L)−∂q∂L=0
are* second order ODEs* (here the dot means time-derivative,
as q˙=defdtdq
and ψ˙=defdtdψ). In §
3.1, where the case of a point
particle undergoing classical and stochastic forces is treated, the
difference between \mathchar32L
and the would-be-Lagrangian L of the point prticle without
noise is apparent: indeed, where the traditional Lagrangian would
be L(x˙,x), a function of position
and velocity, the stochastic Lagrangian is some more complicated function
\mathchar32L(x˙,p˙,x,p),
depending on position, velocity, momentum and momentum derivative.
Last difference we need to stress between the Analytical Mechanics
Lagrangian L and the stochastic Lagrangian \mathchar32L
is that, while L contains “all the Physics” of the system only
for conservative systems, \mathchar32L
encodes the whole Physics of any stochastic system in the form
(13), regardless it is conservative (Hamiltonian,
see § 3.2.1), or dissipative (metriplectic,
see § 3.2.2).
It is also of use to give the definition of stochastic action\underaccent\mathchar24S[ψ,ti,tf)
as the time-integral of the Lagrangian \mathchar32L:
[TABLE]
so that A=N0exp(−i\underaccent\mathchar24S).
The caveat to make no confusion between this \underaccent\mathchar24S
and the action of Analytical Mechanics is the same one as that of
not confusing the stochastic \mathchar32L
and the mechanical Lagrangian L discussed just now.
The construction of the SL is described in [10], and
inspired by the previous literature cited therein. In particular,
thanks to the mathematical nature of the LPE (13),
it is possible to introduce the kernel A[ψ;ti,tf)
starting from the definition of an ensemble statistical average ⟨F⟩,
based on noise statistics
[TABLE]
as noises represent the only element giving to (14)
a stochastic character. The definition of the average ⟨F⟩f,g
is obviously
[TABLE]
our program is to obtain an expression of A so that
[TABLE]
The first step taken in [10] to obtain A
satisfying (22), is to consider n auxiliary variables
χI=1,...,n that in a sense represent the Fourier momenta
conjugated with the additive noises fI=1,...,n in the functional
space Σ(ti,tf). Then, an
auxiliary kernel, depending on ψ and χ is defined as:
[TABLE]
In (23) the quantity A0(ti,tf)
is a normalization factor, so that
[TABLE]
the factor C[χ,Γ;ti,tf)
is the term in which the noise statistics ends up being encoded, as
[TABLE]
Once the auxiliary kernel A[ψ,χ;ti,tf)
is constructed, the definition of the physical kernel A[ψ;ti,tf)
reads simply:
[TABLE]
The calculation indicated in (24) is nothing
but obvious, its feasibility is definitely not for grant. Indeed,
summing over all the possible histories χ(t)/t∈[ti,tf]
is still matter of being able to do a functional integration, that
we know is not an easy task in general. In § 2.2
we will see this integration is rather tractable if noises f and
g are Gaussian fluctuations δ-correlated in time, as in
(25).
The great effort is then re-casting the RPF defined as (24)
and (23) in the Lagrangian form (19).
2.2 δ-correlated Gaussian noise
The example in which the calculations indicated in (23)
and (24) are completely, and rather easily,
feasible, is that in which f(t) and g(t)
are time-δ-correlated noises with t-local Gaussian PDF.
Let us assume that the two noises f and g have t-local probability
density functions
[TABLE]
As noises taken at different times are independent of each other,
the whole RPF is a continuous product of the time-local PDFs, i.e.:
[TABLE]
(in the expression (29) the
emergence of the integration ∫titfdt...
in the argument of the exponential, as well as the power 2dt
in the normalization factor, come from the continuous product, see
[4]). About the coefficients λ and μ
in (25) and (29),
one has to note that their relationship with the standard deviation
σf and σg of noises reads:
[TABLE]
(all the quantities in (30) are Rn,n
matrices). A further assumption necessary for (29)
to represent the correct RPF of those noises, is that f and g
are uncorrelated with each other, i.e.:
[TABLE]
The RPF (29) gives rise to
the average over noises:
[TABLE]
Without loss of generality one may put
[TABLE]
so that the factor C[χ,Γ;ti,tf)
defined in (23) is calculated as follows:
[TABLE]
(in these scripts, repeated nearby indices mean to be summed over).
This (32) is usefully re-written separating the functional
depending only on ψ from the one depending also on χ:
[TABLE]
We are now in the position of calculating A[ψ,χ;ti,tf)
by using this expression in (23), as done
in § A.2, with the important (and not
generality-losing) assumption (31). Moreover, a suitable
symmetric T(V)⊗T(V)
tensor
[TABLE]
may be defined, that is easily shown to be positive definite
[TABLE]
where the symbol ∥τ∥ is the determinant
of the matrix of τ. Moreover, a vector β in V
is defined as:
[TABLE]
so that all in all one remains with:
[TABLE]
that is basically something Gaussian in the χ variables333The fact that this A[ψ,χ;ti,tf)
is Gaussian in χ is not at all a surprise: indeed, the parts
depending on χ in (23) are: the factor
e−i∫titfdt[ψ˙IχI−ΛI(ψ)χI−2i∂ψI∂ΛI(ψ)],
the exponential of a linear composition of the χs, and the C[χ,Γ;ti,tf),
i.e. the functional Fourier transform of the Gaussian RPF P[f,g;ti,tf),
i.e. a Gaussian again.. The result (98) in § A.2
is interesting because, as the definition of J in (33)
is considered
[TABLE]
it is possible to define a total SL
[TABLE]
This is used as:
[TABLE]
The SL of the problem (14)
with f and g time-δ-correlated Gaussian noises, so that
A=N0e−i∫titf\mathchar32Ldt,
is the function (39).
In what follows, relevant examples of system stirred by time-δ-correlated
Gaussian noises will be described, making reference to the SL (39).
3 Examples
In this Section some mechanical examples are reported, to show few,
physically relevant cases in which the construction of A
and \mathchar32L is performed completely.
These are: the case of a pointlike particle of classical mechanics,
that of Hamiltonian and metriplectic systems, and that of a general
Leibniz dynamics. Only in the very simple case of the particle undergoing
a deterministic viscous force, plus noise, the functional measure
[dψ] appearing in (16)
and (17) is calculated explicitly.
For sake of feasibility, all the noises in the examples are supposed
to be time-δ-correlated and Gaussian.
From the examples mentioned here, the reader should learn at least
a general idea about the meaning and use of SLs; however, a more complex
and practical application of this formulation may be found in papers
[5, 6, 15, 7, 8, 9, 4],
where the same formulation described here is applied to space physics
and geophysical examples.
3.1 Point Particle with noise
Consider a point particle described by its position x and
its momentum p, with the relationship p=mdtdx.
Consider also Newton’s DE
[TABLE]
where the force vector F(x,p,t) may
be given by “any” force law. In general, the convenient thing
is to consider an F given by the sum of a pure deterministic,
smooth addendum Φ(x,p,t),
plus a noise terms ξ(t), the statistics of which
is going to be specified in a moment:
[TABLE]
The equations of motion read:
[TABLE]
In equations (41), one has six components
of the state vector
[TABLE]
while the independent additive noises are only three ones, the ξ
components. In order to be cautiously compliant with the scheme of
[10], let us introduce an auxiliary noise term η(t),
so that the ODEs (41) become
[TABLE]
the complete equivalence between these (43)
with the ODEs (41) will be weakly
restored, in the sense that the noises η will be supposed
to have their PDF δ-like peaked in zero identically, i.e.
for all times t.
The first thing to do in order to apply what described in § 2.2
to the system (43), one has to
construct the Lagrangian (39).
As no multiplicative noise exists in the example (43),
one can adapt what obtained before to systems with pure additive
noise just putting everywhere:
[TABLE]
This reduces the SL to:
[TABLE]
Also, the expressions of β and τ must be modified according
to (44): the definition of β in (36)
reads:
The set of necessary functions to write the SL in (47)
is completed with the T(V)-vector
Λ collecting the deterministic part of the dynamics, that
from (43) reads:
In order to reproduce the system (41)
one has to consider an identically zero noise η, which
is realized in (50) “simply” considering
the δ-family444In the formulae (51) and (52),
and in similar formulae below, the “limit” is to be intended in
the distributional sense, or weak limit: limα→α0φ(x,α)=φ0(x)
means
limα→α0∫dxφ(x,α)f(x)=∫dxφ0(x)f(x)∀f,
with f being a function in the suitable space rendering each ∫dxφ(x,α)f(x)
and ∫dxφ0(x)f(x) finite.
[TABLE]
so that we are interested in:
[TABLE]
being N0(ti,tf)
the suitable, necessary normalization constant, generally different
from the N0(ti,tf) in (50).
Note that in (52) the symbol δ[dtdx−mp]
is a functionalδ, being effective at any time t∈[ti,tf],
so that the prescription dtdx=mp
will be enforced as the integration over [d3x], or
over [d3p], takes place.
The example of Φ one might consider reads:
[TABLE]
This force is the sum of a gradient force −∂V(x),
as in classical gravitation, elasticity or electrostatics, a Lorenz
force mqp×B(x) mimicking
the effect of a magnetic field B on a particle of electric
chage q, and a dissipative, viscous friction force −mζp.
Considering the calculations in § B.1,
one may write
The factor N0(ti,tf)e2m3ζ(tf−ti)
may be put all together defining M0(ζ,m;ti,tf)=defN0(ti,tf)e2m3ζ(tf−ti),
so we may write:
[TABLE]
(in (55) the components Φh
are those forming the vector Φ in (53))
Some comments are needed about this result: the exponential
[TABLE]
will, again, converge to a Dirac δ functional of the argument
p˙h−Φh(x,p,t) as σ(ξh)
tends to zero, i.e. as the noise vector ξ is distributed
according to a more and more peaked Gaussian, provided a suitable
term O(σ(ξh)1)
is admitted in the normalization factor M0(ζ,m;ti,tf).
In other words: in the “classical limit” σ(ξh)→0,
in which one has the certainty of having zero ξ, the RPF
à la Phythian of the stochastic trajectories admitted by (41)
converges to the certainty that the trajectory will obey the “silent555“Silent” here means “without noise”, classical, fully
deterministic. ODEs” dtdx=mp and dtdp=Φ.
Before going to more examples of stochastic
systems, we would like to repeat, through the formalism introduced
here, the calculations reported in § 6.1 of the book [14],
and make an explicit calculation of the functional measure[dψ] in the case of the point particle undergoing
additive noise. We will do this for the simplified case in which Φ
is a pure viscous force, i.e. Φ=−mζp,
so the ODEs of the system read:
[TABLE]
The kernel in use reads:
[TABLE]
The term m2ζλ(i)p˙ipi
in the integral can be simplified:
[TABLE]
so one re-writes:
[TABLE]
This kernel is the central quantity allowing for the discussion of
the statistical dynamics of the system (56),
as it represents the statisical weight of the realization
[TABLE]
chosen in the integrand
[TABLE]
The current use of the functional A[x,p;ti,tf)
needs the determination of the constant N~0(ζ;ti,tf),
that is prescribed in order to obtain:
[TABLE]
so that its value is determined as:
[TABLE]
As in the kernel in (57), and hence in
(59), no dependence on x appears
in the integrand, the integration
is to make use of time-discretization as done by Feynman and Hibbs
in their book [14], in which they subdivide
the interval [ti,tf] into ϵ-long
pieces
[TABLE]
The full integration ∫[dp] may be represented
as
[TABLE]
while the integration in the exponential in (60)
is discretized as:
[TABLE]
Defining in a closed form the functional measure ∫[dp]
precisely means finding sensible expressions for the factors Wp(k)
in (62).
Feynman and Hibbs suggest the way to interpret the quantity p˙i(k)
as:
[TABLE]
while the quantity pi(k) is better approximated as
[TABLE]
The replacements (64) and (65) are
peformed, and calculations are done in § B.1.
The integral in (63) reads:
[TABLE]
In anticipation of calculating the limit ϵ→0 invoked
in (62), the expressions calculated in (100)
are reported neglecting o(ϵ), so that one may
re-write:
[TABLE]
giving rise to the following approximation for the term \underaccent\mathchar24Sp
in (66):
apart from the integration over pi(ti),
because it contains also the factor emζλ(i)pi2(ti)
inherited from (57). So, it is useful to
separate the integration over the initial momentum pi(ti)
from all the other ones in (69):
[TABLE]
The quantity J0 reads:
[TABLE]
The integral Ik instead reads:
[TABLE]
It also the case to calculate I0 as:
[TABLE]
All in all, one may write:
[TABLE]
where one has given the definition:
[TABLE]
Once the quantity N~0−1(ζ;ti,tf)
has been calculated as done in § B.2, and represented
as in (73), one may write:
[TABLE]
where the definition Vp(t)=def∫d3p(t)
is intended. If N~0(ζ;ti,tf)
in inserted in the first expression in (74), one has
The relationship (76) may
be of use when ensemble calculations are to be performed via A[x,p;ti,tf).
Going back to (74), if one wants to keep finite N~0(ζ;ti,tf),
the factor ∏k=0NWp(k) must be
chosen accordingly. Provided one defines
[TABLE]
as suggested in § B.3, the factor N~0
remains finite. This amounts to defining the functional measure:
[TABLE]
so that a concrete nature to the definition (62)
is given (in (78) the finite constant
n0≡N~0, used in § B.3,
has been put equal to 1, that means: the necessity of a normalization
factor for A[x,p;ti,tf)
has been reabsorbed into the functional measure [dp]).
3.2 Leibniz systems
Leibniz systems are dynamical systems whose ODEs may be expressed
in tensor terms as
[TABLE]
where the quantity F is a function from V to R
indicated as Leibniz-generator of the motion, while T∈T(V)⊗T(V)
is a tensor, referred to as Leibniz tensor [16].
Suppose to deal with a stochastic dynamics, the deterministic part
of which is (79), i.e. some version
of (14) with
[TABLE]
Provided the SDE reads
[TABLE]
and f and g are Gaussian time-δ-correlated noises with
the statistics described in § 2.2, one
must work with the functionals (40). The
calculations needed are shown in § C,
and one has:
[TABLE]
The more tractable version of (81) is the one with additive
noises only: to obtain this case in the smoothest way possible one
may resort to (44) and to the relationship
(30). Then, one will write:
[TABLE]
In the following two §§, the Hamiltonian and the metriplectic systems
are treated, as particularly relevant cases of Leibniz systems.
3.2.1 Hamiltonian Systems
A Hamiltonian system is a DS as in (79),
with TIJ being a Jocobi tensor [17];
moreover, the dyamics generator F is what they call Hamiltonian
of the system, namely its energy:
[TABLE]
In the relationships (81) and (82),
this means in practical that TIJ=−TJI, and that TSIJ=0;
besides this, TIJ will be rather indicated as JIJ, i.e.
the Jacobi tensor. All in all, one may write the functional formalism
for the statistical dynamics of a general Hamiltonian system, stirred
by both additive f and multiplicative g Gaussian time-δ-correlated
noises, as follows:
[TABLE]
If only additive noises appear, then one has to adapt directly (82),
i.e.:
[TABLE]
3.2.2 Metriplectic Systems
Metriplectic systems are Leibniz systems in which the tensor T
is the sum of a Jacobi tensor J and a semimetric tensor G. Moreover,
the generator F takes the form of free energy function,
namely the sum of a Hamiltonian H and an entropy S, weighted
by some coefficient α: F=H+αS. The two tensors J
and G must have the particular following relationships with the
gradients of H and S:
[TABLE]
As explained thoroughly in [12] and references
therein, a metriplectic system is basically the algebrization of an
energetically closed, otherwise Hamiltonian system, to which some
dissipation is added, due to degrees of freedom whose entropy is represented
by S. All in all, one has energy conservation H˙=0 and
entropy growth S˙≥0, thanks to the ODEs:
[TABLE]
The stochastic version of the metriplectic system is simply written
as:
[TABLE]
Of course, if the noises f and g have completely general statistics,
then all the formalism in § 2.1 is
applicable, but we are interested to the simpler particular case of
time-δ-correlated Gaussian noises, so that what reported in
the general part of this § 3.2 is sufficient.
Particular reference is made to the relationship
[TABLE]
as the compatibility conditions (85)
are considered, the quantity −2i∂ψI∂[TIJ(ψ)∂ψJ∂F(ψ)]
will be calculated as:
[TABLE]
When the additive noise f and the multiplicative one g are both
present, the kernel of the metriplectic stochastic system reads:
[TABLE]
The kernel for the metriplectic system with purely additive,
Gaussian, time-δ-correlated noisef will instead read:
[TABLE]
4 Conclusions
The whole science about “stochastic processes” appears to advance
a criticism to the DP described in § 1, which
is, instead, an upgrade of it. Indeed, non-deterministic finite
dimensional systems, to which one resorts because of the only statistical
knowledge of some elements of the initial value problem, are promoted
to deterministic infinite-dimensional systems, as the mathematical
quantities used to described them are not their state variables ψ
but, rather, statistical distributions of ψ.
In statistical dynamics the central tool for such a theory are master
equations [18], i.e. the evolution equations for probability
distributions [13]. An alternative way to describe
the same problem, appearing even more powerful to the Author of this
paper, is that of functional formalism [10],
in which the goal is to calculate the probability that a certain particular
history ψ^(t) takes place between ti
and tf, what we refer to as RPF in this work. The program
of functional formalism in Statistical Dynamics is, then, to start
from the knowledge of the noise history probability P[γ;ti,tf)
and of the equations of motion of ψ, and find the history probability
A[ψ;ti,tf) of the
state variables ψ of the system, namely studying the map (12).
Under the particular condition that the SDE of the system is in the
form of the ODE in (10), i.e. the Langevin-Phythian
equation
[TABLE]
(f and g being noises of assigned RPF P[f,g;ti,tf)),
it is possible to obtain A[ψ;ti,tf)
in the form of imaginary exponential of a time-local function \mathchar32L(ψ˙(t),ψ(t),t),
referred to as the stochastic Lagrangian:
[TABLE]
As the whole statistical dynamics of such a system is constructed
by using A[ψ;ti,tf),
to some extent one could say it is encoded in \mathchar32L(ψ˙,ψ,t),
precisely as it happens for Quantum Mechanics when Feynman’s path
integral approach is adopted [14]. Once all
the ingredients in (91) are found, one will
use them to calculate “anything” of the ensemble statistics of
the system, e.g. averages, transition probabilities or correlations:
[TABLE]
This promises to be a very powerful tool.
Two big difficulties exist in obtaining things in (91),
and using them as in (92): first of all, as
underlined in § 2.1, obtaining the
RPF of ψ getting rid of the auxiliary variables χ is in
general a rather hard task, that appears to be “simple” only if
f and g are time-δ-correlated Gaussian noises, see §
2.2; this impeds to obtain A[ψ;ti,tf)
from the more immediately defined A[χ,ψ;ti,tf)
(this difficulty is partly circumvented by the recipes in [11],
where the kernel A[χ,ψ;ti,tf)
is used to calculate suskeptibilities and other statistical observables).
Moreover, in all the statistical quantities as those in (92),
there is the necessity of writing in a closed form the functional
measure[dψ], that is the celebrated big
difficulty of any branch of Physics making use of path integrals!
Here, we have made the explicit calculation of the functional measure
∫[dψ]...=def... only in the
very simple case in which a point particle of Newton’s mechanics undergoes
the action of a deterministic viscous force and a stochastic one ξ
(of course, Gaussian and time-δ-correlated!), so that its
SDE reads: dtdp=−mζp+ξ,
see § 3.1 from page 3.1
on, and all the big calculations in §§ B.2
and B.3.
As we have shown in §§ 3.1
and 3.2, the functional formalism of
the stochastic Lagrangian appears to be applicable to a very wide
range of physical problems, from Newton’s mechanics plus noise, to
Leibniz systems, included Hamiltonian or metriplectic systems, see
§§ 3.2.1 and 3.2.2.
Fluid dynamical and plasma physics examples are treated in the references
quoted here, and for sure much wider fields will be covered in the
future.
Acknowledgements
At the end of this work, I want to thank Giuseppe Consolini (INAF,
Italy) and Tom Chang (MIT, Kavli Institute, USA), who always encourage
me not to stop fantasizing about mathematical objects to describe
the beauty of the world we’re gifted to be living in.
Special thanks also go to Giorgio Longhi (University of Florence,
Italy) and Ruggero Vaia (CNR-ISC, Italy) for making me curious of
hacking functional integrals.
Last but not least, I feel indebted with Polish and Romanian people,
who are using the beautiful letters “Ł” and “Ş” respectively,
that I could use for the stochastic Lagrangian \mathchar32L
and stochastic action \underaccent\mathchar24S: mathematics is also
a way to acknowledge the wonderful attitude of mankind to create symbols
mimicking our voices.
∗e−∫titfdt∑I=1n[2μ(I)ΓIJ(ψ)∂ψH∂ΓIH(ψ)+ig0IΓIJ(ψ)+iδIJ(ψ˙I−ΛI(ψ))]χJ.
As the tensor in (34) is defined, with the
property (35), calculations may go ahead as follows:
[TABLE]
Due to the aforementioned nice property, the passage (24)
can be obtained rather easily, by making a functional integration
of the A[ψ,χ;ti,tf) in
(37) in [dχ]. This is done as follows:
The factor N[ψ,μ,λ;ti,tf)
just introduced in the foregoing passages is a correction that should
avoid mathematical non-senses when the calculations
[TABLE]
are performed, i.e. part of the functional measure [dχ].
One may continue as:
[TABLE]
The relationship on which one relies is obtained from the theory of
n-dimensional multi-variate Gaussian processes, with non-diagonal
covariance matrix
[TABLE]
in order to use (97) in (96)
one has simply to make the following identifications:
[TABLE]
so that one may write:
[TABLE]
This is placed into (96), and the calculation goes
ahead:
about this, as the matrix τ in (34)
appears to be positive definite, see also (35), what
we note in A[ψ;ti,tf)
is, first of all, that it contains a negative square term in the velocities
−41(τ−1)JHψ˙Jψ˙H,
that will help in the convergence of A[ψ;ti,tf)
and the determination of the functional measure [dψ].
In order to obtain the relationships (81) one starts
from:
[TABLE]
The only qualitative implication of having a Leibniz system in (101)
is the calculation of the term −2i∂ψI∂[TIJ(ψ)∂ψJ∂F(ψ)]:
this is performed as follows:
where what really matters in the second addendum is the
symmetric part of the Leibniz tensor, due to the symmetric nature
of the Hessian ∂ψI∂ψK∂2F(ψ).
Bibliography18
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] Newton, I. Philosophiae Naturalis Principia Mathematica - Editio Tertia . G. & J. Innys, Royal Society, 1726.
2[2] Materassi, M., Collini, D., Barletti, L. Pack behaviour in inter-algal competition , in preparation.
3[3] Dirac P.A.M. The Principles of Quantum Mechanics . 3rd Edition, Oxford University Press, Amen House, London E.C.4 (1947).
4[4] Materassi, M. Stochastic Field Theory for the Ionospheric Fluctuations in Equatorial Spread F , Chaos, Solitons and Fractals 121 (2019) 186–210.
5[5] Materassi, M., Consolini, G., Turning the resistive MHD into a stochastic field theory , (2008) Nonlinear Processes in Geophysics, 15 (4), pp. 701-709.
6[6] Materassi, M., Stochastic Lagrangian for the 2D Visco-Resistive Magneto-Hydrodynamics , Plasma Phys. Control. Fusion 52 (2010) 075004.
7[7] Materassi M., G. Consolini, The Stochastic Tetrad MHD via Functional Formalism , J. Plasma Phys.(2015), vol. 81 , 495810602, Cambridge University Press 2015, doi:10.1017/S 0022377815001105.
8[8] Quattrociocchi, V., Consolini, G., Marcucci, M. F., Materassi, M., On Geometrical Invariants of Magnetic Field Gradient Tensor in Turbulent Space Plasmas: Scale Variability in the Inertial Range , The Astrophysical Journal, 878 (2), 2019.