Translation Invariant Diffusions in the Space of Tempered Distributions
B. Rajeev
Indian Statistical Institute
8th Mile, Mysore Road
Bangalore - 560 059
email: [email protected]
Abstract
In this paper we prove existence and pathwise uniqueness
for a class of stochastic differential equations (with coefficients
σij,bi and initial condition y in the space of tempered
distributions) that maybe viewed as a generalisation of Ito’s
original equations with smooth coefficients . The solutions are
characterized as the translates of a finite dimensional diffusion
whose coefficients σij⋆y~,bi⋆y~
are assumed to be locally Lipshitz.Here ⋆ denotes convolution
and y~ is the distribution which on functions, is realised
by the formula y~(r):=y(−r) . The expected value of the
solution satisfies a non linear evolution equation which is related
to the forward Kolmogorov equation associated with the above finite
dimensional diffusion.
AMS Classification : 60H (Stochastic Analysis), 60J (Markov
Processes)
Key words and phrases : Stochastic ordinary differential
equations, Stochastic partial differential equations, non linear
evolution equations, translations, diffusions , Hermite-Sobolev
spaces, Monotonicity inequality.
1 Introduction
In this paper we consider a generalisation of Ito’s well known
equation viz.
[TABLE]
where σ=(σij)1≤i,j,≤d and b=(b1,⋯,bd), and where σij and bi
are functions on Rd and (Bt) is a given
d-dimensional Brownian motion ([7]). When σ,b are
smooth functions we can use Ito’s formula and the duality ⟨f,δXt⟩=f(Xt) for the random distribution
δXt∈S′ to arrive at the stochastic
differential equation satisfied by the S′-valued process
(Yt)≡(δXt) viz.
[TABLE]
Here A=(A1,⋯Ad) , with
L,Aj being non linear operators from S′ to S′
given by
[TABLE]
for all ϕ∈S′ where ⟨σ,ϕ⟩ij,⟨b,ϕ⟩i is the (i,j)th entry ⟨σij,ϕ⟩ (respectively ith entry
⟨bi,ϕ⟩ ) of the matrix ⟨σ,ϕ⟩ (respectively the vector ⟨b,ϕ⟩) and
the superscript t denotes transpose. The above equation (1.2)
maybe considered an equation for a system of diffusing particles
,the initial configuration being specified by a tempered
distribution y , with Ito’s original equation corresponding to
the position of a single diffusing particle (Xt). For a single
particle , the above equation in S′ is equivalent to Ito’s
original equation. This can be seen by action on both sides of (1.2)
by a test function f∈S and noting that the resulting
equation is the Ito formula for f(Xt), with Xt given by
equation (1.1). We also note that the Ito formula for f(Xt) is
the point of departure for the celebrated martingale formulation of
Stroock and Varadhan ([15]). For an alternate approach to
constructing (finite dimensional)‘symmetric’ diffusions, see
[2]
In Section 3, we construct solutions to the above equation in S′ when the coefficients σij,bi and the initial
configuration y are allowed to be elements in S′ subject
to the condition that the convolutions σij⋆y~,bi⋆y~ be locally Lipshitz functions on Rd(see Theorem 3.4, Remark 3.7 below). We may take σij and
bi to be arbitrary distributions - for example a Dirac
distribution - provided y is sufficiently smooth or conversely y
to be ‘arbitrary’ and σij,bi to be smooth. Our methods
also extend to the case when the coefficients of L and Ai are
non linear functions (see Remark 3.9). We use the countable
Hilbertian structure of S′ viz. the fact that S′=p∈R⋃Sp, where the Sp are Hilbert spaces, to formulate our equations
(\cite[cite][\@@bibrefKI2],\cite[cite][\@@bibrefKX]).The solutions (Yt(y)) of equation (1.2),
starting at y at time zero , are given explicitly as follows : Let
τx,x∈Rd denote the translation operators
acting on S′. Then Yt(y)=τzt(y)(y) where
(zt(y)) is a finite d- dimensional diffusion starting at zero and
whose drift and diffusion coefficients are given by the convolutions
viz. σij⋆y~ and bi⋆y~. Note
that since zt(y) depends on y, the translation
τzt(y)(y) is non linear (unlike the Brownian case
[13]). Existence and pathwise uniqueness of (Yt(y))are then
a consequence of existence and pathwise uniqueness for (zt(y))
and the Ito formula ( [12]). Our proof of the above
representation uses the uniqueness of solutions of equation (1.2)
with random operator coefficients
Aˉ(s,ω),Lˉ(s,ω) (see \cite[cite][\@@bibrefP],\cite[cite][\@@bibrefKR] )
that satisfy the monotonicity inequality (see [4] and
Theorem 3.5 below).
In Corollary(3.8), we show that Yt(y) has the translational
invariance property viz. the solution corresponding to an initial
condition which is a translate of y∈S′ by x∈Rd viz. Yt(τx(y)) is equal to τx+zt(τx(y))(y) and Xt(x,y):=x+zt(τx(y)) is the
solution of the SDE starting at x at t=0, which is satisfied by
(zt(y)) for x=0. Consequently, the state space Sp of
the flow generated by the solutions of (1.2) viz. (Yt(y)) splits
up into ‘irreducible’ components C(y):={τx(y);x∈Rd} on each of which (Yt(y)) behaves as the finite
dimensional diffusion (Xt(x,y)). This allows us to describe the
Markov properties of (Yt(y)) in terms of that of that of
(Xt(x,y)). We do this in Section 4.
In Section 5, Theorem 5.4, we study the deterministic non linear
evolution equation that results on taking expectations in equation
(1.2). We represent the solutions in terms of a ‘non linear
convolution’. These are related to the fundamental solutions of the
forward equation associated with the diffusion Xt(x,y) (see
Remark 5.5) We describe the latter in Theorem 5.6. This extends some
results of [14] to the case when the coefficients are non
smooth.
Physically, the form of the solutions viz. Yt=τzty
suggests a conservation law. Indeed, when the initial distribution
y is given by an integrable function y(r) and when the solution
Yt or equivalently zt is defined for all t≥0 i.e. there
is no explosion, then we have for all t≥0,
[TABLE]
The form of the
solution further suggests that the diffusion of a system of
particles according to (1.2), starting in an initial configuration
given say, by a function y=y(r), in a diffusive medium
represented by the coefficients σij and bi is
equivalent to a translational flow of the same initial
configuration, in a new medium - represented by coefficients
σij⋆y~ and bi⋆y~ - which is the
result of an interaction between the initial particles and the
original diffusive medium. We refer to [1] for some related
ideas.
2 Preliminaries
Let (Ω,F,{Ft}t≥0,P) be a filtered probability space
satisfying the usual conditions viz. 1) (Ω,F,P) is a complete probability space. 2)
F0 contains all A∈F, such that P(A)=0, and 3) Ft=s>t⋂Fs. On this probability space is given a standard d-dimensional
Ft- Brownian motion (Bt)≡(Bt1,…,Btd).
We will denote the filtration generated by (Bt) as (FtB). Consider now the equation (1.1). Let
σˉij,bˉi be locally Lipshitz functions on
Rd for i,j=1,⋯,d. Let σˉ:=(σˉij) and (bˉ):=(bˉ1,⋯,bˉd)
. We use the notation R^d=Rd∪{∞} for the one point compactification of Rd.
Theorem 2.1
*Let σˉ,bˉ,(Bt) be as above. Then
∃ η:Ω→(0,∞],η an (FtB) stopping time and an R^d-valued,
(FtB) adapted process (Xt)t≥0 such that
-
*For all ω∈Ω, X.(ω):[0,η(ω))→Rd, is continuous
and *Xt(ω)=∞, t≥η(ω)
2. 2.
*a.s. (P), η(ω)<∞ implies *t↑η(ω)limXt(ω)=∞.
3. 3.
a.s.(P),
[TABLE]
for 0≤t<η(ω).
The solution (Xt,η) is (pathwise)
unique i.e. if (Xt1,η1) is another solution then P{η=η1,Xt=Xt1,0≤t<η}=1.
Proof : We refer to [5], Chapter IV, for the proofs.
Theorem 2.3 for the proof of existence and Theorem 3.1 for the proof
of uniqueness .\hfill□
Let α,β∈Z+d:={(x1,⋯,xd):xi≥0,xiinteger}. Let xα:=x1α1…xdαd and ∂β:=∂1β1…∂dβd. For a multi index α, we use the notation ∣α∣:=i=1∑dαi. Let S denote the space
of rapidly decreasing smooth real functions on Rd with
the topology given by the family of semi norms
∧α,β,defined for f∈S and multi
indices α,β by ∧α,β(f):=xsup ∣xα∂βf(x)∣. Then {S,∧α,β}:α,β∈Z+d} is a Fréchet space. S′ will
denote its continuous dual. The duality between S and
S′ will be denoted by ⟨ψ,ϕ⟩ for
ϕ∈S and ψ∈S′. For x∈Rd the translation operators τx:S→S are defined as τxf(y):=f(y−x) for f∈S and then for ϕ∈S′ by duality : ⟨τxϕ,f⟩:=⟨ϕ,τ−xf⟩.Let {hk;k∈Z+d} be the orthonormal basis in the real Hilbert
space L2(Rd,dx)⊃S consisting of the
Hermite functions (see for eg. [16]); here dx denotes Lebesgue
measure. Let ⟨⋅,⋅⟩0 be the inner product
in L2(Rd,dx). For f∈S and p∈R define the inner product ⟨f,g⟩p on
S as follows :
[TABLE]
The
corresponding norm will be denoted by ∥⋅∥p. We define
the Hilbert space Sp as the completion of S with
respect to the norm ∥⋅∥p. The following basic relations
hold between the Sp spaces (see for eg. [8],
[9]): For 0<q<p, S⊂Sp⊂Sq⊂L2=S0⊂S−q⊂S−p⊂S′. Further,
S′=p∈R⋃Sp and
p∈R⋂Sp=S. If
{hkp:k∈Z+d} denotes the orthonormal basis in
Sp consisting of the (suitably normalised) Hermite
functions, then the dual space Sp′ maybe identified with
S−p , via the basis {hk−p:k∈Z+d} of S−p. For ϕ∈S and ψ∈S′ the bilinear form (ϕ,ψ)→⟨ψ,ϕ⟩ also gives the duality between Sp(⊃S) and S−p(⊂S′). It is also well
known that ∂i:Sp→Sp−21 are bounded linear operators for every p∈R and i=1,⋯,d. Suppose for i,j=1,⋯,d,
σij,bi∈S−p. Let σ:=(σij)1,≤i,j,≤d and b:=(b1⋯bd).We
can then define for j=1,⋯,d the non linear operators Aj,L:Sp→Sp−1 as follows:
[TABLE]
Here ⟨σ,ϕ⟩ is the matrix whose (i,j)th
entry ⟨σ,ϕ⟩ij is ⟨σij,ϕ⟩ , ⟨σ,ϕ⟩t its transpose
and ⟨b,ϕ⟩ is the vector whose ith entry
⟨b,ϕ⟩i is ⟨bi,ϕ⟩. The d-
tuple of operators (A1,…,Ad) will denoted by A.
3 Stochastic Differential Equations in S′
We now consider a stochastic partial differential equation in
S′ driven by the Brownian motion (Bt) and differential
operators Ai,i=1,⋯,d and L defined above with given
coefficients σij,bi,i,j=1,⋯d in the space S−p for some fixed p∈R and initial condition y∈Sp viz.
[TABLE]
Note that if (Yt) is an Sp valued ,locally
bounded , (Ft) adapted process then
Ai(Ys),i=1,⋯,d and L(Ys) are Sp−1 valued,
adapted , locally bounded processes and hence the stochastic
integrals ∫0tAi(Ys) dBsi and ∫0tL(Ys) ds are well
defined Sp−1 valued, continuous Ft adapted
processes and in addition, the former processes are Ft
local martingales . We then have the following definition of a
‘local’ strong solution of equation(3.4) .
Definition 3.1
Let p∈R. Let y∈Sp, σij,bi∈S−p and {Bt,Ft} the given standard
(Ft) Brownian motion. Let δ be an arbitrary
state,viewed as an isolated point of Sp^:=Sp∪{δ}. By an Sp^ valued, strong,
(local) solution of equation (3.4), we mean a pair (Yt(y),η)
where η:Ω→(0,∞] is an FtB
stop time and (Yt(y)) an
Sp^ valued (FtB)
adapted process such that
-
For all ω∈Ω, Y.(y,ω):[0,η(ω))→Sp is a continuous map
and
Yt(y,ω)=δ,t≥η(ω)
2. 2.
a.s. (P) the following equation holds in Sp−1 for 0≤t<η(ω),
[TABLE]
Remark 3.2
By usual stopping arguments, the Sp−1 valued
stochastic integrals on the right hand side in equation
(3.5) can be shown to be finite , almost surely , for 0≤t<η. Also , all functions f:Sp→R
are extended to functions on Sp^ by setting
f(δ):=0.
Definition 3.3
We say that pathwise uniqueness holds for equation (3.4) iff
given (Bt), a standard Brownian motion on a probability space
(Ω,F,P) , and given y∈Sp, σij,bi∈S−p, and any two Sp valued strong solutions
(Yti(y),ηi),i=1,2 of equation (3.4), we have P{Ys1(y)=Ys2(y),0≤s<η1∧η2}=1.**
Theorem 3.4
Let y∈Sp, σij,bi∈S−p and {Bt,Ft} the given standard
(Ft) Brownian motion. Suppose the Rd2
valued function σˉ(x):=(⟨σij,τxy⟩) and the Rd valued function bˉ(x)=(⟨bi,τxy⟩) are locally Lipshitz. Then equation
(3.4) has a unique strong solution.
To prove the uniqueness assertion in Theorem 3.4 , we need Lemma
(3.6) below . It is of independent interest since it characterises
the solutions of equation (3.4). The proof of the lemma in turn
depends on the so called ‘Monotonicity inequality’ which we now
state. Given real numbers σij,bi,i,j=1,⋯d, let
σ=(σij) and σt the transpose of σ.
We define the constant coefficient differential operators Aoj,j=1⋯d and Lo as follows :
[TABLE]
[TABLE]
Theorem 3.5
Let α>0.Then there exists a constant C=C(α,p,d)>0 depending only on α,p and d such that for all ϕ∈Sp,
and all σi,j,bi,i,j=1,⋯,d with α≥i,jmax{∣σi,j∣,∣bi∣}, we have
[TABLE]
Proof : See [4].
Lemma 3.6
Suppose (3.4) has an Sp-valued
strong solution (Yt(y),η). Define the continuous
semi-martingale in Rd as follows:
[TABLE]
Then
a.s P, Yt(y)=τzt(y)(y), for 0≤t<η .
Proof: We use the notation Yt,zt for Yt(y),zt(y)
respectively. We apply the Ito formula in Theorem 2.3 of [12]
to the process (τzty) to get the following equation in
Sp−1 :
[TABLE]
where
(Aˉj(s)),j=1⋯d and (Lˉ(s)) are operator
valued Ft adapted processes such that for j=1,⋯,d
and each (s,ω), Aˉj(s,ω)and Lˉ(s,ω)
are linear operators from Sp into Sp−1 ,
defined as follows :
[TABLE]
[TABLE]
Note that almost surely for 0≤t<η, (Yt) satisfies the same equation as (τzty) in
Sp−1 i.e.
[TABLE]
Let (σn)
be a sequence of (Ft) stopping times, such that almost
surely, 0≤σn<η , σn↑η and such
that for each n≥1, the processes
(⟨σij,Yσn∧s⟩)and (⟨bi,Yσn∧s⟩) are uniformly bounded for all i,j.
Define for n≥1, Xtn:=Yσn∧t−τzσn∧ty. Let for n\geq 1,$$C_{n} be the
constant obtained from Theorem 3.5 above applied to the operators
Aoi=Ai(s,ω),i=1,⋯,d and Lo=L(s,ω)
for fixed (s,ω) with s≤σn(ω),n≥1 and
[TABLE]
where the supremum is taken over 1≤i,j,≤d, s≤σn(ω) and ω∈Ω.We then have,
using integration by parts,
[TABLE]
where (Mtn) is a continuous
local martingale. The Gronwall inequality now implies that for
every t≥0 and n≥1, ⇒E∥Xt∧σnn∥p−12=0. It follows that Yt=τzt(y)
almost surely for 0≤t<η.\hfill□
Remark 3.7
For a function f we use the notation f~ to
denote the function f~(r):=f(−r). For a distribution y
we define the distribution y~ via duality i.e. ⟨y~,f⟩:=⟨y,f~⟩ ∀f∈S. We note that σijˉ(x):=⟨σij,τxy⟩=σij⋆y~(x) where
⋆ denotes convolution . It is well known that when
σij∈S′ and y∈S the convolution
σij⋆y is C∞(see Theorem 30.2,
[3]).
We have the following Corollary to Lemma 3.6.
Corollary 3.8
Let y∈Sp,σij,bi∈S−p,i,j=1,⋯,d and x∈Rd. Suppose
that the coefficients in equation (2.3) are given as σˉij=σij⋆y~ and bˉi=bi⋆y~. Let (Yt(τx(y)),η) be a solution
of equation (3.4) with initial condition τx(y). Let
(zt(τx(y))) be given by Lemma 3.6 with Yt=τzt(τx(y))(τx(y))=τx+zt(τx(y))(y),t<η. Then the process (x+zt(τx(y))) solves equation
(2.3) upto the random time η, with initial condition x.
**Proof ** Let Xt:=x+zt(τx(y)),t<η. We have
a.s. for t<η,
[TABLE]
Since σˉ(x)=⟨σ,τx(y)⟩ and bˉ=⟨b,τx(y)⟩, the
result follows from Theorem 2.1.\hfill□
Proof of Theorem 3.4 Assume that σˉij and bˉi are locally Lipschitz functions on Rd..
Let (zt(y),η) be a solution of equation (2.3). zt:=zt(y).
Define the Sp valued process
(Yt) as follows :
[TABLE]
Note that σˉij(zt)=⟨σij,τzty⟩=⟨σij,Yt⟩ and similarly bˉi(zt)=⟨bi,Yt⟩,t<η. Applying the Ito formula in Theorem 2.3 of
[12] to the process (τzty) as in the proof of Lemma
3.6, we get , almost surely for t<η,
[TABLE]
Hence (Yt,η) is a solution. Suppose (Yt1,η1) and
(Yt2,η2) are two solutions. Define the processes (zti)
as follows : For 0≤t<ηi,
[TABLE]
We take zti:=∞ for t>ηi.Then by Lemma 3.6
,Yi(t)=τzti(ϕ),t<ηi. But then by Cor (3.8),(zti,ηi),i=1,2 solves equation (2.3) with initial condition
x=0, upto ηi,i=1,2. The proof of uniqueness in Theorem
(3.1) of [5] can be applied here (with appropriate
modification to take care of the limit at η1∧η2) to
conclude that, almost surely , zt1=zt2 for 0≤t<η1∧η2. It follows that almost surely ,Yt1=Yt2 for 0≤t<η1∧η2.\hfill□
Remark 3.9
Instead of the linear functionals ⟨σij,.⟩,⟨bi,.⟩:Sp→R as coefficients of L and Ai, we could
take more general (non linear) functions σij, bi:Sp→R as the coefficients in L and the
Ai’s . Theorem 3.4 remains true provided σij(τxy),bi(τxy) are locally Lipschitz functions on Rd for
y∈Sp.
Consider the ordinary differential equation
[TABLE]
where b=(b1,⋯,bd) is a smooth vector
field on Rd. The following corollary generalises this
equation when the bi are tempered distributions on Rd.
Corollary 3.10
Suppose σ≡0,bi∈S−p,y∈Sp.
Then the unique solution of the first order evolution equation viz.
[TABLE]
is
given by {Yt,0≤t<η} where Yt:=τzty
for 0≤t<η and {zt,0≤t<η} is the unique
solution of the ordinary differential equation
[TABLE]
We can characterise the unique global solutions of equation (3.4) in
terms of the unique solutions of equation (2.3) described in Theorem
2.1.
Theorem 3.11
Let σij,bi,σˉij,bˉi,y
be as in Theorem 3.4. Then
there exists a solution (Yt(y),η) of equation (3.4) with the
property that Yt(y)=τzt(y)(y),t<η, where
(zt(y),η) is the unique solution of equation (2.3) given by
Theorem (2.1). In particular, almost surely, on the set {η<∞}, t↑ηlimzt(y)=∞. The
solution is pathwise unique, i.e. if (Yt1,η1) is another
such solution, then P{η=η1,Yt(y)=Yt1,0≤t<η}=1.
Proof : Let (zt(y),η) be the unique solution of
equation(2.3) given by Theorem(2.1) and define Yt(y):=τzt(y),t<η and Yt(y):=δ,t≥η. Then
as in the proof of Theorem(3.4), (Yt(y),η) is a solution of
equation(3.4). The uniqueness follows from the uniqueness of
(zt(y),η). \hfill□
The following proposition charactarises the global solutions of equation(3.4)
as processes in S′ . It shows that they are
better behaved than their finite dimensional counterparts. It
concerns the purely analytical behaviour of
τxy as ∣x∣→∞.
Proposition 3.12
Let σi,j,bi,y be as in Theorem 3.4, with y=0. Let
Yt(y)=τzt(y)y be the unique solution upto time η of
equation(3.4), where (zt(y)) solves equation(2.3) with x=0
and σˉij,bˉi as in Corollary (3.8). Fix ω∈Ω. Then, zt(y,ω)→∞ as t→η(ω) whenever Yt(y,ω)→0
weakly in S′ as t→η(ω).
Conversely, suppose one of the following two conditions is satisfied
viz.
-
y* is a square integrable function
i.e. y∈L2(Rd)=S0.*
2. 2.
y* has compact support i.e. y∈E′.*
Then, zt(y,ω)→∞ as t→η(ω) implies Yt(y,ω)→0 weakly in S′.
Proof : Let Yt(ω):=Yt(y,ω) and zt(ω):=zt(y,ω). Suppose first that for ω∈Ω,Yt(ω)→0 weakly in S′ and assume
that zt(ω)↛∞. Since the neighborhoods of
∞ in Rd are complements of compact sets , if
zt(ω)↛∞, then there exists a ball
B(0,r) of radius r around zero and a sequence tn↑η(ω) such that ztn(ω)∈B(0,r) for all n≥1. The compactness of B(0,r) implies the existence of a
subsequence of (tn), denoted again by tn, and z∈B(0,r)
such that ztn(ω)→z. The continuity of the
translations and the weak convergence of Ytn(ω) to zero
now forces τz(y)=0. This implies y=0, a contradiction.
For the converse suppose first that y∈L2(Rd). If
zt(ω)→∞ as t→η(ω)
then for ϕ=0∈S,
[TABLE]
Since y∈L2(Rd), the second
integral can be made small, independent of t by choosing n
large. For the first integral, we can choose t sufficiently close
to η(ω) so that ϕ(x−zt(ω)) is small, uniformly
for ∣x∣≤n, proving the case when y∈L2(Rd).
Suppose now that the second case holds i.e. y has compact support.
Let support y⊆K and let N= order(y)+2d. Then there
exist continuous functions gα,∣α∣≤N,\mboxsupport gα⊆V where V is an open set
having compact closure, containing K, such that
[TABLE]
See
[3], Theorem 24.5, Corollary 3. Then for ϕ=0∈S,
[TABLE]
The same arguments as in the first case applied to each
of the terms in the above sum will now also prove the second case.
\hfill□
Remark 3.13
It is easy to see that zt(y,ω)→∞
does not in general imply that Yt(y,ω)→0 weakly
in S′. For example if d=1 and y=c , a non
zero constant , then y∈S−p for some p>0. If
σ,b are integrable functions with non zero integrals over
R then it is easy to see that η=∞ and
zt(y,ω)→∞ almost surely on the one hand ,
while on the other Yt(y,ω)=c for all t≥0.
4 (Yt(y)) as a Markov Process on S^p.
In this section, we study the Markov properties of the solutions of
equation (3.4) viz. (Yt(y)). For this purpose, it is essential to
obtain a version of (Yt(y)) which is jointly measurable in y as
well. It is of course no accident and certainly not unreasonable
that the Markov properties of Yt(y) derive from that of the
diffusion (X(x,y,t)) generated by equation(2.3).In Proposition 4.1
below, we obtain a version of ((X(x,y,t))) which is also jointly
measurable in x and y. Let σij,bi∈S−p,i,j=1,⋯,d and y∈Sp. Let
σˉij(x,y)=⟨σij,τxy⟩
and bˉi(x,y)=⟨bi,τxy⟩ be locally
Lipschitz functions on Rd (earlier denoted by
σˉij(x),bˉi(x)).
Then by Theorem 2.1, equation(2.3)has a solution for each x∈Rd.
We will denote the Borel σ field on R^d by Bd.
Proposition 4.1
Let (Ω,F,{Ft}t≥0,P) be a filtered probability space
satisfying the usual conditions and (Bt) be a standard
Ft Brownian motion on it. Then,there exists a map X:R^d×S^p×[0,∞)×Ω→R^d which is
Bd⊗B(S^p)⊗B[0,∞)⊗F measurable and such that for each
(x,y)∈Rd×Sp,(X(x,y,t)) is a
solution of equation (2.3) given by Theorem 2.1.
Proof: We note that σˉ(x,y) and bˉ(x,y)
are jointly measurable in (x,y). Choose for each n≥1, and
i,j=1,⋯d, Lipshitz functions in x σijn(x,y),bin(x,y) , measurable in (x,y)such that σijn(x,y)=σˉij(x,y), bin(x,y)=bˉi(x,y),∣x∣≤n.
Consider equation(2.3) with σˉij, bˉi replaced
by σijn(x),bin(x) viz.
[TABLE]
Let for
n\geq 1,k\geq 1,$$(X^{n,k}(x,y,t)) be defined iteratively (in
vector form) by
[TABLE]
with Xn,0(x,y,t):=x for all t≥0. It is a well known property of stochastic integrals that
they are measurable with respect to a parameter when the integrands
are measurable with respect to the same parameter (see [6],
Theorem 17.25). Using an inductive argument and the fact that
σijn(x,y),bin(x,y) are jointly measurable in (x,y) it
follows that the integrals in the right hand side of the above
equation have jointly measurable versions in (x,y,t,ω) and
the same follows for Xn,k(x,y,t,ω). Then by the method of
successive approximations , we get solutions
(Xn(x,y,t,ω)),of equation(4.6), where Xn(x,y,t,ω)=k→∞limXn,k(x,y,t,ω) , jointly
measurable in (x,y,t,ω) and for each (x,y), progressively
measurable in (t,ω)(see [5],Chapter IV, Theorem 3.1).
Note that for each (x,y),(Xn(x,y,t,ω)) also solves
equation(2.3) upto the random time ηn(x,y,ω) defined as
[TABLE]
It is easy to see that
ηn(x,y,ω) is jointly measurable in (x,y,ω).
Further,if we denote by η(x,y,ω) the explosion time for
the solution (X(x,y,t)) of equation (2.3) starting at x∈Rd, then η(x,y,ω):=n→∞limηn(x,y,ω). As a consequence, η(x,y,ω) is
a measurable function of (x,y,ω)∈Rd×Sp×Ω. The map X may now be defined as
[TABLE]
Clearly for each (x,y)∈Rd×Sp we have , by the
uniqueness of solutions, X(x,y,t∧ηn)=Xn(x,y,t∧ηn). We define X(x,y,t,ω)=∞ for all
(t,ω) if x=∞ or y=δ. \hfill□.
We define the transition probability function Pˉ(x,y,t,A) of
the diffusion (X(x,y,t)) in the usual way : For 0≤t≤∞,x∈Rd,y∈Sp and A∈Bd,
[TABLE]
Note that t=∞ is
included in the definition of Pˉ(x,y,t,A) by taking
X∞=∞.We take Pˉ(x,y,t,A):=IA(∞),t≥0, if x=∞. We can define an induced transition
probability on Sp as follows : First we extend the map
τx(y). We define τ∞(y):=δ,y∈S^p and τx(δ):=δ. Thus τx:S^p→S^p,x∈Rd^. For y∈Sp,0≤t≤∞, and B∈B(S^p) define
[TABLE]
where τ.−1(y)(B)={z:τz(y)∈B}. We take P(y,t,B):=IB(δ),t≥0,
if y=δ and define Yt(y)=δ if t=∞ or y=δ. We note that because X(0,y,t)=zt(y),t<η(0,y) we have
[TABLE]
The strong
Markov property for (Yt(y)) is now a simple consequence of that
for the process (X(x,y,t)).
Proposition 4.2
Let y∈Sp and let T be an Ft
stopping time . Then for 0≤s≤∞, and B∈B(S^p), we have ,
almost surely ,
[TABLE]
Proof : Since the result holds trivially at s=∞, we
assume s<∞. We have, using the strong Markov property of
the process (X(x,y,t))
[TABLE]
On the other hand,
[TABLE]
where in the last equality we have made use
of the observation made preceeding the statement of the proposition
and the fact that Ys(τx(y))=τzs(τx(y))(τx(y)). Hence
[TABLE]
This completes the proof of the
Proposition.\hfill□
Let Bp={f:S^p→R;f bounded and measurable,f(δ)=0}. Let
∥.∥p,∞ denote the norm on Bp given by
∥f∥p,∞:=y∈Spsup∣f(y)∣. Then
(Bp,∥.∥p,∞) is a Banach space. Let (Tt)0≤t<∞
denote the linear operators Tt:Bp→Bp given by Ttf(y):=E(f(Yt(y))), for y∈Sp
and f∈Bp.
Corollary 4.3
(Tt)t≥0* is a semi-group of linear
operators on Bp with T0=Identity, Tt1=1 and
Ttf≥0 whenever f≥0.*
Let (L,Dom(L)) denote the infinitesimal generator
of (Tt). Recall that f∈Bp belongs to Dom(L) iff the limit of t1(Ttf−f) as t tends to zero
exists in Bp and further L(f):=t→0limt1(Ttf−f). We shall denote by
(Tˉty) and Lˉy the semi-group and
infinitesimal generator, respectively on Bd and Dom(Lˉy)⊂Bd, associated with the
diffusion (X(x,y,t)) generated by (σˉ(x,y)) and
bˉ(x,y). Here Bd is the Banach space of bounded
measurable functions on R^d endowed with the
supremum norm.
For y∈Sp, let C(y)⊂Sp be defined by
C(y):={y′∈Sp:y′=τx(y),x∈Rd}. C^(y):=C(y)∪{δ}.Then
observe that Sp=y∈Sp⋃C(y)
and for y1=y2 either C(y1)⋂C(y2)=ϕ or
C(y1)=C(y2). We shall consider C(y) as a measurable space
with the σ−field induced by B(S^p). Let
[TABLE]
Since P{Yt(y)∈C(y′) for some t≥0}=0 if C(y)⋂C(y′)=ϕ we have (the restriction) Tt∣Bp(y):Bp(y)→Bp(y) is a
semi-group on (Bp(y),∥.∥p,∞), for every y∈Sp. We shall continue to denote the restrictions of
Tt and L to Bp(y) and Bp(y)⋂Dom(L) respectively by (Tt) and L
or by Tty and Ly if there is a risk of confusion.
Let F:Rd+n→R,F∈C∞(Rd+n) such that support(F)⊂K×Rn for some compact K⊂Rd.
Let φi∈S,i=1,⋯d. Fix y∈Sp.Define f≡fy:Rd×Sp→R as follows :
If y′=τx(y) define
[TABLE]
If y′∈/C(y) we define fy(x,y′)=0.
Define fˉy(x):=fy(x,τx(y)):Rd→R and fˉy(∞)=0. Let σˉt(x,y)
denote the transpose of the matrix given by σˉ(x,y):=(σˉij(x,y)). We will denote by Lˉy the second
order differential operator in the variable x given as
[TABLE]
Proposition 4.4
Let y∈Sp. Then fˉy(x)∈CK∞(Rd)⊂Dom(Lˉ).
Consequently,for x∈Rd,
fy(x,.)∈Dom(L) and
[TABLE]
Lfy(x,y′)=0* if y′∈/C(y).*
Proof: It is clear that fˉy∈CK∞. Further,
using the compactness of K and Ito’s formula it can be shown that
CK∞⊂Dom(Lˉ). It is easily seen from the
definitions that Ttfy(x,τx(y))=Tˉtyfˉy(x) and
Ttfy(x,y′)=0,y′∈/C(y). In particular,
[TABLE]
The result follows.\hfill□
Remark 4.5
Let F(x,z):=g(x)z, where g∈CK∞,K=B(0,r), the ball of radius r centred at 0, and g(x)=1,x∈B(0,r1) for some r1<r. Then for y∈Sp
fy(x,y′)=g(x)⟨ϕ,y′⟩,ϕ∈S,y′∈C(y);fy(x,y′)=0,y′∈/C(y). It is easy to see that
[TABLE]
where L is the operator in equation(3.4).
Remark 4.6
Assume y∈Sp is such that τxy=y
for any x=0. Let j:C^(y)→R^d be defined by j(τx(y)):=x. Then j is one-one and
onto and we provide C^(y) with a topology and corresponding
Borel structure By that makes j:(C^(y),By)→(R^d,Bd) a Borel isomorphism
with inverse τ.(y):R^d→C^(y).
We can extend j as a map j:Bp(y)→Bd such that fˉ∈Bd iff f∈Bp(y), where f=fˉ∘j. This extends to semi
groups viz. Tty=Tˉty∘j. In other words, the
Markov process (Yt(y))≡(τzt(y)(y)) on the state
space C^(y) is ‘isomorphic’ to the Markov process
(X(x,y,t))) on R^d. Note however that the
topology on C^(y) is that of R^d and is
different from the one induced from S^p. Proposition
3.12 is a reflection of the same phenomenon.
5 A Non Linear Evolution Equation .
In this section we derive a non
linear evolution equation associated with
the operator L with the initial condition y∈Sp viz.
[TABLE]
We construct solutions of (5.7) via what maybe
called non linear convolutions, that we define below. This is also
closely related to the notion of stochastic representation of
solutions to evolution equations of the type (4.6). See
[13],[14]. The solutions of equation(5.7)are also
related to the solutions of the forward equation for the diffusion
(X(x,y,t))(see below, equation(5.8), Remark 5.5.) We recall from
[13] that τx:Sp→Sp are
bounded linear operators for all p∈R.
Definition 5.1
Let p∈R and let q≤p. Suppose
h:Sp→Sq and f:Rd→R be Borel measurable maps. For y∈Sp, the convolution h(y)∘f is defined to be the
element of Sq given by the Bochner integral h(y)∘f:=Rd∫h(τxy)f(x) dx provided the
integral exists i.e. provided Rd∫∥h(τx(y))∥q∣f(x)∣ dx<∞.More generally, let μ be a
finite measure on the Borel sigma field of Rd and
h(y) be as above. The convolution h(y)∘μ is defined as h(y)∘μ:=Rd∫h(τxy)μ(dx)
provided Rd∫∥h(τx(y))∥q μ(dx)<∞.
Remark 5.2
Our notation is a compromise between two contrasting
interpretations of the above definition. We could interpret the
above definition as an extension of the notion of convolution of a
functions h:Rd→R and a finite
measure μ on Rd, to that of convolution of μ
and a map h:Sp→Sq. The notation then
would be h∘μ(y), where h∘μ:Sp→Sq. The definition also affords an
interpretation as an extension of the notion of convolution of a
tempered distribution y and a measure μ via the map h. We
may view it then as a non linear convolution between y and μ
where the nonlinearity arises because of the map h. An appropriate
notation then could be y∘hf or y⋆hf. In any
case, our definition of convolution reduces to the usual convolution
between two distributions y and μ, if we take q=p and
h(y)=y in Definition 5.1. We will use the notation y⋆μ for the usual convolution between the distribution y and the
measure μ. We further note that when h is non linear as in
definition (5.1), h(y)∘μ is , in general, different from
the ordinary convolution h(y)⋆μ, between the distribution
h(y) and the measure μ.. In our application of the notion of
convolution to construct solutions of equation(5.7) however, there
is an additional source of non-linearity viz. μ would also
depend on y.
Definition 5.3
Let y∈Sp. We say that a continuous map
ψ(.,y):[0,∞)→Sp, is a solution by
convolution of the initial value problem (5.7) iff there exists
kernels μ(t,dx) on [0,∞)×Rd such that
-
Rd∫∥τx(y)∥p μ(t,dx)<∞,t≥0, and Rd∫∥ϕ(x)τx(y)∥pμ(t,dx) <∞,for all t≥0,and for all ϕ,where ϕ(x)=(⟨σ,τx(y)⟩⟨σ,τx(y)⟩t)ij,i,j=1,⋯d,or ϕ(x)=⟨bi,τx(y)⟩,i=1,⋯,d. In particular if L:Sp→Sp−1 is as in equation (3.4), then the convolution L(y)∘μ(t,.) exists in Sp−1 for all t≥0.
2. 2.
ψ(t,y)=y∘μt,t≥0
3. 3.
ψ(t,y) is continuously differentiable for t∈(0,∞) and we have
[TABLE]
Recall from Section 4 ,the transition probability measure for the
solutions (Yt(y)) of equation(3.4) viz. P(y,t,B) and the
transition probability measure for the process (zt(y)) given by
Corollary 3.8 viz. Pˉ(0,y,t,A). We recall that P(y,t,B)=Pˉ(0,y,t,τ.−1(y)(B)). The following theorem constructs
the solutions of the initial value problem equation(5.7) via a
stochastic representation.
Theorem 5.4
Let σij,bi,y be as in Theorem 3.4 and
let {(Yt(y)),η} be the unique Sp valued solution of
equation (3.4) given by Theorem 3.11. Let σˉij(x,y):=⟨σij,τx(y)⟩,bˉi(x,y):=⟨bi,τx(y)⟩,i,j=1,⋯,d and suppose that for fixed
y, these are bounded and continuous functions of x. Let
(zt(y)) be the unique solution of equation (2.3), as in Theorem
3.11. Then ψ(t,y):=E(Yt(y)),t≥0 defines an Sp valued continuous map that solves the initial value problem
(5.7) by convolution .
**Proof : ** We note that under the assumptions on
σˉij(.,y),bˉi(.,y), (zt(y)) has moments of
all orders and further for all t≥0,
[TABLE]
In particular, η=∞,a.s.
Note that ψ(t,y) is well defined : for z∈Rd,
[TABLE]
where P(x) is a polynomial in x∈R of degree 2∣p∣+1
(see [13]) .
It follows from our assumption on the moments of (zt)
that
[TABLE]
for
every t≥0. In particular ψ(t,y):=EYt(y) exists as
a Bochner integral in Sp. Further it is clear that ψ(t,y)=y∘Pˉ(0,y,t,.) The theorem is proved by taking
expectations in equation (3.5) satisfied by (Yt(y)). We first
show that this is indeed a legitimate operation.
Using the fact that the moments of zs:=zs(y) are finite, we
have for each t≥0,
[TABLE]
where C′=C′(d,∥σi,j∥−p,∥bi∥−p,i,j=1⋯d)
and C=C(d,∥σi,j∥−p,∥bi∥−p,i,j=1⋯d,∥y∥p) are positive constants depending on the indicated
quantities. A similar calculation verifies that for each t≥0
[TABLE]
We can
thus take expectations in equation (3.5) to get
[TABLE]
That ψ(t,y) is
continuously differentiable and satisfies equation (5.7)now follows
from the above equation and the continuity of E(L(Ys(y))). This
completes the proof of Theorem 5.4. \hfill□
For ϕ,y∈S′ such that the products
σˉij(.,y)ϕ, bˉi(.,y)ϕ,i,j=1,⋯d
are tempered distributions in the variable x, we define the
operator Lˉ∗,y as follows :
[TABLE]
We note that Lˉ∗,y is the formal adjoint of the second
order differential operator Lˉ associated with the diffusion
with coefficients σˉij,bˉi,i,j=1,⋯,d. The initial value problem (5.7)is closely connected with
solutions of the forward Kolmogorov equation for Lˉ viz.
[TABLE]
When ψ is a distribution with compact support and
σˉij,bˉi are smooth , solutions to the above
equation maybe obtained by convolution with the transition
probability measure Pˉ(0,y,t,.) (see [14], Theorem
4.5.). We extend that result in Theorem 5.6 when the coefficients
are only bounded and continuous.
Remark 5.5
To see the connection between solutions of equation (5.7)
and solutions of equation(5.8), consider the following integrated
version of equation(5.8) with ψt=Pˉ(0,y,t,.) viz.
[TABLE]
By convolving with y∈Sp and using the relation
[TABLE]
we get ,
[TABLE]
which is equivalent to equation
(5.7). On the other hand we can Fourier transform the above to get
back (5.9): Suppose y has compact support. For a tempered
distribution ϕ, the Fourier transform of ϕ is denoted by
ϕ^. For each ξ∈Rd,
[TABLE]
Since y
has compact support, y^(ξ), is by the Paley-Wiener theorem
an entire function and hence we may cancel off y^(ξ) in the
above equation to get, almost surely with respect to Lebesgue
measure on Rd,
[TABLE]
Inverting the Fourier transform , we get back equation (5.9).
Theorem 5.6
Suppose that y∈Sp, σij,bi,∈S−p,i,j=1,⋯d
and in addition are such that σˉij(.,y),bˉi(.,y) are bounded continuous functions. Let q>4d. Then for any x∈Rd,the map t→Pˉ(x,y,t,.):[0,∞)→S−q
is differentiable and satisfies the forward equation (5.8) with
ψ=δx.
Proof : Let (X(x,y,t)) be the unique solution of (2.3). Then
by our assumptions on σˉij,bˉi , the moments of
all orders of (X(x,y,t)) exist and are locally bounded functions
of t.. Further since δz∈S−q if and only if
q>4d(see [14]), we have δX(x,y,t)∈S−q. In particular, as in the proof of the previous
theorem, E∥δX(x,y,t)∥−q<∞. It follows that
Pˉ(x,y,t,.)=EδX(x,y,t), where the right hand side
is an element of S−q and the equality holds there. Using
the Ito formula in [12] we get,
[TABLE]
where the operator valued processes (Lˉ(s,ω)) and
(Aˉ(s,ω)) are defined for fixed x and y as follows.
Aˉ(s,ω):=(Aˉ1(s,ω),⋯,Aˉd(s,ω)) and for ϕ∈S′,
[TABLE]
Similarly
[TABLE]
As in the proof of Theorem 5.4, we can take expectations
in equation (5.10) to get
[TABLE]
Note that we have
the representation Pˉ(x,y,t,.)=Rd∫δz(.)Pˉ(x,y,t,dz), where the right hand side is a
Bochner integral in S−q. Given a bounded continuous
function ϕ we can use this representation to define the product
ϕPˉ(x,y,t,.) as an element of S−q as follows :
[TABLE]
where the right hand side is
a Bochner integral in S−q. Hence
Lˉ∗Pˉ(x,y,t,.) is a well defined tempered distribution
in S−q−1. It is now easy to see , by acting on test
functions, that Lˉ∗,yPˉ(x,y,t,.)=E(Lˉ(t)δX(x,y,t)). In particular from (5.11) we get
[TABLE]
Since the moments of X(x,y,t)
are locally bounded functions of t, it follows using the dominated
convergence theorem that the integrand in the right hand side in
equation (5.12) is continuous in t. The desired conclusion now
follows from equation (5.12). \hfill□
Acknowledgements : The author would like to thank an anonymous
referee for his careful reading of the manuscript and for his useful
comments and suggestions.