The expected signature of Brownian motion stopped on the boundary of a circle has finite radius of convergence
Horatio Boedihardjo, Joscha Diehl, Marc Mezzarobba, Hao Ni

TL;DR
This paper investigates the properties of the expected signature of Brownian motion stopped at a boundary, providing a counterexample to a previously posed question about its moment conditions and convergence.
Contribution
It presents the first example showing that the expected signature of Brownian motion may not satisfy the moment condition, challenging prior assumptions.
Findings
Expected signature has finite radius of convergence
Counterexample to the moment condition question
Challenges previous beliefs about expected signatures
Abstract
The expected signature is an analogue of the Laplace transform for rough paths. Chevyrev and Lyons showed that, under certain moment conditions, the expected signature determines the laws of signatures. Lyons and Ni posed the question of whether the expected signature of Brownian motion up to the exit time of a domain satisfies Chevyrev and Lyons' moment condition. We provide the first example where the answer is negative.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The expected signature of Brownian motion stopped on the boundary of a circle has finite radius of convergence
Horatio Boedihardjo University of Reading. HB gratefully acknowledges EPSRC’s support (EP/R008205/1).
Joscha Diehl Universität Greifswald
Marc Mezzarobba Sorbonne Université, CNRS, Laboratoire d’informatique de Paris 6, LIP6, F-75005 Paris, France. MM was supported in part by ANR grant ANR-14-CE25-0018-01 (FastRelax).
Hao Ni University College London. HN acknowledges the support by the EPSRC under the program grant EP/S026347/1 and by the Alan Turing Institute under the EPSRC grant EP/N510129/1.
Abstract
The expected signature is an analogue of the Laplace transform for probability measures on rough paths. A key question in the area has been to identify a general condition to ensure that the expected signature uniquely determines the measures. A sufficient condition has recently been given by Chevyrev and Lyons, and requires a strong upper bound on the expected signature. While the upper bound was verified for many well-known processes up to a deterministic time, it was not known whether the required bound holds for random time. In fact even the simplest case of Brownian motion up to the exit time of a planar disc was open. For this particular case we answer this question using a suitable hyperbolic projection of the expected signature. The projection satisfies a three-dimensional system of linear PDEs, which (surprisingly) can be solved explicitly, and which allows us to show that the upper bound on the expected signature is not satisfied.
Contents
- 1 Introduction
- 2 Differentiability of the development of expected signature
- 3 A polar decomposition for the development
- 4 ODE for
- 5 Solving the ODE for
- 6 Concluding
- 7 Appendix
1 Introduction
Let a probability measure on a subset of the real line have moments of all orders. Under which conditions do these moments pin down the probability measure uniquely? This is the well-studied moment problem. When the subset is compact, the answer is always affirmative. In the noncompact case uniqueness is more delicate (see [Sch2017]).
In stochastic analysis one is usually concerned with measures on some space of paths, the prime example being Wiener measure on the space of continuous functions. It turns out that for many purposes a good replacement for “monomials” in this setting are the iterated integrals of paths. The collection of all of these integrals is called the iterated-integrals signature.
For smooth paths (and with respect to a time horizon ), it is defined, using Riemann-Stieltjes integration, as
[TABLE]
It is well-known (see [BGLY2016] and references therein) that
- •
, where is the group of grouplike elements,
- •
completely characterizes the path up to reparametrization and up to tree-likeness.
Let now be a stochastic process. For fixed , is usually not smooth, so that we have to assume that the stochastic process posesses a “reasonable” integration theory. In particular assume that integrals of the form , exist for a large class of functions and that the fundamental theorem of calculus holds,
[TABLE]
An important example is Brownian motion with Stratonovich integration. Other examples include: the Young integration against a fractional Brownian motion with Hurst parameter strictly larger than , …(see [FV10]).
The iterated-integrals signature defined by the expression (1) — now, using the given integration theory — is then a random variable. Let us assume that we can take its expectation level-by-level, i.e. for all (we postpone the discussion of the choice of norm on to later)
[TABLE]
We can then define expected signature level-by-level
[TABLE]
where denotes the expectation level-by-level ( for “projective”) of a -valued random variable. The question arises:
Does completely characterize the law of ?
As we have seen above, the computation of already incurs a loss of information: the parametrization of and any tree-like parts are lost. The relevant question is hence
Does completely characterize the law of , up to parametrization and tree-likeness?
Since this formulation is a bit awkard, and since the (deterministic) step is completely understood, we can instead focus on
Does completely characterize the law of on ?
A sufficient condition for this to be the case is given in [ChevyrevLyons]: if has infinite radius of convergence, that is
[TABLE]
for all then the law of on is the unique law with this (projective) expected value. Here denotes projection onto tensors of length .
Let us give two examples. Let be a probability measure on having all moments and define
[TABLE]
Consider the stochastic process , where is distributed according to . Since is smooth, its signature is well-defined and actually has the simple form
[TABLE]
Then
[TABLE]
and a sufficient condition for to have infinite radius of convergence is , for some .111The condition is of course more than enough in the classical moment problem to have uniqueness for the law on ([SR75, Example X.6.4]). Then [ChevyrevLyons, Proposition 6.1] applies, and the law of on is uniquely determined by these moments.
Consider now the expected signature of a standard Brownian motion calculated up to some fixed time . It is known (see for example [LV2004, Proposition 4.10]) that
[TABLE]
It follows that
[TABLE]
and hence
[TABLE]
for any . Again, by [ChevyrevLyons, Proposition 6.1], the law of is uniquely determined by .
The decay rate of expected signature of the stochastic process up to the exit time from a bounded domain is a very challenging problem, even for the simple Brownian motion case. The literature on the decay rate of the expected signature focuses on the case for the fixed time interval, e.g. [Passeggeri2016] and [FrizRiedel2014], which heavily relies on the Gaussian tail of the increment. However, the increment of stopped processes would violate this assumption. The question on the finiteness of the convergence radius of the expected signature of the Brownian motion was firstly proposed in [LyonsNi] in 2015. It has remained open until our paper provides the first negative example, i.e. a 2-dimensional Brownian motion up to the unit disk, which implies the lower bound of the decay rate of the expected signature in this case. In [LyonsNi, Theorem 3.6], using Sobolev estimates, under certain smoothness and boundedness condition of the domain, geometric upper bounds for the decay rate of the expected signature of stopped Brownian is established (see also [ChevyrevLyons, Example 6.20] for a probabilistic approach). It is even more challenging to establish a non-trivial lower bound of the decay rate of the expected signature in this case. Our work may shed some light on how to use the PDE approach to derive the lower bound of the decay rate of the stopped diffusion processes.
Concretely, we consider the Brownian motion in started at some point in the unit circle , and stopped at hitting the boundary, that is
[TABLE]
In the notation introduced above, we are interested in
[TABLE]
where . In [LyonsNi] it was shown that for every and , the th term of satisfies the following PDE:
[TABLE]
with the boundary condition that for each ,
[TABLE]
Additionally, one has and for all . Using this, they were able to obtain the bound for some ([LyonsNi, Theorem 3.6]). This bound is not enough to decide whether the radius of convergence for is infinite or not, but it is enough to deduce that has radius of converge strictly larger than [math]. In this work we show that the radius of convergence is indeed finite.
Recall from [ChevyrevLyons, Proposition 6.1] that if are -valued random variables such that and has an infinite radius of convergence, then . Our main theorem, proven in Section 6, is the following.
Theorem**.**
The expected signature of a two-dimensional Brownian motion stopped upon exiting the unit disk has a finite radius of convergence.
The condition of having an infinite radius of convergence is equivalent to lying in . Here is defined as the closure of under the coarsest topology such that for all normed algebras and all , the extension is continuous. Recall that for , we may define firstly as a map on the -times algebraic tensor product , by the relation
[TABLE]
and then extended it to by linearity.
We want to show that does not lie in the space . It is sufficient to show that there exists , and , such that diverges as tends to a finite number . In fact, we will choose to be
[TABLE]
Such a map first appeared in [HamblyLyons] to study the signature of bounded variation paths and is also subsequently used in [LyonsXu].
We proceed as follows. In Section 2, for we let act on . For small enough, we show that resulting linear map in , evaluated at is smooth in and solves a certain PDE. Using rotational invariance of Brownian motion, in Section 3 we rewrite said PDE solution in polar coordinates. In Sections 4 and 5, we obtain an explicit solution for the PDE (still, for small enough) in terms of Bessel functions. Finally, in Section 6 we show that the solution blows up as for some , proving our main theorem. The Appendix, Section 7, contains some auxiliary results on PDEs.
2 Differentiability of the development of expected signature
We first need two technical lemmas which assert that the development of the expected signature is twice differentiable, and satisfies the PDE we expect it to. In the lemma below we will adopt the multi-index notation
[TABLE]
Observe that
[TABLE]
Let be the projective norm.
Lemma 1**.**
The function is twice continuously differentiable. There exists a constant such that for all , all and all satisfying , one has the bound
[TABLE]
Moreover, there exists such that for all
[TABLE]
is twice differentiable in and if , then
[TABLE]
Proof.
Let . By Theorem 16 in the appendix, the function is twice continuously differentiable (it is in fact infinitely differentiable on ). By Lemma 15 in the Appendix, there exists such that for all
[TABLE]
where the norm is the Sobolev norm on the unit disc with respect to the variable ,
[TABLE]
By Theorem 2.2 in [LyonsNi], which bounds the values of a function in terms of the Sobolev norm of , there is some constant such that for all and ,
[TABLE]
Since is a linear image of , the function is twice continuously differentiable in , and moreover, there exists such that for all ,
[TABLE]
This bound also allows us to deduce that for , the series
[TABLE]
converges uniformly and hence the series
[TABLE]
is twice continuously differentiable and the derivatives can be taken inside the infinite summation. ∎
Lemma 2**.**
There exists such that if , the function defined by
[TABLE]
is twice continuously differentiable on , and satisfies
[TABLE]
with for . Here denotes the canonical basis of .
Proof.
If we apply the linear map to the PDE (5), then we have a matrix-valued PDE
[TABLE]
together with the boundary condition
[TABLE]
We may multiply both sides with , sum to infinity and apply to the vector to get
[TABLE]
By Lemma 1, each of the infinite sums converges and we may take the derivatives outside the infinite sum. ∎
3 A polar decomposition for the development
Let . Recall that
[TABLE]
We may consider as a linear endomorphism of mapping to .
Lemma 3**.**
For any linear map ,
[TABLE]
where is the transpose of .
Proof.
Note that
[TABLE]
In what follows, we will use the notation
[TABLE]
Corollary 4**.**
Let be the rotation map
[TABLE]
Then
[TABLE]
Proof.
Brownian motion starting at has the same distribution as the rotated Brownian motion , where starts from . Let denote the Stratonovich differential. Then
[TABLE]
By Lemma 3
[TABLE]
As is orthogonal, we have
[TABLE]
Therefore,
[TABLE]
Corollary 5**.**
Define the functions by
[TABLE]
where is the function defined by (9). In polar coordinates, the expression of reads
[TABLE]
Additionally, there exists such that if , then are twice continuously differentiable functions in the variable for all .
Proof.
The functions are twice continuously differentiable because is twice continuously differentiable by Lemma 2. Let be the rotation of angle . For , the definition of gives
[TABLE]
By Corollary 4, one has
[TABLE]
where
[TABLE]
4 ODE for
Lemma 6**.**
There exists such that for , the functions defined in Lemma 5 satisfy
[TABLE]
and
[TABLE]
Proof.
By Corollary 5,
[TABLE]
As are twice continuously differentiable for , we may substitute (4) for into the equation in Lemma 2, which gives for
[TABLE]
Using the identities
[TABLE]
the right hand side of the PDE in Lemma 2 is
[TABLE]
Equating (13) and (14) gives the first equation as
[TABLE]
As this holds for all , we may equate the coefficients of and to obtain
[TABLE]
and from the second equation,
[TABLE]
and therefore,
[TABLE]
Combining (15) and (16) and multiplying the equations throughout by , we have
[TABLE]
for all . By continuity of the second derivatives of (see Lemma 5), the equations hold for all . Again using the continuity of the second derivatives of , we may substitute into (17) to get
[TABLE]
Using the boundary conditions for in Lemma 2, we have
[TABLE]
5 Solving the ODE for
Lemma 7**.**
Let
[TABLE]
where are the Bessel functions of the first kind. Fix such that . Then the real-valued functions defined for all by222Note that the two determinations of the square root in the definition of yield the yield the same , and .
[TABLE]
are the unique solution of the differential system (12) satisfying the boundary conditions stated in Lemma 6.
Proof.
Recall that, for , Bessel’s differential equation
[TABLE]
has a canonical basis of solutions consisting of the ordinary Bessel functions and [DLMF, §10.2]. The function is entire, while is analytic on , and diverges to as along the positive reals.
Let denote a cylinder function, i.e., a linear combination with coefficients that do not depend on [DLMF, §10.2(ii)]. One has [DLMF, (10.6.2), (10.6.3)]
[TABLE]
The equation for is exactly Bessel’s equation with and ; its general solution for is hence . Since diverges as and is nonzero, the initial condition at [math] forces . Similary, the initial condition at implies and therefore , unless , in which case any is a solution.
Let us turn to the coupled equations for and . Make the ansatz
[TABLE]
where are yet unspecified complex numbers. The equation involving becomes
[TABLE]
The change of variable passing from to transforms Bessel’s equation into
[TABLE]
and the relations (21) yield , so (23) holds when
[TABLE]
i.e., when .
Similarly, the equation involving rewrites as
[TABLE]
and the last term on the left-hand side is equal to by (21). Thus, (24) reduces to Bessel’s equation provided that .
In summary, the functions (22) define a solution of (12) for any choice of in the definition of and , such that . The latter condition is equivalent to
[TABLE]
Letting now denote a fixed root of , say the one in (18), the choices
[TABLE]
provide us with four linearly independent333This follows, for instance, from the expressions [DLMF, (10.2.2), (10.8.1)] and the fact that . solutions, which hence form a basis of the solution space of the system of two linear differential equation of order two.
The asymptotic behaviour of at the origin, [DLMF, (10.7.4)], shows that linear combinations involving any of the last two solutions (25) are incompatible with the conditions . Therefore, one has
[TABLE]
for some . The conditions , translate into a linear system for of determinant
[TABLE]
(where we have used the fact that [DLMF, (10.11.9)]). When , the unique solution is . Since , this leads to the expressions (19). ∎
6 Concluding
Lemma 8**.**
In the notation of Lemma 7, there exists such that , viewed as a function of , has a pole at .
Proof.
Let us first show that has a zero lying in the interval . Consider the series expansions [DLMF, (10.2.2)]
[TABLE]
For and such that , the remainders starting at index of both series are bounded by
[TABLE]
In particular, for or with , we have . For , the quantity (27) is bounded by . By replacing and by the first five terms of the series (26) in the expression of and propagating this bound by the triangle inequality, one can check that . A similar calculation shows that . Since is a continuous function of , it follows that vanishes for some .
We still need to check that the numerator of in (19) does not vanish at . One has . Taking in (27) yields an expression of the form
[TABLE]
where one can check that , , . For all , this implies
[TABLE]
The claim follows. ∎
Remark 9**.**
Instead of doing the calculation sketched in the proof manually, one can easily prove the result using a computer implementation of Bessel functions that provides rigorous error bounds. For example, using the interval arithmetic library Arb [Johansson2017] via SageMath, the check that has a zero goes as follows. The quantities of the form [x.xxx +/- eps] appearing in the output are guaranteed to be rigorous enclosures of the corresponding real quantities. We check the presence of a zero in the interval instead of because having a tighter estimate simplifies the second step.
sage: zeta = CBF(sqrt((-1+Isqrt(7))/2)) sage: alpha = zeta^3/2 + zeta sage: lb, ub = CBF(282/100), CBF(283/100) sage: (alpha.conjugate()(lbzeta).bessel_J(0) ....: (lbzeta.conjugate()).bessel_J(1)) [-13.208370024264 +/- 4.16e-13] + [-0.003639973760 +/- 4.63e-13]I sage: (alpha.conjugate()(ubzeta).bessel_J(0) ....: (ubzeta.conjugate()).bessel_J(1)) [-13.424373315124 +/- 4.75e-13] + [0.005782411521 +/- 4.38e-13]*I
One can then verify as follows that the image by the function of the interval only contains elements of negative imaginary part.
sage: crit = lb.union(ub); crit # convex hull (real interval) [2.8 +/- 0.0301] sage: alpha.conjugate()(critzeta.conjugate()).bessel_J(1) [+/- 0.0707] + [-4e+0 +/- 0.303]*I
Theorem 10**.**
The series expansion with respect to of has a finite radius of convergence.
Remark 11**.**
Since the condition of [ChevyrevLyons] of uniqueness of laws is only sufficient, the questions remains on whether there exists another law on having the same moments as .
Proof.
Assume for contradiction that has an infinite radius of convergence. Then is an entire function in . We also know from Corollary 5 that there exists such that for real
[TABLE]
where are defined by Lemma 7. By the Identity theorem, are entire functions and (28) holds for all . This contradicts Lemma 8, and therefore has a finite radius of convergence. ∎
7 Appendix
Let be a domain in .
Definition 12**.**
Let be a locally integrable function in and be a multi-index. Then a locally integrable function such that for every ,
[TABLE]
will be called weak derivative of and is denoted by . By convention, if .
Definition 13**.**
The Sobolev space for is defined to be the set of all -valued functions such that for every multi-index with , the weak partial derivative belongs to , i.e.
[TABLE]
It is endowed with the Sobolev norm defined as follows:
[TABLE]
When , this norm coincides with the -norm, i.e.
[TABLE]
Theorem 14**.**
Let be a second order differential operator with coefficients . Let be a weak solution of
[TABLE]
Suppose that the ellipticity condition holds. Let , . Let be a bounded domain of class and let the coefficients of be of class . Then
[TABLE]
with depending on , and on the -norms for the .
Proof.
It is proved by using Theorem 8.13 in [gilbarg2015elliptic] and setting the boundary condition . ∎
In the following we prove Lemma 15 for , which is a generalization of Lemma 3.11 for the case in [LyonsNi].
Lemma 15**.**
Let be a bounded domain of class in , where . Then there exists a constant only depending on and , such that for every positive integer ,
[TABLE]
Proof.
The proof of Lemma 3.11 in [LyonsNi] can be applied here directly, except for that we need to check that , which is proved in the following theorem 16. ∎
Theorem 16**.**
Suppose that is a non-empty bounded domain in . It follows is infinitely differentiable in componentwise sense, i.e. for all index , is infinitely differentiable for all .
Proof.
Based on Theorem 3.2 in [LyonsNi], it shows that
[TABLE]
where is a smooth distribution with compact support , is the convolution, and be a map from to defined by:
[TABLE]
Since is smooth (in polynomial form) and is a smooth function with compact support, is a smooth function with compact support. It is easy to show that for any partial derivative is integrable.
[TABLE]
On the other hand, as well, and so we have
[TABLE]
Thus is infinitely differentiable, since . ∎
