A functional limit theorem for coin tossing Markov chains

Stefan Ankirchner; Thomas Kruse; Mikhail Urusov

arXiv:1902.06249·math.PR·May 13, 2020

A functional limit theorem for coin tossing Markov chains

Stefan Ankirchner, Thomas Kruse, Mikhail Urusov

PDF

TL;DR

This paper establishes a functional limit theorem for a class of Markov chains that move up or down by state-dependent amounts, enabling approximation of complex continuous Markov processes, including those with irregular features.

Contribution

It introduces a new functional limit theorem for Markov chains with state-dependent jumps, extending approximation capabilities to irregular continuous Markov processes.

Findings

01

Approximate all one-dimensional regular continuous strong Markov processes in natural scale.

02

Applicable to processes not characterized by stochastic differential equations.

03

Illustrated with sticky Brownian motion and Cantor set slowed Brownian motion.

Abstract

We prove a functional limit theorem for Markov chains that, in each step, move up or down by a possibly state dependent constant with probability $1/2$ , respectively. The theorem entails that the law of every one-dimensional regular continuous strong Markov process in natural scale can be approximated with such Markov chains arbitrarily well. The functional limit theorem applies, in particular, to Markov processes that cannot be characterized as solutions to stochastic differential equations. Our results allow to practically approximate such processes with irregular behavior; we illustrate this with Markov processes exhibiting sticky features, e.g., sticky Brownian motion and a Brownian motion slowed down on the Cantor set.

Equations254

X_{0}^{h} = y and X_{(k + 1) h}^{h} = X_{k h}^{h} + a_{h} (X_{k h}^{h}) ξ_{k + 1}, for k \in N_{0} .

X_{0}^{h} = y and X_{(k + 1) h}^{h} = X_{k h}^{h} + a_{h} (X_{k h}^{h}) ξ_{k + 1}, for k \in N_{0} .

X_{t}^{h} = X_{⌊ t / h ⌋ h}^{h} + (t / h - ⌊ t / h ⌋) (X_{(⌊ t / h ⌋ + 1) h}^{h} - X_{⌊ t / h ⌋ h}^{h}), t \in [0, \infty) .

X_{t}^{h} = X_{⌊ t / h ⌋ h}^{h} + (t / h - ⌊ t / h ⌋) (X_{(⌊ t / h ⌋ + 1) h}^{h} - X_{⌊ t / h ⌋ h}^{h}), t \in [0, \infty) .

\frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = h,

\frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = h,

0 < m ([a, b]) < \infty.

0 < m ([a, b]) < \infty.

X_{0}^{h} = y and X_{(k + 1) h}^{h} = X_{k h}^{h} + a_{h} (X_{k h}^{h}) ξ_{k + 1}, for k \in N_{0} .

X_{0}^{h} = y and X_{(k + 1) h}^{h} = X_{k h}^{h} + a_{h} (X_{k h}^{h}) ξ_{k + 1}, for k \in N_{0} .

X_{t}^{h} = X_{⌊ t / h ⌋ h}^{h} + (t / h - ⌊ t / h ⌋) (X_{(⌊ t / h ⌋ + 1) h}^{h} - X_{⌊ t / h ⌋ h}^{h}) .

X_{t}^{h} = X_{⌊ t / h ⌋ h}^{h} + (t / h - ⌊ t / h ⌋) (X_{(⌊ t / h ⌋ + 1) h}^{h} - X_{⌊ t / h ⌋ h}^{h}) .

y \in K sup \frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) - h \in o (h), h \to 0.

y \in K sup \frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) - h \in o (h), h \to 0.

d (x, y) = n = 1 \sum \infty 2^{- n} (∥ x - y ∥_{C [0, n]} \land 1), x, y \in C ([0, \infty), R),

d (x, y) = n = 1 \sum \infty 2^{- n} (∥ x - y ∥_{C [0, n]} \land 1), x, y \in C ([0, \infty), R),

E [F (X^{h, y})] \to E_{y} [F (Y)], h \to 0.

E [F (X^{h, y})] \to E_{y} [F (Y)], h \to 0.

G_{α, β} (u, v) = \frac{( β - u \lor v ) ( u \land v - α )}{β - α}, u, v \in [α, β]

G_{α, β} (u, v) = \frac{( β - u \lor v ) ( u \land v - α )}{β - α}, u, v \in [α, β]

E_{y} [H_{α, β} (Y)] = \int_{(α, β)} G_{α, β} (y, u) m (d u)

E_{y} [H_{α, β} (Y)] = \int_{(α, β)} G_{α, β} (y, u) m (d u)

E_{y} [H_{y - a, y + a} (Y)]

E_{y} [H_{y - a, y + a} (Y)]

= \int_{(y - a, y)} \frac{1}{2} (u - y + a) m (d u) + \int_{{y}} \frac{1}{2} a m (d u) + \int_{(y, y + a)} \frac{1}{2} (y + a - u) m (d u)

= \frac{1}{2} \int_{(y - a, y + a)} (a - ∣ u - y ∣) m (d u) .

y \in K sup E_{y} [H_{y - a_{h} (y), y + a_{h} (y)} (Y)] - h \in o (h), h \to 0,

y \in K sup E_{y} [H_{y - a_{h} (y), y + a_{h} (y)} (Y)] - h \in o (h), h \to 0,

\int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = \int_{I} (a_{h} (y) - ∣ u - y ∣)^{+} m (d u) .

\int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = \int_{I} (a_{h} (y) - ∣ u - y ∣)^{+} m (d u) .

a_{h} (y) = sup {a \geq 0 : y \pm a \in I and \frac{1}{2} \int_{(y - a, y + a)} (a - ∣ z - y ∣) m (d z) \leq h}

a_{h} (y) = sup {a \geq 0 : y \pm a \in I and \frac{1}{2} \int_{(y - a, y + a)} (a - ∣ z - y ∣) m (d z) \leq h}

\frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ z - y ∣) m (d z) = h .

\frac{1}{2} \int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ z - y ∣) m (d z) = h .

K ∋ y \mapsto \frac{1}{2} \int_{I} (a_{0} - ∣ u - y ∣)^{+} m (d u) \in (0, \infty)

K ∋ y \mapsto \frac{1}{2} \int_{I} (a_{0} - ∣ u - y ∣)^{+} m (d u) \in (0, \infty)

d Y_{t} = η (Y_{t}) d W_{t},

d Y_{t} = η (Y_{t}) d W_{t},

η (x) \neq = 0 \forall x \in I^{\circ},

η (x) \neq = 0 \forall x \in I^{\circ},

η^{- 2} \in L_{loc}^{1} (I^{\circ})

m (d x) = \frac{2}{η ^{2} ( x )} d x .

m (d x) = \frac{2}{η ^{2} ( x )} d x .

\int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = 2 a_{h} (y)^{2} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a _{h} ( y ) z )} d z .

\int_{(y - a_{h} (y), y + a_{h} (y))} (a_{h} (y) - ∣ u - y ∣) m (d u) = 2 a_{h} (y)^{2} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a _{h} ( y ) z )} d z .

h \to 0 lim (y \in K sup \frac{a _{h} ( y ) ^{2}}{h} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a _{h} ( y ) z )} d z - 1) = 0.

h \to 0 lim (y \in K sup \frac{a _{h} ( y ) ^{2}}{h} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a _{h} ( y ) z )} d z - 1) = 0.

\int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a z )} d z = \frac{1}{( σ y ) ^{2}} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{( 1 + a z / y ) ^{2}} d z = - \frac{1}{( σ a ) ^{2}} lo g (1 - \frac{a ^{2}}{y ^{2}}) .

\int_{- 1}^{1} \frac{1 - ∣ z ∣}{η ^{2} ( y + a z )} d z = \frac{1}{( σ y ) ^{2}} \int_{- 1}^{1} \frac{1 - ∣ z ∣}{( 1 + a z / y ) ^{2}} d z = - \frac{1}{( σ a ) ^{2}} lo g (1 - \frac{a ^{2}}{y ^{2}}) .

h \to 0 lim (y \in K sup \frac{1}{h σ ^{2}} lo g (1 - \frac{a _{h} ( y ) ^{2}}{y ^{2}}) + 1) = 0.

h \to 0 lim (y \in K sup \frac{1}{h σ ^{2}} lo g (1 - \frac{a _{h} ( y ) ^{2}}{y ^{2}}) + 1) = 0.

X_{0}^{E u, h} = y and X_{(k + 1) h}^{E u, h} = X_{k h}^{E u, h} + η (X_{k h}^{E u, h}) (W_{(k + 1) h} - W_{k h}), for k \in N_{0} .

X_{0}^{E u, h} = y and X_{(k + 1) h}^{E u, h} = X_{k h}^{E u, h} + η (X_{k h}^{E u, h}) (W_{(k + 1) h} - W_{k h}), for k \in N_{0} .

y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) ( 1 - ∣ z ∣ )}{η ^{2} ( y + h η ( y ) z )} d z - 1 = y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) - η ^{2} ( y + h η ( y ) z )}{η ^{2} ( y + h η ( y ) z )} (1 - ∣ z ∣) d z \to 0,

y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) ( 1 - ∣ z ∣ )}{η ^{2} ( y + h η ( y ) z )} d z - 1 = y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) - η ^{2} ( y + h η ( y ) z )}{η ^{2} ( y + h η ( y ) z )} (1 - ∣ z ∣) d z \to 0,

∣ η (y) - η (y + h η (y) z) ∣ \leq ε .

∣ η (y) - η (y + h η (y) z) ∣ \leq ε .

y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) - η ^{2} ( y + h η ( y ) z )}{η ^{2} ( y + h η ( y ) z )} (1 - ∣ z ∣) d z \leq C ε .

y \in K sup \int_{- 1}^{1} \frac{η ^{2} ( y ) - η ^{2} ( y + h η ( y ) z )}{η ^{2} ( y + h η ( y ) z )} (1 - ∣ z ∣) d z \leq C ε .

l_{h} := l + in f {a \in (0, \frac{r - l}{2}] : a < \infty and \frac{1}{2} \int_{(l, l + 2 a)} (a - ∣ u - (l + a) ∣) m (d u) \geq h},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A functional limit theorem for

coin tossing Markov chains

Stefan Ankirchner Stefan Ankirchner, Institute of Mathematics, University of Jena, Ernst-Abbe-Platz 2, 07745 Jena, Germany. Email: [email protected], Phone: +49 (0)3641 946275.

Thomas Kruse Institute of Mathematics, University of Gießen, Arndtstr. 2, 35392 Gießen, Germany. Email: [email protected], Phone: +49 (0)641 9932102.

Mikhail Urusov Mikhail Urusov, Faculty of Mathematics, University of Duisburg-Essen, Thea-Leymann-Str. 9, 45127 Essen, Germany. Email: [email protected], Phone: +49 (0)201 1837428.

Abstract

We prove a functional limit theorem for Markov chains that, in each step, move up or down by a possibly state dependent constant with probability $1/2$ , respectively. The theorem entails that the law of every one-dimensional regular continuous strong Markov process in natural scale can be approximated with such Markov chains arbitrarily well. The functional limit theorem applies, in particular, to Markov processes that cannot be characterized as solutions to stochastic differential equations. Our results allow to practically approximate such processes with irregular behavior; we illustrate this with Markov processes exhibiting sticky features, e.g., sticky Brownian motion and a Brownian motion slowed down on the Cantor set.

Keywords: one-dimensional Markov process; speed measure; Markov chain approximation; functional limit theorem; sticky Brownian motion; sticky reflection; slow reflection; Brownian motion slowed down on the Cantor set.

2010 MSC: Primary: 60F17; 60J25; 60J60. Secondary: 60H35; 60J22.

Introduction

Let $(\xi_{k})_{k\in\mathbb{N}}$ be an iid sequence of random variables, on a probability space with a measure $P$ , satisfying $P(\xi_{1}=\pm 1)=\frac{1}{2}$ . Given $y\in\mathbb{R}$ , $h\in(0,\infty)$ and a function $a_{h}:\mathbb{R}\to\mathbb{R}$ , we denote by $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ the Markov chain defined by

[TABLE]

We choose as the Markov chain’s index set the set of non-negative multiples of $h$ because we interpret $h$ as the length of a time step. We extend $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ to a continuous-time process by linear interpolation, i.e., we set

[TABLE]

Let $\overline{h}\in(0,\infty)$ and let $(a_{h})_{h\in(0,\overline{h})}$ be a family of real functions and $(X^{h})_{h\in(0,\overline{h})}$ the associated family of extended Markov chains defined as in (2). A fundamental problem of probability theory is to find conditions on $(X^{h})_{h\in(0,\overline{h})}$ such that the laws of the processes $X^{h}$ , $h\in(0,\overline{h})$ , converge in some sense as $h\to 0$ . In this article we provide an asymptotic condition on the family $(a_{h})_{h\in(0,\overline{h})}$ guaranteeing that the laws of the processes $X^{h}$ , $h\in(0,\overline{h})$ , converge as $h\to 0$ to the law of a one-dimensional regular continuous strong Markov process (in the sense of Section VII.3 in [39] or Section V.7 in [40]). In what follows we use the term general diffusions for the latter class of processes. Recall that a general diffusion $Y=(Y_{t})_{t\in[0,\infty)}$ has a state space that is an open, half-open or closed interval $I\subseteq\mathbb{R}$ . We denote by $I^{\circ}=(l,r)$ the interior of $I$ , where $-\infty\leq l<r\leq\infty$ . Moreover, the law of any general diffusion is uniquely characterized by its speed measure $m$ on $I$ , its scale function and its boundary behavior. Throughout the introduction we assume that $Y$ is in natural scale and that every accessible boundary point is absorbing (see the beginning of Section 1 and Section 6 on how to incorporate diffusions in general scale and with reflecting boundary points). This setting covers, in particular, solutions of driftless SDEs with discontinuous and fast growing diffusion coefficient (see Section 2) and also diffusions with sticky features (see Section 7), which cannot be modeled by SDEs whenever a sticky point is located in the interior of the state space.

Our main result, Theorem 1.1, shows that if a family of functions $(a_{h})_{h\in(0,\overline{h})}$ satisfies for all $y\in I^{\circ}$ , $h\in(0,\overline{h})$ the equation

[TABLE]

with a precision of order $o(h)$ uniformly in $y$ over compact subsets of $I^{\circ}$ (see Condition (A) below for a precise statement), then the associated family $(X^{h})_{h\in(0,\overline{h})}$ converges in distribution, as $h\to 0$ , to the general diffusion $Y$ with speed measure $m$ . We show that for every general diffusion a family of functions $(a_{h})_{h\in(0,\overline{h})}$ satisfying (3) exists implying that every general diffusion can be approximated by a Markov chain of the form (1). Equation (3) dictates how to compute the functions $(a_{h})_{h\in(0,\overline{h})}$ and therefore paves the way to approximate the distribution of a general diffusion numerically (see, e.g., Section 8).

The central idea in the derivation of Equation (3) is to embed for every $h\in(0,\overline{h})$ the Markov chain $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ into $Y$ with a sequence of stopping times. To explain this idea assume for the moment that the state space is $I=\mathbb{R}$ . For every $h\in(0,\overline{h})$ let $\tau^{h}_{0}=0$ and then recursively define $\tau^{h}_{k+1}$ as the first time $Y$ exits the interval $(Y_{\tau^{h}_{k}}-a_{h}(Y_{\tau^{h}_{k}}),Y_{\tau^{h}_{k}}+a_{h}(Y_{\tau^{h}_{k}}))$ after $\tau^{h}_{k}$ . It follows that the discrete-time process $(Y_{\tau^{h}_{k}})_{k\in\mathbb{N}_{0}}$ has the same law as the Markov chain $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ . Instead of controlling now directly the spatial errors $|Y_{\tau^{h}_{k}}-Y_{kh}|$ , we first analyze the temporal errors $|\tau^{h}_{k}-kh|$ , $k\in\mathbb{N}_{0}$ . We show that for every $y\in\mathbb{R}$ , $a\in[0,\infty)$ the expected time it takes $Y$ started in $y$ to leave the interval $(y-a,y+a)$ is equal to $\frac{1}{2}\int_{(y-a,y+a)}(a-|u-y|)\,m(du)$ . In particular, if $a_{h}$ satisfies (3) for all $y\in I$ , it follows that for all $k\in\mathbb{N}_{0}$ the time lag $\tau^{h}_{k+1}-\tau^{h}_{k}$ between two consecutive stopping times is in expectation equal to $h$ . In this case we refer to $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ as Embeddable Markov Chain with Expected time Lag $h$ (we write shortly $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}\in\text{EMCEL}(h)$ ).

For some diffusions $Y$ one can construct EMCEL approximations explicitly (see, e.g., Section 7). For cases where (3) cannot be solved in closed form, we perform a perturbation analysis and show that it suffices to find for all $h\in\overline{h}$ , $y\in I^{\circ}$ a number $a_{h}(y)$ satisfying (3) with an error of order $o(h)$ uniformly in $y$ belonging to compact subsets of $I^{\circ}$ . We prove that for the associated stopping times $(\tau^{h}_{k})_{k\in\mathbb{N}_{0}}$ the temporal errors $|\tau^{h}_{k}-kh|$ , $k\in\mathbb{N}_{0}$ , converge to [math] as $h\to 0$ in every $L^{\alpha}$ -space, $\alpha\in[1,\infty)$ . This ultimately implies convergence of $(X^{h})_{h\in(0,\overline{h})}$ to $Y$ in distribution as $h\to 0$ .

To illustrate the benefit of the perturbation analysis, we construct in Section 8 approximations for a Brownian motion slowed down on the Cantor set (see Figure 3). Moreover, we note that our main result, Theorem 1.1, is not only applicable to perturbations of the EMCEL approximation but can also be used to derive new convergence results for other approximation methods such as, e.g., weak Euler schemes (see Corollary 2.3).

The idea to use embeddings in order to prove a functional limit theorem goes back to Skorokhod. In the seminal book [42] scaled random walks are embedded into Brownian motion in order to prove Donsker’s invariance principle. In [5] we embed Markov chains into the solution process of an SDE and prove a functional limit theorem where the limiting law is that of the SDE. In [42] and [5] the approximating Markov chains have to be embeddable with a sequence of stopping times $(\tau_{k})_{k\in\mathbb{N}_{0}}$ such that the expected distance between two consecutive stopping times is exactly equal to $h$ , the time discretization parameter. In contrast, in the present article we require that the expected distance between consecutive embedding stopping times is only approximately equal to $h$ . We show that for the convergence of the laws it is sufficient to require that the difference of the expected distance and $h$ is of the order $o(h)$ . Moreover, compared to [5], we allow for a larger class of limiting distributions. Indeed, our setting includes processes that cannot be characterized as the solution of an SDE, e.g., diffusions with sticky points.

There are further articles in the literature using random time grids to approximate a Markov process, under the additional assumption that it solves a one-dimensional SDE. In [17] the authors first fix a finite grid in the state space of the diffusion. Then they construct a Bernoulli random walk on this grid that can be embedded into the diffusion. The authors determine the expected time for attaining one of the neighboring points by solving a PDE.

[37] describes a similar approximation method for the Cox-Ingersoll-Ross (CIR) process. Also here the authors first fix a grid on $[0,\infty)$ and then construct a random walk on the grid that can be embedded into the CIR process. In contrast to [17], the authors in [37] compute the distributions of the embedding stopping times (and not only their expected value) by solving a parabolic PDE. In the numerical implementation of the scheme the authors then draw the random time increments from these distributions and thereby obtain a scheme that is exact along a sequence of stopping times. Note that in contrast to [17] and [37], in our approach the space grid is not fixed a priori. Instead, we approximately fix the expected time lag between the consecutive embedding stopping times.

Yet a further scheme that uses a random time partition to approximate a diffusion $Y$ with discontinuous coefficients is suggested in [36]. In contrast to our approach the distribution of the time increments is fixed there. More precisely, the authors of [36] use the fact that the distribution of $Y$ sampled at an independent exponential random time is given by the resolvent of the process. Consequently, if it is possible to generate random variables distributed according to the resolvent kernel, one obtains an exact simulation of $Y$ at an exponentially distributed time. Iterating this procedure and letting the parameter of the exponential distribution go to infinity provides an approximation of $Y$ .

We remark that embeddings along random time grids have been recently employed in [20] and [21] in order to obtain convergence rates of (F)BSDE approximations driven by Bernoulli increments.

Recall that, while approximating solutions of SDEs on deterministic time grids usually employs Euler-type schemes (1) with Gaussian increments $(\xi_{k})_{k\in\mathbb{N}}$ , we use Bernoulli increments in our paper. In this connection, we would like to mention that, from the numerical perspective, convergence results along equidistant time grids, including approximations by the weak Euler schemes with Bernoulli increments, can be found, e.g., in Section 14.1 of [32]. From the more theoretical perspective, we refer to Theorem 7.4.1 in [16] and Theorem IX.4.8 in [28], which are some general functional limit theorems of the Trotter-Kato type for approximating diffusions. A discussion of how to approximate controlled diffusions by Markov chains with Bernoulli increments can be found in [35]. Another perspective on schemes with Bernoulli increments is suggested in [11] and [22], where, on certain machines (like field programmable gate arrays), such schemes are shown to be more efficient for simulation algorithms.

While in our paper a continuous-time Markov process is approximated via (linearly-interpolated) discrete-time Markov chains, there is an alternative approach, pioneered in [43], where the approximating processes are themselves continuous-time Markov processes. For a recent account, see [10] and references therein. A generalization of the latter approach for variable-speed random walks on trees contained in [6] is worth mentioning as well.

The article is organized as follows. In Section 1 we rigorously formulate and discuss the functional limit theorem. In Section 2 we discuss some of its implications for diffusions that can be described as solution of SDEs. In Sections 3 and 4 we explain, for a given general diffusion, how to embed an approximating coin tossing Markov chain into the diffusion and prove some properties of the embedding stopping times. Section 5 provides the proof of the functional limit theorem, where we, in particular, need the material discussed in Sections 3 and 4. The functional limit theorem is shown under the additional assumption that if a boundary point is attainable, then it is absorbing. In Section 6 we explain how one can extend the functional limit theorem to general diffusions with reflecting boundary points. In the last two sections we illustrate our main result with diffusions exhibiting some stickiness. In Section 7 we construct coin tossing Markov chains approximating sticky Brownian motion, with and without reflection, respectively. In Section 8 we first describe a Brownian motion that is slowed down on the Cantor set, and secondly we explicitly construct coin tossing Markov chains that approximate this process arbitrarily well.

1 Approximating general diffusions with Markov chains

Let $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\geq 0},(P_{y})_{y\in I},(Y_{t})_{t\geq 0})$ be a one-dimensional continuous strong Markov process in the sense of Section VII.3 in [39]. We refer to this class of processes as general diffusions in the sequel. We assume that the state space is an open, half-open or closed interval $I\subseteq\mathbb{R}$ . We denote by $I^{\circ}=(l,r)$ the interior of $I$ , where $-\infty\leq l<r\leq\infty$ , and we set $\overline{I}=[l,r]$ . Recall that by the definition we have $P_{y}[Y_{0}=y]=1$ for all $y\in I$ . We further assume that $Y$ is regular. This means that for every $y\in I^{\circ}$ and $x\in I$ we have that $P_{y}[H_{x}(Y)<\infty]>0$ , where $H_{x}(Y)=\inf\{t\geq 0:Y_{t}=x\}$ . If there is no ambiguity, we simply write $H_{x}$ in place of $H_{x}(Y)$ . Moreover, for $a<b$ in $\overline{I}$ we denote by $H_{a,b}=H_{a,b}(Y)$ the first exit time of $Y$ from $(a,b)$ , i.e. $H_{a,b}=H_{a}\wedge H_{b}$ . Without loss of generality we suppose that the diffusion $Y$ is in natural scale. If $Y$ is not in natural scale, then there exists a strictly increasing continuous function $s:I\to\mathbb{R}$ , the so-called scale function, such that $s(Y_{t})$ , $t\geq 0$ , is in natural scale. Let $m$ be the speed measure of the Markov process $Y$ (see VII.3.7 and VII.3.10 in [39]).111There are different conventions concerning the normalization of the speed measure. We follow the convention of [39] and [8] and note that our speed measure is thus twice as large as the one for example found in [40]. Recall that for all $a<b$ in $I^{\circ}$ we have

[TABLE]

Finally, we also assume that if a boundary point is accessible, then it is absorbing. We drop this assumption in Section 6, where we extend our approximation method to Markov processes with reflecting boundaries. The extension works for both instantaneous and slow reflection.

Let $\overline{h}\in(0,\infty)$ and suppose that for every $h\in(0,\overline{h})$ we are given a measurable function $a_{h}\colon\overline{I}\to[0,\infty)$ such that $a_{h}(l)=a_{h}(r)=0$ and for all $y\in I^{\circ}$ we have $y\pm a_{h}(y)\in I$ . We refer to each function $a_{h}$ as a scale factor. We next construct a sequence of Markov chains associated to the family of scale factors $(a_{h})_{h\in(0,\overline{h})}$ . To this end we fix a starting point $y\in I^{\circ}$ of $Y$ . Let $(\xi_{k})_{k\in\mathbb{N}}$ be an iid sequence of random variables, on a probability space with a measure $P$ , satisfying $P(\xi_{k}=\pm 1)=\frac{1}{2}$ . We denote by $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ the Markov chain defined by

[TABLE]

We extend $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ to a continuous-time process by linear interpolation, i.e., for all $t\in[0,\infty)$ , we set

[TABLE]

To highlight the dependence of $X^{h}=(X^{h}_{t})_{t\in[0,\infty)}$ on the starting point $y\in I^{\circ}$ we also sometimes write $X^{h,y}$ .

To formulate our main result we need the following condition.

Condition (A) For all compact subsets $K$ of $I^{\circ}$ it holds that

[TABLE]

Theorem 1.1.

*Assume that Condition (A) is satisfied. Then, for any $y\in I^{\circ}$ , the distributions of the processes $(X^{h,y}_{t})_{t\in[0,\infty)}$ under $P$ converge weakly to the distribution of $(Y_{t})_{t\in[0,\infty)}$ under $P_{y}$ , as $h\to 0$ ; i.e., for every bounded and continuous functional222As usual, we equip $C([0,\infty),\mathbb{R})$ with the topology of uniform convergence on compact intervals, which is generated, e.g., by the metric

$d(x,y)=\sum_{n=1}^{\infty}2^{-n}\left(\|x-y\|_{C[0,n]}\wedge 1\right),\quad x,y\in C([0,\infty),\mathbb{R}),$

where $\|\cdot\|_{C[0,n]}$ denotes the sup norm in $C([0,n],\mathbb{R})$ . $F\colon C([0,\infty),\mathbb{R})\to\mathbb{R}$ , it holds that*

[TABLE]

Remark 1.2.

To better explain Condition (A), for every $\alpha<\beta$ in $I$ , we introduce the Green function $G_{\alpha,\beta}\colon[\alpha,\beta]^{2}\to\mathbb{R}$ of $Y$ by the formula

[TABLE]

(recall that $Y$ is in natural scale) and observe that, for all $y\in[\alpha,\beta]$ ,

[TABLE]

(see, e.g., Section VII.3 in [39]). It follows that, for any $y\in I^{\circ}$ and $a>0$ such that $y\pm a\in I$ , it holds

[TABLE]

Thus, Condition (A) is an analytic condition that is equivalent to requiring that the scale factors $(a_{h})_{h\in(0,\overline{h})}$ satisfy

[TABLE]

for any compact subset $K$ of $I^{\circ}$ .

Remark 1.3.

It is worth noting that Condition (A) is, in fact, nearly necessary for weak convergence (8) (see Example 2.1).

Remark 1.4.

For all $y\in I^{\circ}$ , $h\in(0,\overline{h})$ it holds that

[TABLE]

This yields an alternative representation of Condition (A) which is occasionally used below.

It is important to note that for every speed measure $m$ there exists a family of scale factors such that Condition (A) is satisfied and hence every general diffusion $Y$ can be approximated by Markov chains of the form (5). Indeed, for all $y\in I^{\circ}$ , $h\in(0,\overline{h})$ let $\widehat{a}_{h}(l)=\widehat{a}_{h}(r)=0$ and

[TABLE]

and denote by $(\widehat{X}^{h})_{h\in(0,\overline{h})}$ the associated family of processes defined in (5) and (6). Then the proof of Corollary 1.5 below shows that for all compact subsets $K$ of $I^{\circ}$ there exists $h_{0}\in(0,\overline{h})$ such that for all $y\in K$ , $h\in(0,h_{0})$ it holds that

[TABLE]

In particular, the family $(\widehat{a}_{h})_{h\in(0,\overline{h})}$ satisfies Condition (A) and we show in Section 3 below that the Markov chain $(\widehat{X}^{h}_{kh})_{k\in\mathbb{N}_{0}}$ is embeddable into $Y$ with a sequence of stopping times with expected time lag $h$ . We refer to $(\widehat{X}^{h}_{t})_{t\in[0,\infty)}$ , $h\in(0,\overline{h})$ , as EMCEL approximations and write shortly $(\widehat{X}^{h}_{kh})_{k\in\mathbb{N}_{0}}\in\text{EMCEL}(h)$ .

Corollary 1.5.

For every $y\in I^{\circ}$ the distributions of the EMCEL approximations $(\widehat{X}^{h,y}_{t})_{t\in[0,\infty)}$ under $P$ converge weakly to the distribution of $(Y_{t})_{t\in[0,\infty)}$ under $P_{y}$ as $h\to 0$ .

Proof.

Let $K$ be a compact subset of $I^{\circ}$ . Without loss of generality assume that $K=[l_{0},r_{0}]$ with $l<l_{0}<r_{0}<r$ . Let $a_{0}=\frac{r-r_{0}}{2}\wedge\frac{l_{0}-l}{2}\wedge 1$ . It follows with dominated convergence that the function

[TABLE]

is continuous. In particular, it is bounded away from zero, i.e., there exists $h_{0}\in(0,\overline{h})$ such that for all $y\in K$ it holds that $\frac{1}{2}\int_{I}(a_{0}-|u-y|)^{+}\,m(du)\geq h_{0}$ . Next observe that for all $y\in K$ the function $[0,a_{0}]\ni a\mapsto\frac{1}{2}\int_{I}(a-|u-y|)^{+}\,m(du)\in[0,\infty)$ is continuous and strictly increasing. Hence for all $y\in K$ , $h\in(0,h_{0})$ the supremum in (11) is a maximum and it holds that $\frac{1}{2}\int_{(y-\widehat{a}_{h}(y),y+\widehat{a}_{h}(y))}(\widehat{a}_{h}(y)-|u-y|)\,m(du)=h.$ In particular, Condition (A) is satisfied and the statement of Corollary 1.5 follows from Theorem 1.1. ∎

2 Application to SDEs

A particular case of our setting is the case, where $Y$ is a solution to the driftless SDE

[TABLE]

where $\eta\colon I^{\circ}\to\mathbb{R}$ is a Borel function satisfying the Engelbert-Schmidt conditions

[TABLE]

( $L^{1}_{\mathrm{loc}}(I^{\circ})$ denotes the set of Borel functions locally integrable on $I^{\circ}$ ). Under (13)–(14) SDE (12) has a unique in law weak solution (see [15] or Theorem 5.5.7 in [30]). This means that there exists a pair of processes $(Y,W)$ on a filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t}),P)$ , with $(\mathcal{F}_{t})$ satisfying the usual conditions, such that $W$ is an $(\mathcal{F}_{t})$ -Brownian motion and $(Y,W)$ satisfies SDE (12). The process $Y$ possibly reaches the endpoints $l$ or $r$ in finite time. By convention we force $Y$ to stay in $l$ (resp., $r$ ) in this case. This can be enforced in (12) by extending $\eta$ to $\overline{I}$ with $\eta(l)=\eta(r)=0$ . In this example $Y$ is a regular continuous strong Markov process with the state space being the interval with the endpoints $l$ and $r$ (whether $l$ and $r$ belong to the state space is determined by the behavior of $\eta$ near the boundaries). Moreover, $Y$ is in natural scale, and its speed measure on $I^{\circ}$ is given by the formula

[TABLE]

In this situation a change of variables shows that it holds for all $h\in(0,\overline{h})$ , $y\in I^{\circ}$ that

[TABLE]

Condition (A) hence becomes that for every compact subset $K$ of $I^{\circ}$ it holds that

[TABLE]

Example 2.1 (Brownian motion).

In the special case where $Y=W$ is a Brownian motion (i.e., $I=\mathbb{R}$ , $\eta(x)\equiv 1$ ), Condition (A) requires that for all compact sets $K\subset\mathbb{R}$ it holds that $\sup_{y\in K}\left|\frac{a_{h}(y)^{2}}{h}-1\right|\to 0$ as $h\to 0$ . In particular, Condition (A) is satisfied for the choice $a_{h}(y)=\sqrt{h}$ , $h\in(0,\infty)$ , $y\in\mathbb{R}$ , and we recover from Theorem 1.1 Donsker’s functional limit theorem for the scaled simple random walk.

Moreover, in the case of a Brownian motion it is natural to restrict ourselves to space-homogeneous (i.e., constant) scale factors $a_{h}(y)\equiv a_{h}$ , $h\in(0,\overline{h})$ , so that Condition (A) takes the form $\lim_{h\to 0}\frac{a_{h}^{2}}{h}=1$ . It is straightforward to show that the latter condition is also necessary for the weak convergence of approximations (5)–(6) driven by space-homogeneous scale factors to the Brownian motion.

Example 2.2 (Geometric Brownian motion).

Let $\sigma>0$ and assume that $\eta$ satisfies for all $x\in(0,\infty)$ that $\eta(x)=\sigma x$ . Then the solution $Y$ of (12) with positive initial value $Y_{0}=y\in(0,\infty)$ is a geometric Brownian motion. Its state space is $I=(0,\infty)$ and both boundary points are inaccessible. Note that for all $y\in(0,\infty)$ , $a\in(0,y)$ it holds that

[TABLE]

Hence, Condition (A) requires that for all compact sets $K\subset(0,\infty)$ it holds that

[TABLE]

To obtain the EMCEL approximation of $Y$ we solve for all $y\in(0,\infty)$ , $h\in(0,\infty)$ the equation $\frac{1}{h\sigma^{2}}\log\left(1-\frac{a^{2}}{y^{2}}\right)+1=0$ in $a$ and obtain $\widehat{a}_{h}(y)=y\sqrt{1-e^{-\sigma^{2}h}}$ . Note that also the usual choice $a_{h}(y)=\sqrt{h}\sigma y$ , $y\in(0,\infty)$ , $h\in(0,1/\sigma^{2})$ , which corresponds to the weak Euler scheme for geometric Brownian motion, satisfies (17).

Convergence of the weak Euler scheme

Throughout this subsection we assume that $I=\mathbb{R}$ . A common method to approximate solutions of SDEs is the Euler scheme. For equations of the form (12) with initial condition $Y_{0}=y$ the Euler scheme $(X^{Eu,h}_{kh})_{k\in\mathbb{N}_{0}}$ with time step $h\in(0,\infty)$ is given by

[TABLE]

Weak Euler schemes are variations of the Euler scheme, where the normal increments $W_{(k+1)h}-W_{kh}$ , $k\in\mathbb{N}_{0}$ , are replaced by an iid sequence of centered random variables with variance $h$ . Therefore, with the choice $a_{h}(y)=\sqrt{h}\eta(y)$ , $h\in(0,\infty)$ , $y\in\mathbb{R}$ , the Markov chain $(X^{h}_{kh})_{k\in\mathbb{N}_{0}}$ defined in (5) represents a weak Euler scheme with Rademacher increments.

In this subsection we show how Theorem 1.1 can be used to derive new convergence results for weak Euler schemes. To this end let the setting of Section 2 be given and let $a_{h}(y)=\sqrt{h}\eta(y)$ , $h\in(0,\infty)$ , $y\in\mathbb{R}$ . Then it follows from (16) that Condition (A) is equivalent to assuming that for every compact subset $K\subset\mathbb{R}$ we have

[TABLE]

as $h\to 0$ .

Suppose that $\eta$ is continuous, let $K\subset\mathbb{R}$ be compact and let $\varepsilon>0$ . Then $\eta$ is bounded on $K$ and since every continuous function is uniformly continuous on compact sets, we obtain that there exists $h_{0}\in(0,\infty)$ such that for all $h\in(0,h_{0}]$ , $y\in K$ , $z\in[-1,1]$ it holds that

[TABLE]

By (13) and the continuity of $\eta$ the function $\eta^{2}$ is strictly bounded away from [math] on every compact subset of $\mathbb{R}$ and hence we obtain that there exists $C\in[0,\infty)$ such that for all $h\in(0,h_{0}]$ it holds that

[TABLE]

It follows with (18) that Condition (A) is satisfied. Therefore we obtain the following Corollary of Theorem 1.1.

Corollary 2.3.

Assume the setting of Section 2 with $I=\mathbb{R}$ and that $\eta$ is continuous. Let $a_{h}\colon\mathbb{R}\to\mathbb{R}$ satisfy $a_{h}(y)=\sqrt{h}\eta(y)$ for all $h\in(0,\infty)$ , $y\in\mathbb{R}$ . Then for all $y\in\mathbb{R}$ the distributions of the processes $(X^{h,y}_{t})_{t\in[0,\infty)}$ under $P$ converge weakly to the distribution of $(Y_{t})_{t\in[0,\infty)}$ under $P_{y}$ , as $h\to 0$ .

Remark 2.4.

Corollary 2.3 complements convergence results for the Euler scheme for example obtained in [45] and [25]. Theorem 2.2 in [45] shows weak convergence of the Euler scheme if $\eta$ has at most linear growth and is discontinuous on a set of Lebesgue measure zero. Theorem 2.3 in [25] establishes almost sure convergence of the Euler scheme if $\eta$ is locally Lipschitz continuous. Moreover, [25] allows for a multidimensional setting and a drift coefficient. In contrast, Corollary 2.3 above applies to the weak Euler scheme and does not require linear growth or local Lipschitz continuity of $\eta$ .

Remark 2.5.

As stated in Corollary 1.5, EMCEL approximations can be constructed for every general diffusion. In particular, they can be used in cases where $\eta$ is not continuous and where (weak) Euler schemes do not converge (see, e.g., Section 5.4 in [4]). In Sections 7 and 8 we consider further irregular examples.

3 Embedding the chains into the Markov process

In this section we construct the embedding stopping times. Throughout the section we assume the setting of Section 1.

We need to introduce an auxiliary subset of $I^{\circ}$ . To this end, if $l>-\infty$ , we define, for all $h\in(0,\overline{h})$ ,

[TABLE]

where we use the convention $\inf\emptyset=\infty$ . If $l=-\infty$ , we set $l_{h}=-\infty$ . Similarly, if $r<\infty$ , then we define, for all $h\in(0,\overline{h})$ ,

[TABLE]

If $r=\infty$ , we set $r_{h}=\infty$ . It is worth noting that $l$ is inaccessible if and only if $l_{h}=l$ for all $h\in(0,\overline{h})$ ; and, similarly, $r$ is inaccessible if and only if $r_{h}=r$ for all $h\in(0,\overline{h})$ . This is verified in Remark 4.2 below. The auxiliary subset is defined by

[TABLE]

Now we have everything at hand to start constructing a sequence of embedding stopping times. Suppose $Y$ starts at a point $y\in I^{\circ}$ and fix $h\in(0,\overline{h})$ . Set $\tau^{h}_{0}=0$ . Let $\sigma^{h}_{1}=H_{y-a_{h}(y),y+a_{h}(y)}$ . Recall that we have

[TABLE]

(see Remark 1.2). We now define $\tau^{h}_{1}$ by distinguishing two cases.

Case 1: $y\in I_{h}$ (i.e., $y\in(l_{h},r_{h})$ or $y\pm a_{h}(y)\in I^{\circ}$ ). In this case we set $\tau^{h}_{1}=\sigma^{h}_{1}$ .

Case 2: $y\notin I_{h}$ (i.e., $y\notin(l_{h},r_{h})$ and ( $y+a_{h}(y)=r$ or $y-a_{h}(y)=l$ )). In this case we deterministically extend $\sigma^{h}_{1}$ so as to make it have expectation $h$ . Observe that by the definition of $l_{h}$ and $r_{h}$ we have in this case $E_{y}[\sigma^{h}_{1}]\leq h$ . Moreover, we can assume in this case that it must hold that $P_{y}(Y_{\sigma^{N}_{1}}\in\{l,r\})=\frac{1}{2}$ (only in the case $\max\{|l|,|r|\}<\infty$ , $y=\frac{l+r}{2}$ and $a_{h}(y)=\frac{r-l}{2}$ this probability is $1$ , but we exclude this case by considering a sufficiently small $h$ , so that $a_{h}(\frac{l+r}{2})<\frac{r-l}{2}$ ; notice that Condition (A) implies that, for any $y\in I^{\circ}$ , $\lim_{h\to 0}a_{h}(y)=0$ ). We define $\tau^{h}_{1}$ by

[TABLE]

Observe that the definition implies $E_{y}[\tau^{h}_{1}]=h$ and that the three random variables $Y_{\tau^{h}_{1}}$ , $Y_{\sigma^{h}_{1}}$ and $X^{h,y}_{h}$ have all the same law.

We can proceed in a similar way to define the subsequent stopping times. Let $k\in\mathbb{N}$ . Suppose that we have already constructed $\tau^{h}_{k}$ . We first define $\sigma^{h}_{k+1}=\inf\{t\geq\tau^{h}_{k}:|Y_{t}-Y_{\tau^{h}_{k}}|=a_{h}(Y_{\tau^{h}_{k}})\}$ . On the event $\{Y_{\tau^{h}_{k}}\in I_{h}\}$ we set $\tau^{h}_{k+1}=\sigma^{h}_{k+1}$ . On the event $\{Y_{\tau^{h}_{k}}\notin I_{h}\}$ we extend $\sigma^{h}_{k+1}$ as follows. Note that $Y_{\tau^{h}_{k}}$ takes only finitely many values. Let $v\in I\setminus(l_{h},r_{h})$ be a possible value of $Y_{\tau^{h}_{k}}$ such that $v-a_{h}(v)=l$ or $v+a_{h}(v)=r$ . Consider the event $A=\{Y_{\tau^{h}_{k}}=v\}$ . Observe that $c:=E_{y}[\sigma^{h}_{k+1}-\tau^{h}_{k}|A]\leq h$ . We extend $\sigma^{h}_{k+1}$ on the event $A$ by setting

[TABLE]

(notice that $P_{y}(Y_{\sigma^{h}_{k+1}}\in\{l,r\}|A)=\frac{1}{2}$ ). This implies that $E_{y}[\tau^{h}_{k+1}-\tau^{h}_{k}|\mathcal{F}_{\tau^{h}_{k}}]=h$ on the event $\{Y_{\tau^{h}_{k}}\notin I_{h}\}$ . Moreover, the processes $(Y_{\tau^{h}_{j}})_{j\in\{0,\ldots,k+1\}}$ and $(X^{h,y}_{jh})_{j\in\{0,\ldots,k+1\}}$ have the same law. To sum up, we have the following.

Proposition 3.1.

For all $h\in(0,\overline{h})$ and $y\in I^{\circ}$ the sequence of stopping times $(\tau^{h}_{k})_{k\in\mathbb{N}_{0}}$ satisfies

$\operatorname{Law}_{P_{y}}\left(Y_{\tau^{h}_{k}};k\in\mathbb{N}_{0}\right)=\operatorname{Law}_{P}\left(X^{h,y}_{kh};k\in\mathbb{N}_{0}\right).$ ** 2. 2.

For all $k\in\mathbb{N}_{0}$ we have

[TABLE]

4 Higher moment estimates for exit times

In this section we provide some moment estimates for the exit times of $Y$ from intervals. We use the estimates in the next section to prove convergence in probability of $\sup_{k\in\{1,\ldots,\lfloor T/h\rfloor\}}\left|\tau^{h}_{k}-kh\right|$ to zero, where $(\tau^{h}_{k})_{k\in\mathbb{N}_{0}}$ is the sequence of embedding stopping times from Section 3. This is a crucial ingredient in the proof of our main result, Theorem 1.1.

We introduce the function $q\colon I^{\circ}\times\overline{I}\to[0,\infty]$ defined by

[TABLE]

where for $u<y$ we set $m((y,u)):=-m((u,y))$ . Notice that, for $y\in I^{\circ}$ , the function $q(y,\cdot)$ is decreasing on $[l,y]$ and increasing on $[y,r]$ . For our analysis the key property of $q$ is that it makes the process $q(y,Y_{t})-(t\wedge H_{l,r})$ , $t\in[0,\infty)$ , a $P_{y}$ -local martingale (see Lemma 4.1 below). Moreover, $q$ plays a central role in Feller’s test for explosions: for any $y\in I^{\circ}$ ,

[TABLE]

(see, e.g., Lemma 2.1 in [2] or Theorem 3.3 in [3]). Consequently, $q$ is finite on $I^{\circ}\times I$ . Notice that for all $y,z\in I^{\circ}$ and $x\in I$ we have

[TABLE]

where $\frac{\partial^{0}q}{\partial x}(y,x)=\frac{1}{2}(\frac{\partial^{+}q}{\partial x}+\frac{\partial^{-}q}{\partial x})(y,x)$ .

The following lemma identifies a local martingale associated to $Y$ .

Lemma 4.1.

Let $y\in I^{\circ}$ . Then the process $q(y,Y_{t})-(t\wedge H_{l,r})$ , $t\in[0,\infty)$ , is a $P_{y}$ -local martingale.

Proof.

Let $a,b\in I$ with $a<y<b$ . We first show that $q(y,Y_{t\wedge H_{a,b}})-t\wedge H_{a,b}$ , $t\in[0,\infty)$ , is a $P_{y}$ -martingale. It follows from (9) that

[TABLE]

where, in the third equality, we use the representation $\int_{(a,y)}1_{\{u<x\}}\,du$ for $x-a$ (and the similar one for $b-x$ ) and apply Fubini’s theorem. Thus,

[TABLE]

Next, we observe that for all $t\in[0,\infty)$ it holds

[TABLE]

On the event $\{H_{a,b}>t\}$ we have $q(y,Y_{H_{a,b}})-H_{a,b}=q(y,Y_{H_{a,b}})\circ\theta_{t}-H_{a,b}\circ\theta_{t}-t$ , where $\theta_{t}$ denotes the shift operator for $Y$ (see Chapter III in [39]). The Markov property and (24) imply that on the event $\{H_{a,b}>t\}$ we have $P_{y}$ -a.s.

[TABLE]

Formula (23) yields for all $z\in I^{\circ}$

[TABLE]

Since $E_{z}[Y_{H_{a,b}}-z]=0$ for all $z\in I^{\circ}$ , equation (4) implies that on the event $\{H_{a,b}>t\}$ we have $P_{y}$ -a.s.

[TABLE]

Together with (25) this yields for all $t\in[0,\infty)$

[TABLE]

which shows that $q(y,Y_{t\wedge H_{a,b}})-t\wedge H_{a,b}$ , $t\in[0,\infty)$ , is a $P_{y}$ -martingale.

The statement of the lemma follows via a localization argument. If $l\notin I$ , then choose a decreasing sequence $(l_{n})_{n\in\mathbb{N}}\subseteq I$ with $l_{1}<y$ and $\lim_{n\to\infty}l_{n}=l$ . If $l\in I$ , set $l_{n}=l$ for all $n\in\mathbb{N}$ . Similarly, if $r\notin I$ , then choose an increasing sequence $(r_{n})_{n\in\mathbb{N}}\subseteq I$ with $r_{1}>y$ and $\lim_{n\to\infty}r_{n}=r$ , and if $r\in I$ , then set $r_{n}=r$ for all $n\in\mathbb{N}$ . The sequence of stopping times $\inf\{t\geq 0\colon X_{t}\notin[l_{n},r_{n}]\}$ , $n\in\mathbb{N}$ , is then a localizing sequence for the process $q(y,Y_{t})-(t\wedge H_{l,r})$ , $t\in[0,\infty)$ . ∎

Remark 4.2.

We still owe verifying that, for $l_{h}$ and $r_{h}$ introduced in Section 3, $l$ is inaccessible if and only if $l_{h}=l$ for all $h\in(0,\overline{h})$ ; and $r$ is inaccessible if and only if $r_{h}=r$ for all $h\in(0,\overline{h})$ . Let us prove this equivalence for the left boundary point. To this end, the following identity is helpful: for $y\in I^{\circ}$ and $a\in(0,\infty)$ such that $y\pm a\in\overline{I}$ , it holds

[TABLE]

Indeed, if $y\pm a\in I$ , then (27) follows from (10) and (24). If we only have $y\pm a\in\overline{I}$ , then (27) follows by the monotone convergence argument.

Now, if $l=-\infty$ , then the equivalence is clear. Let $l>-\infty$ . Consider a small enough $a>0$ . For the integral in the definition of $l_{h}$ , we have due to (27)

[TABLE]

where $l+a,l+2a\in I^{\circ}$ , and hence the second term on the right-hand side is finite, i.e., the integral is infinite if and only if the first term on the right-hans side is infinite. We conclude by applying Feller’s test (21): $l$ is inaccessible if and only if the integral in (28) is infinite for all small $a>0$ , and the latter holds if and only if $l_{h}=l$ for all $h\in(0,\overline{h})$ .

The next result provides conditions guaranteeing that moments of a stopping time $\tau$ can be bounded against an integral with respect to the distribution of $Y_{\tau}$ .

Theorem 4.3.

Let $\alpha\in[1,\infty)$ and let $y\in I$ . Let $\tau$ be a stopping time such that $\tau\leq H_{l,r}(Y)$ , $P_{y}$ -a.s. and the process $(q^{\alpha}(y,Y_{\tau\wedge t}))_{t\in[0,\infty)}$ is of class (D) under $P_{y}$ . Then $\tau<\infty$ , $P_{y}$ -a.s. and it holds that

[TABLE]

Proof.

If $y\in I\setminus I^{\circ}$ , then $\tau=0$ and (29) is satisfied. For the remainder of the proof we assume that $y\in I^{\circ}$ . We first show by contradiction that $\tau<\infty$ $P_{y}$ -a.s. So assume that $P_{y}(\tau=\infty)>0$ . Since on $\{\tau=\infty\}$ we necessarily have $\tau=H_{l,r}(Y)$ , we obtain

[TABLE]

For any $n\in\mathbb{N}$ let $\sigma_{n}=\inf\{t\in[0,\infty):q^{\alpha}(y,Y_{\tau\wedge t})\geq n\}$ . Then we obtain that

[TABLE]

which contradicts the uniform integrability of $\{q^{\alpha}(y,Y_{\sigma_{n}\wedge\tau})\}_{n\in\mathbb{N}}$ . Thus, we proved that $\tau<\infty$ $P_{y}$ -a.s. and, in particular, that $Y_{\tau}$ on the right-hand side of (29) is well-defined.

According to Lemma 4.1 the process $N_{t}:=q(y,Y_{t})-(t\wedge H_{l,r}(Y))$ , $t\geq 0$ , is a $P_{y}$ -local martingale. The product formula yields for all $t\in[0,H_{l,r}(Y)]$

[TABLE]

Note that $(\int_{0}^{t}s^{\alpha-1}dN_{s})_{t\geq 0}$ is a local martingale and let $(\tau^{\prime}_{n})_{n\in\mathbb{N}}$ be a localizing sequence for it. Set $\tau_{n}:=n\wedge\tau^{\prime}_{n}$ for all $n\in\mathbb{N}$ . In particular, it holds $E_{y}[\tau_{n}^{\alpha}]<\infty$ for all $n\in\mathbb{N}$ . With inequality (30) and Hölder’s inequality we obtain for all $n\in\mathbb{N}$ that

[TABLE]

This implies for all $n\in\mathbb{N}$

[TABLE]

By monotone convergence the left-hand side converges to $E_{y}[\tau^{\alpha}]$ as $n\to\infty$ . Since the process $(q^{\alpha}(y,Y_{\tau\wedge t}))_{t\in[0,\infty)}$ is of class (D), it follows that the family $(q^{\alpha}(y,Y_{\tau_{n}\wedge\tau}))_{n\in\mathbb{N}}$ is uniformly integrable. Vitali’s convergence theorem implies that $E_{y}[q^{\alpha}(y,Y_{\tau_{n}\wedge\tau})]\to E_{y}[q^{\alpha}(y,Y_{\tau})]$ as $n\to\infty$ . Therefore we obtain

[TABLE]

which is precisely (32). ∎

Remark 4.4.

Under the assumption of Theorem 4.3 we have equality in (29) for $\alpha=1$ , i.e., $E_{y}[\tau]=E_{y}[q(y,Y_{\tau})]$ . Indeed, the inequality $\leq$ is provided by (29), while, for the reverse inequality $\geq$ , use that equality holds in (30) for $\alpha=1$ , localize (30), compute expectations of the both sides and apply Fatou’s lemma.

From Theorem 4.3 we obtain the following moment estimate for first exit times.

Corollary 4.5.

Let $\alpha\in[1,\infty)$ , let $y\in I$ and let $a\in[0,\infty)$ be such that $[y-a,y+a]\subseteq I$ . Then it holds that

[TABLE]

Proof.

Clearly, $H_{y-a,y+a}(Y)\leq H_{l,r}(Y)$ , $P_{y}$ -a.s. Moreover, under $P_{y}$ , the process $(q^{\alpha}(y,Y_{H_{y-a,y+a}(Y)\wedge t}))_{t\in[0,\infty)}$ is bounded (see (21) and (22)) and hence of class (D). Inequality (32) then follows from Theorem 4.3. ∎

5 Proof of Theorem 1.1

In this section we prove Theorem 1.1 and show that under Condition (A) the processes $(X^{h})_{h\in(0,\overline{h})}$ converge in distribution to $Y$ . We use the embedding stopping times $(\tau^{h}_{k})_{k\in\mathbb{N}_{0}}$ constructed in Section 3 and control the temporal errors $|\tau^{h}_{k}-kh|$ , $h\in(0,\overline{h})$ , $k\in\mathbb{N}_{0}$ . To this end, for every $h\in(0,\overline{h})$ we apply the Doob decomposition to the process $(\tau^{h}_{k}-kh)_{k\in\mathbb{N}_{0}}$ and write $\tau^{h}_{k}-kh=M^{h}_{k}+A^{h}_{k}$ , $k\in\mathbb{N}_{0}$ , for a martingale $M^{h}$ and a predictable process $A^{h}$ . Condition (A) guarantees that $A^{h}$ converges to [math] as $h\to 0$ (see Proposition 5.3 below). We show that also the martingale part $M^{h}$ can be nicely controlled (see Proposition 5.2 below).

For all $h\in(0,\overline{h})$ let $(\tau^{h}_{k})_{k\in\mathbb{N}}$ be the sequence of embedding stopping times defined in Section 3. Then we have the following result about the time lags $\rho^{h}_{k}=\tau^{h}_{k}-\tau^{h}_{k-1}$ , $h\in(0,\overline{h})$ , $k\in\mathbb{N}$ , between consecutive embedding stopping times.

Lemma 5.1.

Let $\alpha\in[1,\infty)$ , $h\in(0,\overline{h})$ and $y\in I^{\circ}$ . Then it holds that

[TABLE]

Proof.

By construction of the sequence $(\tau_{k}^{h})_{k\in\mathbb{N}_{0}}$ (in particular, recall (19)) it holds for all $k\in\mathbb{N}$ that

[TABLE]

This and the strong Markov property of $Y$ imply for all $k\in\mathbb{N}$ that

[TABLE]

Notice that

[TABLE]

because the function $x\mapsto x^{\alpha}$ , $x\in[0,\infty)$ , is convex, increasing and starts in zero. It follows from the triangle inequality, Corollary 4.5 and (35) that for all $z\in I$ it holds

[TABLE]

Combining this with (34) completes the proof. ∎

Below, for a random variable $\xi$ , it is convenient to use the notation

[TABLE]

for all $\alpha\in(0,\infty)$ , even though it is not a norm for $\alpha\in(0,1)$ . Notice that $\|\xi\|_{L^{\alpha}(P_{y})}\leq\|\xi\|_{L^{\beta}(P_{y})}$ for $0<\alpha<\beta$ by the Jensen inequality.

Proposition 5.2.

Let $\alpha\in(0,\infty)$ . Then there exists a constant $C(\alpha)\in(0,\infty)$ such that, for all $T\in(0,\infty)$ , $y\in I^{\circ}$ and $h\in(0,\overline{h})$ , it holds that

[TABLE]

Proof.

Without loss of generality we consider $\alpha\in[2,\infty)$ . Throughout the proof we fix $T\in(0,\infty)$ , $y\in I^{\circ}$ , $h\in(0,\overline{h})$ and let $N=\lfloor T/h\rfloor$ . For all $k\in\{0,\ldots,N\}$ we define $\mathcal{G}_{k}=\mathcal{F}_{\tau^{h}_{k}}$ and $M_{k}=\tau^{h}_{k}-\sum_{n=1}^{k}E[\rho^{h}_{n}|\mathcal{G}_{n-1}]$ . Notice that $(M_{k})_{k\in\{0,\ldots,N\}}$ is a $(\mathcal{G}_{k})_{k\in\{0,\ldots,N\}}$ -martingale. The Burkholder-Davis-Gundy inequality ensures that there exists a constant $\widetilde{C}(\alpha)\in(0,\infty)$ (only depending on $\alpha$ ) such that

[TABLE]

This, together with Jensen’s inequality, proves that

[TABLE]

Then Lemma 5.1 proves (36). ∎

Proposition 5.3.

Let $\alpha\in(0,\infty)$ . Then, for all $T\in(0,\infty)$ , $y\in I^{\circ}$ and $h\in(0,\overline{h})$ , it holds that

[TABLE]

Proof.

Without loss of generality we consider $\alpha\in[1,\infty)$ . Throughout the proof we fix $T\in(0,\infty)$ , $y\in I^{\circ}$ , $h\in(0,\overline{h})$ and let $N=\lfloor T/h\rfloor$ . For all $k\in\{0,\ldots,N\}$ we define $\mathcal{G}_{k}=\mathcal{F}_{\tau^{h}_{k}}$ . The triangle inequality ensures that

[TABLE]

By Proposition 3.1, on the event $\{Y_{\tau^{h}_{n-1}}\in I_{h}\}$ we have

[TABLE]

On the event $\{Y_{\tau^{h}_{n-1}}\notin I_{h}\}$ we have $|E_{y}[\rho^{h}_{n}|\mathcal{G}_{n-1}]-h|=0$ . Therefore,

[TABLE]

This proves (38). ∎

By combining the two preceding theorems we obtain a result about uniform in $k$ convergence of the embedding stopping times $(\tau^{h}_{k})$ in spaces $L^{\alpha}(P_{y})$ , as $h\to 0$ . To this end, we impose a slightly stronger condition than Condition (A), namely,

[TABLE]

Corollary 5.4.

Assume (42). Let $\alpha\in(0,\infty)$ , $T\in(0,\infty)$ and $y\in I^{\circ}$ . Then it holds

[TABLE]

Proof.

The proof is an application of Propositions 5.2 and 5.3. The fact that the right-hand side of (38) converges to zero as $h\to 0$ is a direct consequence of (42) (also recall Remark 1.2). Similarly, (42) implies

[TABLE]

The remaining property

[TABLE]

(cf. (36)), follows from the fact that, by the definition of $I_{h}$ , we have

[TABLE]

This concludes the proof. ∎

Proof of Theorem 1.1.

For any $h\in(0,\overline{h})$ , we define the continuous-time process $Y^{h}=(Y^{h}_{t})_{t\in[0,\infty)}$ by linear interpolation of $(Y_{\tau^{h}_{k}})_{k\in\mathbb{N}_{0}}$ . More precisely, we set

[TABLE]

Notice that $Y^{h}_{kh}=Y_{\tau^{h}_{k}}$ , $k\in\mathbb{N}_{0}$ , and Proposition 3.1 easily extends to

[TABLE]

Therefore, in order to prove Theorem 1.1, it is sufficient to show that the processes $Y^{h}=(Y^{h}_{t})_{t\in[0,\infty)}$ converge to the process $Y=(Y_{t})_{t\in[0,\infty)}$ in probability $P_{y}$ uniformly on compact intervals, i.e., that, for all $T\in(0,\infty)$ , it holds

[TABLE]

where $\|\cdot\|_{C[0,T]}$ denotes the sup norm in $C([0,T],\mathbb{R})$ . In what follows, we use the notation

[TABLE]

for this mode of convergence.

1. In the first step, we prove (43) under assumption (42), which is stronger than Condition (A). To this end, fix $T\in(0,\infty)$ and $\varepsilon>0$ . Take an arbitrary $T^{\prime}\in(0,\infty)$ , $T^{\prime}>T$ , and choose $\delta\in\left(0,\frac{T^{\prime}-T}{2}\right)$ such that $P_{y}(A(\delta))>1-\frac{\varepsilon}{2}$ , where

[TABLE]

Corollary 5.4 implies

[TABLE]

hence, there exists $\gamma\in(0,\overline{h})$ such that $P_{y}(C(h,\delta))>1-\frac{\varepsilon}{2}$ whenever $h\in(0,\gamma)$ , where

[TABLE]

A somewhat tedious check shows that, if $h\in(0,\delta)$ ,

[TABLE]

Thus, we get $P_{y}(\|Y^{h}-Y\|_{C[0,T]}>\varepsilon)<\varepsilon$ whenever $h\in(0,\gamma\wedge\delta)$ . This completes the proof of the first step.

2. We now prove (43) under Condition (A). Consider strictly monotone sequences $\{l_{n}\}_{n\in\mathbb{N}}$ and $\{r_{n}\}_{n\in\mathbb{N}}$ with $l_{n}\searrow l$ and $r_{n}\nearrow r$ . We define compact subintervals $K_{n}$ of $I^{\circ}$ by setting $K_{n}=[l_{n},r_{n}]$ , $n\in\mathbb{N}$ , and modified scale factors $\widetilde{a}^{n}_{h}\colon\overline{I}\to[0,\infty)$ by setting

[TABLE]

$n\in\mathbb{N}$ , $h\in(0,\overline{h})$ , where the scale factors $\widehat{a}_{h}$ , $h\in(0,\overline{h})$ , are the ones from the EMCEL algorithm (recall (11)). Let $(\widetilde{\tau}^{n,h}_{k})_{k\in\mathbb{N}_{0}}$ be the associated sequences of the embedding stopping times and $\widetilde{Y}^{n,h}=(\widetilde{Y}^{n,h}_{t})_{t\in[0,\infty)}$ the analogues of the process $Y^{h}=(Y^{h}_{t})_{t\in[0,\infty)}$ for the modified scale factors $\widetilde{a}^{n}_{h}$ , $n\in\mathbb{N}$ , $h\in(0,\overline{h})$ . Since the scale factors $(a_{h})_{h\in(0,\overline{h})}$ satisfy Condition (A), the modified scale factors $(\widetilde{a}^{n}_{h})_{h\in(0,\overline{h})}$ satisfy (42) for each $n\in\mathbb{N}$ . By the first step of the proof,

[TABLE]

for any fixed $n\in\mathbb{N}$ .

Fix $T\in(0,\infty)$ and $\varepsilon>0$ . For any $n\in\mathbb{N}$ , we define the events

[TABLE]

Notice that the expression $Y_{H_{l,r}(Y)}$ in the above formula for $A_{n}$ is well-defined and finite. Indeed, this is the position of $Y$ at an accessible boundary (because $H_{l,r}(Y)\leq T+2<\infty$ ), while an infinite boundary cannot be accessible (because $Y$ is in natural scale). As $H_{l_{n},r_{n}}(Y)\nearrow H_{l,r}(Y)$ $P_{y}$ -a.s., as $n\to\infty$ , and $Y$ is continuous, we can choose a sufficiently big $n_{0}\in\mathbb{N}$ such that $P_{y}(A_{n_{0}})<\frac{\varepsilon}{3}$ and $P_{y}(B_{n_{0}})<\frac{\varepsilon}{3}$ . We also take an arbitrary $T^{\prime}\in(T,T+1)$ . Corollary 5.4 applied to the modified scale factors $(\widetilde{a}^{n_{0}}_{h})_{h\in(0,\overline{h})}$ , which satisfy (42), yields that there exists $\gamma>0$ such that, for any $h\in(0,\gamma)$ , we have

[TABLE]

For $h\in(0,\overline{h})$ , we define the event

[TABLE]

(the notation $D^{c}$ means the complement of an event $D$ ). Notice that $P_{y}(C_{h})>1-\varepsilon$ whenever $h\in(0,\gamma)$ . Furthermore, on $C_{h}$ we have either

[TABLE]

or

[TABLE]

Together with (44), this proves $\|Y^{h}-Y\|_{C[0,T]}\xrightarrow[]{P_{y}}0$ as $h\to 0$ . As $T\in(0,\infty)$ is arbitrary, we obtain (43). This concludes the proof. ∎

6 Reflecting boundaries

Throughout the preceding sections we assume that if a boundary point is accessible, then it is absorbing. In this section we explain how one can drop this assumption, i.e. how one can extend our functional limit theorem, Theorem 1.1, to Markov processes with reflecting boundaries.

The idea is to reduce the reflecting case to the inaccessible or absorbing case. Indeed, for every Markov process $Z$ with reflecting boundaries one can find a Markov process $Y$ on an extended state space and a Lipschitz function $f$ such that $Y$ has inaccessible or absorbing boundaries and $Z\stackrel{{\scriptstyle d}}{{=}}f(Y)$ .

We illustrate the reduction for a Markov process $Z$ in natural scale with state space $I_{Z}=[l,\infty)$ , where $l>-\infty$ is a reflecting boundary. We denote by $m_{Z}$ the speed measure of $Z$ . Since $l$ is non-absorbing, it must hold that $m_{Z}(\{l\})<\infty$ . Notice that $m_{Z}(\{l\})=0$ corresponds to instantaneous reflection, while $m_{Z}(\{l\})\in(0,\infty)$ to slow reflection.

To proceed with the construction, we first remark that it holds

[TABLE]

Indeed, in terms of the Feller boundary classification (see Table 15.6.2 in [31]), as the accessible boundary point $l$ is reflecting, it can only be regular, which implies (45).

Now let $Y$ be a Markov process in natural scale with state space $I_{Y}=\mathbb{R}$ and speed measure $m_{Y}$ satisfying

[TABLE]

which is a valid speed measure on $I_{Y}=\mathbb{R}$ (i.e., (4) holds) due to (45). Then $l+|Y-l|$ has the same distribution as $Z$ (see Proposition VII.3.10 in [39]).

Let $(a_{h})_{h\in(0,\overline{h})}$ satisfy the assumptions of Theorem 1.1. Then $(X^{h})_{h\in(0,\overline{h})}$ converges in distribution to $Y$ as $h\to 0$ . This implies that the processes $(l+|X^{h}-l|)_{h\in(0,\overline{h})}$ converge in distribution to $Z$ as $h\to 0$ .

In a similar way, a Markov process $Z$ on a bounded interval $I_{Z}$ with endpoints $l$ and $r$ ( $l<r$ ), where $l\in I_{Z}$ is reflecting and $r$ is inaccessible (resp., absorbing), can be reduced to a Markov process $Y$ with state space $I_{Y}$ , which is the interval with endpoints $2l-r$ and $r$ , where both these endpoints are inaccessible (resp., absorbing).

A Markov process $Z$ with two reflecting boundaries can be reduced to a Markov process with state space $\mathbb{R}$ . To explain this, suppose for simplicity that the state space of $Z$ is $[0,1]$ . Define $Y$ as the Markov process on $\mathbb{R}$ with speed measure $m_{Y}$ satisfying

[TABLE]

Let $f\colon\mathbb{R}\to[0,1]$ be the periodic function with period $2$ satisfying $f(x)=|x|$ , $x\in[-1,1]$ . Then the process $f(Y)$ has the same distribution as $Z$ (cf. Proposition VII.3.10 in [39]).

7 Examples with sticky points

In this section we apply Theorem 1.1 to sticky Brownian motions on $\mathbb{R}$ and on $[0,\infty)$ , where the sticky point is zero. In the latter case one also speaks about slow (or sticky) reflection at [math]. Recent years have witnessed an increased interest in the sticky Brownian motion and related processes, see [29], [7], [14], [26], [12] and references therein. Newly, diffusions with slow reflection were applied in [13] to provide bounds (via sticky couplings) for the distance between two multidimensional diffusions with different drifts. Stickiness is a convenient concept for modeling repulsive interactions between particles, and, motivated by natural questions from physics, it is discussed in multi- and infinitedimensional situations in [18], [23], [24], [33], [34]. Diffusions with slow reflection also attracted interest in economic theory, where such processes characterize optimal continuation values in dynamic principal-agent problems (see, e.g., [46] and [38]).

On the contrary, the literature on approximations of diffusions with atoms in the speed measure is scarce. We remark that [1] provides a sequence of random walks that converges in distribution to the Brownian motion on $\mathbb{R}$ sticky at zero. The random walks considered there are forced to stay in zero for some time whenever they visit zero. In contrast to our approach, the approximating processes are not Markov chains. [19] constructs Markov chains that converge in distribution to the Brownian motion on $[0,\infty)$ with slow reflection at [math]. The approximating Markov chains considered there exhibit sticky behavior in zero in the sense that once the Markov chains reach zero they stay there with positive probability also in the next time period. The recent work [9] proposes an approximation of the Brownian motion on $[0,\infty)$ with slow reflection at [math] by continuous-time pure jump Markov processes $Y^{\delta}$ , $\delta\in(0,\infty)$ , with uniform grids $\{0,\delta,2\delta,\ldots\}$ as state spaces. The jump times are exponentially distributed and the mean waiting time at the interior points $\{\delta,2\delta,\ldots\}$ is of order $\delta^{2}$ whereas it is of order $\delta$ at the origin [math].

7.1 Brownian motion on $\mathbb{R}$ with sticky point [math]

Brownian motion on $\mathbb{R}$ sticky at [math] is a Markov process $Y$ in natural scale with state space $I=\mathbb{R}$ and speed measure

[TABLE]

where $\sigma,\theta\in(0,\infty)$ and $\lambda(dx)$ denotes the Lebesgue measure.333Since there exist different conventions concerning the normalization of the speed measure (cf. Footnote 1), our representation of $m$ in (46) may differ by a factor of $2$ from related representations found in the literature (cf., e.g., [7]). Such a process $Y$ behaves like $\sigma$ times a Brownian motion outside zero, but spends a positive amount of time at zero having no intervals of zeros. Notice that the bigger $\theta$ is, the less time $Y$ spends at zero; $\theta=\infty$ corresponds to a standard Brownian motion (times $\sigma$ ).

It is instructive to compute the function $q(y,x)$ , $y,x\in\mathbb{R}$ , of (20)

[TABLE]

( $x^{+}=\max\{x,0\}$ , $x^{-}=-\min\{x,0\}$ ) and to observe that, for any $y\in\mathbb{R}$ , the function $q(y,\cdot)$ has a kink at zero.

We now determine, for every $h\in(0,\infty)$ , a function $\widehat{a}_{h}\colon\mathbb{R}\to(0,\infty)$ such that the associated Markov chain $(\widehat{X}^{h}_{hk})_{k\in\mathbb{N}_{0}}$ , defined in (5), belongs to EMCEL $(h)$ . Indeed, one can explicitly determine for all $y\in\mathbb{R}$ the real number $\widehat{a}_{h}(y)$ satisfying

[TABLE]

We state the closed-form representations of $\widehat{a}_{h}$ in the next Lemma.

Lemma 7.1.

For all $h\in(0,\infty)$ and $y\in\mathbb{R}$ let

[TABLE]

Then Equation (47) is satisfied for all $h\in(0,\infty)$ and $y\in\mathbb{R}$ .

Proof.

Throughout the proof fix $h\in(0,\infty)$ and $y\in\mathbb{R}$ . For every $a\in[0,\infty)$ it holds that

[TABLE]

Assume first that $|y|\geq\sigma\sqrt{h}$ . Then it holds that $|y|\geq\widehat{a}_{h}(y)$ and hence (47) is satisfied. Next assume that $|y|<\sigma\sqrt{h}$ . In this case it holds that $\widehat{a}_{h}(y)>|y|$ . Moreover it holds that

[TABLE]

This proves (47) in the case $|y|<\sigma\sqrt{h}$ . The proof is thus completed. ∎

Since $\widehat{a}_{h}$ satisfies Equation (47) exactly, it follows that Condition (A) is satisfied. Theorem 1.1 implies that the processes $(X^{h})$ converge in distribution to $Y$ as $h\to 0$ . Figure 1 depicts two realizations of a Brownian motion on $\mathbb{R}$ sticky at [math] with $\sigma=1$ and different values for $\theta$ as well as the empirical distribution function of $\widehat{X}^{h}_{1}$ with $h=10^{-3}$ .

7.2 Brownian motion on $[0,\infty)$ with slow reflection at [math]

In this section we consider a Brownian motion on $[0,\infty)$ with slow reflection at [math]. We first define this process, as in Warren [44], as the solution of SDE (49) below. We subsequently show that its distribution is identical to the distribution of $|Y|$ , where $Y$ is the general diffusion analyzed in Section 7.1. From this perspective, the main difference between the processes studied in this section and those in Section 7.1 is the state space.

Let $\sigma,\theta\in(0,\infty)$ . According to Theorem IV.7.2 in [27] the stochastic differential equation

[TABLE]

possesses a weak solution that is unique in law. However, it is worth noting that neither existence of a strong solution nor pathwise uniqueness hold for (49) (see [14] and references therein). The next result shows that $Z$ is a regular diffusion on $[0,\infty)$ and identifies the associated speed measure.

Lemma 7.2.

The solution $Z$ of (49) is a regular continuous strong Markov process in natural scale with state space $I_{Z}=[0,\infty)$ and with speed measure

[TABLE]

Proof.

Strong Markov property of $Z$ is implied by the uniqueness in law for (49). Clearly, $Z$ is regular with state space $I_{Z}=[0,\infty)$ and in natural scale. By Itô’s formula, we have

[TABLE]

for $C^{2}$ functions $f\colon[0,\infty)\to\mathbb{R}$ . Therefore, the generator $\mathcal{A}$ of $Z$ takes the form

[TABLE]

for $f\in C^{2}_{0}([0,\infty))$ (this means that the function itself and its first and second derivative vanish at infinity) satisfying the boundary condition

[TABLE]

By Theorem VII.3.12 in [39], we have $\mathcal{A}f(z)=\frac{d}{dm_{Z}}f^{\prime}(z)$ in the interior of the state space, i.e., for $z>0$ , while, by Proposition VII.3.13 in [39], it holds $f^{\prime}(0)=m(\{0\})\mathcal{A}f(0)$ on the boundary. Together with (51), this implies (50) and concludes the proof. ∎

It follows from Section 6 that $Z\stackrel{{\scriptstyle d}}{{=}}|Y|$ , where $Y$ is a diffusion in natural scale with state space $I_{Y}=\mathbb{R}$ and speed measure $m_{Y}(dz)=\frac{2}{\sigma^{2}}\,\lambda(dz)+\frac{2}{\theta}\,\delta_{0}(dz)$ , i.e., $Y$ is the process studied in Section 7.1 (cf. (46)). In particular, $Z$ can be approximated by $(|\widehat{X}^{h}|)_{h\in(0,\infty)}$ , where each $\widehat{X}^{h}$ is the EMCEL $(h)$ constructed in Section 7.1.

Warren [44] determines for all $t\in(0,\infty)$ the conditional law of $Z_{t}$ given the driving Brownian motion $W$ . As a consequence, we obtain for all $t\in(0,\infty)$ closed form representations of the cumulative distribution function and the expected value of $Z_{t}$ . The precise formulas are provided in Lemma 7.3 below, where we, without loss of generality, consider $\sigma=1$ . The notations $P_{0}$ for the probability measure and $E_{0}$ for the corresponding expectation operator emphasize that the formulas are given for the case $Z_{0}=0$ . We use these formulas to analyze the empirical rate of convergence of EMCEL approximations. The results are presented in Figure 2.

Lemma 7.3.

Let $Z$ be a solution of (49) with $\sigma=1$ . For every $t\in(0,\infty)$ the cumulative distribution function $F(\cdot;t)\colon[0,\infty)\to[0,1]$ of $Z_{t}$ satisfies

[TABLE]

where $\Phi(x)=\int_{-\infty}^{x}\frac{1}{\sqrt{2\pi}}e^{-y^{2}/2}\,dy$ is the cumulative distribution function of the standard normal distribution. Moreover, it holds that

[TABLE]

Proof.

Fix $t\in(0,\infty)$ throughout the proof. Lévy’s distributional theorem implies $W_{t}+\sup_{s\in[0,t]}(-W_{s})\stackrel{{\scriptstyle d}}{{=}}|W_{t}|$ (see Theorem VI.2.3 in [39]). Then it follows from Theorem 1 in [44] that for all $z\in[0,\infty)$ it holds

[TABLE]

This proves (52). Moreover, this implies that the density function of $Z_{t}$ starting in [math] satisfies

[TABLE]

for all $z\in(0,\infty)$ . Observe that for all $z\in(0,\infty)$ it holds

[TABLE]

This implies for all $z\in(0,\infty)$ that

[TABLE]

This and Fubini’s theorem prove that

[TABLE]

This completes the proof. ∎

Remark 7.4.

Another possibility to get (52) and (53) is as follows. An explicit formula for the transition density of a sticky Brownian motion is provided in Part I, Appendix 1, Section 8 of [8]. This yields formula (56) for the density of $Z_{t}$ (notice that the factor $4$ , which is not present in [8], is due to the facts that the mentioned formula in [8] is given for a sticky Brownian motion on $\mathbb{R}$ and the transition density in [8] is given with respect to the speed measure, i.e., twice the Lebesgue measure outside zero). Now (53) follows by the same calculation as above, while the distribution function (52) can be recovered by integrating the density and taking into account the atom at zero.

8 Brownian motion slowed down on the Cantor set

In this section we apply our results to construct a family of Markov chains $(X^{h})_{h\in(0,1)}$ that converge in distribution to the general diffusion $Y$ on $\mathbb{R}$ with speed measure $m(dx)=m_{C}(dx)+2\,dx$ , where $m_{C}$ is the Cantor distribution. Such a process $Y$ can be understood as a Brownian motion slowed down on the Cantor set.

For later reference we briefly recall a way to construct the Cantor distribution. To this end let $\mathcal{C}$ be the collection of all subsets of $[0,1]$ and let $\Psi\colon\mathcal{C}\to\mathcal{C}$ be the map given by

[TABLE]

Next, we define recursively a sequence $(C_{n})_{n\in\mathbb{N}_{0}}$ of subsets of $[0,1]$ . Let $C_{0}=[0,1]$ and for $n\in\mathbb{N}$ let

[TABLE]

The Cantor set is defined as $C=\cap_{n\in\mathbb{N}}C_{n}$ .

We define for all $n\in\mathbb{N}$ the probability measure $m_{n}$ on $(\mathbb{R},\mathcal{B}(\mathbb{R}))$ by $m_{n}(dx)=\left(\frac{3}{2}\right)^{n}1_{C_{n}}(x)\,dx$ . Note that $m_{n}$ is absolutely continuous with respect to the Lebesgue measure $\mu_{L}$ . It follows from the proof of Theorem 3.1 in [41] that the sequence $(m_{n})_{n\in\mathbb{N}}$ converges in distribution to a probability measure $m_{C}$ on $(\mathbb{R},\mathcal{B}(\mathbb{R}))$ and that for all $n\in\mathbb{N}$ it holds

[TABLE]

Moreover, it holds that $m_{C}(C)=1$ (in particular, $m_{C}$ is concentrated on $[0,1]$ ), $\mu_{L}(C)=0$ and, for all $x\in\mathbb{R}$ , $m_{C}(\{x\})=0$ , i.e., $m_{C}$ is a singular-continuous measure.

Proposition 8.1.

Let $m$ be the measure on $\mathbb{R}$ given by $m(dx)=m_{C}(dx)+2\,dx$ and let $Y$ be the associated diffusion. Let $n\colon(0,1)\to\mathbb{N}$ be a function satisfying $\lim_{h\to 0}2^{n(h)}{\sqrt{h}}=\infty$ . Then there exists for all $h\in(0,1)$ , $y\in\mathbb{R}$ a unique solution $a_{h}(y)\in(0,\sqrt{h}]$ of the equation

[TABLE]

Let $(X^{h})_{h\in(0,1)}$ be the family of Markov chains defined in (5) and (6) (with scale factors $a_{h}$ , $h\in(0,1)$ , given by the solution of (61)). Then for all $y\in\mathbb{R}$ the distributions of $(X^{h,y}_{t})_{t\in[0,\infty)}$ , $h\in(0,1)$ , under $P$ converge weakly to the distribution of $(Y_{t})_{t\in[0,\infty)}$ under $P_{y}$ , as $h\to 0$ .

Proof.

First observe that for all $y\in\mathbb{R}$ the mapping

[TABLE]

is continuous and strictly increasing. This ensures existence of a unique solution $a_{h}(y)\in[0,\infty)$ of (61). It follows from

[TABLE]

that $a_{h}(y)\leq\sqrt{h}$ for all $h\in(0,1)$ , $y\in\mathbb{R}$ . Moreover, it follows that for all $h\in(0,1)$ and $y\in\mathbb{R}$ it holds

[TABLE]

Next, observe that formula (27), definition (20) of $q$ and the fact that $m_{C}$ and $m_{n(h)}$ do not possess atoms ensure that it holds for all $h\in(0,1)$ and $y\in\mathbb{R}$ that

[TABLE]

This, (60) and the fact that $a_{h}(y)\leq\sqrt{h}$ show that, for all $h\in(0,1)$ and $y\in\mathbb{R}$ , we have

[TABLE]

Combining (64) and (66) and using the assumption $\lim_{h\to 0}2^{n(h)}{\sqrt{h}}=\infty$ shows that

[TABLE]

Hence, Condition (A) is satisfied and weak convergence of $X^{h}$ to $Y$ follows from Theorem 1.1. ∎

Proposition 8.1 provides the way to simulate approximations of the Brownian trajectories slowed down on the Cantor set (see Figure 3).

Acknowledgement

We thank the anonymous referee for comments that helped improve the exposition. Thomas Kruse and Mikhail Urusov acknowledge the support from the German Research Foundation through the project 415705084.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Amir. Sticky Brownian motion as the strong limit of a sequence of random walks. Stochastic Process. Appl. , 39(2):221–237, 1991.
2[2] S. Ankirchner, N. Kazi-Tani, M. Klein, and T. Kruse. Stopping with expectation constraints: 3 points suffice. Electron. J. Probab. , 24:Paper No. 66, 16 pp., 2019.
3[3] S. Ankirchner, M. Klein, T. Kruse, and M. Urusov. On a certain local martingale in a general diffusion setting. Preprint, hal-01700656 , 2018.
4[4] S. Ankirchner, T. Kruse, and M. Urusov. Numerical approximation of irregular SD Es via Skorokhod embeddings. J. Math. Anal. Appl. , 440(2):692–715, 2016.
5[5] S. Ankirchner, T. Kruse, and M. Urusov. A functional limit theorem for irregular SD Es. Ann. Inst. Henri Poincaré Probab. Stat. , 53(3):1438–1457, 2017.
6[6] S. Athreya, W. Löhr, and A. Winter. Invariance principle for variable speed random walks on trees. The Annals of Probability , 45(2):625–667, 2017.
7[7] R. F. Bass. A stochastic differential equation with a sticky point. Electron. J. Probab. , 19:no. 32, 22, 2014.
8[8] A. N. Borodin and P. Salminen. Handbook of Brownian motion—Facts and Formulae . Probability and its Applications. Birkhäuser Verlag, Basel, second edition, 2002.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A functional limit theorem for

Abstract

Introduction

1 Approximating general diffusions with Markov chains

Theorem 1.1**.**

Remark 1.2**.**

Remark 1.3**.**

Remark 1.4**.**

Corollary 1.5**.**

Proof.

2 Application to SDEs

Example 2.1** (Brownian motion).**

Example 2.2** (Geometric Brownian motion).**

Convergence of the weak Euler scheme

Corollary 2.3**.**

Remark 2.4**.**

Remark 2.5**.**

3 Embedding the chains into the Markov process

Proposition 3.1**.**

4 Higher moment estimates for exit times

Lemma 4.1**.**

Proof.

Remark 4.2**.**

Theorem 4.3**.**

Proof.

Remark 4.4**.**

Corollary 4.5**.**

Proof.

5 Proof of Theorem 1.1

Lemma 5.1**.**

Proof.

Proposition 5.2**.**

Proof.

Proposition 5.3**.**

Proof.

Corollary 5.4**.**

Proof.

Proof of Theorem 1.1.

6 Reflecting boundaries

7 Examples with sticky points

7.1 Brownian motion on R\mathbb{R}R with sticky point [math]

Lemma 7.1**.**

Proof.

7.2 Brownian motion on [0,∞)[0,\infty)[0,∞) with slow reflection at [math]

Lemma 7.2**.**

Proof.

Lemma 7.3**.**

Proof.

Remark 7.4**.**

8 Brownian motion slowed down on the Cantor set

Proposition 8.1**.**

Proof.

Acknowledgement

Theorem 1.1.

Remark 1.2.

Remark 1.3.

Remark 1.4.

Corollary 1.5.

Example 2.1 (Brownian motion).

Example 2.2 (Geometric Brownian motion).

Corollary 2.3.

Remark 2.4.

Remark 2.5.

Proposition 3.1.

Lemma 4.1.

Remark 4.2.

Theorem 4.3.

Remark 4.4.

Corollary 4.5.

Lemma 5.1.

Proposition 5.2.

Proposition 5.3.

Corollary 5.4.

7.1 Brownian motion on $\mathbb{R}$ with sticky point [math]

Lemma 7.1.

7.2 Brownian motion on $[0,\infty)$ with slow reflection at [math]

Lemma 7.2.

Lemma 7.3.

Remark 7.4.

Proposition 8.1.