Parameter estimation for the stochastic heat equation with   multiplicative noise from local measurements

Josef Jan\'ak; Markus Rei{\ss}

arXiv:2303.00074·math.ST·February 22, 2024

Parameter estimation for the stochastic heat equation with multiplicative noise from local measurements

Josef Jan\'ak, Markus Rei{\ss}

PDF

Open Access

TL;DR

This paper develops and compares new estimators for the diffusivity parameter in the stochastic heat equation with multiplicative noise, demonstrating improved statistical properties and robustness through theoretical analysis and simulations.

Contribution

It introduces two novel estimators that account for quadratic variation, offering smaller variance and applicability at low noise levels, with proven asymptotic properties.

Findings

01

New estimators have smaller (conditional) variance.

02

Estimates remain consistent and asymptotically normal.

03

Simulation results confirm theoretical advantages.

Abstract

For the stochastic heat equation with multiplicative noise we consider the problem of estimating the diffusivity parameter in front of the Laplace operator. Based on local observations in space, we first study an estimator that was derived for additive noise. A stable central limit theorem shows that this estimator is consistent and asymptotically mixed normal. By taking into account the quadratic variation, we propose two new estimators. Their limiting distributions exhibit a smaller (conditional) variance and the last estimator also works for vanishing noise levels. The proofs are based on local approximation results to overcome the intricate nonlinearities and on a stable central limit theorem for stochastic integrals with respect to cylindrical Brownian motion. Simulation results illustrate the theoretical findings.

Figures4

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1: Monte Carlo mean and standard deviation of estimators for different σ ( ∙ ) 𝜎 ∙ \sigma({\scriptstyle\bullet}) .

Mean (SD)	ANE ${\hat{ϑ}}_{δ}$	MNE ${\tilde{ϑ}}_{δ}$	SMNE $ϑ_{δ}^{⋆}$
$σ_{1}$	0.050183 (0.0104)	0.050057 (0.0104)	0.050059 (0.0104)
$σ_{2}$	0.050632 (0.0111)	0.050163 (0.0104)	0.050163 (0.0104)
$σ_{3}$	0.052085 (0.0522)	0.050115 (0.0115)	0.050232 (0.0135)

Equations283

⎩ ⎨ ⎧ d X (t) X (0) X (t) ∣_{\partial Λ} = ϑ Δ X (t) d t + σ (X (t)) d W (t), 0 < t \leq T, = X_{0}, = 0, 0 < t \leq T .

⎩ ⎨ ⎧ d X (t) X (0) X (t) ∣_{\partial Λ} = ϑ Δ X (t) d t + σ (X (t)) d W (t), 0 < t \leq T, = X_{0}, = 0, 0 < t \leq T .

\overline{σ} := x \in R sup σ (x) < \infty.

\overline{σ} := x \in R sup σ (x) < \infty.

(B (u) v) (x) = σ (u (x)) v (x), x \in Λ, u, v \in L^{2} (Λ),

(B (u) v) (x) = σ (u (x)) v (x), x \in Λ, u, v \in L^{2} (Λ),

⟨ X (t), z ⟩ = ⟨ X_{0}, z ⟩ + ϑ \int_{0}^{t} ⟨ X (s), Δ z ⟩ d s + \int_{0}^{t} ⟨ z, σ (X (s)) d W (s) ⟩, P - a . s .

⟨ X (t), z ⟩ = ⟨ X_{0}, z ⟩ + ϑ \int_{0}^{t} ⟨ X (s), Δ z ⟩ d s + \int_{0}^{t} ⟨ z, σ (X (s)) d W (s) ⟩, P - a . s .

X (t) = S_{ϑ} (t) X_{0} + \int_{0}^{t} S_{ϑ} (t - s) σ (X (s)) d W (s), P - a . s .

X (t) = S_{ϑ} (t) X_{0} + \int_{0}^{t} S_{ϑ} (t - s) σ (X (s)) d W (s), P - a . s .

Λ_{δ, x_{0}}

Λ_{δ, x_{0}}

z_{δ, x_{0}} (x)

X_{δ, x_{0}} (t)

X_{δ, x_{0}} (t)

X_{δ, x_{0}}^{Δ} (t)

d X_{δ, x_{0}} (t)

d X_{δ, x_{0}} (t)

\hat{ϑ}_{δ} = \frac{\int _{0}^{T} X _{δ, x_{0}}^{Δ} ( t ) d X _{δ, x_{0}} ( t )}{\int _{0}^{T} ( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2} d t} .

\hat{ϑ}_{δ} = \frac{\int _{0}^{T} X _{δ, x_{0}}^{Δ} ( t ) d X _{δ, x_{0}} ( t )}{\int _{0}^{T} ( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2} d t} .

ϑ \int_{0}^{T} (X_{δ, x_{0}}^{Δ} (t))^{2} d t + \int_{0}^{T} X_{δ, x_{0}}^{Δ} (t) ⟨ σ (X (t)) K_{δ, x_{0}}, d W (t) ⟩

ϑ \int_{0}^{T} (X_{δ, x_{0}}^{Δ} (t))^{2} d t + \int_{0}^{T} X_{δ, x_{0}}^{Δ} (t) ⟨ σ (X (t)) K_{δ, x_{0}}, d W (t) ⟩

δ^{- 1} (\hat{ϑ_{δ}} - ϑ) = \frac{M _{δ}}{I _{δ}^{1/2}} ∙ \frac{( δ ^{2} I _{δ} ) ^{1/2}}{δ ^{2} J _{δ}},

δ^{- 1} (\hat{ϑ_{δ}} - ϑ) = \frac{M _{δ}}{I _{δ}^{1/2}} ∙ \frac{( δ ^{2} I _{δ} ) ^{1/2}}{δ ^{2} J _{δ}},

M_{δ}

M_{δ}

I_{δ}

J_{δ}

\int_{0}^{T} ⟨ Y_{δ} (t), d W (t) ⟩ s t ab l y \int_{0}^{T} s (t) d B (t)

\int_{0}^{T} ⟨ Y_{δ} (t), d W (t) ⟩ s t ab l y \int_{0}^{T} s (t) d B (t)

δ^{- 1} (\hat{ϑ}_{δ} - ϑ) s t ab l y \frac{( 2 ϑ ) ^{1/2} ∥ K ∥ _{L^{2} (R)}}{∥ K ^{'} ∥ _{L^{2} (R)}} ∙ \frac{( \int _{0}^{T} σ ^{4} ( X ( t , x _{0} )) d t ) ^{1/2}}{\int _{0}^{T} σ ^{2} ( X ( t , x _{0} )) d t} ∙ Z

δ^{- 1} (\hat{ϑ}_{δ} - ϑ) s t ab l y \frac{( 2 ϑ ) ^{1/2} ∥ K ∥ _{L^{2} (R)}}{∥ K ^{'} ∥ _{L^{2} (R)}} ∙ \frac{( \int _{0}^{T} σ ^{4} ( X ( t , x _{0} )) d t ) ^{1/2}}{\int _{0}^{T} σ ^{2} ( X ( t , x _{0} )) d t} ∙ Z

⟨ X_{δ, x_{0}} ⟩_{t} = \int_{0}^{t} ∥ σ (X (s)) K_{δ, x_{0}} ∥^{2} d s, t \in [0, T] .

⟨ X_{δ, x_{0}} ⟩_{t} = \int_{0}^{t} ∥ σ (X (s)) K_{δ, x_{0}} ∥^{2} d s, t \in [0, T] .

\tilde{ϑ}_{δ} = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d X _{δ, x_{0}} ( t )}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d t} .

\tilde{ϑ}_{δ} = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d X _{δ, x_{0}} ( t )}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d t} .

\tilde{ϑ}_{δ} - ϑ = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} ⟨ σ ( X ( t )) K _{δ, x_{0}} , d W ( t )⟩}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d t} =: \frac{M ~ _{δ}}{I ~ _{δ}} .

\tilde{ϑ}_{δ} - ϑ = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} ⟨ σ ( X ( t )) K _{δ, x_{0}} , d W ( t )⟩}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2}} d t} =: \frac{M ~ _{δ}}{I ~ _{δ}} .

\delta^{-1}(\tilde{\vartheta}_{\delta}-\vartheta)\stackrel{{\scriptstyle d}}{{\rightarrow}}N\Big{(}0,\frac{2\vartheta\|K\|_{L^{2}(\mathbb{R})}^{2}}{T\|K^{\prime}\|_{L^{2}(\mathbb{R})}^{2}}\Big{)}.

\delta^{-1}(\tilde{\vartheta}_{\delta}-\vartheta)\stackrel{{\scriptstyle d}}{{\rightarrow}}N\Big{(}0,\frac{2\vartheta\|K\|_{L^{2}(\mathbb{R})}^{2}}{T\|K^{\prime}\|_{L^{2}(\mathbb{R})}^{2}}\Big{)}.

ε_{δ} \to 0, ε_{δ}^{- 1} δ^{η} \to 0

ε_{δ} \to 0, ε_{δ}^{- 1} δ^{η} \to 0

ϑ_{δ}^{⋆} = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2} + ε _{δ}^{2}} d X _{δ, x_{0}} ( t )}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2} + ε _{δ}^{2}} d t} .

ϑ_{δ}^{⋆} = \frac{\int _{0}^{T} \frac{X _{δ, x_{0}}^{Δ} ( t )}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2} + ε _{δ}^{2}} d X _{δ, x_{0}} ( t )}{\int _{0}^{T} \frac{( X _{δ, x_{0}}^{Δ} ( t ) ) ^{2}}{∥ σ ( X ( t )) K _{δ, x_{0}} ∥ ^{2} + ε _{δ}^{2}} d t} .

\forall x, y \in Λ : ∣ σ (x) - σ (y) ∣ \leq C ∣ x - y ∣^{β_{σ}} .

\forall x, y \in Λ : ∣ σ (x) - σ (y) ∣ \leq C ∣ x - y ∣^{β_{σ}} .

\forall t, s \in [0, T], x, y \in Λ : E (X (t, x) - X (s, y))^{2} \leq C (∣ t - s ∣^{2 β_{t}} + ∣ x - y ∣^{2 β_{x}}) .

\forall t, s \in [0, T], x, y \in Λ : E (X (t, x) - X (s, y))^{2} \leq C (∣ t - s ∣^{2 β_{t}} + ∣ x - y ∣^{2 β_{x}}) .

δ^{- 1} (ϑ_{δ}^{⋆} - ϑ) = \frac{M _{δ}^{⋆}}{( I _{δ}^{⋆} ) ^{1/2}} ∙ \frac{( δ ^{2} I _{δ}^{⋆} ) ^{1/2}}{δ ^{2} J _{δ}^{⋆}}

δ^{- 1} (ϑ_{δ}^{⋆} - ϑ) = \frac{M _{δ}^{⋆}}{( I _{δ}^{⋆} ) ^{1/2}} ∙ \frac{( δ ^{2} I _{δ}^{⋆} ) ^{1/2}}{δ ^{2} J _{δ}^{⋆}}

M_{δ}^{⋆}

M_{δ}^{⋆}

I_{δ}^{⋆}

J_{δ}^{⋆}

T^{⋆} = \int_{0}^{T} 1 (σ (X (t, x_{0})) \neq = 0) d t .

T^{⋆} = \int_{0}^{T} 1 (σ (X (t, x_{0})) \neq = 0) d t .

δ^{- 1} (ϑ_{δ}^{⋆} - ϑ) s t ab l y \frac{( 2 ϑ ) ^{1/2} ∥ K ∥ _{L^{2} (R)}}{( T ^{⋆} ) ^{1/2} ∥ K ^{'} ∥ _{L^{2} (R)}} ∙ Z,

δ^{- 1} (ϑ_{δ}^{⋆} - ϑ) s t ab l y \frac{( 2 ϑ ) ^{1/2} ∥ K ∥ _{L^{2} (R)}}{( T ^{⋆} ) ^{1/2} ∥ K ^{'} ∥ _{L^{2} (R)}} ∙ Z,

\frac{( \int _{0}^{T} σ ^{4} ( X ( t , x _{0} )) d t ) ^{1/2}}{\int _{0}^{T} σ ^{2} ( X ( t , x _{0} )) d t} \geq \frac{1}{T ^{⋆}} \geq \frac{1}{T}

\frac{( \int _{0}^{T} σ ^{4} ( X ( t , x _{0} )) d t ) ^{1/2}}{\int _{0}^{T} σ ^{2} ( X ( t , x _{0} )) d t} \geq \frac{1}{T ^{⋆}} \geq \frac{1}{T}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Financial Risk and Volatility Modeling · Statistical Methods and Inference

Full text

Parameter estimation for the stochastic heat equation with multiplicative noise from local measurements

Josef Janák Markus Reiß Karlsruher Institut für Technologie, Institut für Stochastik, Kaiserstraße 12, 76131 Karlsruhe, Germany. Email: [email protected]ät zu Berlin, Institut für Mathematik, Unter den Linden 6, 10099 Berlin, Germany. Email: [email protected].

Abstract

For the stochastic heat equation with multiplicative noise we consider the problem of estimating the diffusivity parameter in front of the Laplace operator. Based on local observations in space, we first study an estimator, derived in [3] for additive noise. A stable central limit theorem shows that this estimator is consistent and asymptotically mixed normal. By taking into account the quadratic variation, we propose two new estimators. Their limiting distributions exhibit a smaller (conditional) variance and the last estimator also works for vanishing noise levels. The proofs are based on local approximation results to overcome the intricate nonlinearities and on a stable central limit theorem for stochastic integrals with respect to cylindrical Brownian motion. Simulation results illustrate the theoretical findings.

Keywords: Local measurements, stochastic partial differential equation, multiplicative noise, drift estimation, augmented MLE, martingale representation theorem, stable limit theorem.

2020 MSC: Primary 60H15, 60F05; secondary 62G05, 35J15.

1 Introduction

We consider estimation of the diffusivity parameter $\vartheta>0$ in the stochastic heat equation with multiplicative noise

[TABLE]

Here, $W$ is a cylindrical Brownian motion with values in $L^{2}(\Lambda)$ , $dW_{t}/dt$ is also referred to as space-time white noise. The function $\sigma:\operatorname{{\mathbb{R}}}\to\operatorname{{\mathbb{R}}}_{+}$ generates a multiplicative noise, see Section 2 below for precise assumptions. Multiplicative noise appears naturally in stochastic partial differential equations (SPDEs) as a scaling limit or to ensure positivity of the solution, see e.g. the examples given in [16], [17] or [4].

Diffusivity estimation has emerged as a benchmark inference problem for SPDEs. The spectral estimation approach, initiated by [10], has been shown to give reliable estimation results even for more general semi-linear equations like the stochastic Navier-Stokes equation [7], yet always assuming additive noise. In [8] a specific case of multiplicative noise has been treated which leads to geometric Brownian motions in the spectral decomposition of the Laplacian. In [5] also Bayesian estimators have been developed and analysed in this setting.

Similarly, discrete observations of the solution $X$ in time and space give rise to realised $p$ -variation estimators for quite general classes of SPDEs. Most notably, in [17] a precise convergence analysis of $p$ -variation of $X(t,x)$ in space $x$ with $p=2$ and in time $t$ with $p=4$ is given, which leads to a consistent diffusivity estimator in the multiplicative noise case, while convergence rates or asymptotic normality are not considered. Estimation of the multiplicative noise function $\sigma({\scriptstyle\bullet})$ from discrete observations is treated in [6] with intriguing phenomena arising in central limit theorems for $p$ -variations.

Recently, methods for local observations in space have provided a new methodology for linear and semi-linear SPDEs with additive noise [3, 2]. This has enabled the estimation of diffusivity in a stochastic cell motility model from experimental data [1]. The underlying SPDE with additive noise describes chemical concentrations, for which, however, a multiplicative noise structure might be more natural as well as more in line with the empirical data than additive noise.

Starting point of our work is the question whether the additive noise estimator (ANE) derived in [3] for local observations of a stochastic heat equation with additive noise is robust against a multiplicative noise misspecification the same way, as it is against nonlinear reaction terms [2]. Technically, we cannot use a splitting technique to separate the nonlinear from the linear part and we must derive new tools to analyse the estimation error. This is achieved by a stepwise disentanglement and localisation of the statistics, carried out in Proposition 5.7 below. The result is that the estimator has the same rate as for additive noise, but it is asymptotically mixed normal under stable convergence with a suboptimal conditional variance for varying $\sigma({\scriptstyle\bullet})$ .

Therefore we improve the ANE by taking into account the varying quadratic variation of the martingale term in the ANE. The multiplicative noise estimator (MNE) obtained this way satisfies a central limit theorem with smaller variance provided the multiplicative noise $\sigma({\scriptstyle\bullet})$ is bounded away from zero. Since in many cases it is natural that $\sigma({\scriptstyle\bullet})$ vanishes at some boundary values, we improve the MNE to the stabilised multiplicative noise estimator (SMNE), satisfying a stable central limit theorem with small conditional variance even when $\sigma({\scriptstyle\bullet})$ vanishes sometimes.

The exact setting is introduced in Section 2. The construction of the estimators, the main asymptotic results and an application to confidence intervals are presented in Section 3. In Section 4 we discuss the implementation of the estimators and their behaviour for three fundamentally different noise specifications. The detailed proofs are delegated to Section 5. The stable convergence results require a martingale representation theorem in terms of cylindrical Brownian motion and rely on asymptotic orthogonality of martingales by spatial localisation, which might be of independent interest. This material is therefore gathered in Section 6.

2 The model

2.1 Notation

We write $\mathbb{R}_{+}:=[0,\infty)$ , $a\wedge b:=\min(a,b)$ and $a\vee b:=\max(a,b)$ . By $A_{\delta}\lesssim B_{\delta}$ we mean that there exists some constant $C>0$ such that $A_{\delta}\leq CB_{\delta}$ for all values $\delta$ under consideration. Here, we work with $\delta\in(0,1)$ or with the convergence $\delta\to 0$ . Convergence in probability and convergence in distribution are denoted by $\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}$ and $\stackrel{{\scriptstyle d}}{{\rightarrow}}$ , respectively. The symbol $\xrightarrow{stably}$ denotes stable convergence, see e.g. [12], Chapter VIII., Section 5. We say that $X_{\delta}\xrightarrow{stably}X$ holds on an event $G$ if $X_{\delta}{\bf 1}_{G}\xrightarrow{stably}X{\bf 1}_{G}$ . $A_{\delta}=O_{\mathbb{P}}(B_{\delta})$ for random variables $A_{\delta},B_{\delta}$ means that $A_{\delta}/B_{\delta}$ is tight, that is, $\sup_{\delta}\mathbb{P}(|A_{\delta}|>C|B_{\delta}|)\rightarrow 0$ as $C\rightarrow\infty$ . The notation $A_{\delta}=o_{\mathbb{P}}(B_{\delta})$ stands for $A_{\delta}/B_{\delta}\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}0$ as $\delta\to 0$ .

Let $\Lambda$ be an open bounded interval in $\mathbb{R}$ and consider the space $L^{2}(\Lambda)$ equipped with the usual $L^{2}$ -norm $\|{\scriptstyle\bullet}\|:=\|{\scriptstyle\bullet}\|_{L^{2}(\Lambda)}$ and the scalar product $\left\langle{\scriptstyle\bullet},{\scriptstyle\bullet}\right\rangle:=\left\langle{\scriptstyle\bullet},{\scriptstyle\bullet}\right\rangle_{L^{2}(\Lambda)}$ . $H^{s}(\Lambda)$ denotes the $L^{2}$ -Sobolev space of order $s$ on $\Lambda$ and $H_{0}^{1}(\Lambda)$ the space of all $f\in H^{1}(\Lambda)$ with $f(x)=0$ for $x\in\partial\Lambda$ . The Laplace operator is given by $\Delta z=z^{\prime\prime}$ for $z\in H^{2}(\operatorname{{\mathbb{R}}})$ , even though we consider only the one-dimensional case.

2.2 The stochastic heat equation

Let $(\Omega,\mathscr{F},(\mathscr{F}_{t})_{0\leq t\leq T},\mathbb{P})$ be a stochastic basis equipped with the cylindrical Brownian motion $W$ taking values in $L^{2}(\Lambda)$ . The filtration $(\mathscr{F}_{t})_{0\leq t\leq T}$ is assumed to be generated by the cylindrical Brownian motion and augmented by $\mathbb{P}$ –null sets. We study the stochastic heat equation (1.1) with multiplicative noise. The initial value $X_{0}\in L^{2}(\Lambda)$ is supposed to be deterministic and continuous on $\bar{\Lambda}$ . We require throughout the following two assumptions.

Assumption (S).

The function $\sigma:\mathbb{R}\rightarrow\mathbb{R}_{+}$ is continuous and

[TABLE]

The stochastic term $\sigma(X(t))\,dW(t)$ is therefore understood as $B(X(t))dW(t)$ with the multiplicative Nemytskii operators $B(u):L^{2}(\Lambda)\rightarrow L^{2}(\Lambda)$ , defined by

[TABLE]

noting that $\sigma(u({\scriptstyle\bullet}))\in L^{\infty}(\Lambda)$ holds for bounded $\sigma$ .

Assumption (X).

The stochastic partial differential equation (1.1) admits a weak solution $(X(t),0\leq t\leq T)$ taking values in $L^{2}(\Lambda)$ , which is continuous in both (time and space) variables, i.e., $X\in C([0,T];C(\Lambda;\mathbb{R}))$ $\mathbb{P}$ –a.s., and satisfies for any $z\in H_{0}^{1}(\Lambda)\cap H^{2}(\Lambda)$ and $t\in[0,T]$

[TABLE]

Sufficient conditions for Assumption (X) will be discussed in Example 3.7 below. Since a weak solution is also a mild solution (see e.g. Theorem 6.5 in [9]), there is an additional representation of the solution to equation (1.1). Let $(S_{\vartheta}(t),t\geq 0)$ be the strongly continuous semigroup on $L^{2}(\Lambda)$ generated by $\vartheta\Delta$ . The solution $(X(t),0\leq t\leq T)$ satisfies the variation-of-constants formula

[TABLE]

2.3 The observation scheme

As motivated in [3, 1], we observe the solution process $(X(t,x),\,t\in[0,T],x\in\Lambda)$ only locally in space around some point $x_{0}\in\Lambda$ , which remains fixed like the observation time $T\in(0,\infty)$ . More precisely, the observations are given by a spatial convolution of the solution process with a kernel $K_{\delta,x_{0}}$ , localising at $x_{0}$ as the resolution $\delta$ tends to zero. This kernel might for instance model the point spread function in microscopy.

For $z\in L^{2}(\mathbb{R})$ and $\delta\in(0,1)$ introduce the scalings

[TABLE]

Let $K\in H^{2}(\mathbb{R})$ be a function of compact support in $\Lambda_{1,x_{0}}$ . The compact support ensures that $K_{\delta,x_{0}}$ is localising around $x_{0}$ as $\delta\to 0$ and that $K_{\delta,x_{0}}\in H_{0}^{1}(\Lambda)\cap H^{2}(\Lambda)$ . The scaling with $\delta^{-1/2}$ simplifies calculations due to $\|K_{\delta,x_{0}}\|=\|K\|_{L^{2}(\mathbb{R})}$ , while the basic estimators are invariant with respect to kernel scaling.

Local measurements of (1.1) at the point $x_{0}$ with resolution level $\delta\in(0,1)$ are described by the real-valued processes $(X_{\delta,x_{0}}(t),0\leq t\leq T)$ and $(X_{\delta,x_{0}}^{\Delta}(t),0\leq t\leq T)$ given by

[TABLE]

The process $(X_{\delta,x_{0}}(t),0\leq t\leq T)$ satisfies $X_{\delta,x_{0}}(0)=\left\langle X_{0},K_{\delta,x_{0}}\right\rangle$ and by partial integration

[TABLE]

3 Estimation methods and main results

3.1 The additive noise estimator

We study first the augmented maximum likelihood estimator $\hat{\vartheta}_{\delta}$ from [3], derived for the stochastic heat equation with additive space-time white noise.

Definition 3.1.

The additive noise estimator (ANE) $\hat{\vartheta}_{\delta}$ of the parameter $\vartheta>0$ is defined as

[TABLE]

According to (2.5), the numerator $\int_{0}^{T}X_{\delta,x_{0}}^{\Delta}(t)\,dX_{\delta,x_{0}}(t)$ equals

[TABLE]

and the fundamental error decomposition is given by

[TABLE]

where

[TABLE]

The term $\mathcal{I}_{\delta}$ is incorporated because it gives the quadratic variation of the martingale $\mathcal{M}_{\delta}$ in time. It turns out that $\delta^{2}{\mathcal{I}_{\delta}}$ converges in probability to the limit $(2\vartheta)^{-1}\|K^{\prime}\|_{L^{2}(\mathbb{R})}^{2}\|K\|_{L^{2}(\mathbb{R})}^{2}\int_{0}^{T}\sigma^{4}(X(t,x_{0}))\,dt$ as $\delta\to 0$ , while $\delta^{2}\mathcal{J}_{\delta}$ converges to $(2\vartheta)^{-1}\|K^{\prime}\|_{L^{2}(\mathbb{R})}^{2}\int_{0}^{T}\sigma^{2}(X(t,x_{0}))\,dt$ , see Proposition 5.10 below. Since the quadratic variation $\mathcal{I}_{\delta}$ does not become asymptotically deterministic, we cannot rely on a standard martingale central limit theorem to prove asymptotic normality of $\mathcal{M}_{\delta}/\mathcal{I}_{\delta}^{1/2}$ . Therefore we employ the concept of stable convergence, which allows to formulate mixed normal limits and to derive data-driven confidence intervals, see e.g. [12] for a general introduction. In Section 6 we prove a general martingale representation theorem and a stable limit theorem for martingales with respect to cylindrical Brownian motion filtrations. As a consequence, we obtain the following result, when specialising Corollary 6.3 to our setting involving the kernels $K_{\delta,x_{0}}$ :

Proposition 3.2.

Let $(Y_{\delta}(t),0\leq t\leq T)$ for $\delta\in(0,1)$ be $L^{2}(\Lambda)$ -valued processes, progressively measurable with respect to the cylindrical Brownian filtration $({\mathscr{F}}_{t})_{0\leq t\leq T}$ and satisfying $\int_{0}^{T}\|Y_{\delta}(t)\|^{2}\,dt<\infty$ . If

(C1)

$\int_{0}^{T}\|Y_{\delta}(t)\|^{2}\,dt\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}\int_{0}^{T}s^{2}(t)\,dt$ * as $\delta\rightarrow 0$ for some progressively measurable real-valued process $(s(t),0\leq t\leq T)$ with $\int_{0}^{T}s^{2}(t)\,dt<\infty$ ,*

(C2’)

the support inclusion $\operatorname{supp}(Y_{\delta}(t))\subseteq\operatorname{supp}(K_{\delta,x_{0}})$ holds Lebesgue-almost everywhere in $\Lambda$ for all $t\in[0,T]$ ,

then a stable limit theorem for the stochastic integrals holds as $\delta\to 0$ :

[TABLE]

with an independent scalar Brownian motion $(B(t),0\leq t\leq T)$ (on an extension of the original filtered probability space).

The main point of this result is that the limiting Brownian motion $B$ becomes independent because the support of $Y_{\delta}(t)$ shrinks asymptotically to the point $x_{0}$ . Here we shall apply the proposition with $Y_{\delta}(t)=\delta X_{\delta,x_{0}}^{\Delta}(t)\sigma(X(t))K_{\delta,x_{0}}$ . Our first main result is that the additive noise estimator $\hat{\vartheta}_{\delta}$ satisfies a stable central limit theorem with rate $\delta$ .

Theorem 3.3.

Grant Assumptions (S) and (X). Then the ANE $\hat{\vartheta}_{\delta}$ satisfies on the event $\{\int_{0}^{T}\sigma^{2}(X(t,x_{0}))\,dt>0\}$

[TABLE]

as $\delta\rightarrow 0$ , where $Z\sim N(0,1)$ is independent of $\mathscr{F}_{T}$ .

Proof.

The detailed proof is deferred to Section 5.4. ∎

This result establishes a very desirable robustness property of the ANE $\hat{\vartheta}_{\delta}$ : Even though it was designed for estimation in the stochastic heat equation with additive noise, the ANE still converges with the same rate $\delta$ to the true parameter under multiplicative noise.

3.2 The multiplicative noise estimator

We aim at improving the ANE by adjusting the estimator in such a way that the denominator contains already the quadratic variation of the martingale part in the numerator. To that end, we need to incorporate the term $\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}$ that is not observed directly, but is still attainable from the data. The quadratic variation of the observed semi-martingale $(X_{\delta,x_{0}}(t),0\leq t\leq T)$ equals

[TABLE]

So we have access to $\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}$ by differentiation of the realized quadratic variation. For discrete time data, sampled at high-frequency, standard spot volatility estimators can be used to access this term, see Section 4 below. This way we obtain a second estimator, taking into account the multiplicative noise in the stochastic heat equation.

Definition 3.4.

The multiplicative noise estimator (MNE) $\tilde{\vartheta}_{\delta}$ of the parameter $\vartheta>0$ is defined as

[TABLE]

Let us remark that the MNE $\tilde{\vartheta}_{\delta}$ can also be derived like the ANE $\hat{\vartheta}_{\delta}$ in [3], maximising a corresponding pseudo-likelihood in the multiplicative noise case. Since this is done under the correct model specification, we expect better estimation properties.

Using the representation of $dX_{\delta,x_{0}}(t)$ from (2.5), we obtain

[TABLE]

Since $\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}$ appears in the denominators, we require a lower bound on $\sigma({\scriptstyle\bullet})$ in the following theorem.

Theorem 3.5.

Grant Assumptions (S), (X) and assume $\underline{\sigma}=\inf_{x\in\mathbb{R}}\sigma(x)>0$ . Then as $\delta\rightarrow 0$

[TABLE]

Proof.

The proof is deferred to Section 5.4. ∎

3.3 The stabilised multiplicative noise estimator

The lower bound $\underline{\sigma}>0$ on $\sigma({\scriptstyle\bullet})$ required for the MNE $\tilde{\vartheta}_{\delta}$ can be restrictive. For instance, when the random field $X(t,x)$ shall not take negative values, models usually require that $\lim_{x\downarrow 0}\sigma(x)=0$ . To cover this case as well, we stabilise the denominators by adding a number $\varepsilon_{\delta}^{2}$ which tends to zero slowly as $\delta\rightarrow 0$ .

Definition 3.6.

Let $\varepsilon_{\delta}=\varepsilon(\delta)$ be a real function satisfying for any $\eta>0$

[TABLE]

as $\delta\to 0$ . Then the stabilised multiplicative noise estimator (SMNE) $\vartheta_{\delta}^{\star}$ of the parameter $\vartheta>0$ is defined as

[TABLE]

Condition (3.8) says that $\varepsilon_{\delta}$ tends to zero more slowly than any polynomial. It is satisfied for $\varepsilon_{\delta}=\frac{1}{\log(\delta^{-1})}$ . To analyse the asymptotic properties of the SMNE $\vartheta_{\delta}^{\star}$ , we need to strengthen Assumptions (S) and (X) slightly.

Assumption (S’).

Assumption (S) is satisfied and $\sigma$ is $\beta_{\sigma}$ –Hölder continuous, i.e., for some $\beta_{\sigma}\in(0,1]$ there exists a constant $C>0$ such that

[TABLE]

Assumption (X’).

Assumption (X) is satisfied and moreover the solution $X$ is in quadratic mean $\beta_{x}$ –Hölder continuous in the space variable and $\beta_{t}$ –Hölder continuous in the time variable, i.e., for some $\beta_{x},\beta_{t}\in(0,1]$ there is a constant $C>0$ with

[TABLE]

Example 3.7.

If $\sigma({\scriptstyle\bullet})$ is Lipschitz continuous, then standard contraction arguments for the stochastic convolution and the regularity of the Green function for the heat equation yield Assumption (X’) with $\beta_{x}=1/2$ and $\beta_{t}=1/4$ , provided the initial condition $X_{0}$ is $1/2$ –Hölder continuous. Assumption (X) holds already for a continuous initial condition and Lipschitz continuous $\sigma({\scriptstyle\bullet})$ . In fact, standard proofs for pathwise Hölder regularity go via the Kolmogorov-Chentsov theorem and thus establish (3.10), compare Theorem 2.1 in [17] or Corollary 3.4 in [19] for a slightly more involved case on an unbounded domain.

The intriguing questions of weak existence, regularity and pathwise uniqueness for the stochastic heat equation with $\beta_{\sigma}$ -Hölder continuous multiplicative noise have so far only found partial answers. We refer to Theorem 1.3 in [16], which yields our Assumption (X) in case $\beta_{\sigma}>3/4$ in case of an unbounded domain. For their continuity result the authors assert that the results in [18], formulated for coloured noise in space, work analogously for the space-time white noise case. Equations (10) and (19) in [18] then establish Hölder regularity of $X$ in the sense of Assumption (X’).

We turn to the analysis of the stabilised multiplicative noise estimator. The error decomposition for the SMNE $\vartheta_{\delta}^{\star}$ follows from (3.9) and (2.5):

[TABLE]

with

[TABLE]

The term $\mathcal{I}_{\delta}^{\star}$ is the quadratic variation of the martingale part $\mathcal{M}_{\delta}^{\star}$ . The limits of $\mathcal{I}_{\delta}^{\star}$ and $\mathcal{J}_{\delta}^{\star}$ for $\delta\to 0$ involve an (in general random) time length $T^{\star}$ during which $\sigma(X(t,x_{0}))$ does not vanish, see Proposition 5.10 below. So, we use again the stable limit theorem of Proposition 3.2 and derive a central limit theorem for the SMNE $\vartheta_{\delta}^{\star}$ with rate $\delta$ , without assuming a lower bound on $\sigma({\scriptstyle\bullet})$ .

Theorem 3.8.

Grant Assumptions (S’) and (X’) with (3.8). Introduce

[TABLE]

Then as $\delta\rightarrow 0$ on the event $\{T^{\star}>0\}$

[TABLE]

where $Z\sim N(0,1)$ is independent of $\mathscr{F}_{T}$ .

Proof.

The proof is deferred to Section 5.4. ∎

Remark 3.9.

From the series of inequalities

[TABLE]

we infer that the (conditional) asymptotic variance of the SMNE lies between those of the ANE and the MNE. Remember, however, that the asymptotics for the MNE were derived under the condition $\underline{\sigma}>0$ , implying $T^{\star}=T$ . The extreme case $\sigma({\scriptstyle\bullet})\equiv 0$ leads to the deterministic heat equation, which for the initial condition $X_{0}=0$ remains zero all the time and does not allow for inference on $\vartheta$ . This type of degeneracy is excluded for the SMNE by the condition $T^{\star}>0$ .

3.4 Data-driven confidence intervals

The asymptotic (mixed) normality of the three estimators allows us to prescribe asymptotic confidence intervals for the parameter $\vartheta$ . The asymptotic conditional variances depend on quantities unknown to the statistician. Yet, in all three error decompositions (3.2), (3.6) and (3.11) it is shown in the proofs that the martingale term divided by the square root of its quadratic variation is asymptotically standard Gaussian. Dividing each error decomposition by the respective second factor on the right-hand side directly gives an asymptotic confidence statement.

Corollary 3.10.

Let $\alpha\in(0,1)$ . Based on the three estimators $\hat{\vartheta}_{\delta}$ , $\tilde{\vartheta}_{\delta}$ and $\vartheta_{\delta}^{\star}$ the confidence intervals for $\vartheta$

[TABLE]

with the standard Gaussian $(1-\alpha/2)$ -quantile $q_{1-\alpha/2}$ have each asymptotic coverage $1-\alpha$ as $\delta\to 0$ under the assumptions of Theorems 3.3, 3.5 and 3.8, respectively.

Note that the confidence intervals only rely on the observation processes $(X_{\delta,x_{0}}^{\Delta}(t),0\leq t\leq T)$ , $(X_{\delta,x_{0}}(t),0\leq t\leq T)$ and the quadratic variation of the latter. Even the kernel $K$ and the resolution level $\delta$ need not be known. In the next section we shall see how the estimation methods can be implemented when only data is available that is discretely sampled in time.

4 Implementation and simulation results

We illustrate the main results in a setting similar to the experimental setup in [1], where the diffusivity parameter $\vartheta$ was estimated in a concrete stochastic model for cell repolarisation.

Consider the stochastic heat equation (1.1) with $\Lambda=(0,L)$ for $L=20$ , $T=30$ , $\vartheta=0.05$ . The initial condition $X_{0}$ is a smooth approximation of the function $f(x)=4\times\mathbf{1}_{[L/4,3L/4]}(x)+2\times\mathbf{1}_{(0,L/4)\cup(3L/4,L)}(x)$ . We present the results for three different functions $\sigma$ :

[TABLE]

We have chosen $\sigma_{2}({\scriptstyle\bullet})$ to have Hölder regularity 0.8 in line with Example 3.7 and not to vanish completely at zero so that all three estimators are applicable. $\sigma_{3}({\scriptstyle\bullet})$ generates strong noise level fluctuations so that the quality of the estimators should differ significantly.

An approximate solution is computed on a regular time-space grid $\{(t_{j},y_{k}):t_{j}=Tj/N,y_{k}=Lk/M,j=0,\ldots,N,k=0,\ldots,M\}$ with $N=48\,000$ and $M=800$ by the Euler-Maruyama scheme. For the drift part, we use the finite difference approximation of $\Delta$ that is applied implicitly, while $\sigma({\scriptstyle\bullet})$ in the stochastic term is applied to the current state of the solution explicitly, compare Algorithm 10.8 in [15]. The mesh sizes fulfill $T/N\asymp(L/M)^{2}$ , ensuring the Courant-Friedrichs-Lewy (CFL) condition for stable simulations [15]. Heat maps for typical realisations with multiplicative noise $\sigma_{2}(X(t))$ and $\sigma_{3}(X(t))$ are displayed in Figure 1. Under $\sigma_{2}({\scriptstyle\bullet})$ we see that fluctuations are larger for higher temperature levels, while at the boundary it cools down to zero almost deterministically. Under $\sigma_{3}({\scriptstyle\bullet})$ excitations by strong noise at the interface values 2 and 4 are counteracted by the diffusion, which leads to almost noiseless inner regions with strong fluctuations of the interfaces in time. The spatial gradient at the interfaces is very large, which is no numerical artefact, but due to expulsion by noise.

As in [1] we employ the smooth compactly supported kernel

[TABLE]

and we localise around the central point $x_{0}=L/2$ with $\delta=0.03\times L$ . Based on these local measurements, the estimators $\hat{\vartheta}_{\delta}$ (ANE), $\tilde{\vartheta}_{\delta}$ (MNE) and $\vartheta_{\delta}^{\star}$ (SMNE) are computed.

The term $Z(t):=\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}$ is accessed by the following procedure. In view of (3.4), $Z(t)$ presents the spot volatility of $X_{\delta,x_{0}}$ at time $t$ , which we estimate by

[TABLE]

i.e., by taking the average disintegrated realised quadratic variation over the past $D=800$ values ( $=0.5$ time units). The averaging acts as a smoothing, which is a standard approach for spot volatility estimation [11].

Finally, we choose the stabilising value $\varepsilon_{\delta}^{2}=\frac{0.001}{\log(10/\delta)}$ such that it satisfies condition (3.8) and lies within the range of typical values of $\|\sigma(X({\scriptstyle\bullet}))K_{\delta,x_{0}}\|^{2}$ . The possible issue could be that if the term $\varepsilon_{\delta}^{2}$ is much smaller than $\|\sigma(X({\scriptstyle\bullet}))K_{\delta,x_{0}}\|^{2}$ , the SMNE would practically become the MNE. On the other hand, if the term $\varepsilon_{\delta}^{2}$ dominated $\|\sigma(X({\scriptstyle\bullet}))K_{\delta,x_{0}}\|^{2}$ , then the SMNE would practically coincide with the ANE. So, in practice we recommend to estimate the spot volatility first and then to adjust $\varepsilon_{\delta}^{2}$ accordingly.

Figure 2 displays simulation results for the estimators of the parameter $\vartheta$ obtained after $1\,000$ Monte Carlo runs for each of the functions $\sigma_{1}$ , $\sigma_{2}$ and $\sigma_{3}$ . The red lines in the histograms indicate the asymptotic distribution, obtained as a mixture of $1\,000$ Gaussian densities that (individually, for each run) follow the theoretical results established in Theorems 3.3, 3.5 and 3.8. Monte Carlo mean and standard deviation for every case are stated in Table 1.

In the additive noise case $\sigma_{1}$ all three estimators perform similarly well. In this case we have equalities in (3.13) and the resulting asymptotic distributions coincide. In the “Hölder” multiplicative noise case $\sigma_{2}$ , the estimator ANE performs slightly worse than the two alternatives. Since $\sigma_{2}({\scriptstyle\bullet})\geq 0.01>0$ , we have $T^{\star}=T$ and the estimators MNE and SMNE deliver similar results.

For $\sigma_{3}$ the histogram of the ANE in Figure 2(bottom, left) is much more spread out, but has not yet entered the asymptotic regime with a very flat asymptotic density. There are, however, quite a few outliers (12.6 %) outside the interval $[0,0.1]$ , which are not shown and which are caught pretty well by the tails of the asymptotic density. Note also that the corresponding empirical standard deviation in Table 1 is very high with about half the length of the interval $[0,0.1]$ . The estimators MNE and SMNE give a significant improvement here with an error distribution that is almost unchanged with respect to the cases $\sigma_{1}$ and $\sigma_{2}$ . It is worth noting that the assumption $\underline{\sigma}>0$ from Theorem 3.5 for the MNE is violated by $\sigma_{3}$ and we also had $T^{\star}<T$ , but with a minor difference only. In the discrete numerical setting we use the threshold $10^{-6}$ to determine whether $\sigma(X(t,x_{0}))$ is zero or not.

Further unreported simulations show that the performance of the estimators is not influenced by the location of the central observation point $x_{0}$ , unless $x_{0}$ is located very close to the boundary. In fact, if several local measurements (localised around points $\{x_{0}^{j}:j=1,\ldots J\}$ ) are available, it is possible to combine local estimators, see [1], where such an approach is explained and used. Also, simulation results for varying $\delta$ confirm the convergence rate $\delta$ as $\delta\rightarrow 0$ . Note, however, that the spatial discretisation for reliable simulations must always be much finer than $\delta$ . In our setup with $\delta=0.6$ and $L/M=0.025$ , the localised kernel $K_{\delta,x_{0}}$ was evaluated discretely on $48$ grid points.

In conclusion, the two newly proposed estimators MNE and SMNE performed as well as the ANE or even better than the ANE and their asymptotic distribution matches the results obtained in Section 3. The ANE provides good estimation accuracy also under multiplicative noise, but its accuracy suffers under strongly varying $\sigma({\scriptstyle\bullet})$ .

5 Proofs

5.1 Fundamental asymptotics

We need some properties of the rescaled operators and semigroups. Let $(S_{\vartheta,\delta,x_{0}}(t),t\geq 0)$ be the strongly continuous semigroup generated by $\vartheta\Delta$ with Dirichlet boundary conditions on $L^{2}(\Lambda_{\delta,x_{0}})$ and note that both semigroups $(S_{\vartheta}(t),t\geq 0)$ and $(S_{\vartheta,\delta,x_{0}}(t),t\geq 0)$ , are self-adjoint. We cite Lemma 3.1 from [3]:

Lemma 5.1.

For $\delta\in(0,1)$ the following holds:

(i)

If $z\in H_{0}^{1}(\Lambda_{\delta,x_{0}})\cap H^{2}(\Lambda_{\delta,x_{0}})$ , then $\Delta z_{\delta,x_{0}}=\delta^{-2}\left(\Delta z\right)_{\delta,x_{0}}$ .

(ii)

If $z\in L^{2}(\Lambda_{\delta,x_{0}})$ , then $S_{\vartheta}(t)z_{\delta,x_{0}}=\left(S_{\vartheta,\delta,x_{0}}(\delta^{-2}t)z\right)_{\delta,x_{0}}$ , $t\geq 0$ .

The deterministic flow of the initial condition will become negligible due to the next result.

Lemma 5.2.

For an initial condition $X_{0}\in C(\bar{\Lambda})$ we have

[TABLE]

Proof.

Lemma A.7(ii) in [3] shows for $X_{0}\in L^{p}(\Lambda)$ , $p\geq 2$ , that

[TABLE]

Because of $X_{0}\in C(\bar{\Lambda})\subseteq L^{\infty}(\Lambda)$ we may choose $p=4$ and obtain the result. ∎

Lemma 5.3.

Grant Assumptions (S) and (X). For any $t\in[0,T]$ ,

[TABLE]

holds as $\delta\rightarrow 0$ . The limit holds true for any $\omega\in\Omega$ , i.e., surely.

Proof.

For any $t\in[0,T]$ , we have by the continuity of $\sigma$ and $K$ , provided by Assumptions (S) and (X),

[TABLE]

Here dominated convergence is applied with integrable majorant $\overline{\sigma}^{2}K^{2}({\scriptstyle\bullet})$ . ∎

We will need an inequality for the rescaled semigroup $S_{\vartheta,\delta,x_{0}}$ . It encapsulates essentially the hypercontractivity of the heat semigroup.

Lemma 5.4.

For $\alpha\in[0,2]$ , $\delta\in(0,1)$ and $0\leq v\leq\delta^{-2}T$ we have

[TABLE]

Proof.

For $\alpha=0$ we apply Lemma A.6(ii) from [3] with $w=\Delta K$ , $\alpha^{\prime}=1$ and $d=1$ . The constant involved only depends on $T$ and $K$ , which are fixed here.

For $\alpha=2$ we combine Proposition 3.5(i) in [3] and the second inequality from Lemma A.2(iii) in [3] such that

[TABLE]

with $\lVert z\rVert_{L^{1}\cap L^{2}(\operatorname{{\mathbb{R}}})}:=\lVert z\rVert_{L^{1}(\operatorname{{\mathbb{R}}})}+\lVert z\rVert_{L^{2}(\operatorname{{\mathbb{R}}})}$ , using that $K\in H^{2}(\operatorname{{\mathbb{R}}})$ has compact support and is fixed throughout.

For $\alpha\in(0,2)$ we use the Hölder inequality with weight function $w(x)=(S_{\vartheta,\delta,x_{0}}(v)\Delta K)(x)^{2}$ and $p=2/\alpha$ to obtain

[TABLE]

where we used the first two parts of the proof. ∎

We need a uniform bound on the second and fourth centered moment of $X_{\delta,x_{0}}^{\Delta}(t)$ in the sequel.

Lemma 5.5.

For $\delta\rightarrow 0$ we have

[TABLE]

in particular we have $\operatorname{Var}(X_{\delta,x_{0}}^{\Delta}(t))=O(\delta^{-2})$ uniformly over $t\in[0,T]$ .

Proof.

By (2.4) and (2.2) we have

[TABLE]

The Burkholder-Davis-Gundy inequality yields with a constant $C_{4}\geq 1$

[TABLE]

Setting $g(t_{0},s)=\sigma(X(s))S_{\vartheta}(t_{0}-s)\Delta K_{\delta,x_{0}}$ and $t_{0}=t$ , we obtain

[TABLE]

Note that inequality (5.2) holds uniformly over $t\in[0,T]$ and gives the result. The second part follows via Jensen’s inequality. ∎

5.2 Approximation of quadratic variations and related terms

We are ready to study the asymptotic behaviour of the terms $\mathcal{J}_{\delta}$ , $\mathcal{I}_{\delta}$ , $\tilde{\mathcal{I}}_{\delta}$ , $\mathcal{J}_{\delta}^{\star}$ and $\mathcal{I}_{\delta}^{\star}$ . We will analyse them simultaneously, using a generalisation $\mathcal{L}_{\delta}$ .

First, let $f_{\delta}:L^{2}(\Lambda)\rightarrow\mathbb{R}$ be a continuous (and possibly non-linear) functional of the state, satisfying one of the following:

(F1)

There exists $C>0$ such that for any $z,y\in L^{2}(\Lambda)$ , $\delta\in(0,1)$

[TABLE]

(F2)

There exists $C>0$ such that for any $z,y\in L^{2}(\Lambda)$ , $\delta\in(0,1)$

[TABLE]

where $(\varepsilon_{\delta})$ is satisfying (3.8).

Lemma 5.6.

(i)

Functionals $f_{\delta}({\scriptstyle\bullet})\in\{1,\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2},\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{-2}\}$ satisfy condition (F1), provided $\underline{\sigma}>0$ in the last case.

(ii)

Functionals $f_{\delta}({\scriptstyle\bullet})\in\{\frac{1}{\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2}},\frac{\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}}{(\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2})^{2}}\}$ satisfy condition (F2).

Proof.

(i). For $f_{\delta}({\scriptstyle\bullet})\equiv 1$ the statement is trivial.

For $f_{\delta}({\scriptstyle\bullet})=\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}$ we have a uniform upper bound $\overline{\sigma}^{2}\|K\|_{L^{2}(\mathbb{R})}^{2}$ . Moreover, for any $z,y\in L^{2}(\Lambda)$ and $\delta\in(0,1)$ we have

[TABLE]

where we used $|A^{2}-B^{2}|=|A-B||A+B|$ together with the upper bound in the first inequality and then we followed up with the reverse triangle inequality.

For $f_{\delta}({\scriptstyle\bullet})=\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{-2}$ , there is a uniform upper bound $\underline{\sigma}^{-2}\|K\|_{L^{2}(\mathbb{R})}^{-2}$ . Moreover, for any $z,y\in L^{2}(\Lambda)$ and $\delta\in(0,1)$ we have

[TABLE]

Therefore the proof is finished as in the previous case.

(ii). For $f_{\delta}({\scriptstyle\bullet})=\frac{1}{\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2}}$ the upper bound $\varepsilon_{\delta}^{-2}$ is smaller than $\varepsilon_{\delta}^{-4}$ . To prove the second part, we use $|A^{2}-B^{2}|=|A+B||A-B|$ , the upper bound $\overline{\sigma}$ and the reverse triangle inequality to obtain

[TABLE]

For $f_{\delta}({\scriptstyle\bullet})=\frac{\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}}{(\|\sigma({\scriptstyle\bullet})K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2})^{2}}$ , the upper bound $\overline{\sigma}^{2}\|K\|_{L^{2}(\mathbb{R})}^{2}\varepsilon_{\delta}^{-4}$ applies. Moreover, algebraic calculations and the technique above yield

[TABLE]

Therefore, the proof is finished as in the previous case. ∎

Introduce

[TABLE]

In the following proposition we present different expressions that are equal to $\mathcal{L}_{\delta}$ up to terms that are of lower order than $\delta^{-2}$ . This is the major ingredient for the proofs of the main results, noting that the techniques developed in [3] cannot be used here due to the multiplicative noise structure.

Proposition 5.7.

Grant Assumptions (S), (X) with $f_{\delta}$ satisfying condition (F1) or grant Assumptions (S’), (X’) with (3.8) and $f_{\delta}$ satisfying condition (F2). Then $\mathcal{L}_{\delta}$ from (5.3) equals up to additive terms of order $o_{\mathbb{P}}(\delta^{-2})$ for $\delta\rightarrow 0$ :

(i)

$\mathcal{L}_{\delta}^{(i)}=\int_{\delta}^{T}f_{\delta}(X(t))(X_{\delta,x_{0}}^{\Delta}(t)-\operatorname{{\mathbb{E}}}X_{\delta,x_{0}}^{\Delta}(t))^{2}\,dt,$ **

(ii)

$\mathcal{L}_{\delta}^{(ii)}=\int_{\delta}^{T}f_{\delta}(X(t-\delta))(X_{\delta,x_{0}}^{\Delta}(t)-\operatorname{{\mathbb{E}}}X_{\delta,x_{0}}^{\Delta}(t))^{2}\,dt,$ **

(iii)

$\mathcal{L}_{\delta}^{(iii)}=\int_{\delta}^{T}f_{\delta}(X(t-\delta))(\int_{0}^{t}\sigma(X(s,x_{0}))\langle S_{\vartheta}(t-s)\Delta K_{\delta,x_{0}},dW(s)\rangle)^{2}\,dt$ ,

(iv)

$\mathcal{L}_{\delta}^{(iv)}=\int_{\delta}^{T}f_{\delta}(X(t-\delta))(\int_{t-\delta}^{t}\sigma(X(s,x_{0}))\langle S_{\vartheta}(t-s)\Delta K_{\delta,x_{0}},dW(s)\rangle)^{2}\,dt$ ,

(v)

$\mathcal{L}_{\delta}^{(v)}=\int_{\delta}^{T}f_{\delta}(X(t-\delta))\sigma^{2}(X(t-\delta,x_{0}))(\int_{t-\delta}^{t}\langle S_{\vartheta}(t-s)\Delta K_{\delta,x_{0}},dW(s)\rangle)^{2}\,dt$ ,

(vi)

$\mathcal{L}_{\delta}^{(vi)}=\int_{\delta}^{T}f_{\delta}(X(t-\delta))\sigma^{2}(X(t-\delta,x_{0}))\operatorname{{\mathbb{E}}}(\int_{t-\delta}^{t}\langle S_{\vartheta}(t-s)\Delta K_{\delta,x_{0}},dW(s)\rangle)^{2}\,dt$ .

(vii)

$\mathcal{L}_{\delta}^{(vii)}=(2\vartheta)^{-1}\lVert K^{\prime}\rVert_{L^{2}(\mathbb{R})}^{2}\delta^{-2}\int_{0}^{T}f_{\delta}(X(t))\sigma^{2}(X(t,x_{0}))\,dt$ .

Remark 5.8.

The overall idea is to achieve the representation $\mathcal{L}_{\delta}^{(vii)}$ via slight consecutive alterations. In point (i) we shorten the outer integral to the interval $[\delta,T]$ , in (ii), we present a slight time shift of the solution in the functional, i.e., $f_{\delta}(X(t-\delta))$ . In point (iii) the function $\sigma(X(s,{\scriptstyle\bullet}))$ is fixed in the space-point $x_{0}$ , in (iv) the stochastic integral is shortened, in (v) the function $\sigma(X(s,x_{0}))$ is fixed at the time-point $t-\delta$ . In (vi) the expectation of the squared stochastic integral is implemented via conditional independence. Finally, in (vii) the expectation is approximated and the integral extended again.

Proof.

We present the proof with Assumptions (S’) and (X’) and $f_{\delta}$ satisfying condition (F2) with (3.8) in mind. The other case is analogous and much simpler, mostly because it does not use the function $\varepsilon_{\delta}$ at all. The proof of (ii) is shown for both cases. The order $O(\varepsilon_{\delta}^{-4})$ for $|f_{\delta}|$ is used frequently as the first step of the proof.

(i). Compute

[TABLE]

using Lemma 5.5. Therefore, the remainder term is of order $o_{\mathbb{P}}(\delta^{-2})$ due to $\varepsilon_{\delta}^{-4}\delta\to 0$ by (3.8).

(ii). By the Cauchy-Schwarz inequality, we have

[TABLE]

Lemma 5.5 gives $\mathbb{E}\int_{\delta}^{T}(X_{\delta,x_{0}}^{\Delta}(t)-\operatorname{{\mathbb{E}}}X_{\delta,x_{0}}^{\Delta}(t))^{4}\,dt=O(\delta^{-4})$ so that the second factor is of order $O_{\operatorname{{\mathbb{P}}}}(\delta^{-2})$ . Hence, it suffices to establish $\int_{\delta}^{T}(f_{\delta}(X(t))-f_{\delta}(X(t-\delta)))^{2}dt=o_{\operatorname{{\mathbb{P}}}}(1)$ .

When we consider Assumptions (S’), (X’) and $f_{\delta}$ satisfying condition (F2), we obtain

[TABLE]

by the Hölder continuity of $\sigma$ and $X({\scriptstyle\bullet},x)$ and by $\lVert K_{\delta,x_{0}}\rVert=\lVert K\rVert_{L^{2}(\operatorname{{\mathbb{R}}})}$ . This upper bound converges to zero by (3.8).

When we consider Assumptions (S), (X) and $f_{\delta}$ satisfying condition (F1), we have

[TABLE]

The integrand in (5.6) converges to zero by the continuity of $X$ and $\sigma$ (i.e., Assumptions (X) and (S)). The integrable majorant $4\overline{\sigma}^{2}K^{2}({\scriptstyle\bullet})$ serves for the $dy$ –integral in (5.6) as well as for the $dt$ –integral in (5.5). The proof that the second factor converges almost surely to zero is accomplished by the dominated convergence theorem.

(iii). By the upper bound on $|f_{\delta}|$ , we have

[TABLE]

Denoting

[TABLE]

using $|A^{2}-B^{2}|=|A+B||A-B|$ and the Cauchy-Schwarz inequality, we may follow up (5.7) with

[TABLE]

We start with the first factor after $\varepsilon_{\delta}^{-4}$ . Itô’s isometry yields

[TABLE]

and this is of order $O(\delta^{-2})$ , compare (5.2). The second factor is given by

[TABLE]

Using Assumptions (S’) and (X’), we have

[TABLE]

and consequently for the integrand in (5.10)

[TABLE]

Without loss of generality assume $\beta_{x}\beta_{\sigma}\in(0,1)$ and $\beta_{x}\beta_{\sigma}\neq 1/3$ (otherwise decrease $\beta_{x}$ or $\beta_{\sigma}$ slightly). Using Lemma 5.4, we obtain from (5.11)

[TABLE]

Since the bound applies uniformly to all $t\in[0,T]$ , we deduce from (5.10)

[TABLE]

in view of (3.8), which remained to be proved.

(iv). We obtain, using the Cauchy-Schwarz inequality,

[TABLE]

In the analysis of $(I)$ , we use Itô’s isometry, the scaling properties from Lemma 5.1 and Lemma 5.4 with $\alpha=0$ . We obtain

[TABLE]

due to (3.8). Given the result for $(I)$ , term $(II)$ is also of order $o(\delta^{-2})$ because the squared second factor is of order $O(\delta^{-2})$ :

[TABLE]

using Itô isometry and arguing as in (5.2).

(v). Denoting

[TABLE]

we compute

[TABLE]

Itô’s isometry yields

[TABLE]

Hence, $\mathbb{E}\int_{\delta}^{T}G_{+}^{2}(t)\,dt=O(\delta^{-2})$ follows from the boundedness of $\sigma$ and the argument in (5.2).

For the term with $G_{-}$ we have from Assumptions (S’) and (X’) that $\mathbb{E}(\sigma(X(t-v,x_{0}))-\sigma(X(t-\delta,x_{0})))^{2}\lesssim\delta^{2\beta_{t}\beta_{\sigma}}$ uniformly over $t$ and $v$ . The same argument thus gives here $\mathbb{E}\int_{\delta}^{T}G_{-}^{2}(t)\,dt=O(\delta^{2\beta_{t}\beta_{\sigma}-2})$ . Insertion into (5.13) and noting condition (3.8) yields the overall rate $o(\delta^{-2})$ .

(vi). Introduce for $t\in[\delta,T]$

[TABLE]

and compute

[TABLE]

We analyse the integral on two sets: (a) $t>u+\delta$ and (b) $u+\delta>t>u$ .

(a) $t>u+\delta$ . We condition on $\mathcal{F}_{t-\delta}$ and obtain

[TABLE]

using that $\tilde{H}(t)$ is independent of $\mathcal{F}_{t-\delta}$ with $\operatorname{{\mathbb{E}}}\tilde{H}(t)=0$ and that the other factors are $\mathcal{F}_{t-\delta}$ -measurable.

(b) $u+\delta>t>u$ . In this case, we use the upper bounds for $\sigma$ and $f_{\delta}$ and bound (5.14) (up to a positive constant) by

[TABLE]

where the last step involves the Cauchy-Schwarz inequality. The analogous calculations to (5.2) yield

[TABLE]

uniformly over $t\in[\delta,T]$ . We conclude $(\mathbb{E}(\mathcal{L}_{\delta}^{(v)}-\mathcal{L}_{\delta}^{(vi)})^{2})^{1/2}\lesssim\varepsilon_{\delta}^{-4}\delta^{-3/2}$ , implying $\lvert\mathcal{L}_{\delta}^{(v)}-\mathcal{L}_{\delta}^{(vi)}\rvert=o_{\operatorname{{\mathbb{P}}}}(\delta^{-2})$ due to (3.8).

(vii). By translating the integrand in $\mathcal{L}_{\delta}^{(vii)}$ and by Itô’s isometry we have

[TABLE]

By the uniforms bounds on $f_{\delta}$ and $\sigma$ the last term is of order $O(\delta^{-2}\delta\varepsilon_{\delta}^{-4})$ .

Next, we compute by the scaling properties in Lemma 5.1 and by partial integration

[TABLE]

Now, bounding the scalar product by the Cauchy-Schwarz inequality and Lemma 5.4 with $\alpha=0$ , we see that it is of order $O(\delta^{3/4})$ . Using the upper bounds for $f_{\delta}$ and $\sigma$ again, we thus obtain

[TABLE]

by (3.8). ∎

5.3 Asymptotics of quadratic variations and related terms

We shall include the initial condition and consider the terms

[TABLE]

where $X_{\delta,x_{0}}^{\Delta}(t)$ is not centered. Having achieved the representation $\mathcal{L}_{\delta}^{(vii)}$ , we are now ready to determine the limit of $\delta^{2}\tilde{\mathcal{L}}_{\delta}$ as $\delta\rightarrow 0$ .

Corollary 5.9.

Grant Assumptions (S), (X) with $f_{\delta}$ satisfying condition (F1) or grant Assumptions (S’), (X’) with (3.8) and $f_{\delta}$ satisfying condition (F2). Suppose as $\delta\to 0$

[TABLE]

for some random variable $\Psi$ . Then $\tilde{\mathcal{L}}_{\delta}$ from (5.16) satisfies for $\delta\rightarrow 0$

[TABLE]

Proof.

The corresponding convergence for the centered version $\delta^{2}{\mathcal{L}}_{\delta}$ follows directly from the representation (vii) in Proposition 5.7.

For $\tilde{\mathcal{L}}_{\delta}$ note that

[TABLE]

We apply the upper bound to $|f_{\delta}|$ from condition (F1) (respectively (F2)) to the first term and obtain its convergence to zero by Lemma 5.2 using $\varepsilon_{\delta}^{-4}\delta^{-11/6}=o(\delta^{-2})$ due to (3.8) in the second case. The second term converges to zero in probability using the Cauchy-Schwarz inequality and $\delta^{2}{\mathcal{L}}_{\delta}=O_{P}(1)$ from above. This gives the result for $\delta^{2}\tilde{\mathcal{L}}_{\delta}$ . ∎

Proposition 5.10.

The following holds as $\delta\rightarrow 0$ :

(i)

under Assumptions (S) and (X):

[TABLE]

(ii)

under Assumptions (S) and (X):

[TABLE]

(iii)

under Assumptions (S) and (X) and assuming $\underline{\sigma}>0$ :

[TABLE]

(iv)

under Assumptions (S’) and (X’) with (3.8):

[TABLE]

(v)

under Assumptions (S’) and (X’) with (3.8):

[TABLE]

Proof.

We apply Proposition 5.9 to the functionals proposed in Lemma 5.6.

(i). We use $f_{\delta}(X(t))=1$ so that $\Psi=\int_{0}^{T}\sigma^{2}(X(t,x_{0}))\,dt$ and the result follows.

(ii). We use $f_{\delta}(X(t))=\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}$ . Since $f_{\delta}(X(t))\rightarrow\sigma^{2}(X(t,x_{0}))\|K\|_{L^{2}(\mathbb{R})}^{2}$ by Lemma 5.3, the limit $\Psi=\int_{0}^{T}\sigma^{4}(X(t,x_{0}))\|K\|_{L^{2}(\mathbb{R})}^{2}\,dt$ follows by dominated convergence because $\overline{\sigma}^{4}\|K\|_{L^{2}(\mathbb{R})}^{2}$ is an integrable majorant.

(iii). We use $f_{\delta}(X(t))=\|\sigma(X(t))K_{\delta,x_{0}}\|^{-2}$ . Since $f_{\delta}(X(t))\sigma^{2}(X(t,x_{0}))\rightarrow\|K\|_{L^{2}(\mathbb{R})}^{-2}$ , we obtain the limit $\Psi=T\|K\|_{L^{2}(\mathbb{R})}^{-2}$ . The integrable majorant for the $dt$ –integral over $f_{\delta}(X(t))\sigma^{2}(X(t,x_{0}))$ can be taken as $\frac{\overline{\sigma}^{2}}{\underline{\sigma}^{2}\|K\|_{L^{2}(\mathbb{R})}^{2}}$ (up to a multiplicative constant).

(iv). We use $f_{\delta}(X(t))=\frac{1}{\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2}}$ . Since we have only an upper bound to $\sigma({\scriptstyle\bullet})$ in this case, obtaining an integrable majorant to $\frac{\sigma^{2}(X(t,x_{0}))}{\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2}}$ is not so straightforward. We use $|A^{2}-B^{2}|=|A+B||A-B|$ , the upper bound $\overline{\sigma}$ and Assumptions (S’) and (X’) to obtain

[TABLE]

This gives the uniform majorant in $\delta$ and $t$

[TABLE]

for sufficiently small $\delta$ due to (3.8). Next, we determine the pointwise limit of the $L^{1}(\operatorname{{\mathbb{P}}})$ -distance. In view of (5.18) and $\delta^{\beta_{x}\beta_{\sigma}}\varepsilon_{\delta}^{-2}\to 0$ due to (3.8) it suffices to note for all $t$

[TABLE]

We conclude that the integral converges in $L^{1}(\operatorname{{\mathbb{P}}})$ and thus in probability to

[TABLE]

(v). We use $f_{\delta}(X(t))=\frac{\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}}{(\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2})^{2}}$ . An integrable majorant in $t$ is given by $2\|K\|_{L^{2}(\mathbb{R})}^{-2}$ and the limit $\Psi=\lVert K\rVert_{L^{2}(\operatorname{{\mathbb{R}}})}^{-2}T^{\star}$ is determined as in the previous case. ∎

5.4 Proof of the main theorems

Proof of Theorem 3.3.

Consider the error decomposition (3.2). The asymptotic properties of $\delta^{2}\mathcal{I}_{\delta}$ and $\delta^{2}\mathcal{J}_{\delta}$ are established in Proposition 5.10(i),(ii), therefore the second factor converges in probability to the random variable

[TABLE]

For the first factor, let $Y_{\delta}(t,x):=\delta X_{\delta,x_{0}}^{\Delta}(t)\sigma(X(t,x))K_{\delta,x_{0}}(x)$ and check the two conditions of Proposition 3.2. Since

[TABLE]

by Proposition 5.10(ii), condition (C1) is satisfied with $s(t)=\frac{\|K^{\prime}\|_{L^{2}(\mathbb{R})}\|K\|_{L^{2}(\mathbb{R})}}{(2\vartheta)^{1/2}}\sigma^{2}(X(t,x_{0}))$ . The support condition (C2’) follows directly from the definition of $Y_{\delta}(t)$ as a multiple of $K_{\delta,x_{0}}$ . Proposition 3.2 thus shows $\delta\mathcal{M}_{\delta}\xrightarrow{stably}\int_{0}^{T}s(t)\,dB(t)$ as $\delta\rightarrow 0$ with an independent scalar Brownian motion $B$ . On the event $\{\int_{0}^{T}\sigma^{2}(X(t,x_{0}))\,dt>0\}$ we infer $\frac{\delta\mathcal{M}_{\delta}}{\delta\mathcal{I}_{\delta}^{1/2}}\xrightarrow{stably}Z$ where $Z\sim N(0,1)$ is independent of the $\sigma$ –algebra $\mathscr{F}_{T}$ . We conclude by applying Slutsky’s lemma. ∎

Proof of Theorem 3.5.

The decomposition of the error follows from (3.6):

[TABLE]

A standard continuous martingale central limit theorem (e.g., [14], Theorem 1.19) provides the convergence of the first factor to $N(0,1)$ in distribution.

Proposition 5.10(iii) yields the convergence

[TABLE]

The result follows by applying Slutsky’s lemma. ∎

Proof of Theorem 3.8.

Consider the error decomposition (3.11). The asymptotic properties of $\delta^{2}\mathcal{I}_{\delta}^{\star}$ and $\delta^{2}\mathcal{J}_{\delta}^{\star}$ are established in Proposition 5.10(iv),(v). Therefore the second factor converges in probability to the random variable

[TABLE]

For the first factor, let $Y_{\delta}(t,x):=\frac{\delta X_{\delta,x_{0}}^{\Delta}(t)\sigma(X(t,x))K_{\delta,x_{0}}(x)}{\|\sigma(X(t))K_{\delta,x_{0}}\|^{2}+\varepsilon_{\delta}^{2}}$ and check the two conditions of Proposition 3.2. Since

[TABLE]

by Proposition 5.10(v), condition (C1) is satisfied with $s(t)=\frac{\|K^{\prime}\|_{L^{2}(\mathbb{R})}}{(2\vartheta)^{1/2}\|K\|_{L^{2}(\mathbb{R})}}\mathbf{1}(\sigma(X(t,x_{0}))\neq 0)$ . The support condition (C2’) follows again directly by definition of $Y_{\delta}(t)$ .

Proposition 3.2 thus yields $\delta\mathcal{M}_{\delta}^{\star}\xrightarrow{stably}\int_{0}^{T}s(t)\,dB(t)$ with an independent scalar Brownian motion $B$ . Hence, $\frac{\delta\mathcal{M}_{\delta}^{\star}}{\delta(\mathcal{I}_{\delta}^{\star})^{1/2}}\xrightarrow{stably}Z$ holds on $\{T^{\star}>0\}$ with $Z\sim N(0,1)$ independent of the $\sigma$ –algebra $\mathscr{F}_{T}$ . The proof is concluded by applying Slutsky’s lemma. ∎

6 A stable limit theorem for cylindrical Brownian martingales

Let $H$ be a separable Hilbert space and $(e_{k})_{k\geq 1}$ a complete orthonormal system in $H$ . Let $(W_{k}(t),t\geq 0)_{k\geq 1}$ be a sequence of independent real-valued standard Brownian motions. Then $W(t)=\sum_{k\geq 1}W_{k}(t)e_{k}$ is an $H$ -valued cylindrical Brownian motion (e.g., Proposition 4.11 in [9]). Consider the filtered probability space $(\Omega,{\mathscr{F}},({\mathscr{F}}_{t})_{t\geq 0},\mathbb{P})$ , on which $(W_{k}(t),t\geq 0)_{k\geq 1}$ are defined and where the Brownian filtration $({\mathscr{F}}_{t})_{t\geq 0}$ is the filtration generated by $(W_{k}(t),t\geq 0)_{k\geq 1}$ and augmented by $\mathbb{P}$ –null sets.

We start with a Hilbert space-valued Brownian martingale representation theorem, which follows by approximation from the finite-dimensional version, but does not seem readily available in the literature.

Proposition 6.1.

Let $(M(t),t\geq 0)$ be a square-integrable real-valued martingale with respect to $({\mathscr{F}}_{t})_{t\geq 0}$ and with càdlàg paths, $M(0)=0$ . Then there exist progressively measurable processes $(F_{k}(t),t\geq 0)_{k\geq 1}$ satisfying $\sum_{k=1}^{\infty}\int_{0}^{T}\mathbb{E}F_{k}^{2}(t)\,dt<\infty$ for all $T>0$ and $\mathbb{P}$ –a.s. (with $L^{2}(\operatorname{{\mathbb{P}}})$ -convergence)

[TABLE]

where $F(s):=\sum_{k=1}^{\infty}F_{k}(s)e_{k}$ .

Proof.

Define the subfiltrations $({\mathscr{F}}_{t}^{(K)})_{t\geq 0}$ generated by $(W_{k}(t),t\geq 0)_{1\leq k\leq K}$ and consider

[TABLE]

By the tower property for $s<t$ , we have

[TABLE]

and another application of the tower property yields

[TABLE]

We conclude that $(M^{(K)}(t),t\geq 0)$ forms an $L^{2}(\operatorname{{\mathbb{P}}})$ -martingale with respect to the $K$ -dimensional Brownian filtration $({\mathscr{F}}_{t}^{(K)})_{t\geq 0}$ . By standard martingale theory (e.g., Theorem 1.3.13 in [13]) we may choose a càdlàg version of $(M^{(K)}(t),t\geq 0)$ , which we shall do henceforth.

Theorem 3.4.15 in [13] therefore shows that there are $(F_{k}(t),t\geq 0)_{1\leq k\leq K}$ satisfying $\sum_{k=1}^{K}\int_{0}^{T}\mathbb{E}F_{k}^{2}(t)\,dt<\infty$ for all $T>0$ and $\mathbb{P}$ –a.s.

[TABLE]

The uniqueness result of that theorem also shows that for each $K$ the $F_{k}$ , $k=1,\ldots,K$ , can be chosen to not depend on $K$ because by independence of $W_{K}$ from $(W_{k},1\leq k\leq K-1)$ , $K\geq 2$ , we have

[TABLE]

Since ${\mathscr{F}}_{t}$ is generated by $\bigcup_{K\geq 1}{\mathscr{F}}_{t}^{(K)}$ , the $L^{2}$ -martingale convergence theorem gives $\lim_{K\rightarrow\infty}M^{(K)}(t)=M(t)$ in $L^{2}(\operatorname{{\mathbb{P}}})$ -convergence for every $t\geq 0$ . Hence, also $\sum_{k=1}^{K}\int_{0}^{t}F_{k}(s)\,dW_{k}(s)$ converges in $L^{2}(\operatorname{{\mathbb{P}}})$ for $K\rightarrow\infty$ . By Itô’s isometry this shows that the $L^{2}(\operatorname{{\mathbb{P}}})$ -norms converge: $\sum_{k=1}^{\infty}\int_{0}^{t}\mathbb{E}F_{k}^{2}(s)\,ds<\infty$ . The limit $\sum_{k=1}^{\infty}\int_{0}^{t}F_{k}(s)\,dW_{k}(s)=\int_{0}^{t}\left\langle F(s),dW(s)\right\rangle$ is then well defined as an element of $L^{2}(\operatorname{{\mathbb{P}}})$ . Moreover, it equals the limit of $M^{(K)}(t)$ whence

[TABLE]

holds $\mathbb{P}$ –a.s. for each fixed $t\geq 0$ . Using the càdlàg path versions on each side, this entails equality for all $t\geq 0$ with probability one. ∎

Theorem 6.2.

Let $(Y_{\delta}(t),t\geq 0)$ for $\delta>0$ be progressively measurable $H$ -valued processes on $(\Omega,{\mathscr{F}},({\mathscr{F}}_{t})_{t\geq 0},\mathbb{P})$ with $\int_{0}^{T}\|Y_{\delta}(t)\|^{2}\,dt<\infty$ and all $T\geq 0$ (or $T\in[0,T_{max}]$ ). Assume for all $T$ :

(C1)

$\int_{0}^{T}\|Y_{\delta}(t)\|^{2}\,dt\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}\int_{0}^{T}s^{2}(t)\,dt$ * as $\delta\rightarrow 0$ for some progressively measurable real-valued process $(s(t),t\geq 0)$ with $\int_{0}^{T}s^{2}(t)\,dt<\infty$ ,*

(C2)

$\int_{0}^{T}\left\langle Y_{\delta}(t),F(t)\right\rangle\,dt\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}0$ * as $\delta\rightarrow 0$ for all progressively measurable $H$ -valued processes $(F(t),t\geq 0)$ .*

Then the following stable limit theorem for stochastic integrals holds:

[TABLE]

with an independent scalar Brownian motion $(B(t),t\geq 0)$ (on an extension of the original filtered probability space).

Proof.

Since $M_{\delta}(T):=\int_{0}^{T}\left\langle Y_{\delta}(t),dW(t)\right\rangle$ is a continuous martingale with quadratic variation $C_{\delta}(T)=\int_{0}^{T}\|Y_{\delta}(t)\|^{2}\,dt$ , we can apply Theorem IX.7.3(b) in [12] with the trivial processes $Z_{t}=0$ , $B_{t}=0$ (in that Theorem), so that it remains to check for all $T$ :

(i)

$C_{\delta}(T)\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}\int_{0}^{T}s^{2}(t)\,dt$ , 2. (ii)

$\left\langle M_{\delta},N\right\rangle_{T}\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}0$ for all bounded càdlàg-martingales $N$ on $(\Omega,{\mathscr{F}},({\mathscr{F}}_{t})_{t\geq 0},\mathbb{P})$ with $N(0)=0$ .

Condition (i) is satisfied by assumption (C1). For condition (ii) we use the Brownian martingale representation from Proposition 6.1 to represent $N(T)=\int_{0}^{T}\left\langle F(t),dW(t)\right\rangle$ with progressively measurable coordinates $(F_{k}(t),t\geq 0)_{k\geq 1}$ and $\sum_{k=1}^{\infty}\int_{0}^{T}\mathbb{E}F_{k}^{2}(t)\,dt<\infty$ for all $T\geq 0$ . Then $\left\langle M_{\delta},N\right\rangle_{T}=\int_{0}^{T}\left\langle Y_{\delta}(t),F(t)\right\rangle\,dt$ holds and condition (ii) follows from Assumption (C2). ∎

Corollary 6.3.

Theorem 6.2 holds for $H=L^{2}(\Lambda)$ if condition (C2) is replaced by the following support condition:

(C2’)

There exist deterministic Borel sets $A(\delta)\subseteq[0,T]\times\Lambda$ with $\operatorname{supp}(Y_{\delta^{\prime}})\subseteq A(\delta)$ Lebesgue-almost everywhere for all $0<\delta^{\prime}\leq\delta$ and $\lambda(A(\delta))\rightarrow 0$ as $\delta\rightarrow 0$ , where $\lambda$ denotes Lebesgue measure on $[0,T]\times\Lambda$ .

Proof.

Set $F_{\delta}(t)=F(t)\mathbf{1}_{A(\delta)^{C}}$ for $F({\scriptstyle\bullet})$ in condition (C2) of Theorem 6.2. Then by $\lambda(A(\delta))\rightarrow 0$ and dominated convergence, $F_{\delta}\rightarrow F$ holds in $L^{2}([0,T]\times\Lambda)$ for each $\omega\in\Omega$ . Clearly, for each $\delta>0$ the support property gives

[TABLE]

By the triangle and the Cauchy-Schwarz inequality and using condition (C1), we obtain

[TABLE]

with convergence in probability. ∎

Acknowledgment

We are grateful to Randolf Altmeyer for very helpful discussions, in particular bringing up the ideas for Proposition 5.7. Insightful comments and questions by Gregor Pasemann, Eric Ziebell and Pavel Kříž have lead to several improvements. This research has been funded by Deutsche Forschungsgemeinschaft (DFG) - SFB1294/1 - 318763901.

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Altmeyer, T. Bretschneider, J. Janák, M. Reiß (2022) Parameter estimation in an SPDE model for cell repolarisation, SIAM/ASA J. Uncert. Quantif. 10 , 179–199.
2[2] R. Altmeyer, I. Cialenco, G. Pasemann (2020) Parameter estimation for semilinear SPD Es from local measurements, Preprint , ar Xiv:2004.14728.
3[3] R. Altmeyer, M. Reiß (2021) Nonparametric estimation for linear SPD Es from local measurements, Ann. Appl. Probab. 31 , 1–38.
4[4] J. Blath, M. Hammer, F. Nie (2022) The stochastic Fisher-KPP equation with seed bank and on/off branching coalescing Brownian motion, Stoch. PDE: Anal. Comp. , 1–46.
5[5] Z. Cheng, I. Cialenco, R. Gong (2020) Bayesian estimations for diagonalizable bilinear SPD Es, Stoch. Proc. Appl. 130 , 845–877.
6[6] C. Chong (2019) High-frequency analysis of parabolic stochastic PD Es with multiplicative noise: Part I, Preprint , ar Xiv:1908.04145.
7[7] I. Cialenco, N. Glatt-Holtz (2011) Parameter estimation for the stochastically perturbed Navier-Stokes equations, Stoch. Proc. Appl. 121 , 701–724.
8[8] I. Cialenco, S. Lototsky (2009) Parameter estimation in diagonalizable bilinear stochastic parabolic equations, Stat. Infer. Stoch. Proc. 12 , 203–219.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Parameter estimation for the stochastic heat equation with multiplicative noise from local measurements

Abstract

1 Introduction

2 The model

2.1 Notation

2.2 The stochastic heat equation

Assumption (S)****.

Assumption (X)****.

2.3 The observation scheme

3 Estimation methods and main results

3.1 The additive noise estimator

Definition 3.1**.**

Proposition 3.2**.**

Theorem 3.3**.**

Proof.

3.2 The multiplicative noise estimator

Definition 3.4**.**

Theorem 3.5**.**

Proof.

3.3 The stabilised multiplicative noise estimator

Definition 3.6**.**

Assumption (S’)****.

Assumption (X’)****.

Example 3.7**.**

Theorem 3.8**.**

Proof.

Remark 3.9**.**

3.4 Data-driven confidence intervals

Corollary 3.10**.**

4 Implementation and simulation results

5 Proofs

5.1 Fundamental asymptotics

Lemma 5.1**.**

Lemma 5.2**.**

Proof.

Lemma 5.3**.**

Proof.

Lemma 5.4**.**

Proof.

Lemma 5.5**.**

Proof.

5.2 Approximation of quadratic variations and related terms

Lemma 5.6**.**

Proof.

Proposition 5.7**.**

Remark 5.8**.**

Proof.

5.3 Asymptotics of quadratic variations and related terms

Corollary 5.9**.**

Proof.

Proposition 5.10**.**

Proof.

5.4 Proof of the main theorems

Proof of Theorem 3.3.

Proof of Theorem 3.5.

Proof of Theorem 3.8.

6 A stable limit theorem for cylindrical Brownian martingales

Proposition 6.1**.**

Proof.

Theorem 6.2**.**

Proof.

Corollary 6.3**.**

Proof.

Acknowledgment

Assumption (S).

Assumption (X).

Definition 3.1.

Proposition 3.2.

Theorem 3.3.

Definition 3.4.

Theorem 3.5.

Definition 3.6.

Assumption (S’).

Assumption (X’).

Example 3.7.

Theorem 3.8.

Remark 3.9.

Corollary 3.10.

Lemma 5.1.

Lemma 5.2.

Lemma 5.3.

Lemma 5.4.

Lemma 5.5.

Lemma 5.6.

Proposition 5.7.

Remark 5.8.

Corollary 5.9.

Proposition 5.10.

Proposition 6.1.

Theorem 6.2.

Corollary 6.3.