Conditionally Gaussian Random Sequences for an Integrated Variance   Estimator with Correlation between Noise and Returns

Stefano Peluso; Antonietta Mira; Pietro Muliere

arXiv:1905.11793·stat.CO·May 29, 2019

Conditionally Gaussian Random Sequences for an Integrated Variance Estimator with Correlation between Noise and Returns

Stefano Peluso, Antonietta Mira, Pietro Muliere

PDF

TL;DR

This paper introduces a new integrated variance estimator that effectively handles correlation between microstructure noise and returns, using a generalized sampling algorithm, and demonstrates improved accuracy on real financial data.

Contribution

It proposes a novel estimator and a generalized sampling algorithm to account for noise-return correlation, filling a gap in existing financial variance estimation methods.

Findings

01

Outperforms existing estimators in simulation studies.

02

Shows improved accuracy on intra-day Microsoft prices.

03

Demonstrates robustness to noise-return dependence.

Abstract

Correlation between microstructure noise and latent financial logarithmic returns is an empirically relevant phenomenon with sound theoretical justification. With few notable exceptions, all integrated variance estimators proposed in the financial literature are not designed to explicitly handle such a dependence, or handle it only in special settings. We provide an integrated variance estimator that is robust to correlated noise and returns. For this purpose, a generalization of the Forward Filtering Backward Sampling algorithm is proposed, to provide a sampling technique for a latent conditionally Gaussian random sequence. We apply our methodology to intra-day Microsoft prices, and compare it in a simulation study with established alternatives, showing an advantage in terms of root mean square error and dispersion.

Tables1

Table 1. Table 1: Bias, standard deviation and RMSE for the methods in Kalnina and Linton, ( 2008 ) (KL), Zhang et al., ( 2005 ) (Z), Bandi and Russell, ( 2011 ) (BR), Jacod et al., ( 2009 ) (JAC), the small sample adjusted estimator of Jacod et al., ( 2009 ) (JAC ADJ), Barndorff-Nielsen et al., ( 2011 ) (BN), Xiu, ( 2010 ) (X) and for our methodology (LIP), over 500 trading days, in the simulation setting with correlation between microstructure noise and financial latent return fixed to ± 0.10 plus-or-minus 0.10 \pm 0.10 .

	$ρ = - 0.10$			$ρ = + 0.10$
Method	Bias $\times$ 1000	Std $\times$ 1000	RMSE $\times$ 1000	Bias $\times$ 1000	Std $\times$ 1000	RMSE $\times$ 1000
KL	-4.53	12.62	13.41	-5.42	11.93	13.11
Z	-2.74	11.37	11.70	-3.24	10.72	11.19
BR	-1.02	11.72	11.76	-1.53	11.04	11.14
JAC	-0.39	2.97	3.00	-0.26	3.01	3.03
JAC ADJ	-0.09	2.99	2.99	0.04	3.03	3.03
BN	0.89	2.35	2.52	1.19	2.41	2.69
X	0.03	1.35	1.35	0.10	1.40	1.40
LIP	0.56	0.56	0.80	-0.10	0.90	0.90

Equations110

θ_{t + 1}

θ_{t + 1}

ξ_{t + 1}

\delta(x,y)=\left\{\begin{array}[]{ll}1,&x=y\\ 0,&x\neq y\end{array}\right.

\delta(x,y)=\left\{\begin{array}[]{ll}1,&x=y\\ 0,&x\neq y\end{array}\right.

m (t + 1)

m (t + 1)

γ (t + 1)

θ_{t + 1}

θ_{t + 1}

ξ_{t + 1}

ξ_{t + 1}

ξ_{t + 1}

\displaystyle\left\{\begin{array}[]{l}A_{0}(t)=\tilde{A}_{0}(t)+\tilde{A}_{1}(t)a_{0}(t)\\ A_{1}(t)=\tilde{A}_{1}(t)a_{1}(t)\\ B_{1}(t)=\tilde{A}_{1}(t)b_{1}(t)+\tilde{B}_{1}(t)\\ B_{2}(t)=\tilde{A}_{1}(t)b_{2}(t)+\tilde{B}_{2}(t).\\ \end{array}\right.

\displaystyle\left\{\begin{array}[]{l}A_{0}(t)=\tilde{A}_{0}(t)+\tilde{A}_{1}(t)a_{0}(t)\\ A_{1}(t)=\tilde{A}_{1}(t)a_{1}(t)\\ B_{1}(t)=\tilde{A}_{1}(t)b_{1}(t)+\tilde{B}_{1}(t)\\ B_{2}(t)=\tilde{A}_{1}(t)b_{2}(t)+\tilde{B}_{2}(t).\\ \end{array}\right.

θ_{t + 1}

θ_{t + 1}

ξ_{t + 1}

m (t + 1)

m (t + 1)

γ (t + 1)

p (θ^{T} ∣ ξ^{T}, \tilde{D} (1), \dots, \tilde{D} (T)) \propto t = 1 \prod T ϕ (V_{t} W_{t}, V_{t}),

p (θ^{T} ∣ ξ^{T}, \tilde{D} (1), \dots, \tilde{D} (T)) \propto t = 1 \prod T ϕ (V_{t} W_{t}, V_{t}),

V_{t}^{- 1}

V_{t}^{- 1}

W_{t}

θ_{1}, \dots, θ_{T} ∣ ξ_{1}, \dots, ξ_{T}, \tilde{D} (1), \dots, \tilde{D} (T)

θ_{1}, \dots, θ_{T} ∣ ξ_{1}, \dots, ξ_{T}, \tilde{D} (1), \dots, \tilde{D} (T)

p (θ^{T} ∣ ξ^{T}) \propto t = 1 \prod T ϕ (V_{t} W_{t}, V_{t}),

p (θ^{T} ∣ ξ^{T}) \propto t = 1 \prod T ϕ (V_{t} W_{t}, V_{t}),

V_{t}^{- 1}

V_{t}^{- 1}

W_{t}

Σ_{t}

d θ_{t} = c (t) d Z_{t}

d θ_{t} = c (t) d Z_{t}

ξ_{(t + 1) / T}

ξ_{(t + 1) / T}

θ_{(t + 1) / T}

m (t + 1)

m (t + 1)

γ (t + 1)

m (t + 1)

m (t + 1)

γ (t + 1)

p (θ_{t / T} ∣ θ_{(t + 1) / T}, \dots, θ_{T}, ξ^{T})

p (θ_{t / T} ∣ θ_{(t + 1) / T}, \dots, θ_{T}, ξ^{T})

γ^{*} := \frac{1}{2} ((2 \tilde{B}_{1} + b_{1})^{2} + 4 \tilde{B}_{2}^{2} - (2 \tilde{B}_{1} + b_{1})),

γ^{*} := \frac{1}{2} ((2 \tilde{B}_{1} + b_{1})^{2} + 4 \tilde{B}_{2}^{2} - (2 \tilde{B}_{1} + b_{1})),

\frac{( 1 - \frac{B ~ _{1} + b _{1}}{b _{1}} ) ^{2}}{B ~ _{2}^{2}} + γ^{*- 1} < γ_{0}^{*- 1},

\frac{( 1 - \frac{B ~ _{1} + b _{1}}{b _{1}} ) ^{2}}{B ~ _{2}^{2}} + γ^{*- 1} < γ_{0}^{*- 1},

\frac{1}{b _{1}^{2}} \tilde{B}_{1}^{4} + \frac{2}{b _{1}} \tilde{B}_{1}^{3} > \frac{b _{1}^{2} + 4 B ~ _{2}^{2}}{b _{1}} \tilde{B}_{1}^{2} + (b_{1} + b_{1}^{2} + 4 \tilde{B}_{2}^{2}) \tilde{B}_{1} .

\frac{1}{b _{1}^{2}} \tilde{B}_{1}^{4} + \frac{2}{b _{1}} \tilde{B}_{1}^{3} > \frac{b _{1}^{2} + 4 B ~ _{2}^{2}}{b _{1}} \tilde{B}_{1}^{2} + (b_{1} + b_{1}^{2} + 4 \tilde{B}_{2}^{2}) \tilde{B}_{1} .

(ξ_{(t + 1) / T} θ_{(t + 1) / T}) ∣ θ_{t}, b_{1} (t), B_{1} (t), B_{2} (t) \sim Φ {(θ_{t} θ_{t}), (B_{1}^{2} (t) + B_{2}^{2} (t) b_{1} (t) B_{1} (t) b_{1} (t) B_{1} (t) b_{1}^{2} (t))},

(ξ_{(t + 1) / T} θ_{(t + 1) / T}) ∣ θ_{t}, b_{1} (t), B_{1} (t), B_{2} (t) \sim Φ {(θ_{t} θ_{t}), (B_{1}^{2} (t) + B_{2}^{2} (t) b_{1} (t) B_{1} (t) b_{1} (t) B_{1} (t) b_{1}^{2} (t))},

ξ_{(t + 1) / T} ∣ θ_{(t + 1) / T}, θ_{t}, b_{1} (t), B_{1} (t), B_{2} (t)

ξ_{(t + 1) / T} ∣ θ_{(t + 1) / T}, θ_{t}, b_{1} (t), B_{1} (t), B_{2} (t)

{θ_{(i)}^{T}, \tilde{B}_{1 (i)}^{T}, \tilde{B}_{2 (i)}^{T}, b_{1 (i)}^{T}}_{i = 1}^{M},

{θ_{(i)}^{T}, \tilde{B}_{1 (i)}^{T}, \tilde{B}_{2 (i)}^{T}, b_{1 (i)}^{T}}_{i = 1}^{M},

\frac{1}{M - M _{0}} i = M_{0} + 1 \sum M t = 1 \sum T (θ_{(t + 1) / T, (i)} - θ_{t, (i)})^{2}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Conditionally Gaussian Random Sequences for an Integrated Variance Estimator with Correlation between Noise and Returns

Stefano Peluso 111Corresponding author. Università Cattolica del Sacro Cuore, Department of Statistical Sciences and Università della Svizzera italiana, Data Science Lab, ICS. Largo Gemelli 1 20123 Milan. E-mail: [email protected]

Antonietta Mira 222Università della Svizzera italiana, Data Science Lab, ICS and Università dell’Insubria, E-mail: [email protected]

Pietro Muliere 333Bocconi University of Milan, E-mail: [email protected]

Abstract

Correlation between microstructure noise and latent financial logarithmic returns is an empirically relevant phenomenon with sound theoretical justification. With few notable exceptions, all integrated variance estimators proposed in the financial literature are not designed to explicitly handle such a dependence, or handle it only in special settings. We provide an integrated variance estimator that is robust to correlated noise and returns. For this purpose, a generalization of the Forward Filtering Backward Sampling algorithm is proposed, to provide a sampling technique for a latent conditionally Gaussian random sequence. We apply our methodology to intra-day Microsoft prices, and compare it in a simulation study with established alternatives, showing an advantage in terms of root mean square error and dispersion.

Keywords: Forward Filtering and Backward Sampling; Integrated Variance; Kalman Filtering; State Space Models.

1 Introduction

Many statistical problems can be formulated as State Space models, where a latent stochastic process $\{\theta_{t}\}$ evolves in time with dynamics given by a transition equation $\theta_{t+1}=a_{1}(t)\theta_{t}+b_{1}(t)\epsilon_{1}(t+1)$ , and with the process observed noisily through $\{\xi_{t}\}$ , which evolves following the measurement equation $\xi_{t+1}=\tilde{A}_{1}(t)\theta_{t+1}+\tilde{B}_{2}(t)\epsilon_{2}(t+1)$ , where $\epsilon_{1}(t)$ and $\epsilon_{2}(t)$ are Gaussian random variables and $a_{1}(t)$ , $b_{1}(t)$ , $\tilde{A}_{1}(t)$ and $\tilde{B}_{2}(t)$ are time-varying parameters. Kalman, (1960) proposed the celebrated Kalman filtering algorithm as optimal solution, in mean square sense, to the filtering problem, that is the problem of estimating the unobservable $\theta_{t}$ by means of observations $\xi^{t}=\{\xi_{1},\dots,\xi_{t}\}$ . The Kalman filter is the starting point in Fruwirth-Schnatter, (1994) and Carter and Kohn, (1994) for an iterative procedure, today commonly known as Forward Filtering Backward Sampling (FFBS), for obtaining posterior samples of $\{\theta_{t}\}$ .

Liptser and Shiryayev, (1972); Liptser and Shiryayev, 2001a ; Liptser and Shiryayev, 2001b introduce the so-called conditionally Gaussian random sequences, whose main two features are: (a) dependence of model parameters from past observations or from other random quantities, with the remaining randomness expressed in terms of Gaussian random variables, (b) correlation between $\xi_{t}$ and $\theta_{t}$ , introduced through the presence, in both the transition and measurement equations, of common Brownian motions.

The Mixture Kalman filter of Chen and Liu, (2000) and the Gibbs samplers of Sethuraman, (1994) and Carter and Kohn, (1996) are some relevant examples that, among other things, generalize Kalman filtering and posterior sampling of the latent stochastic process along direction (a) above. Other simulation techniques such as the Extended Kalman filter of Gelb, (1972), the Monte Carlo filter of Kitagawa, (1987) and the Particle filter of Gordon et al., (1993) do not require conditional Gaussianity, but are based on some form of approximation. Gelb, (1972) provides a suboptimal solution to the filtering problem, by linearizing the transition equation. Kitagawa, (1987) and Gordon et al., (1993) approximate the posterior distribution of the latent stochastic process through a weighted set of particles. A Kalman filter robust to the presence of outliers is proposed in Ruckdeschel et al., (2014). de Jong and Shephard, (1995) and Durbin and Koopman, (2002) also developed FFBS algorithms. In particular, the methodology of de Jong and Shephard, (1995) defines the conditionally linear Gaussian state space model in terms of a single source of error, and it is able to reproduce correlated shocks between the measurement and the transition equations. Relying on the method of de Jong and Shephard, (1995), Czado and Song, (2008) develop a new simulation smoother for binomial longitudinal data.

Harvey and Shepard, (1996) and Sandmann and Koopman, (1998) point out the empirical relevance of direction (b) for modeling the asymmetric behavior often found in stock prices, and Hull and White, (1987) emphasize the role of correlation between observed prices and latent stochastic volatility, suggesting that, neglecting it, it can cause significant biases in financial option pricing. Among others, Brandt and Kang, (2004) and Jacquier et al., (2004) further study this phenomenon in the financial economic literature. Another concrete situation where neglecting correlation in the two equations can be misleading is our motivating example, that is integrated volatility estimation in presence of dependence between microstructure noise and latent financial logarithmic returns, a phenomenon empirically found in Hansen and Lunde, (2006) and theoretically justified in Diebold and Strasser, (2013). Many integrated variance estimators proposed in the literature (Andersen et al., 2003; Ait-Sahalia et al., 2005; Zhang et al., 2005; Zhang, 2006; Ait-Sahalia et al., 2010; Barndorff-Nielsen et al., 2011; Corsi et al., 2015; Peluso et al., 2014) are not designed to handle such a dependence, except for Barndorff-Nielsen et al., (2008), but only in the special setting of a linear model of endogeneity. To our knowledge, the only papers that deal with this endogenous noise are Kalnina and Linton, (2008), which propose a robust version of Zhang et al., (2005), and the pre-averaging estimator of Jacod et al., (2009). Bandi and Russell, (2011), despite assuming exogenous noise, still provide a good benchmark method, since it is empirically found to perform well even if the underlying assumptions are violated.

It is now well recognized that the proper use of intra-day financial price observations leads to precise and accurate measurement and forecasting of unobservable measures, through the so-called realized estimators. See, for instance, the beta estimator proposed by Barndorff-Nielsen et al., (2011) and Golosnoy, (2016), the realized multivariate covariance of Barndorff-Nielsen et al., (2011); Peluso et al., (2014); Corsi et al., (2015) and Shephard and Xiu, (2016), the correlation studied in Barndorff-Nielsen and Shephard, (2004); Audrino and Corsi, (2010) and Bertram et al., (2013). In the present paper we propose a realized variance estimator of the daily integrated volatility that is robust to the presence of correlation between microstructure noise and latent returns, generalizing the setting of Ait-Sahalia et al., (2010). For this purpose, we extend the FFBS algorithm from standard State Space models to the more general context of Liptser and Shiryayev, (1972) in an exact form (with no approximations involved). Therefore our contribution is twofold: (i) the generalization of the FFBS algorithm from State Space models to conditionally Gaussian random sequences, an extension of interest in itself, since it solves the filtering and smoothing problem in a more general context; (ii) the inclusion of the new FFBS algorithm into a MCMC scheme that provides a Bayesian integrated variance estimator robust to correlation between microstructure noise and return, to our knowledge the first Bayesian estimator with these properties. The main advantages of a Bayesian estimator of the integrated variance relying on a system of observational and transition equations are that (i) the latent stochastic price process can be obtained as a byproduct, (ii) from the MCMC iterations any function of the integrated variance or of the latent price process (for instance, integrated quarticity) can be derived, (iii) not only a point estimate, but a whole posterior distribution of the quantity to estimate can be obtained, therefore providing uncertainty quantification of the integrated variance estimate.

The algorithm is presented in Section 3, after an introduction to conditionally Gaussian random sequences in Section 2. The motivating financial problem with related simulated studies and a real application to Microsoft data is detailed in Section 4, and finally the conclusions are drawn in Section 5. Matlab codes for the proposed algorithm and the data supporting the findings in this study are available on request from the corresponding author. The data for the empirical application are not publicly available due to privacy restrictions.

2 Conditionally Gaussian Random Sequences

In this section we introduce the theoretical framework developed in Liptser and Shiryayev, (1972) (see also Liptser and Shiryayev, 2001a and Liptser and Shiryayev, 2001b ), with focus on the recursive equations of conditionally Gaussian random sequences for the solution of the filtering problem.

On a probability space $(\Omega,\mathcal{F},P)$ , the random sequence $\{\theta_{t},\xi_{t}\}_{t}$ , $t=1,2,\dots$ , with $\theta_{t}=(\theta_{1}(t),\dots,\theta_{k}(t))$ and $\xi_{t}=(\xi_{1}(t),\dots,\xi_{l}(t))$ , defines the system of recursive equations

[TABLE]

where $\epsilon_{1}(t)=(\epsilon_{1,1}(t),\dots,\epsilon_{1,k}(t))$ and $\epsilon_{2}(t)=(\epsilon_{2,1}(t),\dots,\epsilon_{2,l}(t))$ are independent Gaussian random variables with expected value $\mathbb{E}(\epsilon_{i,j}(t))=0$ and $\mathbb{E}(\epsilon_{i_{1},j_{1}}(t)\epsilon_{i_{2},j_{2}}(s))=\delta(i_{1},i_{2})\delta(j_{1},j_{2})\delta(t,s)$ , for all $i$ and $j$ , where

[TABLE]

In the sequel, $\theta_{t}$ and $\xi_{t}$ are, respectively, unobservable and observed vectors, with $\theta_{0}|\xi_{0}\sim\Phi(m,\gamma)$ , that is Gaussian with mean $m$ and variance $\gamma$ . $a_{0}(t,\omega)$ and $A_{0}(t,\omega)$ are vector functions, and $a_{1}(t,\omega)$ , $A_{1}(t,\omega)$ , $b_{1}(t,\omega)$ , $b_{2}(t,\omega)$ , $B_{1}(t,\omega)$ and $B_{2}(t,\omega)$ are matrix functions, square integrable and measurable at time $t$ . All the vector and matrix functions at time $t$ are collected in $D(t,\omega)$ . In Liptser and Shiryayev, (1972), $D(t,\omega)$ is assumed to be $\mathcal{F}_{t}^{\xi}$ -measurable, where $\mathcal{F}_{t}^{\xi}=\sigma\{\omega:\ \xi_{0},\dots,\xi_{t}\}$ is the $\sigma$ -algebra generated by the random variables $\xi_{0},\dots,\xi_{t}$ . This assumption will be relaxed in Section 3, where measurability with respect to $\sigma$ -algebras generated by other random variables will be considered. Denote by $b\circ b=b_{1}b_{1}^{*}+b_{2}b_{2}^{*}$ , $b\circ B=b_{1}B_{1}^{*}+b_{2}B_{2}^{*}$ and $B\circ B=B_{1}B_{1}^{*}+B_{2}B_{2}^{*}$ where $X^{*}$ is the transposed matrix of $X$ and $X^{+}=Y^{*}(YY^{*})^{-2}Y$ is the pseudo-inverse matrix of $X$ , with $Y$ such that $Y^{*}Y=X$ . For ease of notation we suppress the dependence on $\omega$ .

Theorem 2.1.

Suppose that $\mathbb{E}(||\theta_{0}||^{2}+||\xi_{0}||^{2})<\infty$ , $|(a_{1}(t))_{ij}|<L$ and $|(A_{1}(t))_{ij}|<L$ , where $L$ is a positive constant. Then, $\theta_{t}|\xi_{0},\dots,\xi_{t}\sim\Phi(m(t),\gamma(t))$ , where $m(t)$ and $\gamma(t)$ are determined from the recursive equations

[TABLE]

with the initial conditions $m(0)=m$ and $\gamma(0)=\gamma$ .

Proof.

See Liptser and Shiryayev, (1972), Theorem 3.2. ∎

An important special case is when $D(t)$ is not a random, but a deterministic function of time $t$ . In this case, if the vector $(\theta_{0},\xi_{0})$ is Gaussian, the process $(\theta_{t},\xi_{t})$ will also be Gaussian, with known covariance $\gamma(t)$ . In this setting it is possible to reformulate the system of recursive equations (1) and (2) so that the dependence between $\xi_{t}$ and $\theta_{t}$ is explicit, and to recover the Kalman filter as special case.

The random sequence $\{\theta_{t},\xi_{t}\}_{t}$ is known as conditionally Gaussian since it follows a Gaussian distribution at any specific time $t$ , conditionally on the knowledge of $D(t)$ . Note that this is not a restrictive assumption, since unconditionally the dependence in time and space is not necessarily linear (for instance when the distribution of $a_{1}(t)$ depends on $\theta_{t}$ ), and the disturbances are location-scale mixture of Gaussian random variables. A wide class of continuous distributions may be constructed as location-scale mixture of Normal distributions, such as contaminated Normals, Student’s t, Logistic, Laplace and Stable distributions. As specified in Marron and Wand, (1992), one way of seeing that the class of Normal mixture densities is very broad results by recalling that any density, even strongly multi-modal and asymmetric, can be approximated arbitrarily well by a Normal mixture. This is a setting of interest in finance, where we often observe skewed distributions of returns (see, among others, Barndorff-Nielsen, 1997 and Azzalini and Capitanio, 2003). Furthermore, distributions of returns can be contaminated by outliers that are not easy to detect and correct for, and that can severely distort a non robust estimation methodology, causing for instance relevant consequences on asset allocation studies (Best and Grauer, 1992). Finally, as pointed out in Engle and Smith, (1999), multi-modal distributions can model situations of regime switches, known to have a relevance in option pricing (see for instance Buffington and Elliott, 2002) and mean-variance portfolio selection (Zhou and Yin, 2003, among others).

3 Sampling Algorithm of the Latent Process

System (1) and (2) can be reformulated to highlight the relation between $\xi_{t}$ and $\theta_{t}$ , so that the sequence of the observations can be interpreted as a realization of a stochastic Markovian latent process with measurement noise:

[TABLE]

where $a_{0}(t),a_{1}(t),b_{1}(t),b_{2}(t),\tilde{A}_{0}(t),\tilde{A}_{1}(t),\tilde{B}_{1}(t),\tilde{B}_{2}(t)$ are stored in $\tilde{D}(t)$ . This alternative representation is more common in the econometrics, financial and engineering literature, and it can be derived from the system (1) and (2) since, substituting (5) in (6), $\xi_{t+1}$ can be written as

[TABLE]

clarifying that the relation between $D(t)$ and $\tilde{D}(t)$ is given by

[TABLE]

Given the system (5)-(6), from Theorem 2.1 it follows that $\theta_{t}|\xi_{1},\dots,\xi_{t}\sim\Phi(m(t),\gamma(t))$ , where $m(t)$ and $\gamma(t)$ are obtained by the recursive equations (3) and (4), but with $A_{0}(t)$ , $A_{1}(t)$ , $B_{1}(t)$ and $B_{2}(t)$ replaced by the respective right hand sides in (11). When $b_{2}(t)=0$ and $\tilde{B}_{1}(t)=0$ (or, equivalently, when $b_{1}(t)=0$ and $\tilde{B}_{2}(t)=0$ ) for all $t$ , system (5)-(6) simplifies to

[TABLE]

for which the filtering problem can be solved through the Kalman filtering iterations:

[TABLE]

In the simplified setting of model (12)-(13), Fruwirth-Schnatter, (1994) and Carter and Kohn, (1994) introduce the Forward Filtering and Backward Sampling (FFBS) algorithm, to sample $\theta^{T}$ a posteriori from

[TABLE]

where

[TABLE]

Exploiting an extended factorization of the posterior density of $\theta$ , induced by the shared Brownian motions, we derive a generalized version of the FFBS algorithm, to jointly sample

[TABLE]

from the system (5)-(6) (an equivalent algorithm for the system (1)-(2) can also be formulated). For easier reference in the sequel, we refer to this algorithm as G-FFBS.

Proposition 3.1.

Given $\xi^{T}$ generated from model (5)-(6), then

[TABLE]

where

[TABLE]

Proof.

See Appendix A. ∎

The proposed generalization over the traditional FFBS finds relevant empirical justification in the motivating example that will be discussed in Section 4. The algorithm requires a forward step in which the quantities of interest $m(t)$ and $\gamma(t)$ are computed following Theorem 2.1, and a backward step where the latent process is sampled according to the factorization in (22). In the traditional FFBS algorithm, the factor at time $t$ in (22) is proportional to $p(\theta_{t+1}|\theta_{t})p(\theta_{t}|\xi^{t})$ , whilst in the proposed G-FFBS algorithm, there is an additional term $p(\xi_{t+1}|\theta_{t+1},\theta_{t})$ , since the correlation between measurement and transition errors generates a conditional dependence between $\xi_{t+1}|\theta_{t+1}$ and $\theta_{t}$ . When $\tilde{B}_{1}(t)=0$ and $b_{2}(t)=0$ for all $t$ or when $\tilde{B}_{2}(t)=0$ and $b_{1}(t)=0$ , there is no correlation between the two errors, the conditional independence of the observations is restored, and G-FFBS reduces to FFBS.

For posterior inference on any function of the latent stochastic process $g(\theta^{T})$ , three cases can be distinguished: (i) $\tilde{D}(t)$ is measurable at time $t$ , (ii) $\tilde{D}(t)$ is unknown at time $t$ but with known dynamics, (iii) $\tilde{D}(t)$ is unknown at time $t$ and with unknown dynamics. In case (i), $\tilde{D}(t)$ is measurable at time $t$ with respect to the $\sigma$ -algebra generated by $\xi^{T}$ or by some other observables, and all samples from $\theta^{T}|\xi^{T}$ can be obtained through the G-FFBS. In case (ii) a simple procedure for posterior inference requires to recursively estimate $\tilde{D}(t)$ by $\hat{D}(t)$ , which is estimated by the known dynamics, and then use $\hat{D}(t)$ instead of $\tilde{D}(t)$ in the G-FFBS (see Smith and West, 1983 and Campagnoli et al., 2001 for, respectively, a biometric and a financial application). When in (iii), $\tilde{D}(t)$ is unknown and cannot be parametrically forecasted: a complete Bayesian model has to be specified, with prior $\pi(\tilde{D}(1),\dots,\tilde{D}(T))$ , and MCMC procedures are used to sample from the joint posterior $\mathbb{P}(\theta^{T},\tilde{D}(1),\dots,\tilde{D}(T)|\xi^{T})$ , by repeatedly sampling at each iteration

•

$\mathbb{P}(\theta^{T}|\tilde{D}(1),\dots,\tilde{D}(T),\xi^{T})$ ,

•

$\mathbb{P}(\tilde{D}(1),\dots,\tilde{D}(T)|\theta^{T},\xi^{T})\propto\mathbb{P}(\theta^{T},\xi^{T}|\tilde{D}(1),\dots,\tilde{D}(T))\,\pi(\tilde{D}(1,\omega),\dots,\tilde{D}(T,\omega)).$

The first step is executed through G-FFBS, and the whole algorithm is a Gibbs sampler (Geman and Geman, 1984; Gelfand and Smith, 1990) or a Metropolis-Hastings sampler (Metropolis et al., 1953; Hastings, 1970), depending on wheather $\pi(\tilde{D}(1),\dots,\tilde{D}(T))$ is a conjugate prior or not.

We conclude this section with a note on model parameters identifiability. If proper priors are adopted, in a Bayesian setting different values of parameters corresponding to the same likelihood value do not arise identifiability issues, with the exception of degenerate cases when the prior and the posterior distribution concide. To better understand this point let us collect in $\{\tilde{D}(t)\}$ all parameters $\tilde{D}(1),\dots,\tilde{D}(T))$ . If, for different values of $\{\tilde{D}(t)\}$ , say $\{\tilde{D}_{1}(t)\}$ and $\{\tilde{D}_{2}(t)\}$ , $\mathbb{P}(\theta^{T},\xi^{T}|\{\tilde{D}_{1}(t)\})$ and $\mathbb{P}(\theta^{T},\xi^{T}|\{\tilde{D}_{2}(t)\})$ are the same, there are no identifiability problems as long as $\mathbb{P}(\{\tilde{D}(t)\}|\xi^{T})$ differs from $\mathbb{P}(\{\tilde{D}(t)\})$ for at least one value of $\{\tilde{D}(t)\}$ . If $\mathbb{P}(\theta^{T},\xi^{T}|\{\tilde{D}_{1}(t)\})=\mathbb{P}(\theta^{T},\xi^{T}|\{\tilde{D}_{2}(t)\})$ and also $\mathbb{P}(\{\tilde{D}_{1}(t)\})=\mathbb{P}(\{\tilde{D}_{2}(t)\})$ , we can only conclude that $\{\tilde{D}(t)\}$ has the same posterior probability in correspondence of $\{\tilde{D}_{1}(t)\}$ and $\{\tilde{D}_{2}(t)\}$ , but still $\{\tilde{D}(t)\}$ has a proper posterior distribution. The case of $\mathbb{P}(\{\tilde{D}(t)\}|\xi^{T})=\mathbb{P}(\{\tilde{D}(t)\})$ occurs when the data does not provide any information on $\{\tilde{D}(t)\}$ , a degenerate case verified only when $\mathbb{P}(\xi^{T}|\{\tilde{D}(t)\})$ is constant for all values of $\{\tilde{D}(t)\}$ .

4 Robust Integrated Variance Estimation

4.1 Problem context

In this section the developed sampling algorithm is applied to our motivating problem. Suppose that the logarithmic price of a given financial asset follows, within the trading day, the diffusion process

[TABLE]

where $c(t)$ is the instantaneous volatility and $\{Z_{t}\}_{t}$ is the standard Brownian motion. $IV=\int c^{2}(t)dt$ is known as integrated variance and is of interest as a measure of the true daily volatility. For estimation we use the discrete approximation of the continuous-time process above: $\theta_{(t+1)/T}=\theta_{t/T}+c_{t/T}\sqrt{1/T}Z_{t}$ , where we have restated the time subscripts of the trading day in the interval $[0,1]$ , $T^{-1}$ is the discrete time interval between adjacent observations, $\theta_{t/T}-\theta_{(t-1)/T}=O_{p}(T^{-1/2})$ and $Z_{t}$ is a standard Gaussian. $IV$ is a latent quantity, usually estimated with the so-called realized variance $RV=\sum_{t=1}^{T}(\theta_{t/T}-\theta_{(t-1)/T})^{2}$ , the sum of all intra-day high frequency observed logarithmic returns. $RV$ is a consistent and efficient estimator of $IV$ (Andersen et al., 2003) when there is no microstructure noise, that is when $\theta_{t/T}$ for $t=1,\dots,T$ is directly observed. When microstructure noise is introduced, we observe $\xi_{t/T}$ instead of $\theta_{t/T}$ , and the computable realized variance becomes $\tilde{RV}=\sum_{t=1}^{T}(\xi_{t/T}-\xi_{(t-1)/T})^{2}$ . Note that we do not specify the continuous-time version of the measurement equation: the observed price relates to the latent price only through the microstructure noise, consequence of trades occurring at discrete times. Unfortunately, $\tilde{RV}$ loses the good properties of $RV$ , since it is biased and inconsistent for the true integrated variance. As this problem arises mainly when the frequency of observations approaches infinity (that is when the maximum distance between adjacent measurement times approaches zero), it can be attenuated by sparse sampling, but this involves a loss of information because of the discarded data. Recently, some authors have followed the approach suggested by Ait-Sahalia et al., (2005) of sampling as often as possible and modeling the noise. In particular, a first consistent estimator of $IV$ for financial data contaminated by microstructure noise has been proposed in Zhang et al., (2005) (whose order of convergence is improved in Zhang, 2006), later followed by Barndorff-Nielsen et al., (2008), that propose a kernel-based estimator. There have been numerous extensions of the framework with noisy observations that account for additional empirically observed data irregularities, as asyncronicity of multivariate log prices, serially dependent microstructure noise, positivity of the estimator, skewness and kurtosis, presence of outliers, lead-lag effects (see, for instance, Geske and Torous, 1991; Ait-Sahalia et al., 2010; Barndorff-Nielsen et al., 2011; Corsi et al., 2015; Peluso et al., 2014; Hubert et al., 2014; Buccheri et al., 2018). Less attention has been posed on the dependence between microstructure noise and latent financial logarithmic returns, empirically found in Hansen and Lunde, (2006). Also, common microstructure theories from financial economics literature justify a correlation between latent returns and microstructure noise (Diebold and Strasser, 2013) by the presence of uninformed trades, risk aversion and market makers learning speed. All the estimators mentioned above are not designed for such a dependence, except for Barndorff-Nielsen et al., (2008), but only for a linear model of endogeneity. Kalnina and Linton, (2008) robustifies the estimator of Zhang et al., (2005) to the presence of endogenous noise, and Jacod et al., (2009) propose a generalized pre-averaging estimator of the integrated variance accounting for various noise structures. The kernel estimator of Bandi and Russell, (2011) also shows robustness properties that justify its adoption in a setting with correlation between microstructure noise and latent returns.

4.2 The proposed estimator

The framework of conditionally Gaussian sequences, with the sampling algorithm introduced above, can be used to propose a new estimator of the integrated variance that is robust to the presence of correlation between microstructure noise and latent returns. Consider the bivariate system

[TABLE]

in which $a_{0}(t)=\tilde{A}_{0}(t)=b_{2}(t)=0$ and $a_{1}(t)=\tilde{A}_{1}(t)=1$ for all $t$ . Model (14)-(15) is completed by characterizing the prior distributions: $\tilde{B}_{1}(t)\sim\phi(\mu_{B,t},\sigma^{2}_{B,t})$ , $b_{1}(t)\sim\phi(\mu_{b,t},\sigma^{2}_{b,t})$ and finally $\tilde{B}_{2}(t)\sim IG(\alpha_{B,t},\beta_{B,t})$ . The correlation between microstructure noise and true returns is introduced through the random variable $\epsilon_{1}$ , appearing in both the equations. Note that Hansen and Lunde, (2006) found microstructure noise and latent returns negatively correlated: with a Gaussian prior on $B_{1}(t)$ it is possible to center, a priori, this correlation on a negative value. Furthermore, Diebold and Strasser, (2013) point out that a negative correlation appears more realistic, and that markets with no evidence of significant negative correlation are likely subject to an extraordinary microstructure effect such as high risk aversion.

The full conditional distribution of $\theta^{T}$ is sampled with the G-FFBS. The forward step of the G-FFBS algorithm is performed through the following filtering iterations:

[TABLE]

Note that if $\tilde{B}_{1}(t)=0\ \forall t$ , the filtering iterations (16) and (17) simplify to the Kalman filter iterations (Kalman, 1960):

[TABLE]

For the backward sampling step, $\theta^{T}|\xi^{T}$ are sampled from (22), where

[TABLE]

with $V_{t}$ and $W_{t}$ defined in Appendix B.

Note that $B_{1}(t)=\tilde{B}_{1}(t)+b_{1}(t)$ and $B_{2}(t)=\tilde{B}_{2}(t)$ . The correlation between transition and measurement error can be removed by fixing $\tilde{B}(t)=0$ . In this case, $B_{1}(t)=b_{1}(t)$ and, as expected, $V_{t}=\left(1-\frac{\gamma(t)}{b_{1}^{2}(t)+\gamma(t)}\right)\gamma(t)$ and $W_{t}V_{t}=\left(1-\frac{\gamma(t)}{b_{1}^{2}(t)+\gamma(t)}\right)m(t)+\frac{\gamma(t)}{b_{1}^{2}(t)+\gamma(t)}\theta_{(t+1)/T}$ , as in the usual FFBS.

4.3 Some properties of the estimator

The difference between FFBS and G-FFBS can be crucial for the estimation of the latent stochastic process. We highlight that the result in (18) serves the purpose of sampling the latent stochastic process, and therefore the implied realized variance, from its correct posterior distribution under the general setting of conditionally Gaussian random sequences. Therefore, under our modeling assumptions, the consistency to the correct values is guaranteed by the MCMC properties. Unbiasedness in finite sample is not assured, unless one implements appropriately built unbiased MCMC schemes (Jacob et al., , 2017), which is beyond the scope of our paper. In finite samples we can say that the estimate of the integrated variance is optimal in the mean square sense, that is no other estimator can have a lower mean square error under our modeling assumptions, since the posterior mean is also the solution to the smoothing problem of conditionally Gaussian random sequences, solution known to be optimal in the mean square sense (Liptser and Shiryayev, 2001b, ).

To study the asymptotic FFBS bias in a simplified setting, in this section we assume that in the model for observations and latent process expressed in Equations (14) and (15) the parameter values are constant in $t$ or they eventually stabilize to some steady state, starting from some value of $t$ . Then for all $t=1,\dots,T$ , $\tilde{B}_{1}(t)=\tilde{B}_{1}$ , $\tilde{B}_{2}(t)=\tilde{B}_{2}$ and $b_{1}(t)=b_{1}$ , with $\gamma$ converging to

[TABLE]

which reduces to $\gamma_{0}^{*}:=\frac{1}{2}\left(\sqrt{b_{1}^{2}+4\tilde{B}_{2}^{2}}-b_{1}\right)$ when correlation is neglected. We can assume the existence and uniqueness of such a limit since the conditions for asymptotic properties of the optimal linear filtering are satisfied (Theorem 14.3 of Liptser and Shiryayev, 2001b ). Ignoring correlation results in a negative asymptotic bias if $V_{t}$ , computed for the model with no correlation, is lower than the corresponding quantity in the full model. Equivalently, looking at the functional form of $V_{t}$ in Appendix B, the asymptotic negative bias resulting from neglecting the correlation occurs when

[TABLE]

which, after some algebra, can be written as

[TABLE]

For specific annualized values of $b_{1}$ and $\tilde{B}_{2}$ , the difference between $V_{t}$ computed with and without correlation is shown in Figure 1. Omitting the correlation implies a negative bias in correspondence of $\tilde{B}_{1}$ values at which the black solid line $\tilde{B}_{1}^{4}/b_{1}^{2}+2\tilde{B}_{1}^{3}/b_{1}$ is above the red dashed line $(\sqrt{b_{1}^{2}+4\tilde{B}_{2}^{2}}/b_{1})\tilde{B}_{1}^{2}+(b_{1}+\sqrt{b_{1}^{2}+4\tilde{B}_{2}^{2}})\tilde{B}_{1}$ , and a positive bias vice-versa. Therefore the direction of the asymptotic bias tends to follow the sign of $\tilde{B}_{1}$ , with the exception of more extreme negative or positive $\tilde{B}_{1}$ , for which the bias direction is reversed. Also note that the distortion is not symmetric for negative and positive $\tilde{B}_{1}$ .

For instance, for a correlation $\rho$ between microstructure noise and financial latent return taking values in the set $\pm\{0.15,0.30,0.75,0.90\}$ , a noise to signal ratio ( $NTS$ ) of 1.5 and an annualized transition error variance of 0.06, we simulate, for each value of $\rho$ , 500 trading days, with $T=23400$ seconds per business day. To fix the correlation to the desired level, we generate the data imposing $\tilde{B}_{2}=\sqrt{(1-\rho^{2})b_{1}^{2}\cdot NTS}$ and $\tilde{B}_{1}=\sqrt{\rho^{2}b_{1}^{2}\cdot NTS}$ (scenarios with positive correlation) or $\tilde{B}_{1}=-\sqrt{\rho^{2}b_{1}^{2}\cdot NTS}$ (scenarios with negative correlation). In this way, $\rho=sgn(b_{1})\tilde{B}_{1}/(\sqrt{\tilde{B}_{1}^{2}+\tilde{B}_{2}^{2}})$ . For each day we compute the estimated quadratic variation for FFBS and G-FFBS, that is the sum of the squared first differences in $\theta_{1/T},\theta_{2/T},\dots,\theta_{1}$ sampled from distribution in (18) (G-FFBS) and from (18) with $\tilde{B}_{1}=0$ (FFBS), and we compare them in Figure 2. It is clear that neglecting $\rho$ has an impact on the inference of the latent process. As expected, the distance between the two methodologies widens in the magnitude of the correlation: see in the left figure how FFBS worsens with higher and higher negative correlations introduced in the system, against a G-FFBS algorithm that remains unbiased. But, as expected from (20) and its graphical representation in Figure 1, the FFBS bias direction does not necessarily follow the sign of the correlation: negative correlation is imposed through a negative $\tilde{B}_{1}$ , but in the case of $\rho=-0.90$ , the annualized $\tilde{B}_{1}=-0.27$ is outside the region $(-0.245,0)\cup(0.281,\infty)$ for which the bias would be negative. The results are similar in the right panel, when positive correlations of 0.15, 0.75 and 0.9 are hypothesized: more and more correlation worsens the quadratic variation estimated by FFBS, but, as expected, asymmetrically relative to the scenarios with negative correlation: the impact of a higher correlation seems worse, and in the most extreme scenario with $\rho=0.9$ , the bias does not become negative since $\tilde{B}_{1}=0.27$ , inside the region $(-\infty,-0.245)\cup(0,0.281)$ of positive FFBS bias.

4.4 Other MCMC steps

To sample from the remaining full conditional distributions, note that

[TABLE]

and

[TABLE]

The full conditionals of $\tilde{B}_{1}(t)$ and $\tilde{B}_{2}^{2}(t)$ are in standard form and provided in Appendix B. On the other hand, we sample $b_{1}(t)$ with a Hamiltonian step (see Chapter 5 in Brooks et al., 2011 for an introduction to the algorithm). The motivation for using this step is its ability to exploit the information in the full conditional gradient of $b_{1}(t)$ , for a faster exploration of the parameter space, thus overcoming the random walk behavior of the Metropolis-Hastings step in a highly dimensional space. We refer the Reader to Appendix C for the details on the Hamiltonian step. Note that, when there is no correlation (that is when $\tilde{B}_{1}(t)=0$ ), the sampler can be reduced to the Gibbs algorithm in Peluso et al., (2014).

The output of the whole algorithm is a collection of samples

[TABLE]

where $M$ is the number of iterations of the MCMC scheme. Then the proposed estimator of the integrated variance is

[TABLE]

where $M_{0}<M$ is the burn-in, that is the number of samples discarded at the beginning of the MCMC chain to allow the simulation process to reach its stationary regime. To summarize, the procedure for obtaining the IV estimator is the following: **

For iterations $i=1\dots,M$

(a)

Sample $\theta^{T}_{(i)}$ from the G-FFBS algorithm in Proposition 3.1, assuming $a_{0}(t)=\tilde{A}_{0}(t)=b_{2}(t)=0$ and $a_{1}(t)=\tilde{A}_{1}(t)=1$ for all $t$ 2. (b)

Sample $\tilde{B}^{T}_{1(i)}$ from the full conditional (23) in Appendix B 3. (c)

Sample $\tilde{B}^{T}_{2(i)}$ from the full conditional (24) in Appendix B 4. (d)

Sample $b^{T}_{1(i)}$ from the Hamiltonian step highlighted in Appendix C 2. 2.

Compute the estimator given in Equation (21).

We simulate 500 trading days, for $M=1000$ , $M_{0}=500$ and correlations $\pm 0.10$ , starting all the chains from values significantly different from the true ones. The hyper-parameters are $\mu_{B,t}=-1.48\cdot 10^{-5}$ , $\sigma^{2}_{B,t}=1.53\cdot 10^{-10}$ , $\mu_{b,t}=1.21\cdot 10^{-4}$ , $\sigma^{2}_{b,t}=1.02\cdot 10^{-8}$ , $\alpha_{B,t}=2.1$ and $\beta_{B,t}=1.99\cdot 10^{-8}$ for all $t$ , fixed so that they are at least 20% higher or lower than the true values used to generate the datasets. Our methodology is compared with the estimators of Kalnina and Linton, (2008), Bandi and Russell, (2011) and Jacod et al., (2009) (for Jacod et al., 2009, both the adjusted and unadjusted estimators for small sample sizes are implemented). For completeness, we add to the comparison other popular estimators, as the quasi-maximum likelihood estimator of Xiu, (2010), the realized kernel of Barndorff-Nielsen et al., (2011), and the two-scale estimator proposed by Zhang et al., (2005). The method of Kalnina and Linton, (2008) requires the choice of the tuning parameter $K$ : we use $K=T^{2/3}$ , since it performs well in the simulations in Kalnina and Linton, (2008) and it is what the authors suggest in their empirical study. Alternative values of $K$ are shown in Kalnina and Linton, (2008) to perform worse and depend on unobservable quantities estimated with a slow-decaying bias. For the estimator proposed in Bandi and Russell, (2011), the tuning parameters are chosen according to the rule of thumb proposed in Equation (26) of Bandi and Russell, (2011), in simulation computed using the true values and in the application below to Microsoft Corporation, using the corresponding values in Table 1 of Bandi and Russell, (2006). Finally, the tuning parameters of Jacod et al., (2009) are fixed, using their notation, to $k_{n}=51$ , $\theta=k_{n}/\sqrt{T}$ and $g(x)=x\wedge(1-x)$ , as in their simulation studies. The results are reported in Figure 3 and in Table 1: there is a clear advantage for our methodology in terms of dispersion and root mean square error (RMSE). The quasi-maximum likelihood estimator performs particularly well in terms of bias, even if it shows some relevant positive dispersion that contributes to increase the RMSE to a level higher than that of the method we propose.

We also run the algorithm on 1-second frequency logarithmic prices of Microsoft Corporation, for the period April 1, 2007 - June 30, 2008, and the estimated annualized quadratic variations are reported in Figure 4. A practical implication of the differences in the estimation of Microsoft integrated variances is a Gaussian Value At Risk that deviates, on average over the period studied, from 2% to 6% of a hypothetical initial investment.

5 Conclusions

Overwhelming evidence contrasts the independent microstructure noise assumption, in favour of market noise correlated with increments in the efficient price, with important implications for volatility estimation based on high-frequency data (Hansen and Lunde, 2006). Furthermore, such a dependence naturally arises in common microstructure models, as discussed in depth in Diebold and Strasser, (2013). On the other hand, with the notable exceptions of Barndorff-Nielsen et al., (2008), Kalnina and Linton, (2008) and Zhang et al., (2005), several results in the literature analyze high-frequency volatility estimation assuming that the noise process is independent of the efficient price. In the present paper we use the theoretical framework of the conditionally Gaussian random sequences of Liptser and Shiryayev, (1972); Liptser and Shiryayev, 2001a ; Liptser and Shiryayev, 2001b , to propose a new integrated variance estimator that is robust to correlation between microstructure noise and latent returns. To this aim, we adopt a Bayesian perspective and sample a posteriori the latent price process through a generalization of the Forward Filtering Backward Sampling algorithm of Fruwirth-Schnatter, (1994) and Carter and Kohn, (1994). An application to Microsoft 1-second logarithmic prices is provided, and a simulation study shows an improved performance of our estimator in terms of RMSE and dispersion, relative to the alternatives in the literature. Our methodology can be implemented in other financial problems, for instance to generalize the framework of Barndorff-Nielsen, (1997) to normal inverse Gaussian financial logarithmic returns with measurement error, or, following the approaches of Harvey et al., (1992) and Harvey et al., (1994), in ARCH and Stochastic Volatility models.

Acknowledgements

Stefano Peluso acknowledges support from the Swiss National Science Foundation (SNF), Grant No. P1TIP1_155031. Antonietta Mira also gratefully acknowledges the financial support by SNF.

Appendix A: Proof of Proposition 3.1

Using the notation $x^{t}=\{x_{1},\dots,x_{t}\}$ and suppressing the dependence on $\tilde{D}(1),\dots,\tilde{D}(T)$ , G-FFBS exploits the factorization

[TABLE]

Noting that

[TABLE]

$\xi_{t+1}|\theta_{t+1},\theta_{t}\sim\phi(\mu_{t},\Sigma_{t})$ , where

[TABLE]

Thus the factor $p(\theta_{t}|\theta_{t+1},\dots,\theta_{T},\xi^{T})$ in (22) can be expressed as

[TABLE]

Appendix B: Auxiliary quantities and full conditionals not mentioned in the main text

Quantities $V_{t}$ and $V_{t}W_{t}$ for Equation (18):

[TABLE]

Full conditionals of $\tilde{B}_{1}(t)$ and $\tilde{B}_{2}^{2}(t)$ in Section 4:

[TABLE]

Appendix C: Hamiltonian step of Section 4

The Hamiltonian step is performed through the following iterative procedure:

Sample the auxiliary momentum variable $p\{1\}$ from $\Phi(0,1)$ , 2. 2.

Propose $b_{1}(t)^{*}$ from the Leapfrog algorithm. In details, fix $k\{1\}$ to the current value of $b_{1}(t)$ . For step size $\epsilon$ and number of iterations $L$ :

[TABLE]

For $i=1,\dots,L-1$ :

[TABLE]

Finally,

[TABLE]

and the proposed value is $b_{1}(t)^{*}=k\{1+L\epsilon\}$ . 3. 3.

Evaluate potential and kinetic energies $U$ and $Z$ at proposed and current values:

[TABLE] 4. 4.

Accept $b_{1}(t)^{*}$ with probability

[TABLE]

Bibliography62

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ait-Sahalia et al., (2010) Ait-Sahalia, Y., Fan, J., and Xiu, D. (2010). High-Frequency Covariance Estimates with Noisy and Asynchronous Financial Data. JASA , 105:1504–1517.
2Ait-Sahalia et al., (2005) Ait-Sahalia, Y., Mykland, P., and Zhang, L. (2005). How Often to Sample a Continuous-Time Process in the presence of Market Microstructure Noise. The Review of Financial Studies , 18:351–416.
3Andersen et al., (2003) Andersen, T., Bollerslev, T., Diebold, F., and Labys, P. (2003). Modeling and Foreasting Realized Volatility. Econometrica , 71:529–626.
4Audrino and Corsi, (2010) Audrino, F. and Corsi, F. (2010). Modeling tick-by-tick realized correlations. Computational Statistics and Data Analysis , 54:2372–2382.
5Azzalini and Capitanio, (2003) Azzalini, A. and Capitanio, A. (2003). Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution. Journal of the Royal Statistical Society B , 65:579–602.
6Bandi and Russell, (2006) Bandi, F. and Russell, J. (2006). Separating Microstructure Noise from Volatility. Journal of Financial Economics , 79:655–692.
7Bandi and Russell, (2011) Bandi, F. and Russell, J. (2011). Market Microstructure Noise, Integrated Variance Estimators, and the Accuracy of Asymptotic Approximations. Journal of Econometrics , 160:145–159.
8Barndorff-Nielsen, (1997) Barndorff-Nielsen, O. (1997). Normal inverse Gaussian distributions and Stochastic Volatility Modelling. Scand. J. of Stat. , 24:1–13.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Conditionally Gaussian Random Sequences for an Integrated Variance Estimator with Correlation between Noise and Returns

Abstract

1 Introduction

2 Conditionally Gaussian Random Sequences

Theorem 2.1**.**

Proof.

3 Sampling Algorithm of the Latent Process

Proposition 3.1**.**

Proof.

4 Robust Integrated Variance Estimation

4.1 Problem context

4.2 The proposed estimator

4.3 Some properties of the estimator

4.4 Other MCMC steps

5 Conclusions

Acknowledgements

Appendix A: Proof of Proposition 3.1

Appendix B: Auxiliary quantities and full conditionals not mentioned in the main text

Appendix C: Hamiltonian step of Section 4

Theorem 2.1.

Proposition 3.1.