TL;DR
This paper introduces a new class of stochastic PDE models for limit order book dynamics that are analytically tractable, allowing for efficient estimation and capturing key market features.
Contribution
It develops a novel SPDE framework with conditions for finite-dimensional realization, enabling practical modeling of limit order book dynamics with low-dimensional Markov processes.
Findings
Model reproduces statistical properties of price changes
Analyzes role of parameters in price and order book dynamics
Provides intuitive financial interpretation of variables
Abstract
We propose an analytically tractable class of models for the dynamics of a limit order book, described through a stochastic partial differential equation (SPDE) with multiplicative noise for the order book centered at the mid-price, along with stochastic dynamics for the mid-price which is consistent with the order flow dynamics. We provide conditions under which the model admits a finite dimensional realization driven by a (low-dimensional) Markov process, leading to efficient estimation and computation methods. We study two examples of parsimonious models in this class: a two-factor model and a model with mean-reverting order book depth. For each model we analyze in detail the role of different parameters, the dynamics of the price, order book depth, volume and order imbalance, provide an intuitive financial interpretation of the variables involved and show how the model reproduces…
| Ticker | Date | |||||||
|---|---|---|---|---|---|---|---|---|
| INTC | 2016-11-15 | 5179.0 | 5641.7 | 0.151 | 0.156 | 0.133 | 0.134 | -0.077 |
| 2016-11-16 | 5565.0 | 5672.5 | 0.082 | 0.118 | 0.111 | 0.124 | -0.070 | |
| 2016-11-17 | 5776.5 | 7363.2 | 0.144 | 0.109 | 0.118 | 0.116 | -0.019 | |
| MSFT | 2016-11-15 | 3035.6 | 3855.9 | 0.522 | 0.426 | 0.292 | 0.292 | -0.092 |
| 2016-11-16 | 2839.9 | 3562.1 | 0.409 | 0.395 | 0.239 | 0.240 | -0.071 | |
| 2016-11-17 | 4149.0 | 5762.5 | 0.300 | 0.239 | 0.202 | 0.208 | -0.146 | |
| QQQ | 2016-11-15 | 4686.9 | 5489.2 | 2.467 | 1.972 | 0.724 | 0.639 | -0.177 |
| 2016-11-16 | 4801.0 | 5142.6 | 2.041 | 1.845 | 0.632 | 0.677 | -0.177 | |
| 2016-11-17 | 6414.0 | 6226.4 | 1.428 | 1.281 | 0.510 | 0.506 | -0.224 | |
| SPY | 2016-11-15 | 3903.4 | 4877.9 | 1.949 | 1.689 | 0.737 | 0.666 | -0.176 |
| 2016-11-16 | 3773.4 | 4486.4 | 1.324 | 1.763 | 0.578 | 0.657 | -0.156 | |
| 2016-11-17 | 3693.0 | 4115.4 | 1.355 | 1.405 | 0.597 | 0.543 | -0.181 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
label=
A stochastic partial differential equation model for limit order book dynamics
Rama Cont
and
Marvin S. Müller
Mathematical Institute, University of Oxford.
Department of Mathematics, ETH Zürich
current address: 2Xideas AG, Seestrasse 39, CH-8700 Küsnacht
(Date: 19th March 2021)
Abstract.
We propose an analytically tractable class of models for the dynamics of a limit order book, described through a stochastic partial differential equation (SPDE) with multiplicative noise for the order book centered at the mid-price, along with stochastic dynamics for the mid-price which is consistent with the order flow dynamics. We provide conditions under which the model admits a finite dimensional realization driven by a (low-dimensional) Markov process, leading to efficient estimation and computation methods. We study two examples of parsimonious models in this class: a two-factor model and a model with mean-reverting order book depth. For each model we analyze in detail the role of different parameters, the dynamics of the price, order book depth, volume and order imbalance, provide an intuitive financial interpretation of the variables involved and show how the model reproduces statistical properties of price changes, market depth and order flow in limit order markets.
2010 Mathematics Subject Classification:
35R60, 60H15, 91B26, 91G80
M. S. M. is grateful for generous support from the Swiss National Science Foundation through SNF grant and from the ETH Foundation at ETH Zurich. The authors thank Martin Keller-Ressel for comments and discussions.
Contents
Financial instruments such as stocks and futures are increasingly traded in electronic, order-driven markets, in which orders to buy and sell are centralized in a limit order book and market orders are executed against the best available offers in the limit order book. The dynamics of prices in such markets are not only interesting from the viewpoint of market participants –for trading and order execution– but also from a fundamental perspective, since they provide a detailed view of the dynamics of supply fand demand and their role in price formation.
The availability of a large amount of high frequency data on order flow, transactions and price dynamics on these markets has instigated a line of research which, in contrast to traditional market microstructure models which make assumptions on the behavior and preferences of various types of agents, focuses on the statistical modeling of aggregate order flow and its relation with price dynamics, in a quest to understand the interplay between price dynamics and order flow of various market participants [Cont, 2011].
A fruitful line of approach to these questions has been to model the stochastic dynamics of the limit order book, which centralizes all buy and sell orders, either as a queueing system [Luckock, 2003, Smith et al., 2003, Cont et al., 2010, Cont and De Larrard, 2012, Cont and de Larrard, 2013, Kelly and Yudovina, 2018] or, at a coarse-grained level, through a (stochastic) partial differential equation describing the evolution of the distribution of buy and sell orders [Lasry and Lions, 2007, Caffarelli et al., 2011, Burger et al., 2013, Carmona and Webster, 2013, Markowich et al., 2016, Hambly et al., 2020, Horst and Kreher, 2018]. These PDE models may be viewed as scaling limits of discrete point process models [Cont and De Larrard, 2012, Hambly et al., 2020, Horst and Kreher, 2018].
Although joint modeling of order flow at all price levels in the limit order book is more appealling, (S)PDE models have lacked the analytical and computational tractability needed for applications; as a result, most analytical results have been derived using reduced-form models of the best bid-ask queues [Cont and De Larrard, 2012, Cont and de Larrard, 2013, Chavez-Casillas and Figueroa-Lopez, 2017, Huang et al., 2017].
We propose a class of stochastic models for the dynamics of the limit order book which represent the dynamics of the entire order book while retaining at the same time the analytical and computational tractability of low-dimensional Markovian models, and provides realistic dynamics for the joint dynamics of the market price and order book depth. Starting with a description of the dynamics of the limit order book via a stochastic partial differential equation (SPDE) with multiplicative noise, we show that in many cases, the solutions of this equation may be parameterized in terms of a low-dimensional diffusion process, which then makes the model computationally tractable. In particular, we are able to derive analytical relations between model parameters and various observable quantities. This feature may be used for calibrating model parameters to match statistical features of the order flow and leads to empirically testable predictions, which we proceed to test using high frequency time series of order flow in electronic equity markets.
Outline Section 1 introduces a description of the dynamics of a limit order book through a stochastic partial differential equation (SPDE). We describe the various terms in the equation, their interpretation and discuss the implications for price dynamics (Section 1.3). This class of models is part of a more general family of SPDEs driven by semimartingales, introduced in Sec. 1.5 and studied in Sec. 2.
We then focus on two analytically tractable examples: a two-factor model (Section 3) and a model with mean-reverting depth and imbalance (Section 4). For each model we perform a detailed analysis of the role of different parameters and study the dynamics of the price, order book depth, volume and order imbalance, provide an intuitive financial interpretation of the variables involved and show how the model may be estimated from financial time series of price, volume and order flow.
1. A stochastic PDE model for limit order book dynamics
We consider a market for a financial asset (stock, futures contract, etc.) in which buyers and sellers may submit limit orders to buy or sell a certain quantity of the asset at a certain price, and market orders for immediate execution against the best available price.111In the following we do not distinguish market orders and marketable limit orders i.e. limit orders with a price better than the best price on the opposite side. Limit orders awaiting execution are collected in the limit order book, an example of which is shown in Figure 1: at any time , the state of the limit order book is summarized by the volume of orders awaiting execution at price levels on a grid with mesh size given by the minimum price increment or *tick * size . By convention we associate negative volumes with buy orders and positive volumes with sell orders, as shown in Figure 1. An admissible order book configuration is then represented by a function such that
[TABLE]
(resp. ) is called the bid (resp. ask) price and represents the price associated with the best buy (resp. sell) offer. The quantity
[TABLE]
is called the mid-price and the difference is called the bid-ask spread. In the example shown in Figure 1, and the bid-ask spread is equal in this case to the tick size, which is 1 cent.
One modelling approach has been to represent the dynamics of as a spatial (marked) point process [Luckock, 2003, Cont et al., 2010, Cont and de Larrard, 2013, Kelly and Yudovina, 2018]. These models preserve the discrete nature of the dynamics at high frequencies but can become computationally challenging as one tries to incorporate realistic dynamics. In particular, price dynamics, which is endogenous in such models, is difficult to study, even when the order flow is a Poisson point process.
When the bid-ask spread and tick size are much smaller than the price level, as is often the case, another modelling approach is to use a continuum approximation for the order book, describing it through its density representing the volume of orders per unit price:
[TABLE]
The evolution of the density of buy and sell orders is then described through a partial differential equation (PDE). A deterministic description of the dynamics of order densities through a system of coupled partial differential equations was proposed by [Lasry and Lions, 2007] and studied in detail by [Chayes et al., 2009, Caffarelli et al., 2011, Burger et al., 2013]. In the Lasry-Lions model, the evolution of the density of buy and sell orders is described by a pair of diffusion equations coupled through the dynamics of the price, which represents the free boundary between prices of buy and sell orders. This model is appealing in many respects, especially in terms of analytical tractability, but leads to a deterministic price process which decays to a constant price, so does not provide any insight into the relation between liquidity, depth, order flow and price volatility. [Markowich et al., 2016] explore some stochastic extensions of this model but essentially show that these extensions do not provide realistic price dynamics.
We adopt here this continuum approach for the description of the limit order book, but describe instead its dynamics through a stochastic partial differential equation, paying close attention to price dynamics and its relation with order flow.
The model we propose shares some features with [Lasry and Lions, 2007], but also has some essential differences. Unlike the Lasry-Lions model, which is a free boundary problem in which the dynamics of the price is implicitly determined, we formulate the model as a stochastic partial differential equation in relative price coordinates, which leads to a *stochastic * moving boundary problem in absolute price coordinates. This leads to a more realistic joint dynamics for the market price and order book depth which can be related to empirical observations. Our model also relates to the classes of models studied in [Horst and Kreher, 2018, Hambly et al., 2020] as scaling limits of discrete queueuing systems.
We now describe our model in some detail.
1.1. State variables and scaling transformations
We focus on the case where the tick size and the bid-ask spread are small compared to the typical price level and consider a limit order book described in terms of a mid-price and the density of orders at each price level , representing buy orders for , and sell orders for . We use the convention, shown in Figure 1, of representing buy orders with a negative sign and sell orders with a positive sign, so
[TABLE]
Limit orders are executed against market orders according to price priority and their position in the queue; execution of a limit order only occurs if they are located at the best (buy/sell) prices. This means that price dynamics is determined by the interaction of market orders with limit orders of opposite type at or near the interface defined by the best price [Cont et al., 2010]. Due to this fact, most limit orders flow are submitted close to the best price levels: the frequency of limit order submissions is highly inhomogeneous as a function of distance to the best price and concentrated near the best price. As shown in previous empirical studies, order flow intensity at a given distance from the best price can be considered as a stationary variable in a first approximation [Bouchaud et al., 2009, Cont et al., 2010]. For this reason, in a stochastic description it is more convenient to model the dynamics of order flow in the reference frame of the (mid-)price . We define
[TABLE]
where represents a distance from the mid-price. We refer to as the centered order book density.
The simplest way of centering is to set but other, nonlinear, scalings may be of interest. Although limit orders may be placed at any distance from the bid/ask prices, price dynamics is dominated by the behavior of the order book a few levels above and below the mid price [Cont and De Larrard, 2012]. This region becomes infinitesimal if the tick size is naively scaled to zero, suggesting that the correct scaling limit is instead one in which we choose as coordinate a scaled version , as classically done in boundary layer analysis of PDEs [Schlichting and Gersten, 2017], in order to zoom into the relevant region:
[TABLE]
for bid and ask side, respectively. We will consider examples of such nonlinear scalings when discussing applications to high-frequency data in Sections 3 and 4.
These arguments also justify limiting the range of the argument to a bounded interval , setting for . This amounts to assuming that no orders are submitted at price levels at distances from the mid-price and that orders previously submitted at some price are cancelled as soon as i.e. when the mid-price moves away from by more than . When is a large multiple of daily volatility, this is a realistic assumption. In some market (for example futures contracts), limit orders can be in fact only submitted within a range of the mid-price.
1.2. Dynamics of the centered limit order book
Empirical studies on intraday order flow in electronic markets reveal the coexistence of two, very different types of order flow operating at different frequencies [Lehalle and Laruelle, 2018].
On one hand, we observe the submission (and cancellation) of orders queueing at various price levels on both sides of the market price by regular market participants. Cancellation may occur in several ways: we distinguish *outright * cancellations, which we model as proportional to current queue size, from cancellations with replacement (‘order modifications’), in which an order is cancelled and immediately replaced by another one of the same type, usually at a neighboring price limit. The former results in a net decrease in the volume of the order book whereas the latter is conservative and simply shifts orders across neighboring levels of the book. Further decomposing this conservative flow into a symmetric and antisymmetric part leads to two terms in the dynamics of : a diffusion term representing the cancellation of orders and their (symmetric) replacement by orders at neighboring price levels and a convection (or transport) term representing the cancellation of orders and their replacement by orders closer to the mid-price. Denoting by the gradient in the variable , the net effect of this order flow on the order book may thus be described as a superposition of
- •
a term (resp. representing the rate of buy (resp. sell) order submissions at a distance from the best price;
- •
a term (resp. ) representing (outright) proportional cancellation of limit buy (resp. sell) orders at a distance from the mid-price (where ).
- •
a convection term (resp. ) with which models the replacement of buy (resp. sell) orders by orders closer to the mid-price (i.e. closer to , hence the signs in these terms): in the reference frame where the origin is the mid-price, this translates into a flow of volume towards the origin;
- •
a diffusion term (resp. ) which represents the cancellation and symmetric replacement of orders at a distance from the mid-price.
Another component of order flow is the one generated by high-frequency traders (HFT). These market participants buy and sell at very high frequency and under tight inventory constraints, submitting and cancelling large volumes of limit orders near the mid-price and resulting in an order flow whose net contribution to total order book volume is zero on average over longer time intervals but whose sign over small time intervals fluctuates at high frequency. At the coarse-grained time scale of the average (non-HF) market participants, these features may be modeled as a multiplicative noise term of the form
- •
for buy orders () and for sell orders ()
where is a two-dimensional Wiener process (with possibly correlated components). The multiplicative nature of the noise accounts for the high-frequency cancellations associated with HFT orders.
The impact of these different order flow components may be summarized by the following stochastic partial differential equation for the centered order book density :
[TABLE]
Here , , , , , , and , although the equation may be equally considered without these sign restrictions.
Note that, unlike the Lasry-Lions model, there is no ‘smooth pasting’ condition at : in general : the difference is in fact random and represents an imbalance in the flow of buy and sell orders, which drives price dynamics. This important feature is discussed in Section 1.3 below.
Remark 1.1*.*
In simple price impact models used in the literature on optimal trade execution it is assumed that the relation between price impact and order size is deterministic. This corresponds to the case which leads to a constant centered order book profile . These terms thus correspond to deformations of the centered order book profile due to new order book events and lead to a stochastic market impact of trades dependent on the current state of the order book.
The existence of a solution satisfying the boundary and sign constraints is not obvious but we will see in Section 2 that (1.2) is well-posed: it follows from [Da Prato and Zabczyk, 2014, Theorem 6.7] and [Milian, 2002, Theorem 3] that, when , , then for all there exists a unique weak solution of (1.2) (see Definition 2.2 below) and, when and this solution satisfies
[TABLE]
We will study the mathematical properties of the solution in more detail below.
1.3. Price dynamics
The dynamics of the limit order book determines the dynamics of the bid and ask price, which corresponds to the location of the best (buy and sell) orders. The dynamics of the price should thus be related to the arrival and execution of orders in the order book.
To understand the relation between price dynamics and order flow, let us take a step back and consider an order book with discrete price levels, multiples of a tick size , orders per level on the bid side and orders per level on the ask side. Price changes during a time interval are triggered through the interaction of the net order flow, or order flow imbalance (OFI) and the outstanding limit orders at the top of the order book [Cont et al., 2014]. As illustrated in Figure 2, an order flow imbalance of on the ask side over a short time interval represents an excess of buy orders, which will then be executed against limit sell orders sitting on the ask side and move the ask price by ticks, resulting in a price move of . Similarly, an order flow imbalance on the bid side will move the bid price up by ticks. Using our sign conventions for buy/sell volumes, this leads to the following dynamics:
[TABLE]
so the dynamics of the mid price is given by
[TABLE]
This relation is exact (up to rounding) in the case of a discrete order book with constant depth per level (and thus, no empty levels), as shown in Figure 2. However, in a dynamic setting where the order book may have an arbitrary profile which randomly shifts at each instant, one can only expect a ‘homogenized’ version of (1.4) to hold:
[TABLE]
where is an impact coefficient which relates order imbalance to price movements. This relation between order flow imbalance and price movements has been empirically verified in equity markets [Cont et al., 2014], and we shall use it as a basis for defining the relation between price dynamics and order flow in our model.
Let us now see how the relation (1.5) translates in terms of the variables in our model. Denoting by (resp. ) the volume of buy (resp. sell) limit orders at the top of the book (i.e. the first or average of the first few levels). Given a mid-price , we define a scaling transformation as discussed in Section 1.1, with continuously differentiable inverse and such that . The volume in the best ask queue is then given by
[TABLE]
may be similarly defined for the bid side. These quantities represent the depth at the top of the book; we will refer to them as ‘market depth’. In the case of linear scaling , using a second order expansion in yields
[TABLE]
Similarly, for the bid side
[TABLE]
Substituting these expressions in (1.5), we obtain the following dynamics of the mid-price:
[TABLE]
We observe that price dynamics is entirely determined by the order flow at the top of the book and the depth of the limit order book around the mid-price. The tick size , used in the derivation, does not appear anymore in (1.9). The only trace of the microstructure is the impact coefficient which relates the order flow imbalance to the magnitude of the price change, and whose amplitude may vary across assets.
Remark 1.2*.*
Equation (1.9) requires left and right-differentiability of at the origin. This can be guaranteed whenever takes values in the Sobolev space , for some which will be the case in our model. Note however that, in contrast to [Lasry and Lions, 2007], in general : the difference between these two quantities is proportional to the order flow imbalance which drives price moves.
Remark 1.3*.*
As noted in Remark 1.1, in the case corresponds to a constant centered order book profile . In this case, Equation (1.9) implies i.e. the price is constant, which is consistent with a zero net order flow. This is a (desirable) consequence of the consistency between the price dynamics (1.9) and the order book dynamics (1.2).
1.4. Dynamics in absolute price coordinates
The model above describes dynamics of the order book in relative price coordinates, i.e. as a function of the (scaled) distance from the mid-price. The density of the limit order book parameterized by the (absolute) price level is given by
[TABLE]
where we extend to by setting for . Assume follows an (arbitrary) Itô process
[TABLE]
where and is predictable and integrable and and are predictable and square-integrable processes. This includes the case of price dynamics (1.9), which can be used to express in terms of and model parameters. We will not go into such detail here but will return to this in the examples in Section 3 and 4. Define
[TABLE]
Using a (generalized) Itô-Wentzell formula (see Appendix A), we can show that is the solution of a stochastic moving boundary problem [Mueller, 2018]:
[TABLE]
for , and
[TABLE]
for with the moving boundary conditions
[TABLE]
We refer to (1.13) as a stochastic boundary condition at .
Here, we assumed for simplicity that , . A more detailed discussion of this result is given in Appendix A.
1.5. Linear evolution models for order book dynamics
We will now describe a more general class of linear models for order book dynamics, rich enough to cover the examples we discussed so far, but also covering all level-1 models where the best bid and ask queue are modeled by positive semimartingales. Generally, the densities of orders in the bid and ask side will take values in some function spaces and , respectively. We assume that orders at relative price level for will be cancelled. The relative price levels are on the bid side , and on the ask side . Then, in order to preserve the interpretation of a density it will be reasonable to ask and . From mathematical side, we will assume that and are real separable Hilbert spaces. For notational convenience we now also set .
The density of limit orders at relative price level and time is given by , such that is an -valued adapted process. The initial state is described by , such that is an element in . The (averaged) intra-book dynamics are modeled by linear operators , for , which we assume to be densely defined and such that for there exist weak solutions in of the equations
[TABLE]
for each initial state .
The random order arrivals and cancellations are assumed to be proportional and are modelled by cadlag semimartingales and , which we assume to have jumps greater than almost surely. We assume the initial order book state is denoted by and we write , .
Model 1.4* (Linear Homogeneous Evolution).*
The general form of the linear homogeneous model is
[TABLE]
for , and . can be alternatively expressed as
[TABLE]
where and are solutions of (1.14), see Theorem 2.5 below. If, in addition, and are of bounded variation, then we obtain the price dynamics (1.9).
Corollary 1.5**.**
Assume the setting of Model 1.4 and, in addition, that is an eigenfunction of with eigenvalue , for and . Then, (1.14) can be solved explicitly and
[TABLE]
Remark 1.6*.*
In case that and are (local) martingales, the eigenvalues and play the role of net order arrival rates on bid and ask side, respectively.
Model 1.7* (Linear models with source terms).*
A more realistic setting assumes in addition an influx/outflow of orders at a rate which depends on the distance to the mid price [Cont et al., 2010]. The equation then becomes:
[TABLE]
for , with initial condition .
As we will discuss in Section 4, an interesting case is when (resp. ) is an eigenfunction of (resp. ) associated with some eigenvalue (resp. ). Then by Theorem 2.10 we obtain
[TABLE]
where, for , is the solution of
[TABLE]
Remark 1.8*.*
If , the state of the order book is mean reverting to the state . We will give an example of such a mean-reverting order book model in Section 4.
Remark 1.9*.*
Any model for the dynamics of the order book implies a model for price dynamics via (1.9). In particular this implies a relation between price volatility and parameters describing order flow, in the spirit of [Cont and de Larrard, 2013]. We will derive this relation for the examples studied in the sequel and use it to construct a model-based intraday volatility estimator.
In the next section, we will study this class of models from a mathematical point of view. We will then continue with the analysis of the two examples mentioned above in Sections 3 and 4.
2. Linear stochastic PDE models with multiplicative noise
In order to further study the properties of the SPDE model (1.2), we require a more explicit characterization of the solution, in order to compute various quantities of interest and estimate model coefficients from observations. A useful approach is to look for a finite dimensional realization of the infinite-dimensional process :
Definition 2.1** (Finite dimensional realizations).**
A process taking values in an (infinite-dimensional) function space is said to admit a finite dimensional realization of dimension if there exists an -valued stochastic process and a map
[TABLE]
Availability of a finite dimensional realization for the SPDE (1.2) makes simulation, computation and estimation problems more tractable, especially if the process is a low-dimensional Markov process. Existence of such finite-dimensional realizations for stochastic PDEs have been investigated for SPDEs arising in filtering [Lévine, 1991] and interest rate modelling [Filipovic and Teichmann, 2003, Gaspar, 2006].
We will now show that finite dimensional realizations may indeed be constructed for a class of SPDEs which includes (1.2), and use this representation to perform an analytical study of these models.
2.1. Homogeneous equations
We now consider a more general class of linear homogeneous evolution equations with multiplicative noise taking values in a real separable Hilbert space . Typically, will be a function space such as for some interval . We consider the following class of evolution equations:
[TABLE]
where is a real càdlàg semimartingale whose jumps satisfy a. s. and a linear operator on whose adjoint we denote by . We assume that is dense, and is closed. Since is closed we have that also is dense and that [Yosida, 1995, Theorem VII.2.3].
Definition 2.2**.**
An adapted -valued stochastic process is an (analytical) weak solution of (2.1) with initial condition if, for all , is càdlàg a. s. and for each , a. s.
[TABLE]
The case corresponds to a notion of weak solution for the PDE:
[TABLE]
That is, for all ,
[TABLE]
where the integral on the right hand side is assumed to exist.222Note that this slightly differs from the classical formulation of weak solutions for PDEs. In particular, this yields that is continuous.
Remark 2.3*.*
By considering bid and ask side separately, we can bring (1.2) into the form of (2.1), where is a Brownian motion and is given by on , or , with domain
[TABLE]
where is the closure in of test functions with compact support in .
Denote by the stochastic exponential of . We recall the following useful lemma (see e.g. [Karatzas and Kardaras, 2007, Lemma 3.4]):
Lemma 2.4**.**
Let
[TABLE]
Then, almost surely, for all . Moreover,
[TABLE]
Theorem 2.5**.**
Let , . Then every weak solution of (2.1) is of the form
[TABLE]
where is a weak solution of (2.2).
Remark 2.6*.*
In particular, the SPDE (2.1) admits a two dimensional realization in the sense of Definition 2.1 with factor process and .
Proof.
Set , , and for write , . Since is continuous and is scalar and càdlàg, we get that is càdlàg. Note that is of finite variation and is a semimartingale, so that also is a semimartingale. Moreover, by Itô product rule and since is of finite variation and continuous,
[TABLE]
which is (2.1). Now, let be a solution of (2.1) and set
[TABLE]
and , . Recall that by Lemma 2.4 we have for all . Set , and, as above, fix and write and . By Itô’s product rule and Lemma 2.4,
[TABLE]
Thus, is a weak solution of (2.2). ∎
Example 2.7**.**
Let be the generator of a strongly continuous semigroup . Then, for define
[TABLE]
which is a weak solution of (2.2). By Theorem 2.5.
[TABLE]
is a weak solution of (2.1).
Remark 2.8*.*
If is an eigenfunction of with eigenvalue , then, is the unique locally -integrable solution of (2.2), and the unique solution of (2.1) is given by
[TABLE]
2.2. Inhomogeneous equations
We keep the assumptions on , and from the previous section and let . We now consider the inhomogeneous linear evolution equations
[TABLE]
Definition 2.9**.**
A weak solution of (2.7) is an adapted -valued stochastic process such that for all the mapping is càdlàg and
[TABLE]
almost surely.
We exclude the cases or which correspond to the homogeneous case discussed above. Let us first consider the case where admits at least one eigenfunction.
Theorem 2.10**.**
Suppose that is an eigenfunction for with eigenvalue , and let and be the solution of
[TABLE]
Then:
- (i)
The stochastic process defined by , , is a solution of (2.7) with initial condition . 2. (ii)
Let, in addition, be such that there exists a weak solution of the deterministic equation
[TABLE]
Then, is a solution of (2.7) with initial condition . 3. (iii)
Let be such that there exists a weak solution of (2.7) with initial condition . Then, , is a weak solution of (2.9).
Remark 2.11*.*
Let and be given by (2.8) with respective initial data , , . Then, in fact , which is consistent with choosing different values for in (ii).
Proof.
Part (i) follows by direct a computation: Let , then for ,
[TABLE]
Similarly, we obtain that any solution of (2.7) with initial data can be written as
[TABLE]
where is the solution of the homogeneous problem (2.1) with initial data and is a solution of (2.7) with initial data . Then, part (i) and Theorem 2.5 finish the proof of (ii) and (iii). ∎
It is then readily verified using Itô’s formula that the unique solution of (2.8) is given by
[TABLE]
where
[TABLE]
We now focus on the case for a real Brownian motion and a constant . Then, we will consider regular two-dimensional realizations of the form , where
- (a)
is a diffusion process with state space , satisfying
[TABLE]
for measurable functions , , where has non-empty interior, for all and is locally integrable on . 2. (b)
such that for all , the maps defined by , , , are in .
Examples of such regular two-dimensional realizations are given by Theorem 2.10.(i).
Theorem 2.12**.**
Let , , for and a real Brownian motion , and assume that (2.7) admits a regular finite-dimensional realization , . Then is an eigenfunction of for some eigenvalue , and there exists an invertible transformation such that for , almost surely
[TABLE]
where is given by (2.8).
Proof.
Let , and
[TABLE]
An application of the Itô formula yields
[TABLE]
Comparing the martingale term with (2.7), we see that satisfies the ODE
[TABLE]
-a. e. and hence, must be of the form
[TABLE]
for some and in the interior of . The regularity property of the representation guarantees that is well-defined and strictly monotone increasing. We stress that is in fact independent of . Setting , we see that satisfies
[TABLE]
for the drift function .
Note that for each , the mapping is linear continuous from into . Since is dense, by Riesz representation theorem for each there exists such that
[TABLE]
Since , is differentiable and (2.13) becomes, for ,
[TABLE]
Comparing the drift terms with (2.7) yields for , and ,
[TABLE]
Evaluating at two different points and subtracting we obtain that
[TABLE]
for all , and . We conclude that there exists a constant such that
[TABLE]
Thus must be of the form for . Inserting into (2.15) we obtain that
[TABLE]
Since is dense the equation holds for all . Due to the assumption that and is non-zero, also and we get . In particular, is independent of and (2.16) yields
[TABLE]
This means that . Since , see e. g. [Yosida, 1995, Theorem VII.2.3], we have and
[TABLE]
By density of in this yields that , i.e. must be an eigenfunction of with eigenvalue .
Putting everything together, we have shown that where
[TABLE]
Rescaling by concludes the proof. ∎
2.3. Linear SDEs & Pearson diffusions
Let again for some and a real Brownian motion . The factor processes appearing above are then special cases of the linear SDE
[TABLE]
studied e.g. in [Kloeden and Platen, 1992, Ch. 4] or [Kallenberg, 2002, Prop. 21.2]. Well-known special cases are the geometric Brownian motion () and the Ornstein-Uhlenbeck-process (). Relevant in our context is the less common case , on which we focus now. Using (2.11), the solution is given by
[TABLE]
where
[TABLE]
Solutions of (2.18) have also been studied in the context of reciprocal gamma diffusions (see e.g. the ‘Case 4’ in [Forman and Sørensen, 2008]) or also Pearson diffusions. These are generalizations of (2.18) that allow for a square-root term in the diffusion coefficient.
Proposition 2.13**.**
Assume that , and . Then, has unique invariant distribution , which is an Inverse Gamma distribution with shape parameter and scale parameter and, for any bounded measurable function ,
[TABLE]
Proof.
First, note that
[TABLE]
define a scale density and speed measure for . Then, one can easily verify that is strictly positive and recurrent on , see e. g. [Karatzas and Shreve, 1987, Prop. 5.5.22]. Moreover, and so the unique invariant distribution of is
[TABLE]
The remaining results then follow from e. g. [Borodin and Salminen, 2012, II.35] or [Revuz and Yor, 1999, X.3.12]. ∎
, , satsifies the ODE
[TABLE]
Thus,
[TABLE]
Remark 2.14*.*
Let , and be the stationary solution of
[TABLE]
that is, is chosen distributed according to inverse gamma distribution with shape parameter and scale parameter . Then, as shown in [Bibby et al., 2005], the autocorrelation function of is given by
[TABLE]
To study price dynamics it is also useful to examine the reciprocal process . When , is the unique solution of
[TABLE]
In particular, with given in (2.20),
[TABLE]
When , (2.24) is called the stochastic logistic equation.
2.4. Positivity, stationarity and martingale property
Let us first come back to the linear homogeneous situation. On average, market makers do not accumulate inventory, which suggests to consider the baseline case of balanced order flowfor which is a (local) martingale. If is a local martingale with a. s., then, from the properties of stochastic exponentials we obtain that:
- •
The weak solution of the homogeneous equation (2.1) is a local martingale, if and only if the initial condition is harmonic: and .
- •
If is a martingale and , then is a martingale.
In the Brownian motion case, from the discussion in the previous section we directly obtain:
Corollary 2.15**.**
Let where is a standard Brownian motion and , and be the solution of the inhomogeneous equation (2.7), where is an eigenfunction of with eigenvalue and , for some . If and , then
[TABLE]
where has an Inverse Gamma distribution with shape parameter and scale parameter .
Remark 2.16*.*
The Inverse Gamma distribution has a Pareto (right) tail with tail index in this case: the -th moment of if and only if .
So far, we have set aside the positivity constraint for . By Theorem 2.5 this reduces to analysis of the deterministic equation. In the case of second-order elliptic operators, positivity results from the comparison principle, whenever the initial condition is positive:
Assumption 2.17**.**
Let be an interval and suppose that is a uniformly elliptic operator of the form
[TABLE]
with Dirichlet boundary conditions, and where and are smooth and bounded coefficients, and in particular for all .
In addition, the principal eigenvalue of , has an eigenfunction which is positive on [Evans, 2010, Sec. 6.5]. Note that the factor process has state space both in Theorem 2.5 and 2.5. We thus obtain the following corollary.
Corollary 2.18** (Positivity).**
Under Assumption 2.17,
- (i)
If is positive on , then the solution of (2.2) and the solution of (2.1) are a.s. positive on . 2. (ii)
If is the principal eigenfunction of , then the finite-dimensional realization of (2.7) is a.s. positive on .
This simple result thus guarantees the existence of a solution with the correct sign, thereby avoiding recourse to ‘reflected’ solutions as in [Hambly et al., 2020] and considerably simplifying the analysis of our model.
3. A two-factor model
We now study the simplest example of model satisfying Assumption 2.17, namely the case of constant coefficients , , , , , , , ;
[TABLE]
together with the sign condition:
[TABLE]
In the following, we will write and .
3.1. Spectral representation of solutions
A spectral representation of the operator may be used to obtain an analytical solution to this model.
Proposition 3.1**.**
Let or and , , , and consider the linear operator
[TABLE]
on , with . The eigenvalues of are real and given by
[TABLE]
with corresponding eigenfunctions
[TABLE]
In particular the only positive eigenfunction is .
Proof.
First we note that that is an eigenfunction of with eigenvalue , if and only if
[TABLE]
is an eigenfunction of with zero Dirichlet boundary conditions, for eigenvalue . Details of calculations are given in [Cont, 2005]. The operator with domain is self-adjoint, has compact resolvent [Cont, 2005] and eigenvalues
[TABLE]
Eigenfunctions of with eigenvalue are solutions of the Sturm-Liouville problem
[TABLE]
with zero boundary conditions, which yields that must be of the form
[TABLE]
The zero boundary conditions at [math] and imply for some so
[TABLE]
Translating this from to yields the result. ∎
Define the following bilinear forms:
[TABLE]
and
[TABLE]
which define equivalent inner products, respectively for and . For , and , define
[TABLE]
Let
[TABLE]
Then is an orthonormal basis of and is an orthonormal basis for and solutions for the SPDE may be constructed using an expansion along these bases:
Proposition 3.2**.**
Let , , .Then defined by
[TABLE]
is the unique continuous weak solution of (3.1) in the sense of Definition 2.2.
Proof.
The unique continuous solutions of the respective deterministic equations are given by and , where and are the Dirichlet semigroups generated by
[TABLE]
on and , respectively. Thus, from Theorem 2.5 we get
[TABLE]
and are linear continuous so that for each , ,
[TABLE]
By Proposition 3.1 (resp. ) are eigenfunctions of (resp. ) and thus also of (resp. ). This yields the desired representation, where the series converge in . To obtain pointwise convergence, we note that for and , by Cauchy-Schwarz inequality, Parseval’s identity and integral criterion for sequences, for ,
[TABLE]
When , then the weights of the spectral decomposition decay exponentially in for large . This justifies approximating the solution by the first few terms. Note also that the only positive eigenfunctions are the principal eigenfunctions and so the sign constraints (3.2) only if the projection of the solution along the principal eigenfunctions dominates the other terms in the expansion. This motivates us to focus on solutions which live in the first eigenspace. This occurs if the initial condition is a (positive) linear combination of and . We will later show that this assumption is supported by market data. This leads to a finite-dimensional realization which satisfies the sign constraints (3.2):
Corollary 3.3**.**
Let resp. and define
[TABLE]
The unique solution of (3.1)–(3.2) with initial condition is given by
[TABLE]
[TABLE]
[TABLE]
In particular, , and
[TABLE]
The normalization (3.18) allows to interpret the variables in terms of order book volume and depth: (resp. ) represents the volume of sell (resp. buy) orders, while (resp. ) represents the depth at the top of the book. In this simple two-factor model, these two are proportional to each other: they may be decoupled by considering multifactor specifications involving higher-order eigenfunctions.
The drift parameter (resp. ) thus represents the net growth rate of decrease of the volume of sell (resp. buy) orders. As shown in (3.20), this net growth rate results from the superposition of several effects:
- •
submission/ cancellation of limit sell (resp. buy) orders by directional sellers (resp. buyers) at rate (resp. ); this may be interpreted as the ‘low frequency’ component of the order flow;
- •
replacement of limit orders by new ones closer to the mid-price, at rate (resp. );
- •
cancellation of limit orders as the mid-price moves away (i.e. at distance from the mid-price), at rate (resp. ).
In the case of a balanced order flow for which there is no systematic accumulation or depletion of limit orders away from the mid-price, these terms compensate each other and the volume of limit orders in any interval is a (local) martingale. The following result follows from the remarks in Section 2.4:
Corollary 3.4** (Balanced order flow).**
The order book density is a local martingale (in ), if and only if
[TABLE]
for some and
[TABLE]
Remark 3.5* (Balance between high- and low-frequency order flow).*
The balance condition (3.22) expresses a balance between the slow arrival of directional orders, represented by the terms and , and the fast replacement of orders inside the book, represented by the terms and , and finally the cancellation of limit orders deep inside the book, at rate
This balance between order flow at various frequencies may be seen as a mathematical counterpart of the observations made by [Kirilenko et al., 2017] on the nature of intraday order flow.
3.2. Shape of the order book
An implication of the above results is that the average profile of the order book is given, up to a constant, by the principal eigenfunctions :
[TABLE]
Dropping the indices , the normalized profile of the order book has the form:
[TABLE]
where is such that :
[TABLE]
Figure 3 shows this function for different values of : has a unique maximum at
[TABLE]
The position of the maximum moves closer to the origin as is increased. For we have , and, on the other hand as . Typically, the order book profile for liquid large–tick securities a few ticks from the mid price. Figure 4 shows the average order book profile for QQQ; similar results were found in [Bouchaud et al., 2009, Cont et al., 2010]. This suggests is of the order of a few ticks, so we are interested in the parameter range for which is large.
The value at the maximum is
[TABLE]
which grows linearly as , as shown in Figure 3, where we have plotted , normalized by its -norm, for various values of with and .
The above results are valuable for calibrating the model parameters , and to reproduce the average profile (for each side) of the order book.
can be estimated from the position of the maximum using (3.24). Note that, when is large then
[TABLE]
The height of this maximum gives a further constraint on parameters, using (3.25).
We will use this result for parameter estimation in Section 3.6.
3.3. Dynamics of order book volume
As noted in Corollary 3.3, and may be identified as the volume of sell (resp. buy) limit orders: they follow (correlated) geometric Brownian motions:
[TABLE]
where . The average volume of the order book satisfies
[TABLE]
Intraday studies of order book volume show it to be stable away from the open and close. Here if and only if is a martingale, i. e. .
3.4. Dynamics of price and market depth
Recall from the discussion in Section 1.3 that the order book dynamics yield the price process
[TABLE]
where is an impact coefficient and and represent the depth at the top of the order book [Cont et al., 2014]:
[TABLE]
Using the results in Corollary 3.3, we obtain the following price dynamics:
[TABLE]
where
[TABLE]
The price dynamics can thus be written as
[TABLE]
where is a Brownian motion and is the mid price volatility, which may be expressed in terms of parameters describing the order flow:
[TABLE]
The implied price dynamics thus corresponds to the Bachelier model:
- •
The drift term only depends on the rate of relative increase of the bid/ask depth, not the actual depths and .
- •
The quadratic variation of the mid price is decreases with the correlation between the buy and sell order flow. This correlation, generated by market makers, reduces price volatility.
Remark 3.6*.*
Replacing and by arbitrary semimartingales and with jumps bounded from below by , yields the following price dynamics:
[TABLE]
In particular, this relation links price jumps to large changes (‘jumps’) in order flow imbalance:
[TABLE]
3.5. Absolute price coordinates: stochastic moving boundary problem
The model above describes dynamics of the order book in relative price coordinates, i.e. as a function of the (scaled) distance from the mid-price. The density of the limit order book parameterized by the (absolute) price level is given (in the case of linear scaling) by
[TABLE]
where we extend to by setting for . As observed in Section 3.4, the mid-price dynamics is given by
[TABLE]
The dynamics of may then be described, via an application of the Itô-Wentzell formula, as the solution of a stochastic moving boundary problem [Mueller, 2018]:
Theorem 3.7** (Stochastic moving boundary problem).**
The order book density , as a function of the price level is a solution, in the sense of distributions, of the stochastic moving boundary problem
[TABLE]
for , and
[TABLE]
for with the moving boundary conditions
[TABLE]
in the following sense: is an continuous -valued stochastic process and for all and ,
[TABLE]
where we denote, for , ,
[TABLE]
for .
Remark 3.8*.*
Note that (3.36) is a stochastic boundary condition at .
The proof, given in Appendix A, is based on Krylov’s extended Itô-Wentzell formula [Krylov, 2011, Theorem 1.1].
3.6. Parameter estimation
We now describe a method for estimating model parameters. We use time series of order books for NASDAQ stocks and ETFs, from the LOBSTER database.
Given that we do not observe separately the various components of the order flow as in (3.1), we use the relations discussed in Sec. 3.2 to calibrate the parameters , and the shape parameter
[TABLE]
for each side of the order book. We set to the largest value in our data set, (). Parameters may be calibrated either through
- (a)
a least squares fit of (3.23) to the average order book profile, or 2. (b)
calibrating parameters to reproduce the position and height of the maximum of the order book profile.
Remark 3.9*.*
The estimator based on the maximum position of the peak is fast in computation but the fixed price level grid in the data restricts the set possible values for estimation of . In particular, the estimator is sensitive to the location of the maximum (i.e. the mode of the order book profile).
We show results for a set of NASDAQ stocks and ETFs. Figure 4 shows how the model reproduces the average book profile for QQQ at NASDAQ on 17th November 2017. In Figure 5 we see the coefficient estimated across various 30-min windows during the trading day. The one-factor model based on the principal eigenfunction yields a reasonable approximation for the average order book profile, which justifies our assumptions on the dynamics in Section 1.2.
For low-price/large tick stocks, the average order book profiles may differ from the exponential-sine shape. For such stocks, we use the nonlinear scaling described in Section 1.1, leading to an average order book profile:
[TABLE]
where is the best price. Figure 6 shows such a nonlinear fit for the average order book profile of SIRI.
4. Mean-reverting models
4.1. A class of models with mean-reversion
We now return to the full model (1.2) with non-zero source terms representing the rate of arrival of new limit orders at a distance from the best price:
[TABLE]
with the sign condition
[TABLE]
where, as above , , , , , , , are constants and . As above, we denote and . We will show that, when and are negative and , for all , this class of models leads to mean reverting dynamics for the order book profile, consistent with the observation that intraday dynamics of order book volume and queue size over intermediate time scales (hours, day) typically exhibit mean reversion rather than a trend.
Projecting the equation on the eigenfunctions , as in Section 3, we see that, due to the fast increase in the eigenvalues (3.4), solutions starting from a generic initial condition may be approximated by their projection on the principal eigenfunctions (we will justify this below in Proposition 4.2) and the main contribution of heterogeneous order arrivals arises from the projection of (resp. ) on (resp. ).
This motivates the following specfication, which leads to a tractable class of models:
[TABLE]
Theorem 2.10 then gives explicit solutions to (1.2). Recall the notations (3.10) and (3.9) and define and by
[TABLE]
where , . The solution of the SPDE may then be obtained as follows:
Proposition 4.1**.**
- (i)
The unique -continuous solution of (1.2) – (4.1) for a general initial condition is given by
[TABLE] 2. (ii)
For an initial condition of the form
[TABLE]
the unique -continuous solution of (1.2) – (4.1) is given by
[TABLE]
Proof.
We obtain the general solution of the linear homogeneous equation from Proposition 3.2. The series representation of results from the spectral decomposition, Proposition 3.1 and Theorem 2.10. ∎
4.2. Long time asymptotics and stationary solutions
In order to derive properties of the ’average’ order book profile, we now examine whether the order book profile has an ergodic behavior and describe stationary solutions. The following result describes the long-term dynamics and shows that this dynamics is well approximated by projecting the initial condition on the principal eigenfunctions as done in (4.3):
Proposition 4.2**.**
Let be the unique solution of (1.2) – (4.1) for a general initial condition and define:
[TABLE]
If and , then:
- (i)
The long-term dynamics of the order book is well approximated by the dynamics (4.4) projected along the principal eigenfunctions:
[TABLE] 2. (ii)
* has a unique stationary distribution and*
[TABLE]
where are given by (4.1) and (resp. ) is an Inverse Gamma random variable with shape parameter (resp. ) and scale parameter (resp. ). 3. (iii)
If furthermore and , then
[TABLE]
Proof.
For , let
[TABLE]
This term is indeed finite by integral criterion for series, see e. g. proof of Proposition 3.2. Denote the unique solution of the linear homogeneous equation (3.1) for an initial condition . Recall from Theorem 2.10 that
[TABLE]
It suffices now to prove the results for the ask side and note that the calculations will be analogous for the bid side. Using the representation of from Proposition 3.2 we get for all and all ,
[TABLE]
which, as , converges to [math] provided that . This proves (i).
To show (iii), a similar calculation but using the orthogonality of the decomposition in Proposition 3.2 yields
[TABLE]
If , then this converges to [math] as . Since defines an equivalent norm on , this finishes the proof of (iii).
Assertion (ii) follows from Proposition 2.13. Indeed, recall that are ergodic processes whose unique invariant distribution is given by an Inverse Gamma distribution with shape parameter and scale parameters , . Denote by and random variables with these distribution For any , we have the convergence in distribution
[TABLE]
Since almost sure convergence yields convergence in distribution, by part (i) this yields that (4.8) holds also for with arbitrary initial data . ∎
4.3. Dynamics of order book volume
Consider now the ‘projected’ dynamics as in the setting of Proposition 4.1.(ii). The dynamics of the order book volume is then given by
[TABLE]
where and , defined in (4.2), represent the volume of buy (resp. sell) orders in the order book.
Since we can write
[TABLE]
for some Brownian motion , independent of . Then,
[TABLE]
In particular, the quadratic variation (‘realized variance’) of the order book volume is given by
[TABLE]
For the symmetric and perfectly correlated case, is itself a reciprocal gamma diffusion:
Corollary 4.3**.**
Assume the setting of Proposition 4.1.(ii) and, in addition, that , and . Then, is the unique solution of
[TABLE]
with .
In all cases, we get from (2.22) that for , ,
[TABLE]
and
[TABLE]
4.4. Joint dynamics of mid-price and market depth
We now consider the mid price and market depths dynamics in the situation of Proposition 4.1.(ii). As discussed in Sections 1.3 and Section 3.4 for the linear homogeneous models, the dynamics of the mid-price is given by
[TABLE]
where is an impact coefficient, while the bid/ask depths follow
[TABLE]
Thus, the dynamics of the market depths are given by
[TABLE]
for some mean reversion levels . We thus obtain the joint dynamics of price and market depth:
[TABLE]
where and are independent Brownian motions. The mid-price itself has quadratic variation , where
[TABLE]
Over a small time interval
[TABLE]
where is a standard Gaussian variable. In particular the conditional probability of an upward mid-price move of size is given by
[TABLE]
where denotes the cumulative distribution function of the standard normal distribution.
Remark 4.4*.*
Using (2.22), the expected order flow over a small time interval on each side of the book is given by for ,
[TABLE]
Remark 4.5* (Mean-reverting order book imbalance).*
The imbalance between buy and sell depth is a frequently used indicator for predicting short term price moves [Cartea et al., 2018, Cont and de Larrard, 2013, Lipton et al., 2014]). In this model, the depth imbalance has the following dynamics:
[TABLE]
In the symmetric case, when , , (4.17) becomes
[TABLE]
This quantity is decreasing in the depth imbalance : this is a consequence of the mean reversion in order book depth. In the symmetric case
[TABLE]
so the model reproduces the empirical observation that order book imbalance is mean reverting [Cartea et al., 2018].
Note that the model predicts mean reversion of market depths on the scale of which corresponds to seconds for the ETFs QQQ and SPY and around 10 seconds for large tick stocks such as MSFT and INTC (see Table 1). For time scales smaller than , the direction of price moves is highly correlated with order flow imbalance, as shown in empirical studies of equity markets [Cont et al., 2014].
4.5. Parameter estimation
We now discuss estimation of model parameters from a discrete set of observations of the bid/ask volumes on a uniform time grid . Let us rewrite the dynamics of and in the form of reciprocal Gamma diffusions:
[TABLE]
with , , . We use method of moments estimators as in [Leonenko and Šuvak, 2010] for and and a martingale estimation function [Bibby and Sørensen, 1995] for the autocorrelation parameters , : we define
[TABLE]
Combining Proposition 2.13 and Remark 2.16 with [Leonenko and Šuvak, 2010, Theorem 6.3] we obtain that if and , then has finite th moment and the estimators are consistent and asymptotically normal.
For the autocorrelation parameters and we use the martingale estimation function [Bibby and Sørensen, 1995, Section 2]:
[TABLE]
where
[TABLE]
Given , this yields the estimators
[TABLE]
Convergence of this estimator is discussed in [Bibby and Sørensen, 1995, Theorem 3.2].
We apply these estimators to high-frequency limit order book time series for NASDAQ stocks and ETFs, obtained from the LOBSTER database, arranged into equally spaced observations over time intervals of size and . For each observation we use as market depth the average volume of order in the first two price levels, respectively on bid and ask side.333The source code for the implementation is available online [Cont and Mueller, 2018]. Below we show sample results for ETFs (SPY and QQQ) and liquid stocks,(MSFT and INTC).
Figure Table 1 shows estimated parameter values across different days for INTC, MSFT, QQQ and SPY. We observe negative values of correlation across bid and ask order flows which is consistent with observations in [Carmona and Webster, 2013].
Figures 7 and 8 show intraday variation of estimators for , , , and computed over 15-minute windows.
There are various estimators for intraday price volatility in this model, which allows to test the model. Recall that in (4.16) we expressed price volatility in terms of the parameters describing the order flow:
[TABLE]
where is the impact coefficient. We call this the RV estimator.
Another estimator is obtained by first estimating and using the martingale estimation function (4.22) then computing the price volatility using Equation (4.25). We label this the RCG estimator.
Finally, one can compute the realized variance of the price over a 30 minute time window using price changes over 10 ms intervals. Comparing these different estimators is a qualitative test of the model.
Figure 9 compares these estimators, computed over 30 minute time windows: we observe that the model-based estimators are of the same order and closely track the intraday realized price volatility, which shows that the model captures correctly the qualitative relation between order flow and volatility.
Appendix A Dynamics in absolute price coordinates
We now discuss in more detail the generalized Itô-Wentzell formula for distribution-valued processes, which is used in Section 3.5 to derive the dynamics of the (non-centered) order book density . Let be the space of smooth compactly supported functions on , its dual, the space of generalized functions. We denote by and the first two derivatives in the sense of distributions and by the duality product on .
A -valued stochastic process on a filtered probability space is called -predictable if for all the real valued process is predictable.
Let and and , be predictable valued processes. We assume that for all , and all , almost surely
[TABLE]
Let be independent scalar Brownian motions. We consider an equation of the form
[TABLE]
Definition A.1**.**
A -valued stochastic process is called a solution of (A.2) in the sense of distributions with initial condition if for and
[TABLE]
holds almost surely.
The following change of variable formula is a special case of a result by Krylov [Krylov, 2011, Theorem 1.1]:
Theorem A.2** (Generalized Itô-Wentzell formula).**
Let be a solution of (A.2) in the sense of distributions and let be a locally integrable process with representation
[TABLE]
where and are real-valued predictable processes. Define the -valued process by , for , . Then is a solution of
[TABLE]
in the sense of distributions.
Remark A.3*.*
It is worth noting that the correlation of and contributes the term
[TABLE]
We now apply the above Itô-Wentzell formula in order to derive the dynamics of the order book density , in non-centered coordinates, in the setting considered in Sections 3 and 4.
Let and . For , . Then, (1.2) with initial condition admits a unique (analytically) strong solution denoted by . Let be the trivial extension of to , i. e.
[TABLE]
Note that . Recall that and in the previous discussions denoted the weak derivatives on , and we get that and
[TABLE]
where denotes a point mass at . Define
[TABLE]
so that
[TABLE]
The Cauchy-Schwartz inequality shows that (A.1) is satisfied. Assume now that the mid price follows the dynamics
[TABLE]
for some integrable predictable process . Define
[TABLE]
Then, Theorem A.2 yields that for we get
[TABLE]
i. e. is a solution of the stochastic moving boundary problem,
[TABLE]
To define what we mean by solution in this context we introduce the mappings
[TABLE]
Define now the functions , , as
[TABLE]
for , , , , .
Following [Mueller, 2018, Definition 1.11], a solution of (A.13) is an -continuous stochastic process , taking values in
[TABLE]
such that is given by (A.10) and, in the sense of distributions,
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bibby et al., 2005] Bibby, B. M., Skovgaard, I. M., and Sørensen, M. (2005). Diffusion-type models with given marginal distribution and autocorrelation function. Bernoulli , 11(2):191–220.
- 2[Bibby and Sørensen, 1995] Bibby, B. M. and Sørensen, M. (1995). Martingale estimation functions for discretely observed diffusion processes. Bernoulli , 1(1-2):17–39.
- 3[Borodin and Salminen, 2012] Borodin, A. N. and Salminen, P. (2012). Handbook of Brownian motion-facts and formulae . Birkhäuser.
- 4[Bouchaud et al., 2009] Bouchaud, J.-P., Farmer, J., and Lillo, F. (2009). How markets slowly digest changes in supply and demand. In Hens, T. and Schenk-Hoppe, K. R., editors, Handbook of financial markets: dynamics and evolution , pages 57–160. Elsevier.
- 5[Burger et al., 2013] Burger, M., Caffarelli, L., Markowich, P. A., and Wolfram, M.-T. (2013). On a Boltzmann-type price formation model. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences , 469(2157).
- 6[Caffarelli et al., 2011] Caffarelli, L. A., Markowich, P. A., and Pietschmann, J.-F. (2011). On a price formation free boundary model by Lasry and Lions. Comptes Rendus Mathematique , 349(11):621 – 624.
- 7[Carmona and Webster, 2013] Carmona, R. and Webster, K. (2013). The Self-Financing Equation in High Frequency Markets. Ar Xiv 1312.2302.
- 8[Cartea et al., 2018] Cartea, A., Donnelly, R., and Jaimungal, S. (2018). Enhancing trading strategies with order book signals. Applied Mathematical Finance , 25(1):1–35.
