Mean field games with controlled jump-diffusion dynamics: Existence results and an illiquid interbank market model
Chiara Benazzoli, Luciano Campi, Luca Di Persio

TL;DR
This paper investigates mean field games with controlled jump-diffusion dynamics, establishing existence of solutions and applying the results to a model of an illiquid interbank market with numerical analysis.
Contribution
It extends mean field game theory to jump-diffusion processes with controlled coefficients and provides existence results along with a practical interbank market model.
Findings
Existence of solutions in a relaxed mean field game with jump-diffusion dynamics.
Conditions for Markovian optimal strategies in jump-diffusion setting.
Numerical results for an illiquid interbank market model.
Abstract
We study a family of mean field games with a state variable evolving as a multivariate jump diffusion process. The jump component is driven by a Poisson process with a time-dependent intensity function. All coefficients, i.e. drift, volatility and jump size, are controlled. Under fairly general conditions, we establish existence of a solution in a relaxed version of the mean field game and give conditions under which the optimal strategies are in fact Markovian, hence extending to a jump-diffusion setting previous results established in [30]. The proofs rely upon the notions of relaxed controls and martingale problems. Finally, to complement the abstract existence results, we study a simple illiquid inter-bank market model, where the banks can change their reserves only at the jump times of some exogenous Poisson processes with a common constant intensity, and provide some numerical…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Mean field games with controlled jump-diffusion dynamics: Existence results and an illiquid interbank market model
Chiara Benazzoli
Department of Mathematics, University of Trento, Italy
,
Luciano Campi
Department of Statistics, London School of Economics, UK
and
Luca Di Persio
Department of Computer Science, University of Verona, Italy
Abstract.
We study a family of mean field games with a state variable evolving as a multivariate jump diffusion process. The jump component is driven by a Poisson process with a time-dependent intensity function. All coefficients, i.e. drift, volatility and jump size, are controlled. Under fairly general conditions, we establish existence of a solution in a relaxed version of the mean field game and give conditions under which the optimal strategies are in fact Markovian, hence extending to a jump-diffusion setting previous results established in [31]. The proofs rely upon the notions of relaxed controls and martingale problems. Finally, to complement the abstract existence results, we study a simple illiquid inter-bank market model, where the banks can change their reserves only at the jump times of some exogenous Poisson processes with a common constant intensity, and provide some numerical results.
Keywords: mean field games, jump measures, controlled martingale problem, relaxed controls, martingale measure, illiquid interbank market model.
1. Introduction
Mean field games (MFGs, henceforth) were introduced by Lasry and Lions in [32, 33, 34] and, independently, by Huang and co-authors in [24], as optimization problems approximating large population symmetric stochastic differential games, where the interaction between the players is of mean field type. A solution of the limit MFG allows to construct approximate Nash equilibria for the corresponding -player games if is large enough; see, e.g., [10], [11], [14], [24], [28] as well as the recent book [9]. This approximation result is also practically relevant since a direct computation of Nash equilibria in the -player game with very large is usually not feasible even numerically, due to the curse of dimensionality. Moreover, MFGs represent a very flexible framework for applications in various areas including but not limited to finance, economics and crowd dynamics (see [22, 9] for a good sample of applications), which partly explain the increasing literature on the subject.
In this paper, we study a family of MFGs with controlled jumps, that can be shortly described as follows. Let be a finite time horizon. We consider the controlled state variable taking values in and following the dynamics
[TABLE]
where is a probability measure on the Skorokhod space of right-continuous with left limit functions, represents a control process with values in a fixed action space , is a standard multivariate Brownian motion and is a compensated Poisson process with some time-dependent intensity . Moreover, we assume that and are independent. The precise meaning of and appearing in the dynamics above will be given in the next section. Intuitively, represents a flow of probability measures on the state space , while the left limit at time with respect to the weak convergence.
Given some running cost and some final cost , the aim is to find a control solving the following minimization problem
[TABLE]
over all control processes as above and such that the so-called mean field condition is fulfilled: the measure has to be equal to the law of the optimally controlled state variable.
This model is the natural limit of a symmetric nonzero-sum -player game for large enough, where each player controls the drift, the volatility as well as the jump sizes of her/his own private state and the states are coupled through their empirical distribution. In more detail, consider the following dynamics for the private state, , of player :
[TABLE]
where is the control of player at time , is the empirical distribution of the vector of the private states of all players, is a sequence of independent multivariate Brownian motions and is a sequence of compensated Poisson processes with the same intensity . The goal of each player is minimizing some objective function, given by
[TABLE]
over her/his controls. This is a symmetric stochastic differential game, where the agents interact through the empirical mean of their private states, entering in both the drift and the jump component. According to mean field game theory, we expect that as the number of player gets larger and larger, the -player game just described tends in some sense to the minimization problem in (1), that is the solution to the latter would provide a good approximation of some Nash equilibrium in the -player game. In the present paper, we focus on the existence of solutions for the limit MFG and we postpone to future research the other equally important issues of uniqueness of the MFG solution and approximation of Nash equilibria of the -player game when is large.
While the uncontrolled counter-part of MFG, that is particle systems and propagation of chaos for jump processes, has been thoroughly studied in the probabilistic literature (see, e.g., [21, 26] and the very recent preprint [3]), MFGs with jumps have not attracted much attention so far. In particular, MFGs for jump-diffusions have not been considered so far in the literature. Indeed, most of the existing articles focus on non-linear dynamics with continuous paths, with the exception of some papers such as [20], [23], [28], and the pre-print [15]. The paper [23] deals with stochastic control of McKean-Vlasov type (see [12] for a comparison between MFG and McKean-Vlasov control), whereas [28] uses methods based on potential theory and nonlinear Markov processes. The article [20] performs a detailed analysis of MFGs for continuous-time Markov chains, and the very recent [15] studies MFGs with finitely many states via a probabilistic approach based on a Poisson-type approximation. Finally, we cite also the paper [19], where MFGs with singular controls are considered for the first time, hence introducing the possibility of jumps in the state variables.
The approach we use in this paper is based on weak formulation of stochastic controls, relaxed controls and martingale problems, which is very much inspired by Lacker [31]. We extend to MFGs with jump-diffusive state variables, where the jump size as well as drift and volatility coefficients are controlled, the main results established by D. Lacker in [31]. More in detail, our main contributions can be summarized as follows:
- •
Using the very powerful relaxed approach to MFGs introduced by Lacker [31], we establish abstract existence results for a class of MFGs with a multivariate state variable of jump-diffusion type, under suitable growth conditions on its coefficients and on the cost functional, hence including the case of unbounded coefficients and controls. The existence of Markovian solutions is also considered and sufficient conditions granting it are given. The issue of approximation of the -player games by means of the MFG solution is considered in a companion paper [4], while the one of uniqueness is postponed to future research.
- •
We complement the abstract existence results with an illiquid interbank market toy model, inspired by the systemic risk model proposed by Carmona and co-authors in [13]. Within this model, using techniques based on forward/backward stochastic differential equations, we can compute explicitly the Nash equilibrium and see the convergence to the solution of the limit MFG. Moreover, we perform some numerical experiments showing the role of illiquidity in driving the evolution over time of the controls and the state variables.
The paper is structured as follows. Section 2 provides the relaxed version of the MFG with controlled jumps together with all the assumptions on the coefficients, while in Section 3 we state and prove the main existence results in the bounded case. In Section 4, we extend the existence result to the unbounded case, i.e. coefficients and controls can be unbounded. Section 5 identifies sufficient conditions guaranteeing the existence of a Markovian MFG solution. In Section 6 we study Nash equilibria and their convergence towards the MFG solution in the illiquid interbank model; some numerical experiments are also performed. Finally, the paper ends with an appendix, collecting most of the technical results used in the proofs of the main theorems.
2. The relaxed MFG problem with controlled jumps
Following the approach in [31], we are going to study a relaxed version of the MFG briefly described in the introduction. Slightly more precisely, we will re-define the state variable and the controls on a suitable canonical space supporting all the randomness sources involved in the SDE with jumps above, so that the solution to the MFG will be identified with a probability measure on that space, that can be seen as the joint law of the pair state/control as in [31]. Therefore, finding a relaxed solution to the MFG above will boil down to finding a fixed point for a suitably defined set-valued map. The rest of this section sets up the main assumptions on the state variable and the cost functions as well as the precise definition of relaxed mean field game with controlled jumps, while the issue of existence of solutions will be addressed in the next section.
2.1. Notation
We start with some notation that will be used throughout the whole paper. Let denote the set of all càdlàg functions with values in , i.e. the set of all right continuous with left limit functions , endowed with the Skorokhod topology . We will refer to [6] for all the definitions and results on this topology when needed. Given some metric space , denotes the set of all probability measures defined on the measurable space , where is the Borel -field of . will be equipped with the topology induced by the weak convergence of measures. Moreover, , for some , denotes the set all such that for some (hence for all) . In this paper, the set will always be equipped with the Wasserstein metric
[TABLE]
where
[TABLE]
Finally for any measure for being either or we will use the notation
[TABLE]
Product spaces will always be endowed with the product -fields. Given any smooth enough scalar function , will denote its gradient, while its Hessian. Finally, for any matrix we denote its trace.
2.2. Assumptions
In what follows we will make use of the following standing assumptions on the coefficients , the costs and the initial probability distribution of the state process .
Assumption A**.**
Let be given real numbers.
- (A.1)
* is a distribution in .* 2. (A.2)
The intensity function is measurable and bounded by some constant . 3. (A.3)
The coefficients
[TABLE]
as well as the costs and are continuous functions in all their variables. 4. (A.4)
There exists a constant such that for all , , and we have
[TABLE] 5. (A.5)
There exists some positive constants such that for each and we have
[TABLE]
Without loss of generality we can assume . 6. (A.6)
The control space is a closed subset of for some integer .
Few comments on the assumptions above are in order. Conditions (A.2), (A.3) and (A.4) ensure the existence of a unique strong solution of the SDE (8) governing the evolution of the state variable. The growth conditions postulated in Assumptions (A.4) and (A.5) are widely used in Lemma A.4 and Lemma A.5, which establish crucial compactness and continuity properties needed in the fixed point argument, hence the existence of a MFG solution when the space of actions is compact. In particular, we stress that (compared to [31]) we need to impose an extra Lipschitz continuity of the coefficients as well as of the running cost with respect to the measure, while in [31] only continuity is needed. This is due to the fact that we have to work with the Shorokhod space, where the flows of measures are not necessarily continuous in time (see the proofs of Lemmas A.4 and A.5).
Moreover, they play an important role also when extending the existence of a MFG solution to the unbounded case. This will be done by truncating the coefficients first, and then passing to the limit along the sequence of MFG solutions corresponding to the truncated data. The growth conditions on the coefficients and the costs allow to safely pass to the limit while preserving the optimality.
Remark 1*.*
Observe that we have chosen a volatility coefficient with linear growth, i.e. in [31] notation, for the sake of simplicity. The two interesting cases covered by our assumptions essentially are and , so that our toy model of illiquid interbank market of Section 6 is partially covered (see our Remark 8). More specifically the state variable therein fits the general theory, while the costs do not as they are quadratic in the control. We stress that even in the continuous paths framework in [31], quadratic costs are not allowed here in full generality (see Lacker’s example in [31, Section 7]).
2.3. Relaxed controls
Now we give some background on relaxed controls. The use of relaxed controls has a long tradition in optimal control theory, they have the advantage of linearizing the objective functionals while having very convenient compactness properties. We refer to [7] and the references therein for an overview of such an approach.
Let be a measure on the set , equipped with the product -field , such that its first marginal equals the Lebesgue measure, i.e. , for all , and its second marginal is a probability distribution over .
The set of all measures of this type and satisfying
[TABLE]
will be denoted by and each element is called relaxed control. Since for all , we can renormalize all such measures and endow with the (modified) -Wasserstein metric , given by
[TABLE]
Notice that such a metric turns into a complete separable metric space. Moreover, when the action space is compact so is .
Relaxed controls can be seen as a generalization of regular controls. Indeed, every relaxed control can be related to a unique (up to a.e. equality) measure-valued map such that . Moreover a control is said to be strict if for some -valued measurable function , where denotes the Dirac delta function at the point .
2.4. Admissible laws
Let and let be the Borel -algebra induced on by the Skorokhod norm . Moreover, we also consider the measurable space , where is the Borel -field associated with the (modified) -Wasserstein distance defined in (5). Let be endowed with the product -field. A generic element is denoted by , and, with a slight abuse of notation, we denote (resp. ) its projection onto (resp. ).
The product space will be equipped with the canonical filtration associated to , which is defined as the product filtration for , where and , which are the canonical filtrations generated, respectively, by and by .
Let be the linear integro-differential operator defined on , i.e. the set of all infinitely differentiable functions with compact support, by
[TABLE]
for each .
For each test function and for any measure , is the operator defined by
[TABLE]
where with defined as for . Notice that a.e. under the Lebesgue measure we have , where is defined similarly as the image of via the mapping given by , .
Definition 1**.**
Let be a given probability measure in . We say that is an admissible law if it satisfies the following conditions:
- (1)
; 2. (2)
; 3. (3)
* is a -martingale for each .*
We denote by the set of all admissible laws.
Remark 2*.*
Observe that, according to Definition 1 above, represents a set-valued correspondence . Moreover, for each probability distribution , is nonempty if the martingale problem (7) (with initial distribution ) admits at least one solution. The latter is guaranteed by the fact that the SDE for has one strong solution by Assumption A. Finally, is a convex set for each , i.e. any convex combination with and , is still an element of .
We conclude this part on admissible laws with the following equivalent characterization of the elements of as weak solutions to a suitable stochastic differential equation with jumps. For the notion of martingale measure and related stochastic integrals, which are used in the result below, we refer to the article [17]. Similar techniques as therein lead to the following result. We provide a sketch of the proof below.
Lemma 2.1**.**
Let be a given probability measure in . Then equals the set of all probability measures on some filtered probability space , with satisfying the usual conditions, supporting orthogonal -adapted (continuous) martingale measures on with intensity , a random Poisson measure on with compensator , and an -adapted process satisfying the following equation
[TABLE]
with initial distribution and where denotes the compensated random measure, i.e. .
Proof.
Since any probability measures satisfying the properties in the statement clearly belongs to , we only need to show the other inclusion. The latter follows as in [17, Theorem IV-2], which is in turn based on the representation of continuous martingales as integrals with respect to martingale measures in [17, Theorem III-10]. Such a result can be extended to any càdlàg local martingale by using the standard decomposition , where is a continuous local martingale and is a purely discontinuous one. Theorem III-10 in [17] can be applied to the continuous part, while Théorème 13 in [16] can be used to represent as integral with respect to some (compensated) Poisson random measure in a possibly enlarged probability space. ∎
Remark 3*.*
Observe that in Lemma 2.1, which provides an equivalent definition of the admissible laws , the measurable space and the filtration are not specified in advance. However, by definition, is an element in and the solution process has càdlàg path. Therefore, by considering the measurable map
[TABLE]
we can induce a measure on the canonical space. Furthermore, has the same law under as it does under . Therefore we will assume that means that is defined on the canonical space.
Remark 4*.*
Note that under Assumption A, in particular Lipschitz continuity and growth conditions on the coefficients , the SDE with jumps (8) admits a unique strong solution.
2.5. Relaxed mean field game
For any probability measure , the objective cost functional of the minimization problem is defined by
[TABLE]
Thus, solving the relaxed MFG with controlled jump component means finding a probability measure so that the expected cost under is minimal with respect to all admissible laws, i.e.
[TABLE]
together a suitable mean field condition. In order to give a rigorous formulation of the latter and hence complete the description of relaxed MFG, we need to introduce the set-valued map defined as
[TABLE]
where is the expected cost given by
[TABLE]
Remark 5*.*
Observe that whenever . Indeed, by definition of the set , any satisfies , hence in view of Lemma A.1. Therefore and being the conclusion holds.
We can at last give the rigorous definition of relaxed MFG solutions in a setting where the jump component is controlled as well.
Definition 2**.**
A relaxed MFG solution is a probability distribution such that , i.e. it provides a fixed point for the set-valued map
[TABLE]
A relaxed MFG solution is said to be Markovian (resp. strict Markovian) if the -marginal of , i.e. , satisfies for a measurable function (resp. for a measurable function ).
The next sections address the issues of existence of relaxed MFG solutions in both cases for bounded and unbounded coefficients as well as the existence of Markovian solutions.
3. Existence of a relaxed MFG solution in the bounded case
In this section we prove the existence of a relaxed solution of our MFG with controlled jump-diffusion dynamics under the additional assumptions of boundedness of the coefficients and compactness of the action space . Moreover, we will also see that, under some rather standard convexity property, one can provide a (strict) Markovian MFG solution. The technical results that we use in the proofs can be found in the Appendix A. The following assumption will be in force throughout the whole section.
Assumption B**.**
The coefficients are bounded and the action space is compact.
Theorem 3.1**.**
Under Assumptions A and B, there exists a relaxed MFG solution.
Proof.
According to Definition 2, a probability distribution is a relaxed MFG solution if it is a fixed point for the correspondence given by
[TABLE]
where denotes the correspondence defined in (10). In order to prove the existence of such a , we apply the Kakutani-Fan-Glicksberg fixed point theorem (see, e.g., [1, Theorem 17.55]) to a restriction of to a suitably chosen domain. Indeed, that theorem applies to convex-values correspondences with closed graph and nonempty, compact, convex domain. Therefore we look for a convex compact subset , containing , and consider the restriction of on , which we will denote by . We split the remainder of the proof into four steps.
Step 1: the map is upper hemicontinuous with non-empty compact convex values (hence it has closed graph by [1, Theorem 17.11]).
Lemma A.4 implies the joint continuity of the function defined in (11), whereas Lemma A.5 yields that is continuous and has nonempty compact values, and therefore by applying the Berge Maximum Theorem (see [1, Theorem 17.31]), we have that the correspondence is indeed upper hemicontinuous with nonempty compact values. By continuity of the map so is (cf. [1, Theorem 17.23]). Recall that, by Remark 2, is convex for each . Hence by linearity111In this context, by linearity of the map we mean linearity with respect to convex combinations, namely: for any Borel set and for all . Also , being an expected value, is linear with respect to convex combinations in the underlying probability . and continuity of and of , it follows that also and are convex sets for each .
Step 2: identification of a suitable auxiliary compact convex subset of .
Let be a given initial law. Let be a set of the probability measures in , such that:
- (i)
; 2. (ii)
, where denotes the constant appearing in equation (39) of Lemma A.1, which is independent of ; 3. (iii)
is adapted to a filtration and satisfies
[TABLE]
for and , with defined in (55) independently of .
We need to prove that is a nonempty, compact and convex subset of . First, is convex by construction: consider with , and corresponding filtrations and as in condition (iii) above. Conditions (i) and (ii) are easily satisfied by since the initial distribution is the same for all the probabilities and the constant does not depend on them. Condition (iii) for also holds with the same constant as in (12) and the filtration .
Now, we show that is relatively compact in . Observe that is tight since it satisfies the sufficient criterion for tightness [36, Lemma 3.11] 222Notice that even though Whitt’s result is stated only for , a careful inspection reveals that it holds for any as well since it relies on his Theorem 3.3 where the exponent is any .. Indeed since the constant in (12) is independent of , it suffices to choose (in the notation of [36, Lemma 3.11]) , and the tightness follows. Applying [2, Prop. 7.1.5] and the bound (ii) in the definition of we get that is bounded in . Therefore is relatively compact, hence its closure for the -Wasserstein metric is compact in and convex.
Step 3: identification of a compact subset of .
We can now define as follows
[TABLE]
Since is compact and convex and is a continuous and linear map, turns out to be a convex, compact subset of as requested in Kakutani-Fan-Glicksberg fixed point theorem.
Step 4: prove that , i.e. for all .
In order to show that the range of is contained in we prove that for each , so that for all and therefore . Let , then it satisfies conditions (i) and (ii) by construction (see Definition 2.1 and Lemma A.1). The validity of condition (iii) can be proved arguing as in Proposition A.6. Indeed, using the same notation therein, we have that for each
[TABLE]
giving the same bound as in (12) with constant , which does not depend on . Note that since is nonempty (see Remark 2) then so is .
Since all the hypotheses of the Kakutani-Fan-Glicksberg fixed point theorem are satisfied, we can conclude that there exists a fixed point for the correspondence , that is a fixed point also for , which is by definition a relaxed MFG solution. ∎
4. Existence of a relaxed MFG solution in the unbounded case
By adapting to our setting the arguments used in [31, Section 5], we extend the existence result in the previous section to the case when , and have linear growth and the action space is not necessarily compact. The main idea is to work with an approximation of these functions, namely their truncated and therefore bounded versions. Then, by a convergence argument, it must be shown that the limit of the mean field game solutions found in the truncated setting is indeed a solution for the unbounded case. The main result can be easily formulated as follows:
Theorem 4.1**.**
Under Assumption A, there exists a relaxed MFG solution.
Our proof follows closely [31, Section 5], hence instead of giving all details, which would be rather redundant, we will sketch the main arguments and we will give more details only on those parts which are jump-specific.
In preparation for the proof, let us introduce some notation. For any , let be the truncated version of the coefficients , i.e. is the pointwise projection of into the ball centered at the origin with radius in and analogously for and in their respective value spaces. Moreover, we denote the intersection of with the ball centered at the origin with radius , where we recall that is the constant appearing in Assumption (A.4) granting Lipschitz continuity as well as growth conditions on the coefficients of the state variable. Since is closed by assumption, there exists such that for all the set is nonempty and compact, hence the truncated data set satisfies Assumptions A and B and, furthermore, properties (A.4) and (A.5) are fulfilled with the same constants () independent of .
Due to Theorem 3.1 in the previous section, for all there exists a relaxed MFG solution corresponding to the data set , which can be viewed as a probability measure on since can be naturally embedded in due to the inclusion . Let be the operator defined as in (A.2) with the truncated data replacing . Now, we can define the set of admissible laws as the set of all measures such that
- (1)
; 2. (2)
; 3. (3)
for all functions , the process
[TABLE]
is a -martingale.
We also define . Due to the embedding of in , we can identify (resp. ) with the set of admissible laws (resp. optimal laws) of the MFG with data . Finally, any relaxed MFG solution for the -truncated data can be viewed as a probability with satisfying the mean field condition . We are now ready to give the proof of Theorem 4.1.
Proof of Theorem 4.1.
The proof is structured in three steps.
Step 1: the sequence of relaxed MFG solutions is relatively compact in . Furthermore, we have
[TABLE]
To prove the two uniform bounds in (13) one proceeds verbatim as in [31, Lemma 5.1]. Regarding the relative compactness of the sequence , it follows from Proposition A.6.
Step 2: For any , limit in of a convergent subsequence of , we have , where , and
[TABLE]
The proof of this step is very close to the one of [31, Lemma 5.2] except for some additional details due to the presence of jumps in the state variable. We sketch the main argument by stressing the few differences with Lacker’s proof. First of all, we have , while the bound (14) is a consequence of the LHS in (13) and a standard application of Fatou’s lemma.
To conclude this step it suffices to show that is an admissible law for the non-truncated MFG, i.e. . This boils down to showing that is a -martingale for all relying in turn on the fact that for all the process is a -martingale due to . First, notice that for all
[TABLE]
Proceeding exactly as in the proof of [31, Lemma 5.2], we obtain the following bounds for the terms containing :
[TABLE]
where is a constant such that for all , is large enough (more precisely, for ), and
[TABLE]
The same bound holds for the term with as well (recall that in our case ). Regarding the term coming from the jump part, we have the following estimates
[TABLE]
where we used the fact that only if . In a similar way as for in [31, Lemma 5.2], one can show that implies due to Assumption (A.4) and by definition of in (16). Since , taking yields that -a.s.
[TABLE]
where we set . Therefore, combining the bounds above we obtain for all and -a.s.
[TABLE]
The estimate above, together with the ones in Step 1, implies the convergence
[TABLE]
Finally, using the continuity of in , granted by Lemmas A.2 and A.3, we can conclude this step as in the proof of [31, Lemma 5.2].
Step 3: The limit point is optimal, i.e. . First of all, let with . Then, one can show that there exists a sequence of probabilities such that
[TABLE]
where denotes the objective corresponding to the truncated data. Indeed, this can be shown as in the proof of [31, Lemma 5.3] with the difference that estimates for the jump part are needed as well. They can be obtained in a standard way using Burkholder-Davis-Gundy inequalities and the Assumption (A.4). Therefore, since is optimal for each we have
[TABLE]
so that thanks to (17) and to the fact that is lower semicontinuous (cf. Lemma A.4) we obtain
[TABLE]
The optimality of follows since is arbitrary.
The proof is completed since we have exhibited an admissible law , which is optimal (Step 3) and satisfies the mean field condition (Step 2). Hence is a relaxed MFG solution. ∎
5. Existence of a Markovian MFG solution
The following theorem extends to our setting the results in [31, Theorem 3.7], showing that for any element , for , there exists a Markovian control with a lower cost than . We need to introduce one further assumption:
Assumption C**.**
For all , the subset
[TABLE]
of is convex.
Remark 6*.*
The additional assumption is a classical one in the relaxed approach to optimal control. We just notice that the toy model in Section 6 fulfils it. Slightly more generally, any model where is convex subset of , is affine in and independent of , (where have constant entries), is a constant matrix and is convex in , satisfies Assumption C.
Now, we can state and prove the main result of this section.
Theorem 5.1**.**
Under Assumption A, let be a relaxed MFG solution. Assume that the coefficients satisfy the following property: there exist some constant and some continuous function such that
[TABLE]
for all . Then there exists a Markovian MFG solution. Moreover, if also Assumption C holds, there exists a strict Markovian MFG solution.
Proof.
Let be a relaxed MFG solution. In order to show the existence of a Markovian MFG solution, first we exhibit a (possibly different) probability measure , with , such that the following holds:
Property M**.**
- (M.1)
; 2. (M.2)
* for all ;* 3. (M.3)
* for a measurable function .*
Condition (M.1) implies that under this new probability measure the expected cost is minimized with respect to all the admissible laws, i.e. , whereas (M.2) assures that the marginals of are preserved under the new probability . Condition (M.3) guarantees the Markovianity of .
To obtain a Markovian MFG solution, we apply the mimicking results in [29, Corollary 4.9]. In order to do that, we need to check that satisfies properties (i)-(vi) in [29, pp. 611-612].333The results in [29] are established for a time-homogeneous generator. However, results in Section 4.2 of that article can be applied to our setting by suitably choosing the space of controls in [29] as , the relaxed control being as , for all . Let us note that assumptions (i)-(iii) are easily checked, while our assumption (19) readily implies (v). It remains to show that (iv) (positive maximum principle, cf. [18], page 165) is also fulfilled. Let and be such that , then we have
[TABLE]
which is negative for all since for all . Therefore, we can apply the very general results established in [29, Corollary 4.9] granting the existence of a process , defined on some filtered probability space , and of a measurable function such that
[TABLE]
is an -adapted -martingale for all . Moreover
[TABLE]
Let . Hence and it satisfies conditions (M.2) and (M.3) by construction. Moreover we have that
[TABLE]
where equality (a) follows from the equivalent distribution of the processes involved, i.e. for all , equality (b) is provided by (20) and equality (c) is just the tower property of conditional expectations. Therefore satisfies condition (M.1) so that . By proceeding as in the proof of [31, Corollary 3.8], we can easily conclude that with , i.e. is a Markovian relaxed MFG solution.
For the existence of a strict Markovian MFG solution, assume that the set defined in (18) is convex for all . Hence by applying the same arguments as in the second part of the proof of Theorem 3.7 in [31], we get the existence of a measurable function such that . ∎
Remark 7*.*
It is worth noticing that property (19) is guaranteed, for instance, whenever the coefficients are bounded in all their variables. In order to prove the existence of a Markovian MFG solution beyond that assumption, one could extend the mimicking approach taken in [8] to a jump-diffusion model like ours. However, such an approach, which consists in solving the problem first in discrete time and then pass to the limit along finer and finer partition, is quite a delicate one and it seems to rely heavily on path continuity. Finally, analogue mimicking results have been proved in [5] for jump-diffusions dynamics. Unfortunately, their assumptions on the jump measure are too strong for our setting and not so easy to check when compared to Kurtz and Stockbridge [29] results.
6. Application: a toy model for an illiquid inter-bank market
In this section we consider a symmetric -player game with controlled jumps fitting the general framework studied in the previous sections. The model shows the economic relevance of the general MFG considered in the previous section and, furthermore, it is simple enough to allow for explicit computations of the equilibrium in -player case as well as when . The convergence to the MFG solution is also studied. The model is in our opinion an interesting variation of the systemic risk model proposed by Carmona and coauthors in [13], parsimoniously modified for including some illiquidity phenomenon.
More precisely, we consider banks which lend to and borrow from a central bank in an interbank borrowing market. The aim of each bank is to keep its monetary reserves away from critical levels. After the financial crisis of 2008, banks are required by international regulation to store an adequate amount of liquid assets, cash, to manage possible market stress. At the same time, holding too much cash is costly for banks because of its low return. We will use the average monetary reserves of the system as the benchmark for the reserves’ level of each bank and so the cost function will penalize every deviation from this mean value as in [13].
The main difference with the model in [13] is that therein the banks can control their reserves continuously over time, i.e. they can choose a rate at which they lend or borrow money, while in the present paper the interbank market is illiquid. This means that the banks can borrow or lend money only at some exogenously given instants, that are modelled as jump times of a Poisson process with a certain intensity . The intensity can be viewed as an health indicator of the whole system: for instance, when the intensity is low, the probability of being able to control the reserves in the next instant is also low, the system is becoming very illiquid. This kind of situation typically arises during financial crises.
Let us turn to the mathematical description of the model. Let denote the monetary log-reserves of bank , whose evolution is given by
[TABLE]
where is an -dimensional Brownian motion and is an -dimensional Poisson process, each component with a constant intensity . We assume that the system starts at time from i.i.d. random variables such that for all , and that initial conditions, Brownian motions and Poisson processes are all independent.
The main difference with respect to [13] is that the control appears only in the jump component, hence the bank cannot change its reserves continuously over time but only at the jump times of the Poisson process . Notice that while the banks can borrow/lend money at different times, being the ’s independent, the jump intensity is the same for each bank.
We denote by the empirical mean of the monetary log-reserves, i.e.
[TABLE]
hence the dynamics of bank reserves can be rewritten in the mean field form as
[TABLE]
and the dynamics of the average state, , can be expressed as
[TABLE]
The dynamics of the monetary reserves are coupled through their drifts by means of the average state of the system as in [13]. Let and . Bank controls the size of the jumps (if any) at time , , in order to minimize a cost functional given by
[TABLE]
where and are quadratic functions as in [13], namely the running cost function is given by
[TABLE]
where and the terminal cost function is
[TABLE]
Note that both player ’s cost functions depend on the other players’ strategies through the mean, i.e. and . The parameter is to control the incentive to borrowing or lending: the bank will want to borrow (i.e. ) if is smaller than the empirical mean and lend (i.e. ) if is greater than the mean . We have taken the shape of the objective functions from [13] (see therein for more details on the financial interpretation of such cost functions).
The parameters and are strictly positive, so that the quadratic terms in both costs penalize departure from the average. Moreover we assume that
[TABLE]
which guarantees the convexity of in both variables.
Remark 8*.*
Observe that the interbank illiquid model of this section fits the general theory developed in the previous sections only in the case as running costs which are quadratic in the measure are ruled out by Assumption (A.5). However, since one of the main goal of this section is giving an explicit characterization of a Nash equilibrium (in the -player game) and an MFG solution (for the limit problem), we will be using other techniques based on forward/backward SDEs, allowing to treat the case as well without any additional effort. Indeed, the approach followed in the previous sections is very powerful in giving abstract existence results, while not very suitable to compute the equilibria.
In the next sub-sections, we will compute a Nash equilibria in open-loop as well as closed-loop form (see Remark 9 below). Moreover, we will study the corresponding MFG when , whose solution will be strict Markovian. Our approach is heavily based on BSDE and it follows closely the one in [13]. Nonetheless, we give all details at least in the open-loop case for reader’s convenience. Finally, we will conclude with some simulations and some financial comments on the model.
6.1. Nash equilibrium in open-loop strategies
We are searching for a Nash equilibrium among all admissible open-loop strategies , that are real-valued predictable processes satisfying the integrability condition \mathbb{E}\big{[}\int_{0}^{T}|\gamma^{i}_{t}|dt\big{]}<\infty for all . We denote the set of all such admissible controls by .
For the problem of the -th bank, we consider the Hamiltonian
[TABLE]
defined by
[TABLE]
The adjoint processes , and are defined as the solutions of the following BSDEs with jumps
[TABLE]
for .
In order to find a candidate for the optimal control , it suffices to minimize the Hamiltonian with respect to , leading to
[TABLE]
In order to prove that is a Nash equilibrium we show that when the other players are following , then is the best response of player . To do that, we need to solve the BSDE with jumps (23). It is natural to consider the ansatz
[TABLE]
where is the Kronecker delta and is a deterministic scalar function of class , satisfying the terminal condition , so that . Differentiating the ansatz, we have that solves the following SDE:
[TABLE]
where denotes the average value of all control processes. Comparing (23) and (26) under the ansatz (25) yields
[TABLE]
for all .
Moreover, by (25) and (28), it follows that the candidate given in solves
[TABLE]
and therefore the candidate optimal best response turns out to be
[TABLE]
To complete the description of , we need to provide a characterization of the function . From the definition of the Hamiltonian in (6.1) it follows that the drift coefficient in the SDE (23) for as adjoint process is given by
[TABLE]
and therefore, under the ansatz (25) and the related implications, equation (23) becomes
[TABLE]
Since both equations (26) and (6.1) hold simultaneously, we have that the following equality holds
[TABLE]
and this implies that must solve the ODE
[TABLE]
with terminal condition . For more details on how such an ODE can be solved at least in implicit form we refer the reader to Appendix B.
6.2. Approximate Nash equilibria
We conclude the theoretical study of our model by computing the solution of the MFG obtained from the -player game as . In particular, we will see that the -player Nash equilibria computed in the previous sections tend towards the MFG solution.
Let be a given càdlàg function, which represents a candidate for the limit of when . Consider the following one-player minimization problem
[TABLE]
subject to the dynamics
[TABLE]
where is a standard Brownian motion and a Poisson process with intensity . The processes and are assumed to be independent. Then, for a given càdlàg function , the optimal control is chosen in the class of admissible controls, which are real-valued predictable processes such that . Moreover, in order for to be a MFG solution, the mean field condition has to be satisfied, i.e. for a.e. .
As in the -player case, we solve the problem via the Pontryagin maximum principle. In this case, the Hamiltonian is given by
[TABLE]
and the first order condition implies that the candidate optimal strategy is
[TABLE]
The corresponding adjoint forward-backward equations are as follows: the forward process solves
[TABLE]
while the BSDE for the adjoint processes is given by
[TABLE]
Note that this time the optimization problem is one-dimensional and therefore , and are real-valued stochastic processes.
Since equations (33)-(34) are linear we can firstly solve for the expectations and . By taking the expectation in both sides of equations (33) and (34) and using the martingale property of the integrals with respect to the Brownian motion and the compensated Poisson process, we have
[TABLE]
which becomes
[TABLE]
under the mean field condition .
In order to solve the BSDE (34), we make the same ansatz as before, that is , which by identification implies
[TABLE]
for some deterministic function of class with final value , so that as required by the BSDE (34). Notice that these processes can be obtained by those in (25)-(27)-(28) by letting .
If we plug the ansatz in the BSDE (34), we find that the process solves the following SDE
[TABLE]
and, at the same time, by differentiating the ansatz , we have that
[TABLE]
where once again we used the identity and the equalities , which are due to the fact that both processes are proportional to . By matching the two SDEs for , we find that solves the following Cauchy problem
[TABLE]
and therefore the optimal control turns out to be
[TABLE]
Observe that this can also be obtained as limit for of the Nash equilibrium computed before in the -player game (see equations (29) and (31)). Figure 1 displays the behaviour of , solution of the ODE (31), for different values of players’ number . As increases, the graph of quickly converges to the solution we found in the game with an infinite number of players, given in equation (36).
Remark 9*.*
In the closed-loop case, each player has complete information on the private states of the other participants, and therefore his best reply is chosen among all Markovian strategies of the form , , for some real-valued function . Following the same approach as before, based on the Pontryagin maximum principle and BSDEs with jumps, we obtain a Nash equilibrium in closed-loop form given by
[TABLE]
where the function , analogously as in the open loop case, solves a suitable ODE with the same terminal condition as before, i.e. . We do not develop further on this since, qualitatively speaking, the two equilibria, open loop and closed loop, produces the same kind of behaviour, which is illustrated by some numerical experiments in the next section in the open loop case. Moreover, in the closed loop case too we have convergence of the -player Nash equilibrium above towards the MFG solution (37) as . Therefore, we have decided to focus on the differences between our model and [13] which are due to illiquidity.
6.3. Simulations
Previous computations in Section 6.1 and Remark 9 shows that the open-loop optimal strategies (see equation (29)) have the form
[TABLE]
for some deterministic function solving a suitable ODE. In this section, we examine the dependence on the parameters of the model (in particular, ) of the open-loop strategies and log-reserves at the Nash equilibrium computed before. The same analysis applied to the closed-loop case would lead to the same qualitative conclusions (cf. Remark 9).
First, figure 2 shows a typical scenario of our model. It can be observed that the optimal strategy is such that at each jump time for the reserves of a given bank, i.e. when the bank can borrow or lend money, the reserves move closer to the average level of the reserves in the system. This is clearly expected due to the fact that such an average is the benchmark for each bank. There are two reasons why they do not match exactly and the second one is linked to presence of jumps in the model. First, reaching can be too costly. Second, the choice of each bank, say bank , at time depends on the difference between its reserve and the average reserves immediately before time , with the aim of reducing such difference. But at the same time, might have a jump at time , as a consequence of the jump in the reserves of bank . So even if we could have .
Now, let us focus on the variation of the equilibrium strategies due to changes in the intensity of the Poisson processes, which we recall it represents the liquidity parameter of the inter-bank market. It is more convenient for the analysis to consider the function defined as
[TABLE]
so that the open-loop optimal strategy can be expressed as , for . Hence, whenever one of the Poisson processes, say , jumps, bank would modify its reserves by an amount which is proportional to the difference just before the jump, with a proportionality factor .
Then, routine computation shows that solves the following ODE:
[TABLE]
with final value . Notice that the terminal value is increasing in .
All our numerical experiments revealed that such a monotonicity behaviour propagates to the whole time interval, i.e. the proportionality factor is increasing in for all . Here, in figures 3 to 6, we show only the behaviour of as function of time with , , , and more importantly for different values of . Moreover, observe that (see Fig 5) the final value depends on the parameter whenever is different than zero.
In the Nash equilibrium we have found, when is small, hence the interbank market is very illiquid in the sense that banks will have (in expectation) very few possibilities to change their reserves, the reserves will change very little proportionally to . On the other hand, when is large, so that in expectation banks will have many occasions to lend/borrow money from the central bank, changes in their reserves will be very big proportionally to . Therefore, focusing on the first case, we notice that instead of compensating the lack of liquidity ( small), banks seem to amplify it by borrowing and lending very little.
Another interesting feature one can notice from the figures is that when is small, the proportionality factor is increasing in time. When the market is very illiquid, there are very few possibility for the banks to change their reserves during the time period, so that when the maturity is approaching, the banks knowing that they are running out of time to move their reserves closer to the average reserve , they amplify their efforts, whence an increasing . An analogue interpretation can be provided for the opposite situation of a time-decreasing when is large.
7. Conclusions
In this paper, we have generalised the controlled martingale problem approach developed in [31] to MFGs with a multidimensional state variable following a jump-diffusion dynamics, where drift, diffusion coefficient and jump sizes are controlled. Some extra Lipschitz continuity in the measures for costs and coefficients was needed, due to presence of jumps in the flows of measures. Slightly more precisely, we have established existence of relaxed MFG solutions in the unbounded case as well of strict Markovian solutions under a sort boundedness assumption on the data of the problem. Moreover, using forward/backward SDE methods, we have studied a toy model for an illiquid interbank model in the spirit of [13]. We have provided an explicit Nash equilibrium for the -player game and the corresponding solution for the limit MFG and performed some numerics illustrating the properties of the equilibrium. Many further extensions are possible, here we mention the one that appears as the most appealing in view of applications: considering in the state variable dynamics a jump process with a compensator depending on the state, measure as well as strategies would pave the road to models whose behaviour of the players can affect directly or indirectly the frequency of jumps arrival. In this case, one would have to deal with non-standard SDE with jumps (as, e.g., in [25]), so that a different approach would need to be used. Hence, we have decided to postpone this further generalisation to future research.
Acknowledgements
L. Campi and L. Di Persio would like to thank the FBK (Fondazione Bruno Kessler) - CIRM (Centro Internazionale per la Ricerca Matematica) for funding their “Research in Pairs” project “McKean-Vlasov dynamics with Lévy noise with applications to systemic risk” for the period November 1-8, 2015, with the active participation of C. Benazzoli as well. Moreover L. Di Persio and C. Benazzoli would like to thank the “Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA)” for funding the projects “Stochastic Partial Differential Equations and Stochastic Optimal Transport with Applications to Mathematical Finance”, coordinated by L. Di Persio (2016), and “Metodi di controllo ottimo stocastico per l’analisi di problem debt management”, coordinated by A. Marigonda (2017). Both such projects supported the research object of the present paper.
Appendix A Intermediate results for the existence of MFG solutions
This appendix collects useful results which were used in the proof of Theorems 3.1 and 5.1.
A.1. A-priori estimates for the state variables
In the following, we will write as a shortcut for . The next lemma is fairly standard (compare, e.g., Lacker’s Lemma 4.3), nonetheless with give all details for reader’s convenience.
Lemma A.1**.**
Let . Under Assumption A, there exists a constant such that for any and we have
[TABLE]
As a consequence, . Furthermore, if , we have
[TABLE]
Proof.
Consider a given probability measure and a related admissible law . By Lemma 2.1, there exists a constant such that
[TABLE]
In what follows, the value of the constant may change from line to line, however we will indicate what it depends on. By Jensen’s inequality and using the growth conditions on in Assumption A, we have for all
[TABLE]
By the Burkholder-Davis-Gundy inequality (see [35, Theorem 48, Ch. IV.4]), the expected supremum of the Itô integral in (40) can be bounded as follows
[TABLE]
Lastly, we have to consider the integral in (40). Using the Burkholder-Davis-Gundy inequality again, see [35, Theorem 48, Ch. IV.4], it follows that
[TABLE]
and since has linear growth in uniformly in , it is found that
[TABLE]
Observe that the previous two bounds hold if . On the other hand, if the conclusion still holds since and then arguing as before.
Combining all the previous estimates, we get that there exists a positive constant such that
[TABLE]
Hence the estimates (39) follows from an application of Gronwall’s lemma. As a consequence, when , we have
[TABLE]
so that another application of Gronwall’s lemma gives the second estimate. ∎
A.2. Some continuity results
The following result is an extension of Corollary A.5 in [31], where the space of continuous functions is replaced with the Skorokhod space of all right-continuous with left limit functions taking values in some metric space .
Lemma A.2**.**
Let be a complete separable metric space. Let be a jointly measurable function in all its variables and jointly continuous in for each . Assume that for some constant and some one of the following two properties is satisfied
- (1)
, for all ; 2. (2)
, for all .
Hence, if (1) (resp. (2)) is fulfilled, the following function
[TABLE]
is upper semicontinuous (resp. continuous).
Proof.
We prove first that the function
[TABLE]
is jointly continuous. In order to do that we adapt and expand the arguments provided in the proof of [30, Lemma 1.5]. Using [31, Prop. A.1] it suffices to show that when in as , we have for all continuous functions such that , for all . We use the notation for the measure associated to as in (42).
By definition of Skorokhod topology, there exists a sequence of time changes , i.e. strictly increasing continuous functions mapping onto , such that
[TABLE]
Let for all and any Borel set . Since is tight, for all there exists a compact set such that . Clearly, the same property holds for as well as . Moreover, being continuous, we have
[TABLE]
Therefore we can show
[TABLE]
Indeed, letting denote the complement of , we have
[TABLE]
where the second summand above can be made arbitrarily small, uniformly in , since both integrands are in their respective spaces due to the polynomial growth of the function . Furthermore, the first summand can be handled as follows:
[TABLE]
where the first summand goes to zero thanks to (44), while the second one does too due to in the (modified) -Wasserstein distance defined in (5).
Finally, the upper semicontinuity (resp. continuity) of the function (41) is obtained by applying [31, Corollary A.4] (resp. [31, Lemma A.3]). ∎
Lemma A.3**.**
Let a convergent sequence to . Then, for any
[TABLE]
Proof.
First, recall that convergence with respect to implies also weak convergence, hence Skorokhod’s representation theorem ensures that there exist -valued random variables and , defined on a common probability space such that
[TABLE]
with
[TABLE]
where denotes any metric consistent with Skorokhod -topology (see, e.g., [18, Chp. 3, Sect. 5]). By the triangular inequality, we obtain
[TABLE]
and
[TABLE]
where , which does not depend on . Then, since the convergence in implies convergence for a.e , by applying a slightly more general version of dominated convergence (e.g. [27, Theorem 1.21]), we have
[TABLE]
Then, by applying dominated convergence once again in view of
[TABLE]
we can conclude that
[TABLE]
The two previous lemmas together with the assumptions of Lipschitz continuity in of and the growth conditions of both and (cf. Assumption A), yields the following continuity properties of the objective functional in (11).
Lemma A.4**.**
The operator defined in (11) is upper semi-continuous under Assumption A. Moreover, if Assumption B is also in force, the operator is continuous.
Proof.
The upper semi-continuity is an easy consequence of Lemmas A.2 and A.3, while the continuity follows from the compactness of . More precisely, Lemma A.2 is used to prove the semi-continuity of in , while the one in is granted by Lemma A.3. ∎
We conclude this part of the appendix with establishing a continuity property for the set-valued map , giving the set of all admissible laws corresponding to measures , as given in Definition 1.
Lemma A.5**.**
Under Assumptions A and B, the set-valued correspondence given in Definition 1 is continuous with relatively compact image in .
Proof.
We split the proof in three steps.
Step 1: the image of is relatively compact. Using [31, Lemma A.2], we prove that is relatively compact in by proving that and are relatively compact sets in and respectively. The compactness of in equipped with the -Wasserstein metric follows from the compactness of , and therefore of . On the other hand, the compactness of follows from Proposition A.6.
Step 2: is upper hemicontinuous. In order to show that is upper hemicontinuous, we use the closed graph theorem (see, e.g., [1, Theorem 17.11]) by proving that is closed, i.e. for each and for each convergent sequence with , it holds that . According to Definition 1, we have to show that and that , defined as in (7) on the canonical probability space , is a -martingale for all .
The first condition is satisfied since convergence in probability implies convergence in distribution and therefore , whose law is given by .
Regarding the second condition, let be such that , and let be a continuous, -measurable, bounded function. By the martingale property of , we have for all . Hence to prove that is a -martingale for all it suffices to prove
[TABLE]
First of all, note that the functional can be bounded, uniformly on . Indeed, by Taylor’s theorem, we have
[TABLE]
where , , . Therefore
[TABLE]
The global continuity of the functions , , and guarantees that also the function is for all test functions . Moreover, since, according to Assumptions A and B, all such coefficients are bounded, and , and are Lipschitz continuous with respect to the variable uniformly in , then also is Lipschitz with respect to . Indeed
[TABLE]
where . Therefore we can conclude that
[TABLE]
is continuous. Indeed, the continuity with respect to is provided by Lemma A.2, whereas the continuity with respect to is an application of Lemma A.3 since by the previous computation it follows that
[TABLE]
Therefore for each continuous bounded function
[TABLE]
and moreover, since and is bounded and continuous, we have that
[TABLE]
This means
[TABLE]
which implies , i.e. is a martingale and .
Step 3: is lower hemicontinuous. We are left with proving that is also lower hemicontinuous. Let and be a sequence in the same space converging to . Then, for every we need to exhibit a sequence such that in . Let be a filtered probability space, orthogonal continuous -martingale measures on with intensity and a Poisson random measure on with intensity measure , where is a bounded function of time. Let be the unique strong solution of the following SDE:
[TABLE]
with initial condition . Lipschitz continuity and growth conditions on the coefficients grant existence and uniqueness of strong solution of equation (46). Then by uniqueness we have . For each , define as the process solving
[TABLE]
We want to show that
[TABLE]
Let . Then, for a suitable positive constant we have
[TABLE]
Since is Lipschitz continuous in and , we have that
[TABLE]
Regarding the stochastic integral in (48), we can apply Jensen and Burkholder-Davis-Gundy inequalities to obtain
[TABLE]
Lastly, applying the Burkholder-Davis-Gundy inequality to the Poisson measure integral in (48) combined with the boundedness of the intensity function , we have that
[TABLE]
Notice that the last integral in all estimates above converges to zero as due to Lemma A.3.
Therefore, combining the previous results and applying the Gronwall’s inequality, we conclude that .
By defining we have found the desired sequence such that in . To conclude, we have to show that is indeed in . satisfies condition (1) in Definition 1 by construction, and condition (3) can be checked by applying Itô’s formula to , for each . ∎
A.3. A compactness result
Proposition A.6**.**
Let be a given positive constant and let . Let be a given initial law. Define as the set of laws of -valued random variables defined on some filtered probability space with , such that:
- (1)
, where are orthogonal -martingale measures on with intensity and is a random measure with intensity , where the function is measurable and bounded; 2. (2)
; 3. (3)
* are measurable functions such that*
[TABLE]
for all ; 4. (4)
.
Hence is relatively compact in .
Proof.
Since is a Polish space under metric, Prokhorov’s theorem (cf. [6, Theorem 5.1, Theorem 5.2]) ensures that a family of probability measures on is relatively compact if and only if it is tight. In order to prove the tightness, we will use the Aldous’s criterion provided in [6, Theorem 16.10]. By proceeding as in the previous Lemma A.1, there exists a constant (uniform in ) such that
[TABLE]
which implies that is bounded by a constant which depends upon only through the initial distribution , which is the same for all laws in . Hence, we have
[TABLE]
Then we are left with proving that
[TABLE]
where denotes the family of all stopping times with values in a.s. For each and each stopping time , there exists a constant such that
[TABLE]
Without loss of generality, we may assume that . By applying Burkholder-Davis-Gundy inequality, the property (51) and the boundedness of the intensity function as in the proof of Lemma A.1, there exists a constant such that, for all and all stopping times , we have
[TABLE]
From this point onwards we can proceed as in the proof of [31, Proposition B.4], which gives
[TABLE]
Hence Aldous’ criterion applies and the proof is completed.∎
Appendix B Details on the computation of in equation (31)
We want to solve the final value Cauchy problem
[TABLE]
where
[TABLE]
Then, if we perform the time reversal we can consider the equivalent Cauchy problem
[TABLE]
Moreover we look for a solution whose graph belongs to the domain , where , so that the optimal strategy given in (29) is well defined for all . Notice that if and , which we can safely assume to rule out trivialities.
Note that since and are continuous functions (in the smaller domain ) then we can apply standard results for existence and uniqueness. Therefore the function solves the following Cauchy problem
[TABLE]
for appropriate values of which do not depend on time :
[TABLE]
Let
[TABLE]
Then solves the following ODE:
[TABLE]
where
[TABLE]
Lastly, by choosing
[TABLE]
we have that
[TABLE]
where
[TABLE]
Equation (56) can be solved in parametric form as
[TABLE]
where is a suitable constant. Therefore,
[TABLE]
Then, computing the integral in the definition of gives
[TABLE]
Finally, in order to compute explicitly, we need to invert this relation and plug it into .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C.. Aliprantis and K. Border “Infinite dimensional analysis: a hitchhiker’s guide” Springer Science & Business Media, 2006
- 2[2] L. Ambrosio, N. Gigli and G. Savaré “Gradient flows: in metric spaces and in the space of probability measures” Springer Science & Business Media, 2008
- 3[3] L. Andreis, P. Dai Pra and M. Fischer “Mc Kean-Vlasov limit for interacting systems with simultaneous jumps” In ar Xiv preprint ar Xiv:1704.01052 , 2017
- 4[4] C. Benazzoli, L. Campi and L. Di Persio “ ε 𝜀 \varepsilon -Nash equilibrium in stochastic differential games with mean-field interaction and controlled jumps” In ar Xiv preprint ar Xiv:1710.05734 , 2017
- 5[5] A. Bentata and R. Cont “Mimicking the marginal distributions of a semimartingale” In ar Xiv preprint ar Xiv:0910.3992 , 2009
- 6[6] P. Billingsley “Convergence of probability measures” John Wiley & Sons, 2013
- 7[7] Vivek S Borkar “Controlled diffusion processes” In Probability surveys 2 The Institute of Mathematical Statisticsthe Bernoulli Society, 2005, pp. 213–244
- 8[8] G. Brunick and S. Shreve “Mimicking an Itô process by a solution of a stochastic differential equation” In The Annals of Applied Probability 23.4 Institute of Mathematical Statistics, 2013, pp. 1584–1628
