Nonzero-Sum Submodular Monotone-Follower Games: Existence and Approximation of Nash Equilibria
Jodi Dianetti, Giorgio Ferrari

TL;DR
This paper proves the existence of Nash equilibria in complex N-player stochastic games with singular controls, introduces approximation methods for these equilibria, and provides algorithms for their computation.
Contribution
It establishes Nash equilibrium existence in non-Markovian monotone-follower games and introduces a convergent approximation scheme with practical algorithms.
Findings
Existence of Nash equilibria in non-Markovian settings.
Convergence of approximate Nash equilibria to the original game.
Algorithm for computing Nash equilibria in monotone-follower games.
Abstract
We consider a class of N-player stochastic games of multi-dimensional singular control, in which each player faces a minimization problem of monotone-follower type with submodular costs. We call these games "monotone-follower games". In a not necessarily Markovian setting, we establish the existence of Nash equilibria. Moreover, we introduce a sequence of approximating games by restricting, for each natural number n, the players' admissible strategies to the set of Lipschitz processes with Lipschitz constant bounded by n. We prove that, for each n, there exists a Nash equilibrium of the approximating game and that the sequence of Nash equilibria converges, in the Meyer-Zheng sense, to a weak (distributional) Nash equilibrium of the original game of singular control. As a byproduct, such a convergence also provides approximation results of the equilibrium values across the two classes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Stochastic processes and financial applications · Game Theory and Voting Systems
Nonzero-Sum Submodular Monotone-Follower Games:
Existence and Approximation of Nash Equilibria
Jodi Dianetti
and
Giorgio Ferrari
J. Dianetti: Center for Mathematical Economics (IMW), Bielefeld University, Universitätsstrasse 25, 33615, Bielefeld, Germany
G. Ferrari: Center for Mathematical Economics (IMW), Bielefeld University, Universitätsstrasse 25, 33615, Bielefeld, Germany
Abstract.
We consider a class of -player stochastic games of multi-dimensional singular control, in which each player faces a minimization problem of monotone-follower type with submodular costs. We call these games monotone-follower games. In a not necessarily Markovian setting, we establish the existence of Nash equilibria. Moreover, we introduce a sequence of approximating games by restricting, for each , the players’ admissible strategies to the set of Lipschitz processes with Lipschitz constant bounded by . We prove that, for each , there exists a Nash equilibrium of the approximating game and that the sequence of Nash equilibria converges, in the Meyer-Zheng sense, to a weak (distributional) Nash equilibrium of the original game of singular control. As a byproduct, such a convergence also provides approximation results of the equilibrium values across the two classes of games. We finally show how our results can be employed to prove existence of open-loop Nash equilibria in an -player stochastic differential game with singular controls, and we propose an algorithm to determine a Nash equilibrium for the monotone-follower game.
Keywords: nonzero-sum games; singular control; submodular games; Meyer-Zheng topology; Pontryagin maximum principle; Nash equilibrium; stochastic differential games; monotone-follower problem.
AMS subject classification: 91A15, 06B23, 49J45, 60G07, 91A23, 93E20.
1. Introduction
We consider a class of stochastic -player games over a finite time-horizon in which each player, indexed by , faces a multi-dimensional singular stochastic control problem of monotone-follower type. On a complete probability space, consider a multi-dimensional càdlàg (i.e., right-continuous with left limits) process and, for , multi-dimensional continuous semimartingales with nonnegative components. Denote by the right-continuous extension of the filtration generated by and , augmented by the sets of zero probability. We call monotone-follower game the game in which each player is allowed to choose a multi-dimensional control in the set of admissible strategies
[TABLE]
in order to minimize the cost functional
[TABLE]
where . Here and and are suitable nonnegative convex cost functions.
Next, we introduce a sequence of approximating games with regular controls in the following way. For each , define the -Lipschitz game as the game in which players are restricted to pick a Lipschitz control in the set of admissible -Lipschitz strategies
[TABLE]
in order to minimize the cost functionals .
Our main contributions are the following.
- (1)
Under submodularity conditions on the functions and , we establish the existence of Nash equilibria for the monotone-follower and the -Lipschitz games. 2. (2)
We show connections across these two classes of games. In particular:
- (i)
Any sequence obtained by choosing, for each , a Nash equilibrium of the -Lipschitz game is relatively compact in the Meyer-Zheng topology, and any accumulation point of this sequence is the law of a weak Nash equilibrium of the monotone-follower game (see Definition 4 below). That is, any accumulation point is a Nash equilibrium on a suitable probability space on which are defined processes and such that their joint law coincides with the joint law of and . 2. (ii)
The -dimensional vector whose components are the expected costs associated to any weak Nash equilibrium obtained through the previous approximation is a Nash equilibrium payoff. Moreover, for each , there exist large enough and a Nash equilibium of the -Lipschitz game which is an -Nash equilibrium of the monotone-follower game.
Furthermore, we provide applications of our results to deduce existence of Nash equilibria for a class of stochastic differential games with singular controls and non-Markovian random costs. Also, in the spirit of [47], we construct an algorithm to determine a Nash equilibrium of the monotone-follower game.
To the best of our knowledge, general existence and approximation results for Nash equilibria in -player non-Markovian stochastic games of multi-dimensional singular control appear in this paper for the first time.
1.1. Background literature.
A singular stochastic control problem appears for the first time in [6], where the problem of controlling the motion of a spaceship has been addressed. Later on, examples of solvable singular stochastic control problems have been studied in [7].
Singular stochastic control problems of monotone-follower type have been introduced and studied in [31] and [33]. A monotone-follower problem is the problem of tracking a stochastic process by a nondecreasing process in order to optimize a certain performance criterion. Since then, this class of problems has found many applications in economics and finance (see [5], [15], [18], among many others), operations research (see, e.g., [24] and [28]), queuing theory (see, e.g., [35]), mathematical biology (see, e.g., [1]), aerospace engineering (see, e.g., [41]), and insurance mathematics (see [39], among others).
The literature on singular stochastic control problems experienced results on existence of minima (or maxima) (see [21] and [29], among others), characterization of the optimizers through first order conditions (see, e.g., [4], [5] and [13]), as well as connections to optimal stopping problems (see, e.g., [33] or the more recent [10]) and to constrained backward stochastic differential equations [11]. We also mention the recent work [37], as their version of the monotone-follower problem is the single-agent version (in weak formulation) of our game.
The number of contributions on games of singular controls is still quite limited (see [19], [22], [26], [27], [36], [45], [51]), although these problems have received an increasing interest in the recent years. We briefly discuss here some of these works. In [45] it is determined a symmetric Nash equilibrium of a monotone-follower game with symmetric payoffs (i.e., the cost functional is the same for all players), and it is provided a characterization of any equilibria through a system of first order conditions. The same approach is also followed in [22] for a game in which players are allowed to choose a regular control and a singular control. Such a problem has been motivated by a question arising in public economic theory. A general characterization of Nash equilibria through the Pontryagin Maximum Principle approach has been investigated in the recent [51] for regular-singular stochastic differential games. Connections between nonzero-sum games of singular control and games of optimal stopping have been tackled in [19]. It is also worth mentioning some recent works on mean field games with singular controls (see [23] and [25]) and their connection to symmetric -player games (see [27]). A complete analysis of a Markovian -player stochastic game in which players can control an underlying diffusive dynamic through a control of bounded-variation is provided in the recent [26]. There, the authors derive a Nash equilibrium by solving a system of moving free boundary problems. General existence results for stochastic games with multi-dimensional singular controls and non-Markovian costs were, however, missing in the literature, and this has motivated our study.
1.2. Our results
We now provide more details on our results by discussing the ideas and techniques of their proofs.
The existence results. Going back to the seminal ideas of J. Nash, a typical way to prove existence of Nash equilibria is to show existence of a fixed point for the best-reply map. In the spirit of [47], our strategy to prove existence of Nash equilibria in the monotone-follower game and in the -Lipschitz game is to exploit the submodular structure of our games in order to apply a lattice-theoretical fixed point theorem: the Tarski’s fixed point theorem (see [46]). We proceed as follows. We first endow the spaces of admissible strategies and (defined above) with a lattice structure. While the lattice is complete, the same does not hold true for . To overcome this problem, we show that, under suitable assumptions, each “reasonable” strategy lives in a bounded subset of , and we restrict our analysis to this subset. We then prove that the best-reply maps are non empty. To accomplish this task in the -Lipschitz game, we employ the so-called classical direct method. Indeed, since each strategy is forced to be -Lipschitz, then the sequence of time-derivatives of any minimizing sequence is bounded in . Hence, Banach-Saks’ theorem, together with the lower semi-continuity and the convexity of the costs, allows to conclude existence of the minima. On the other hand, for the monotone-follower game we use some more recent techniques already employed to prove existence of optimizers in singular stochastic control problems (see [5]). Assuming a uniform coercivity condition on the costs (which has to be, anyway, necessarily satisfied in any Nash equilibria; see Remark 2.5 below) we can use a theorem by Y.M. Kabanov (see Lemma 3.5 in [30]) which gives relative sequential compactness, in the Cesàro sense, of any minimizing sequence. Then, exploiting again the lower semi-continuity and the convexity of the cost functions, we conclude existence of the minima. Next, we show that the best-reply maps preserve the order in the spaces of admissible strategies, and for this the submodular condition is essential. The existence then follows by invoking Tarski’s fixed point theorem.
Our result also generalizes to the monotone-follower game in which players are allowed to choose both a regular control and a singular control (see Remark 2.7). Moreover, some of our assumptions are not needed if we impose finite-fuel constraints (see Remark 2.6).
It is worth stressing that our proof strongly hinges on the submodularity assumption, which, however, is a typical requirement in many problems arising in applications (see, e.g., [43], [47], [49], or the books [48] and [50] and the references therein).
The approximation results. Singular control problems naturally arise to overcome the ill-posedness of standard stochastic control problems in which the control linearly affects the dynamics of the state variable, and the cost of control is proportional to the effort. Some kind of connection between regular control problems with the linear structure described above and singular control problems is then expected, and actually already discussed in the literature (see, e.g, the early [41] for an analytical approach, and [37] for a probabilistic approach). In Theorem 21 of [37], it is shown that any sequence obtained by choosing, for each , a minimizer of the monotone-follower problem when the class of admissible controls is restricted to the set of -Lipschitz controls, suitably approximates a (weak) optimal solution to the original monotone-follower problem.
We prove that any sequence of Nash equilibria of the -Lipschitz game is weakly relatively compact, and that any accumulation point is a weak Nash equilibrium of the monotone-follower game. We first show that this sequence satisfies a tightness criterion for the Meyer-Zheng topology. Then, we prove that any Nash equilibrium of the -Lipschitz game necessarily satisfies a system of stochastic equations. After changing the underlying probability space by a Skorokhod representation, we pass to the limit in these systems of equations and we deduce that any accumulation point solves a new system of stochastic equations. These equations can be viewed as a version of the Pontryagin maximum principle, and they are sufficient to ensure that the limit point is a Nash equilibrium in the new probability space, hence a weak Nash equilibrium.
As a byproduct of this result, we are able to show that, for each , there exists large enough such that the Nash equilibrium of the -Lipschitz game is an -Nash equilibrium of the monotone-follower game. This gives a clearer interpretation of the weak Nash equilibrium found through the approximation: the -dimensional vector whose components are the expected costs associated to the weak Nash equilibrium is, in fact, a Nash equilibrium payoff (as defined in [12]) of the monotone-follower game.
Applications and examples. Our existence result applies to deduce existence of open-loop Nash equilibria in stochastic differential games with singular controls and non-Markovian random costs, whenever a certain structure is preserved by the dynamics. For the sake of illustration, we consider the case in which the dynamics of the state variable of each player are a linearly controlled geometric Brownian motion and a linearly controlled Ornstein-Uhlenbeck process.
Moreover, we consider the algorithm introduced by Topkis (see Algorithm II in [47]) for submodular games: given as initial point the constantly null profile strategy, this algorithm consists of an iteration of the best-reply map. We show that, also in our setting, this algorithm converges to a Nash equilibrium.
1.3. Organization of the paper
In Section 2.1 we introduce the monotone-follower game. Sections 2.2 and 3 are devoted to the existence theorems of Nash equilibria for the submodular monotone-follower game and for the -Lipschitz game, respectively. The approximation results are contained in Section 4. The application of our result to suitable stochastic differential games is provided in Section 5, together with the proof of the convergence to a Nash equilibrium of a certain algorithm. In Appendix A we recall some results about the Meyer-Zheng topology.
1.4. Notations.
In the rest of this paper, for and , we denote by the scalar product in , as well as by the Euclidean norm in . For and , we will write if for each , as well as if for each . Moreover, we set and , where and for each . Finally, for , and , for each set and, for , set .
2. The Monotone-Follower Game
2.1. Definition of the Monotone-Follower Game
Fix a complete probability space , a finite time horizon , an integer and . Consider a càdlàg process , and, for , assume to be given continuous semimartingales , and set . Denote by the right-continuous extension of the filtration generated by and , augmented by the -null sets.
Define the space of admissible strategies
[TABLE]
and let denote the set of admissible profile strategies. In order to avoid confusion, in the following we will denote profile strategies in bold letters.
For each , consider measurable functions . We define the monotone-follower game as the game in which each player is allowed to choose an admissible strategy in order to minimize the cost functional
[TABLE]
where and . Here and in the sequel the integrals with respect to are defined by
[TABLE]
where the integrals on the right hand side are intended in the standard Lebesgue-Stieltjes sense on the interval .
We recall the notion of Nash equilibrium.
Definition 1**.**
An admissible profile strategy is a Nash equilibrium if, for every , we have and
[TABLE]
Letting denote the set of all subset of , for each define the best-reply map by
[TABLE]
Observe that the maps are constant in the variable . Moreover define the map
[TABLE]
and notice that the set of Nash equilibria coincides with the set of fixed points of the map R which have finite values; that is, the set of such that and for every .
Remark 2.1**.**
The notion of equilibrium introduced above is that of the so-called Open-Loop Nash equilibrium. We focus on this specific class of equilibria since serious conceptual – so far unsolved – problems arise when one tries to define a game of singular controls with Closed-Loop strategies (see [3] for a discussion, and also [22] and [45]).
We now specify the structural hypothesis on the costs.
Assumption 2.2**.**
For each and for assume that:
- (1)
for each , the function is lower semi-continuous, and strictly convex; 2. (2)
for each the function has decreasing differences in , i.e.
[TABLE]
for each such that ; 3. (3)
for each , the function is submodular, i.e.
[TABLE]
for each .
Under Conditions 2 and 3 of Assumption 2.2 we refer to the game introduced above as to the submodular monotone-follower game (see [47] for a static deterministic -player submodular game, and Theorem 2.6.1 and Corollary 2.6.1 at p. 44 in [48] for further discussion on these conditions). The submodular structure of our game will play a fundamental role in the following.
2.2. Existence of Nash Equilibria in the Submodular Monotone-Follower Game
Define the space of extended admissible strategies
[TABLE]
and, on it, we define the order relation such that, for , one has
[TABLE]
Moreover, we can endow the space with a lattice structure, defining the processes and as
[TABLE]
In the same way, on the set of extended profile strategies , define, for , an order relation by
[TABLE]
together with the lattice structure
[TABLE]
We now provide an existence result for the submodular monotone-follower game.
Theorem 2.3**.**
Let Assumption 2.2 hold and assume that the following uniform coercivity condition is satisfied: there exist two constants such that, for each ,
[TABLE]
Suppose, moreover, that there exists a constant such that, for each ,
[TABLE]
Then the set of Nash equilibria is non empty, and the partially ordered set is a complete lattice.
Proof.
Our aim is to prove existence of a Nash equilibrium by applying Tarski’s fixed point theorem (see Theorem 1 in [46]) to the map (cf. (2.3)). For this, the assumption on the submodularity of and will play a crucial role.
First of all, recalling and from (2.5) and (2.6), define the constant , and introduce the set of restricted admissible strategies
[TABLE]
and the set of restricted profile strategies as . In the following steps we will identify the proper framework allowing us to apply Tarski’s fixed point theorem.
(Step 1) The best-reply maps are well defined.
Fix and take . We have to prove that there exists a unique such that
[TABLE]
and, moreover, that . Clearly, by (2.2), we have .
Let be a minimizing sequence for the functional . Thanks to the coercivity conditions (2.5) on the costs, we deduce that
[TABLE]
We can then use (a minimal adjustment of) Lemma 3.5 in [30], to find a càdlàg, nondecreasing, nonnegative, -adapted process , and a subsequence of (not relabeled) such that, -a.s.,
[TABLE]
where we set, -a.s.
[TABLE]
Moreover, from the limit in (2.8) we have that there exists a -null set such that, for each there exists a subset of null Lebesgue measure, such that
[TABLE]
The latter convergence allows us to invoke Fatou’s lemma which, together with the limit in (2.8) and thanks to the lower semi-continuity of the costs, allows us to conclude that
[TABLE]
where we have used the convexity of and and the minimizing property of . Hence is a minimizer for ; in fact, is the unique minimizer of by strict convexity of the costs.
It remains to prove that , and to accomplish that we argue by contradiction. If there exists such that , then we have and hence, by the coercivity condition (2.5) together with (2.6), we deduce that
[TABLE]
which contradicts the optimality of .
(Step 2) The best-reply maps are increasing, i.e. if are such that , then .
First of all, observe that, by an integration by parts (see, e.g., Corollary 2 at p. 68 in [44]), the cost functional rewrites as
[TABLE]
where denotes the left-limit of . Thanks to the optimality of we have the inequality
[TABLE]
which by (2.10) and setting and , can be rewritten as
[TABLE]
By the submodularity Condition 3 in Assumption 2.2, we have
[TABLE]
and
[TABLE]
Moreover, one can easily verify that
[TABLE]
and
[TABLE]
[TABLE]
so that, by (2.11), we deduce that
[TABLE]
Now, by Condition 2 in Assumption 2.2, we have
[TABLE]
and finally, by (2.16), we conclude that
[TABLE]
Hence minimizes as well as and, by uniqueness, it must be . That is , which shows the claimed monotonicity.
(Step 3) The lattices and are complete.
We prove the claim only for the lattice , since an analogous rationale applies to show that the lattice is complete.
To prove that the lattice is complete we have to show that each subset of has a least upper bound and a greatest lower bound. We now prove only the existence of a least upper bound, since the existence of a greatest lower bound follows by similar arguments.
Consider a subset of , where is a set of indexes. Define . For each we set
[TABLE]
and we recall that there exists a countable subset of such that
[TABLE]
Define next the right-continuous process by
[TABLE]
Observe that is -adapted by right-continuity of the filtration. Hence, lies in , and clearly for each .
Consider next an element of such that for each . For and there exists a -null set such that for all . Defining then , we have for all and , which, by (2.18), implies that for all . Finally, introducing the -null set , we have for all and , and, by right-continuity, we deduce that . Thus, is the least upper bound of .
(Step 4) There exist increasing maps such that for each .
For each , define as the least upper bound of the set in the complete lattice . If , then and, since is increasing, for each such that , which implies that . Moreover, if are such that , then we have and hence that . It only remains to prove that . In order to accomplish that, we observe that, for each such that we have that and, since is increasing, ; that is, the set is closed under taking maxima. This implies that there exists a sequence such that the sequence is increasing and, moreover,
[TABLE]
where the latter equality is due to the monotone convergence theorem. Finally, by Step 1 we have that for each , which, by (2.20), implies that
(Step 5) Existence of Nash equilibria.
By the previous steps the lattice is complete and the map from the set of extended profile strategies into itself is monotone increasing. Then, by Tarski’s fixed point theorem (see [46], Theorem 1), the set of fixed point of the map is a non empty complete lattice. Now, by Step 4, the image of the map is contained in , and the map coincides with the map on . This implies that the set of fixed points of is equal to the set of fixed point of , and since such a set coincides with the set of Nash equilibria, the proof is completed. ∎
2.3. Some Remarks
In this subsection we collect some remarks concerning assumptions and extensions of the previous theorem.
Remark 2.4** (Comments on the Conditions of Theorem 2.3).**
A few comments are worth being done.
- (1)
Condition (2.5) is satisfied if, for example, there exists a constant such that
[TABLE]
or if are such that . 2. (2)
The role of Condition (2.6) is to force Nash equilibria, whenever they exist, to live in the bounded subset of . If there exist measurable functions such that, for each and for each , we have and , with
[TABLE]
then Condition (2.6) is satisfies with .
Remark 2.5**.**
Consider the case . The costs relative to Player 1 are , while the costs of Player 2 can be generic functions satisfying our requirements. Then, all the assumptions of Theorem 2.3 are satisfied, with the exception of the coercivity condition (2.5), which is not satisfied by . If now were a Nash equilibrium, then for the first player we could write
[TABLE]
which is clearly a contradiction. This example shows that, at least in the Nash equilibria, the coercivity condition (2.5) is necessarily satisfied.
Remark 2.6** (Finite-Fuel Constraint).**
Many models in the literature on monotone-follower problems enjoy a so-called finite fuel constraint (see e.g. [32] for a seminal paper, and the more recent [4] and [16]). This can be realized by requiring that the admissible control strategies stay bounded either -a.s. or in expectation. In our game, if we suppose that, for each , the strategies of player belongs to the set a proof similar to that of Theorem 2.3 still shows existence of Nash equilibria without need of Conditions 2.5 and 2.6.
Remark 2.7** (An Extension of Theorem 2.3 with Regular-Singular Controls).**
We here discuss how to extend Theorem 2.3 to a game in which players can choose both a regular and a singular control.
Fix a square integrable random variable and define the space of regular controls as the set of -valued -progressively measurable processes such that We consider the game of regular-singular controls, in which each player is allowed to choose an admissible strategy in order to minimize the cost functional
[TABLE]
Define on the order relation by setting, for , if and only if -a.e. Next, consider on the lattice the topology of intervals (see, e.g., p. 250 in [9]); that is, the topology for which the topology of closed sets is generated by the family of sets and for . Since the topology is included in the weak topology of and is bounded, then is compact in the topology . Therefore, by a characterization of complete lattices (see Theorem 20 at p. 250 in [9]), it follows that the lattice is complete. Then, existence of Nash equilibria follows proceding as in the proof of Theorem 2.3.
3. The -Lipschitz Game
In the notation of Section 2, for each , define the space of -Lipschitz strategies
[TABLE]
and the space of -Lipschitz profile strategies as . The set (resp. ) inherits from (resp. ) the order relation (resp. ) together with the associated lattice structure.
For each , the set of -Lipschitz profile strategies , together with the cost functionals , define a game to which we will refer to as the -Lipschitz game. We say that an -Lipschitz profile strategy is a Nash equilibrium of the -Lipschitz game if, for each , we have and
[TABLE]
Theorem 3.1** (Existence of Nash Equilibria for the Submodular -Lipschitz Game).**
Let Assumption 2.2 hold. Then, for each , the set of Nash equilibria of the -Lipschitz game is non empty, and the partially ordered set is a complete lattice.
Proof.
As in the proof of Theorem 2.3, we identify the proper framework in order to apply Tarski’s fixed point theorem. The completeness of the lattice follows by observing that the least upper bound (as well as the greatest lower bound) of any subset is still Lipschitz with Lipschitz constant bounded by . Moreover, as in the proof of Proposition 26 at p. 109 in [37], we deduce that, for each and each , there exists a unique (by strict convexity of the costs) such that
[TABLE]
By employing arguments as those in the Step 2 of the proof of Theorem 2.3 we conclude that the map is monotone increasing in the complete lattice . Then, the thesis of the theorem follows from Tarski’s fixed point theorem. ∎
4. Existence and Approximation of Weak Nash Equilibria in the Submodular Monotone-Follower Game
In this section we will investigate connections between the monotone-follower game and the -Lipschitz games.
4.1. Weak Formulation of the Monotone-Follower Game.
For and an arbitrary , we introduce the following measurable spaces:
- •
denotes the set of -valued continuous function on with nonnegative components, endowed with the Borel -algebra generated by the uniform convergence norm;
- •
denotes the Skorokhod space of -valued càdlàg functions, defined on , endowed with the Borel -algebra generated by the Skorokhod topology;
- •
denotes the Skorokhod space of -valued nondecreasing, nonnegative càdlàg functions, defined on , endowed with the Borel -algebra generated by the Skorokhod topology.
Also, let , and denote the set of probability measures on the Borel -algebras of , and , respectively. Finally, denote by the set of probability measures on the product -algebra.
Moreover, denote by the canonical projection, i.e., set for each and . Also, for a probability measure , denote by the right continuous extension of the filtration on generated by the canonical projections and , augmented by the -null sets.
We now give a weak formulation of the monotone-follower game. Assume to be given a distribution such that the projection process is a semimartingale with respect to the filtration .
Definition 2**.**
We call a a 5-tuple such that is a complete probability space, is an -valued càdlàg process, is an -valued continuous, nonnegative semimartingale with respect to the filtration , and .
For each basis , we then give the relative notion of admissible strategy.
Definition 3**.**
Given a basis , an admissible strategy associated to is an -valued càdlàg, nondecreasing, nonnegative process on the probability space . We denote by the set of admissible strategies associated to the basis . Moreover, we define the space of admissible profile strategies associated to the basis as
Given a basis , for each and each admissible strategy we define the cost functionals
[TABLE]
where , and denotes the expectation under the probability measure .
We finally introduce a notion of equilibrium that we will refer to as weak Nash equilibrium.
Definition 4** (Weak Nash Equilibrium).**
Given a basis and an admissible profile strategy , we say that the couple is a weak Nash equilibrium if, for every , we have
[TABLE]
4.2. Assumptions and a Preliminary Lemma
In this subsection we specify the main assumptions of this section, we introduce some notations, and we provide a preliminary lemma.
Assumption 4.1**.**
Let Assumption 2.2 hold and, for each , assume that:
- (1)
* and are continuous and continuously differentiable in the variable .* 2. (2)
There exist such that the -dimensional gradients and of the functions and with respect to the (-dimensional) variable satisfy
[TABLE]
for each and .
Moreover, there exist measurable functions such that and , with
[TABLE]
and
[TABLE]
where for some . 3. (3)
There exists a constant such that
[TABLE]
and the total conditional variation (see definition (A.2) in the Appendix A) of over the interval is finite; that is, .
For a basis , a profile strategy and an index , we define the continuous (non adapted) subgradient process by setting
[TABLE]
Furthermore, if is such that for a certain , then, exploiting the convexity of and and integrating by parts, we obtain the following subgradient inequality
[TABLE]
Fix a basis and recall that is the right-continuous extension of the filtration generated by and , augmented by the -null sets. For each , consider a Nash equilibrium of the -Lipschitz game as in Theorem 3.1. The next lemma shows that any Nash equilibria of the -Lipschtz game satisfy certain first order conditions. The proof of this claim follows arguments analogus to those used in the proof of Proposition 27 in [37].
Lemma 4.2**.**
For every and every , set . Then, under Assumption 4.1, defining , we have
[TABLE]
4.3. Existence and Approximation of Weak Nash Equilibria
We now state and prove the main result of this section, which can be thought of as a game-theoretic version of Theorem 21 in [37].
For an arbitrary , consider on the space the topology given by the convergence in the uniform norm. Furthermore, on the space consider the pseudopath topology ; that is, the topology on induced by the convergence in the measure on the interval , where denotes the Lebesgue measure, and denotes the Dirac measure at the terminal time . The space is a closed subset of the topological space , and the Borel -algebra induced by the topology , coincides with the -algebra induced by the Skorokhod topology (see also the Appendix in [37]). Notice that the topological spaces and are separable, but not Polish (see, e.g., [42]). Finally, on the product space , consider the product topology, and on consider the topology of weak convergence of probability measures.
Fix a basis and consider, for each , a Nash equilibrium of the -Lipschitz game as in Theorem 3.1. Define, for , the law in ; with a slight abuse of terminology, we will refer to the law as the law of the Nash equilibrium . We then have the following theorem.
Theorem 4.3**.**
Under Assumption 4.1 the following statements hold.
- (1)
The sequence of the laws of the Nash equilibria of the -Lipschitz games is weakly relatively compact in . 2. (2)
Any accumulation point is the law of a weak Nash equilibrium of the monotone-follower game; that is, there exist a basis and an admissible profile strategy , such that is a weak Nash equilibrium of the monotone-follower game and .
Proof.
We prove the two claims of the theorem separately.
Proof of Claim 1. By assumption we have . Moreover, by employing arguments similar to those in the proof of Proposition 28 at p. 110 in [37], we find
[TABLE]
where is as in Assumption 4.1. Therefore, from Lemma A.1, we can deduce that the sequence is tight in , and that in tight in . Furthermore, since the space is Polish, is regular, and hence is tight in (see, e.g., Remark 13.27 at p. 260 in [34]). This implies that the sequence is tight in .
By Prokhorov’s theorem (see, e.g., Theorem 13.29 at p. 261 in [34]), there exists a subsequence of indexes (still denoted by ) and a probability measure such that the sequence converges weakly to . The first claim of the theorem is thus proved.
Proof of Claim 2. Thanks to an extension of Skorokhod’s theorem for separable spaces (see Theorem 3 in [20]), there exists a probability space , and, on it, a sequence of -valued random variables, and a -valued random variable , such that and . Furthermore, this representation is such that, for almost all , we have
[TABLE]
as well as
[TABLE]
Define then . Since is constantly , then the same holds for its limit; that is, , and this implies that is a basis.
Next, for every and , recalling (4.5), we define on the probability space the subgradient processes and . By the convergence at the terminal time (4.10) together with Fatou’s lemma and the estimate we have
[TABLE]
Let and define the measurable function by
[TABLE]
Being constantly equal to , the sequence is tight in . This allows to assume without loss of generality (modulo a further subsequence, a new Skorokhod representation of the sequence , and exploiting the measurability of ), that converges to , -a.s. Furthermore, by (4.3) in Assumption 4.1, we have The latter, together with the -a.s. convergence of , the convergence in (4.10), and the integrability proved in (4.11), implies that, for -almost all , there exists a constant such that
[TABLE]
Thus, for -almost all , we can find, by continuity of , another constant such that
[TABLE]
Hence, for -almost all , the bounded continuous function coincides with the function when evaluated along the sequence and at the limit point .
Considering fixed and bounded by , this allows to use equation (A.1), together with standard arguments exploiting the compactness of , in order to deduce that, -a.s.
[TABLE]
The latter, thanks to (4.9) and (4.10) and to the continuity of , implies that,
[TABLE]
The following claims summarize two key properties of the processes and that will guarantee that is a weak Nash equilibrium as in Definition 4.
For every , we now prove that the following hold -a.s.:
- (2.a)
for every ; 2. (2.b)
.
(Proof of 2.a) We begin by proving that in . For , from the convergence proved in (4.13) we have that -a.e. converges to . Moreover, for as in Assumption 4.1, by the growth condition (4.1) we easily find that
[TABLE]
for a suitable constant . Using then the integrability condition (4.3) in Assumption 4.1 and the estimates (4.8) (recall that by assumption ), we have
[TABLE]
which implies that the sequence is uniformly integrable. From Theorem 6.25 at p. in [34], we deduce then that in . Now, from the second equation in (4.7) in Lemma 4.2, we find
[TABLE]
and by continuity of we conclude that -a.s.
[TABLE]
(Proof of 2.b) Computations analogous to those employed in (4.14) yield
[TABLE]
as well as,
[TABLE]
Now, the estimates (4.8), (4.11), (4.17) and (4.18) imply that
[TABLE]
which, together with the convergence established in (4.13), allows us to use Lemma A.2 in Appendix A in order to deduce that
[TABLE]
where we have used the first equality of (4.7) in Lemma 4.2 and that, for each , -a.s. This implies, thanks to the non negativity of established in (4.16), that -a.s.
[TABLE]
i.e. (2.b) is proved.
It does remain to conclude that the couple is a weak Nash equilibrium of the game. Fix , and consider an admissible strategy . By (4.6) and Claims (2.a) and (2.b) we have
[TABLE]
which in fact completes the proof. ∎
4.4. On Lipschitz -Nash Equilibria for the Monotone-Follower Game
In this subsection we prove another connection between the Lipschitz games and the monotone-follower game by showing that -Nash equilibria of the monotone-follower game can be realized as Nash equilibria of the -Lipschitz game, for sufficiently large. The proof of this result exploits Theorem 4.3, combined with a contradiction scheme.
As in Subsection 4.3, in the following we consider fixed a basis , and, for each , let be a Nash equilibrium of the -Lipschitz game as in Theorem 3.1.
Theorem 4.4**.**
Suppose that Assumption 4.1 holds and that there exists a constant such that
[TABLE]
for each and .
Then, for each , there exists such that the Nash equilibrium of the -Lipschitz game is an -Nash equilibrium of the monotone-follower game; that is, for each
[TABLE]
Proof.
We argue by contraddiction and we suppose that the thesis is false. Then, there exists such that, for each , there exist and an admissible strategy with
[TABLE]
Since the number of indexes of the players is finite, we can suppose that there exists such that, for each ,
[TABLE]
Recall now that, for each , is a Nash equilibrium for the -Lipschitz game and notice that the process constantly equal to zero is admissible. Hence, from (4.21), and using the coercivity condition (4.4) and the integrability condition (4.2) in Assumption 4.1, we find
[TABLE]
which implies that
[TABLE]
With arguments analogous to those employed in the proof of Claim 1 of Theorem 4.3, from the tightness condition (4.22) we deduce that there exists a subsequence of indexes (still denoted by ) and a probability measure such that the sequence converges weakly to .
Then, thanks again to an extension of Skorokhod’s theorem (see Theorem 3 in [20]), there exists a probability space , and, on it, a sequence of -valued random variables, and a -valued random variable , such that and . Furthermore, this representation is such that, for -almost all , we have
[TABLE]
as well as
[TABLE]
A rationale similar to that yelding (4.12) can be employed to show that, -a.s.,
[TABLE]
where we have also used that and are continuous. Furthermore, thanks to the growth condition (4.20), for as in Assumption 4.1, we can find a suitable constant such that
[TABLE]
where the integrability of the right-hand side follows from Condition (4.3) and the estimate (4.8). Finally, the limit in (4.25), together with the uniform integrability in (4.26), allows us to conclude that
[TABLE]
With a similar reasoning we also find
[TABLE]
Moreover, Condition (4.3) yields
[TABLE]
The latter, together with (4.8) and (4.11), allows to use Lemma A.2 in Appendix A in order to deduce that
[TABLE]
which, together with (4.28), gives
[TABLE]
Fix now and define the sequence of processes by as well as the process . Observe that, for each , from (4.21) and the definition of we have
[TABLE]
Moreover, notice that the convergence established in (4.24) implies that, -a.s., the sequence converges to in the measure on .
Now, since the sequence is bounded by the constant , we can use again Lemma A.2 in Appendix A to deduce that
[TABLE]
Hence, thanks to (4.30), (4.27) and (4.32), for each fixed we can pass to the limit in the inequality (4.31), in order to obtain that
[TABLE]
Finally, by the monotone convergence theorem, we can take the limit as in the latter inequality to deduce that
[TABLE]
On the other hand, the probability measure is an accumulation point of the sequence , and hence, by Theorem 4.3, the couple is a weak Nash equilibrium of the monotone-follower game, with . Moreover, is an admissible strategy associated to the basis ; this implies that
[TABLE]
which, together with (4.33), leads to a contradiction, and thus completes the proof. ∎
Remark 4.5**.**
Theorem 4.4 can also be understood in a different way. Fix a weak Nash equilibrium which is an accumulation point of a sequence of Nash equilibria of the -Lipschitz game on a fixed basis , and define
[TABLE]
Then, is a Nash equilibrium payoff of the monotone-follower game (see, e.g., Definition 2.7 in [12], or [38]), in the sense that, for each , there exists such that, for each , we have:
- (1)
* for each*; 2. (2)
.
Moreover, Theorem 4.4 shows that the Nash equilibrium payoff is such that, for each , the profile strategy , which satisfies the conditions of the definition above, can be chosen as a Nash equilibrium of the -Lipschitz game, for large enough.
Remark 4.6**.**
Notice that the submodularity conditions (2) and (3) in Assumption 2.2 are not necessarily needed in the proof of Theorem 4.3 and 4.4. Indeed, only the requirement that, for each , there exists a Nash equilibrium for the -Lipschitz game is needed. The latter games can be seen as stochastic differential games, where the set of strategies is the set of progressively measurable stochastic processes , with degenerate dynamics . This fact suggests that, whenever the submodularity requirement does not hold, one might exploit, on a case by case basis, existence results on equilibria for sochastic differential games (see, e.g., [14] and references therein for results on stochastic differential games).
5. Applications and Examples
5.1. Existence of Equilibria in a Class of Stochastic Differential Games.
This subsection is devoted to show that Theorem 2.3 applies to deduce existence of open loop Nash equilibria in stochastic differential games with singular controls, whenever a certain structure is preserved by the dynamics. For the sake of illustration, we propose the following model.
Fix a filtered probability space satisfying the usual conditions and consider on it standard -Brownian motions . Suppose to be given, for , measurable functions , as well as constants and continuous -adapted stochastic processes . Assume moreover to be given an -adapted process with càdlàg components. The set of admissible strategies is defined as the set of nondecreasing, nonnegative, càdlàg, -adapted, -valued stochastic processes, whereas denotes the set of asmissible profile strategies.
We consider the -player stochastic differential game of singular controls in which, for , player chooses an admissible strategy to control her private state, which evolves according to the stochastic differential equation
[TABLE]
in order to minimize her expected cost
[TABLE]
Observe that, for , the solution to equation (5.1) is given by
[TABLE]
where the processes and are defined by
[TABLE]
Assumption 5.1**.**
Let and satisfy Assumption 2.2. Suppose moreover that:
- (1)
for each , there exist functions such that
[TABLE]
with
[TABLE] 2. (2)
there exists a constant such that, for each , we have for each .
Theorem 5.2**.**
Under Assumption 5.1, there exists an open-loop Nash equilibrium of the previously introduced stochastic differential game.
Proof.
Thanks to (5.2), the cost functional of player can be rewritten in terms of (cf. (5.3)), that is
[TABLE]
This leads to define the new functions by
[TABLE]
as well as the continuous processes by . These definitions allows us to introduce new cost functionals in terms of new profile strategies setting
[TABLE]
Notice that, by (5.4) and the definition of in (5.3) as a function of , we have that
[TABLE]
Furthermore, for each there exists a unique such that for each . This means that solving the stochastic differential game in the class of profile strategies and with cost functionals is equivalent to solve the monotone-follower game for and cost functionals . The rest of the proof is then mainly devoted to show that the costs and , together with the processes , satisfy the conditions of Theorem 2.3.
Since the functions and satisfy Assumption 2.2, for each the functions and are clearly continuous and strictly convex. Moreover, for and such that , we have for each , since the components of are positive. Therefore, because has decreasing differences, we deduce that
[TABLE]
which means that has decreasing difference as well. In the same way it is possible to show that has decreasing differences, and this allows to conclude that the functions and satisfy Assumption 2.2. Moreover, thanks to (1) in Assumption 5.1, Condition 2.6 is clearly satisfied with for each .
We prove now that the functionals satisfy a slightly different version of Condition 2.5. The superlinear condition (2) in Assumption 5.1 implies that
[TABLE]
where is the probability measure on given by
[TABLE]
and equivalent to .
We can therefore apply Theorem 2.3 (in fact a slightly different version of it, in which the expectation in Condition 2.5 is replaced by the expectation under an equivalent probability measure) to deduce existence of a Nash equilibrium of the monotone-follower game with cost functionals . Hence the process defined by
[TABLE]
is an open-loop Nash equilibrium of the stochastic differential game. ∎
Remark 5.3**.**
The same arguments employed in the proof of Theorem 5.2 apply if we replace the dynamics of the controlled geometric Brownian motion in (5.1) by the dynamics of a controlled Ornstein–Uhlenbeck process
[TABLE]
for some parameters and . Mean-reverting dynamics (as the Ornstein–Uhlenbeck one) find important application in the energy and commodity markets (see, e.g., [8] or Chapter 2 in [40]).
5.2. An Algorithm to Approximate the Least Nash Equilibrium
In this subsection we prove that, also in our setting, the algorithm introduced by Topkis (see Algorithm II in [47]) for submodular games converges to the least Nash equilibrium of the game.
According to the notation of Section 2, define the sequence of processes in the following way:
- •
;
- •
for each , set .
Theorem 5.4**.**
Suppose that the assumptions of Theorem 2.3 hold. Assume, moreover, that there exists a constant such that, for each ,
[TABLE]
Then the sequence is monotone increasing in the lattice and it converges to the least Nash equilibrium of the game.
Proof.
Since the map is increasing (cf. Step 2 in the proof of Theorem 2.3), the sequence is clearly monotone increasing with respect to the order relation in .
Define now the process as the least upper bound of the sequence in the lattice . Recall the construction of and (cf. (2.18) and (2.19) in Step 3 in the proof of Theorem 2.3). Notice that, since the sequence is increasing in the lattice , there exists a -null set such that
[TABLE]
Take now and . If does not converge to , then we find such that,
[TABLE]
for each such that . This implies that , which means that is in the set of discontinuity points of . Thus, we conclude that there exists a -null set such that,
[TABLE]
since, for each , the latter convergence is verified in by the definition of .
We next show that the limit point is a Nash equilibrium. By Step 1 in the proof of Theorem 2.3, we know that there exists a suitable constant such that, for each , . Hence, by the monotone convergence theorem, we deduce that
[TABLE]
which in turn implies that Fix then and . If , then, by the coercivity condition (2.5), we would automatically have Hence, without loss of generality, we can assume that
[TABLE]
Now, since minimizes , for each we can write
[TABLE]
Moreover, the limit in (5.6), together with conditions (5.5) and the estimates (5.7) and (5.8), allows us to invoke the dominated convergence theorem and to take the limit as goes to infinity in the last inequality in order to deduce that . Hence is a Nash equilibrium.
Finally, we prove that is the least Nash equilibrium. Suppose that is another Nash equilibrium. By definition we have . If, for an arbitrary , we have , then, since the map is increasing and is a fixed point of , we have . Hence, by induction, we deduce that for each , which in turn implies that , since is the least upper bound of the sequence . ∎
Appendix A Meyer-Zheng Convergence
In this appendix we recall some fact about the so-called Meyer-Zheng topology (see [42]) and we provide some results concerning the tightness of càdlàg processes in such a topology.
Pseudopath topology. Recall that we have defined (cf. Subsection 4.3) the pseudopath topology on the space as the topology induced by the convergence in the measure on the interval , where denotes the Lebesgue measure and denotes the Dirac measure at the terminal point . Notice that we introduce the pseudo-path topology through its characterization proved in Lemma 1 in [42]. Observe that the topology is metrizable. If is a sequence of functions in converging to a function in the pseudopath topology , then we have that (see, e.g., Appendix A.3. at p. 116 in [37])
[TABLE]
for each bounded continuous function .
Meyer-Zheng topology and tightness criteria. The Meyer-Zheng topology on is the topology of weak convergence of probability measures on the topological space .
For a given filtered probability space consider a càdlàg process , and consider the conditional variation of over the interval , defined as
[TABLE]
where the supremum is taken over all the partitions , .
We finally prove, for the sake of completeness, a slightly different version of the classical Meyer-Zheng tightness criterion (see Theorem 4 at p. 360 in [42]), that is useful in many occasions during our study. Notice that, differently to Theorem 34 at p. 116 in [37], the next lemma allows us to handle a stochastic cost of control .
Lemma A.1**.**
The following tightness criteria hold true.
- (1)
Let be a sequence of -valued càdlàg processes defined on such that
[TABLE]
Then is tight in . 2. (2)
Let be a sequence of nondecreasing, nonnegative, -valued càdlàg processes defined on such that
[TABLE]
Then is tight in .
Proof.
We will prove only the claim (1), since the proof of claim (2) follows by an analogous rationale.
Let be the space of -valued càdlàg functions on , with the Borel -algebra generated by the Skorokhod topology. On the half line , consider the measure given by . On consider the pseudopath topology ; that is, the topology induced by the convergence in the measure on the interval . Define, moreover, the space as the set of elements of which are constant on , and notice that is a closed subset of . Also, observe that the extension map , defined by
[TABLE]
is an omeomorphism between the topological spaces and .
Now, using the uniform boundedness of , we notice that the sequence satisfies the requirement of Theorem 4 in [42], and, as shown in its proof, it follows that the sequence is tight in . Furthermore, since is a closed subset of , we have that is tight in . Finally, since the map is an omeomorphism, we conclude that the sequence is tight in in the Meyer-Zheng topology. ∎
We finally summarize in a lemma a result on the convergence of stochastic integrals.
Lemma A.2**.**
Let be a sequence of -valued continuous processes which converges -a.s. to an -valued continuous process uniformly on . Let be a sequence of nondecreasing, nonnegative, -valued càdlàg processes defined on , which converges -a.s. to nondecreasing, nonnegative, -valued cadlag process in the pseudopath topology . Suppose, moreover, that there exists two constant such that
[TABLE]
Then
[TABLE]
Proof.
We will prove that for each subsequence of indexes there exists a further subsequence for which the limit in (A.5) holds true.
Consider then a subsequence of indexes (not relabeled). From Condition (A.4), Hölder’s inequality with as in the assumptions easily reveals that
[TABLE]
Since , by the reflexivity of , there exists a subsequence of indexes and a random variable , for which
[TABLE]
where the equality of the two limits follows from the -a.s. uniform convergence of to and from the integrability condition (A.4).
Next, since by Condition (A.4) the sequence is bounded in , by Lemma 3.5 in [30] there exist a nondecreasing, nonnegative, -valued càdlàg process defined on and a subsequence (not relabeled) of such that, -a.s.,
[TABLE]
where we have set, -a.s.
[TABLE]
Moreover, for , the limit in (A.8) and an integration by parts, together with the limit in (A.1) (observing that the sequence is -a.s. bounded), imply that, -a.s.,
[TABLE]
Therefore, by the fundamental lemma of the Calculus of Variation (see Theorem 1.24 at p. 26 in [17]), the right-continuity of and , and the convergence of to , we have for all , -a.s. This identification allows to conclude, using (A.7) and uniform integrability estimates as in (A.6), that
[TABLE]
The latter, combined with (A.7), completes the proof of the lemma. ∎
Acknowledgements. Financial support by the German Research Foundation (DFG) through the Collaborative Research Centre 1283 “Taming uncertainty and profiting from randomness and low regularity in analysis, stochastics and their application” is gratefully acknowledged. We are grateful to Peter Bank, Markus Fischer, Ulrich Horst, Max Nendel, Frank Riedel, and Jan-Henrik Steg for fruitful conversations.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Alvarez, L. H., & Shepp, L. A. (1998). Optimal harvesting of stochastically fluctuating populations. Journal of Mathematical Biology, 37(2), 155-177.
- 2[2] Amir, R. (1989). A lattice-theoretic approach to a class of dynamic games. In “System-Theoretic Methods in Economic Modelling I”, 1345-1349.
- 3[3] Back, K., & Paulsen, D. (2009). Open-loop equilibria and perfect competition in option exercise games. The Review of Financial Studies, 22(11), 4531-4552.
- 4[4] Bank, P. (2005). Optimal control under a dynamic fuel constraint. SIAM Journal on Control and Optimization, 44(4), 1529-1541.
- 5[5] Bank, P., & Riedel, F. (2001). Optimal consumption choice with intertemporal substitution. The Annals of Applied Probability, 11(3), 750-788.
- 6[6] Bather, J. A., & Chernoff, H. (1967). Sequential decisions in the control of a spaceship. In “Fifth Berkeley Symposium on Mathematical Statistics and Probability” (Vol. 3, 181-207).
- 7[7] Beneš, V. E., Shepp, L. A., & Witsenhausen, H. S. (1980). Some solvable stochastic control problems. Stochastics: An International Journal of Probability and Stochastic Processes, 4(1), 39-83.
- 8[8] Benth, F. E., Kholodnyi, V. A., & Laurence, P. (Eds.). (2014). Quantitative Energy Finance: Modeling, Pricing, and Hedging in Energy and Commodity Markets. Springer Science & Business Media.
