Unique ergodicity of deterministic zero-sum differential games
Antoine Hochart

TL;DR
This paper investigates the conditions under which deterministic zero-sum differential games exhibit unique ergodicity, characterized by the convergence of value functions, extending classical dynamical systems concepts.
Contribution
It provides necessary and sufficient conditions for unique ergodicity in such games, involving symmetric criteria and the concept of dominions.
Findings
Necessary and sufficient conditions for ergodicity are established.
The notion extends classical ergodicity to game-theoretic settings.
Conditions involve symmetric properties and invariant subsets called dominions.
Abstract
We study the ergodicity of deterministic two-person zero-sum differential games. This property is defined by the uniform convergence to a constant of either the infinite-horizon discounted value as the discount factor tends to zero, or equivalently, the averaged finite-horizon value as the time goes to infinity. We provide necessary and sufficient conditions for the unique ergodicity of a game. This notion extends the classical one for dynamical systems, namely when ergodicity holds with any (suitable) perturbation of the running payoff function. Our main condition is symmetric between the two players and involve dominions, i.e., subsets of states that one player can make approximately invariant.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Unique ergodicity of
deterministic zero-sum differential games
Antoine Hochart
Facultad de Ingeniería y Ciencia, Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Santiago, Chile
(Date: January 7, 2020)
Abstract.
We study the ergodicity of deterministic two-person zero-sum differential games. This property is defined by the uniform convergence to a constant of either the infinite-horizon discounted value as the discount factor tends to zero, or equivalently, the averaged finite-horizon value as the time goes to infinity. We provide necessary and sufficient conditions for the unique ergodicity of a game. This notion extends the classical one for dynamical systems, namely when ergodicity holds with any (suitable) perturbation of the running payoff function. Our main condition is symmetric between the two players and involve dominions, i.e., subsets of states that one player can make approximately invariant.
Key words and phrases:
Differential games, Hamilton-Jacobi equations, viscosity solutions, ergodicity, limit value
2010 Mathematics Subject Classification:
Primary: 91A23, 49N70; Secondary: 37A99, 49L25, 35F21, 35B40.
The author is supported by FONDECYT grant 3180662.
1. Introduction
We study the ergodic problem for deterministic two-player zero-sum differential games. Such games are defined by a nonlinear system in controlled by two players,
[TABLE]
where the first player chooses the actions and the second player, the actions . Given a continuous and bounded payoff function , player 1 intends to minimize one of the following payoff functionals, whereas player 2 intends to maximize it:
[TABLE]
in the infinite-horizon discounted game, or
[TABLE]
in the game played in finite horizon . We assume that the data are -periodic in the state variable so that the state space can be identify with the -torus . We also restrict our study to the lower game, in which player 1 adapts her control to player 2’s actions, but note that all the results can be readily adapted to the upper game or the situation in which the classical Isaacs condition holds.
The value of the discounted (lower) game and the one of the finite-horizon (lower) game, denoted respectively by and , are the payoffs at equilibrium and can be characterized as the viscosity solutions of, respectively, a stationary Hamilton-Jacobi PDE and an evolutionary Hamilton-Jacobi PDE involving the Hamiltonian of the (lower) game.
The ergodic problem for zero-sum differential games or for Hamilton-Jacobi equations, its PDE counterpart, concerns the asymptotic behavior of the value functions and . More precisely, it deals with the uniform convergence toward a constant of when the discount factor goes to zero, and of when the horizon goes to infinity. The problem has been much studied since the seminal work of Lions, Papanicolaou and Varadhan [LPV87]. For optimal control (i.e., one-player) problems, let us mention the work of Arisawa [Ari97, Ari98] and for two-player games, the one of Alvarez and Bardi [AB03, AB07] or Cardaliaguet [Car10]. More recently, the ergodic control problem has been studied by Quincampoix and Renault [QR11], Gaitsgory and Quincampoix [GQ13], Cannarsa and Quincampoix [CQ15] or Buckdahn, Quincampoix and Renault [BQR15], for situations in which the limit value is not necessarily constant with respect to the initial state. Let us further mention the work of Khlopin [Khl18] on Abelian-Tauberian properties, or Ziliotto [Zil17, Zil19] on counterexamples to the convergence of the values, which illustrate the connection between the discrete setting (i.e., repeated games) and the continuous setting (which we consider here)111Note that the counterexample to Hamilton-Jacobi homogenization given in [Zil17] has been preceded by counterexamples for the convergence of the value of repeated games given by Vigeral [Vig13] and Ziliotto [Zil16]..
An important problem is then to characterize the differential games which are ergodic. Typical results require that the nonlinear system (or a subsystem, if it is decomposable) be uniformly controllable by one player, that is, any point is controllable to any other point by this player, either exactly or approximately, asymptotically or in bounded time (see e.g., [Ari98, Bet05, AB07]). Such conditions are independent of the payoff function and thus imply that the game is in fact uniquely ergodic. The latter notion, which was originally defined for dynamical systems, readily extends to differential games: a game is uniquely ergodic if it is ergodic for all perturbations of the payoff function that only depend on the state variable. In [Ari97], Arisawa showed that a converse property holds for systems controlled by one player and proved the existence of an ergodic attractor when unique ergodicity holds. But for two-player games, these controllability conditions totally lack symmetry, focusing only on one player.
The purpose of this article is to study the unique ergodicity property for differential games. We introduce a “dominion condition” which is in essence symmetrical between the two players. Each dominion is associated with one player and, roughly speaking, corresponds to a nonempty subset of states that this player can make approximately invariant for the dynamics. We show that if a game is uniquely ergodic, then the players do not have disjoint dominions. To prove this result, we use an Hamilton-Jacobi PDE approach. Under specific controllability assumptions (independence of with respect to the state variable or uniform time estimates on the dynamics) we further prove that the “dominion condition” is in fact equivalent to unique ergodicity. Thus our results generalize the unique ergodicity property of dynamical systems222We refer the reader to [AB03, Sec. 6.1] for the connections between classical ergodic theory and ergodicity of games or Hamiltonians., as well as the analysis of Arisawa in [Ari97, Ari98] for optimal control problems. In particular, let us observe that if a system is uniformly controllable by one player, then, whatever assumptions are made on the controllability (asymptotic or bounded time, exact or approximate), it implies that the other player has a unique trivial dominion, namely the whole state space, and so that the “dominion condition” trivially holds.
We finally mention that the notion of dominion coincides with the ones of leadership domain and discriminating domain in viability theory (see e.g., [Car96]), and therefore also relates with the notions of B-set and approachability in repeated games with vector payoffs, as shown by As Soulaimani, Quincampoix and Sorin in [ASQS09]. However, the ideas developed in this article were first inspired by the study of the ergodic problem for zero-sum repeated games, i.e., games played in discrete time (see the companion articles [Hoc19] and [AGH20]). In order to remain consistent with the latter work, we have chosen to use the term “dominion” instead of “domain”, although the two terms could be interchanged.
The paper is organized as follows. Section 2 is dedicated to preliminaries on differential games, their value functions and the Hamilton-Jacobi PDE approach to ergodicity. This section only provides some notation and classical results. It can therefore be safely skipped by readers familiar with the subject. In Section 3, we introduce and study the unique ergodicity property for general Hamiltonians, that is, the property of an Hamiltonian to be ergodic for any suitable perturbation. This (slightly) generalizes a characterization by Alvarez and Bardi in [AB10]. In Section 4, we introduce the notion of dominion and study the unique ergodicity property for differential games following a PDE approach. In Section 5, we study the unique ergodicity of differential games relying only on a dynamical system approach. Finally in Section 6, we characterize dominions in operator-theoretic terms, which establishes the link with the notion of discriminating / leadership domain in viability theory.
2. Preliminaries
We introduce here some notation as well as standard definitions and results on differential games and their PDE approach. Readers familiar with the subject can safely skip the section.
2.1. Framework and standing assumptions
We start by describing the setting of deterministic two-player zero-sum differential games that we study in this article. Consider first the controlled nonlinear system 1 where the map is from to , with nonempty compact metric spaces. We assume throughout the paper that is continuous in all variables and Lipschitz continuous in the state variable, uniformly in the control variables, i.e., denoting by the standard Euclidean norm,
[TABLE]
for some constant and for all , and . Player 1 (resp., player 2) chooses a control (resp., ) in the set of Lebesgue measurable functions from to (resp., ), which we denote by (resp., )333In order to simplify the notation, we shall equally denote by and single elements of and , respectively, and controls of player 1 and player 2, i.e., elements of and , respectively. The distinction should be clear from the context. The Cauchy-Lipschitz theorem implies that Equation 1 has a unique solution, which we denote by and for which the differential equation holds for almost all .
We are further given a bounded continuous payoff function , where we let , the supremum norm of . Then, for any trajectory of the controlled system 1, we mainly consider in this article the discounted payoff functional 2, associated with the game played in infinite horizon with a discount factor on the running payoff. The objective of player 1 is to minimize the latter functional, whereas player 2 intends to maximize it. We shall also briefly mention the payoff functional 3 associated with the game played in a finite horizon .
Additionally to the classical conditions on and already mentioned above (and which we reproduce below), we make throughout the paper the following assumption. Before stating it, let us recall that a modulus of continuity is a nondecreasing function , vanishing and continuous at [math], that is, such that .
Assumption A0 (Standing assumption).
- (i)
The function is continuous in all variables and uniformly Lipschitz continuous in the state variable, the function is bounded continuous, the action spaces and are nonempty compact sets. 2. (ii)
The payoff function is uniformly continuous with respect to the state variable, uniformly with respect to the control variables, i.e., there exists a modulus of continuity such that
[TABLE] 3. (iii)
The functions and are -periodic in the state variable, i.e., for ,
[TABLE]
Let us remark that Item (iii) implies that the state space can be identify with the -torus . Although we shall work mostly in , we draw the attention of the reader to the fact that sometimes, we will consider objects in the quotient space. Moreover, Item (iii) together with the continuity of entails the boundedness of this function. We therefore let .
2.2. Value functions and Hamilton-Jacobi PDEs
We introduce here the concept of value function and then characterize it in terms of viscosity solution of some Hamilton-Jacobi PDE. We keep the presentation to a minimum and refer the reader to the classical monograph [BCD97] for more details.
Let us start with the definition of nonanticipating strategies.
Definition 2.1** (Nonanticipating strategy).**
A nonanticipating strategy for the first player is a map such that for any time and any controls of player 2, if for almost all then for almost all . We denote by the set of nonanticipating strategies for player 1.
The set of nonanticipating strategies for the second player is defined accordingly.
We then introduce the (unnormalized) value functions. When player 2 chooses a control and player 1 is allowed to adapt her response to this control, i.e., when she chooses a nonanticipating strategy , we are considering the lower game, which we denote by . The lower value function associated with the infinite-horizon discounted payoff functional is then defined by
[TABLE]
On the other hand, if player 1 is bound to choose a control to which player 2 can adapt by choosing a nonanticipating strategy , then we are considering the upper game, denoted by , and the upper value function is given by
[TABLE]
When the game is played in a finite horizon , the value functions are defined similarly by, respectively,
[TABLE]
We always have (resp., ) and the differential game is said to have a value at state if there is equality. The latter holds under the classical Isaacs condition (which we recall at the end of the section). However, in this work, we do not need to make such an assumption: all the results presented in the article hold in the lower as well as in the upper game. Owing to the symmetry of and , we shall only consider from now on the lower game, and therefore drop the “-” superscript for simplicity of the notation. We leave to the reader the straightforward adaptation of the results to the upper game (or to the situation in which Isaacs’ condition holds).
We readily deduce from the above definitions that the two normalized value functions and are bounded by and -periodic. It is also known that they are respectively continuous on and Lipschitz continuous on for all times . Furthermore, they can be characterized as viscosity solutions444In this paper, the solutions of PDEs will always be in the continuous viscosity sense. of some PDEs, called Hamilton-Jacobi-Isaacs’ equations. These equations involve the (lower) Hamiltonian, defined by
[TABLE]
where is the standard scalar product on . The next result illustrates this fact. Note that, given any real function , we denote by its partial derivative with respect to the time variable , and by its gradient with respect to the state variable .
Theorem 2.2** (see [BCD97, Ch. III, Prop. 2.8, 3.5]).**
Under A0, the value function is the unique continuous viscosity solution of the Hamilton-Jacobi PDE
[TABLE]
and the value function is the unique continuous viscosity solution of the Hamilton-Jacobi PDE
[TABLE]
The upper value functions are characterized by the same PDEs after replacing the lower Hamiltonian with the upper Hamiltonian
[TABLE]
Consequently, if Isaacs’ condition holds, that is, if
[TABLE]
then the lower and the upper value functions are equal.
2.3. Ergodicity and PDE approach
In this article, we are interested in the asymptotic behavior of the value functions, that is, in the behavior of as the discount factor goes to [math] (resp., in the behavior of as the time horizon goes to ). More specifically, we study the so-called ergodic problem, that is, the situation in which there exists a constant such that the normalized value tends to as goes to [math] (resp., tends to as goes to ) uniformly in . This property is called ergodicity of the game.
Thanks to Theorem 2.2, the latter problem can be studied by a PDE approach. With this in mind, we shall sometimes consider arbitrary Hamiltonians defined on that satisfy the following properties. Note that these properties are inherited from the Hamiltonian defined in 4.
Assumption A1.
- (i)
The Hamiltonian is continuous. 2. (ii)
is -periodic in the first variable, i.e, for all and ,
[TABLE] 3. (iii)
There is a modulus of continuity such that, for all ,
[TABLE] 4. (iv)
There is a function that is positively homogeneous of degree one in the second variable, and a constant such that, for all ,
[TABLE]
Let us make few comments about these assumptions. First, Items (i), (ii) and (iii) imply that the PDEs HJδ and HJ have a unique continuous viscosity solution. In particular, Item (iii) implies that the comparison principle for viscosity solutions holds. Second, the map introduced in Item (iv) is called the recession function of . The positive homogeneity of degree one means that
[TABLE]
for all and all . A consequence is that
[TABLE]
uniformly in , and so is necessarily unique, continuous and -periodic in the first variable. Let us observe that if is the Hamiltonian associated with the lower game , as defined in 4, then
[TABLE]
Following a PDE approach, the existence and the value of the ergodic constant can be related with the viscosity solutions of the following cell problem:
[TABLE]
The next result explains this connection. In its statement, we abbreviate upper semicontinuous as u.s.c. and lower semicontinuous as l.s.c. Note that the result was shown in [AB03] for second-order Hamilton-Jacobi PDEs.
Theorem 2.3** ([AB03, Thm. 4]).**
Let be an arbitrary Hamiltonian satisfying Items (i), (ii) and (iii) of A1. The following assertions are equivalent.
- (i)
If is the solution of the stationary problem **HJδ***, then converges uniformly in to a constant as goes to [math].* 2. (ii)
If is the solution of the Cauchy problem HJ, then converges uniformly in to a constant as goes to . 3. (iii)
There exists a constant such that
[TABLE]
Moreover, if one of these assertions is true, then .
When an arbitrary Hamiltonian satisfies one (hence all) of the above assertions, we say that it is ergodic. We refer the reader to [AB03, Sec. 6] for a detailed discussion on the connections between classical ergodic theory of deterministic dynamical systems and ergodicity of Hamiltonians.
3. Unique ergodicity of Hamiltonians
In this section, we introduce the central concept of this article, namely unique ergodicity, which we first apply to arbitrary Hamiltonians.
Unique ergodicity is a property that originally applies to dynamical systems. Although its definition (existence of a unique invariant probability measure) cannot be readily extended to differential games or arbitrary Hamiltonians, its characterization in terms of long time averages of any continuous function along the trajectories makes this extension possible.
Alvarez and Bardi in [AB10] used this terminology of unique ergodicity and studied the property for two-player controlled systems. However, we mention that before this work, the property was already studied for controlled systems, without being given any explicit name (see for instance [Ari97, Ari98]).
3.1. Definition and characterization
Definition 3.1** (Uniquely ergodic Hamiltonian).**
Let be an Hamiltonian satisfying Items (i), (ii) and (iii) in A1. We say that is uniquely ergodic if, for every continuous and -periodic function , the perturbed Hamiltonian is ergodic, i.e., one (hence all) of the assertions in Theorem 2.3 holds with .
In the remainder, we denote by the space of continuous and -periodic real functions over .
We next give a characterization of unique ergodicity which is very similar to Proposition in [AB10] (as a matter of fact, most of the proof is borrowed from the latter reference, which we have chosen to reproduce for the sake of completeness). However, our result differs from the one of Alvarez and Bardi in two ways. First, it is not restricted to Hamiltonians associated with differential games but it applies to arbitrary Hamiltonians. Second, our definition of unique ergodicity is slightly more general, in the sense that we only need to consider perturbations of Hamiltonians of the form .
Theorem 3.2** (compare with [AB10, Prop. 2.3]).**
Let be an Hamiltonian satisfying A1. It is uniquely ergodic if and only if the following assertions hold:
- •
(Structural equicontinuity)* for every continuous and -periodic function , if denotes the solution of HJδ with the Hamiltonian , then the family is equicontinuous;*
- •
(Strong maximum principle)* the constant functions are the only continuous viscosity solutions of the PDE*
[TABLE]
where is the recession function of .
Proof.
Let us first assume that is uniquely ergodic. Let and, for , let be the solution of HJδ with the Hamiltonian . Since satisfies A1, the standard comparison principle for viscosity solutions holds. A first straightforward application of this principle yields that the family is uniformly bounded by . Then, using this fact and once again the comparison principle, we get that
[TABLE]
for all . Since the solution of HJδ is continuous, we further deduce that the function is continuous on . Together with the hypothesis that converges uniformly in to a constant when goes to 0, it entails the equicontinuity of .
To show that the second point (strong maximum principle) holds, let us consider any continuous viscosity solution of HJ∞. Fix and denote by the solution of HJδ with the Hamiltonian , i.e., the solution of
[TABLE]
Let us show that (where is the constant defined in Item (iv) of A1 for the Hamiltonian ) is a viscosity subsolution of 7. To that end, for any , let us consider any continuously differentiable function such that has a local maximum point at . Then the function has also a local maximum at , which implies that . The positive homogeneity of yields . We then have
[TABLE]
This inequality proves that is a viscosity subsolution of 7 at any point . Since is continuous, so is , and therefore the comparison principle applies, leading to . Similarly, we can show that is a viscosity supersolution of 7, hence that .
Since is uniquely ergodic, we know that converges to some constant when goes to [math]. Thus, passing to the limit in the latter inequalities, we get
[TABLE]
for all and all , which yields
[TABLE]
Letting goes to , we obtain that for all , hence that is constant. This concludes the necessary part of the proof.
We now prove the sufficient part. To that end, we assume that the structural equicontinuity property and that the strong maximum principle hold true. Let be any function in and let us denote by the solution of Equation HJδ with the Hamiltonian . We have already mentioned at the beginning of the proof that the family is uniformly bounded. Since it is also equicontinuous by hypothesis, the Arzelà-Ascoli theorem entails the existence of a subsequence that converges uniformly to some continuous and -periodic function .
Multiplying HJδ by , we get that the function solves in the equation
[TABLE]
with being -periodic. Since converges as goes to [math] to locally uniformly in , the stability property of viscosity solutions yields that the uniform limit is solution of HJ∞, hence constant since the strong maximum principle applies. We then deduce that Item (iii) of Theorem 2.3 is satisfied. Indeed the implication (i) (iii) remains true if, instead of the whole family , there is only a subsequence of that converges uniformly to a constant (for the details, see the proof of [AB03, Thm. 4]). Thus the Hamiltonian is ergodic which proves that is uniquely ergodic. ∎
With a straightforward adaption of the proof, which we leave to the reader, we can also get a sufficient condition of ergodicity with the following weaker hypothesis.
Proposition 3.3**.**
Let be an arbitrary Hamiltonian satisfying A1. If the family , where is the solution of HJδ, is equicontinuous and if the strong maximum principle holds, then is ergodic. ∎
Example 3.4*.*
Consider a differential game with state space in whose dynamics is defined for all by
[TABLE]
with . Then, as we shall see in the next section (see Example 3.6), for any payoff function satisfying A0, the family of value functions is equicontinuous. On the other hand, the recession operator of the Hamiltonian of the game is
[TABLE]
and we know that HJ∞ has a nonconstant solution if and only if (see e.g., [Car10]). Thus, the game is ergodic if is irrational.
3.2. Equicontinuity of
Theorems 3.2 and 3.3 tell us that (unique) ergodicity relies on two distinct properties. As we shall see in Section 4, the strong maximum principle is a qualitative feature of the underlying dynamical system, which can be systematically characterized. On the other hand, the (structural) equicontinuity property appears more difficult to apprehend and is rather related with quantitative aspects of the underlying dynamics (e.g., controllability assumptions with specific time estimates). We next review two sufficient conditions on any Hamiltonian that guarantee the equicontinuity of the family . Let us mention that for both conditions, the equicontinuity property is stable by perturbations of with functions , that is, equicontinuity is “structural” in the sense of Theorem 3.2.
The first of these conditions is a classic: it is well known that equicontinuity of holds if is coercive in the second variable, i.e., if
[TABLE]
uniformly in . More precisely, this property implies that the family is uniformly Lipschitz continuous. This yields in particular the existence of a corrector, that is, a solution to CP (see [LPV87]).
Secondly, the equicontinuity property holds if is uniformly continuous in , uniformly with respect to , i.e., if there exists a modulus of continuity such that
[TABLE]
for all and all .
Indeed, the equicontinuity of readily follows from the comparison principle, after noticing that and are respectively subsolution and supersolution of HJδ (see [Car10]).
Example 3.5*.*
Assume that , where the function is continuous and is continuous and -periodic. Then satisfies 8, hence the structural equicontinuity property holds.
Example 3.6*.*
Assume that is the Hamiltonian of a deterministic zero-sum differential game for which the function that controls the dynamics only depends on the control variables and not on the state, that is, for some continuous function and for all . Then writes
[TABLE]
and one can easily see that it satisfies condition 8 with modulus of continuity . Thus the structural equicontinuity property holds. Observe that if for all and some , then we recover as a special case the previous example.
4. Unique ergodicity of games via PDE approach
In the whole section, we fix a deterministic zero-sum differential game in its lower form, , which satisfies A0. We denote by the Hamiltonian of the game, defined in 4, and by its recession operator 6.
Since the values and of the game are characterized as viscosity solutions of Hamilton-Jacobi-Isaacs PDEs (Theorem 2.2), we can define the (unique) ergodicity of by applying the definitions to its Hamiltonian . This leads to the following definition.
Definition 4.1** (Ergodicity of differential games).**
The differential game is ergodic if the normalized value converges uniformly in to a constant when goes to [math] (or equivalently if converges uniformly to a constant when goes to ).
The game is uniquely ergodic if for every continuous and -periodic function , the perturbed game with running payoff , all other data being equal, is ergodic.
Thus, Theorem 3.2 or Proposition 3.3 already provides conditions for (unique) ergodicity. The purpose of this section is to give other conditions, which rely on the main tool of this article, namely dominions. We first introduce this concept, which only rely on the controlled system 1, and then use it to characterize the (unique) ergodicity property.
4.1. Dominions
Informally speaking, dominions are subsets of state that can be made approximately invariant by one player for an arbitrary period of time. This is an adaptation to the framework of differential games of a notion that was used to study zero-sum repeated games, played in discrete time (see in particular the companion works [AGH20, Hoc19]). However, as we will prove in Section 6, the notion coincides with the one of leadership domain and discriminating domain which appears in viability theory (see, e.g., [Car96]).
Before giving the formal definition of a dominion, and with the aim of simplifying the notation, let us further mention that we shall hereafter write , instead of , the solution of the controlled system 1 induced by a strategy of player 1 and a control of player 2. Also, we let be the distance of a point to a subset , that is,
[TABLE]
Definition 4.2** (Dominions).**
A dominion of the first player in the lower game is a nonempty closed set such that for every initial position in , player 1 can force the state to remain approximately in for any arbitrary period of time, meaning that
[TABLE]
Dominions for the second player are defined accordingly. Specifically, a dominion of player 2 in is a nonempty closed set such that
[TABLE]
The definition of dominions in the upper game is identical after switching the identity of the players. As we shall see in Section 6, when Isaacs’ condition holds, the definitions in the lower and the upper game coincide.
We next illustrate the notion of dominion with two examples. In the first one, Isaacs’ condition holds true, which allows us to choose for each player the more convenient definition. In the second example however, we provide a game for which the sets of dominions for each player are not the same in the lower and the upper form.
Example 4.3*.*
Consider the game already introduced in Example 3.4, whose controlled system is defined in by the function
[TABLE]
with (and with any payoff function satisfying A0). Let us observe that Isaacs’ condition holds true for :
[TABLE]
Hence, according to Remark 6.3 in Section 6, the dominions are the same in the lower and the upper game, and we can use for each player the simplest definition in order to describe them, namely for player 1: dominions as defined in ; for player 2: dominions as defined in . Following this observation, we can easily see that any line of the form
[TABLE]
with and is a dominion of player 1. Indeed, in the lower game, if she uses the strategy against all , then will be invariant for any initial point in it. Dually, any line of the form
[TABLE]
with is a dominion of player 2. Indeed, in the upper game, he can choose the strategy against all to ensure the invariance of .
Example 4.4*.*
Contrary to the latter example, let us now illustrate the situation in which Isaacs’ condition fails and the set of dominions of each player is not the same in the lower and the upper game. So, consider a differential game with state space in whose dynamics is defined for all by
[TABLE]
and the payoff function is any continuous function that is -periodic in . For such a game, we have whereas for all , which proves that Isaacs’ condition does not hold.
Then observe that in the lower game, any single point is a dominion of player 1 whereas the dominions of player 2 are all of the form . Symmetrically, the set of dominions of player 2 in the upper game contains any singleton, whereas the set of dominions for player 1 contains only intervals of the form . We further mention that the lower and the upper game are both uniquely ergodic, as we shall see with Theorem 4.13.
Before going on with ergodicity conditions, let us recall that the state space is essentially the -torus . However, the image of a closed set in is not necessarily closed, which is problematic when considering dominions. To illustrate this issue, think of the dominions and described in Example 4.3 when or are irrational, i.e., when their image in is dense. For this reason, we introduce the following definition of “dominion in the torus”. Note that we let be the quotient map.
Definition 4.5** (Dominion in the torus).**
A set is a dominion in the torus of some player if for some dominion of that player in .
Note that if is a dominion, then \pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)}=\overline{\pi^{-1}(\pi(D))} is also a dominion in . Furthermore, the latter set is -translation-invariant, meaning that for every x\in\pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)} and every , we have x+k\in\pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)}.
4.2. Necessary condition for unique ergodicity
We provide here a necessary condition for unique ergodicity involving dominions in the torus. The result is based on the very simple idea that a player will leverage one of his dominion if the payoff is more favorable on this dominion than in the rest of the states.
Proposition 4.6**.**
If the differential game is uniquely ergodic, then the intersection of every dominion of player 1 with every dominion of player 2 in the torus is nonempty, that is, for every dominion of player 1 and every dominion of player 2 in , we have
[TABLE]
To prove this result, we will need the following technical lemmas, which give an equivalent characterization of the dominions. In their statement, we denote by the set of points whose distance to a subset is not greater than , i.e.,
[TABLE]
Also, we denote by the indicator function of , defined by if and if . Let us further recall the following standard estimate on the trajectories of 1 (where ):
[TABLE]
for all , , and .
Lemma 4.7**.**
A nonempty closed set is a dominion of player 1 in if and only if for some (hence all) ,
[TABLE]
Proof.
We first assume that is a dominion of player 1 and fix some discount factor . Let and . For any horizon , there is a strategy of player 1 such that, for all controls of player 2 and all times , . So, for all we have
[TABLE]
Hence we get
[TABLE]
for all . Taking the limit as goes to , and since the integral is bounded above by , we finally get that
[TABLE]
We now assume that is not a dominion of player 1. Since it is nonempty, it means that there exist some , and such that for all strategies of player 1, player 2 can choose a control for which at some . Using the estimate 9 we deduce that
[TABLE]
hence . Note that, since , the estimate 9 necessarily implies .
For any we then have
[TABLE]
Thus
[TABLE]
which concludes the proof. ∎
With a minor adaptation of the proof, which we leave to the reader, we can show a dual characterization of dominions for the second player.
Lemma 4.8**.**
A nonempty closed set is a dominion of player 2 in if and only if for some (hence all) ,
[TABLE]
Remark 4.9*.*
We can also give a similar characterization of dominions replacing the discounted averages with the time averages
[TABLE]
We can now give the proof of the necessary condition for unique ergodicity.
Proof of Proposition 4.6.
We prove the contrapositive and, to this end, we suppose that there exist in a dominion of player 1, denoted , and a dominion of player 2, denoted , such that . Since the sets \pi^{-1}\big{(}\,\overline{\pi(D^{1\backslash 2})}\,\big{)} are also dominions in , we can assume without loss of generality that are -translation-invariant and that . So we can find such that and also have an empty intersection (recall that ). We then consider any function satisfying
[TABLE]
where equals if and any positive real otherwise. Thus, the function satisfies, for all ,
[TABLE]
Let be any discount factor. From the above inequalities, we deduce that for all , all strategies of player 1 and all controls of player 2,
[TABLE]
Let us denote by the unnormalized value of the discounted game with the perturbed running payoff . Taking the supremum over and then the infimum over in the latter inequalities, we deduce from Lemma 4.7 that for all , and from Lemma 4.8 that for all . Thus, if and , we have
[TABLE]
which proves that the perturbed game is not ergodic, hence that the game is not uniquely ergodic. ∎
Remark 4.10* (Comparison with one-player controlled systems).*
It is instructive to compare the latter necessary condition of unique ergodicity with the result of Arisawa in [Ari97], which deals with optimal control problems, i.e., problems for systems controlled by one player (who is minimizing and which we call player 1). In this paper, she proved that if the controlled system is uniquely ergodic, then there exists an ergodic attractor which satisfies the following properties.
- (P)
is closed, connected and positively invariant. 2. (D)
is nonempty and if and only if for any and any , there exists and such that and . 3. (A)
has the following time-averaged attracting property: for any neighborhood of and any ,
[TABLE]
For such controlled systems, if we introduce a second player as a dummy to cast the problem within the framework of two-player differential games, then it readily follows from the definition that the dominions of player 2 correspond to the nonempty closed and positively invariant sets (indeed, every positive orbit through any point in a dominion of player 2 is within any -neighborhood of the dominion for any arbitrary period of time). Let us observe that these sets are also dominions of player 1 and that the intersection of two dominions of player 2, if nonempty, is another dominion of player 2.
Then, applying Proposition 4.6, we deduce that if unique ergodicity holds, there is a unique minimal nonempty closed positively invariant set in the torus and that this set intersect every dominion of player 1 in the torus. We claim that this set is the ergodic attractor described in [Ari97] and that the two results are equivalent. Indeed it follows from the properties (P) and (D) that the ergodic attractor is the unique minimal dominion of player 2 (the uniqueness comes from the connectedness in (P) and the minimality from (D)) and property (A) implies that any dominion of player 1 cannot be disjoint from . Conversely, if is the unique minimal dominion of player 2 whose existence stems from Proposition 4.6, then property (P) is readily verified. Furthermore, its minimality implies that any point is approximately controllable to any other point . Then, since every dominion of player 1 meets , and particularly the closure of any positive orbit, we can show that property (D) holds. Finally using (P) and (D) we can then prove that (A) holds, as is done in [Ari97].
4.3. Sufficient condition for unique ergodicity
In this subsection, we give a sufficient condition of unique ergodicity which is derived from Theorem 3.2. We start with a lemma that relates the solutions of HJ∞ to dominions in .
Lemma 4.11**.**
Let be any continuous viscosity solution of HJ∞. Then is a dominion of player 1 in and is a dominion of player 2.
Proof.
Let us first consider the differential game with the same definition as except for the payoff function which is replaced with . The Hamiltonian associated to this game is and since, for any , the function is solution to HJδ with the latter Hamiltonian, we deduce from Theorem 2.2 that it is the (unnormalized) value of the infinite-horizon discounted game. Thus, for all points in and all positive factors we have
[TABLE]
Now set and let us assume, without loss of generality, that . Also, since the case with constant is trivial, we can assume that . In view of Lemma 4.7, we fix arbitrary positive constants and . Again, if is the whole space , then the equality in Lemma 4.7 trivially holds, so we assume that is small enough so that . Then, denoting by the infimum of on the complement of , which is necessarily positive, we can write
[TABLE]
for all . By plugging this inequality into the right-hand side of 11, we obtain for all
[TABLE]
After simplification, this yields, for all ,
[TABLE]
Since the converse inequality is obviously true, we deduce that there is in fact equality and thus, by Lemma 4.7, that is a dominion of player 1.
With very similar arguments and using Lemma 4.8 instead of Lemma 4.7, we can show that is a dominion of player 2. ∎
We know that if the value function converges uniformly to some function then it is solution to HJ∞. This entails the following corollary
Corollary 4.12**.**
Assume that the value function of the game converges uniformly to some function . Then and are dominions of player 1 and player 2, respectively.
A straightforward consequence is that if and have a nonempty intersection, then is constant and the game is ergodic. We can extend this result to unique ergodicity with the help of Theorem 3.2 and thus provide a converse to Proposition 4.6.
Theorem 4.13**.**
Assume that in the differential game , the intersection of every dominion of player 1 with every dominion of player 2 in the torus is nonempty. Then the strong maximum principle (see Theorem 3.2) holds, i.e., the constant functions are the only solutions of HJ∞.
If, moreover, the structural equicontinuity property is true, then is uniquely ergodic if and only if the two players do not have disjoint dominions in the torus.
Proof.
Let be any solution of HJ∞. Let and . Since is -periodic and continuous, it passes to the quotient into a continuous map on the torus whose minimum (resp., maximum) is attained on (resp., ). Hence, are necessarily closed and we have D^{1\backslash 2}=\pi^{-1}\big{(}\,\overline{\pi(D^{1\backslash 2})}\,\big{)}. Using now Lemma 4.11, we deduce that hence is nonempty. So is constant.
The rest of the proof follows from Proposition 4.6 and Theorem 3.2. ∎
Note that if the controlled system 1 is Lipschitz continuous, meaning that there is a positive constant for which
[TABLE]
then the family is equi-Lipschitz for any payoff function . In that case we can use the latter theorem to characterize unique ergodicity in terms of dominions. This is in particular the case if the function does not depend on the state variable.
Example 4.14*.*
Let us go back to the game introduced in Examples 3.4 and 4.3, whose dynamics is defined in by the function
[TABLE]
with and whose payoff function is any function satisfying A0. We already mentioned that the family of value functions is equicontinuous (see Example 3.6 or the above remark). Hence the structural equicontinuity property holds.
If is a rational number then, for any , the lines
[TABLE]
are dominions of player 1 and player 2, respectively, and their quotient images in the torus are closed and disjoint for suitable and . Thus, according to Theorem 4.13, the game is not uniquely ergodic.
Assume now that is not a rational number and consider in any dominions and of player 1 and player 2, respectively. We next show that their intersection in the torus is not empty. Let us fix two points, and , in these dominions. By definition, given and , player 1 has a strategy such that for every action of player 2, we have for all . In particular, if is the constant control equal to , then we have
[TABLE]
that is, the (continuous) trajectory of the dynamical system has the property that is included in the cone for all (see Figure 1). Likewise, with the same and , player 2 has a strategy such that for all and all , and if player 1 chooses the constant control equal to , then we have
[TABLE]
that is, the trajectory of the system is such that is included in the cone for all (see Figure 1).
Then, the parameter being fixed, either there is some time such that the images in the torus of the two trajectories mentioned above intersect on the time interval at some point , or for all times their images always remain disjoint, which is possible only if they are contained in the parallel half-lines starting in and , respectively, and directed by the vector . Indeed, since , the images of these half-lines in the torus are dense, and therefore any deviation of a trajectory from one of these half-lines eventually leads to the intersection of the two trajectories.
If there are only finitely many points as described above, then we deduce that and respectively contain the latter half-lines and therefore both dominions correspond to the trivial dominion in , composed of the whole state space. If there are infinitely many points , then any limit point is, by construction, contained in both and . In any case, we deduce that the players do not have disjoint dominions in the torus and so, according to Theorem 4.13, that the game is uniquely ergodic.
5. Unique ergodicity of games via controllability approach
In this section, as usual, we fix a deterministic zero-sum differential game in its lower form, , which satisfies A0. However, we assume that the controlled system 1 is not Lipschitz continuous (and in particular that ), so that equicontinuity of cannot be guaranteed. We also make the standard assumption that the payoff function is Lipschitz continuous in uniformly in , i.e., that there exists such that
[TABLE]
We then have the following classical regularity property of the value function.
Proposition 5.1** (see [BCD97, Ch. VIII, Prop. 1.8]).**
If is Lipschitz continuous in uniformly in , then, for any discount factor , the value function is Hölder continuous with exponent and constant independent of :
[TABLE]
In view of unique ergodicity, the requirement that the payoff function be uniformly Lipschitz continuous in prevents us from considering perturbations that only lie in . If we want to use the latter proposition (which we need to prove the main theorem of this section), we need to restrict the perturbations to the set of Lipschitz continuous and -periodic functions. Fortunately, this is not a major restriction. Indeed, in the proof of Proposition 4.6 it is possible to consider a perturbation function satisfying 10 and which is Lipschitz. Thus we have the following stronger result.
Proposition 5.2**.**
Assume that for every Lipschitz continuous and -periodic function from to , the perturbed differential game with payoff function is ergodic. Then, in the game , the players do not have disjoint dominions in . ∎
To compensate the lack of equicontinuity of we also need to introduce the following controllability assumption, which involves sets of points that are reachable by one player. Let us first describe precisely these sets.
Given any strategy of player 1, we define the reachable set from a point for player 2 by
[TABLE]
On the other hand, for all strategies of player 1, let us associate a control of player 2. Then, we define the reachable set from for player 1 by
[TABLE]
Furthermore, we say that the map is nonanticipating if for all and almost all implies that for almost all . That is, if and coincide almost surely on , then the same is true for and .
We emphasize that the purpose of the following assumption is only to provide a uniform bound on the time needed to get arbitrarily close to any reachable point. We further mention that the estimate is borrowed from [Ari98] (see also [Bet05]).
Assumption A2 (Uniform time estimate).
There exist constants and such that, for all ,
- •
for all , for all and all , there is a control and a time for which ;
- •
for all nonanticipating map , for all and all , there is a strategy and a time for which .
We can now give the condition for the (somewhat modified version of) unique ergodicity of differential games. Notice that in the proof of this result, we use the fact that the sets and are dominions of player 1 and player 2, respectively. We postpone the precise statement and the proof of this fact afterward.
Theorem 5.3**.**
In the differential game , suppose that A2 holds and that the payoff function is Lipschitz continuous in uniformly in . The following assertions are equivalent:
- (i)
for every function which is Lipschitz continuous in uniformly in and -periodic in , the modified game with running payoff is ergodic; 2. (ii)
for every Lipschitz continuous and -periodic function , the perturbed game with running payoff is ergodic; 3. (iii)
the players do not have disjoint dominions in the torus.
Proof.
The implication (i) (ii) is trivial and we already know from Proposition 5.2 that (ii) (iii). So we only need to prove that (iii) (i). And since the payoff function is arbitrary and assertion (iii) does not depend on it, if we prove that is ergodic, the result will be true for any other payoff function .
Let be any discount factor and let be a fixed positive real. Let be any points in . From the dynamic programming principle, there exists a strategy of player 1 (which depends only on , and ) such that
[TABLE]
for all times and all controls . Similarly, for all , there exists a control of player 2 (which depends only on , , and ) such that
[TABLE]
for all times . Furthermore, the map can be chosen nonanticipating, as defined above (indeed, for the controls to satisfy these conditions, we can chose them so that ).
Let and be the closures of the sets of reachable points from and by player 2 and player 1, respectively, being fixed the strategy and the nonanticipating map .
We know from subsequent Lemma 5.4 that these sets are respectively a dominion of player 1 and a dominion of player 2. Hence there exists a point z\in\pi^{-1}\big{(}\,\overline{\pi(D^{1})}\cap\overline{\pi(D^{2})}\,\big{)}. This implies that there are , and such that
[TABLE]
Moreover, A2 guarantees the existence of a control , a strategy and times such that
[TABLE]
Combining these inequalities, we get
[TABLE]
Since the inequalities 12 and 13 hold uniformly in , we can now write them at times and respectively, and then use the estimates that we have just established. We recall that, for small enough, the function is Hölder continuous with exponent and constant . We also recall that is -periodic. Let for simplicity. From 12 we get
[TABLE]
where we use the fact that and .
On the other hand, from 13 we get
[TABLE]
Here we use the fact that and .
Combining the two inequalities and letting , we obtain, for all ,
[TABLE]
Since , choosing such that with , we observe that the right-hand side of the latter inequality converges to zero as vanishes, which yields
[TABLE]
Since the points and are arbitrary and the bound in 14 does not depend on them, we deduce that
[TABLE]
uniformly in .
The rest of the proof is classical (see for instance [Ari98]), but one may also notice that the latter uniform limit together with the continuity of on (see the proof of Theorem 3.2) entails the equicontinuity of the family . We can then conclude with Propositions 3.3 and 4.13. ∎
In order to complete the proof and conclude the section, we prove the following.
Lemma 5.4**.**
Given a strategy of player 1, the topological closure of the reachable set from any point for player 2, , is a dominion of player 1.
Dually, given a map from to which is nonanticipating, the closure of the reachable set from for player 1, , is a dominion of player 2.
Proof.
We show in detail that is a dominion of player 1, and leave to the reader the details of the proof for , which follows the same lines. Nevertheless we will highlight the important changes.
First, for any point , we show that we can construct a strategy of player 1 such that for all controls of player 2 and all times . Indeed, there exist and such that . Then, for any control , let us introduce the control obtained by concatenating and in the following way:
[TABLE]
We further define the strategy of player 1 as follows: for all . It is straightforward to verify that is nonanticipating and, moreover, that for all , . Thus, for all we have .
Consider now and fix some and . There exists such that . Let be the strategy of player 1 defined above, which ensures that for all and . We have the following standard estimate on the trajectories of 1:
[TABLE]
from which we deduce that, for all ,
[TABLE]
Thus, for all and we have , which finally proves that is a dominion for player 1.
For the proof is identical, up to the changes in players’ role. The main difference concerns the construction, for any point and any strategy of player 1, of a control of player 2 such that for all . We next detail this construction. Let and be such that . Let us also define, for any , the control by . We then define a nonanticipating strategy as follows:
[TABLE]
If we set , one can check that for all (in particular we have because the map is nonanticipating and so for almost all ). Hence the result. ∎
6. Operator-theoretic characterization of dominions
In this final section, we characterize dominions in operator-theoretic terms. Thus, we show that the notion of dominion coincides with the one of leadership domain and discriminating domain which appears in viability theory555We mention that the notion of discriminating / leadership domain, hence of dominion, relates with the ones of B-set and approachability in repeated games with vector payoffs. Indeed, In [ASQS09], As Soulaimani, Quincampoix and Sorin proved that the B-sets for one player (which provide a sufficient condition for approachability) coincide with the discriminating domains for that player in an associated differential game. (see, e.g., [Car96]). This characterization stems from the similarities that exist between dominions on the one hand, and the interpretation of discriminating and leadership domains, on the other hand. Indeed, the latter, which are originally defined by means of inequalities involving , can also be characterized in terms of invariant dynamics (see, e.g., [Car96]). This correspondence between the two notions can be readily established for leadership domains and dominions of player 2 in the lower game (see Theorem 2.3, ibid.). As for the correspondence between discriminating domains and dominions of player 1, it is not as straightforward since the interpretation theorem (Theorem 2.1, ibid.) requires convexity properties. Such assumptions – typically, must be convex and , affine in – are commonly assumed in viability theory but are not needed here. Nevertheless, by adapting the proof of the latter result to our setting, we are able to show that dominions of the first player in can indeed be characterized as discriminating domains. We next state precisely these results.
To this end, we need to introduce the following definition. A vector is a proximal normal to a subset of at point if . We denote by the set of proximal normals to at . Note that, if we let be the set of projections of any point onto , i.e.,
[TABLE]
then the definition of a proximal normal implies that for every vector and every scalar , we have .
We now provide the operator-theoretic characterizations of dominions. The first one, for dominions of player 2 in , comes readily from the correspondence of the latter with leadership domains in viability theory.
Theorem 6.1** ([Car96, Thm. 2.3]).**
A nonempty closed set is a dominion of player 2 in the lower game if and only if
[TABLE]
We next give a similar characterization for dominions of player 1, which relates them with discriminating domains.
Theorem 6.2**.**
A nonempty closed set is a dominion of player 1 in the lower game if and only if
[TABLE]
Proof.
We first prove the necessary part and suppose that is a dominion of player 1. Toward a contradiction, let us assume that there exists a positive constant , some and some such that
[TABLE]
Since the function is upper semicontinuous and is compact, there exists an action such that
[TABLE]
Let be the constant control equal to , i.e., for all .
Since is a dominion of player 1, given and there exists a strategy such that for all . In order to simplify the notation, let . Then, for all , choosing any point in , the set of projections of on , we have
[TABLE]
where we use the fact that and that since .
On the other hand, for almost all we have
[TABLE]
To establish the last inequality, we used the estimate 9; the Lipschitz continuity of (with Lipschitz constant ); and 15. Let . After integrating the latter inequality we get, for all ,
[TABLE]
which, combined with 16, yields
[TABLE]
Note that to square 16, we need to assume that , which is possible because is different from [math] (otherwise 15 would not hold). In the latter inequality, the positive constants , and are fixed, whereas and are arbitrary. Hence, by choosing and rewriting 17 with we obtain
[TABLE]
which is a contradiction if is small enough. This concludes the proof of the necessary part.
We now prove the sufficient part and assume that for all points in and all proximal normals in , we have
[TABLE]
We then fix and positive constants and . Our aim is to construct recursively on the subintervals of a well-chosen partition of , a nonanticipating strategy of player 1 such that for all and all controls of player 2. The mesh of the partition (which shall depend only on , , and the data of the problem) will be chosen a posteriori, so we assume for now that it is fixed. Also, for any we shall fix a point in which we denote by .
We start by selecting an arbitrary element in and set for all and . Note that is obviously nonanticipating on , that is, for any controls that coincide almost everywhere on , we have for (almost) all .
Next we assume that has been defined on with and that it is nonanticipating on this interval. Given any control , if , then we set on . Otherwise, letting (for simplicity) and , we introduce the set-valued map defined from to by
[TABLE]
Let us observe that depends on the control only through . Thus, if two controls and are equal almost everywhere on , then and therefore they define the same set-valued map.
Since is continuous, is measurable and has closed values. Moreover, since by definition, 18 implies that the domain of is , i.e., is nonempty for all . Hence, according to the Measurable Selection Theorem (see [AF09, Thm. 8.1.3]), admits a measurable selection . Then we set for all . It is readily seen that is nonanticipating on , whence on after repeating the induction step until . For , we set for all .
To conclude the proof, it remains to show that on for every control of player 2. So we fix and let . We also let and . For all and for almost all we have
[TABLE]
To establish the latter inequality, we used the estimate 9 and the fact that either , in which case by definition of , or which implies . By integration we then obtain
[TABLE]
for all and . Grönwall’s inequality yields
[TABLE]
and thus
[TABLE]
If we apply the latter inequality to , we can use it to show by induction that, for all ,
[TABLE]
Combining now the last two inequalities, we deduce that for all and all ,
[TABLE]
Since and if , we finally get, for all ,
[TABLE]
The proof is complete once we have observed that we can choose the mesh of the partition, , depending only on , , and , so that the right-hand side in the latter inequality is lower than . ∎
Remark 6.3* (Dominions in the upper game and with Isaacs’ condition).*
Similar characterizations for the dominions in the upper game can be obtained after switching the identity of the players (the fact that one player is minimizing and the other maximizing does not come into account here). Thus, a nonempty closed set is a dominion of player 1 (resp., player 2) in if and only if
[TABLE]
As a consequence, the classical min-max inequality yields that a dominion of player 1 in is also a dominion in , and symmetrically, a dominion of player 2 in is also a dominion in (see Example 4.4 for an illustration of this situation). These observations are consistent with the fact that player 1 (resp., player 2) has more information in the lower game (resp., in the upper game), hence has an advantage in this game. Furthermore, if Isaacs’ condition 5 applies to , then the set of dominions for each player is the same in the lower and the upper game.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AB 03] O. Alvarez and M. Bardi, Singular perturbations of nonlinear degenerate parabolic PD Es: a general convergence result , Arch. Ration. Mech. Anal. 170 (2003), no. 1, 17–61.
- 2[AB 07] by same author, Ergodic problems in differential games , Advances in dynamic game theory, Ann. Internat. Soc. Dynam. Games, vol. 9, Birkhäuser Boston, Boston, MA, 2007, pp. 131–152.
- 3[AB 10] by same author, Ergodicity, stabilization, and singular perturbations for Bellman-Isaacs equations , Mem. Amer. Math. Soc. 204 (2010), no. 960, vi+77.
- 4[AF 09] J.-P. Aubin and H. Frankowska, Set-valued analysis , Modern Birkhäuser Classics, Birkhäuser Boston, Inc., Boston, MA, 2009, Reprint of the 1990 edition.
- 5[AGH 20] M. Akian, S. Gaubert, and A. Hochart, A game theory approach to the existence and uniqueness of nonlinear Perron-Frobenius eigenvectors , Discrete Contin. Dyn. Syst. 40 (2020), no. 1, 207–231.
- 6[Ari 97] M. Arisawa, Ergodic problem for the Hamilton-Jacobi-Bellman equation. I. Existence of the ergodic attractor , Ann. Inst. H. Poincaré Anal. Non Linéaire 14 (1997), no. 4, 415–438.
- 7[Ari 98] by same author, Ergodic problem for the Hamilton-Jacobi-Bellman equation. II , Ann. Inst. H. Poincaré Anal. Non Linéaire 15 (1998), no. 1, 1–24.
- 8[ASQS 09] S. As Soulaimani, M. Quincampoix, and S. Sorin, Repeated games and qualitative differential games: approachability and comparison of strategies , SIAM J. Control Optim. 48 (2009), no. 4, 2461–2479.
