Unique ergodicity of deterministic zero-sum differential games

Antoine Hochart

arXiv:1908.03643·math.OC·January 8, 2020·Dyn. Games Appl.

Unique ergodicity of deterministic zero-sum differential games

Antoine Hochart

PDF

TL;DR

This paper investigates the conditions under which deterministic zero-sum differential games exhibit unique ergodicity, characterized by the convergence of value functions, extending classical dynamical systems concepts.

Contribution

It provides necessary and sufficient conditions for unique ergodicity in such games, involving symmetric criteria and the concept of dominions.

Findings

01

Necessary and sufficient conditions for ergodicity are established.

02

The notion extends classical ergodicity to game-theoretic settings.

03

Conditions involve symmetric properties and invariant subsets called dominions.

Abstract

We study the ergodicity of deterministic two-person zero-sum differential games. This property is defined by the uniform convergence to a constant of either the infinite-horizon discounted value as the discount factor tends to zero, or equivalently, the averaged finite-horizon value as the time goes to infinity. We provide necessary and sufficient conditions for the unique ergodicity of a game. This notion extends the classical one for dynamical systems, namely when ergodicity holds with any (suitable) perturbation of the running payoff function. Our main condition is symmetric between the two players and involve dominions, i.e., subsets of states that one player can make approximately invariant.

Equations231

{\dot{X_{t}} = f (X_{t}, a_{t}, b_{t}), t > 0, X_{0} = x,

{\dot{X_{t}} = f (X_{t}, a_{t}, b_{t}), t > 0, X_{0} = x,

J_{δ} (x, a, b) = \int_{0}^{\infty} e^{- δ s} ℓ (X_{s}, a_{s}, b_{s}) d s

J_{δ} (x, a, b) = \int_{0}^{\infty} e^{- δ s} ℓ (X_{s}, a_{s}, b_{s}) d s

J (t, x, a, b) = \int_{0}^{t} ℓ (X_{s}, a_{s}, b_{s}) d s

J (t, x, a, b) = \int_{0}^{t} ℓ (X_{s}, a_{s}, b_{s}) d s

∣ f (x, a, b) - f (y, a, b) ∣ ⩽ L_{f} ∣ x - y ∣

∣ f (x, a, b) - f (y, a, b) ∣ ⩽ L_{f} ∣ x - y ∣

∣ ℓ (x, a, b) - ℓ (y, a, b) ∣ ⩽ ω_{ℓ} (∣ x - y ∣), \forall x, y \in R^{n}, \forall a \in A, \forall b \in B .

∣ ℓ (x, a, b) - ℓ (y, a, b) ∣ ⩽ ω_{ℓ} (∣ x - y ∣), \forall x, y \in R^{n}, \forall a \in A, \forall b \in B .

φ (x + k, a, b) = φ (x, a, b), \forall k \in Z^{n}, \forall x \in R^{n}, \forall a \in A, \forall b \in B .

φ (x + k, a, b) = φ (x, a, b), \forall k \in Z^{n}, \forall x \in R^{n}, \forall a \in A, \forall b \in B .

v_{δ}^{-} (x) = α \in A in f b \in B sup J_{δ} (x, α [b], b) .

v_{δ}^{-} (x) = α \in A in f b \in B sup J_{δ} (x, α [b], b) .

v_{δ}^{+} (x) = β \in B sup a \in A in f J_{δ} (x, a, β [a]) .

v_{δ}^{+} (x) = β \in B sup a \in A in f J_{δ} (x, a, β [a]) .

v^{-} (t, x) = α \in A in f b \in B sup J (t, x, α [b], b) and v^{+} (t, x) = β \in B sup a \in A in f J (t, x, a, β [a]) .

v^{-} (t, x) = α \in A in f b \in B sup J (t, x, α [b], b) and v^{+} (t, x) = β \in B sup a \in A in f J (t, x, a, β [a]) .

H(x,p)=H^{-}(x,p)=\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad x,p\in\mathbb{R}^{n},

H(x,p)=H^{-}(x,p)=\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad x,p\in\mathbb{R}^{n},

{δ u (x) + H (x, D u (x)) = 0, in R^{n}, u Z^{n} -periodic,

{δ u (x) + H (x, D u (x)) = 0, in R^{n}, u Z^{n} -periodic,

_{t}

_{t}

H^{+}(x,p)=\max_{a\in A}\min_{b\in B}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad x,p\in\mathbb{R}^{n}.

H^{+}(x,p)=\max_{a\in A}\min_{b\in B}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad x,p\in\mathbb{R}^{n}.

\max_{a\in A}\min_{b\in B}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}}\\ =\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad\forall x,p\in\mathbb{R}^{n},

\max_{a\in A}\min_{b\in B}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}}\\ =\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle-\ell(x,a,b)\big{\}},\quad\forall x,p\in\mathbb{R}^{n},

H (x + k, p) = H (x, p) .

H (x + k, p) = H (x, p) .

\left|{H(x,p)-H(y,p)}\right|\leqslant\omega\big{(}\left|{x-y}\right|(1+\left|{p}\right|)\big{)}.

\left|{H(x,p)-H(y,p)}\right|\leqslant\omega\big{(}\left|{x-y}\right|(1+\left|{p}\right|)\big{)}.

∣ H (x, p) - H_{\infty} (x, p) ∣ ⩽ M_{H} .

∣ H (x, p) - H_{\infty} (x, p) ∣ ⩽ M_{H} .

H_{\infty} (x, ν p) = ν H_{\infty} (x, p)

H_{\infty} (x, ν p) = ν H_{\infty} (x, p)

ν \to + \infty lim \frac{H ( x , ν p )}{ν} = H_{\infty} (x, p)

ν \to + \infty lim \frac{H ( x , ν p )}{ν} = H_{\infty} (x, p)

H_{\infty}(x,p)=\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle\big{\}},\quad x,p\in\mathbb{R}^{n}.

H_{\infty}(x,p)=\min_{b\in B}\max_{a\in A}\big{\{}-\langle f(x,a,b),p\rangle\big{\}},\quad x,p\in\mathbb{R}^{n}.

{c + H (x, D w (x)) = 0, in R^{n}, w Z^{n} -periodic .

{c + H (x, D w (x)) = 0, in R^{n}, w Z^{n} -periodic .

sup {c \in R ∣ there is an u.s.c. subsolution of \lx@cref refnum eq:cell-problem} = λ_{3} = in f {c \in R ∣ there is a l.s.c. supersolution of \lx@cref refnum eq:cell-problem} .

sup {c \in R ∣ there is an u.s.c. subsolution of \lx@cref refnum eq:cell-problem} = λ_{3} = in f {c \in R ∣ there is a l.s.c. supersolution of \lx@cref refnum eq:cell-problem} .

{H_{\infty} (x, D w (x)) = 0, in R^{n}, w Z^{n} -periodic,

{H_{\infty} (x, D w (x)) = 0, in R^{n}, w Z^{n} -periodic,

∥ u_{δ} - u_{δ^{'}} ∥_{\infty} ⩽ M_{g} \frac{1}{δ} - \frac{1}{δ ^{'}}

∥ u_{δ} - u_{δ^{'}} ∥_{\infty} ⩽ M_{g} \frac{1}{δ} - \frac{1}{δ ^{'}}

{δ u (x) - ρw (x) + H (x, D u (x)) = 0, in R^{n}, u Z^{n} -periodic .

{δ u (x) - ρw (x) + H (x, D u (x)) = 0, in R^{n}, u Z^{n} -periodic .

δ w_{δ}^{ρ} (x) - ρw (x) + H (x, D φ (x)) = - M_{H} + H (x, D φ (x)) ⩽ H_{\infty} (x, D φ (x)) ⩽ 0.

δ w_{δ}^{ρ} (x) - ρw (x) + H (x, D φ (x)) = - M_{H} + H (x, D φ (x)) ⩽ H_{\infty} (x, D φ (x)) ⩽ 0.

ρw (x) - M_{H} ⩽ λ_{ρ} ⩽ ρw (y) + M_{H}

ρw (x) - M_{H} ⩽ λ_{ρ} ⩽ ρw (y) + M_{H}

w (x) - w (y) ⩽ \frac{2 M _{H}}{ρ} .

w (x) - w (y) ⩽ \frac{2 M _{H}}{ρ} .

δ u (x) + δ g (x) + δ H (x, δ^{- 1} D u (x)) = 0

δ u (x) + δ g (x) + δ H (x, δ^{- 1} D u (x)) = 0

f (x, a, b) = (a γ b), a, b \in [- 1, 1],

f (x, a, b) = (a γ b), a, b \in [- 1, 1],

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Unique ergodicity of

deterministic zero-sum differential games

Antoine Hochart

Facultad de Ingeniería y Ciencia, Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Santiago, Chile

[email protected]

(Date: January 7, 2020)

Abstract.

We study the ergodicity of deterministic two-person zero-sum differential games. This property is defined by the uniform convergence to a constant of either the infinite-horizon discounted value as the discount factor tends to zero, or equivalently, the averaged finite-horizon value as the time goes to infinity. We provide necessary and sufficient conditions for the unique ergodicity of a game. This notion extends the classical one for dynamical systems, namely when ergodicity holds with any (suitable) perturbation of the running payoff function. Our main condition is symmetric between the two players and involve dominions, i.e., subsets of states that one player can make approximately invariant.

Key words and phrases:

Differential games, Hamilton-Jacobi equations, viscosity solutions, ergodicity, limit value

2010 Mathematics Subject Classification:

Primary: 91A23, 49N70; Secondary: 37A99, 49L25, 35F21, 35B40.

The author is supported by FONDECYT grant 3180662.

1. Introduction

We study the ergodic problem for deterministic two-player zero-sum differential games. Such games are defined by a nonlinear system in $\mathbb{R}^{n}$ controlled by two players,

[TABLE]

where the first player chooses the actions $a_{t}\in A$ and the second player, the actions $b_{t}\in B$ . Given a continuous and bounded payoff function $\ell$ , player 1 intends to minimize one of the following payoff functionals, whereas player 2 intends to maximize it:

[TABLE]

in the infinite-horizon discounted game, or

[TABLE]

in the game played in finite horizon $t$ . We assume that the data are $\mathbb{Z}^{n}$ -periodic in the state variable $x\in\mathbb{R}^{n}$ so that the state space can be identify with the $n$ -torus $\mathbb{R}^{n}/\mathbb{Z}^{n}$ . We also restrict our study to the lower game, in which player 1 adapts her control to player 2’s actions, but note that all the results can be readily adapted to the upper game or the situation in which the classical Isaacs condition holds.

The value of the discounted (lower) game and the one of the finite-horizon (lower) game, denoted respectively by $v_{\delta}(x)$ and $v(t,x)$ , are the payoffs at equilibrium and can be characterized as the viscosity solutions of, respectively, a stationary Hamilton-Jacobi PDE and an evolutionary Hamilton-Jacobi PDE involving the Hamiltonian of the (lower) game.

The ergodic problem for zero-sum differential games or for Hamilton-Jacobi equations, its PDE counterpart, concerns the asymptotic behavior of the value functions $v_{\delta}(x)$ and $v(t,x)$ . More precisely, it deals with the uniform convergence toward a constant of $\delta v_{\delta}(x)$ when the discount factor $\delta$ goes to zero, and of $v(t,x)/t$ when the horizon $t$ goes to infinity. The problem has been much studied since the seminal work of Lions, Papanicolaou and Varadhan [LPV87]. For optimal control (i.e., one-player) problems, let us mention the work of Arisawa [Ari97, Ari98] and for two-player games, the one of Alvarez and Bardi [AB03, AB07] or Cardaliaguet [Car10]. More recently, the ergodic control problem has been studied by Quincampoix and Renault [QR11], Gaitsgory and Quincampoix [GQ13], Cannarsa and Quincampoix [CQ15] or Buckdahn, Quincampoix and Renault [BQR15], for situations in which the limit value is not necessarily constant with respect to the initial state. Let us further mention the work of Khlopin [Khl18] on Abelian-Tauberian properties, or Ziliotto [Zil17, Zil19] on counterexamples to the convergence of the values, which illustrate the connection between the discrete setting (i.e., repeated games) and the continuous setting (which we consider here)111Note that the counterexample to Hamilton-Jacobi homogenization given in [Zil17] has been preceded by counterexamples for the convergence of the value of repeated games given by Vigeral [Vig13] and Ziliotto [Zil16]..

An important problem is then to characterize the differential games which are ergodic. Typical results require that the nonlinear system (or a subsystem, if it is decomposable) be uniformly controllable by one player, that is, any point $x$ is controllable to any other point $y$ by this player, either exactly or approximately, asymptotically or in bounded time (see e.g., [Ari98, Bet05, AB07]). Such conditions are independent of the payoff function $\ell$ and thus imply that the game is in fact uniquely ergodic. The latter notion, which was originally defined for dynamical systems, readily extends to differential games: a game is uniquely ergodic if it is ergodic for all perturbations of the payoff function $\ell$ that only depend on the state variable. In [Ari97], Arisawa showed that a converse property holds for systems controlled by one player and proved the existence of an ergodic attractor when unique ergodicity holds. But for two-player games, these controllability conditions totally lack symmetry, focusing only on one player.

The purpose of this article is to study the unique ergodicity property for differential games. We introduce a “dominion condition” which is in essence symmetrical between the two players. Each dominion is associated with one player and, roughly speaking, corresponds to a nonempty subset of states that this player can make approximately invariant for the dynamics. We show that if a game is uniquely ergodic, then the players do not have disjoint dominions. To prove this result, we use an Hamilton-Jacobi PDE approach. Under specific controllability assumptions (independence of $f$ with respect to the state variable or uniform time estimates on the dynamics) we further prove that the “dominion condition” is in fact equivalent to unique ergodicity. Thus our results generalize the unique ergodicity property of dynamical systems222We refer the reader to [AB03, Sec. 6.1] for the connections between classical ergodic theory and ergodicity of games or Hamiltonians., as well as the analysis of Arisawa in [Ari97, Ari98] for optimal control problems. In particular, let us observe that if a system is uniformly controllable by one player, then, whatever assumptions are made on the controllability (asymptotic or bounded time, exact or approximate), it implies that the other player has a unique trivial dominion, namely the whole state space, and so that the “dominion condition” trivially holds.

We finally mention that the notion of dominion coincides with the ones of leadership domain and discriminating domain in viability theory (see e.g., [Car96]), and therefore also relates with the notions of B-set and approachability in repeated games with vector payoffs, as shown by As Soulaimani, Quincampoix and Sorin in [ASQS09]. However, the ideas developed in this article were first inspired by the study of the ergodic problem for zero-sum repeated games, i.e., games played in discrete time (see the companion articles [Hoc19] and [AGH20]). In order to remain consistent with the latter work, we have chosen to use the term “dominion” instead of “domain”, although the two terms could be interchanged.

The paper is organized as follows. Section 2 is dedicated to preliminaries on differential games, their value functions and the Hamilton-Jacobi PDE approach to ergodicity. This section only provides some notation and classical results. It can therefore be safely skipped by readers familiar with the subject. In Section 3, we introduce and study the unique ergodicity property for general Hamiltonians, that is, the property of an Hamiltonian to be ergodic for any suitable perturbation. This (slightly) generalizes a characterization by Alvarez and Bardi in [AB10]. In Section 4, we introduce the notion of dominion and study the unique ergodicity property for differential games following a PDE approach. In Section 5, we study the unique ergodicity of differential games relying only on a dynamical system approach. Finally in Section 6, we characterize dominions in operator-theoretic terms, which establishes the link with the notion of discriminating / leadership domain in viability theory.

2. Preliminaries

We introduce here some notation as well as standard definitions and results on differential games and their PDE approach. Readers familiar with the subject can safely skip the section.

2.1. Framework and standing assumptions

We start by describing the setting of deterministic two-player zero-sum differential games that we study in this article. Consider first the controlled nonlinear system 1 where the map $f$ is from $\mathbb{R}^{n}\times A\times B$ to $\mathbb{R}^{n}$ , with $A,B$ nonempty compact metric spaces. We assume throughout the paper that $f$ is continuous in all variables and Lipschitz continuous in the state variable, uniformly in the control variables, i.e., denoting by $\left|{\cdot}\right|$ the standard Euclidean norm,

[TABLE]

for some constant $L_{f}\geqslant 0$ and for all $x,y\in\mathbb{R}^{n}$ , $a\in A$ and $b\in B$ . Player 1 (resp., player 2) chooses a control $t\mapsto a_{t}$ (resp., $t\mapsto b_{t}$ ) in the set of Lebesgue measurable functions from $[0,+\infty)$ to $A$ (resp., $B$ ), which we denote by $\mathscr{A}$ (resp., $\mathscr{B}$ )333In order to simplify the notation, we shall equally denote by $a$ and $b$ single elements of $A$ and $B$ , respectively, and controls of player 1 and player 2, i.e., elements of $\mathscr{A}$ and $\mathscr{B}$ , respectively. The distinction should be clear from the context. The Cauchy-Lipschitz theorem implies that Equation 1 has a unique solution, which we denote by $X^{x,a,b}_{t}$ and for which the differential equation holds for almost all $t>0$ .

We are further given a bounded continuous payoff function $\ell:\mathbb{R}^{n}\times A\times B\to\mathbb{R}$ , where we let $M_{\ell}=\left\|\ell\right\|_{\infty}$ , the supremum norm of $\ell$ . Then, for any trajectory of the controlled system 1, we mainly consider in this article the discounted payoff functional 2, associated with the game played in infinite horizon with a discount factor $\delta>0$ on the running payoff. The objective of player 1 is to minimize the latter functional, whereas player 2 intends to maximize it. We shall also briefly mention the payoff functional 3 associated with the game played in a finite horizon $t>0$ .

Additionally to the classical conditions on $f$ and $\ell$ already mentioned above (and which we reproduce below), we make throughout the paper the following assumption. Before stating it, let us recall that a modulus of continuity is a nondecreasing function $\omega:[0,+\infty)\to[0,+\infty)$ , vanishing and continuous at [math], that is, such that $\lim_{r\to 0}\omega(r)=\omega(0)=0$ .

Assumption A0 (Standing assumption).

(i)

The function $f$ is continuous in all variables and uniformly Lipschitz continuous in the state variable, the function $\ell$ is bounded continuous, the action spaces $A$ and $B$ are nonempty compact sets. 2. (ii)

The payoff function $\ell$ is uniformly continuous with respect to the state variable, uniformly with respect to the control variables, i.e., there exists a modulus of continuity $\omega_{\ell}$ such that

[TABLE] 3. (iii)

The functions $f$ and $\ell$ are $\mathbb{Z}^{n}$ -periodic in the state variable, i.e., for $\varphi\in\{f,\ell\}$ ,

[TABLE]

Let us remark that Item (iii) implies that the state space can be identify with the $n$ -torus $\mathbb{R}^{n}/\mathbb{Z}^{n}$ . Although we shall work mostly in $\mathbb{R}^{n}$ , we draw the attention of the reader to the fact that sometimes, we will consider objects in the quotient space. Moreover, Item (iii) together with the continuity of $f$ entails the boundedness of this function. We therefore let $M_{f}=\left\|f\right\|_{\infty}$ .

2.2. Value functions and Hamilton-Jacobi PDEs

We introduce here the concept of value function and then characterize it in terms of viscosity solution of some Hamilton-Jacobi PDE. We keep the presentation to a minimum and refer the reader to the classical monograph [BCD97] for more details.

Let us start with the definition of nonanticipating strategies.

Definition 2.1 (Nonanticipating strategy).

A nonanticipating strategy for the first player is a map $\alpha:\mathscr{B}\to\mathscr{A}$ such that for any time $t>0$ and any controls $b^{1},b^{2}\in\mathscr{B}$ of player 2, if $b^{1}_{s}=b^{2}_{s}$ for almost all $s\leqslant t$ then $\alpha[b^{1}]_{s}=\alpha[b^{2}]_{s}$ for almost all $s\leqslant t$ . We denote by $\mathfrak{A}$ the set of nonanticipating strategies for player 1.

The set $\mathfrak{B}$ of nonanticipating strategies $\beta:\mathscr{A}\to\mathscr{B}$ for the second player is defined accordingly.

We then introduce the (unnormalized) value functions. When player 2 chooses a control $b\in\mathscr{B}$ and player 1 is allowed to adapt her response to this control, i.e., when she chooses a nonanticipating strategy $\alpha\in\mathfrak{A}$ , we are considering the lower game, which we denote by $\Gamma^{-}$ . The lower value function associated with the infinite-horizon discounted payoff functional is then defined by

[TABLE]

On the other hand, if player 1 is bound to choose a control $a\in\mathscr{A}$ to which player 2 can adapt by choosing a nonanticipating strategy $\beta\in\mathfrak{B}$ , then we are considering the upper game, denoted by $\Gamma^{+}$ , and the upper value function is given by

[TABLE]

When the game is played in a finite horizon $t>0$ , the value functions are defined similarly by, respectively,

[TABLE]

We always have $v^{-}_{\delta}(x)\leqslant v^{+}_{\delta}(x)$ (resp., $v^{-}(t,x)\leqslant v^{+}(t,x)$ ) and the differential game is said to have a value at state $x$ if there is equality. The latter holds under the classical Isaacs condition (which we recall at the end of the section). However, in this work, we do not need to make such an assumption: all the results presented in the article hold in the lower as well as in the upper game. Owing to the symmetry of $\Gamma^{-}$ and $\Gamma^{+}$ , we shall only consider from now on the lower game, and therefore drop the “-” superscript for simplicity of the notation. We leave to the reader the straightforward adaptation of the results to the upper game (or to the situation in which Isaacs’ condition holds).

We readily deduce from the above definitions that the two normalized value functions $x\mapsto\delta v_{\delta}(x)$ and $(t,x)\mapsto v(t,x)/t$ are bounded by $M_{\ell}$ and $\mathbb{Z}^{n}$ -periodic. It is also known that they are respectively continuous on $\mathbb{R}^{n}$ and Lipschitz continuous on $[0,T]\times\mathbb{R}^{n}$ for all times $T>0$ . Furthermore, they can be characterized as viscosity solutions444In this paper, the solutions of PDEs will always be in the continuous viscosity sense. of some PDEs, called Hamilton-Jacobi-Isaacs’ equations. These equations involve the (lower) Hamiltonian, defined by

[TABLE]

where $\langle\cdot,\cdot\rangle$ is the standard scalar product on $\mathbb{R}^{n}$ . The next result illustrates this fact. Note that, given any real function $(t,x)\mapsto\varphi(t,x)$ , we denote by $\partial_{t}\varphi$ its partial derivative with respect to the time variable $t$ , and by $D\varphi$ its gradient with respect to the state variable $x$ .

Theorem 2.2 (see [BCD97, Ch. III, Prop. 2.8, 3.5]).

Under A0, the value function $v_{\delta}$ is the unique continuous viscosity solution of the Hamilton-Jacobi PDE

[TABLE]

and the value function $(t,x)\mapsto v(t,x)$ is the unique continuous viscosity solution of the Hamilton-Jacobi PDE

[TABLE]

The upper value functions are characterized by the same PDEs after replacing the lower Hamiltonian $H$ with the upper Hamiltonian

[TABLE]

Consequently, if Isaacs’ condition holds, that is, if

[TABLE]

then the lower and the upper value functions are equal.

2.3. Ergodicity and PDE approach

In this article, we are interested in the asymptotic behavior of the value functions, that is, in the behavior of $v_{\delta}(x)$ as the discount factor $\delta$ goes to [math] (resp., in the behavior of $v(t,x)$ as the time horizon $t$ goes to $+\infty$ ). More specifically, we study the so-called ergodic problem, that is, the situation in which there exists a constant $\lambda\in\mathbb{R}$ such that the normalized value $\delta v_{\delta}(x)$ tends to $\lambda$ as $\delta$ goes to [math] (resp., $v(t,x)/t$ tends to $\lambda$ as $t$ goes to $+\infty$ ) uniformly in $x$ . This property is called ergodicity of the game.

Thanks to Theorem 2.2, the latter problem can be studied by a PDE approach. With this in mind, we shall sometimes consider arbitrary Hamiltonians $(x,p)\mapsto H(x,p)$ defined on $\mathbb{R}^{n}\times\mathbb{R}^{n}$ that satisfy the following properties. Note that these properties are inherited from the Hamiltonian defined in 4.

Assumption A1.

(i)

The Hamiltonian $H:\mathbb{R}^{n}\times\mathbb{R}^{n}\to\mathbb{R}$ is continuous. 2. (ii)

$H$ is $\mathbb{Z}^{n}$ -periodic in the first variable, i.e, for all $x,p\in\mathbb{R}^{n}$ and $k\in\mathbb{Z}^{n}$ ,

[TABLE] 3. (iii)

There is a modulus of continuity $\omega:[0,+\infty)\to[0,+\infty)$ such that, for all $x,y,p\in\mathbb{R}^{n}$ ,

[TABLE] 4. (iv)

There is a function $H_{\infty}:\mathbb{R}^{n}\times\mathbb{R}^{n}\to\mathbb{R}$ that is positively homogeneous of degree one in the second variable, and a constant $M_{H}\geqslant 0$ such that, for all $x,p\in\mathbb{R}^{n}$ ,

[TABLE]

Let us make few comments about these assumptions. First, Items (i), (ii) and (iii) imply that the PDEs HJδ and HJ ${}_{\text{t}}$ have a unique continuous viscosity solution. In particular, Item (iii) implies that the comparison principle for viscosity solutions holds. Second, the map $H_{\infty}$ introduced in Item (iv) is called the recession function of $H$ . The positive homogeneity of degree one means that

[TABLE]

for all $x,p\in\mathbb{R}^{n}$ and all $\nu>0$ . A consequence is that

[TABLE]

uniformly in $(x,p)$ , and so $H_{\infty}$ is necessarily unique, continuous and $\mathbb{Z}^{n}$ -periodic in the first variable. Let us observe that if $H$ is the Hamiltonian associated with the lower game $\Gamma^{-}$ , as defined in 4, then

[TABLE]

Following a PDE approach, the existence and the value of the ergodic constant $\lambda$ can be related with the viscosity solutions of the following cell problem:

[TABLE]

The next result explains this connection. In its statement, we abbreviate upper semicontinuous as u.s.c. and lower semicontinuous as l.s.c. Note that the result was shown in [AB03] for second-order Hamilton-Jacobi PDEs.

Theorem 2.3 ([AB03, Thm. 4]).

Let $H$ be an arbitrary Hamiltonian satisfying Items (i), (ii) and (iii) of A1. The following assertions are equivalent.

(i)

If $u_{\delta}$ is the solution of the stationary problem **HJδ***, then $\delta u_{\delta}(x)$ converges uniformly in $x$ to a constant $\lambda_{1}\in\mathbb{R}$ as $\delta$ goes to [math].* 2. (ii)

If $u$ is the solution of the Cauchy problem HJ ${}_{\text{t}}$ , then $u(t,x)/t$ converges uniformly in $x$ to a constant $\lambda_{2}\in\mathbb{R}$ as $t$ goes to $+\infty$ . 3. (iii)

There exists a constant $\lambda_{3}$ such that

[TABLE]

Moreover, if one of these assertions is true, then $\lambda_{1}=\lambda_{2}=\lambda_{3}$ .

When an arbitrary Hamiltonian $H$ satisfies one (hence all) of the above assertions, we say that it is ergodic. We refer the reader to [AB03, Sec. 6] for a detailed discussion on the connections between classical ergodic theory of deterministic dynamical systems and ergodicity of Hamiltonians.

3. Unique ergodicity of Hamiltonians

In this section, we introduce the central concept of this article, namely unique ergodicity, which we first apply to arbitrary Hamiltonians.

Unique ergodicity is a property that originally applies to dynamical systems. Although its definition (existence of a unique invariant probability measure) cannot be readily extended to differential games or arbitrary Hamiltonians, its characterization in terms of long time averages of any continuous function along the trajectories makes this extension possible.

Alvarez and Bardi in [AB10] used this terminology of unique ergodicity and studied the property for two-player controlled systems. However, we mention that before this work, the property was already studied for controlled systems, without being given any explicit name (see for instance [Ari97, Ari98]).

3.1. Definition and characterization

Definition 3.1 (Uniquely ergodic Hamiltonian).

Let $H:\mathbb{R}^{n}\times\mathbb{R}^{n}\to\mathbb{R}$ be an Hamiltonian satisfying Items (i), (ii) and (iii) in A1. We say that $H$ is uniquely ergodic if, for every continuous and $\mathbb{Z}^{n}$ -periodic function $g:\mathbb{R}^{n}\to\mathbb{R}$ , the perturbed Hamiltonian $g+H$ is ergodic, i.e., one (hence all) of the assertions in Theorem 2.3 holds with $g+H$ .

In the remainder, we denote by $\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ the space of continuous and $\mathbb{Z}^{n}$ -periodic real functions over $\mathbb{R}^{n}$ .

We next give a characterization of unique ergodicity which is very similar to Proposition $2.3$ in [AB10] (as a matter of fact, most of the proof is borrowed from the latter reference, which we have chosen to reproduce for the sake of completeness). However, our result differs from the one of Alvarez and Bardi in two ways. First, it is not restricted to Hamiltonians associated with differential games but it applies to arbitrary Hamiltonians. Second, our definition of unique ergodicity is slightly more general, in the sense that we only need to consider perturbations of Hamiltonians of the form $g\in\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ .

Theorem 3.2 (compare with [AB10, Prop. 2.3]).

Let $H:\mathbb{R}^{n}\times\mathbb{R}^{n}\to\mathbb{R}$ be an Hamiltonian satisfying A1. It is uniquely ergodic if and only if the following assertions hold:

•

(Structural equicontinuity)* for every continuous and $\mathbb{Z}^{n}$ -periodic function $g:\mathbb{R}^{n}\to\mathbb{R}$ , if $u_{\delta}$ denotes the solution of HJδ with the Hamiltonian $g+H$ , then the family $\{\delta u_{\delta}\}_{0<\delta\leqslant 1}$ is equicontinuous;*

•

(Strong maximum principle)* the constant functions are the only continuous viscosity solutions of the PDE*

[TABLE]

where $H_{\infty}$ is the recession function of $H$ .

Proof.

Let us first assume that $H$ is uniquely ergodic. Let $g\in\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ and, for $\delta\in(0,1]$ , let $u_{\delta}$ be the solution of HJδ with the Hamiltonian $g+H$ . Since $g+H$ satisfies A1, the standard comparison principle for viscosity solutions holds. A first straightforward application of this principle yields that the family $\{\delta u_{\delta}\}_{0<\delta\leqslant 1}$ is uniformly bounded by $M_{g}=\left\|g(\cdot)+H(\cdot,0)\right\|_{\infty}$ . Then, using this fact and once again the comparison principle, we get that

[TABLE]

for all $\delta,\delta^{\prime}\in(0,1]$ . Since the solution of HJδ is continuous, we further deduce that the function $(\delta,x)\mapsto\delta u_{\delta}(x)$ is continuous on $(0,1]\times\mathbb{R}^{n}$ . Together with the hypothesis that $\delta u_{\delta}(x)$ converges uniformly in $x$ to a constant when $\delta$ goes to 0, it entails the equicontinuity of $\{\delta u_{\delta}\}_{0<\delta\leqslant 1}$ .

To show that the second point (strong maximum principle) holds, let us consider any continuous viscosity solution $w$ of HJ∞. Fix $\rho>0$ and denote by $u^{\rho}_{\delta}$ the solution of HJδ with the Hamiltonian $-\rho w+H$ , i.e., the solution of

[TABLE]

Let us show that $w^{\rho}_{\delta}=\frac{1}{\delta}(\rho w-M_{H})$ (where $M_{H}$ is the constant defined in Item (iv) of A1 for the Hamiltonian $H$ ) is a viscosity subsolution of 7. To that end, for any $x\in\mathbb{R}^{n}$ , let us consider any continuously differentiable function $\varphi$ such that $w^{\rho}_{\delta}-\varphi$ has a local maximum point at $x$ . Then the function $w-\frac{\delta}{\rho}\varphi$ has also a local maximum at $x$ , which implies that $H_{\infty}(x,\frac{\delta}{\rho}D\varphi(x))\leqslant 0$ . The positive homogeneity of $H_{\infty}$ yields $H_{\infty}(x,D\varphi(x))\leqslant 0$ . We then have

[TABLE]

This inequality proves that $w^{\rho}_{\delta}$ is a viscosity subsolution of 7 at any point $x$ . Since $w$ is continuous, so is $w^{\rho}_{\delta}$ , and therefore the comparison principle applies, leading to $w^{\rho}_{\delta}=\frac{1}{\delta}(\rho w-M_{H})\leqslant u^{\rho}_{\delta}$ . Similarly, we can show that $\frac{1}{\delta}(\rho w+M_{H})$ is a viscosity supersolution of 7, hence that $u^{\rho}_{\delta}\leqslant\frac{1}{\delta}(\rho w+M_{H})$ .

Since $H$ is uniquely ergodic, we know that $\delta u^{\rho}_{\delta}$ converges to some constant $\lambda_{\rho}$ when $\delta$ goes to [math]. Thus, passing to the limit in the latter inequalities, we get

[TABLE]

for all $x,y\in\mathbb{R}^{n}$ and all $\rho>0$ , which yields

[TABLE]

Letting $\rho$ goes to $+\infty$ , we obtain that $w(x)-w(y)\leqslant 0$ for all $x,y\in\mathbb{R}^{n}$ , hence that $w$ is constant. This concludes the necessary part of the proof.

We now prove the sufficient part. To that end, we assume that the structural equicontinuity property and that the strong maximum principle hold true. Let $g$ be any function in $\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ and let us denote by $u_{\delta}$ the solution of Equation HJδ with the Hamiltonian $g+H$ . We have already mentioned at the beginning of the proof that the family $\{\delta u_{\delta}\}_{0<\delta\leqslant 1}$ is uniformly bounded. Since it is also equicontinuous by hypothesis, the Arzelà-Ascoli theorem entails the existence of a subsequence that converges uniformly to some continuous and $\mathbb{Z}^{n}$ -periodic function $w$ .

Multiplying HJδ by $\delta$ , we get that the function $\delta u_{\delta}$ solves in $\mathbb{R}^{n}$ the equation

[TABLE]

with $u$ being $\mathbb{Z}^{n}$ -periodic. Since $(x,r,p)\mapsto\delta r+\delta g(x)+\delta H(x,\delta^{-1}p)$ converges as $\delta$ goes to [math] to $(x,r,p)\mapsto H_{\infty}(x,p)$ locally uniformly in $\mathbb{R}^{n}\times\mathbb{R}\times\mathbb{R}^{n}$ , the stability property of viscosity solutions yields that the uniform limit $w$ is solution of HJ∞, hence constant since the strong maximum principle applies. We then deduce that Item (iii) of Theorem 2.3 is satisfied. Indeed the implication (i) $\Rightarrow$ (iii) remains true if, instead of the whole family $\{\delta u_{\delta}\}$ , there is only a subsequence of $\{\delta u_{\delta}\}$ that converges uniformly to a constant (for the details, see the proof of [AB03, Thm. 4]). Thus the Hamiltonian $g+H$ is ergodic which proves that $H$ is uniquely ergodic. ∎

With a straightforward adaption of the proof, which we leave to the reader, we can also get a sufficient condition of ergodicity with the following weaker hypothesis.

Proposition 3.3.

Let $H$ be an arbitrary Hamiltonian satisfying A1. If the family $\{\delta u_{\delta}\}_{0<\delta\leqslant 1}$ , where $u_{\delta}$ is the solution of HJδ, is equicontinuous and if the strong maximum principle holds, then $H$ is ergodic. ∎

*Example 3.4**.*

Consider a differential game with state space in $\mathbb{R}^{2}$ whose dynamics is defined for all $x\in\mathbb{R}^{2}$ by

[TABLE]

with $0<\gamma\leqslant 1$ . Then, as we shall see in the next section (see Example 3.6), for any payoff function $\ell$ satisfying A0, the family of value functions $\{\delta v_{\delta}\}_{0<\delta\leqslant 1}$ is equicontinuous. On the other hand, the recession operator of the Hamiltonian of the game is

[TABLE]

and we know that HJ∞ has a nonconstant solution if and only if $\gamma\in\mathbb{Q}$ (see e.g., [Car10]). Thus, the game is ergodic if $\gamma$ is irrational.

3.2. Equicontinuity of $\boldsymbol{\{\delta u_{\delta}\}}$

Theorems 3.2 and 3.3 tell us that (unique) ergodicity relies on two distinct properties. As we shall see in Section 4, the strong maximum principle is a qualitative feature of the underlying dynamical system, which can be systematically characterized. On the other hand, the (structural) equicontinuity property appears more difficult to apprehend and is rather related with quantitative aspects of the underlying dynamics (e.g., controllability assumptions with specific time estimates). We next review two sufficient conditions on any Hamiltonian $H$ that guarantee the equicontinuity of the family $\{\delta u_{\delta}\}$ . Let us mention that for both conditions, the equicontinuity property is stable by perturbations of $H$ with functions $g\in\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ , that is, equicontinuity is “structural” in the sense of Theorem 3.2.

The first of these conditions is a classic: it is well known that equicontinuity of $\{\delta u_{\delta}\}$ holds if $H$ is coercive in the second variable, i.e., if

[TABLE]

uniformly in $x$ . More precisely, this property implies that the family $\{u_{\delta}\}$ is uniformly Lipschitz continuous. This yields in particular the existence of a corrector, that is, a solution to CP (see [LPV87]).

Secondly, the equicontinuity property holds if $H$ is uniformly continuous in $x$ , uniformly with respect to $p$ , i.e., if there exists a modulus of continuity $\omega$ such that

[TABLE]

for all $x,y\in\mathbb{R}^{n}$ and all $p\in\mathbb{R}^{n}$ .

Indeed, the equicontinuity of $\{\delta u_{\delta}\}$ readily follows from the comparison principle, after noticing that $u_{\delta}(\cdot+h)-\delta^{-1}\omega(\left|{h}\right|)$ and $u_{\delta}(\cdot+h)+\delta^{-1}\omega(\left|{h}\right|)$ are respectively subsolution and supersolution of HJδ (see [Car10]).

*Example 3.5**.*

Assume that $H(x,p)=\widetilde{H}(p)-\tilde{\ell}(x)$ , where the function $\widetilde{H}:\mathbb{R}^{n}\to\mathbb{R}$ is continuous and $\tilde{\ell}:\mathbb{R}^{n}\to\mathbb{R}$ is continuous and $\mathbb{Z}^{n}$ -periodic. Then $H$ satisfies 8, hence the structural equicontinuity property holds.

*Example 3.6**.*

Assume that $H$ is the Hamiltonian of a deterministic zero-sum differential game $\Gamma^{-}$ for which the function $f$ that controls the dynamics only depends on the control variables and not on the state, that is, $f(x,a,b)=\tilde{f}(a,b)$ for some continuous function $\tilde{f}:A\times B\to\mathbb{R}^{n}$ and for all $x$ . Then $H$ writes

[TABLE]

and one can easily see that it satisfies condition 8 with modulus of continuity $\omega_{\ell}$ . Thus the structural equicontinuity property holds. Observe that if $\ell(x,a,b)=\tilde{\ell}(x)$ for all $x,a,b$ and some $\tilde{\ell}\in\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ , then we recover as a special case the previous example.

4. Unique ergodicity of games via PDE approach

In the whole section, we fix a deterministic zero-sum differential game in its lower form, $\Gamma^{-}$ , which satisfies A0. We denote by $H$ the Hamiltonian of the game, defined in 4, and by $H_{\infty}$ its recession operator 6.

Since the values $v_{\delta}(\cdot)$ and $v(t,\cdot)$ of the game $\Gamma^{-}$ are characterized as viscosity solutions of Hamilton-Jacobi-Isaacs PDEs (Theorem 2.2), we can define the (unique) ergodicity of $\Gamma^{-}$ by applying the definitions to its Hamiltonian $H$ . This leads to the following definition.

Definition 4.1 (Ergodicity of differential games).

The differential game $\Gamma^{-}$ is ergodic if the normalized value $\delta v_{\delta}(x)$ converges uniformly in $x$ to a constant when $\delta$ goes to [math] (or equivalently if $v(t,x)/t$ converges uniformly to a constant when $t$ goes to $+\infty$ ).

The game $\Gamma^{-}$ is uniquely ergodic if for every continuous and $\mathbb{Z}^{n}$ -periodic function $g:\mathbb{R}^{n}\to\mathbb{R}$ , the perturbed game with running payoff $(x,a,b)\mapsto\ell(x,a,b)+g(x)$ , all other data being equal, is ergodic.

Thus, Theorem 3.2 or Proposition 3.3 already provides conditions for (unique) ergodicity. The purpose of this section is to give other conditions, which rely on the main tool of this article, namely dominions. We first introduce this concept, which only rely on the controlled system 1, and then use it to characterize the (unique) ergodicity property.

4.1. Dominions

Informally speaking, dominions are subsets of state that can be made approximately invariant by one player for an arbitrary period of time. This is an adaptation to the framework of differential games of a notion that was used to study zero-sum repeated games, played in discrete time (see in particular the companion works [AGH20, Hoc19]). However, as we will prove in Section 6, the notion coincides with the one of leadership domain and discriminating domain which appears in viability theory (see, e.g., [Car96]).

Before giving the formal definition of a dominion, and with the aim of simplifying the notation, let us further mention that we shall hereafter write $X^{x,\alpha,b}_{t}$ , instead of $X^{x,\alpha{[b]},b}_{t}$ , the solution of the controlled system 1 induced by a strategy $\alpha\in\mathfrak{A}$ of player 1 and a control $b\in\mathscr{B}$ of player 2. Also, we let $\operatorname{dist}_{K}(x)$ be the distance of a point $x\in\mathbb{R}^{n}$ to a subset $K\subset\mathbb{R}^{n}$ , that is,

[TABLE]

Definition 4.2 (Dominions).

A dominion of the first player in the lower game $\Gamma^{-}$ is a nonempty closed set $D\subset\mathbb{R}^{n}$ such that for every initial position in $D$ , player 1 can force the state to remain approximately in $D$ for any arbitrary period of time, meaning that

[TABLE]

Dominions for the second player are defined accordingly. Specifically, a dominion of player 2 in $\Gamma^{-}$ is a nonempty closed set $D\subset\mathbb{R}^{n}$ such that

[TABLE]

The definition of dominions in the upper game $\Gamma^{+}$ is identical after switching the identity of the players. As we shall see in Section 6, when Isaacs’ condition holds, the definitions in the lower and the upper game coincide.

We next illustrate the notion of dominion with two examples. In the first one, Isaacs’ condition holds true, which allows us to choose for each player the more convenient definition. In the second example however, we provide a game for which the sets of dominions for each player are not the same in the lower and the upper form.

*Example 4.3**.*

Consider the game already introduced in Example 3.4, whose controlled system is defined in $\mathbb{R}^{2}$ by the function

[TABLE]

with $0<\gamma\leqslant 1$ (and with any payoff function $\ell$ satisfying A0). Let us observe that Isaacs’ condition holds true for $H_{\infty}$ :

[TABLE]

Hence, according to Remark 6.3 in Section 6, the dominions are the same in the lower and the upper game, and we can use for each player the simplest definition in order to describe them, namely for player 1: dominions as defined in $\Gamma^{-}$ ; for player 2: dominions as defined in $\Gamma^{+}$ . Following this observation, we can easily see that any line of the form

[TABLE]

with $x\in\mathbb{R}^{2}$ and $-1\leqslant\mu\gamma\leqslant 1$ is a dominion of player 1. Indeed, in the lower game, if she uses the strategy $\alpha[b]=\mu\gamma b$ against all $b\in\mathscr{B}$ , then $V^{1}_{\mu}$ will be invariant for any initial point in it. Dually, any line of the form

[TABLE]

with $-\gamma\leqslant\nu\leqslant\gamma$ is a dominion of player 2. Indeed, in the upper game, he can choose the strategy $\beta[a]=\frac{\nu}{\gamma}a$ against all $a\in\mathscr{A}$ to ensure the invariance of $V^{2}_{\nu}$ .

*Example 4.4**.*

Contrary to the latter example, let us now illustrate the situation in which Isaacs’ condition fails and the set of dominions of each player is not the same in the lower and the upper game. So, consider a differential game with state space in $\mathbb{R}$ whose dynamics is defined for all $x\in\mathbb{R}$ by

[TABLE]

and the payoff function is any continuous function that is $\mathbb{Z}$ -periodic in $x$ . For such a game, we have $H_{\infty}^{-}(x,p)=\max(0,-p)$ whereas $H_{\infty}^{+}(x,p)=\min(0,-p)$ for all $p\in\mathbb{R}$ , which proves that Isaacs’ condition does not hold.

Then observe that in the lower game, any single point is a dominion of player 1 whereas the dominions of player 2 are all of the form $[x,+\infty)$ . Symmetrically, the set of dominions of player 2 in the upper game contains any singleton, whereas the set of dominions for player 1 contains only intervals of the form $[x,+\infty)$ . We further mention that the lower and the upper game are both uniquely ergodic, as we shall see with Theorem 4.13.

Before going on with ergodicity conditions, let us recall that the state space is essentially the $n$ -torus $\mathbb{R}^{n}/\mathbb{Z}^{n}$ . However, the image of a closed set in $\mathbb{R}^{n}/\mathbb{Z}^{n}$ is not necessarily closed, which is problematic when considering dominions. To illustrate this issue, think of the dominions $V^{1}_{\mu}$ and $V^{2}_{\nu}$ described in Example 4.3 when $\mu$ or $\nu$ are irrational, i.e., when their image in $\mathbb{R}^{2}/\mathbb{Z}^{2}$ is dense. For this reason, we introduce the following definition of “dominion in the torus”. Note that we let $\pi:\mathbb{R}^{n}\to\mathbb{R}^{n}/\mathbb{Z}^{n}$ be the quotient map.

Definition 4.5 (Dominion in the torus).

A set $K\subset\mathbb{R}^{n}/\mathbb{Z}^{n}$ is a dominion in the torus of some player if $K=\overline{\pi(D)}$ for some dominion $D\subset\mathbb{R}^{n}$ of that player in $\Gamma^{-}$ .

Note that if $D\subset\mathbb{R}^{n}$ is a dominion, then $\pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)}=\overline{\pi^{-1}(\pi(D))}$ is also a dominion in $\mathbb{R}^{n}$ . Furthermore, the latter set is $\mathbb{Z}^{n}$ -translation-invariant, meaning that for every $x\in\pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)}$ and every $k\in\mathbb{Z}^{n}$ , we have $x+k\in\pi^{-1}\big{(}\,\overline{\pi(D)}\,\big{)}$ .

4.2. Necessary condition for unique ergodicity

We provide here a necessary condition for unique ergodicity involving dominions in the torus. The result is based on the very simple idea that a player will leverage one of his dominion if the payoff is more favorable on this dominion than in the rest of the states.

Proposition 4.6.

If the differential game $\Gamma^{-}$ is uniquely ergodic, then the intersection of every dominion of player 1 with every dominion of player 2 in the torus is nonempty, that is, for every dominion $D^{1}$ of player 1 and every dominion $D^{2}$ of player 2 in $\mathbb{R}^{n}$ , we have

[TABLE]

To prove this result, we will need the following technical lemmas, which give an equivalent characterization of the dominions. In their statement, we denote by $K_{\varepsilon}$ the set of points whose distance to a subset $K\subset\mathbb{R}^{n}$ is not greater than $\varepsilon>0$ , i.e.,

[TABLE]

Also, we denote by $\mathbf{1}_{K}$ the indicator function of $K$ , defined by $\mathbf{1}_{K}(x)=1$ if $x\in K$ and $\mathbf{1}_{K}(x)=0$ if $x\notin K$ . Let us further recall the following standard estimate on the trajectories of 1 (where $\left\|f\right\|_{\infty}=M_{f}$ ):

[TABLE]

for all $x,y\in\mathbb{R}^{n}$ , $a\in\mathscr{A}$ , $b\in\mathscr{B}$ and $t\geqslant 0$ .

Lemma 4.7.

A nonempty closed set $D\subset\mathbb{R}^{n}$ is a dominion of player 1 in $\Gamma^{-}$ if and only if for some (hence all) $\delta>0$ ,

[TABLE]

Proof.

We first assume that $D$ is a dominion of player 1 and fix some discount factor $\delta>0$ . Let $x\in D$ and $\varepsilon>0$ . For any horizon $T>0$ , there is a strategy $\bar{\alpha}\in\mathfrak{A}$ of player 1 such that, for all controls $b\in\mathscr{B}$ of player 2 and all times $t\in[0,T]$ , $X^{x,\bar{\alpha},b}_{t}\in D_{\varepsilon}$ . So, for all $b\in\mathscr{B}$ we have

[TABLE]

Hence we get

[TABLE]

for all $T>0$ . Taking the limit as $T$ goes to $+\infty$ , and since the integral is bounded above by $1$ , we finally get that

[TABLE]

We now assume that $D$ is not a dominion of player 1. Since it is nonempty, it means that there exist some $\bar{x}\in D$ , $\varepsilon>0$ and $T>0$ such that for all strategies $\alpha$ of player 1, player 2 can choose a control $b$ for which $X^{\bar{x},\alpha,b}_{t}\notin D_{2\varepsilon}$ at some $t\in[0,T]$ . Using the estimate 9 we deduce that

[TABLE]

hence $X^{\bar{x},\alpha,b}_{s}\notin D_{\varepsilon}$ . Note that, since $X^{\bar{x},\alpha,b}_{t}\notin D_{2\varepsilon}$ , the estimate 9 necessarily implies $t-\frac{\varepsilon}{M_{f}}>0$ .

For any $\delta>0$ we then have

[TABLE]

Thus

[TABLE]

which concludes the proof. ∎

With a minor adaptation of the proof, which we leave to the reader, we can show a dual characterization of dominions for the second player.

Lemma 4.8.

A nonempty closed set $D\subset\mathbb{R}^{n}$ is a dominion of player 2 in $\Gamma^{-}$ if and only if for some (hence all) $\delta>0$ ,

[TABLE]

*Remark 4.9**.*

We can also give a similar characterization of dominions replacing the discounted averages with the time averages

[TABLE]

We can now give the proof of the necessary condition for unique ergodicity.

Proof of Proposition 4.6.

We prove the contrapositive and, to this end, we suppose that there exist in $\mathbb{R}^{n}$ a dominion of player 1, denoted $D^{1}$ , and a dominion of player 2, denoted $D^{2}$ , such that $\overline{\pi(D^{1})}\cap\overline{\pi(D^{2})}=\emptyset$ . Since the sets $\pi^{-1}\big{(}\,\overline{\pi(D^{1\backslash 2})}\,\big{)}$ are also dominions in $\mathbb{R}^{n}$ , we can assume without loss of generality that $D^{1\backslash 2}$ are $\mathbb{Z}^{n}$ -translation-invariant and that $D^{1}\cap D^{2}=\emptyset$ . So we can find $\varepsilon>0$ such that $D^{1}_{\varepsilon}$ and $D^{2}_{\varepsilon}$ also have an empty intersection (recall that $D^{1\backslash 2}_{\varepsilon}=\{x\in\mathbb{R}^{n}\mid\operatorname{dist}_{D^{1\backslash 2}}(x)\leqslant\varepsilon\}$ ). We then consider any function $g\in\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ satisfying

[TABLE]

where $M_{\ell}$ equals $\left\|\ell\right\|_{\infty}$ if $\ell\neq 0$ and any positive real otherwise. Thus, the function $g$ satisfies, for all $x\in\mathbb{R}^{n}$ ,

[TABLE]

Let $\delta>0$ be any discount factor. From the above inequalities, we deduce that for all $x\in\mathbb{R}^{n}$ , all strategies $\alpha$ of player 1 and all controls $b$ of player 2,

[TABLE]

Let us denote by $v_{\delta}^{g}$ the unnormalized value of the discounted game with the perturbed running payoff $(x,a,b)\mapsto\ell(x,a,b)+g(x)$ . Taking the supremum over $b\in\mathscr{B}$ and then the infimum over $\alpha\in\mathfrak{A}$ in the latter inequalities, we deduce from Lemma 4.7 that $\delta v_{\delta}^{g}(x)\leqslant M_{\ell}$ for all $x\in D^{1}$ , and from Lemma 4.8 that $2M_{\ell}\leqslant\delta v_{\delta}^{g}(y)$ for all $y\in D^{2}$ . Thus, if $x\in D^{1}$ and $y\in D^{2}$ , we have

[TABLE]

which proves that the perturbed game is not ergodic, hence that the game $\Gamma^{-}$ is not uniquely ergodic. ∎

*Remark 4.10** (Comparison with one-player controlled systems).*

It is instructive to compare the latter necessary condition of unique ergodicity with the result of Arisawa in [Ari97], which deals with optimal control problems, i.e., problems for systems controlled by one player (who is minimizing and which we call player 1). In this paper, she proved that if the controlled system is uniquely ergodic, then there exists an ergodic attractor $D$ which satisfies the following properties.

(P)

$D$ is closed, connected and positively invariant. 2. (D)

$D$ is nonempty and $y\in D$ if and only if for any $x\in\mathbb{R}^{n}$ and any $\varepsilon>0$ , there exists $T_{\varepsilon}>0$ and $a_{\varepsilon}\in\mathscr{A}$ such that $\lim_{\varepsilon\to 0}T_{\varepsilon}=+\infty$ and $\left|{y-X^{x,a_{\varepsilon}}_{T_{\varepsilon}}}\right|<\varepsilon$ . 3. (A)

$D$ has the following time-averaged attracting property: for any neighborhood $U$ of $D$ and any $x\in\mathbb{R}^{n}$ ,

[TABLE]

For such controlled systems, if we introduce a second player as a dummy to cast the problem within the framework of two-player differential games, then it readily follows from the definition that the dominions of player 2 correspond to the nonempty closed and positively invariant sets (indeed, every positive orbit through any point in a dominion of player 2 is within any $\varepsilon$ -neighborhood of the dominion for any arbitrary period of time). Let us observe that these sets are also dominions of player 1 and that the intersection of two dominions of player 2, if nonempty, is another dominion of player 2.

Then, applying Proposition 4.6, we deduce that if unique ergodicity holds, there is a unique minimal nonempty closed positively invariant set in the torus and that this set intersect every dominion of player 1 in the torus. We claim that this set is the ergodic attractor $D$ described in [Ari97] and that the two results are equivalent. Indeed it follows from the properties (P) and (D) that the ergodic attractor $D$ is the unique minimal dominion of player 2 (the uniqueness comes from the connectedness in (P) and the minimality from (D)) and property (A) implies that any dominion of player 1 cannot be disjoint from $D$ . Conversely, if $D$ is the unique minimal dominion of player 2 whose existence stems from Proposition 4.6, then property (P) is readily verified. Furthermore, its minimality implies that any point $x\in D$ is approximately controllable to any other point $y\in D$ . Then, since every dominion of player 1 meets $D$ , and particularly the closure of any positive orbit, we can show that property (D) holds. Finally using (P) and (D) we can then prove that (A) holds, as is done in [Ari97].

4.3. Sufficient condition for unique ergodicity

In this subsection, we give a sufficient condition of unique ergodicity which is derived from Theorem 3.2. We start with a lemma that relates the solutions of HJ∞ to dominions in $\Gamma^{-}$ .

Lemma 4.11.

Let $w$ be any continuous viscosity solution of HJ∞. Then $\operatorname{arg\,min}w$ is a dominion of player 1 in $\Gamma^{-}$ and $\operatorname{arg\,max}w$ is a dominion of player 2.

Proof.

Let us first consider the differential game with the same definition as $\Gamma^{-}$ except for the payoff function $\ell$ which is replaced with $w$ . The Hamiltonian associated to this game is $H_{\infty}(x,p)-w(x)$ and since, for any $\delta>0$ , the function $\delta^{-1}w$ is solution to HJδ with the latter Hamiltonian, we deduce from Theorem 2.2 that it is the (unnormalized) value of the infinite-horizon discounted game. Thus, for all points $x$ in $\mathbb{R}^{n}$ and all positive factors $\delta$ we have

[TABLE]

Now set $D=\operatorname{arg\,min}w$ and let us assume, without loss of generality, that $\min w=0$ . Also, since the case with $w$ constant is trivial, we can assume that $D\neq\mathbb{R}^{n}$ . In view of Lemma 4.7, we fix arbitrary positive constants $\varepsilon$ and $\delta$ . Again, if $D_{\varepsilon}=\{x\in\mathbb{R}^{n}\mid\operatorname{dist}_{D}(x)\leqslant\varepsilon\}$ is the whole space $\mathbb{R}^{n}$ , then the equality in Lemma 4.7 trivially holds, so we assume that $\varepsilon$ is small enough so that $D_{\varepsilon}\neq\mathbb{R}^{n}$ . Then, denoting by $m_{\varepsilon}$ the infimum of $w$ on the complement of $D_{\varepsilon}$ , which is necessarily positive, we can write

[TABLE]

for all $x\in\mathbb{R}^{n}$ . By plugging this inequality into the right-hand side of 11, we obtain for all $x\in\mathbb{R}^{n}$

[TABLE]

After simplification, this yields, for all $x\in D$ ,

[TABLE]

Since the converse inequality is obviously true, we deduce that there is in fact equality and thus, by Lemma 4.7, that $D$ is a dominion of player 1.

With very similar arguments and using Lemma 4.8 instead of Lemma 4.7, we can show that $\operatorname{arg\,max}w$ is a dominion of player 2. ∎

We know that if the value function $\delta v_{\delta}$ converges uniformly to some function $v$ then it is solution to HJ∞. This entails the following corollary

Corollary 4.12.

Assume that the value function $\delta v_{\delta}$ of the game $\Gamma^{-}$ converges uniformly to some function $v$ . Then $\operatorname{arg\,min}v$ and $\operatorname{arg\,max}v$ are dominions of player 1 and player 2, respectively.

A straightforward consequence is that if $\operatorname{arg\,min}v$ and $\operatorname{arg\,max}v$ have a nonempty intersection, then $v$ is constant and the game is ergodic. We can extend this result to unique ergodicity with the help of Theorem 3.2 and thus provide a converse to Proposition 4.6.

Theorem 4.13.

Assume that in the differential game $\Gamma^{-}$ , the intersection of every dominion of player 1 with every dominion of player 2 in the torus is nonempty. Then the strong maximum principle (see Theorem 3.2) holds, i.e., the constant functions are the only solutions of HJ∞.

If, moreover, the structural equicontinuity property is true, then $\Gamma^{-}$ is uniquely ergodic if and only if the two players do not have disjoint dominions in the torus.

Proof.

Let $w$ be any solution of HJ∞. Let $D^{1}=\operatorname{arg\,min}w$ and $D^{2}=\operatorname{arg\,max}w$ . Since $w$ is $\mathbb{Z}^{n}$ -periodic and continuous, it passes to the quotient into a continuous map on the torus whose minimum (resp., maximum) is attained on $\pi(D^{1})$ (resp., $\pi(D^{2})$ ). Hence, $\pi(D^{1\backslash 2})$ are necessarily closed and we have $D^{1\backslash 2}=\pi^{-1}\big{(}\,\overline{\pi(D^{1\backslash 2})}\,\big{)}$ . Using now Lemma 4.11, we deduce that $\overline{\pi(D^{1})}\cap\overline{\pi(D^{2})}$ hence $D^{1}\cap D^{2}$ is nonempty. So $w$ is constant.

The rest of the proof follows from Proposition 4.6 and Theorem 3.2. ∎

Note that if the controlled system 1 is Lipschitz continuous, meaning that there is a positive constant $L$ for which

[TABLE]

then the family $\{\delta v_{\delta}\}$ is equi-Lipschitz for any payoff function $\ell$ . In that case we can use the latter theorem to characterize unique ergodicity in terms of dominions. This is in particular the case if the function $f$ does not depend on the state variable.

*Example 4.14**.*

Let us go back to the game introduced in Examples 3.4 and 4.3, whose dynamics is defined in $\mathbb{R}^{2}$ by the function

[TABLE]

with $0<\gamma\leqslant 1$ and whose payoff function $\ell$ is any function satisfying A0. We already mentioned that the family of value functions $\{\delta v_{\delta}\}_{0<\delta\leqslant 1}$ is equicontinuous (see Example 3.6 or the above remark). Hence the structural equicontinuity property holds.

If $\gamma$ is a rational number then, for any $x,y\in\mathbb{R}^{2}$ , the lines

[TABLE]

are dominions of player 1 and player 2, respectively, and their quotient images in the torus $\mathbb{R}^{2}/\mathbb{Z}^{2}$ are closed and disjoint for suitable $x$ and $y$ . Thus, according to Theorem 4.13, the game is not uniquely ergodic.

Assume now that $\gamma$ is not a rational number and consider in $\mathbb{R}^{2}$ any dominions $D^{1}$ and $D^{2}$ of player 1 and player 2, respectively. We next show that their intersection in the torus is not empty. Let us fix two points, $x\in D^{1}$ and $y\in D^{2}$ , in these dominions. By definition, given $\varepsilon>0$ and $T>0$ , player 1 has a strategy $\alpha_{\varepsilon}\in\mathfrak{A}$ such that for every action $b\in\mathscr{B}$ of player 2, we have $\operatorname{dist}_{D^{1}}(X^{x,\alpha_{\varepsilon},b}_{t})\leqslant\varepsilon$ for all $t\in[0,T]$ . In particular, if $b$ is the constant control equal to $1$ , then we have

[TABLE]

that is, the (continuous) trajectory of the dynamical system has the property that $X^{x,\alpha_{\varepsilon},b}_{t}-X^{x,\alpha_{\varepsilon},b}_{s}$ is included in the cone $C_{\gamma}^{1}=\{z=(z_{1},z_{2})^{\intercal}\in\mathbb{R}^{2}\mid-z_{2}\leqslant\gamma z_{1}\leqslant z_{2}\}$ for all $0\leqslant s\leqslant t\leqslant T$ (see Figure 1). Likewise, with the same $\varepsilon$ and $T$ , player 2 has a strategy $\beta_{\varepsilon}\in\mathfrak{B}$ such that $\operatorname{dist}_{D^{2}}(X^{y,a,\beta_{\varepsilon}}_{t})\leqslant\varepsilon$ for all $t\in[0,T]$ and all $a\in\mathscr{A}$ , and if player 1 chooses the constant control equal to $1$ , then we have

[TABLE]

that is, the trajectory of the system is such that $X^{y,a,\beta_{\varepsilon}}_{t}-X^{y,a,\beta_{\varepsilon}}_{s}$ is included in the cone $C_{\gamma}^{2}=\{z=(z_{1},z_{2})^{\intercal}\in\mathbb{R}^{2}\mid-\gamma z_{1}\leqslant z_{2}\leqslant\gamma z_{1}\}$ for all $0\leqslant s\leqslant t\leqslant T$ (see Figure 1).

Then, the parameter $\varepsilon$ being fixed, either there is some time $T$ such that the images in the torus $\mathbb{R}^{2}/\mathbb{Z}^{2}$ of the two trajectories mentioned above intersect on the time interval $[0,T]$ at some point $z_{\varepsilon}\in\mathbb{R}^{2}/\mathbb{Z}^{2}$ , or for all times $T$ their images always remain disjoint, which is possible only if they are contained in the parallel half-lines starting in $x$ and $y$ , respectively, and directed by the vector $(1,\gamma)^{\intercal}$ . Indeed, since $\gamma\notin\mathbb{Q}$ , the images of these half-lines in the torus are dense, and therefore any deviation of a trajectory from one of these half-lines eventually leads to the intersection of the two trajectories.

If there are only finitely many points $z_{\varepsilon}$ as described above, then we deduce that $D^{1}$ and $D^{2}$ respectively contain the latter half-lines and therefore both dominions correspond to the trivial dominion in $\mathbb{R}^{2}/\mathbb{Z}^{2}$ , composed of the whole state space. If there are infinitely many points $z_{\varepsilon}$ , then any limit point is, by construction, contained in both $D^{1}$ and $D^{2}$ . In any case, we deduce that the players do not have disjoint dominions in the torus and so, according to Theorem 4.13, that the game is uniquely ergodic.

5. Unique ergodicity of games via controllability approach

In this section, as usual, we fix a deterministic zero-sum differential game in its lower form, $\Gamma^{-}$ , which satisfies A0. However, we assume that the controlled system 1 is not Lipschitz continuous (and in particular that $L_{f}>0$ ), so that equicontinuity of $\{\delta v_{\delta}\}$ cannot be guaranteed. We also make the standard assumption that the payoff function $\ell$ is Lipschitz continuous in $x$ uniformly in $(a,b)$ , i.e., that there exists $L_{\ell}>0$ such that

[TABLE]

We then have the following classical regularity property of the value function.

Proposition 5.1 (see [BCD97, Ch. VIII, Prop. 1.8]).

If $\ell$ is Lipschitz continuous in $x$ uniformly in $(a,b)$ , then, for any discount factor $\delta<L_{f}$ , the value function $\delta v_{\delta}$ is Hölder continuous with exponent $\delta/L_{f}$ and constant $L$ independent of $\delta$ :

[TABLE]

In view of unique ergodicity, the requirement that the payoff function $\ell$ be uniformly Lipschitz continuous in $x$ prevents us from considering perturbations $g$ that only lie in $\mathcal{C}^{0}_{\text{per}}(\mathbb{R}^{n})$ . If we want to use the latter proposition (which we need to prove the main theorem of this section), we need to restrict the perturbations $g$ to the set of Lipschitz continuous and $\mathbb{Z}^{n}$ -periodic functions. Fortunately, this is not a major restriction. Indeed, in the proof of Proposition 4.6 it is possible to consider a perturbation function $g$ satisfying 10 and which is Lipschitz. Thus we have the following stronger result.

Proposition 5.2.

Assume that for every Lipschitz continuous and $\mathbb{Z}^{n}$ -periodic function $g$ from $\mathbb{R}^{n}$ to $\mathbb{R}$ , the perturbed differential game with payoff function $(x,a,b)\mapsto\ell(x,a,b)+g(x)$ is ergodic. Then, in the game $\Gamma^{-}$ , the players do not have disjoint dominions in $\mathbb{R}^{n}/\mathbb{Z}^{n}$ . ∎

To compensate the lack of equicontinuity of $\{\delta v_{\delta}\}$ we also need to introduce the following controllability assumption, which involves sets of points that are reachable by one player. Let us first describe precisely these sets.

Given any strategy $\alpha\in\mathfrak{A}$ of player 1, we define the reachable set from a point $x\in\mathbb{R}^{n}$ for player 2 by

[TABLE]

On the other hand, for all strategies $\alpha\in\mathfrak{A}$ of player 1, let us associate a control $b_{\alpha}\in\mathscr{B}$ of player 2. Then, we define the reachable set from $x\in\mathbb{R}^{n}$ for player 1 by

[TABLE]

Furthermore, we say that the map $\alpha\mapsto b_{\alpha}$ is nonanticipating if $\alpha^{1}[b]_{s}=\alpha^{2}[b]_{s}$ for all $b\in\mathscr{B}$ and almost all $s\in[0,t]$ implies that $(b_{\alpha^{1}})_{s}=(b_{\alpha^{2}})_{s}$ for almost all $s\in[0,t]$ . That is, if $\alpha^{1}$ and $\alpha^{2}$ coincide almost surely on $[0,t]$ , then the same is true for $b_{\alpha^{1}}$ and $b_{\alpha^{2}}$ .

We emphasize that the purpose of the following assumption is only to provide a uniform bound on the time needed to get arbitrarily close to any reachable point. We further mention that the estimate is borrowed from [Ari98] (see also [Bet05]).

Assumption A2 (Uniform time estimate).

There exist constants $\gamma\in[0,1)$ and $C>0$ such that, for all $\varepsilon>0$ ,

•

for all $\alpha\in\mathfrak{A}$ , for all $x\in\mathbb{R}^{n}$ and all $y\in\overline{R^{1}_{\alpha}}(x)$ , there is a control $b\in\mathscr{B}$ and a time $t\leqslant C(-\log\varepsilon)^{\gamma}$ for which $|{y-X^{x,\alpha,b}_{t}}|\leqslant\varepsilon$ ;

•

for all nonanticipating map $\mathfrak{A}\to\mathscr{B},\alpha\mapsto b_{\alpha}$ , for all $x\in\mathbb{R}^{n}$ and all $y\in\overline{R^{2}_{b\centerdot}}(x)$ , there is a strategy $\alpha\in\mathfrak{A}$ and a time $t\leqslant C(-\log\varepsilon)^{\gamma}$ for which $|{y-X^{x,\alpha,b_{\alpha}}_{t}}|\leqslant\varepsilon$ .

We can now give the condition for the (somewhat modified version of) unique ergodicity of differential games. Notice that in the proof of this result, we use the fact that the sets $\overline{R^{1}_{\alpha}}(x)$ and $\overline{R^{2}_{b\centerdot}}(x)$ are dominions of player 1 and player 2, respectively. We postpone the precise statement and the proof of this fact afterward.

Theorem 5.3.

In the differential game $\Gamma^{-}$ , suppose that A2 holds and that the payoff function $\ell$ is Lipschitz continuous in $x$ uniformly in $(a,b)$ . The following assertions are equivalent:

(i)

for every function $\ell^{\prime}:\mathbb{R}^{n}\times A\times B\to\mathbb{R}$ which is Lipschitz continuous in $x$ uniformly in $(a,b)$ and $\mathbb{Z}^{n}$ -periodic in $x$ , the modified game with running payoff $\ell^{\prime}$ is ergodic; 2. (ii)

for every Lipschitz continuous and $\mathbb{Z}^{n}$ -periodic function $g:\mathbb{R}^{n}\to\mathbb{R}$ , the perturbed game with running payoff $(x,a,b)\mapsto\ell(x,a,b)+g(x)$ is ergodic; 3. (iii)

the players do not have disjoint dominions in the torus.

Proof.

The implication (i) $\Rightarrow$ (ii) is trivial and we already know from Proposition 5.2 that (ii) $\Rightarrow$ (iii). So we only need to prove that (iii) $\Rightarrow$ (i). And since the payoff function $\ell$ is arbitrary and assertion (iii) does not depend on it, if we prove that $\Gamma^{-}$ is ergodic, the result will be true for any other payoff function $\ell^{\prime}$ .

Let $\delta>0$ be any discount factor and let $\varepsilon$ be a fixed positive real. Let $x,y$ be any points in $\mathbb{R}^{n}$ . From the dynamic programming principle, there exists a strategy $\alpha^{\!1}\in\mathfrak{A}$ of player 1 (which depends only on $\delta$ , $\varepsilon$ and $x$ ) such that

[TABLE]

for all times $t>0$ and all controls $b\in\mathscr{B}$ . Similarly, for all $\alpha\in\mathfrak{A}$ , there exists a control $b_{\alpha}\in\mathscr{B}$ of player 2 (which depends only on $\delta$ , $\varepsilon$ , $y$ and $\alpha$ ) such that

[TABLE]

for all times $t>0$ . Furthermore, the map $\alpha\mapsto b_{\alpha}$ can be chosen nonanticipating, as defined above (indeed, for the controls $b_{\alpha}$ to satisfy these conditions, we can chose them so that $v_{\delta}(x)-\varepsilon\leqslant J_{\delta}(x,\alpha,b_{\alpha}$ ).

Let $D^{1}=\overline{R_{\alpha^{\!1}}^{1}}(x)$ and $D^{2}=\overline{R_{b\centerdot}^{2}}(y)$ be the closures of the sets of reachable points from $x$ and $y$ by player 2 and player 1, respectively, being fixed the strategy $\alpha^{\!1}$ and the nonanticipating map $\alpha\mapsto b_{\alpha}$ .

We know from subsequent Lemma 5.4 that these sets are respectively a dominion of player 1 and a dominion of player 2. Hence there exists a point $z\in\pi^{-1}\big{(}\,\overline{\pi(D^{1})}\cap\overline{\pi(D^{2})}\,\big{)}$ . This implies that there are $z^{1}\in D^{1}$ , $z^{2}\in D^{2}$ and $k,l\in\mathbb{Z}^{n}$ such that

[TABLE]

Moreover, A2 guarantees the existence of a control $b^{1}\in\mathscr{B}$ , a strategy $\alpha^{\!2}\in\mathfrak{A}$ and times $t_{1},t_{2}\leqslant C(-\log\varepsilon)^{\gamma}$ such that

[TABLE]

Combining these inequalities, we get

[TABLE]

Since the inequalities 12 and 13 hold uniformly in $t$ , we can now write them at times $t_{1}$ and $t_{2}$ respectively, and then use the estimates that we have just established. We recall that, for $\delta$ small enough, the function $\delta v_{\delta}$ is Hölder continuous with exponent $\delta/L_{f}$ and constant $L$ . We also recall that $v_{\delta}$ is $\mathbb{Z}^{n}$ -periodic. Let $T_{\varepsilon}=C(-\log\varepsilon)^{\gamma}$ for simplicity. From 12 we get

[TABLE]

where we use the fact that $\delta v_{\delta}(z)+M_{\ell}\geqslant 0$ and $e^{-\delta t_{1}}\leqslant 1$ .

On the other hand, from 13 we get

[TABLE]

Here we use the fact that $-\delta v_{\delta}(z)+M_{\ell}\geqslant 0$ and $e^{-\delta t_{2}}\leqslant 1$ .

Combining the two inequalities and letting $M=\max\{M_{\ell},L\}$ , we obtain, for all $\delta,\varepsilon>0$ ,

[TABLE]

Since $T_{\varepsilon}=C(-\log\varepsilon)^{\gamma}$ , choosing $\varepsilon$ such that $\log\varepsilon=-\delta^{-(1+\omega)}$ with $0<\omega<\frac{1}{\gamma}-1$ , we observe that the right-hand side of the latter inequality converges to zero as $\delta$ vanishes, which yields

[TABLE]

Since the points $x$ and $y$ are arbitrary and the bound in 14 does not depend on them, we deduce that

[TABLE]

uniformly in $x,y\in\mathbb{R}^{n}$ .

The rest of the proof is classical (see for instance [Ari98]), but one may also notice that the latter uniform limit together with the continuity of $(\delta,x)\mapsto\delta v_{\delta}(x)$ on $(0,1]\times\mathbb{R}^{n}$ (see the proof of Theorem 3.2) entails the equicontinuity of the family $\{\delta v_{\delta}\}_{0<\delta\leqslant 1}$ . We can then conclude with Propositions 3.3 and 4.13. ∎

In order to complete the proof and conclude the section, we prove the following.

Lemma 5.4.

Given a strategy $\alpha\in\mathfrak{A}$ of player 1, the topological closure of the reachable set from any point $x\in\mathbb{R}^{n}$ for player 2, $\overline{R_{\alpha}^{1}}(x)$ , is a dominion of player 1.

Dually, given a map $\alpha\mapsto b_{\alpha}$ from $\mathfrak{A}$ to $\mathscr{B}$ which is nonanticipating, the closure of the reachable set from $x$ for player 1, $\overline{R_{b\centerdot}^{2}}(x)$ , is a dominion of player 2.

Proof.

We show in detail that $\overline{R_{\alpha}^{1}}(x)$ is a dominion of player 1, and leave to the reader the details of the proof for $\overline{R_{b\centerdot}^{2}}(x)$ , which follows the same lines. Nevertheless we will highlight the important changes.

First, for any point $y\in R_{\alpha}^{1}(x)$ , we show that we can construct a strategy $\bar{\alpha}\in\mathfrak{A}$ of player 1 such that $X^{y,\bar{\alpha},b}_{s}\in R_{\alpha}^{1}(x)$ for all controls $b\in\mathscr{B}$ of player 2 and all times $s\geqslant 0$ . Indeed, there exist $\bar{b}\in\mathscr{B}$ and $t\geqslant 0$ such that $y=X^{x,\alpha,\bar{b}}_{t}$ . Then, for any control $b\in\mathscr{B}$ , let us introduce the control $\bar{b}|b$ obtained by concatenating $\bar{b}$ and $b$ in the following way:

[TABLE]

We further define the strategy $\bar{\alpha}$ of player 1 as follows: $\bar{\alpha}[b]_{s}=\alpha[\bar{b}|b]_{s+t}$ for all $s\geqslant 0$ . It is straightforward to verify that $\bar{\alpha}$ is nonanticipating and, moreover, that for all $s\geqslant 0$ , $X^{y,\bar{\alpha},b}_{s}=X^{x,\alpha,\bar{b}|b}_{s+t}$ . Thus, for all $s\geqslant 0$ we have $X^{y,\bar{\alpha},b}_{s}\in R_{\alpha}^{1}(x)$ .

Consider now $z\in\overline{R_{\alpha}^{1}}(x)\setminus R_{\alpha}^{1}(x)$ and fix some $\varepsilon>0$ and $T\geqslant 0$ . There exists $y\in R_{\alpha}^{1}(x)$ such that $\left|{y-z}\right|\leqslant\varepsilon e^{-L_{f}T}$ . Let $\bar{\alpha}\in\mathfrak{A}$ be the strategy of player 1 defined above, which ensures that $X^{y,\bar{\alpha},b}_{s}\in R_{\alpha}^{1}(x)$ for all $b\in\mathscr{B}$ and $s\geqslant 0$ . We have the following standard estimate on the trajectories of 1:

[TABLE]

from which we deduce that, for all $s\geqslant 0$ ,

[TABLE]

Thus, for all $b\in\mathscr{B}$ and $s\in[0,T]$ we have $\operatorname{dist}_{R_{\alpha}^{1}(x)}(X^{z,\bar{\alpha},b}_{s})\leqslant\varepsilon$ , which finally proves that $\overline{R_{\alpha}^{1}}(x)$ is a dominion for player 1.

For $R_{b\centerdot}^{2}(x)$ the proof is identical, up to the changes in players’ role. The main difference concerns the construction, for any point $y\in R_{b\centerdot}^{2}(x)$ and any strategy $\alpha\in\mathfrak{A}$ of player 1, of a control $\bar{b}\in\mathscr{B}$ of player 2 such that $X^{y,\alpha,\bar{b}}_{s}\in R_{b\centerdot}^{2}(x)$ for all $s\geqslant 0$ . We next detail this construction. Let $\bar{\alpha}\in\mathfrak{A}$ and $t\geqslant 0$ be such that $y=X^{x,\bar{\alpha},b_{\bar{\alpha}}}_{t}$ . Let us also define, for any $b\in\mathscr{B}$ , the control $\sigma_{t}b$ by $(\sigma_{t}b)_{s}=b_{s+t}$ . We then define a nonanticipating strategy $\bar{\alpha}|\alpha$ as follows:

[TABLE]

If we set $\bar{b}=\sigma_{t}b_{\bar{\alpha}|\alpha}$ , one can check that $X^{y,\alpha,\bar{b}}_{s}=X^{x,\bar{\alpha}|\alpha,b_{\bar{\alpha}|\alpha}}_{s+t}$ for all $s\geqslant 0$ (in particular we have $X^{x,\bar{\alpha}|\alpha,b_{\bar{\alpha}|\alpha}}_{t}=X^{x,\bar{\alpha},b_{\bar{\alpha}}}_{t}=y$ because the map $\alpha\mapsto b_{\alpha}$ is nonanticipating and so $(b_{\bar{\alpha}|\alpha})_{s}=(b_{\bar{\alpha}})_{s}$ for almost all $s\in[0,t]$ ). Hence the result. ∎

6. Operator-theoretic characterization of dominions

In this final section, we characterize dominions in operator-theoretic terms. Thus, we show that the notion of dominion coincides with the one of leadership domain and discriminating domain which appears in viability theory555We mention that the notion of discriminating / leadership domain, hence of dominion, relates with the ones of B-set and approachability in repeated games with vector payoffs. Indeed, In [ASQS09], As Soulaimani, Quincampoix and Sorin proved that the B-sets for one player (which provide a sufficient condition for approachability) coincide with the discriminating domains for that player in an associated differential game. (see, e.g., [Car96]). This characterization stems from the similarities that exist between dominions on the one hand, and the interpretation of discriminating and leadership domains, on the other hand. Indeed, the latter, which are originally defined by means of inequalities involving $H_{\infty}$ , can also be characterized in terms of invariant dynamics (see, e.g., [Car96]). This correspondence between the two notions can be readily established for leadership domains and dominions of player 2 in the lower game (see Theorem 2.3, ibid.). As for the correspondence between discriminating domains and dominions of player 1, it is not as straightforward since the interpretation theorem (Theorem 2.1, ibid.) requires convexity properties. Such assumptions – typically, $A$ must be convex and $f$ , affine in $a$ – are commonly assumed in viability theory but are not needed here. Nevertheless, by adapting the proof of the latter result to our setting, we are able to show that dominions of the first player in $\Gamma^{-}$ can indeed be characterized as discriminating domains. We next state precisely these results.

To this end, we need to introduce the following definition. A vector $p\in\mathbb{R}^{n}$ is a proximal normal to a subset $K$ of $\mathbb{R}^{n}$ at point $x\in K$ if $\operatorname{dist}_{K}(x+p)=\left|{p}\right|$ . We denote by ${N\!P}_{K}(x)$ the set of proximal normals to $K$ at $x$ . Note that, if we let ${P}_{K}z$ be the set of projections of any point $z\in\mathbb{R}^{n}$ onto $K$ , i.e.,

[TABLE]

then the definition of a proximal normal implies that for every vector $p\in{N\!P}_{K}(x)$ and every scalar $\nu\in(0,1)$ , we have ${P}_{K}(x+\nu p)=\{x\}$ .

We now provide the operator-theoretic characterizations of dominions. The first one, for dominions of player 2 in $\Gamma^{-}$ , comes readily from the correspondence of the latter with leadership domains in viability theory.

Theorem 6.1 ([Car96, Thm. 2.3]).

A nonempty closed set $D$ is a dominion of player 2 in the lower game $\Gamma^{-}$ if and only if

[TABLE]

We next give a similar characterization for dominions of player 1, which relates them with discriminating domains.

Theorem 6.2.

A nonempty closed set $D$ is a dominion of player 1 in the lower game $\Gamma^{-}$ if and only if

[TABLE]

Proof.

We first prove the necessary part and suppose that $D$ is a dominion of player 1. Toward a contradiction, let us assume that there exists a positive constant $\eta$ , some $x\in D$ and some $p\in{N\!P}_{D}(x)$ such that

[TABLE]

Since the function $b\mapsto\min_{a\in A}\langle f(x,a,b),p\rangle$ is upper semicontinuous and $B$ is compact, there exists an action $\bar{b}\in B$ such that

[TABLE]

Let $b\in\mathscr{B}$ be the constant control equal to $\bar{b}$ , i.e., $b_{t}=\bar{b}$ for all $t\geqslant 0$ .

Since $D$ is a dominion of player 1, given $\varepsilon>0$ and $T>0$ there exists a strategy $\alpha\in\mathfrak{A}$ such that $\operatorname{dist}_{D}(X^{x,\alpha,b}_{t})\leqslant\varepsilon$ for all $t\in[0,T]$ . In order to simplify the notation, let $X_{t}=X^{x,\alpha,b}_{t}$ . Then, for all $t\in[0,T]$ , choosing any point $y_{t}$ in ${P}_{D}X_{t}$ , the set of projections of $X_{t}$ on $D$ , we have

[TABLE]

where we use the fact that $y_{t}\in D$ and that $\left|{x+p-y_{t}}\right|\geqslant\operatorname{dist}_{D}(x+p)=\left|{p}\right|$ since $p\in{N\!P}_{D}(x)$ .

On the other hand, for almost all $t\in[0,T]$ we have

[TABLE]

To establish the last inequality, we used the estimate 9; the Lipschitz continuity of $f$ (with Lipschitz constant $L_{f}$ ); and 15. Let $C=M_{f}(M_{f}+L_{f}\left|{p}\right|)$ . After integrating the latter inequality we get, for all $t\in[0,T]$ ,

[TABLE]

which, combined with 16, yields

[TABLE]

Note that to square 16, we need to assume that $\varepsilon\leqslant\left|{p}\right|$ , which is possible because $p$ is different from [math] (otherwise 15 would not hold). In the latter inequality, the positive constants $\left|{p}\right|$ , $C$ and $\eta$ are fixed, whereas $\varepsilon$ and $T$ are arbitrary. Hence, by choosing $T=\eta/C$ and rewriting 17 with $t=T$ we obtain

[TABLE]

which is a contradiction if $\varepsilon$ is small enough. This concludes the proof of the necessary part.

We now prove the sufficient part and assume that for all points $x$ in $D$ and all proximal normals $p$ in ${N\!P}_{D}(x)$ , we have

[TABLE]

We then fix $x\in D$ and positive constants $\varepsilon$ and $T$ . Our aim is to construct recursively on the subintervals $[t_{k},t_{k+1})$ of a well-chosen partition $\{t_{k}=k\frac{T}{N}\}_{0\leqslant k\leqslant N}$ of $[0,T]$ , a nonanticipating strategy $\alpha$ of player 1 such that $\operatorname{dist}_{D}(X^{x,\alpha,b}_{t})\leqslant\varepsilon$ for all $t\in[0,T]$ and all controls $b$ of player 2. The mesh $\theta=\frac{T}{N}$ of the partition (which shall depend only on $x$ , $\varepsilon$ , $T$ and the data of the problem) will be chosen a posteriori, so we assume for now that it is fixed. Also, for any $z\in\mathbb{R}^{n}$ we shall fix a point in ${P}_{D}z$ which we denote by ${p}_{D}(z)$ .

We start by selecting an arbitrary element $\bar{a}$ in $A$ and set $\alpha[b]_{t}=\bar{a}$ for all $b\in\mathscr{B}$ and $t\in[0,t_{1})$ . Note that $\alpha$ is obviously nonanticipating on $[0,t_{1})$ , that is, for any controls $b^{1},b^{2}\in\mathscr{B}$ that coincide almost everywhere on $[0,t_{1})$ , we have $\alpha[b^{1}]_{t}=\alpha[b^{2}]_{t}$ for (almost) all $t\in[0,t_{1})$ .

Next we assume that $\alpha$ has been defined on $[0,t_{k})$ with $0<k<N$ and that it is nonanticipating on this interval. Given any control $b\in\mathscr{B}$ , if $X^{x,\alpha,b}_{t_{k}}\in D$ , then we set $\alpha[b]_{t}=\bar{a}$ on $[t_{k},t_{k+1})$ . Otherwise, letting $X_{k}=X^{x,\alpha,b}_{t_{k}}$ (for simplicity) and $y_{k}={p}_{D}(X_{k})$ , we introduce the set-valued map $\Phi$ defined from $B$ to $A$ by

[TABLE]

Let us observe that $\Phi$ depends on the control $b$ only through $X_{k}$ . Thus, if two controls $b^{1}$ and $b^{2}$ are equal almost everywhere on $[0,t_{k})$ , then $X^{x,\alpha,b^{1}}_{t_{k}}=X^{x,\alpha,b^{2}}_{t_{k}}$ and therefore they define the same set-valued map.

Since $f$ is continuous, $\Phi$ is measurable and has closed values. Moreover, since $X_{k}-y_{k}\in{N\!P}_{D}(y_{k})$ by definition, 18 implies that the domain of $\Phi$ is $B$ , i.e., $\Phi(b^{\prime})$ is nonempty for all $b^{\prime}\in B$ . Hence, according to the Measurable Selection Theorem (see [AF09, Thm. 8.1.3]), $\Phi$ admits a measurable selection $\phi:B\to A$ . Then we set $\alpha[b]_{t}=\phi(b_{t})$ for all $t\in[t_{k},t_{k+1})$ . It is readily seen that $\alpha$ is nonanticipating on $[0,t_{k+1})$ , whence on $[0,T)$ after repeating the induction step until $t_{k+1}=T$ . For $t=T$ , we set $\alpha[b]_{T}=\bar{a}$ for all $b\in\mathscr{B}$ .

To conclude the proof, it remains to show that $\operatorname{dist}_{D}(X^{x,\alpha,b}_{t})\leqslant\varepsilon$ on $[0,T]$ for every control $b$ of player 2. So we fix $b\in\mathscr{B}$ and let $X_{t}=X^{x,\alpha,b}_{t}$ . We also let $X_{k}=X_{t_{k}}$ and $y_{k}={p}_{D}(X_{k})$ . For all $k\in\{0,\dots,N-1\}$ and for almost all $t\in[t_{k},t_{k+1}]$ we have

[TABLE]

To establish the latter inequality, we used the estimate 9 and the fact that either $X_{k}\notin D$ , in which case $\langle f(y_{k},\alpha[b]_{t},b_{t}),X_{k}-y_{k}\rangle\leqslant 0$ by definition of $\alpha$ , or $X_{k}\in D$ which implies $X_{k}-y_{k}=0$ . By integration we then obtain

[TABLE]

for all $k\in\{0,\dots,N-1\}$ and $t\in[t_{k},t_{k+1}]$ . Grönwall’s inequality yields

[TABLE]

and thus

[TABLE]

If we apply the latter inequality to $t=t_{k+1}$ , we can use it to show by induction that, for all $k\in\{1,\dots,N\}$ ,

[TABLE]

Combining now the last two inequalities, we deduce that for all $k\in\{0,\dots,N-1\}$ and all $t\in[t_{k},t_{k+1}]$ ,

[TABLE]

Since $k\theta\leqslant N\theta=T$ and $(e^{x}-1)^{-1}\leqslant x^{-1}$ if $x>0$ , we finally get, for all $t\in[0,T]$ ,

[TABLE]

The proof is complete once we have observed that we can choose the mesh of the partition, $\theta=\frac{T}{N}$ , depending only on $M_{f}$ , $L_{f}$ , $T$ and $\varepsilon$ , so that the right-hand side in the latter inequality is lower than $\varepsilon^{2}$ . ∎

*Remark 6.3** (Dominions in the upper game and with Isaacs’ condition).*

Similar characterizations for the dominions in the upper game $\Gamma^{+}$ can be obtained after switching the identity of the players (the fact that one player is minimizing and the other maximizing does not come into account here). Thus, a nonempty closed set $D$ is a dominion of player 1 (resp., player 2) in $\Gamma^{+}$ if and only if

[TABLE]

As a consequence, the classical min-max inequality yields that a dominion of player 1 in $\Gamma^{+}$ is also a dominion in $\Gamma^{-}$ , and symmetrically, a dominion of player 2 in $\Gamma^{-}$ is also a dominion in $\Gamma^{+}$ (see Example 4.4 for an illustration of this situation). These observations are consistent with the fact that player 1 (resp., player 2) has more information in the lower game (resp., in the upper game), hence has an advantage in this game. Furthermore, if Isaacs’ condition 5 applies to $H_{\infty}$ , then the set of dominions for each player is the same in the lower and the upper game.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AB 03] O. Alvarez and M. Bardi, Singular perturbations of nonlinear degenerate parabolic PD Es: a general convergence result , Arch. Ration. Mech. Anal. 170 (2003), no. 1, 17–61.
2[AB 07] by same author, Ergodic problems in differential games , Advances in dynamic game theory, Ann. Internat. Soc. Dynam. Games, vol. 9, Birkhäuser Boston, Boston, MA, 2007, pp. 131–152.
3[AB 10] by same author, Ergodicity, stabilization, and singular perturbations for Bellman-Isaacs equations , Mem. Amer. Math. Soc. 204 (2010), no. 960, vi+77.
4[AF 09] J.-P. Aubin and H. Frankowska, Set-valued analysis , Modern Birkhäuser Classics, Birkhäuser Boston, Inc., Boston, MA, 2009, Reprint of the 1990 edition.
5[AGH 20] M. Akian, S. Gaubert, and A. Hochart, A game theory approach to the existence and uniqueness of nonlinear Perron-Frobenius eigenvectors , Discrete Contin. Dyn. Syst. 40 (2020), no. 1, 207–231.
6[Ari 97] M. Arisawa, Ergodic problem for the Hamilton-Jacobi-Bellman equation. I. Existence of the ergodic attractor , Ann. Inst. H. Poincaré Anal. Non Linéaire 14 (1997), no. 4, 415–438.
7[Ari 98] by same author, Ergodic problem for the Hamilton-Jacobi-Bellman equation. II , Ann. Inst. H. Poincaré Anal. Non Linéaire 15 (1998), no. 1, 1–24.
8[ASQS 09] S. As Soulaimani, M. Quincampoix, and S. Sorin, Repeated games and qualitative differential games: approachability and comparison of strategies , SIAM J. Control Optim. 48 (2009), no. 4, 2461–2479.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Unique ergodicity of

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. Preliminaries

2.1. Framework and standing assumptions

Assumption A0 (Standing assumption).

2.2. Value functions and Hamilton-Jacobi PDEs

Definition 2.1** (Nonanticipating strategy).**

Theorem 2.2** (see [BCD97, Ch. III, Prop. 2.8, 3.5]).**

2.3. Ergodicity and PDE approach

Assumption A1.

Theorem 2.3** ([AB03, Thm. 4]).**

3. Unique ergodicity of Hamiltonians

3.1. Definition and characterization

Definition 3.1** (Uniquely ergodic Hamiltonian).**

Theorem 3.2** (compare with [AB10, Prop. 2.3]).**

Proof.

Proposition 3.3**.**

Example 3.4*.*

3.2. Equicontinuity of {δuδ}\boldsymbol{\{\delta u_{\delta}\}}{δuδ​}

Example 3.5*.*

Example 3.6*.*

4. Unique ergodicity of games via PDE approach

Definition 4.1** (Ergodicity of differential games).**

4.1. Dominions

Definition 4.2** (Dominions).**

Example 4.3*.*

Example 4.4*.*

Definition 4.5** (Dominion in the torus).**

4.2. Necessary condition for unique ergodicity

Proposition 4.6**.**

Lemma 4.7**.**

Proof.

Lemma 4.8**.**

Remark 4.9*.*

Proof of Proposition 4.6.

Remark 4.10* (Comparison with one-player controlled systems).*

4.3. Sufficient condition for unique ergodicity

Lemma 4.11**.**

Proof.

Corollary 4.12**.**

Theorem 4.13**.**

Proof.

Example 4.14*.*

5. Unique ergodicity of games via controllability approach

Proposition 5.1** (see [BCD97, Ch. VIII, Prop. 1.8]).**

Proposition 5.2**.**

Assumption A2 (Uniform time estimate).

Theorem 5.3**.**

Proof.

Lemma 5.4**.**

Proof.

6. Operator-theoretic characterization of dominions

Theorem 6.1** ([Car96, Thm. 2.3]).**

Theorem 6.2**.**

Proof.

Remark 6.3* (Dominions in the upper game and with Isaacs’ condition).*

Definition 2.1 (Nonanticipating strategy).

Theorem 2.2 (see [BCD97, Ch. III, Prop. 2.8, 3.5]).

Theorem 2.3 ([AB03, Thm. 4]).

Definition 3.1 (Uniquely ergodic Hamiltonian).

Theorem 3.2 (compare with [AB10, Prop. 2.3]).

Proposition 3.3.

*Example 3.4**.*

3.2. Equicontinuity of $\boldsymbol{\{\delta u_{\delta}\}}$

*Example 3.5**.*

*Example 3.6**.*

Definition 4.1 (Ergodicity of differential games).

Definition 4.2 (Dominions).

*Example 4.3**.*

*Example 4.4**.*

Definition 4.5 (Dominion in the torus).

Proposition 4.6.

Lemma 4.7.

Lemma 4.8.

*Remark 4.9**.*

*Remark 4.10** (Comparison with one-player controlled systems).*

Lemma 4.11.

Corollary 4.12.

Theorem 4.13.

*Example 4.14**.*

Proposition 5.1 (see [BCD97, Ch. VIII, Prop. 1.8]).

Proposition 5.2.

Theorem 5.3.

Lemma 5.4.

Theorem 6.1 ([Car96, Thm. 2.3]).

Theorem 6.2.

*Remark 6.3** (Dominions in the upper game and with Isaacs’ condition).*