Mean-field games of optimal stopping: a relaxed solution approach

G\'eraldine Bouveret; Roxana Dumitrescu; Peter Tankov

arXiv:1812.06196·math.OC·July 9, 2020·SIAM J. Control. Optim.

Mean-field games of optimal stopping: a relaxed solution approach

G\'eraldine Bouveret, Roxana Dumitrescu, Peter Tankov

PDF

TL;DR

This paper introduces a relaxed solution framework for mean-field games involving optimal stopping, establishing existence, uniqueness, and a numerical method for equilibrium computation.

Contribution

It develops a relaxed optimal stopping approach for mean-field games, proving equilibrium existence, uniqueness, and connecting relaxed solutions to pure strategies.

Findings

01

Proved existence of relaxed Nash equilibrium.

02

Established conditions for pure strategy optimality.

03

Presented a convergent numerical method for potential games.

Abstract

We consider the mean-field game where each agent determines the optimal time to exit the game by solving an optimal stopping problem with reward function depending on the density of the state processes of agents still present in the game. We place ourselves in the framework of relaxed optimal stopping, which amounts to looking for the optimal occupation measure of the stopper rather than the optimal stopping time. This framework allows us to prove the existence of the relaxed Nash equilibrium and the uniqueness of the associated value of the representative agent under mild assumptions. Further, we prove a rigorous relation between relaxed Nash equilibria and the notion of mixed solutions introduced in earlier works on the subject, and provide a criterion, under which the optimal strategies are pure strategies, that is, behave in a similar way to stopping times. Finally, we present a…

Equations285

d X_{t}^{i} = μ (t, X_{t}^{i}) d t + σ (t, X_{t}^{i}) d W_{t}^{i},

d X_{t}^{i} = μ (t, X_{t}^{i}) d t + σ (t, X_{t}^{i}) d W_{t}^{i},

E [\int_{0}^{τ} e^{- ρt} \tilde{f} (t, X_{t}^{i}, m_{t}^{n}) d t + e^{- ρ (τ \land T)} g (τ \land T, X_{τ \land T}^{i})],

E [\int_{0}^{τ} e^{- ρt} \tilde{f} (t, X_{t}^{i}, m_{t}^{n}) d t + e^{- ρ (τ \land T)} g (τ \land T, X_{τ \land T}^{i})],

m_{t}^{N} (d x) = \frac{1}{N} i = 1 \sum N δ_{X_{t}^{i}} (d x) 1_{t \leq τ^{i}},

m_{t}^{N} (d x) = \frac{1}{N} i = 1 \sum N δ_{X_{t}^{i}} (d x) 1_{t \leq τ^{i}},

τ max E [\int_{0}^{τ} e^{- ρt} \tilde{f} (t, X_{t}, m_{t}) d t + e^{- ρ (τ \land T)} g (τ \land T, X_{τ \land T})],

τ max E [\int_{0}^{τ} e^{- ρt} \tilde{f} (t, X_{t}, m_{t}) d t + e^{- ρ (τ \land T)} g (τ \land T, X_{τ \land T})],

d X_{t} = μ (t, X_{t}) d t + σ (t, X_{t}) d W_{t} .

d X_{t} = μ (t, X_{t}) d t + σ (t, X_{t}) d W_{t} .

m_{t} (A) = \int m_{0} (d x) P [X_{t}^{x} \in A; τ^{m, x} > t], A \in B (R^{d}), t \in [0, T] .

m_{t} (A) = \int m_{0} (d x) P [X_{t}^{x} \in A; τ^{m, x} > t], A \in B (R^{d}), t \in [0, T] .

τ sup E [exp (\int_{0}^{τ} r_{s} d s) 1_{{θ > τ} \cup {θ = \infty}}] .

τ sup E [exp (\int_{0}^{τ} r_{s} d s) 1_{{θ > τ} \cup {θ = \infty}}] .

d X_{t}^{i} = μ (t, X_{t}^{i}) d t + σ (t, X_{t}^{i}) d W_{t}^{i}, X_{0}^{i} = x^{i} \in O,

d X_{t}^{i} = μ (t, X_{t}^{i}) d t + σ (t, X_{t}^{i}) d W_{t}^{i}, X_{0}^{i} = x^{i} \in O,

\sup_{0\leq t\leq T}\mathbb{E}[|\!|X^{i}_{t}|\!|^{p}]<\infty,{\color[rgb]{0,0,0}\text{ for all }p\geq 1.}

\sup_{0\leq t\leq T}\mathbb{E}[|\!|X^{i}_{t}|\!|^{p}]<\infty,{\color[rgb]{0,0,0}\text{ for all }p\geq 1.}

{\color[rgb]{0,0,0}\mathcal{L}f(t,x)=\nabla_{X}f(t,x)^{\top}\mu(t,x)+\frac{1}{2}Tr[\sigma^{\top}(H_{X}f)\sigma],}

{\color[rgb]{0,0,0}\mathcal{L}f(t,x)=\nabla_{X}f(t,x)^{\top}\mu(t,x)+\frac{1}{2}Tr[\sigma^{\top}(H_{X}f)\sigma],}

τ max E [\int_{0}^{τ \land τ_{O}^{i}} e^{- ρt} \tilde{f} (t, X_{t}^{i}, m_{t}^{N}) d t + e^{- ρ (τ \land τ_{O}^{i} \land T)} g (τ \land τ_{O}^{i} \land T, X_{τ \land τ_{O}^{i} \land T}^{i})],

τ max E [\int_{0}^{τ \land τ_{O}^{i}} e^{- ρt} \tilde{f} (t, X_{t}^{i}, m_{t}^{N}) d t + e^{- ρ (τ \land τ_{O}^{i} \land T)} g (τ \land τ_{O}^{i} \land T, X_{τ \land τ_{O}^{i} \land T}^{i})],

\displaystyle m^{N}_{t}(dx)=\frac{1}{{\color[rgb]{0,0,0}N}}\sum_{i=1}^{{\color[rgb]{0,0,0}N}}\delta_{X^{i}_{t}}(dx)\mathbf{1}_{t\leq{\color[rgb]{0,0,0}\tau^{i}}\wedge\tau_{\mathcal{O}}^{i}},

\displaystyle m^{N}_{t}(dx)=\frac{1}{{\color[rgb]{0,0,0}N}}\sum_{i=1}^{{\color[rgb]{0,0,0}N}}\delta_{X^{i}_{t}}(dx)\mathbf{1}_{t\leq{\color[rgb]{0,0,0}\tau^{i}}\wedge\tau_{\mathcal{O}}^{i}},

τ max E [\int_{0}^{τ \land τ_{O}^{i}} f (t, X_{t}^{i}, m_{t}^{n}) d t] .

τ max E [\int_{0}^{τ \land τ_{O}^{i}} f (t, X_{t}^{i}, m_{t}^{n}) d t] .

J_{N}^{i} (τ) \geq J_{N}^{i} ([τ^{- i}, σ]),

J_{N}^{i} (τ) \geq J_{N}^{i} ([τ^{- i}, σ]),

J_{N}^{i} (θ) := E [\int_{0}^{θ^{i} \land τ_{O}^{i}} f (t, X_{t}^{i}, m_{t}^{N}) d t],

J_{N}^{i} (θ) := E [\int_{0}^{θ^{i} \land τ_{O}^{i}} f (t, X_{t}^{i}, m_{t}^{N}) d t],

d X_{t}^{x} = μ (t, X_{t}^{x}) d t + σ (t, X_{t}^{x}) d W_{t},

d X_{t}^{x} = μ (t, X_{t}^{x}) d t + σ (t, X_{t}^{x}) d W_{t},

τ \in T^{W} ([0, T]) max E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}, m_{t}) d t],

τ \in T^{W} ([0, T]) max E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}, m_{t}) d t],

m_{t} (A) = \int m_{0} (d x) P [X_{t}^{x} \in A; τ^{m, x} \land τ_{O}^{x} > t], A \in B (O), t \in [0, T] .

m_{t} (A) = \int m_{0} (d x) P [X_{t}^{x} \in A; τ^{m, x} \land τ_{O}^{x} > t], A \in B (O), t \in [0, T] .

τ \in T^{W} ([0, T]) max E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}) d t],

τ \in T^{W} ([0, T]) max E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}) d t],

\overset{m}{ˉ}_{t} (A) = \int_{O} m_{0}^{*} (d x) P [X_{t}^{x} \in A; τ_{O}^{x} > t] .

\overset{m}{ˉ}_{t} (A) = \int_{O} m_{0}^{*} (d x) P [X_{t}^{x} \in A; τ_{O}^{x} > t] .

\int_{0}^{T} \int_{O} (f (t, x))_{-} \overset{m}{ˉ}_{t} (d x) d t < \infty,

\int_{0}^{T} \int_{O} (f (t, x))_{-} \overset{m}{ˉ}_{t} (d x) d t < \infty,

\int_{0}^{T} \int_{O} f (t, x) m_{t} (d x) d t,

\int_{0}^{T} \int_{O} f (t, x) m_{t} (d x) d t,

\int_{O} u (0, x) m_{0}^{*} (d x) + \int_{0}^{T} \int_{O} {\frac{\partial u}{\partial t} + L u} \overset{m}{^}_{t} (d x) d t \geq 0,

\int_{O} u (0, x) m_{0}^{*} (d x) + \int_{0}^{T} \int_{O} {\frac{\partial u}{\partial t} + L u} \overset{m}{^}_{t} (d x) d t \geq 0,

E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}) d t] = \int_{[0, T] \times O} f (t, y) m_{t}^{x} (d y) d t .

E [\int_{0}^{τ \land τ_{O}^{x}} f (t, X_{t}^{x}) d t] = \int_{[0, T] \times O} f (t, y) m_{t}^{x} (d y) d t .

u (0, x) + \int_{[0, T] \times O} (\frac{\partial u}{\partial t} + L u) (t, y) m_{t} (d y) d t = E [u (τ \land τ_{O}^{x} \land T, X_{τ \land τ_{O}^{x} \land T})] \geq 0.

u (0, x) + \int_{[0, T] \times O} (\frac{\partial u}{\partial t} + L u) (t, y) m_{t} (d y) d t = E [u (τ \land τ_{O}^{x} \land T, X_{τ \land τ_{O}^{x} \land T})] \geq 0.

(\frac{\partial u}{\partial t} + L u) (t, x) = g (t, x) (t, x) \in [0, T] \times O, u (T, x) = 0, \forall x \in O,

(\frac{\partial u}{\partial t} + L u) (t, x) = g (t, x) (t, x) \in [0, T] \times O, u (T, x) = 0, \forall x \in O,

i, j = 1 \sum d a_{i, j} (t, x) ξ^{i} ξ^{j} \geq γ ∣ ξ ∣^{2} .

i, j = 1 \sum d a_{i, j} (t, x) ξ^{i} ξ^{j} \geq γ ∣ ξ ∣^{2} .

X_{t}^{x} = \int_{0}^{t} μ (X_{s}^{x}) d s + \int_{0}^{t} σ (X_{s}^{x}) d W_{s},

X_{t}^{x} = \int_{0}^{t} μ (X_{s}^{x}) d s + \int_{0}^{t} σ (X_{s}^{x}) d W_{s},

D_{k t}^{i}

D_{k t}^{i}

D_{k t}^{ij}

+ \int_{0}^{t} \partial_{x_{l} x_{n}}^{2} σ^{k m} (X_{s}^{x}) D_{l s}^{i} D_{l s}^{j} d W_{s}^{m} + \int_{0}^{t} \partial_{x_{l}} σ^{k m} (X_{s}^{x}) D_{l s}^{ij} d W_{s}^{m} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Mean-field games of optimal stopping:

a relaxed solution approach

Géraldine Bouveret Smith School, University of Oxford, South Parks Road, Oxford, OX1 3QY, United Kingdom, Email: [email protected]

Roxana Dumitrescu Department of Mathematics, King’s College London, Strand, London, WC2R 2LS, United Kingdom, Email: [email protected]

Peter Tankov ENSAE Paris, 5 avenue Henry Le Chatelier, 91120 Palaiseau, France, Email: [email protected]

Abstract

We consider the mean-field game where each agent determines the optimal time to exit the game by solving an optimal stopping problem with reward function depending on the density of the state processes of agents still present in the game. We place ourselves in the framework of relaxed optimal stopping, which amounts to looking for the optimal occupation measure of the stopper rather than the optimal stopping time. This framework allows us to prove the existence of a relaxed Nash equilibrium and the uniqueness of the associated value of the representative agent under mild assumptions. Further, we prove a rigorous relation between relaxed Nash equilibria and the notion of mixed solutions introduced in earlier works on the subject, and provide a criterion, under which the optimal strategies are pure strategies, that is, behave in a similar way to stopping times. Finally, we present a numerical method for computing the equilibrium in the case of potential games and show its convergence.

Keywords: Mean-field games, optimal stopping, relaxed solutions, infinite-dimensional linear programming

AMS: 91A55, 91A13, 60G40

1 Introduction

The purpose of this paper is to study a large-population stochastic differential game of optimal stopping, where each agent finds the optimal time to exit the game by solving an optimal stopping problem with instantaneous reward function depending on the density of the state processes of agents still present in the game. To motivate the mean-field game (MFG) framework, we first provide a formulation with a finite number of agents. Assume that each agent $i=1,2,...,{N}$ has a private state process $X^{i}$ , whose dynamics is given by the stochastic differential equation (SDE),

[TABLE]

where the Brownian motions $W^{i}$ , $i=1,\dots,N$ are independent.

The objective of each agent $i$ is to maximize over all possible stopping times $\tau$ the reward functional

[TABLE]

with

[TABLE]

where $\tau^{i}$ represents the optimal stopping time of the agent $i$ . Agents have the same state process coefficients and objective functions, and the optimal stopping problems are coupled only through the empirical measure $m^{N}$ . Since the objective functions are coupled, it is natural to look for Nash equilibria.

Stochastic differential games with a large number $n$ of players are rarely tractable. The MFG approach amounts to looking for a Nash equilibrium in the limiting regime, when the number of players $n$ goes to infinity. Following this approach, we study the MFG of optimal stopping, which can be seen as an infinite-agent version of the above game. In this approach, we first solve for a fixed flow of sub-probability measures $(m_{t})_{0\leq t\leq T}$ the optimal stopping problem

[TABLE]

with

[TABLE]

Then, given $\tau^{m,x}$ the optimal stopping time for the agent with initial condition $x$ , and the initial measure $m_{0}$ , we look for the flow of measures $(m_{t})_{0\leq t\leq T}$ such that

[TABLE]

Note that in $\eqref{part2}$ , the probability is not a conditional but joint probability. A solution (Nash equilibrium) of the MFG problem is the flow of measures $(m_{t})_{0\leq t\leq T}$ , which is the fixed point of the mapping defined by the right-hand side of (1.1).

In this paper, we prove the existence of the Nash equilibrium for the MFG problem and the uniqueness of the associated value of the representative agent. To this aim, we use the relaxed solution approach, which converts the stochastic optimal stopping problem into a linear programming problem over a space of measures. The decision variable is no longer the optimal stopping time, but rather the distribution of the killed state process.

Introducing relaxed solutions facilitates existence proofs: the existence is proven by using Fan-Glicksberg’s fixed-point theorem. The relaxed solutions are related to the mixed strategies introduced in [bertucci2017optimal], and we establish a rigorous relation between the two. Finally, we propose an implementable numerical scheme for computing a Nash equilibrium in the case of potential games, and show its convergence. An application of these results to a resource-sharing problem will be developed in a companion paper.

MFG theory has been introduced by P.-L. Lions and J.-M. Lasry in a series of papers [lasry2006jeux, lasry2006jeux1, lasry2007mean] using an analytic approach and studied independently at about the same time by [huang2006large]. Later on, a probabilistic approach has been developed in a series of papers by Carmona, Delarue, and their co-authors [carmona2013probabilistic, carmona2013mean, carmona2018probabilistic, carmona2016mean, lacker2015mean] and so on.

The analytic method consists in finding the Nash equilibria through a coupled system of nonlinear partial differential equations: a Hamilton-Jacobi-Bellman equation (backward in time), which describes the optimal control problem of the representative agent when the distribution $\mu$ is given, and a Kolmogorov-type equation (forward in time) which describes the evolution of the density under the optimal control. In the probabilistic approach, the system of PDEs is replaced by a coupled system of forward-backward stochastic differential equations of McKean-Vlasov type.

MFGs of optimal stopping have been considered in the literature only very recently, and our understanding of this type of games remains limited. [nutz2018mean] considers a MFG problem where the agents interact through the proportion of players that have already stopped and each agent solves a specific optimal stopping problem of the form

[TABLE]

There, the process $r$ creates an incentive for the agent to stay in the game, while the possibility of default at a random time $\theta$ creates an incentive to leave. The distribution of $\theta$ depends on the proportion $\rho_{t}$ of players who have already stopped in such a way that the departure of other agents creates an incentive for the agent under consideration to leave as well (this type of game is known as preemption game). In a similar spirit but with greater generality, [carmona2017mean] consider MFGs of timing, whose formulation is motivated by a dynamic model of bank run in a continuous time setting. As in [nutz2018mean], the payoff of each agent depends on the proportion of players who have already stopped, and the departure of players creates an additional incentive for the players still in the game to leave as well. Both papers ([nutz2018mean] and [carmona2017mean]) adopt a purely probabilistic approach.

In contrast to these two references, [bertucci2017optimal] studies a MFG of optimal stopping, which is similar to the one considered in this paper, i.e. where the interaction takes place through the density of states of agents remaining in the game, rather than the proportion of players that have already stopped. In this reference and in our paper, the departure of players creates an incentive for the players still in the game to stay, a type of behavior known as ’war of attrition’, which is characteristic of resource-sharing problems. In [bertucci2017optimal] the state process has constant coefficients and evolves in a bounded domain, and the MFG of optimal stopping is solved through a coupled system of a Hamilton-Jacobi-Bellman variational inequality and a Fokker-Planck equation.

[bertucci2017optimal] makes a number of significant contributions to the literature. In particular, he provides an example of non-existence of Nash equilibrium with pure strategies in optimal stopping MFG, and introduces the notion of mixed strategies in this context, for which existence may be recovered. However, the existence proofs in this paper are not fully clear to us.111To be precise, the weak convergence of the flow $m^{\varepsilon}$ established in the proof of existence of a mixed solution in both stationary and parabolic cases (Theorems 1.6 for the stationary case and Theorem 2.1 for the parabolic case) is not sufficient to conclude that $\int f(m^{\varepsilon})dm^{\varepsilon}$ converges.

To clarify the existence question and solve the MFG of optimal stopping problem in greater generality (with variable coefficients and in unbounded domains), we adopt, in this paper, a completely different approach, based on the relaxed solution technique.

The approach of relaxed solutions/controls is a relatively popular method of compactification of stochastic control problems to establish existence of solutions, which comes in several different flavors. In, e.g., [el1987compactification] and a number of other papers, the authors reformulate the control problem as a relaxed controlled martingale problem. A similar approach is used by [lacker2015mean] in the context of (standard) MFG. In the second approach, especially popular for infinite-horizon and ergodic control problems, the control problem is reformulated as a linear programming problem on the space of measures, and one looks for the joint occupation density of the state process and the control. We refer the reader to, e.g., [buckdahn2011stochastic] and [stockbridge1990time], for a link between these two formulations. The literature on relaxed solutions for individual optimal stopping problems is quite limited. [SC2002] propose a linear programming formulation for the infinite-horizon optimal stopping of a Markov diffusion process, using two measures: the occupation measure of the process and the joint distribution of the stopping time and the stopped process. [HS] extend this result to processes with singular components such as reflected diffusions. In contrast to these two references, in our paper we propose a different formulation based only on the occupation measure of the process killed at the stopping time. To the best of our knowledge, ours is the first paper which uses relaxed solutions in order to solve optimal stopping problems of mean-field type.

The literature on numerical schemes for MFG is well developed in the case of MFG with regular controls (see e.g. [BC2015]), but very little is known in the case of MFG with optimal stopping. In the latter case [B2018] proposes an algorithm, which works only under the assumption that the instantaneous reward function is strictly monotonic with respect to the measure, which is quite restrictive for applications. We propose instead a different algorithm, which allows to consider the case of a non-strictly monotonic reward function.

The structure of the paper is the following. In Section 2, we present the model and give the mean-field formulation of the problem. In Section 3, we introduce the relaxed formulation of the single-agent optimal stopping problem and establish the existence of a relaxed solution. In Section 4, we study the relaxed optimal stopping problem in the MFG context and give conditions for the existence of a Nash equilibrium and uniqueness of the Nash equilibrium value. In Section 5, we establish the relation between the relaxed and strong formulation of both single-agent and MFG optimal stopping problems. Finally, in Section 6, we present the numerical algorithm and provide convergence results.

2 The model

We fix a terminal time horizon $T<\infty$ , and introduce a possibly unbounded open domain ${\color[rgb]{0,0,0}{\mathcal{O}}\subseteq\mathbb{R}^{d}}$ on which the state processes of the agents will evolve. The space of bounded positive measures on ${\mathcal{O}}$ will be denoted by $\mathcal{M}({\mathcal{O}})$ , and the space of probability measures on $\mathcal{O}$ will be denoted by $\mathcal{P}({\mathcal{O}})$ . In the sequel, any element $x\in\mathbb{R}^{d}$ will be identified to a column vector with $i$ -th component $x^{i}$ and Euclidian norm $|\!|x|\!|.$ Similarly, for any matrix $A\in\mathbb{R}^{d\times K}$ we denote by $|\!|A|\!|$ its Euclidian norm.

N-players game formulation

Consider $N$ agents whose states $X^{i}$ , $i=1,\dots,N$ follow the diffusion-type dynamics

[TABLE]

where the $K$ -dimensional Brownian motions $W^{i}$ , $i=1,\dots,N$ are independent and the coefficients $\mu$ and $\sigma$ satisfy the following assumption.

Assumption 1 (X-SDE).

The coefficients $\mu:[0,T]\times\mathcal{O}\mapsto\mathbb{R}^{d}$ and $\sigma:[0,T]\times\mathcal{O}\mapsto\mathbb{R}^{d\times K}$ are assumed to be Lipschitz continuous in the second variable, uniformly in $t\in[0,T]$ and bounded.

By classical results on SDEs, this assumption guarantees the existence of a strong solution to (2.1) satisfying

[TABLE]

We denote by $\mathcal{L}$ the infinitesimal generator of this process

[TABLE]

with $\nabla_{X}f:=(\partial_{x_{1}}f,...,\partial_{x_{d}}f)^{\top}$ , $H_{X}f$ the Hessian matrix of $f$ with respect to $x$ and $Tr$ the trace operator.

Each agent aims to determine the optimal stopping time $\tau_{i}$ valued in $[0,T]$ by solving the optimal stopping problem

[TABLE]

where $\rho>0$ is a discount factor, $\tilde{f}:[0,T]\times{\mathcal{O}}\times\mathcal{M}({\mathcal{O}})\to\mathbb{R}$ is the running reward function, $g:[0,T]\times{\mathcal{O}}\to\mathbb{R}$ is the terminal reward, $m^{N}_{t}$ is defined by

[TABLE]

with $\tau^{i}$ a stopping time with respect to the filtration generated by the Brownian motions of all agents, corresponding to agent $i$ and $\tau_{\mathcal{O}}^{i}$ the exit time from the domain $\mathcal{O}$ of agent $i$ . The assumptions on $\tilde{f}$ will be specified later, and $g$ is assumed to belong to $C^{1,2}([0,T]\times{\mathcal{O}})$ and has derivatives of order $1$ in $t$ and of orders $1$ and $2$ in $x$ of polynomial growth in $x$ uniformly in $t$ . Letting $f(t,x,\mu)=e^{-\rho t}(\tilde{f}(t,x,\mu)-\rho g(t,x)+\frac{\partial g}{\partial t}+\mathcal{L}g),$ the optimal stopping problem becomes (up to a constant),

[TABLE]

We now formulate the notion of Nash equilibrium for the optimal stopping game with $N$ players. To this purpose, let $\mathcal{T}$ be the set of stopping times with respect to the filtration generated by the Brownian motions of all agents, taking values between [math] and $T$ . Given a strategy vector ${\tau}:=(\tau^{1},\tau^{2},...,\tau^{N}){\color[rgb]{0,0,0}\in\mathcal{T}^{N}}$ and an individual strategy $\sigma{\color[rgb]{0,0,0}\in\mathcal{T}}$ , let $[\tau^{-i},\sigma]$ indicate the strategy vector that is obtained from $\tau$ by replacing $\tau^{i}$ , the strategy of player $i$ , with $\sigma$ .

Definition 2.1 (Nash Equilibrium $N$ -players game).

A strategy vector ${\tau}:=(\tau^{1},\tau^{2},...,\tau^{N}){\color[rgb]{0,0,0}\in\mathcal{T}^{N}}$ is called a Nash equilibrium for the $N$ players game, if for every $i\in\{1,2,..,N\}$ and every $\sigma{\color[rgb]{0,0,0}\in\mathcal{T}}$ , we have

[TABLE]

where, for each $\theta{\color[rgb]{0,0,0}\in\mathcal{T}^{N}}$ ,

[TABLE]

where $m_{t}^{N}$ is given by $\eqref{empiricalmeas}$ with $\tau^{i}$ replaced by $\theta^{i}$ , for each $i$ .

MFG formulation

In the limit of a large number of agents, we expect, from the law of large numbers, that the empirical measure $m^{N}_{t}$ converges to a deterministic limiting distribution $m_{t}$ for each $t\in[0,T]$ . The problem of each agent therefore consists in finding the optimal stopping time in the filtration generated by the individual noise of this agent only, and it is sufficient to work on a probability space supporting a single Brownian motion.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space supporting a standard $K$ -dimensional Brownian motion $W$ . We denote by $\mathbb{F}^{W}$ the natural filtration of $W$ completed with the sets of measure zero. In the MFG formulation, the state of the representative agent with initial value $x$ follows the dynamics

[TABLE]

where we write $X^{x}_{\cdot}$ as a shorthand for $X^{(0,x)}_{\cdot}$ . As intimated in the introduction, the first step of the MFG approach consists in solving the following optimal stopping problem for the agent

[TABLE]

where $\mathcal{T}^{W}([0,T])$ is the set of $\mathbb{F}^{W}$ -stopping times with values in $[0,T]$ and $\tau_{\mathcal{O}}^{x}\equiv\tau_{\mathcal{O}}^{(0,x)}$ is the exit time from the domain $\mathcal{O}$ of this agent with initial value $x$ . Then, given the optimal stopping time (solution of the problem (2.4)) for the agent with initial condition $x$ , $\tau^{m,x}$ , and the initial measure $m_{0}\in\mathcal{P}({\mathcal{O}})$ , the second step consists in finding the flow of measures $(m_{t})_{0\leq t\leq T}$ such that

[TABLE]

In other words, the solution of the optimal stopping MFG problem is the flow of measures $(m_{t})_{0\leq t\leq T}$ , which is the fixed point of the mapping defined by the right-hand side of (2.5). In the sequel, such solution will be called a pure solution. As shown in [bertucci2017optimal], pure solutions for optimal stopping MFG problems do not always exist, and for this reason in the sequel we shall consider relaxed solutions. A relaxed solution is close in spirit to the mixed solution introduced in [bertucci2017optimal], precise relationship between the two notions will be established later in the paper.

3 Relaxed formulation of the single-agent optimal stopping

problem

The relaxed formulation of the optimal stopping problem consists in finding the occupation measure of the representative agent rather than the stopping time. We first provide a relaxed formulation of the standard optimal stopping problem in this section and then move to the relaxed formulation of the MFG problem in the following one. First, we introduce the necessary notations.

Let $V$ be the space of flows of (signed) bounded measures on ${\mathcal{O}}$ : $(m_{t}(\cdot))_{0\leq t\leq T}\in V$ is such that: for every $t\in[0,T]$ , $m_{t}$ is a (signed) bounded measure on ${\mathcal{O}}$ , for every $A\in\mathcal{B}({\mathcal{O}})$ , the mapping $t\mapsto m_{t}(A)$ is measurable, and $\int_{0}^{T}\int_{{\mathcal{O}}}m_{t}(dx)\,dt<\infty$ . To each flow $m\in V$ , we associate a signed measure on $[0,T]\times{\mathcal{O}}$ defined by $\mu(dt,dx):=m_{t}(dx)\,dt$ . The space $V$ , endowed with the topology of weak convergence (that is, $\int fd\mu^{n}\mapsto\int fd\mu$ for every function $f$ continuous and bounded) is a locally convex Hausdorff topological space (see e.g. [V61]).

Consider the optimal stopping problem

[TABLE]

In this section we study a relaxed version of this optimal stopping problem, where the process $X$ starts with an initial distribution $m^{*}_{0}\in\mathcal{P}(\mathcal{O})$ instead of a fixed value, and which is formulated in terms of flows of measures rather than stopping times. We let $\bar{m}_{t}$ denote the distribution of the process $X$ , started with the initial distribution $m_{0}^{*}$ and killed at the first exit time from $\mathcal{O}$ . In other words,

[TABLE]

We impose the following minimal assumption on the reward function $f$ . We shall see below in Corollary 3.4 that this assumption is sufficient for the problem to be well defined, but stronger assumptions will be imposed for existence of solution.

Assumption 2 ( $f$ -min).

The map $f:[0,T]\times\mathcal{O}\mapsto\mathbb{R}$ is measurable and satisfies

[TABLE]

where $()_{-}$ denotes the negative part.

The previous assumption was not sufficient to guarantee that the integral in (3.2) is well defined.

Definition 3.1 (Relaxed optimal stopping problem).

For a given initial distribution $m^{*}_{0}\in\mathcal{P}({\mathcal{O}})$ , the relaxed formulation of the optimal stopping problem (3.1) consists in finding the flow of measures $(m^{*}_{t})_{0\leq t\leq T}$ , which maximizes the cost functional

[TABLE]

over $\hat{m}\in\mathcal{A}(m^{*}_{0})$ , where the set $\mathcal{A}(m^{*}_{0})\subseteq V$ contains all flows of positive bounded measures $(\hat{m}_{t})_{0\leq t\leq T}\in V$ satisfying

[TABLE]

for all $u\in C^{1,2}([0,T]\times{\mathcal{O}})$ such that $u\geq 0$ and $\frac{\partial u}{\partial t}+\mathcal{L}u$ is bounded on $[0,T]\times\mathcal{O}$ .

The rest of this section is devoted to the solution of the relaxed optimal stopping problem. A precise connection with the strong (classical) formulation of the optimal stopping problem will be established in Section 5. To gain some intuition about this definition right away, remark that for a stopping time $\tau\in\mathcal{T}^{W}([0,T])$ , we can introduce the occupation measure $m^{x}_{t}(A):=\mathbb{E}[\mathbf{1}_{A}(X^{x}_{t})\mathbf{1}_{t\leq\tau\wedge\tau^{x}_{\mathcal{O}}}]$ . Then the objective function of the optimal stopping problem writes

[TABLE]

On the other hand, by Itô’s formula, for a positive and regular test function $u$ , one has

[TABLE]

In Lemmas 3.3 and 3.5, we study the properties of the set $\mathcal{A}(m^{*}_{0})$ . First note that this set is clearly nonempty since it contains the flow $m_{t}(dx)\equiv 0$ . To proceed, we need a regularity assumption on the coefficients $\mu$ and $\sigma$ . We distinguish two cases depending on the type of boundary of $\mathcal{O}$ .

Assumption 3 (X-PDE).

The coefficients $\mu$ and $\sigma$ are such that for every $C^{\infty}$ bounded function $g:[0,T]\times{\mathcal{O}}\to\mathbb{R}$ with bounded derivatives of all orders, the equation

[TABLE]

has a $C^{1,2}$ solution $u$ on $[0,T]\times{\mathcal{O}}$ such that $\frac{\partial u}{\partial x}$ has a polynomial growth in $x$ , uniformly in $t$ , and such that one of the following two conditions holds:

i.

The boundary of $\mathcal{O}$ is unattainable: for all $x\in\mathcal{O}$ , $\tau^{x}_{\mathcal{O}}>T$ a.s.

ii.

The solution $u$ belongs to $C([0,T]\times\overline{\mathcal{O}})$ and satisfies $u(t,x)=0$ for $(t,x)\in[0,T]\times\partial\mathcal{O}$ .

Remark 3.2.

Assumption (X-PDE) holds in a variety of different settings. Below, while not aiming to give the sharpest possible conditions, we present some examples of such settings.

•

Let $\mathcal{O}=\mathbb{R}^{d}$ and assume that the operator $\mathcal{L}$ is uniformly parabolic: there exists $\gamma>0$ such that for all $(t,x)\in[0,T]\times\mathbb{R}^{d}$ and $\xi\in\mathbb{R}^{d}$ , the $d\times d$ matrix $a=\sigma^{\top}\sigma$ satisfies

[TABLE]

Furthermore, suppose that the coefficients $a_{ij}$ are bounded, uniformly Hölder continuous in $x$ and uniformly continuous in $t$ , and the coefficients $\mu_{i}$ are Hölder continuous in $x$ uniformly on compacts and continuous in $t$ . Then, by Theorem 4.4.6 in [F75], equation (3.4) admits a $C^{1,2}$ solution, and the polynomial growth of $\frac{\partial u}{\partial x}$ follows from the estimate (4.4.12) in the above reference.

•

Let $\mathcal{O}$ be a bounded domain with $C^{1}$ boundary and assume that (3.5) is satisfied and the coefficients $a_{ij}$ and $\mu_{i}$ are uniformly Hölder continuous in $(t,x)$ on $[0,T]\times\mathcal{O}$ . Then, by Theorem 4.3.6 in [F75] equation (3.4) admits a $C^{1,2}$ solution.

•

As our last example we consider a situation where the condition (3.5) need not be satisfied. For simplicity, we restrict ourselves to the setting of homogeneous equations, that is, the coefficients $\mu$ and $\sigma$ do not depend on $t$ , but the argument may be extended to the general case. Suppose that the boundary of $\mathcal{O}$ is unattainable and that $\partial_{x_{i}}\mu$ , $\partial^{2}_{x_{i}x_{j}}\mu$ , $\partial_{x_{i}}\sigma$ and $\partial^{2}_{x_{i}x_{j}}\sigma$ are bounded and locally Lipschitz. This ensures that equation (2.1) admits a unique strong solution,

[TABLE]

and, applying Theorem V.39 in [protter] twice (first to the process $X$ and then to its first order tangent flow), we conclude that the mapping $x\mapsto X^{x}_{t}$ is twice continuously differentiable, and the derivatives $D^{i}_{kt}:=\partial_{x_{i}}X^{x}_{kt}$ and $D^{ij}_{kt}:=\partial^{2}_{x_{i}x_{j}}X^{x}_{kt}$ are given by the solutions of the following system of equations (where we use the Einstein convention of summing over repeated indices and $\delta^{i}_{k}$ denotes the Kroneker symbol).

[TABLE]

Moreover, by standard arguments (e.g., Theorem V.66 in [protter] and Gronwall’s lemma), from boundedness of derivatives of $\mu$ and $\sigma$ it follows that for some constant $K$ ,

[TABLE]

Let us define

[TABLE]

Then, by dominated convergence, the derivatives $\partial_{t}u$ , $\partial x_{i}u$ and $\partial^{2}_{x_{i}x_{j}}u$ exist, are bounded, continuous, and given by the following expressions.

[TABLE]

Furthermore, by the Markov property, for $h\in(t,T)$ ,

[TABLE]

and an application of the Itô formula yields:

[TABLE]

where we removed the superscript $(t,x)$ to save space. Dividing both sides by $h-t$ and passing to the limit $h\to t$ , we get (3.4).

Lemma 3.3.

Let Assumptions (X-SDE) and (X-PDE) be satisfied. Fix $m^{*}_{0}\in\mathcal{P}({\mathcal{O}})$ .

i.

Let $g:\mathcal{O}\mapsto\mathbb{R}^{+}$ be a continuous function with polynomial growth. Then almost everywhere on $t\in[0,T]$ , and for $m\in\mathcal{A}(m^{*}_{0})$ ,

[TABLE]

ii.

Let $g\in C^{2}(\mathcal{O};\mathbb{R})$ such that $g$ , $|\!|\nabla_{{}_{X}}g|\!|$ and $|\!|H_{{}_{X}}g|\!|$ are bounded. Then, for $m\in\mathcal{A}(m^{*}_{0})$ and for every $\psi\in C^{1}([0,T])$ ,

[TABLE]

for some $C>0$ .

Proof.

Part i. Assume that $f$ and $g$ are $C^{\infty}$ bounded positive functions with bounded derivatives of all orders, and let $u$ be the solution of

[TABLE]

described in Assumption (X-PDE). By Itô’s formula, for $x\in\mathcal{O}$ ,

[TABLE]

Taking the expectation and using the equation satisfied by $u$ , the fact that $|\!|\nabla_{X}u|\!|$ has polynomial growth and the a priori estimates on the strong solution of the SDE (i.e. $\sup_{t}\mathbb{E}[|\!|X_{t}|\!|^{p}]<\infty$ , for all $p\geq 1$ ), we get

[TABLE]

which means that $u$ is an admissible test function in the sense of Definition 3.1. Substituting the above expression for $u$ into the constraint (3.3), we have

[TABLE]

Since $f$ is arbitrary, this implies that

[TABLE]

$t$ -almost everywhere on $[0,T]$ . The result may be extended to a positive continuous function $g$ with polynomial growth by considering a sequence of functions $g^{l,n,m}(x):=g^{l}(x)\phi^{n,m}(x)$ , where $g^{l}:=\rho^{l}\star g$ converges uniformly on compact sets to $g$ (see Prop. 4.21 in [B2010]), $\phi^{n,m}:=\rho^{m}\star\psi^{n}$ converges pointwise to $\psi^{n}$ , where $(\rho^{l})_{l\geq 1}$ , $(\rho^{m})_{m\geq 1}$ are two sequences of mollifiers and $\psi^{n}(x):=\textbf{1}_{x\in K^{n}}$ , with $K^{n}$ a sequence of increasing compact sets approximating the open set $\mathcal{O}$ (exhaustion by compact sets of the set $\mathcal{O}$ ). Note that all elements of the sequence of functions $(g^{l,n,m})_{l,n,m}$ admit bounded derivatives of all orders (since they are continuous and have compact support). The result follows by applying first Lebesgue’s Theorem, when taking the limit with respect to $l$ and $m$ and then the monotone convergence theorem when letting $n\rightarrow\infty$ .

Part ii. First remark that

[TABLE]

is bounded on $[0,T]$ . This implies that it is enough to prove the result for $\psi\in C^{\infty}([0,T])$ , because for $\psi\in C^{1}([0,T])$ , the derivative $\psi^{\prime}$ may be approximated by smooth functions in the uniform norm.

By Itô formula, for $s\leq\tau^{(t,x)}_{\mathcal{O}}$ ,

[TABLE]

Taking the expectation and integrating by parts we obtain

[TABLE]

for some constant $C<\infty$ , due to the bounds on $g$ , $|\!|\nabla_{X}g|\!|$ , $|\!|H_{X}g|\!|$ , $|\!|\mu|\!|$ and $|\!|\sigma|\!|$ . Then we can define the function

[TABLE]

which is an admissible test function by the same argument as the one used in the first part. This proves that

[TABLE]

and since $u(0,x)\leq 2C\|\psi\|_{\infty}$ for all $x\in\mathcal{O}$ , we get the statement of the lemma. ∎

Corollary 3.4.

Under the assumptions of Lemma 3.3, let $m^{*}_{0}\in\mathcal{P}({\mathcal{O}})$ , and let $\bar{m}_{t}(dx)$ be the distribution of the process $X$ started with initial distribution $m^{*}_{0}$ and killed at the first exit time from $\mathcal{O}$ . Then for every $m\in\mathcal{A}(m^{*}_{0})$ , $m_{t}\leq\bar{m}_{t}$ , $dt$ -almost everywhere on $[0,T]$ . In particular, if $\bar{m}_{t}$ has a density then $m_{t}$ does as well.

Proof.

Approximating the indicator function with a sequence of continuous bounded functions and using the dominated convergence theorem, the first part of the above lemma yields for all $a,b\in{\mathcal{O}}$ with $a<b$ (where the inequality is interpreted componentwise),

[TABLE]

where $\bar{p}^{x}(t,dz)$ is the transition distribution of the process $X$ killed at $\tau^{x}_{\mathcal{O}}$ . ∎

In the following lemma we continue the study of the properties of the set $\mathcal{A}(m^{*}_{0})$ . The compactness of this set is established under the following assumption.

Assumption 4 ( $m^{*}_{0}$ -Compact).

The initial distribution $m^{*}_{0}\in\mathcal{P}({\mathcal{O}})$ satisfies

[TABLE]

Lemma 3.5.

Let Assumptions (X-SDE) and ( $m^{*}_{0}$ -Compact) be satisfied. Then the set $\mathcal{A}(m^{*}_{0})$ is sequentially compact.

Proof.

Let us first show the tightness of the associated set of measures on $[0,T]\times{\mathcal{O}}$ . For $i=1,\dots,d$ , define the function

[TABLE]

with $A\geq 0$ . Remark that

[TABLE]

and $\partial_{x_{j}}\phi^{i}_{A}(x)=0$ , for all $j\neq i$ , from which it is easy to see that $\phi^{i}_{A}$ is twice continuously differentiable on its entire domain, and that the expressions $\nabla_{{}_{X}}\phi^{i}_{A}(x)$ , $x^{T}\nabla_{{}_{X}}\phi^{i}_{A}(x)$ , $H_{{}_{X}}\phi^{i}_{A}(x)$ and $x^{\top}H_{{}_{X}}\phi^{i}_{A}(x)x$ are bounded on ${\mathcal{O}}$ by a constant independent from $A$ . In addition, as $A\to\infty$ , $\phi^{i}_{A}(x)$ converges in a monotone fashion to the limiting function $\phi^{i*}(x)=\ln\{1+|x_{i}|^{3}\}$ . Now, consider the test function $u_{A}(t,x)=(T-t)\sum_{i=1}^{d}\phi^{i}_{A}(x)$ . It follows that

[TABLE]

for $m\in\mathcal{A}(m^{*}_{0})$ . From the boundedness of $\mu$ and $\sigma$ and the above observations, we deduce that the expression within the brackets in the last term is bounded uniformly on $A$ . The limits of the first two terms, on the other hand, are computed by monotone convergence. Letting $\phi^{*}(x):=\sum_{i=1}^{d}\phi^{i*}(x)$ , we conclude that there exists a constant $C<\infty$ such that

[TABLE]

from which the tightness follows 222For sake of clarity, we precise the tightness criteria. Let $F$ be a topological space equipped with its Borel sigma-field. Let $(\mu_{i})_{i\in I}$ be a flow of measures on $(F,\mathcal{B}(F))$ . If there exists a measurable function $\phi:F\mapsto[0,\infty]$ with compact level sets such that $C:=\sup_{i\in I}\int_{F}\phi(x)d\mu_{i}(x)<\infty$ , then $(\mu_{i})_{i\in I}$ is tight (the proof follows immediately by the measure version of the Markov inequality).. Moreover, taking $g=1$ in Lemma 3.3 we see that $\mathcal{A}(m^{*}_{0})$ is uniformly bounded. Therefore, by Prokhorov’s theorem (Theorem 8.6.2 in [bogachev2007measure]), from any sequence of flows of measures $(m^{n})_{n\geq 1}\subseteq\mathcal{A}(m^{*}_{0})$ , one can extract a subsequence, also denoted by $(m^{n})_{n\geq 1}$ , such that the sequence of associated measures on $[0,T]\times{\mathcal{O}}$ , $(\mu^{n})_{n\geq 1}$ converges weakly to a limiting measure $\mu^{*}$ . By weak convergence, the measure $\mu^{*}$ also satisfies the constraints of $\mathcal{A}(m^{*}_{0})$ i.e., for every test function $u$ ,

[TABLE]

Taking the test function $u(t,x)=\int_{t}^{T}f(s)ds$ with $f$ a positive continuous function, we have

[TABLE]

We conclude that $\mu^{*}$ is a bounded measure and the measure $\int_{\mathcal{O}}\mu^{*}(dt,dx)$ on $[0,T]$ is absolutely continuous with respect to the Lebesgue measure, which means that we can write $\mu^{*}(dt,dx)=m^{*}_{t}(dx)\,dt$ for some $m^{*}\in\mathcal{A}(m^{*}_{0})$ . The positivity of the limiting measure flow follows from weak convergence and absolute continuity. ∎

The following proposition is an existence result for the relaxed optimal stopping problem. We need the following assumption on $f$ .

Assumption 5 ( $f$ -Exist).

One of the following alternative conditions holds true:

i.

The mapping $(t,x)\mapsto f(t,x)$ is continuous on $[0,T]\times{\mathcal{O}}$ and satisfies

[TABLE]

where $\bar{m}_{t}$ is the distribution at time $t$ of the process $X$ started with initial distribution $m_{0}^{*}$ .

ii.

The function $f$ is of the form

[TABLE]

where $n\geq 1$ and for each $i$ , $g_{i}\in C^{2}(\mathcal{O};\mathbb{R})$ is such that $g_{i}$ , $|\!|\nabla_{{}_{X}}g_{i}|\!|$ and $|\!|H_{{}_{X}}g_{i}|\!|$ are bounded., and $\bar{f}_{i}$ is bounded measurable.

Proposition 3.6.

Let Assumptions (X-SDE), (X-PDE), ( $m^{*}_{0}$ -Compact) and ( $f$ -Exist) be satisfied. Then there exists $m^{*}\in\mathcal{A}(m^{*}_{0})$ which maximizes the functional

[TABLE]

over all $m\in\mathcal{A}(m^{*}_{0})$ .

Proof.

Choose a maximizing sequence of flows of measures $(m^{n})_{n\geq 1}{\subseteq}\mathcal{A}(m^{*}_{0})$ . By Lemma 3.5, it has a subsequence, also denoted by $(m^{n})_{n\geq 1}$ , which converges weakly to a limit $m^{*}\in\mathcal{A}(m^{*}_{0})$ . To show that $m^{*}$ is a maximizer of (3.2), we consider separately the two alternative assumptions of the proposition.

Suppose that Assumption i. holds true. Fix $\varepsilon>0$ . By the continuity of $f$ and the integrability assumption, there exists $0\leq M<\infty$ such that

[TABLE]

Then, by weak convergence and by Corollary 3.4,

[TABLE]

Since $\varepsilon$ is arbitrary, $m^{n}$ is a maximizing sequence and $m^{*}\in\mathcal{A}(m^{*}_{0})$ , this finishes the proof.

Suppose now that Assumption ii. holds true instead. Without loss of generality it is enough to consider the case where $n=1$ , and we omit the index $i$ . Consider the mapping $G^{n}:[0,T]\mapsto\mathbb{R}$ defined by $G^{n}(t)=\int_{{\mathcal{O}}}g(x)m^{n}_{t}(dx)$ . By Lemma 3.3 and Proposition 3.6 in [ambrosio2000functions], $G^{n}$ is then of bounded variation on $[0,T]$ . Then, by Theorem 3.23 in the above reference, up to taking a subsequence, we may assume that the sequence of mappings $(G^{n})_{n\geq 1}$ converges in $L^{1}([0,T])$ to some mapping $G^{*}$ . On the other hand, in view of the weak convergence, for any continuous function $f:[0,T]\mapsto\mathbb{R}$ ,

[TABLE]

This shows that $G^{*}(t)=\int_{{\mathcal{O}}}g(x)m^{*}_{t}(dx)$ . We conclude that

[TABLE]

as $n\to\infty$ . ∎

4 Relaxed formulation of the optimal stopping MFG problem

We now give the definition of Nash equilibrium for the relaxed MFG optimal stopping problem. For the problem to be well-defined, we impose the following minimal assumption on the reward function $f$ :

Assumption 6 ( $f$ -min-MFG).

For every $m\in\mathcal{A}(m^{*}_{0})$ , the map

[TABLE]

is measurable and satisfies

[TABLE]

Definition 4.1.

Given the initial distribution $m^{*}_{0}$ , a flow of measures $m^{*}\in\mathcal{A}(m^{*}_{0})$ is a Nash equilibrium for the relaxed MFG optimal stopping problem (or “relaxed Nash equilibrium”) if

[TABLE]

and

[TABLE]

for all $m\in\mathcal{A}(m^{*}_{0})$ .

In other words, the set of Nash equilibria coincides with the set of fixed points of the set-valued mapping ${\Theta:\mathcal{A}(m_{0}^{*})\to 2^{\mathcal{A}(m_{0}^{*})},{\rm\,\,with\,\,}2^{\mathcal{A}(m_{0}^{*})}{\rm\,\,the\,family\,of}}$ ${{\rm sets\,over\,\,}\mathcal{A}(m_{0}^{*})}$ , defined by

[TABLE]

which is well defined whenever the function $(t,x)\mapsto f(t,x,m_{t})$ satisfies the conditions of Proposition 3.6.

The next theorem estalishes existence of the MFG equilibrium under the following assumption.

Assumption 7 ( $f$ -Exist-MFG).

Let the reward function $f$ be of the form

[TABLE]

where, for each $i$ , $g_{i},\bar{g}_{i}\in C^{2}(\mathcal{O};\mathbb{R})$ are such that $g_{i},\bar{g}_{i}$ , $|\!|\nabla_{{}_{X}}g_{i}|\!|$ , $|\!|\nabla_{{}_{X}}\bar{g}_{i}|\!|$ , $|\!|H_{{}_{X}}g_{i}|\!|$ , $|\!|H_{{}_{X}}\bar{g}_{i}|\!|$ are bounded, and $\bar{f}_{i}$ is bounded measurable and continuous with respect to its second argument.

Theorem 4.2.

Let Assumptions (X-SDE), (X-PDE), ( $m^{*}_{0}$ -Compact) and ( $f$ -Exist-MFG) be satisfied. Then there exists a Nash equilibrium for the relaxed MFG problem.

Proof.

We shall use the Fan-Glicksberg fixed-point theorem (Theorem 7.1 in [mclennan2018advanced]). We have seen that $V$ is a locally convex space; moreover, the subset $\mathcal{A}(m_{0}^{*})\subseteq V$ is compact (by Lemma 3.5 and since $\mathcal{A}(m_{0}^{*})$ is included in the space of positive and finite measures on a separable metric space, which is metrizable), convex and nonempty. The mapping $\Theta$ is clearly convex. Therefore, to prove that it has a fixed point it suffices to check that it is upper semicontinuous. In other words, we check that it has a closed graph (see Proposition 5.1.3 in [mclennan2018advanced]), where the graph is defined by

[TABLE]

To show that ${\text{Gr}(\Theta)}$ is closed it suffices to check that for any two sequences $(m^{n})_{n\geq 1}\subseteq\mathcal{A}(m^{*}_{0})$ and $(\bar{m}^{n})_{n\geq 1}\subseteq\mathcal{A}(m^{*}_{0})$ which converge weakly to $m\in\mathcal{A}(m^{*}_{0})$ and $\bar{m}\in\mathcal{A}(m^{*}_{0})$ respectively, and such that

[TABLE]

for every $\hat{m}\in\mathcal{A}(m^{*}_{0})$ , we have

[TABLE]

for every $\hat{m}\in\mathcal{A}(m^{*}_{0})$ . To prove this, it is enough to show that, up to taking a subsequence,

[TABLE]

and

[TABLE]

We will only show that $\eqref{conv1}$ holds true, since the convergence given by (4.2) follows by the same arguments. It is enough to consider the case $K=1$ and we drop the index $i$ . We therefore need to prove

[TABLE]

where we write $g*m$ as a shorthand for $\int_{\mathcal{O}}g(x)m(dx)$ . As in the proof of Proposition 3.6, we may show that $\bar{g}*\bar{m}^{n}$ converges to $\bar{g}*\bar{m}$ in $L^{1}([0,T])$ . Similarly, we may show that $g*m^{n}$ converges to $g*m$ in $L^{1}([0,T])$ . Since $f$ is continuous, $\bar{f}(t,\bar{g}*\bar{m}^{n}_{t})\,g*m^{n}_{t}$ converges almost everywhere to $\bar{f}(t,\bar{g}*\bar{m}_{t})\,g*m_{t}$ . Further, by Corollary 3.4, $g*m^{n}_{t}$ is uniformly bounded, and (4.3) follows from the dominated convergence theorem. ∎

Uniqueness of the Nash value for the relaxed MFG problem

We prove here the uniqueness result of the Nash equilibrium value for the relaxed problem, which holds under the following assumption on the map $f$ .

Assumption 8 ( $f$ -Uniq-MFG).

The function $f$ takes the following form

[TABLE]

where $g\in C^{2}(\mathcal{O};\mathbb{R})$ is such that $g$ , $|\!|\nabla_{{}_{X}}g|\!|$ and $|\!|H_{{}_{X}}g|\!|$ are bounded., $h:[0,T]\times\mathcal{O}\mapsto\mathbb{R}$ is continuous, with polynomial growth in $x$ and $\bar{f}:[0,T]\times\mathbb{R}\mapsto\mathbb{R}$ is bounded measurable, continuous and decreasing in the second argument.

Remark 4.3.

Note that under Assumption ( $f$ -Uniq-MFG), the function $f$ satisfies for each $t$ and all $m^{1}\in\mathcal{A}(m^{*}_{0})$ and $m^{2}\in\mathcal{A}(m^{*}_{0})$ the following antimonotonicity condition

[TABLE]

Theorem 4.4 (Uniqueness of the Nash value).

Let $m^{*}$ and $\bar{m}$ be two Nash equilibria for the relaxed problem and let Assumption ( $f$ -Uniq-MFG) be satisfied. Then,

[TABLE]

almost everywhere on $[0,T]$ , and in particular they lead to the same value of the relaxed fixed point problem, that is $\int_{0}^{T}\int_{\mathcal{O}}f(t,x,m^{*})m^{*}_{t}(dx)=\int_{0}^{T}\int_{\mathcal{O}}f(t,x,\bar{m})\bar{m}_{t}(dx)$ .

Proof.

Since $m^{*}$ is a Nash equilibrium, we get that

[TABLE]

Since $\bar{m}$ is also a Nash equilibrium, we obtain

[TABLE]

From the two above inequalities, we derive that

[TABLE]

The antimonotonicity property of the map ${f}$ then implies that

[TABLE]

almost everywhere on $[0,T]$ , or in other words that

[TABLE]

almost everywhere on $[0,T]$ , which implies that

[TABLE]

almost everywhere on $[0,T]$ . Integrating over $[0,T]\times{\mathcal{O}}$ we see that the two equilibria lead to the same value. ∎

Remark 4.5.

A natural question is to see if one can use a relaxed Nash equilibrium corresponding to the MFG game problem in order to construct a $\varepsilon$ -Nash equilibria for the $N$ -player game. A possible way to do it, is to show that, given $m^{*}$ a relaxed MFG equilibrium, then the empirical measures

[TABLE]

correspond to a $\varepsilon$ -Nash equilibria, where, for every $i\in\{1,2,...,N\}$ , $\tau_{i}$ maximizes

[TABLE]

This problem is left for further research.

5 Relation between the relaxed and the strong formulation of the single-agent optimal stopping and of the MFG problem and relation with mixed solutions

In this section we provide the relation between the relaxed and the strong formulation of the single-agent optimal stopping problem and of the MFG problem, as well as with the mixed solutions introduced in [bertucci2017optimal]. We make here the following additional assumption.

Assumption 9 (X-Reg).

i.

The domain ${\mathcal{O}}$ is an open bounded domain of $\mathbb{R}^{d}$ , with boundary $\Gamma:=\partial\mathcal{O}$ of class $C^{2}$ and the process $X_{\cdot}$ started with initial distribution $m^{*}_{0}$ and killed at the first exit time of $\mathcal{O}$ has a distribution $\bar{m}_{t}$ , which, for each $t$ , has a square integrable density with respect to the Lebesgue measure. 2. ii.

$\sigma$ satisfies the uniform ellipticity condition.

Remark 5.1.

Let $\mathcal{O}$ be as in Assumption (X-Reg), assume that $\sigma$ satisfies the uniform ellipticity condition, that the coefficients $a=\sigma^{\top}\sigma$ and $\mu$ are uniformly Lipschitz continuous on $[0,T]\times\mathcal{O}$ and that the initial distribution $m^{*}_{0}$ admits a bounded density with respect to the Lebesgue measure. Then, by Theorem 3.16 in [friedman83], the operator $\mathcal{L}$ admits a Green function $G(x,t;\xi,T)$ , which is continuous in $\xi$ for all $T>t$ . Moreover, the Green function admits an Aronson-type estimate of the form

[TABLE]

see Equation (16.16) in [ladyzh]. This means that the solution $u$ to the equation

[TABLE]

with boundary condition $u|_{\partial\mathcal{O}}=0$ and terminal condition $u(T,\xi)=\phi(\xi)$ is given by

[TABLE]

On the other hand, by Theorem 5.2 in [F75], this solution is given by

[TABLE]

We conclude that the Green function coincides with the density of the process started at $(t,x)$ and killed at the first exist time from $\mathcal{O}$ . The density of the process started with the initial distribution $m_{0}^{*}$ is therefore given by

[TABLE]

Since $m^{*}_{0}$ is bounded by assumption, we conclude using the bound (5.1) that the density $\bar{m}_{t}(\xi)$ is uniformly bounded on $[0,T]$ . Note that the process satisfying the conditions given in this remark also satisfies the assumptions (X-SDE) and (X-PDE) (see Remark 3.2).

Note that, by Corollary 3.4, we derive that $m_{t}$ admits a square integrable density with respect to the Lebesgue measure, for each $m\in\mathcal{A}(m_{0}^{*})$ and for a.e. $t\in(0,T]$ .

Let $W$ be a standard $K$ -dimensional Brownian motion and $X_{0}$ be a random variable with distribution $m^{*}_{0}$ , independent from $W$ . We suppose that $X_{0}$ is valued in $\mathcal{O}$ and that $m^{*}_{0}$ admits a square integrable density with respect to the Lebesgue measure. In the sequel, we denote by $\mathbb{F}$ the filtration given by $\mathcal{F}_{t}=\sigma(W_{s},0\leq s\leq t)\vee\sigma(X_{0})\vee\mathcal{N}$ , where $\mathcal{N}$ denotes the sets of zero measure. Moreover, $\mathcal{T}([t,T])$ denotes the set of stopping times with respect to this filtration with values in $[t,T]$ . We also denote by $\mathcal{T}^{t}_{W}([t,T])$ the set of stopping times with respect to the (completed) filtration generated by the translated Brownian motion $W^{t}_{s}:=W_{s}-W_{t}$ , $s\geq t$ , with values in $[t,T]$ .

We address first the case of the single-agent optimal stopping problem.

Theorem 5.2.

[Single-Agent optimal stopping problem] Let Assumptions (X-SDE), (X-PDE), (X-Reg) and ( $f$ -Exist)(ii) be satisfied. Let $v$ be the value function of the following optimal stopping problem

[TABLE]

with $(t,x)\in[0,T]\times\mathbb{R}$ and $\tau_{{\mathcal{O}}}^{(t,x)}:=\inf\{s\geq t:\,\,\,X_{s}^{(t,x)}\notin{\mathcal{O}}\}$ . We have

i.

$\int_{\mathcal{O}}v(0,x)m^{*}_{0}(dx)=\underset{m\in\mathcal{A}(m^{*}_{0})}{\sup}\int_{0}^{T}\int_{\mathcal{O}}f(s,x)m_{s}(dx)ds.$ 2. ii.

Let $x\in\mathcal{O}$ and define $\bar{\tau}^{x}:=\inf\{0\leq s\leq T:\,\,v(s,X_{s}^{x})=0\}$ . Then the measure $m^{*}$ given by $m_{t}^{*}(A):=\int_{\mathcal{O}}m^{*}_{0}(dx)\mathbb{P}[X_{t}^{x}\in A,t<\bar{\tau}^{x}]$ for all $A\in\mathcal{B}({\mathcal{O}})$ is a maximizer of the map $m\in\mathcal{A}(m_{0}^{*})\mapsto\int_{0}^{T}\int_{\mathcal{O}}f(s,x)m_{s}(dx)ds.$ 3. iii.

Let $\bar{m}$ be a maximizer of the map $m\in\mathcal{A}(m_{0}^{*})\mapsto\int_{0}^{T}\int_{\mathcal{O}}f(s,x)m_{s}(dx)ds.$ Then it satisfies:

a.

$\int_{\mathcal{S}}f(t,x)\bar{m}_{t}(dx)dt=0$ , with $\mathcal{S}:=\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v(t,x)=0\}.$ 2. b.

For all ${C_{c}^{\infty}}$ functions $\phi$ such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ , the following holds

[TABLE]

Proof.

Part i. By Theorem 4.7, Chapter 3, in [bensoussan1982applications], the value function defined by (5.2) is a solution belonging to $W^{2,1,2}(\mathcal{Q})$ , with $\mathcal{Q}:=(0,T)\times\mathcal{O}$ 333The Sobolev space $W^{2,1,2}(\mathcal{Q})$ represents the set of functions $u$ such that $\partial_{t}u,\partial_{x_{i}}u,\partial_{x_{i}x_{j}}u\in L^{2}(\mathcal{Q})$ , with $i,j=\overline{1,d}$ , where the derivatives are understood in the sense of distributions., which satisfies the following variational inequality

[TABLE]

First note that, by Lemma A.1, we have

[TABLE]

By classical results on optimal stopping and associated reflected Backward SDEs with random terminal time $T\wedge\mathcal{\tau}_{\mathcal{O}}^{X_{0}}$ (see e.g. Proposition 2.3 in [el1997reflected]), we get

[TABLE]

where

[TABLE]

Note that, by definition of the value function $v$ , we have $\bar{\tau}^{X_{0}}\leq\tau_{\mathcal{O}}^{X_{0}}\wedge T$ a.s.

Taking now the expectation in $\eqref{stop}$ , we derive that

[TABLE]

Remark that the occupational measure associated with the diffusion process $X_{\cdot}$ killed at the stopping time $\bar{\tau}^{X_{0}}$ , that is $m_{t}(A):=\int_{\mathcal{O}}m^{*}_{0}(dx)\mathbb{P}[X_{t}^{x}\in A,t<\bar{\tau}^{x}]$ , belongs to $\mathcal{A}(m_{0}^{*})$ . Therefore, we have

[TABLE]

We now show the converse inequality. Fix $m\in\mathcal{A}(m_{0}^{*})$ . Using a classical method of regularisation by convolution with a standard mollifier, with respect to both time and space (see, e.g., an extension of Meyers-Serrin’s result - Theorem 3, p. 252, in [evans]), the value function $v$ can be approximated by a sequence of functions $\varphi^{n}\in C^{\infty}([0,T]\times{\mathcal{O}},\mathbb{R}^{+})$ such that $\varphi^{n}\rightarrow v$ in $W^{2,1,2}(\mathcal{Q})\cap C([0,T],L^{2}({\mathcal{O}}))$ as $n\rightarrow\infty$ and $\partial_{t}\varphi^{n}+\mathcal{L}\varphi^{n}$ is bounded. Since $(\varphi^{n})_{n\geq 1}$ are admissible test functions, they verify the constraint (3.3). Therefore, using the assumptions on $m$ and passing to the limit, we derive that the value function $v$ satisfies

[TABLE]

From the above inequality, we derive that

[TABLE]

Since $v$ satisfies the variational inequality $\eqref{ineq}$ and due to the positivity of $m$ and Assumption ( $f$ -Reg), we get

[TABLE]

Combining the two above relations and by arbitrariness of $m\in\mathcal{A}(m_{0}^{*})$ , we get

[TABLE]

Part ii. Since the stopping time $\bar{\tau}^{X_{0}}$ given by $\eqref{opt}$ is optimal for the stopping problem $\eqref{valuefct11}$ , we derive that

[TABLE]

with $m^{*}$ defined by $m_{t}^{*}(A):=\int_{\mathcal{O}}m^{*}_{0}(dx)\mathbb{P}[X_{t}^{x}\in A,t<{\bar{\tau}}^{x}]$ for all $A\in\mathcal{B}({\mathcal{O}})$ .

Using part i. and the fact that $m^{*}\in\mathcal{A}(m_{0}^{*})$ , the result follows.

Part iii. Let $m^{*}$ be defined in part ii. Since by the results above it is a maximizer, we have $\int_{0}^{T}\int_{\mathcal{O}}f(t,x)\bar{m}_{t}(dx)dt=\int_{0}^{T}\int_{\mathcal{O}}f(t,x)m^{*}_{t}(dx)dt$ . Therefore

[TABLE]

where the last relation follows since $v$ satisfies the variational inequality $\eqref{ineq}$ . Now, since $-\frac{\partial v}{\partial t}-\mathcal{L}v=0$ a.e. on $\{v=0\}$ and $m^{*}$ satisfies $\eqref{eqq}$ , we get

[TABLE]

Using the above relation, the inequality $\eqref{eqqq}$ and the fact that $f\leq 0$ a.e. on $\{v=0\}$ , we finally obtain that

[TABLE]

and

[TABLE]

Let us now show that (5.11) implies that

[TABLE]

for all $C_{c}^{\infty}$ functions $\phi$ such that $\operatorname{supp}\phi{\color[rgb]{0,0,0}\subseteq}\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ .

First note that, by the same approximation procedure as the one used for the value function $v$ in Part i. (using an extension of Meyers-Serrin’s result), any non-negative function $u$ in $W^{2,1,2}(\mathcal{Q})$ satisfies the constraint (5.8).

Let $\phi$ be a $C^{\infty}_{c}$ non-negative function such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ . Up to an appropriate scale factor, one can assume that $\phi\leq v$ . Suppose that

[TABLE]

Subtracting $\eqref{equation1}$ from (5.11), we obtain that

[TABLE]

Since $v-\phi$ is a non-negative function belonging to $W^{2,1,2}(\mathcal{Q})$ , we get a contradiction. This implies that for all non-negative $C^{\infty}_{c}$ functions $\phi$ such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ we have

[TABLE]

The result can be extended to an arbitrary $C^{\infty}_{c}$ function $\phi$ (which also takes negative values) such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ . Using appropriate scaling factors and similar arguments as above, one can show that $\int_{0}^{T}\int_{{\mathcal{O}}}\left\{\frac{\partial\phi}{\partial t}+\mathcal{L}\phi\right\}\bar{m}_{t}(dx)dt+\int_{\mathcal{O}}\phi(0,x)m^{*}_{0}(dx)<0$ and $\int_{0}^{T}\int_{{\mathcal{O}}}\left\{\frac{\partial\phi}{\partial t}+\mathcal{L}\phi\right\}\bar{m}_{t}(dx)dt+\int_{\mathcal{O}}\phi(0,x)m^{*}_{0}(dx)>0$ cannot be satisfied. Hence, for all $C_{c}^{\infty}$ functions $\phi$ such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ , we have

[TABLE]

∎

We now illustrate the relation between the relaxed and strong formulation of the optimization problem in the MFG context, as well as the relation with the mixed solutions introduced in [bertucci2017optimal].

Theorem 5.3.

[MFG optimal stopping problem] Let Assumptions (X-SDE), (X-PDE), (X-Reg) and ( $f$ -Exist-MFG) be satisfied. Let $m^{*}$ be a Nash equilibrium of the relaxed MFG problem and let $v$ be the value function of the optimal stopping problem

[TABLE]

with $(t,x)\in[0,T]\times\mathbb{R}$ and $\tau_{{\mathcal{O}}}^{(t,x)}:=\inf\{s\geq t:\,\,\,X_{s}^{(t,x)}\notin{\mathcal{O}}\}$ .

We have

i.

Relation with the strong formulation

$\int_{\mathcal{O}}v(0,x)m^{*}_{0}(dx)=\int_{0}^{T}\int_{\mathcal{O}}f(s,x,m_{s}^{*})m^{*}_{s}(dx)ds.$ 2. ii.

Relation with mixed solutions

$m^{*}$ satisfies

a.

$\int_{\mathcal{S}}f(t,x,m_{t}^{*})m^{*}_{t}(dx)dt=0$ , with $\mathcal{S}:=\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v(t,x)=0\}.$ 2. b.

For all $C^{\infty}_{c}$ functions $\phi$ such that $\operatorname{supp}\phi\subseteq\{(t,x)\in[0,T]\times{\mathcal{O}}:\,\,v>0\}$ , the following holds

[TABLE]

Proof.

The proof follows by using the results obtained in Theorem 5.2 applied to the instantaneous reward function $f(\cdot,m^{*})$ (which satisfies Assumption ( $f$ -Exist)(ii), so that Theorem 5.2 can be applied), together with the Nash equilibrium property of $m^{*}$ . ∎

Remark 5.4.

It follows from the variational inequality $\eqref{ineq}$ that $f\leq 0$ on $\{v=0\}$ . Therefore, if $f\neq 0$ on $\{v=0\}$ , $\int_{\{v=0\}}m^{*}_{t}(dx)\,dt=0$ . Such a solution is called a pure solution in [bertucci2017optimal], meaning that the agent will exit the game immediately upon entering the exercise region.

6 Fixed-point algorithm and convergence in the case of potential games

We first show that, in the case of potential games, the search for MFG equilibrium reduces to the maximization of a functional. The reward function of a potential game satisfied the following assumption.

Assumption 10 ( $f$ -Pot).

The reward function is of the form

[TABLE]

where for each $i$ , $\bar{f}_{i}$ is bounded, measurable in $t$ , and continuous and decreasing in the second argument, and $g_{i}\in C^{2}(\mathcal{O};\mathbb{R})$ such that $g_{i}$ , $|\!|\nabla_{{}_{X}}g_{i}|\!|$ and $|\!|H_{{}_{X}}g_{i}|\!|$ are bounded. Moreover, for each $i$ , there exists $\overline{F}_{i}:[0,T]\times\mathbb{R}\mapsto\mathbb{R}$ such that $\partial_{x}\overline{F}_{i}(t,x)=\bar{f}_{i}(t,x)$ and $\overline{F}_{i}(\cdot,0)\in L^{1}([0,T])$ .

Proposition 6.1.

Let Assumption ( $f$ -Pot) be satisfied. Then $m^{*}\in\mathcal{A}(m^{*}_{0})$ is a Nash equilibrium of the relaxed optimal stopping problem if and only if

[TABLE]

where

[TABLE]

Proof.

Assume that $m^{*}$ is a Nash equilibrium. By definition we then have

[TABLE]

Since $\bar{f}_{i}$ is decreasing in the second argument, $\bar{F}_{i}$ is concave in the second argument, and by concavity this implies that $F(m^{*})\geq F(m)$ . Conversely, assume that $m^{*}$ is a maximizer of $F$ . For every $\alpha\in[0,1]$ and every $m\in\mathcal{A}(m^{*}_{0})$ , then,

[TABLE]

which implies that

[TABLE]

where $\xi_{t}\in[g*m^{*}_{t},\alpha g*m_{t}+(1-\alpha)g*m^{*}_{t}]$ . Making $\alpha$ tend to [math] and using the dominated convergence theorem, we conclude that

[TABLE]

∎

We propose now a fixed-point algorithm for potential games. We use the notations of Proposition 6.1.

Algorithm

•

Fix $m^{0}\in\mathcal{A}(m_{0}^{*});$

•

For $k=0$ to $N$

$\bullet$

Compute $u^{k}$ the solution of the obstacle problem (5) associated with $f(\cdot,m^{k})$ ;

$\bullet$

Let $\tilde{m}^{k}\in\mathcal{A}(m_{0}^{*})$ be such that $\tilde{m}_{t}^{k}(A)=\int_{\mathcal{O}}m^{*}_{0}(dx)\mathbb{P}[X_{t}^{x}\in A;\,\,t<\tau_{k}^{x}]$ , for all $A\in\mathcal{B}({\mathcal{O}})$ , where $\tau_{k}^{x}:=\inf\{0\leq t\leq T:\,\,u^{k}(t,X^{x}_{t})=0\}$ 444Note that, for each $k$ from [math] to $N$ , we extend $u^{k}$ such that $u^{k}(t,x)=0$ for all $t\in[0,T]$ and $x\notin\mathcal{O}$ . Therefore, we have $\tau_{k}^{x}\leq\tau_{\mathcal{O}}^{x}\wedge T$ a.s.;

$\bullet$

Let $\rho^{k}$ be a maximizer of $\rho\mapsto F(m^{k}+\rho(\tilde{m}^{k}-m^{k}));$

$\bullet$

Set $m^{k+1}:=m^{k}+\rho^{k}(\tilde{m}^{k}-m^{k})$ ;

$\bullet$

Set $k\leftarrow k+1$ .

In the above algorithm, $N$ represents the number of iterations.

For each $m\in\mathcal{A}(m^{*}_{0})$ , define

[TABLE]

Lemma 6.2.

Let Assumptions (X-SDE), (X-PDE), (X-Reg) and ( $f$ -Pot) be satisfied. The set-valued map $m\in\mathcal{A}(m_{0}^{*})\mapsto\mathcal{C}(m)$ has a closed graph and $\bar{m}\in\mathcal{A}(m^{*}_{0})$ is a relaxed Nash equilibrium if and only if it satisfies $\bar{m}\in\mathcal{C}(\bar{m})$ .

Proof.

Let $(m^{n})_{n\geq 1}\in\mathcal{A}(m_{0}^{*})$ be a sequence converging weakly to some $\hat{m}\in\mathcal{A}(m_{0}^{*})$ and $\bar{m}^{n}=m^{n}+\hat{\rho}^{n}(m^{n^{*}}-m^{n})$ such that $\bar{m}^{n}\in\mathcal{C}(m^{n})$ weakly converging to some $\bar{m}$ . Let us prove that $\bar{m}\in\mathcal{C}(\hat{m})$ . Taking subsequences if necessary, we can assume that $\hat{\rho}^{n}$ converges to some $\hat{\rho}\in[0,1]$ and $m^{n^{*}}$ weakly converges to some $m^{*}$ .

Since $m^{n^{*}}$ maximizes the map $m\mapsto\int_{0}^{T}\int_{\mathcal{O}}f(m^{n}_{{t}})m_{t}(dx)dt$ , we get that

[TABLE]

for all $m\in\mathcal{A}(m_{0}^{*})$ . For simplicity, we consider here the case $K=1$ and drop the index $i$ . Using the same arguments as those in the proof of Theorem 4.2, we may say that, up to taking subsequences, the sequence $(g*m^{{\color[rgb]{0,0,0}n}})_{n\geq 1}$ (resp. $(g*m^{{\color[rgb]{0,0,0}n}^{*}})_{n\geq 1}$ ) converges in $L^{1}([0,T])$ to $g*\hat{m}$ (resp. $g*m^{*}$ ). Due to the continuity of $\bar{f}$ , we derive that $\bar{f}(t,g*m^{{\color[rgb]{0,0,0}n}}_{{{\color[rgb]{0,0,0}t}}})g*m^{{\color[rgb]{0,0,0}n}^{*}}_{{{\color[rgb]{0,0,0}t}}}$ (resp. $\bar{f}(t,g*m^{{\color[rgb]{0,0,0}n}}_{{{\color[rgb]{0,0,0}t}}})g*m_{{\color[rgb]{0,0,0}t}}$ ) converges for a.e. $t$ to $\bar{f}(t,g*\hat{m}_{{\color[rgb]{0,0,0}t}})g*m^{*}_{{\color[rgb]{0,0,0}t}}$ (resp. $\bar{f}(t,g*\hat{m}_{{\color[rgb]{0,0,0}t}})g*m_{{\color[rgb]{0,0,0}t}}$ ). By Corollary 3.4, $g*m^{{\color[rgb]{0,0,0}n}^{*}}$ is uniformly bounded, therefore, by appealing to the dominated convergence theorem, we derive

[TABLE]

for all $m\in\mathcal{A}(m_{0}^{*})$ , that is

[TABLE]

Now it remains to show that $\hat{\rho}$ is a maximizer of $\rho\mapsto F(\hat{m}+\rho(m^{*}-\hat{m}))$ . For each $n$ , we have $F(m^{{\color[rgb]{0,0,0}n}}+\hat{\rho}^{{\color[rgb]{0,0,0}n}}(m^{{\color[rgb]{0,0,0}n}^{*}}-m^{{\color[rgb]{0,0,0}n}}))\geq F(m^{{\color[rgb]{0,0,0}n}}+\rho(m^{{\color[rgb]{0,0,0}n}^{*}}-m^{{\color[rgb]{0,0,0}n}}))$ , for all $\rho\in[0,1]$ , for all $n$ . Taking the limit $n\rightarrow\infty$ and using similar arguments as above, as well as the assumptions on $F$ , we get

[TABLE]

To conclude, we have $\bar{m}\in\mathcal{C}(\hat{m})$ .

It is clear that, if $m\in\mathcal{A}(m_{0}^{*})$ is a relaxed Nash equilibrium, then it satisfies $m\in\mathcal{C}(m)$ . Conversely, one can show that if $m\in\mathcal{C}(m)$ , then $m$ corresponds to a relaxed Nash equilibrium. Indeed, if $m\in\mathcal{C}(m)$ , then we have $\hat{\rho}=0$ or $m^{*}=m$ . If $\hat{\rho}=0$ , then $\int_{0}^{T}\int_{\mathcal{O}}f(m_{{\color[rgb]{0,0,0}t}})(m_{{\color[rgb]{0,0,0}t}}^{*}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}-m_{{\color[rgb]{0,0,0}t}}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})})dt\leq 0.$ Since $m^{*}$ is a maximizer of the map $m^{\prime}\mapsto\int_{0}^{T}\int_{{\mathcal{O}}}f(m_{{\color[rgb]{0,0,0}t}})m_{{\color[rgb]{0,0,0}t}}^{\prime}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}dt$ , we derive that $\int_{0}^{T}\int_{\mathcal{O}}f(m_{{\color[rgb]{0,0,0}t}})(m_{{\color[rgb]{0,0,0}t}}^{*}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}-m_{{\color[rgb]{0,0,0}t}}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})})dt=0,$ which implies that $m$ corresponds to a relaxed Nash equilibrium. If $m^{*}=m$ , the conclusion is clear. ∎

We now give the following convergence result.

Theorem 6.3.

Let Assumptions (X-SDE), (X-PDE) , (X-Reg) and ( $f$ -Pot) be satisfied. Then the cluster points of the sequence $(m^{n})_{{\color[rgb]{0,0,0}n\geq 1}}$ generated by the previous algorithm belong to the set of relaxed Nash equilibria and the sequence $(u^{n}(0,x))_{{\color[rgb]{0,0,0}n\geq 1}}$ converges for all $x\in{\mathcal{O}}$ to $\bar{u}(0,x)$ , the value function of the obstacle problem associated with cost functional $f(\cdot,\bar{m})$ , where $\bar{m}$ is a relaxed Nash equilibrium.

Proof.

First note that, by using the definition of $\tilde{m}^{n}$ and Theorem 5.2 part ii., we get that $\tilde{m}^{n}\in\underset{m^{\prime}\in\mathcal{A}(m^{*}_{0})}{\arg\max}\int_{0}^{T}\int_{{\mathcal{O}}}f(m^{n}_{{\color[rgb]{0,0,0}t}})m_{{\color[rgb]{0,0,0}t}}^{\prime}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}dt$ . We thus have $m^{{\color[rgb]{0,0,0}{n+1}}}\in\mathcal{C}(m^{{\color[rgb]{0,0,0}{n}}})$ , for all $n$ .

Let $(m^{k_{n}})_{{\color[rgb]{0,0,0}n\geq 1}}$ be a sequence converging weakly to some $m$ , and taking a subsequence again if necessary, we may also assume that $m^{k_{n}+1}$ converges to some $m_{1}$ . As by the previous theorem the set-valued map $m\in\mathcal{A}(m_{0}^{*})\mapsto\mathcal{C}(m)$ has a closed graph, we have $m_{1}\in\mathcal{C}(m)$ , that is $m_{1}=m+\hat{\rho}(m^{*}-m)$ , with $m^{*}\in\underset{m^{\prime}\in\mathcal{A}(m^{*}_{0})}{\arg\max}\int_{0}^{T}\int_{\mathcal{O}}f(m_{{\color[rgb]{0,0,0}t}})m_{{\color[rgb]{0,0,0}t}}^{\prime}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}dt$ and $\hat{\rho}\in\underset{\rho\in[0,1]}{\arg\max}\,\,F(m+\rho(m^{*}-m)).$

Now, since the sequence $(F(m^{n}))_{{\color[rgb]{0,0,0}n\geq 1}}$ is increasing, one has $F(m)=F(m_{1}).$ Assume now that $m$ is not a Nash equilibrium, that is $m\notin\underset{m^{\prime}\in\mathcal{A}(m^{*}_{0})}{\arg\max}\int_{0}^{T}\int_{\mathcal{O}}f(m_{{\color[rgb]{0,0,0}t}})m_{{\color[rgb]{0,0,0}t}}^{\prime}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}dt$ . Therefore, $\int_{0}^{T}\int_{\mathcal{O}}f(m_{{\color[rgb]{0,0,0}t}})(m_{{\color[rgb]{0,0,0}t}}^{*}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})}-m_{{\color[rgb]{0,0,0}t}}{\color[rgb]{0,0,0}(}dx{\color[rgb]{0,0,0})})dt>0$ . Moreover, using Lemma 6.2, we have $m\notin\mathcal{C}(m)$ which implies that $\hat{\rho}>0$ . Hence, we conclude that $F(m_{1})=F(m+\hat{\rho}(m^{*}-m))>F(m)$ , which represents a contradiction.

Let us now prove the convergence of the sequence $(u^{n}(0,x))_{{\color[rgb]{0,0,0}n\geq 1}}$ for all $x\in{\mathcal{O}}$ .

Since all Nash equilibria $m$ lead to the same value (see Theorem 4.4), we can define $\bar{u}$ as being the solution of the obstacle problem associated with $f(\cdot,\bar{m})$ , with $\bar{m}$ a Nash equilibrium.

Let $u^{k_{n}}$ be a given subsequence. Up to subtracting a subsequence again, one can assume that $m^{k_{n}}$ converges weakly to some $m^{*}\in\mathcal{A}(m_{0}^{*})$ , which, by the results above, is a relaxed Nash equilibrium.

Fix $x\in\mathcal{O}$ . We have:

[TABLE]

Using again the convergence in $L^{1}([0,T])$ of $g*m^{k_{n}}$ to $g*m^{*}$ , the assumptions on $f$ together with Lebesgue Theorem, we get that the last term of the above inequality converges to 0. We can conclude that from every subsequence of $u^{k_{n}}(0,x)$ , we can extract a further subsequence which converges to $\bar{u}(0,x)$ . The result follows. ∎

7 Acknowledgement

Peter Tankov gratefully acknowledges financial support from the LABEX ECODEC (ANR-11-IDEX-0003/LabexEcodec/ANR-11-LABX-0047) and from the FIME Research Initiative.

Appendix A Appendix

We show here that the representation (5.2) remains true when the initial condition $\xi$ is random. More precisely, we have the following result.

Lemma A.1.

Let $\xi\in{\color[rgb]{0,0,0}L}^{2}({\mathcal{O}},\mathcal{F}_{0})$ . Then we have

[TABLE]

Proof.

The proof is based on quite classical arguments and we give it here for the reader’s convenience. Let us first consider a simple random variable $\xi^{n}\in{\color[rgb]{0,0,0}L}^{2}({\mathcal{O}},\mathcal{F}_{0})$ , being such that there exists $n\in\mathbb{N}$ , $A_{1},A_{2},...,A_{n}\in\mathcal{F}_{0}$ and $x_{1},x_{2},...,x_{n}\in{\mathcal{O}}$ such that

[TABLE]

By using the definitions of $\xi^{n}$ and $v(t,x)$ , we obtain

[TABLE]

Now, in the general case, we approximate $\xi$ by a sequence of simple random variables $\xi^{n}$ of the form given by (A.2). The continuity of $v$ with respect to $x$ implies that

[TABLE]

We have

[TABLE]

Since $\xi^{n}{\color[rgb]{0,0,0}\rightarrow}\xi$ a.s. as $n{\color[rgb]{0,0,0}\rightarrow}\infty$ , we get that $\tau_{\mathcal{O}}^{{\color[rgb]{0,0,0}\xi^{n}}}{\color[rgb]{0,0,0}\rightarrow}\tau_{\mathcal{O}}^{{\color[rgb]{0,0,0}\xi}}$ a.s. as $n{\color[rgb]{0,0,0}\rightarrow}\infty$ due to the continuity property of the first passage time for elliptic diffusions (see Proposition 4.4. in [pardoux1998backward]). Using the continuity property of the solution of the SDE with respect to the initial condition, together with the assumptions on $f$ and Lebesgue Theorem, it follows that

[TABLE]

By (A.3) and (A.4) and the uniqueness of the limit, we get (A.1). ∎

Bibliography33

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] \harvarditem [Ambrosio et al.]Ambrosio, Fusco \harvardand Pallara 2000 ambrosio 2000 functions Ambrosio, L., Fusco, N. \harvardand Pallara, D. \harvardyearleft 2000 \harvardyearright , Functions of bounded variation and free discontinuity problems , Vol. 254, Clarendon Press Oxford.
2[2] \harvarditem Benamou \harvardand Carlier 2015 BC 2015 Benamou, J. \harvardand Carlier, G. \harvardyearleft 2015 \harvardyearright , ‘Augmented lagrangian methods for transport optimization, mean field games and degenerate elliptic equations’, Journal of Optimization Theory and Applications, 167(1):1-26 .
3[3] \harvarditem Bensoussan \harvardand Lions 1982 bensoussan 1982 applications Bensoussan, A. \harvardand Lions, J.-L. \harvardyearleft 1982 \harvardyearright , Applications of variational inequalities in stochastic control , North Holland Publishing Company.
4[4] \harvarditem Bertucci 2017 bertucci 2017 optimal Bertucci, C. \harvardyearleft 2017 \harvardyearright , ‘Optimal stopping in mean field games, an obstacle problem approach’, Journal de Mathématiques Pures et Appliquées .
5[5] \harvarditem Bertucci 2018 B 2018 Bertucci, C. \harvardyearleft 2018 \harvardyearright , ‘A remark on uzawa’s algorithm and an application to mean-field games systems’, https://arxiv.org/pdf/1810.01181.pdf .
6[6] \harvarditem Bogachev 2007 bogachev 2007 measure Bogachev, V. I. \harvardyearleft 2007 \harvardyearright , Measure theory , Springer Science & Business Media.
7[7] \harvarditem Brezis 2010 B 2010 Brezis, H. \harvardyearleft 2010 \harvardyearright , ‘Functional analysis, sobolev spaces and partial differential equations’, Springer .
8[8] \harvarditem [Buckdahn et al.]Buckdahn, Goreac \harvardand Quincampoix 2011 buckdahn 2011 stochastic Buckdahn, R., Goreac, D. \harvardand Quincampoix, M. \harvardyearleft 2011 \harvardyearright , ‘Stochastic optimal control and linear programming approach’, Applied Mathematics & Optimization 63 (2), 257–276.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Mean-field games of optimal stopping:

Abstract

1 Introduction

2 The model

N-players game formulation

Assumption 1** (X-SDE).**

Definition 2.1** (Nash Equilibrium NNN-players game).**

MFG formulation

3 Relaxed formulation of the single-agent optimal stopping

Assumption 2** (fff-min**).

Definition 3.1** (Relaxed optimal stopping problem).**

Assumption 3** (X-PDE).**

Remark 3.2**.**

Lemma 3.3**.**

Proof.

Corollary 3.4**.**

Proof.

Assumption 4** (m0∗m^{*}_{0}m0∗​-Compact**).

Lemma 3.5**.**

Proof.

Assumption 5** (fff-Exist**).

Proposition 3.6**.**

Proof.

4 Relaxed formulation of the optimal stopping MFG problem

Assumption 6** (fff-min-MFG**).

Definition 4.1**.**

Assumption 7** (fff-Exist-MFG**).

Theorem 4.2**.**

Proof.

Uniqueness of the Nash value for the relaxed MFG problem

Assumption 8** (fff-Uniq-MFG**).

Remark 4.3**.**

Theorem 4.4** **(Uniqueness of the Nash value).

Proof.

Remark 4.5**.**

5 Relation between the relaxed and the strong formulation of the single-agent optimal stopping and of the MFG problem and relation with mixed solutions

Assumption 9** (X-Reg).**

Remark 5.1**.**

Theorem 5.2**.**

Proof.

Theorem 5.3**.**

Proof.

Remark 5.4**.**

6 Fixed-point algorithm and convergence in the case of potential games

Assumption 10** (fff-Pot**).

Proposition 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Theorem 6.3**.**

Proof.

7 Acknowledgement

Appendix A Appendix

Lemma A.1**.**

Proof.

Assumption 1 (X-SDE).

Definition 2.1 (Nash Equilibrium $N$ -players game).

Assumption 2 ( $f$ -min).

Definition 3.1 (Relaxed optimal stopping problem).

Assumption 3 (X-PDE).

Remark 3.2.

Lemma 3.3.

Corollary 3.4.

Assumption 4 ( $m^{*}_{0}$ -Compact).

Lemma 3.5.

Assumption 5 ( $f$ -Exist).

Proposition 3.6.

Assumption 6 ( $f$ -min-MFG).

Definition 4.1.

Assumption 7 ( $f$ -Exist-MFG).

Theorem 4.2.

Assumption 8 ( $f$ -Uniq-MFG).

Remark 4.3.

Theorem 4.4 (Uniqueness of the Nash value).

Remark 4.5.

Assumption 9 (X-Reg).

Remark 5.1.

Theorem 5.2.

Theorem 5.3.

Remark 5.4.

Assumption 10 ( $f$ -Pot).

Proposition 6.1.

Lemma 6.2.

Theorem 6.3.

Lemma A.1.