Probably Approximately Correct Nash Equilibrium Learning

Filiberto Fele; Kostas Margellos

arXiv:1903.10387·math.OC·October 15, 2020

Probably Approximately Correct Nash Equilibrium Learning

Filiberto Fele, Kostas Margellos

PDF

TL;DR

This paper introduces a data-driven, probabilistically robust method for computing Nash equilibria in uncertain multi-agent games, with theoretical guarantees and decentralized computation, demonstrated on electric vehicle charging.

Contribution

It develops a PAC learning framework for Nash equilibrium computation with robustness certificates and a decentralized solution approach for scenario-based games.

Findings

01

Provides probabilistic robustness guarantees for Nash equilibria.

02

Enables decentralized equilibrium computation.

03

Validates approach on electric vehicle charging problem.

Abstract

We consider a multi-agent noncooperative game with agents' objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set - a key concept in scenario-based optimization - can be directly obtained as a byproduct of the proposed solution methodology.…

Figures5

Click any figure to enlarge with its caption.

Tables1

Table 1. TABLE I: Empirical validation of the a posteriori result of Theorem 8 .

$d^{*}$ [ - ]	4	6	7	9
Empirical $V (x^{*})$ [%]	0.98	1.09	1.26	1.33
$ε (d^{*})$ - Thm 8 [%]	8.06	9.76	10.55	12.06
$ε (d^{*})$ - bound (7) [%]	5.30	6.11	6.49	7.22

Equations94

J_{i} (x_{i}, x_{- i}) = f_{i} (x_{i}, x_{- i}) + m \in {1, \dots, M} max g (x_{i}, x_{- i}, θ_{m}),

J_{i} (x_{i}, x_{- i}) = f_{i} (x_{i}, x_{- i}) + m \in {1, \dots, M} max g (x_{i}, x_{- i}, θ_{m}),

\Omega=\big{\{}x^{\ast}=(x^{\ast}_{i})_{i\in\mathcal{N}}\in\mathcal{X}\colon\\ x^{\ast}_{i}\in\operatorname*{arg\,min}_{x_{i}\in\mathcal{X}_{i}}J_{i}(x_{i},x^{\ast}_{-i}),\,\forall i\in\mathcal{N}\big{\}}.

\Omega=\big{\{}x^{\ast}=(x^{\ast}_{i})_{i\in\mathcal{N}}\in\mathcal{X}\colon\\ x^{\ast}_{i}\in\operatorname*{arg\,min}_{x_{i}\in\mathcal{X}_{i}}J_{i}(x_{i},x^{\ast}_{-i}),\,\forall i\in\mathcal{N}\big{\}}.

(u - v)^{⊺} ((\nabla_{u_{i}} f_{i} (u))_{i \in N} - (\nabla_{v_{i}} f_{i} (v))_{i \in N}) \geq χ^{f} ∥ u - v ∥^{2},

(u - v)^{⊺} ((\nabla_{u_{i}} f_{i} (u))_{i \in N} - (\nabla_{v_{i}} f_{i} (v))_{i \in N}) \geq χ^{f} ∥ u - v ∥^{2},

(u - v)^{⊺} (\nabla_{u} g (u, θ) - \nabla_{v} g (v, θ)) \geq χ^{g} ∥ u - v ∥^{2},

V (x^{*}) = \mathds P {θ \in Θ : x^{*} \in / Ω^{+}}

V (x^{*}) = \mathds P {θ \in Θ : x^{*} \in / Ω^{+}}

ε (M) = 1, and k = 0 \sum M - 1 (k M) (1 - ε (k))^{M - k} = β .

ε (M) = 1, and k = 0 \sum M - 1 (k M) (1 - ε (k))^{M - k} = β .

\mathds P^{M} {(θ_{1}, \dots, θ_{M}) \in Θ^{M} : V (x^{*}) \leq ε (d^{*})} \geq 1 - β,

\mathds P^{M} {(θ_{1}, \dots, θ_{M}) \in Θ^{M} : V (x^{*}) \leq ε (d^{*})} \geq 1 - β,

\frac{β}{M + 1} m = k \sum M (k m) t^{m - k} - (k M) t^{M - k} = 0.

\frac{β}{M + 1} m = k \sum M (k m) t^{m - k} - (k M) t^{M - k} = 0.

\mathds P^{M} {(θ_{1}, \dots, θ_{M}) \in Θ^{M} : V (x^{*}) \leq ε ((n + 1) N)} \geq 1 - β .

\mathds P^{M} {(θ_{1}, \dots, θ_{M}) \in Θ^{M} : V (x^{*}) \leq ε ((n + 1) N)} \geq 1 - β .

V_{c} (x^{*}) = \mathds P {θ \in Θ : g (x^{*}, θ) > m \in {1, \dots, M} max g (x^{*}, θ_{m})} .

V_{c} (x^{*}) = \mathds P {θ \in Θ : g (x^{*}, θ) > m \in {1, \dots, M} max g (x^{*}, θ_{m})} .

x_{i} \in ν_{i} \in X_{i} arg min f_{i} (ν_{i}, x_{- i}) + \overset{g}{^} (ν_{i}, x_{- i}, y) m = 1 \sum M y_{m} g (ν_{i}, x_{- i}, θ_{m}),

x_{i} \in ν_{i} \in X_{i} arg min f_{i} (ν_{i}, x_{- i}) + \overset{g}{^} (ν_{i}, x_{- i}, y) m = 1 \sum M y_{m} g (ν_{i}, x_{- i}, θ_{m}),

m \in {1, \dots, M} max g (x, θ_{m}) = y \in Δ max m = 1 \sum M y_{m} g (x, θ_{m}),

m \in {1, \dots, M} max g (x, θ_{m}) = y \in Δ max m = 1 \sum M y_{m} g (x, θ_{m}),

y \in ν \in Δ arg max \overset{g}{^} (x, ν) .

y \in ν \in Δ arg max \overset{g}{^} (x, ν) .

F(x,y)=\left[\begin{array}[]{c}\left(\nabla_{x_{i}}f_{i}(x)+\nabla_{x_{i}}\hat{g}(x,y)\right)_{i\in\mathcal{N}}\\[6.0pt] -\left(\nabla_{y_{m}}\hat{g}(x,y)\right)_{m=1}^{M}\end{array}\right].

F(x,y)=\left[\begin{array}[]{c}\left(\nabla_{x_{i}}f_{i}(x)+\nabla_{x_{i}}\hat{g}(x,y)\right)_{i\in\mathcal{N}}\\[6.0pt] -\left(\nabla_{y_{m}}\hat{g}(x,y)\right)_{m=1}^{M}\end{array}\right].

find subject to z^{*} \in X \times Δ (z - z^{*})^{⊺} F (z^{*}) \geq 0, \forall z \in X \times Δ.

find subject to z^{*} \in X \times Δ (z - z^{*})^{⊺} F (z^{*}) \geq 0, \forall z \in X \times Δ.

JF(x,y)=\left[\begin{array}[]{cc}J_{x}F^{x}(x,y)&J_{y}F^{x}(x,y)\\ -J_{x}F^{y}(x,y)&-J_{y}F^{y}(x,y)\end{array}\right]

JF(x,y)=\left[\begin{array}[]{cc}J_{x}F^{x}(x,y)&J_{y}F^{x}(x,y)\\ -J_{x}F^{y}(x,y)&-J_{y}F^{y}(x,y)\end{array}\right]

ν^{⊺} (\nabla_{x_{i} x_{j}}^{2} f_{i} (x))_{i, j \in N} ν ν^{⊺} \nabla_{xx}^{2} \overset{g}{^} (x, y) ν \geq χ^{f} ∥ ν ∥^{2}, \geq χ^{g} ∥ ν ∥^{2} .

ν^{⊺} (\nabla_{x_{i} x_{j}}^{2} f_{i} (x))_{i, j \in N} ν ν^{⊺} \nabla_{xx}^{2} \overset{g}{^} (x, y) ν \geq χ^{f} ∥ ν ∥^{2}, \geq χ^{g} ∥ ν ∥^{2} .

ν^{⊺} ((\nabla_{x_{i} x_{j}}^{2} f_{i} (x))_{i, j \in N} + \nabla_{xx}^{2} \overset{g}{^} (x, y)) ν = ν^{⊺} J_{x} F^{x} (x, y) ν \geq (χ^{f} + χ^{g}) ∥ ν ∥^{2}, \forall ν \in R^{n N},

ν^{⊺} ((\nabla_{x_{i} x_{j}}^{2} f_{i} (x))_{i, j \in N} + \nabla_{xx}^{2} \overset{g}{^} (x, y)) ν = ν^{⊺} J_{x} F^{x} (x, y) ν \geq (χ^{f} + χ^{g}) ∥ ν ∥^{2}, \forall ν \in R^{n N},

z^{*} =

z^{*} =

subject to (z - z^{*})^{⊺} F (z^{*}) \geq 0, \forall z \in X \times Δ.

x_{i} = ν_{i} \in X_{i} arg min f_{i} (ν_{i}, x_{- i}) + \overset{g}{^} (ν_{i}, x_{- i}, y) + \frac{η}{2} ∥ (ν_{i}, x_{- i}, y) ∥^{2} + \frac{τ}{2} ∥ ν_{i} - \overset{x}{ˉ}_{i} ∥^{2},

x_{i} = ν_{i} \in X_{i} arg min f_{i} (ν_{i}, x_{- i}) + \overset{g}{^} (ν_{i}, x_{- i}, y) + \frac{η}{2} ∥ (ν_{i}, x_{- i}, y) ∥^{2} + \frac{τ}{2} ∥ ν_{i} - \overset{x}{ˉ}_{i} ∥^{2},

y = ν \in Δ arg max \overset{g}{^} (x, ν) - \frac{η}{2} ∥ (x, ν) ∥^{2} - \frac{τ}{2} ∥ ν - \overset{y}{ˉ} ∥^{2},

y = ν \in Δ arg max \overset{g}{^} (x, ν) - \frac{η}{2} ∥ (x, ν) ∥^{2} - \frac{τ}{2} ∥ ν - \overset{y}{ˉ} ∥^{2},

(z - z^{*})^{⊺} (F (z^{*}) + η z^{*} + τ (z^{*} - \overset{z}{ˉ})) \geq 0, \forall z, \overset{z}{ˉ} \in X \times Δ.

(z - z^{*})^{⊺} (F (z^{*}) + η z^{*} + τ (z^{*} - \overset{z}{ˉ})) \geq 0, \forall z, \overset{z}{ˉ} \in X \times Δ.

(u - v)

(u - v)

= (u - v)^{⊺} (F (u) - F (v)) + (η + τ) ∥ u - v ∥^{2}

\geq τ ∥ u - v ∥^{2},

γ^{*} = m \in {1, \dots, M} max g (x^{*}, θ_{m}) .

γ^{*} = m \in {1, \dots, M} max g (x^{*}, θ_{m}) .

H_{θ} = {(x, γ) : g (x, θ) \leq γ} .

H_{θ} = {(x, γ) : g (x, θ) \leq γ} .

\mathds{P}^{M}\big{\{}(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}:\\ \mathds{P}\{\theta\in\Theta\colon(x^{\ast},\gamma^{\ast})\notin H_{\theta}\}\leq\varepsilon(d^{*})\big{\}}\geq 1-\beta,

\mathds{P}^{M}\big{\{}(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}:\\ \mathds{P}\{\theta\in\Theta\colon(x^{\ast},\gamma^{\ast})\notin H_{\theta}\}\leq\varepsilon(d^{*})\big{\}}\geq 1-\beta,

(x^{*}, γ^{*}) \in H_{θ_{m}}, \forall m \in {1, \dots, M} .

(x^{*}, γ^{*}) \in H_{θ_{m}}, \forall m \in {1, \dots, M} .

(x_{i}^{*}, γ^{*}) \in

(x_{i}^{*}, γ^{*}) \in

subject to

(x^{*}, y^{*}) \in X \times Δ min \frac{1}{2} ∥ (x^{*}, y^{*}) ∥^{2}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Probably Approximately Correct

Nash Equilibrium Learning

Filiberto Fele and Kostas Margellos Research was supported by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant agreement EP/P03277X/1. The authors are with the Department of Engineering Science, University of Oxford, OX1 3PJ, UK {filiberto.fele, kostas.margellos}@eng.ox.ac.uk

Abstract

We consider a multi-agent noncooperative game with agents’ objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set — a key concept in scenario-based optimization — can be directly obtained as a byproduct of the proposed solution methodology. Finally, we illustrate how to overcome differentiability issues, arising due to the introduction of scenarios, and compute a Nash equilibrium solution in a decentralized manner. We demonstrate the efficacy of the proposed approach on an electric vehicle charging control problem.

Index Terms:

Nash equilibria, Robust game theory, Scenario approach, Variational inequalities, Electric vehicles.

I Introduction

Game theory has attracted significant attention in the control systems community [1], and has found numerous applications ranging from smart grid [2, 3, 4] and electricity markets [5, 6], to communication networks [7] and regulatory compliance [8, 9]. The concept of Nash equilibrium (NE) is central in this context, as it defines no-regret strategies for noncooperative, selfish agents [10, 11]. As a result, NE has been a popular solution for multi-agent distributed and decentralized control architectures, investigating also the connections with social welfare; see, e.g., [12, 13, 14, 15, 16, 17] and references therein.

Uncertainty has been widely addressed in Nash games, by adopting stochastic or worst-case approaches. In the first case, both chance-constrained (risk-averse) [18, 19, 20] or expected payoff criteria [21, 22, 23, 24, 25] have been considered. For tractability, these methods typically involve assumptions on the underlying probability distribution of uncertainty realization. In the second case, results build upon robust control theory [26, 27, 28]; however, these rely on certain assumptions on the geometry of the uncertainty set; see [29, 20].

In this paper we consider a multi-agent NE seeking problem with uncertainty affecting agents’ objective functions. We depart from existing paradigms and follow a data driven methodology, where we represent uncertainty by a finite set of scenarios that could either be extracted from historical data, or by means of some prediction model (e.g., regression, Markov chains, neural networks) [30]. Adopting a data driven methodology poses a main challenge: NE are inherently random as they depend on the observed scenarios. Therefore, our objective is to investigate the sensitivity of the resulting NE to the uncertainty, in a probabilistic sense. More specifically, our contributions can be outlined as follows:

We treat the NE computation problem in a probably approximately correct (PAC) learning framework [31, 32, 33], and employ the so called scenario approach [34]. Building on [35] we first provide an a posteriori certificate on the probability that a NE remains unaltered upon a new realization of the uncertainty. We then rely on [36] and provide an a priori probabilistic certificate on the equilibrium sensitivity, under an additional non-degeneracy assumption (see Section II for a definition). The obtained results are distribution-free, and as such the underlying probability distribution of the uncertainty could be unknown and the only requirement is the availability of samples.

Blending the scenario approach with game theory has only recently appeared in the literature [37, 38]. The validity of the probabilistic statements presented in this paper extends to games admitting multiple NE. Moreover, all our a posteriori statements allow for degenerate problem instances; the latter circumvents the need of verifying the non-degeneracy assumption which, unlike convex optimization programs, is often not satisfied in games.

Under the additional assumption that the game under consideration admits a unique NE, or for aggregative games with multiple equilibria but a unique aggregate solution, we show that a compression set (a key concept in learning and generalization — see Section II for a definition) can be directly computed by inspection of the solution returned by the proposed algorithm. This feature has significant computational advantages as it prevents the use of a greedy mechanism (see, e.g., [35]), which would require running up to numerical convergence multiple times (possibly as many as the number of samples) a NE seeking algorithm (see Section V).
We provide a constructive proof of the existence of a single-valued mapping from the set of observed scenarios to a NE of the robust game, where the latter possibly admits multiple equilibria (and multiple maximisers). More specifically, we build an iterative algorithm for decentralized NE computation. To circumvent nondifferentiability issues and incorporate an equilibrium selection mechanism we bridge the results in [39] and [7] that involve resorting to an augmented $\min-\max$ game. The proposed scheme enjoys the same convergence properties as state-of-the-art decentralized algorithms for monotone games [7] (see Section III).

Note that the results presented in this paper do not contemplate constraints coupling agents’ strategies. The latter give rise to generalized NE problems; we refer the reader to [12, 15, 40, 41] for details.

In Section II we introduce the scenario-based Nash game, pose the main problem, and present the main results of the paper. Section III provides a decentralized construction of a solution algorithm for the game under study, while Section IV contains the proof of the main results. In Section V we provide for a wide class of games a computationally efficient methodology to determine an upper bound to the cardinality of the compression set. Section VI provides an electric-vehicle charging control case study, while Section VII concludes the paper and provides some directions for future work.

II Scenario based multi-agent game

II-A Gaming set-up

Let the set $\mathcal{N}=\{1,\ldots,N\}$ designate a finite population of agents. The decision vector, henceforth referred to as strategy, of each agent $i\in\mathcal{N}$ is denoted by $x_{i}\in\mathbb{R}^{n}$ and satisfies individual constraints encoded by the set $\mathcal{X}_{i}\subset\mathbb{R}^{n}$ . We denote by $x=(x_{i})_{i\in\mathcal{N}}\in\mathcal{X}\subset\mathbb{R}^{nN}$ the collection of all agents’ strategies, where $\mathcal{X}=\mathcal{X}_{1}\times\cdots\times\mathcal{X}_{N}$ . For any agent $i\in\mathcal{N}$ , $x_{-i}=(x_{j})_{j\in\mathcal{N}\setminus\{i\}}$ denotes the collection of strategies of all other agents.

Let $\theta$ be an uncertain vector taking values over some set $\Theta$ , endowed with a $\sigma$ -algebra $\mathscr{Q}$ , and let $\mathds{P}$ denote the probability measure defined over $\mathscr{Q}$ . For all subsequent derivations fix any $M\in\mathbb{N}$ , and let $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ be a finite collection of independent and identically distributed (i.i.d.) scenarios/realizations of the uncertain vector $\theta$ , that we will henceforth designate as $M$ -multisample. For given strategies of the remaining agents $x_{-i}$ , each agent $i\in\mathcal{N}$ aims at minimizing with respect to $x_{i}$ the function

[TABLE]

where $f_{i}:\mathbb{R}^{nN}\to\mathbb{R}$ expresses a deterministic objective, different for each agent $i$ but still dependent on the strategies of all agents, while $g:\mathbb{R}^{nN}\times\Theta\to\mathbb{R}$ encodes a common component in the agents’ objective function that depends on the uncertain vector. Agents are interested in minimizing their local objective $f_{i}$ and the worst-case (maximum) value $g$ can take among a finite set of scenarios. The electric vehicle charging control problem of Section VI provides a natural interpretation of such a set-up, where electric vehicles are selfish entities each one with a possibly different utility function $f_{i}$ ; however, they could be participating in the same aggregation plan or belonging to a centrally managed fleet, thus giving rise to a common $g$ .

We consider a noncooperative game among the $N$ agents, described by the tuple $\mathcal{G}=\langle\mathcal{N},(\mathcal{X}_{i})_{i\in\mathcal{N}},(J_{i})_{i\in\mathcal{N}},\{\theta_{m}\}_{m=1}^{M}\rangle$ , where $\mathcal{N}$ is the set of agents/players, $\mathcal{X}_{i}$ , $J_{i}$ are respectively the strategy set and the cost function for each agent $i\in\mathcal{N}$ , and $\{\theta_{1},\ldots,\theta_{M}\}$ is a finite collection of samples. We consider the following solution concept for $\mathcal{G}$ :

Definition 1 (Nash equilibrium).

Let $\Omega\subseteq\mathcal{X}$ denote the set of Nash equilibria of $\mathcal{G}$ , defined as

[TABLE]

We impose the following standing assumptions:

Assumption 2.

*(i) For any $\theta\in\Theta$ , and any $x_{-i}\in\mathcal{X}_{-i}=\Pi_{j\neq i\in\mathcal{N}}\mathcal{X}_{j}$ , $f_{i}(\cdot,x_{-i})+g(\cdot,x_{-i},\theta)$ is convex and continuous differentiable, while the local constraint set $\mathcal{X}_{i}$ is nonempty, compact and convex for all $i\in\mathcal{N}$ .

(ii) For any $\theta\in\Theta$ , and for all $i\in\mathcal{N}$ , the functions $g$ and $f_{i}$ are twice differentiable on an open convex set containing $\mathcal{X}$ .

(iii) The pseudo-gradient $(\nabla_{x_{i}}f_{i}(x))_{i\in\mathcal{N}}$ is monotone with constant $\chi^{f}\in\mathbb{R}$ , while $\nabla_{x}g(x,\theta)$ is monotone with constant $\chi^{g}\in\mathbb{R}$ for any fixed $\theta$ , i.e., for any $u,v\in\mathbb{R}^{nN}$ , and $\theta\in\Theta$ ,*

[TABLE]

and $\chi^{f}+\chi^{g}\geq 0$ .

We wish to emphasize that convexity of $f_{i}(\cdot,x_{-i})+g(\cdot,x_{-i},\theta)$ (for any fixed $x_{-i}$ and $\theta$ ) does not require $f_{i}(\cdot,x_{-i})$ and $g(\cdot,x_{-i},\theta)$ to be both convex; indeed, Assumption 2 allows either function to be weakly convex. Similarly, it is not required that both $\chi^{f},\chi^{g}$ are non-negative, but only $\chi^{f}+\chi^{g}\geq 0$ . Note that a sufficient (but stronger) condition for the monotonicity requirements to be satisfied is for $f_{i}+g$ to be jointly convex with respect to $x$ .

II-B Problem statement

As every NE $x^{\ast}\in\Omega$ is a random vector due to its dependency on the $M$ -multisample, a question that naturally arises is how sensitive a NE is against a new realization of the uncertainty. More formally, let $x^{\ast}\in\Omega$ be a NE of the game with $M$ samples. Consider a new realization $\theta\in\Theta$ , and let $\mathcal{G}^{+}=\langle\mathcal{N},(\mathcal{X}_{i})_{i\in\mathcal{N}},(J_{i})_{i\in\mathcal{N}},\{\theta_{m}\}_{m=1}^{M}\cup\{\theta\}\rangle$ be the game defined over the $M+1$ scenarios $\{\theta_{1},\ldots,\theta_{M},\theta\}$ ; denote by $\Omega^{+}$ the set of the associated NE. Then, for all $x^{\ast}\in\Omega$ , let

[TABLE]

denote the probability that a NE of $\mathcal{G}$ does not “remain” a NE of $\mathcal{G}^{+}$ , i.e., of the game characterized by the extraction of an additional sample. Note that $V(x^{\ast})$ is in turn a random variable, as its argument depends on the multisample $\{\theta_{1},\ldots,\theta_{M}\}$ . To provide a rigorous answer to the above question we will study the generalization properties of $x^{\ast}$ within a probably approximately correct (PAC) learning framework. With a given confidence/probability with respect to the product measure $\mathds{P}^{M}$ (as the samples are extracted in an i.i.d. fashion), we aim at quantifying $V(x^{\ast})$ .

To achieve such a characterization we provide some basic definitions. Let $\Phi\colon\Theta^{M}\to\Omega$ be a single-valued mapping from the set of $M$ -multisamples to the set of equilibria of $\mathcal{G}$ .

Remark 3.

The game $\mathcal{G}$ , the set of NE $\Omega$ , the mapping $\Phi$ (as well as of other associated quantities introduced in the sequel) depend on $M$ via the $M$ -multisample employed. Therefore, they are parameterized by $M$ , giving rise to a family of games, NE sets and mappings. To ease notation we do not show this dependency explicitly. Also, the dimension of the domain of $\Phi$ is to be intended in accordance with $M$ .

Definition 4 (Support sample [36]).

Fix any i.i.d. $M$ -multisample $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ , and let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ be a NE of $\mathcal{G}$ . Let $x^{\circ}=\Phi(\theta_{1},\ldots,\theta_{s-1},\theta_{s+1},\ldots,\theta_{M})$ be the solution obtained by discarding the sample $\theta_{s}$ . We call the latter a support sample if $x^{\circ}\neq x^{\ast}$ .

Definition 5 (Compression set — adapted from [35]).

Fix any i.i.d. $M$ -multisample $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ , and let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ be a NE of $\mathcal{G}$ . Consider any subset $\mathcal{C}\subseteq\{\theta_{1},\ldots,\theta_{M}\}$ and let $x^{\circ}=\Phi(\mathcal{C})$ . We call $\mathcal{C}$ a compression set if $x^{\circ}=x^{\ast}$ .

The notion of compression set has appeared in the literature under different names; its properties are studied in full detail in [35], where it is designated as support subsample. Here we adopt the term compression set as in [31, 33] to avoid confusion with Definition 4.

Let $\mathfrak{C}(\theta_{1},\ldots,\theta_{M})$ be the collection of all compression sets associated with the $M$ -multisample $\{\theta_{1},\ldots,\theta_{M}\}$ . We refer to the cardinality $|\mathcal{C}|$ of some compression set $\mathcal{C}\in\mathfrak{C}(\theta_{1},\ldots,\theta_{M})$ as the compression cardinality $d^{*}$ (we do not make explicit the dependence on the specific $\mathcal{C}$ to ease notation, as the results below hold for any compression set in $\mathfrak{C}$ ). Note that $\mathfrak{C}$ — hence also $d^{*}$ — is itself a random variable as it depends on the $M$ -multisample.

Definition 6 (Non-degeneracy — adapted from [42]).

For any $M\in\mathbb{N}$ , with $\mathds{P}^{M}$ -probability equal to $1$ , the NE $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ coincides with the NE returned by $\Phi$ when the latter takes as argument only the support samples. The corresponding game is then said to be non-degenerate; otherwise it is called degenerate.

It follows that for non-degenerate problems the support samples form a compression set with $\mathds{P}^{M}$ -probability 1. For degenerate problems the notions in Definitions 4 and 5 do not necessarily coincide; in particular, the support samples form a strict subset of any compression set in $\mathfrak{C}$ . For a detailed discussion on degeneracy in scenario-based contexts, we refer the reader to [43, 42].

II-C Main results

We first show that a single-valued mapping $\Phi\colon\Theta^{M}\to\Omega$ from the set of $M$ -multisamples to the set of NE of the game $\mathcal{G}$ indeed exists.

Proposition 7.

Under Assumption 2 there exists a single-valued decentralized mapping $\Phi\colon\Theta^{M}\to\Omega$ .

The mapping can be computed in a decentralized manner, thus fitting the inherent structure of the game. Its construction, and hence the proof of Proposition 7, is provided in Section III.

II-C1 A posteriori certificate

We provide an a posteriori quantification of an upper bound for $V(x^{\ast})$ . This is summarized in the following theorem.

Theorem 8.

Consider Assumption 2. Fix $\beta\in(0,1)$ and let $\varepsilon\colon\{0,\ldots,M\}\to[0,1]$ be a function satisfying

[TABLE]

Let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ , where $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ is an i.i.d. sample from $\Theta$ . We then have that

[TABLE]

where $d^{*}\leq M$ is the cardinality of any compression set of $\{\theta_{1},\ldots,\theta_{M}\}$ .

Theorem 8 shows that — with confidence at least $1-\beta$ — the probability that a NE $x^{\ast}$ of $\mathcal{G}$ does not “remain” an equilibrium of $\mathcal{G}^{+}$ (i.e., when an additional sample $\theta\in\Theta$ is considered) is at most $\varepsilon(d^{*})$ . Note that (6) captures the generalization properties of $x^{\ast}$ , where $1-\beta$ accounts for the ‘probably’ and $\varepsilon(d^{*})$ for the ‘approximately correct’ term used within a PAC learning framework. The value of $\varepsilon(\cdot)$ is defined in accordance to [35] and depends on the observed compression cardinality $d^{*}$ , which in turn depends on the random multiextraction $\{\theta_{1},\ldots,\theta_{M}\}$ thus giving rise to the a posteriori nature of the result. As a consequence, the level of conservatism of the obtained certificate depends on $d^{*}$ ; the smaller the cardinality of the computed compression set, the tighter the bound; see Section V for a detailed elaboration on the computation of $d^{*}$ . The proof of Theorem 8 is provided in Section IV-A.

In the case of a non-degenerate game (see Definition 6), the bound could be significantly improved by means of the wait-and-judge analysis of [42]: specifically, by Theorem 2 in [42], we can replace the expression for $\varepsilon(\cdot)$ in (5) with $\varepsilon(k)=1-t(k)$ , where $t(k)$ is the unique solution in $(0,1)$ of

[TABLE]

However, note that non-degeneracy is a condition in general difficult to verify even in convex optimization settings, a challenge that becomes more prominent in games.

II-C2 A priori certificate

We now provide an a priori quantification of an upper-bound of $V(x^{\ast})$ . This is summarized in the following theorem.

Theorem 9.

Consider Assumption 2, and further assume that the game is non-degenerate according to Definition 6. Fix $\beta\in(0,1)$ and consider $\varepsilon\colon\{0,\ldots,M\}\to[0,1]$ be a function satisfying (5). Let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ , where $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ is an i.i.d. multisample. We then have that

[TABLE]

The proof of Theorem 9 is provided in Section IV-B. Although similar in form to Theorem 8, the bound on $V(x^{\ast})$ provided by Theorem 9 additionally relies on the developments in [36, 43]: these results are independent of the given multisample and linked instead to the problem structure. In this way, $\varepsilon(\cdot)$ is evaluated on the sample-independent quantity $(n+1)N$ , expressing the dimension $nN$ of the agents’ decision space plus $N$ additional variables, explained by the epigraphic reformulation introduced in the proof of Theorem 9. If we further assume that for all $i\in\mathcal{N}$ , for every fixed $x_{-i}\in\mathcal{X}_{-i}$ and $\theta\in\Theta$ , both $f_{i}(\cdot,x_{-i})$ and $g(\cdot,x_{-i},\theta)$ are convex, we would only need one epigraphic variable, hence the argument of $\varepsilon(\cdot)$ could be replaced by $nN+1$ (see Section IV-B).

Since we strengthen here the assumptions of Theorem 9 by imposing a non-degeneracy condition (see Definition 6), (5) could be directly replaced by the tighter expression (7). We wish to emphasize that, even if the non-degeneracy assumption holds, it may still be preferable to calculate the cardinality $d^{*}$ in an a posteriori fashion, as in certain problems the latter might be significantly lower compared to $(n+1)N$ . This is also the case in the electric vehicle charging control problem of Section VI.

Corollary 10.

Let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})$ and define

[TABLE]

Then,

Under the assumptions of Theorem 8, (6) holds with $V_{c}(x^{\ast})$ in place of $V(x^{\ast})$ . 2. 2.

Under the assumptions of Theorem 9, (8) holds with $V_{c}(x^{\ast})$ in place of $V(x^{\ast})$ .

Corollary 10 shows that, with given confidence, the probability that $g(x^{\ast},\theta)$ — hence also each agent’s objective function — deteriorates when a new realization of the uncertainty is encountered can be bounded both in an a posteriori and an a priori fashion as in Theorem 8 and Theorem 9, respectively. These statements are established within the proofs of Theorems 8 and 9 (see (24)).

Remark 11.

The results of Theorems 8 and 9 can be adapted to the case where the uncertain part of the objective function is different for each agent, i.e., if $g$ is replaced by $g_{i}$ , $i\in\mathcal{N}$ . We keep our presentation with a common $g$ since for this case we are able to construct $\Phi$ in a decentralized manner, as shown in Section III; the decentralized computation of $\Phi$ if the uncertain part of the objective function is different for each agent encompasses additional challenges (see [39, Rem. 1]) and is outside the scope of our paper.

III Decentralized NE computation

In this section we show how to construct $\Phi$ , necessary ingredient in the proof of Proposition 7. In particular, we show that the image of $\Phi$ corresponds to the limit of a decentralized algorithm that returns a NE of the game $\mathcal{G}$ . To achieve this, we characterise the NE of $\mathcal{G}$ as solutions to a variational inequality (VI) [44]. We then leverage results in the literature to obtain sufficient conditions for the existence of equilibria, and set the foundations for the design of a decentralized NE computation mechanism [7].

III-A VI analysis

It can be observed that the presence of the $\max$ operator renders agents’ objective functions (1) non-differentiable. To circumvent the computation of sub-gradients and exploit the wide range of algorithms available to solve VIs in the differentiable case, we follow the method in [39] and define the augmented game $\widehat{\mathcal{G}}$ between $N+1$ agents. In $\widehat{\mathcal{G}}$ each player $i\in\mathcal{N}$ , given $x_{-i}$ and $y=(y_{m})_{m=1}^{M}$ , computes

[TABLE]

where $\hat{g}(x,y)$ follows from the equivalence, holding for any $x$ ,

[TABLE]

where $\Delta=\left\{y\in\mathbb{R}^{M}\colon y\geq 0,\sum_{m=1}^{M}y_{m}=1\right\}$ is the simplex in $\mathbb{R}^{M}$ [45, Lemma 6.2.1]. The additional agent (could be thought of as a coordinating authority), given $x$ , will act instead as a maximizing player for the uncertain component of $J_{i}$ , $i\in\mathcal{N}$ , i.e.,

[TABLE]

Note that, for any $M$ -multisample $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ , the objective functions in (9) and (11) are differentiable by Assumption 2. We now link the NE of the augmented game $\widehat{\mathcal{G}}$ to a VI. To this end, we define the mapping $F(x,y)\colon\mathcal{X}\times\Delta\to\mathbb{R}^{(nN+M)}$ as the pseudo-gradient [44, §1.4.1]

[TABLE]

Letting $z=(x,y)$ , the VI problem takes the form [44, §1.4.2]

[TABLE]

The constraints in (13) represent the concatenation of the first-order optimality conditions for the $N+1$ individual problems described by (9) and (11). In the following, we refer to the problem described by (13) as VI $(F,\mathcal{X}\times\Delta)$ .

While the VI $(F,\mathcal{X}\times\Delta)$ describes (under the conditions in Assumption 2) the NE of $\widehat{\mathcal{G}}$ , it turns out that the former can be also linked to the equilibria of $\mathcal{G}$ , as formalized next.

Proposition 12.

Under Assumption 2, there always exists a solution of VI $(F,\mathcal{X}\times\Delta)$ . Denote such a solution by $z^{\ast}=(x^{\ast},y^{\ast})$ . We then have that $x^{\ast}$ is a NE of $\mathcal{G}$ .

Proof.

The existence of a solution for the VI $(F,\mathcal{X}\times\Delta)$ is guaranteed by [44, Cor. 2.2.5] under Assumption 2 and the compactness of $\Delta$ . Denote such a solution by $z^{\ast}$ . A link between the solutions of the VI and those of the augmented game is established by [44, Prop. 1.4.2]: $z^{\ast}=(x^{\ast},y^{\ast})$ is a solution of $\widehat{\mathcal{G}}$ if and only if it solves VI $(F,\mathcal{X}\times\Delta)$ . The link with the original game $\mathcal{G}$ is provided by [39, Thm. 1]: for any NE $(x^{\ast},y^{\ast})$ of the game $\widehat{\mathcal{G}}$ , $x^{\ast}$ is a NE of $\mathcal{G}$ , which concludes the proof. ∎

III-B Monotonicity of the augmented VI operator

The development of algorithms for the solution of VI problems relies upon the monotonicity of the mapping $F$ in (13), which plays a role analogous to convexity in optimization [7].

Definition 13 (Monotonicity).

A mapping $F\colon\mathcal{D}\to\mathbb{R}^{m}$ , with $\mathcal{D}\subseteq\mathbb{R}^{m}$ closed and convex, is

•

monotone on $\mathcal{D}$ if $(u-v)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}(F(u)-F(v))\geq 0$ for all $u,v\in\mathcal{D}$ ,

•

strongly monotone on $\mathcal{D}$ if there exists $c>0$ such that $(u-v)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}(F(u)-F(v))\geq c\|u-v\|^{2}$ for all $u,v\in\mathcal{D}$ .

The following result is instrumental in our analysis:

Lemma 14.

Let Assumptions 2 hold. Then $F(x,y)$ in (12) is monotone on $\mathcal{X}\times\Delta$ .

Proof.

By Assumption 2(ii), $F(x,y)$ is continuously differentiable on its domain. Let $F^{x}$ and $F^{y}$ denote the first $nM$ and the last $M$ rows of $F$ , respectively, i.e., $F^{x}(x,y)=\left(\nabla_{x_{i}}f_{i}(x)+\nabla_{x_{i}}g(x,y)\right)_{i\in\mathcal{N}}$ , and $F^{y}(x,y)=\left(\nabla_{y_{m}}g(x,y)\right)_{m=1}^{M}$ . By definition of the Jacobian we have

[TABLE]

where $J_{x}F^{x}(x,y)=(\nabla^{2}_{x_{i}x_{j}}f_{i}(x))_{i,j\in\mathcal{N}}+\nabla^{2}_{xx}\hat{g}(x,y)$ , $J_{y}F^{x}=(J_{x}F^{y}(x,y))^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}=(\nabla^{2}_{x_{i}y}(f_{i}(x)+\hat{g}(x,y)))_{i\in\mathcal{N}}$ , and $J_{y}F^{y}(x,y)=0$ ; notice that $(\nabla^{2}_{x_{i}x_{j}}f_{i}(x))_{i,j\in\mathcal{N}}$ is a matrix with $\nabla^{2}_{x_{i}x_{j}}f_{i}(x)$ being its $(i,j)$ -th entry.

Due to the particular block structure in (14), $JF(x,y)\succeq 0$ if and only if $J_{x}F^{x}(x,y)\succeq 0$ . To show this, note that by from (2), for all $(x,y)\in\mathcal{X}\times\Delta$ and all $\nu\in\mathbb{R}^{nN}$ ,

[TABLE]

Summing the above inequalities yields

[TABLE]

which, since $\chi^{f}+\chi^{g}\geq 0$ , corresponds to $J_{x}F^{x}(x,y)\succeq 0$ . The statement then follows directly from [44, Prop. 2.3.2], thus concluding the proof. ∎

A direct consequence of the monotonicity of $F$ is that by [7, Thm. 41], VI $(F,\mathcal{X}\times\Delta)$ may admit multiple solutions: this fact together with [44, Prop. 1.4.2] — stating the correspondence between the solutions of the VI $(F,\mathcal{X}\times\Delta)$ and the NE of $\widehat{\mathcal{G}}$ — implies that the game $\widehat{\mathcal{G}}$ can admit multiple NE.

III-C Decentralized algorithm for monotone VI and equilibrium selection

Specific algorithmic design is needed to tackle the convergence properties of a monotone mapping; we refer the reader to [46, 7, 47, 48] for a deep discussion on this topic. For our scope, proximal algorithms can be used to retrieve a solution of a monotone VI by solving a particular sequence of strongly monotone problems, derived by regularizing the original problem. To construct a decentralized mapping $\Phi$ that is single-valued, as required by Proposition 7, a tie-break rule needs to be put in place to single a particular NE out of the possibly many (see Section III-B). Such a tie-break rule is needed even if only one NE is returned by the given algorithm, to prevent the case where different initial conditions produce different NE.

We address the above by employing a proximal algorithm based on [7, Algorithm 4], which allows us to select the minimum Euclidean norm NE; the choice of the Euclidean norm is not restrictive, and a wide range of strictly convex objective function can be used as a selector instead (see [7, Thm. 21]). Thus, more formally, we consider the following refinement of (13)

[TABLE]

We link the VI problem in (17) to the regularized game $\widehat{\mathcal{G}}^{\tau,\bar{z}}$ , where $\tau\in\mathbb{R}_{+}$ and $\bar{z}=(\bar{x},\bar{y})$ are the designated step size and centre of regularization, respectively. Given the tuple $(x_{-i},y,\bar{x}_{i})$ , each player $i\in\mathcal{N}$ solves the following problem

[TABLE]

while the additional agent (player $N+1$ ), given $(x,\bar{y})$ , solves

[TABLE]

with $\eta\in\mathbb{R}_{+}$ . Note that Assumption 2 still holds for (18)–(19). By taking the pseudo-gradient of the above as in (12), we have from [44, Prop. 1.4.2] that $z^{\ast}=(x^{\ast},y^{\ast})$ is a NE of $\widehat{\mathcal{G}}^{\tau,\bar{z}}$ if and only if it satisfies the VI

[TABLE]

The next lemma shows that the regularized game admits a unique NE.

Lemma 15.

Consider Assumptions 2. Let $F$ be as in (12), and fix $\eta\geq 0$ . Then, for any $\tau>0$ and $\bar{z}\in\mathcal{X}\times\Delta$ , the regularized game $\widehat{\mathcal{G}}^{\tau,\bar{z}}$ defined by (18)–(19) admits a unique NE.

Proof.

To establish uniqueness of the NE it suffices to show that $F^{\tau,\bar{z}}$ is strongly monotone [7, Thm. 41]. Fix any $u,v\in\mathbb{R}^{nN+M}$ . Let $F^{\tau,\bar{z}}(u)=F(u)+\eta u+\tau(u-\bar{z})$ , and define $F^{\tau,\bar{z}}(v)$ similalrly. We have

[TABLE]

where the inequality follows from the fact that $\eta\in\mathbb{R}_{+}$ , and $(u-v)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}(F(u)-F(v))\geq 0$ since $F$ is monotone due to Lemma 14. By Definition 13, (21) implies that $F^{\tau,\bar{z}}$ is strongly monotone, thus concluding the proof. ∎

Now let $S^{\tau}(\cdot)$ denote the solution of the VI $(F^{\tau,\cdot},\mathcal{X}\times\Delta)$ . Building on Lemma 15, we aim at determining a NE of $\mathcal{G}$ by updating the centre of regularization of $\widehat{\mathcal{G}}^{\tau,\cdot}$ on the basis of an iterative method in the form $\bar{z}^{(k+1)}=S^{\tau}(\bar{z}^{(k)})$ , until convergence to the fixed point $z^{\ast}=S^{\tau}(z^{\ast})$ . The latter corresponds to the (unique under Lemma 15) NE of $\widehat{\mathcal{G}}^{\tau,z^{\ast}}$ , which satisfies (17). Algorithm 1 provides the means to establish such a connection; this is formalised in the following proposition.

Proposition 16 (Thm. 21 [7]).

*Let Assumptions 2 hold. Let $\{\eta^{(k)}\}_{k=0}^{\infty}$ be any sequence satisfying $\eta^{(k)}>0$ for all $k$ , $\sum_{k=0}^{\infty}\eta^{(k)}=\infty$ , and $\lim_{k\rightarrow\infty}\eta^{(k)}=0$ . Select a big enough $\bar{\tau}>0$ and let $\{\bar{z}^{(k)}\}_{k=0}^{\infty}$ denote the sequence generated by Algorithm 1. For any $\tau\geq\bar{\tau}$ , (i) there exists $\gamma_{\mathrm{inn}},\gamma_{\mathrm{out}}>0$ such that $\{\bar{z}^{(k)}\}_{k=0}^{\infty}$ is bounded; (ii) there exists $z^{\ast}=(x^{\ast},y^{\ast})$ such that $\|\bar{z}^{(k)}-z^{\ast}\|\rightarrow 0$ for $k\rightarrow\infty$ , where $z^{\ast}$ is a solution of (17), and (iii) $x^{\ast}\in\Omega$ . *

The reader is referred to [7, Lemma. 20] for a lower bound on $\bar{\tau}$ in Algorithm 1. The latter asymptotically converges to a solution of (17), while by Proposition 12, (17b) it is equivalent to the game $\widehat{\mathcal{G}}$ , whose solution set is nonempty and is also contained in $\Omega$ due to the second part of Proposition 12.

Proof of Proposition 7: Algorithm 1 and its analysis, leading to Proposition 16, serves as an implicit construction of a decentralized, single-valued mapping $\Phi\colon\Theta^{M}\to\Omega$ , thus establishing Proposition 7. $\blacksquare$

Note that $\Phi$ is single-valued, hence the returned solution is independent of the initial condition used in Algorithm 1. However, in the proof of Theorem 9 it becomes insightful to make this dependency explicit. Thus, for the analysis of Section IV-B we will introduce the notation $\Phi_{x_{0}}$ , with $x_{0}=\bar{x}^{(0)}\in\mathcal{X}$ . (Notice that $\bar{y}^{(0)}$ , which also appears in the initial condition $\bar{z}^{(0)}$ of Algorithm 1, depends on the $M$ -multisample; as the latter is already an argument of $\Phi$ , we only include $x_{0}$ as a subscript.)

IV Proofs of a posteriori and a priori certificates

IV-A Proof of Theorem 8

Fix any $M\in\mathbb{N}$ . Consider $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ , and let $d^{*}\leq M$ be the cardinality of any given compression set of $\{\theta_{1},\ldots,\theta_{M}\}$ (recall that it depends on the observation of the $M$ -multisample). Let $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})\in\Omega$ , and

[TABLE]

For any $\theta\in\Theta$ , consider the set

[TABLE]

Fix $\beta\in(0,1)$ and consider $\varepsilon(\cdot)$ defined as in (5). Under Assumptions 2, $\Phi$ is single-valued by Proposition 7. By [35, Thm. 1] we then have that

[TABLE]

if the following consistency condition holds for $H_{\theta}$ (see [33] for a definition)

[TABLE]

To show the latter, notice that for each $i\in\mathcal{N}$ , by the NE definition (Definition 1), $(x^{\ast}_{i},\gamma^{\ast})$ will belong to the set of minimizers of the following epigraphic reformulation of (2)

[TABLE]

By (26b) it follows then that (25) is satisfied, thus establishing (24). Note that for the result of [35] to be invoked, (26) is not required to be a convex optimization program, hence the fact that for each $i\in\mathcal{N}$ , for any $\theta\in\Theta$ , only $f_{i}(\cdot,x^{\ast}_{-i})+g(\cdot,x^{\ast}_{-i},\theta)$ is assumed to be convex by Assumption 2 is sufficient.

By the definition of $\gamma^{\ast}$ and $H_{\theta}$ , (24) implies that with confidence at least $1-\beta$ , $\mathds{P}\{\theta\in\Theta\colon g(x^{\ast},\theta)>\max_{m\in\{1,\ldots,M\}}g(x^{\ast},\theta_{m})\}\leq\varepsilon(d^{*})$ , thus establishing the first part of Corollary 10.

We now proceed to demonstrate the claim in (6). Recall that, by (12), (17) and Proposition 12, we can obtain $x^{\ast}\in\Omega$ as solution of the following optimization program (note the slight abuse of notation as by $(x^{\ast},y^{\ast})$ we denote both the optimizer and the corresponding decision vector)

[TABLE]

where $(x^{\ast},y^{\ast})$ is a NE of $\widehat{\mathcal{G}}$ . By definition of $\hat{g}$ in (9), and recalling $\nabla_{y_{m}}(\sum_{m=1}^{M}y^{\ast}_{m}g(x^{\ast},\theta_{m}))=g(x^{\ast},\theta_{m})$ , (27b) can be equivalently written as

[TABLE]

As (28) holds for all $y\in\Delta$ , we have that (27b) is equivalent to the following inequality being satisfied for all $x\in\mathcal{X}$ ,

[TABLE]

where the equality follows from (10). For a given $\theta\in\Theta$ , recall from Section II-C the definition of the game $\mathcal{G}^{+}$ associated with the samples $\{\theta_{1},\ldots,\theta_{M}\}\cup\{\theta\}$ , and the associated set of NE $\Omega^{+}$ . Moreover, let $\widehat{\mathcal{G}}^{+}$ denote the associated augmented game. Analogously to (29), any solution $(x^{+},y^{+})\in\mathcal{X}\times\Delta^{+}$ , where $\Delta^{+}$ is the simplex in $\mathbb{R}^{M+1}$ , of the augmented game $\widehat{\mathcal{G}}^{+}$ will satisfy the following VI:

[TABLE]

Note the analogy between (30) and (29), with the additional terms corresponding to the new sample $\theta$ ( $y_{M+1}$ is the additional decision variable corresponding to the new sample).

We are interested in quantifying the probability of $x^{\ast}\in\Omega^{+}$ . To this end, notice that if $g(x^{\ast},\theta)\leq\gamma^{\ast}$ , then $x^{+}=x^{\ast}$ and $y^{+}=(y^{\ast{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}},\,0)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}$ constitute a feasible pair for (30). This is due to the fact that under this choice $y_{M+1}^{+}=0$ and hence

[TABLE]

thus (30) reduces to (29). Applying Proposition 12 to $\mathcal{G}^{+}$ and $\widehat{\mathcal{G}}^{+}$ , we have that if $(x^{+},y^{+})$ satisfies (30) (i.e., it is a NE of the augmented game $\widehat{\mathcal{G}}^{+}$ ) then $x^{+}\in\Omega^{+}$ . Therefore, $x^{\ast}\in\Omega^{+}$ whenever $g(x^{\ast},\theta)\leq\gamma^{\ast}$ , or in other words

[TABLE]

By (24) and (32), (6) follows, thus concluding the proof. $\blacksquare$

IV-B Proof of Theorem 9

Let $\mathcal{C}_{0}\subseteq\{\theta_{1},\ldots,\theta_{M}\}$ be the minimal cardinality compression set for the minimum norm NE $x^{\ast}$ returned by Algorithm 1; note that under the non-degeneracy assumption it will be unique and it will coincide with the set of support samples. Following the discussion at the end of Section III $\Phi_{x_{0}}$ denotes a mapping that returns $x^{\ast}$ (e.g., the one induced by Algorithm 1), where we make explicit the dependence on the initial condition $x_{0}\in\mathcal{X}$ . As $\Phi$ is single-valued, by definition of a compression set we have that $x^{\ast}=\Phi_{x_{0}}(\theta_{1},\ldots,\theta_{M})=\Phi_{x_{0}}(\mathcal{C}_{0})$ , for all $x_{0}\in\mathcal{X}$ .

Consider now the following optimization program.

[TABLE]

where $\gamma_{i}$ , $i\in\mathcal{N}$ , are epigraphic variables, and in (33a) we have equality and not inclusion since the set of minimizers is a singleton due to the regularization term $\tau\|x_{i}-x^{\ast}_{i}\|^{2}$ . Note that (33) is separable across $i\in\mathcal{N}$ , with each subproblem corresponding to an epigraphic reformulation of the fixed point characterization of the regularized problem (18) for $\eta=0$ (this is the value of $\eta^{(k)}$ when the proposed algorithm has converged to $x^{\ast}$ ) and $\bar{x}_{i}=x^{\ast}_{i}$ , for all $i\in\mathcal{N}$ . The latter is identical to $\Phi_{x_{0}}$ with $x_{0}=x^{\ast}$ . It follows from (33) that $x^{\ast}=\Phi_{x^{\ast}}(\theta_{1},\ldots,\theta_{M})$ . Also note that we have introduced one epigraphic variable per agent $i\in\mathcal{N}$ ; we will invoke in the sequel the fact that (33) is convex due to Assumption 2 . However, if we further assume that for all $i\in\mathcal{N}$ , for every fixed $x_{-i}\in\mathcal{X}_{-i}$ and $\theta\in\Theta$ , the functions $f_{i}(\cdot,x_{-i})$ and $g(\cdot,x_{-i},\theta)$ are each convex, we would only need one epigraphic variable, as we could perform an epigraphic reformulation only for $g$ ; this would give rise to the constraint in (26b), which is common to all agents.

Let $\mathcal{C}$ denote a minimal cardinality compression set for $x^{\ast}$ in (33). We claim that $\mathcal{C}_{0}\subseteq\mathcal{C}$ . To show this, assume for the sake of contradiction that there exists $k\in\{1,\ldots,M\}$ such that $\theta_{k}\in\mathcal{C}_{0}$ but $\theta_{k}\notin\mathcal{C}$ . Consider the set $\{\theta_{1},\ldots,\theta_{M}\}\setminus\{\theta_{k}\}\supseteq\mathcal{C}$ , and notice that this has to be a compression set for $x^{\ast}$ in (33) as it is a superset of $\mathcal{C}$ . By Definition 5, this implies that $x^{\ast}=\Phi_{x^{\ast}}(\{\theta_{1},\ldots,\theta_{M}\}\setminus\{\theta_{k}\})$ (recall that the solution of (33) is given by $x^{\ast}=\Phi_{x^{\ast}}(\theta_{1},\ldots,\theta_{M})$ ). However, $\theta_{k}\in\mathcal{C}_{0}$ ; as the latter coincides with the set of support samples due to the imposed non-degeneracy assumption, we have by Definition 4 $x^{\ast}\neq\Phi_{x^{\ast}}(\{\theta_{1},\ldots,\theta_{M}\}\setminus\{\theta_{k}\})$ . This establishes a contradiction, showing that $\mathcal{C}_{0}\subseteq\mathcal{C}$ (hence $|\mathcal{C}_{0}|\leq|\mathcal{C}|$ ).

By Assumptions 2, (33) is a convex scenario program, and admits a unique solution due to the fact that the objective function in (33a) is strictly convex. Moreover, it has a non-empty feasibility region in view of Proposition 12. Therefore, by [36], [43], we have that any minimal cardinality compression set $\mathcal{C}$ has cardinality upper-bounded by $(n+1)N$ , i.e., the number of decision variables in (33). Therefore, $|\mathcal{C}_{0}|\leq|\mathcal{C}|\leq(n+1)N$ . As a result, $|\mathcal{C}_{0}|$ can be upper-bounded by the a priori known quantity $(n+1)N$ . As Theorem 8 holds for any compression cardinality $d^{*}\geq|\mathcal{C}_{0}|$ , we can apply it with $d^{*}=(n+1)N$ . Hence, Theorem 9 as well as the second part of Corollary 10 directly follow, concluding the proof. $\blacksquare$

V Computation of the compression set cardinality

The result of Theorem 8 relies on the computation of the compression cardinality $d^{*}$ , which by Definition 5 is bounded by $M$ . An a posteriori estimate of the compression cardinality can be obtained through different methodologies, whose design may be tuned on the specific case. It follows from (6) that the closer the estimate to the minimal cardinality of the compression sets in $\mathfrak{C}$ , the less conservative the probabilistic guarantees on the robustness performance of the solution. In [35, §II] a greedy procedure is outlined to estimate (an upper bound to) the minimal compression cardinality; for completeness, we summarize this procedure in Algorithm 2. According to this, a compression set $\mathcal{C}$ is constructed progressively by removing samples one by one (step 2). Only if their removal leaves the solution unaltered they are discarded (step 3-4); this is then repeated till no further sample can be removed without changing $x^{\ast}$ .

However, there are two drawbacks: first, the computational cost is generally high, as $\Phi(\cdot)$ should be evaluated at least $M$ times, where each of these operations typically involves an asymptotic scheme (as, e.g., in Algorithm 1); second, in practice, limited numerical accuracy makes the evaluation of the condition at Step 3 of Algorithm 2 amenable to numerical errors.

To alleviate these, we provide a computationally efficient way to determine a compression set, and hence $d^{*}$ , by direct inspection of the NE. To achieve this, we impose certain NE uniqueness requirements. However, it should be noted that for the wide class of aggregative games, the additional structure required in the proposition below implies only uniqueness of an aggregate strategy, where multiple equilibria may exist. This is summarized in the following proposition.

Proposition 17.

Consider Assumption 2. Further assume that for all $M\in\mathbb{N}$ , either

$\mathcal{G}$ * admits a unique NE;* 2. 2.

or, $g$ depends on the aggregate strategy111With a slight abuse of notation, in the second part of the proposition it is to be understood that for all $i\in\mathcal{N}$ and for any given $x_{-i}$ , Assumption 2 refer to the function $f_{i}(\cdot,x_{-i})+g(\sigma(\cdot,x_{-i}),\theta)$ . * $\sigma(x)\colon x\mapsto\sum_{i\in\mathcal{N}}x_{i}$ , and $\mathcal{G}$ admits a unique NE aggregate $\sigma(x)$ .*

Then, the set $\mathcal{Y}^{\ast}\triangleq\{m\in\{1,\ldots,M\}\colon y^{\ast}_{m}>0\}$ corresponds to the indices of a compression set, i.e., $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})=\Phi(\{\theta_{m}\}_{m\in\mathcal{Y}^{\ast}})$ .

Proof.

*Part 1: Uniqueness of NE. * Fix any $(\theta_{1},\ldots,\theta_{M})\in\Theta^{M}$ and notice that it forms a (trivial) compression set for $x^{\ast}$ . Let $(x^{\ast},y^{\ast})$ be a solution of the augmented game $\widehat{\mathcal{G}}$ , where $y^{\ast}=(y^{\ast}_{m})_{m=1}^{M}$ .

To prove that $x^{\ast}=\Phi(\{\theta_{m}\}_{m\in\mathcal{Y}^{\ast}})$ it suffices to show that the solution returned by $\Phi$ remains unaltered after removing all samples from $\{\theta_{1},\ldots,\theta_{M}\}$ whose associated component of $y^{\ast}$ is zero. To this end, suppose that at least one such sample exists: without loss of generality, assume $y^{\ast}_{M}=0$ (i.e., that sample has index $M$ ). We will first show that $\{\theta_{1},\ldots,\theta_{M-1}\}$ is a compression set, i.e., $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M-1})$ . Let $\mathcal{G}^{-}=\langle\mathcal{N},(\mathcal{X}_{i})_{i\in\mathcal{N}},(J_{i})_{i\in\mathcal{N}},\{\theta_{j}\}_{j=1}^{M-1}\rangle$ be the game with samples $\{\theta_{1},\ldots,\theta_{M-1}\}$ . Moreover, let $\widehat{\mathcal{G}}^{-}$ denote the associated augmented game, and $\Delta^{-}$ the simplex in $\mathbb{R}^{M-1}$ . Since $(x^{\ast},y^{\ast})$ is an NE of $\widehat{\mathcal{G}}$ , it will satisfy the VI in (29). At the same time, every solution $(x^{-},y^{-})\in\mathcal{X}\times\Delta^{-}$ of the augmented game $\widehat{\mathcal{G}}^{-}$ satisfies the following VI:

[TABLE]

Set $x^{-}=x^{\ast}$ and $y^{-}=(y^{\ast}_{m})_{m=1}^{M-1}$ . Under this choice $(x^{-},y^{-})$ satisfies (V), as $\max_{m\in\{1,\ldots,M-1\}}g(x^{\ast},\theta_{m})\leq\max_{m\in\{1,\ldots,M\}}g(x^{\ast},\theta_{m})$ . Equivalently, $(x^{\ast},(y^{\ast}_{m})_{m=1}^{M-1})$ is an NE for $\widehat{\mathcal{G}}^{-}$ , and by applying Proposition 12 to $\mathcal{G}^{-}$ and $\widehat{\mathcal{G}}^{-}$ we have that $x^{\ast}$ is an NE for $\mathcal{G}^{-}$ . However, due to the uniqueness assumption, $x^{\ast}$ has to be the only NE of $\mathcal{G}^{-}$ , showing that $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M-1})$ .

Following the same procedure, removing one by one all samples for which the associated elements of $y^{\ast}$ are zero, shows that $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})=\Phi(\{\theta_{m}\}_{m\in\mathcal{Y}^{\ast}})$ , thus concluding the proof of the first part.

Part 2: Uniqueness of NE aggregate. The proof follows the same arguments as in Part 1 with the following modifications. The derivation until the discussion right after (V) remains unaltered, showing that $(x^{\ast},(y^{\ast}_{m})_{m=1}^{M-1})$ is a NE of $\widehat{\mathcal{G}}^{-}$ . To prove that $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M-1})$ it suffices to show that $(x^{\ast},(y^{\ast})_{m=1}^{M-1})$ is the minimum norm NE of $\widehat{\mathcal{G}}^{-}$ . We thus assume for the sake of contradiction that $(\hat{x},\hat{y})\in\mathcal{X}\times\Delta^{-}$ is the NE of $\widehat{\mathcal{G}}^{-}$ that achieves the minimum norm, i.e., $\|(\hat{x},\hat{y})\|^{2}<\|(x^{\ast},(y^{\ast})_{m=1}^{M-1})\|^{2}$ . We distinguish two cases:

Case 1: $g(\sigma(\hat{x}),\theta_{M})\leq\max_{m\in\{1,\ldots,M-1\}}g(\sigma(\hat{x}),\theta_{m})$ . Under this condition observe that $(\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}},0)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}})$ satisfies the VI in (29) for the game with $M$ samples. However, as $(x^{\ast},y^{\ast})$ is the minimum norm equilibrium for that game, we have that $\|(x^{\ast},y^{\ast})\|^{2}\leq\|(\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}},0)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}})\|^{2}$ . Overall, recalling $y^{\ast}_{M}=0$ , $\|(x^{\ast},(y^{\ast})_{m=1}^{M-1})\|^{2}=\|(x^{\ast},y^{\ast})\|^{2}\leq\|(\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}},0)^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}})\|^{2}=\|(\hat{x},\hat{y})\|^{2}$ , thus establishing a contradiction. We can then show that $x^{\ast}=\Phi(\theta_{1},\ldots,\theta_{M})=\Phi(\{\theta_{m}\}_{m\in\mathcal{Y}^{\ast}})$ as in the last paragraph of Part 1.

Case 2: $g(\sigma(\hat{x}),\theta_{M})>\max_{m\in\{1,\ldots,M-1\}}g(\sigma(\hat{x}),\theta_{m})$ . We will show that, under our assumptions, this case cannot occur. By the uniqueness assumption we have that $\sigma(\hat{x})=\sigma(x^{\ast})$ for any equilibrium $\hat{x}\neq x^{\ast}$ (the NE is not necessarily unique, but all equilibria have the same aggregate). We then have

[TABLE]

for any $m\in{1,\ldots,M-1}$ . Since (V) holds for any $m$ ,

[TABLE]

Consider now (26b). By direct computation of the KKT optimality conditions [45, §6.2.1] of (26) and (11), respectively, it can be verified that the decision variable $y\in\Delta$ introduced in (9)–(11) is a shadow price for the constraint (26b). Then, by the complementary slackness condition,

[TABLE]

Since $y_{M}=0$ implies $g(\sigma(x^{\ast}),\theta_{M})\leq\gamma^{\ast}$ we obtain

[TABLE]

From (38) it follows $\max_{m\in\{1,\ldots,M-1\}}g(\sigma(x^{\ast}),\theta_{m})=\max_{m\in\{1,\ldots,M\}}g(\sigma(x^{\ast}),\theta_{m})\geq g(\sigma(x^{\ast}),\theta_{M})$ , which contradicts (V) and concludes the proof. ∎

Based on the shadow price interpretation of $y$ (see proof of Proposition 17) notice that if $g(x^{\ast},\theta_{m})<\gamma^{\ast}$ (inactive constraint) then $y^{\ast}_{m}=0$ . Note that samples with $y^{\ast}_{m}=0$ can be removed without altering $x^{\ast}$ due to the imposed uniqueness requirements; otherwise, the feasibility region of the VI in (V) may enlarge, possibly resulting in a different minimum norm NE. Moreover, it should be noted that Proposition 17 does not provide guarantees that a minimal cardinality compression set is determined; this can be obtained by Algorithm 2 (see also [35]). However, the important implication of Proposition 17 is that the cardinality of a compression set is readily available by inspecting $y^{\ast}$ .

VI Case study: Electric vehicle charging control

VI-A Problem set-up

We consider a stylized electric vehicle (EV) charging control problem, with EVs being risk-averse, selfish entities interested in minimizing their own cost. Let $\{1,\ldots,N\}$ index the finite population of EV vehicles/agents. We denote by $x_{i}\in\mathbb{R}^{n}$ the demand profile each EV seeks to determine over $n$ time slots, where for simplicity these are taken to be of unit length (1 hour). Vehicles’ charging strategy is in response to a pricing signal received from a coordinator, which in turn depends on the demand profiles of all agents. We consider price to be an affine function of the aggregate strategy $\sigma(x)\colon x\mapsto\sum_{i\in\mathcal{N}}x_{i}$ , but other choices are also supported by our theoretical analysis. Price is subject to uncertainty, e.g., externalities acting on the energy spot market, encoded by the random variable $\theta\in\Theta$ , which we model by means of scenarios. In particular, each scenario is a realization of prices along the considered $n$ -slot interval. Note that these scenarios are i.i.d., however, each of them is a finite horizon path, whose entries can be correlated. Each agent $i=1,\ldots,N$ aims at minimizing

[TABLE]

where $A_{m}\in\mathbb{R}^{n\times n}$ , for $m=0,1,\ldots,M$ , are diagonal matrices, and $b_{m}\in\mathbb{R}^{n}$ . Moreover, we assume the charging operations are subject to $\mathcal{X}_{i}=\{x_{i}\in\mathbb{R}^{n}:\;\mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}x_{i}\geq E_{i},\;0\leq x_{ij}\leq P_{i},~{}\forall j=1,\ldots,n\}$ , where $E_{i},P_{i}\in\mathbb{R}$ designate the desired final state of charge (SoC) and the maximum power deliverable by the charger, respectively.

We analyse the results of several randomly generated cases, differing in the parameters characterizing the EV constraints $\mathcal{X}_{i}$ , selected from a uniform random distribution: specifically, $P_{i}\in[6,15]$ kW, and $E_{i}$ is chosen to be feasible in the specified time interval ( $\sim$ 0–35 kWh per 12 h interval). The pairs $\{A_{m},b_{m}\}_{m=1}^{M}$ are i.i.d. extracted from a lognormal distribution for the diagonal entries of $A_{m}$ , and a uniform distribution for the vectors $b_{m}$ . The nominal electricity price, i.e., the diagonal entries $\{a_{t}\}_{t=1}^{n}$ of the matrix $A_{0}$ , have been derived by rescaling a winter weekday demand profile in the UK [49], whereas $b_{0}=0$ .

It should be noted that even though this example fits in the class of aggregative games, it does not necessarily meet the uniqueness requirement of the second part of Proposition 17. However, we have empirically observed that the main conclusion of the proposition still holds, namely, the minimum norm solution returned by Algorithm 1 remains unaltered when the algorithm is fed only with the samples with indices in $\mathcal{Y}^{\ast}$ . Informally, this happens due to the fact that for any feasible problem instance $\mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}x_{i}\geq E_{i}$ will always be binding at the optimum, and as result $\mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{$ \displaystyle\intercal $}}{\raisebox{0.75346pt}{$ \textstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptstyle\intercal $}}{\raisebox{0.75346pt}{$ \scriptscriptstyle\intercal $}}}\sigma(x)$ will be constant for any NE $x$ (see also transparent plane in Figure 3); a detailed investigation of this issue is topic of current research.

VI-B Simulation results

The NE charging schedules have been obtained by implementing Algorithm 1 with $\gamma_{\mathrm{inn}}=10^{-14}$ , $\gamma_{\mathrm{out}}=10^{-5}$ and $\tau\in[4,6]$ . At each iteration of the proposed algorithm a quadratic optimization needs to be solved; this was performed on a dual-core 7th gen. Intel processor using MATLAB. Figures 1–2 show the convergence of Algorithm 1 in the computation of the NE for $N=20$ EVs, $n=24$ and $M=500$ . A dominant linear convergence rate can be observed, and less than 4000 outer loop iterations were needed to meet the desired exit accuracy $\gamma_{\mathrm{out}}$ . The inner loop enjoys similar convergence rate (not shown for space reasons), and less than 30 iterations (15 in average) are needed to achieve an error smaller than $\gamma_{\mathrm{inn}}$ .

To validate the a posteriori result of Theorem 8, Table I shows the average robustness performance of several solutions (with $N=20$ , $n=24$ ) obtained from different sets of $M=500$ samples, grouped according to the a posteriori observed compression cardinality $d^{*}$ ; we have set $\beta=10^{-6}$ . The violation rate $V(x^{\ast})$ of each solution is empirically computed using $10^{6}$ newly extracted samples (according to the same aforementioned distributions) and counting the fraction of them that result in a change of the computed NE. Consistently with [35], we note that the observed value of $d^{*}$ is indicative of the confidence level on the equilibrium robustness. The experimental results are compared with the theoretical bound provided by Theorem 8 (third row). For non-degenerate problems, the conservatism of the latter can be reduced by employing the tighter expression for $\varepsilon(\cdot)$ reported in (7), leading to the fourth row of Table I. However, note that in general it is difficult to verify whether a given problem is non-degenerate, thus preventing the use of (7). We observe that a bound of $\sim$ 2–3% could have been achieved with Theorem 8 by increasing the sample size to $M=2000$ .

A visual representation of the concept of compression set is given in Fig. 3. The plot depicts the curves expressing the uncertain cost term $Ng(x^{\ast},\theta_{m})$ associated to a subset $m\in\{53,72,282,566\}$ of the $M=1000$ samples used for the derivation of the NE $x^{\ast}$ . Values are plotted as a function of the aggregate demand $(\sigma(x)|_{t=1},\sigma(x)|_{t=2})$ on an interval around $\sigma(x^{\ast})$ . In this case the (minimal) compression cardinality is $d^{*}=2$ , with $\{\theta_{53},\theta_{282}\}$ supporting the solution together with the constraint on the target SoC which is binding in this case (transparent plane). Note that in this instance the constraints on the power rate $P_{i}$ are not active, and omitted from the plot for clarity.

We now investigate numerically the validity of the a priori result of Theorem 9. Fig. 4 shows the average compression set cardinality $d^{*}$ (solid line) observed over 50 trials, corresponding to different randomly generated cases corresponding to different values of $n$ and $M$ . In all cases, $d^{*}$ is bounded by $(n+1)N$ as suggested by Theorem 9. In fact, the empirically calculated cardinality $d^{*}$ is significantly lower, suggesting that in this case study an a posteriori quantification is less conservative. Moreover, in all our numerical investigations we noticed that $d^{*}\leq n$ , i.e., the empirical estimate of the compression set cardinality is independent of the number of agents and is bounded by the number of individual decision variables (dashed line). We conjecture that for the aggregative EV charging game considered here, involving affine price functions, the so called support rank (see [50] for a definition) offers a tighter bound on the compression set cardinality compared to the total number of decision variables $(n+1)N$ .

VII Concluding remarks

We considered the problem of NE computation in multi-agent games in the presence of uncertainty, and accompanied them with a priori and a posteriori certificates regarding the probability that the NE equilibrium remains unchanged when a new uncertainty realization is encountered.

Current work is concentrated towards relaxing the uniqueness requirements underpinning the compression set quantification of Section V for the class of aggregative games.

Bibliography50

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Lamnabhi-Lagarrigue, A. Annaswamy, S. Engell, A. Isaksson, P. Khargonekar, R. M. Murray, H. Nijmeijer, T. Samad, D. Tilbury, and P. V. den Hof, “Systems & control for the future of humanity, research agenda: Current and future roles, impact and grand challenges,” Annual Reviews in Control , vol. 43, pp. 1 – 64, 2017.
2[2] H. Le Cadre, I. Mezghani, and A. Papavasiliou, “A game-theoretic analysis of transmission-distribution system operator coordination,” European Journal of Operational Research , vol. 274, no. 1, pp. 317 – 339, 2019.
3[3] A. De Paola, F. Fele, D. Angeli, and G. Strbac, “Distributed coordination of price-responsive electric loads: A receding horizon approach,” in 2018 IEEE Conference on Decision and Control (CDC) , Dec 2018, pp. 6033–6040.
4[4] I. Atzeni, L. G. Ordóñez, G. Scutari, D. P. Palomar, and J. R. Fonollosa, “Noncooperative day-ahead bidding strategies for demand-side expected cost minimization with real-time adjustments: A GNEP approach,” IEEE Transactions on Signal Processing , vol. 62, no. 9, pp. 2397–2412, May 2014.
5[5] R. Kamat and S. Oren, “Two-settlement systems for electricity markets under network uncertainty and market,” Power. Journal of Regulatory Economics , vol. 25, pp. 5–37, Jan 2004.
6[6] J. Yao, I. Adler, and S. Oren, “Modelling and computation of two-settlement oligopolistic equilibrium in a congested electricity network,” Operations Research , vol. 56, no. 1, pp. 34–47, Jan 2008.
7[7] G. Scutari, F. Facchinei, J. S. Pang, and D. P. Palomar, “Real and complex monotone communication games,” IEEE Transactions on Information Theory , vol. 60, no. 7, pp. 4197–4231, July 2014.
8[8] J. Krawczyk, “Numerical solutions to coupled-constraint (or generalised Nash) equilibrium problems,” Computational Management Science , vol. 4, pp. 183–204, Nov 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Probably Approximately Correct

Abstract

Index Terms:

I Introduction

II Scenario based multi-agent game

II-A Gaming set-up

Definition 1** (Nash equilibrium).**

Assumption 2**.**

II-B Problem statement

Remark 3**.**

Definition 4** (Support sample [36]).**

Definition 5** (Compression set — adapted from [35]).**

Definition 6** (Non-degeneracy — adapted from [42]).**

II-C Main results

Proposition 7**.**

II-C1 A posteriori certificate

Theorem 8**.**

II-C2 A priori certificate

Theorem 9**.**

Corollary 10**.**

Remark 11**.**

III Decentralized NE computation

III-A VI analysis

Proposition 12**.**

Proof.

III-B Monotonicity of the augmented VI operator

Definition 13** (Monotonicity).**

Lemma 14**.**

Proof.

III-C Decentralized algorithm for monotone VI and equilibrium selection

Lemma 15**.**

Proof.

Proposition 16** (Thm. 21 [7]).**

IV Proofs of a posteriori and a priori certificates

IV-A Proof of Theorem 8

IV-B Proof of Theorem 9

V Computation of the compression set cardinality

Proposition 17**.**

Proof.

VI Case study: Electric vehicle charging control

VI-A Problem set-up

VI-B Simulation results

VII Concluding remarks

Definition 1 (Nash equilibrium).

Assumption 2.

Remark 3.

Definition 4 (Support sample [36]).

Definition 5 (Compression set — adapted from [35]).

Definition 6 (Non-degeneracy — adapted from [42]).

Proposition 7.

Theorem 8.

Theorem 9.

Corollary 10.

Remark 11.

Proposition 12.

Definition 13 (Monotonicity).

Lemma 14.

Lemma 15.

Proposition 16 (Thm. 21 [7]).

Proposition 17.