Probably Approximately Correct Nash Equilibrium Learning
Filiberto Fele, Kostas Margellos

TL;DR
This paper introduces a data-driven, probabilistically robust method for computing Nash equilibria in uncertain multi-agent games, with theoretical guarantees and decentralized computation, demonstrated on electric vehicle charging.
Contribution
It develops a PAC learning framework for Nash equilibrium computation with robustness certificates and a decentralized solution approach for scenario-based games.
Findings
Provides probabilistic robustness guarantees for Nash equilibria.
Enables decentralized equilibrium computation.
Validates approach on electric vehicle charging problem.
Abstract
We consider a multi-agent noncooperative game with agents' objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set - a key concept in scenario-based optimization - can be directly obtained as a byproduct of the proposed solution methodology.…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Probably Approximately Correct
Nash Equilibrium Learning
Filiberto Fele and Kostas Margellos Research was supported by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant agreement EP/P03277X/1. The authors are with the Department of Engineering Science, University of Oxford, OX1 3PJ, UK {filiberto.fele, kostas.margellos}@eng.ox.ac.uk
Abstract
We consider a multi-agent noncooperative game with agents’ objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set — a key concept in scenario-based optimization — can be directly obtained as a byproduct of the proposed solution methodology. Finally, we illustrate how to overcome differentiability issues, arising due to the introduction of scenarios, and compute a Nash equilibrium solution in a decentralized manner. We demonstrate the efficacy of the proposed approach on an electric vehicle charging control problem.
Index Terms:
Nash equilibria, Robust game theory, Scenario approach, Variational inequalities, Electric vehicles.
I Introduction
Game theory has attracted significant attention in the control systems community [1], and has found numerous applications ranging from smart grid [2, 3, 4] and electricity markets [5, 6], to communication networks [7] and regulatory compliance [8, 9]. The concept of Nash equilibrium (NE) is central in this context, as it defines no-regret strategies for noncooperative, selfish agents [10, 11]. As a result, NE has been a popular solution for multi-agent distributed and decentralized control architectures, investigating also the connections with social welfare; see, e.g., [12, 13, 14, 15, 16, 17] and references therein.
Uncertainty has been widely addressed in Nash games, by adopting stochastic or worst-case approaches. In the first case, both chance-constrained (risk-averse) [18, 19, 20] or expected payoff criteria [21, 22, 23, 24, 25] have been considered. For tractability, these methods typically involve assumptions on the underlying probability distribution of uncertainty realization. In the second case, results build upon robust control theory [26, 27, 28]; however, these rely on certain assumptions on the geometry of the uncertainty set; see [29, 20].
In this paper we consider a multi-agent NE seeking problem with uncertainty affecting agents’ objective functions. We depart from existing paradigms and follow a data driven methodology, where we represent uncertainty by a finite set of scenarios that could either be extracted from historical data, or by means of some prediction model (e.g., regression, Markov chains, neural networks) [30]. Adopting a data driven methodology poses a main challenge: NE are inherently random as they depend on the observed scenarios. Therefore, our objective is to investigate the sensitivity of the resulting NE to the uncertainty, in a probabilistic sense. More specifically, our contributions can be outlined as follows:
- We treat the NE computation problem in a probably approximately correct (PAC) learning framework [31, 32, 33], and employ the so called scenario approach [34]. Building on [35] we first provide an a posteriori certificate on the probability that a NE remains unaltered upon a new realization of the uncertainty. We then rely on [36] and provide an a priori probabilistic certificate on the equilibrium sensitivity, under an additional non-degeneracy assumption (see Section II for a definition). The obtained results are distribution-free, and as such the underlying probability distribution of the uncertainty could be unknown and the only requirement is the availability of samples.
Blending the scenario approach with game theory has only recently appeared in the literature [37, 38]. The validity of the probabilistic statements presented in this paper extends to games admitting multiple NE. Moreover, all our a posteriori statements allow for degenerate problem instances; the latter circumvents the need of verifying the non-degeneracy assumption which, unlike convex optimization programs, is often not satisfied in games.
-
Under the additional assumption that the game under consideration admits a unique NE, or for aggregative games with multiple equilibria but a unique aggregate solution, we show that a compression set (a key concept in learning and generalization — see Section II for a definition) can be directly computed by inspection of the solution returned by the proposed algorithm. This feature has significant computational advantages as it prevents the use of a greedy mechanism (see, e.g., [35]), which would require running up to numerical convergence multiple times (possibly as many as the number of samples) a NE seeking algorithm (see Section V).
-
We provide a constructive proof of the existence of a single-valued mapping from the set of observed scenarios to a NE of the robust game, where the latter possibly admits multiple equilibria (and multiple maximisers). More specifically, we build an iterative algorithm for decentralized NE computation. To circumvent nondifferentiability issues and incorporate an equilibrium selection mechanism we bridge the results in [39] and [7] that involve resorting to an augmented game. The proposed scheme enjoys the same convergence properties as state-of-the-art decentralized algorithms for monotone games [7] (see Section III).
Note that the results presented in this paper do not contemplate constraints coupling agents’ strategies. The latter give rise to generalized NE problems; we refer the reader to [12, 15, 40, 41] for details.
In Section II we introduce the scenario-based Nash game, pose the main problem, and present the main results of the paper. Section III provides a decentralized construction of a solution algorithm for the game under study, while Section IV contains the proof of the main results. In Section V we provide for a wide class of games a computationally efficient methodology to determine an upper bound to the cardinality of the compression set. Section VI provides an electric-vehicle charging control case study, while Section VII concludes the paper and provides some directions for future work.
II Scenario based multi-agent game
II-A Gaming set-up
Let the set designate a finite population of agents. The decision vector, henceforth referred to as strategy, of each agent is denoted by and satisfies individual constraints encoded by the set . We denote by the collection of all agents’ strategies, where . For any agent , denotes the collection of strategies of all other agents.
Let be an uncertain vector taking values over some set , endowed with a -algebra , and let denote the probability measure defined over . For all subsequent derivations fix any , and let be a finite collection of independent and identically distributed (i.i.d.) scenarios/realizations of the uncertain vector , that we will henceforth designate as -multisample. For given strategies of the remaining agents , each agent aims at minimizing with respect to the function
[TABLE]
where expresses a deterministic objective, different for each agent but still dependent on the strategies of all agents, while encodes a common component in the agents’ objective function that depends on the uncertain vector. Agents are interested in minimizing their local objective and the worst-case (maximum) value can take among a finite set of scenarios. The electric vehicle charging control problem of Section VI provides a natural interpretation of such a set-up, where electric vehicles are selfish entities each one with a possibly different utility function ; however, they could be participating in the same aggregation plan or belonging to a centrally managed fleet, thus giving rise to a common .
We consider a noncooperative game among the agents, described by the tuple , where is the set of agents/players, , are respectively the strategy set and the cost function for each agent , and is a finite collection of samples. We consider the following solution concept for :
Definition 1** (Nash equilibrium).**
Let denote the set of Nash equilibria of , defined as
[TABLE]
We impose the following standing assumptions:
Assumption 2**.**
*(i) For any , and any , is convex and continuous differentiable, while the local constraint set is nonempty, compact and convex for all .
(ii) For any , and for all , the functions and are twice differentiable on an open convex set containing .
(iii) The pseudo-gradient is monotone with constant , while is monotone with constant for any fixed , i.e., for any , and ,*
[TABLE]
and .
We wish to emphasize that convexity of (for any fixed and ) does not require and to be both convex; indeed, Assumption 2 allows either function to be weakly convex. Similarly, it is not required that both are non-negative, but only . Note that a sufficient (but stronger) condition for the monotonicity requirements to be satisfied is for to be jointly convex with respect to .
II-B Problem statement
As every NE is a random vector due to its dependency on the -multisample, a question that naturally arises is how sensitive a NE is against a new realization of the uncertainty. More formally, let be a NE of the game with samples. Consider a new realization , and let be the game defined over the scenarios ; denote by the set of the associated NE. Then, for all , let
[TABLE]
denote the probability that a NE of does not “remain” a NE of , i.e., of the game characterized by the extraction of an additional sample. Note that is in turn a random variable, as its argument depends on the multisample . To provide a rigorous answer to the above question we will study the generalization properties of within a probably approximately correct (PAC) learning framework. With a given confidence/probability with respect to the product measure (as the samples are extracted in an i.i.d. fashion), we aim at quantifying .
To achieve such a characterization we provide some basic definitions. Let be a single-valued mapping from the set of -multisamples to the set of equilibria of .
Remark 3**.**
The game , the set of NE , the mapping (as well as of other associated quantities introduced in the sequel) depend on via the -multisample employed. Therefore, they are parameterized by , giving rise to a family of games, NE sets and mappings. To ease notation we do not show this dependency explicitly. Also, the dimension of the domain of is to be intended in accordance with .
Definition 4** (Support sample [36]).**
Fix any i.i.d. -multisample , and let be a NE of . Let be the solution obtained by discarding the sample . We call the latter a support sample if .
Definition 5** (Compression set — adapted from [35]).**
Fix any i.i.d. -multisample , and let be a NE of . Consider any subset and let . We call a compression set if .
The notion of compression set has appeared in the literature under different names; its properties are studied in full detail in [35], where it is designated as support subsample. Here we adopt the term compression set as in [31, 33] to avoid confusion with Definition 4.
Let be the collection of all compression sets associated with the -multisample . We refer to the cardinality of some compression set as the compression cardinality (we do not make explicit the dependence on the specific to ease notation, as the results below hold for any compression set in ). Note that — hence also — is itself a random variable as it depends on the -multisample.
Definition 6** (Non-degeneracy — adapted from [42]).**
For any , with -probability equal to , the NE coincides with the NE returned by when the latter takes as argument only the support samples. The corresponding game is then said to be non-degenerate; otherwise it is called degenerate.
It follows that for non-degenerate problems the support samples form a compression set with -probability 1. For degenerate problems the notions in Definitions 4 and 5 do not necessarily coincide; in particular, the support samples form a strict subset of any compression set in . For a detailed discussion on degeneracy in scenario-based contexts, we refer the reader to [43, 42].
II-C Main results
We first show that a single-valued mapping from the set of -multisamples to the set of NE of the game indeed exists.
Proposition 7**.**
Under Assumption 2 there exists a single-valued decentralized mapping .
The mapping can be computed in a decentralized manner, thus fitting the inherent structure of the game. Its construction, and hence the proof of Proposition 7, is provided in Section III.
II-C1 A posteriori certificate
We provide an a posteriori quantification of an upper bound for . This is summarized in the following theorem.
Theorem 8**.**
Consider Assumption 2. Fix and let be a function satisfying
[TABLE]
Let , where is an i.i.d. sample from . We then have that
[TABLE]
where is the cardinality of any compression set of .
Theorem 8 shows that — with confidence at least — the probability that a NE of does not “remain” an equilibrium of (i.e., when an additional sample is considered) is at most . Note that (6) captures the generalization properties of , where accounts for the ‘probably’ and for the ‘approximately correct’ term used within a PAC learning framework. The value of is defined in accordance to [35] and depends on the observed compression cardinality , which in turn depends on the random multiextraction thus giving rise to the a posteriori nature of the result. As a consequence, the level of conservatism of the obtained certificate depends on ; the smaller the cardinality of the computed compression set, the tighter the bound; see Section V for a detailed elaboration on the computation of . The proof of Theorem 8 is provided in Section IV-A.
In the case of a non-degenerate game (see Definition 6), the bound could be significantly improved by means of the wait-and-judge analysis of [42]: specifically, by Theorem 2 in [42], we can replace the expression for in (5) with , where is the unique solution in of
[TABLE]
However, note that non-degeneracy is a condition in general difficult to verify even in convex optimization settings, a challenge that becomes more prominent in games.
II-C2 A priori certificate
We now provide an a priori quantification of an upper-bound of . This is summarized in the following theorem.
Theorem 9**.**
Consider Assumption 2, and further assume that the game is non-degenerate according to Definition 6. Fix and consider be a function satisfying (5). Let , where is an i.i.d. multisample. We then have that
[TABLE]
The proof of Theorem 9 is provided in Section IV-B. Although similar in form to Theorem 8, the bound on provided by Theorem 9 additionally relies on the developments in [36, 43]: these results are independent of the given multisample and linked instead to the problem structure. In this way, is evaluated on the sample-independent quantity , expressing the dimension of the agents’ decision space plus additional variables, explained by the epigraphic reformulation introduced in the proof of Theorem 9. If we further assume that for all , for every fixed and , both and are convex, we would only need one epigraphic variable, hence the argument of could be replaced by (see Section IV-B).
Since we strengthen here the assumptions of Theorem 9 by imposing a non-degeneracy condition (see Definition 6), (5) could be directly replaced by the tighter expression (7). We wish to emphasize that, even if the non-degeneracy assumption holds, it may still be preferable to calculate the cardinality in an a posteriori fashion, as in certain problems the latter might be significantly lower compared to . This is also the case in the electric vehicle charging control problem of Section VI.
Corollary 10**.**
Let and define
[TABLE]
Then,
Under the assumptions of Theorem 8, (6) holds with in place of . 2. 2.
Under the assumptions of Theorem 9, (8) holds with in place of .
Corollary 10 shows that, with given confidence, the probability that — hence also each agent’s objective function — deteriorates when a new realization of the uncertainty is encountered can be bounded both in an a posteriori and an a priori fashion as in Theorem 8 and Theorem 9, respectively. These statements are established within the proofs of Theorems 8 and 9 (see (24)).
Remark 11**.**
The results of Theorems 8 and 9 can be adapted to the case where the uncertain part of the objective function is different for each agent, i.e., if is replaced by , . We keep our presentation with a common since for this case we are able to construct in a decentralized manner, as shown in Section III; the decentralized computation of if the uncertain part of the objective function is different for each agent encompasses additional challenges (see [39, Rem. 1]) and is outside the scope of our paper.
III Decentralized NE computation
In this section we show how to construct , necessary ingredient in the proof of Proposition 7. In particular, we show that the image of corresponds to the limit of a decentralized algorithm that returns a NE of the game . To achieve this, we characterise the NE of as solutions to a variational inequality (VI) [44]. We then leverage results in the literature to obtain sufficient conditions for the existence of equilibria, and set the foundations for the design of a decentralized NE computation mechanism [7].
III-A VI analysis
It can be observed that the presence of the operator renders agents’ objective functions (1) non-differentiable. To circumvent the computation of sub-gradients and exploit the wide range of algorithms available to solve VIs in the differentiable case, we follow the method in [39] and define the augmented game between agents. In each player , given and , computes
[TABLE]
where follows from the equivalence, holding for any ,
[TABLE]
where is the simplex in [45, Lemma 6.2.1]. The additional agent (could be thought of as a coordinating authority), given , will act instead as a maximizing player for the uncertain component of , , i.e.,
[TABLE]
Note that, for any -multisample , the objective functions in (9) and (11) are differentiable by Assumption 2. We now link the NE of the augmented game to a VI. To this end, we define the mapping as the pseudo-gradient [44, §1.4.1]
[TABLE]
Letting , the VI problem takes the form [44, §1.4.2]
[TABLE]
The constraints in (13) represent the concatenation of the first-order optimality conditions for the individual problems described by (9) and (11). In the following, we refer to the problem described by (13) as VI.
While the VI describes (under the conditions in Assumption 2) the NE of , it turns out that the former can be also linked to the equilibria of , as formalized next.
Proposition 12**.**
Under Assumption 2, there always exists a solution of VI. Denote such a solution by . We then have that is a NE of .
Proof.
The existence of a solution for the VI is guaranteed by [44, Cor. 2.2.5] under Assumption 2 and the compactness of . Denote such a solution by . A link between the solutions of the VI and those of the augmented game is established by [44, Prop. 1.4.2]: is a solution of if and only if it solves VI. The link with the original game is provided by [39, Thm. 1]: for any NE of the game , is a NE of , which concludes the proof. ∎
III-B Monotonicity of the augmented VI operator
The development of algorithms for the solution of VI problems relies upon the monotonicity of the mapping in (13), which plays a role analogous to convexity in optimization [7].
Definition 13** (Monotonicity).**
A mapping , with closed and convex, is
- •
monotone on if (u-v)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}(F(u)-F(v))\geq 0 for all ,
- •
strongly monotone on if there exists such that (u-v)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}(F(u)-F(v))\geq c\|u-v\|^{2} for all .
The following result is instrumental in our analysis:
Lemma 14**.**
Let Assumptions 2 hold. Then in (12) is monotone on .
Proof.
By Assumption 2(ii), is continuously differentiable on its domain. Let and denote the first and the last rows of , respectively, i.e., , and . By definition of the Jacobian we have
[TABLE]
where , J_{y}F^{x}=(J_{x}F^{y}(x,y))^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}=(\nabla^{2}_{x_{i}y}(f_{i}(x)+\hat{g}(x,y)))_{i\in\mathcal{N}}, and ; notice that is a matrix with being its -th entry.
Due to the particular block structure in (14), if and only if . To show this, note that by from (2), for all and all ,
[TABLE]
Summing the above inequalities yields
[TABLE]
which, since , corresponds to . The statement then follows directly from [44, Prop. 2.3.2], thus concluding the proof. ∎
A direct consequence of the monotonicity of is that by [7, Thm. 41], VI may admit multiple solutions: this fact together with [44, Prop. 1.4.2] — stating the correspondence between the solutions of the VI and the NE of — implies that the game can admit multiple NE.
III-C Decentralized algorithm for monotone VI and equilibrium selection
Specific algorithmic design is needed to tackle the convergence properties of a monotone mapping; we refer the reader to [46, 7, 47, 48] for a deep discussion on this topic. For our scope, proximal algorithms can be used to retrieve a solution of a monotone VI by solving a particular sequence of strongly monotone problems, derived by regularizing the original problem. To construct a decentralized mapping that is single-valued, as required by Proposition 7, a tie-break rule needs to be put in place to single a particular NE out of the possibly many (see Section III-B). Such a tie-break rule is needed even if only one NE is returned by the given algorithm, to prevent the case where different initial conditions produce different NE.
We address the above by employing a proximal algorithm based on [7, Algorithm 4], which allows us to select the minimum Euclidean norm NE; the choice of the Euclidean norm is not restrictive, and a wide range of strictly convex objective function can be used as a selector instead (see [7, Thm. 21]). Thus, more formally, we consider the following refinement of (13)
[TABLE]
We link the VI problem in (17) to the regularized game , where and are the designated step size and centre of regularization, respectively. Given the tuple , each player solves the following problem
[TABLE]
while the additional agent (player ), given , solves
[TABLE]
with . Note that Assumption 2 still holds for (18)–(19). By taking the pseudo-gradient of the above as in (12), we have from [44, Prop. 1.4.2] that is a NE of if and only if it satisfies the VI
[TABLE]
The next lemma shows that the regularized game admits a unique NE.
Lemma 15**.**
Consider Assumptions 2. Let be as in (12), and fix . Then, for any and , the regularized game defined by (18)–(19) admits a unique NE.
Proof.
To establish uniqueness of the NE it suffices to show that is strongly monotone [7, Thm. 41]. Fix any . Let , and define similalrly. We have
[TABLE]
where the inequality follows from the fact that , and (u-v)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}(F(u)-F(v))\geq 0 since is monotone due to Lemma 14. By Definition 13, (21) implies that is strongly monotone, thus concluding the proof. ∎
Now let denote the solution of the VI. Building on Lemma 15, we aim at determining a NE of by updating the centre of regularization of on the basis of an iterative method in the form , until convergence to the fixed point . The latter corresponds to the (unique under Lemma 15) NE of , which satisfies (17). Algorithm 1 provides the means to establish such a connection; this is formalised in the following proposition.
Proposition 16** (Thm. 21 [7]).**
*Let Assumptions 2 hold. Let be any sequence satisfying for all , , and . Select a big enough and let denote the sequence generated by Algorithm 1. For any , (i) there exists such that is bounded; (ii) there exists such that for , where is a solution of (17), and (iii) . *
The reader is referred to [7, Lemma. 20] for a lower bound on in Algorithm 1. The latter asymptotically converges to a solution of (17), while by Proposition 12, (17b) it is equivalent to the game , whose solution set is nonempty and is also contained in due to the second part of Proposition 12.
Proof of Proposition 7: Algorithm 1 and its analysis, leading to Proposition 16, serves as an implicit construction of a decentralized, single-valued mapping , thus establishing Proposition 7.
Note that is single-valued, hence the returned solution is independent of the initial condition used in Algorithm 1. However, in the proof of Theorem 9 it becomes insightful to make this dependency explicit. Thus, for the analysis of Section IV-B we will introduce the notation , with . (Notice that , which also appears in the initial condition of Algorithm 1, depends on the -multisample; as the latter is already an argument of , we only include as a subscript.)
IV Proofs of a posteriori and a priori certificates
IV-A Proof of Theorem 8
Fix any . Consider , and let be the cardinality of any given compression set of (recall that it depends on the observation of the -multisample). Let , and
[TABLE]
For any , consider the set
[TABLE]
Fix and consider defined as in (5). Under Assumptions 2, is single-valued by Proposition 7. By [35, Thm. 1] we then have that
[TABLE]
if the following consistency condition holds for (see [33] for a definition)
[TABLE]
To show the latter, notice that for each , by the NE definition (Definition 1), will belong to the set of minimizers of the following epigraphic reformulation of (2)
[TABLE]
By (26b) it follows then that (25) is satisfied, thus establishing (24). Note that for the result of [35] to be invoked, (26) is not required to be a convex optimization program, hence the fact that for each , for any , only is assumed to be convex by Assumption 2 is sufficient.
By the definition of and , (24) implies that with confidence at least , , thus establishing the first part of Corollary 10.
We now proceed to demonstrate the claim in (6). Recall that, by (12), (17) and Proposition 12, we can obtain as solution of the following optimization program (note the slight abuse of notation as by we denote both the optimizer and the corresponding decision vector)
[TABLE]
where is a NE of . By definition of in (9), and recalling , (27b) can be equivalently written as
[TABLE]
As (28) holds for all , we have that (27b) is equivalent to the following inequality being satisfied for all ,
[TABLE]
where the equality follows from (10). For a given , recall from Section II-C the definition of the game associated with the samples , and the associated set of NE . Moreover, let denote the associated augmented game. Analogously to (29), any solution , where is the simplex in , of the augmented game will satisfy the following VI:
[TABLE]
Note the analogy between (30) and (29), with the additional terms corresponding to the new sample ( is the additional decision variable corresponding to the new sample).
We are interested in quantifying the probability of . To this end, notice that if , then and y^{+}=(y^{\ast{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}},\,0)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}} constitute a feasible pair for (30). This is due to the fact that under this choice and hence
[TABLE]
thus (30) reduces to (29). Applying Proposition 12 to and , we have that if satisfies (30) (i.e., it is a NE of the augmented game ) then . Therefore, whenever , or in other words
[TABLE]
By (24) and (32), (6) follows, thus concluding the proof.
IV-B Proof of Theorem 9
Let be the minimal cardinality compression set for the minimum norm NE returned by Algorithm 1; note that under the non-degeneracy assumption it will be unique and it will coincide with the set of support samples. Following the discussion at the end of Section III denotes a mapping that returns (e.g., the one induced by Algorithm 1), where we make explicit the dependence on the initial condition . As is single-valued, by definition of a compression set we have that , for all .
Consider now the following optimization program.
[TABLE]
where , , are epigraphic variables, and in (33a) we have equality and not inclusion since the set of minimizers is a singleton due to the regularization term . Note that (33) is separable across , with each subproblem corresponding to an epigraphic reformulation of the fixed point characterization of the regularized problem (18) for (this is the value of when the proposed algorithm has converged to ) and , for all . The latter is identical to with . It follows from (33) that . Also note that we have introduced one epigraphic variable per agent ; we will invoke in the sequel the fact that (33) is convex due to Assumption 2 . However, if we further assume that for all , for every fixed and , the functions and are each convex, we would only need one epigraphic variable, as we could perform an epigraphic reformulation only for ; this would give rise to the constraint in (26b), which is common to all agents.
Let denote a minimal cardinality compression set for in (33). We claim that . To show this, assume for the sake of contradiction that there exists such that but . Consider the set , and notice that this has to be a compression set for in (33) as it is a superset of . By Definition 5, this implies that (recall that the solution of (33) is given by ). However, ; as the latter coincides with the set of support samples due to the imposed non-degeneracy assumption, we have by Definition 4 . This establishes a contradiction, showing that (hence ).
By Assumptions 2, (33) is a convex scenario program, and admits a unique solution due to the fact that the objective function in (33a) is strictly convex. Moreover, it has a non-empty feasibility region in view of Proposition 12. Therefore, by [36], [43], we have that any minimal cardinality compression set has cardinality upper-bounded by , i.e., the number of decision variables in (33). Therefore, . As a result, can be upper-bounded by the a priori known quantity . As Theorem 8 holds for any compression cardinality , we can apply it with . Hence, Theorem 9 as well as the second part of Corollary 10 directly follow, concluding the proof.
V Computation of the compression set cardinality
The result of Theorem 8 relies on the computation of the compression cardinality , which by Definition 5 is bounded by . An a posteriori estimate of the compression cardinality can be obtained through different methodologies, whose design may be tuned on the specific case. It follows from (6) that the closer the estimate to the minimal cardinality of the compression sets in , the less conservative the probabilistic guarantees on the robustness performance of the solution. In [35, §II] a greedy procedure is outlined to estimate (an upper bound to) the minimal compression cardinality; for completeness, we summarize this procedure in Algorithm 2. According to this, a compression set is constructed progressively by removing samples one by one (step 2). Only if their removal leaves the solution unaltered they are discarded (step 3-4); this is then repeated till no further sample can be removed without changing .
However, there are two drawbacks: first, the computational cost is generally high, as should be evaluated at least times, where each of these operations typically involves an asymptotic scheme (as, e.g., in Algorithm 1); second, in practice, limited numerical accuracy makes the evaluation of the condition at Step 3 of Algorithm 2 amenable to numerical errors.
To alleviate these, we provide a computationally efficient way to determine a compression set, and hence , by direct inspection of the NE. To achieve this, we impose certain NE uniqueness requirements. However, it should be noted that for the wide class of aggregative games, the additional structure required in the proposition below implies only uniqueness of an aggregate strategy, where multiple equilibria may exist. This is summarized in the following proposition.
Proposition 17**.**
Consider Assumption 2. Further assume that for all , either
* admits a unique NE;* 2. 2.
or, depends on the aggregate strategy111With a slight abuse of notation, in the second part of the proposition it is to be understood that for all and for any given , Assumption 2 refer to the function . * , and admits a unique NE aggregate .*
Then, the set corresponds to the indices of a compression set, i.e., .
Proof.
*Part 1: Uniqueness of NE. * Fix any and notice that it forms a (trivial) compression set for . Let be a solution of the augmented game , where .
To prove that it suffices to show that the solution returned by remains unaltered after removing all samples from whose associated component of is zero. To this end, suppose that at least one such sample exists: without loss of generality, assume (i.e., that sample has index ). We will first show that is a compression set, i.e., . Let be the game with samples . Moreover, let denote the associated augmented game, and the simplex in . Since is an NE of , it will satisfy the VI in (29). At the same time, every solution of the augmented game satisfies the following VI:
[TABLE]
Set and . Under this choice satisfies (V), as . Equivalently, is an NE for , and by applying Proposition 12 to and we have that is an NE for . However, due to the uniqueness assumption, has to be the only NE of , showing that .
Following the same procedure, removing one by one all samples for which the associated elements of are zero, shows that , thus concluding the proof of the first part.
Part 2: Uniqueness of NE aggregate. The proof follows the same arguments as in Part 1 with the following modifications. The derivation until the discussion right after (V) remains unaltered, showing that is a NE of . To prove that it suffices to show that is the minimum norm NE of . We thus assume for the sake of contradiction that is the NE of that achieves the minimum norm, i.e., . We distinguish two cases:
Case 1: . Under this condition observe that (\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}},0)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}) satisfies the VI in (29) for the game with samples. However, as is the minimum norm equilibrium for that game, we have that \|(x^{\ast},y^{\ast})\|^{2}\leq\|(\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}},0)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}})\|^{2}. Overall, recalling , \|(x^{\ast},(y^{\ast})_{m=1}^{M-1})\|^{2}=\|(x^{\ast},y^{\ast})\|^{2}\leq\|(\hat{x},(\hat{y}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}},0)^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}})\|^{2}=\|(\hat{x},\hat{y})\|^{2}, thus establishing a contradiction. We can then show that as in the last paragraph of Part 1.
Case 2: . We will show that, under our assumptions, this case cannot occur. By the uniqueness assumption we have that for any equilibrium (the NE is not necessarily unique, but all equilibria have the same aggregate). We then have
[TABLE]
for any . Since (V) holds for any ,
[TABLE]
Consider now (26b). By direct computation of the KKT optimality conditions [45, §6.2.1] of (26) and (11), respectively, it can be verified that the decision variable introduced in (9)–(11) is a shadow price for the constraint (26b). Then, by the complementary slackness condition,
[TABLE]
Since implies we obtain
[TABLE]
From (38) it follows , which contradicts (V) and concludes the proof. ∎
Based on the shadow price interpretation of (see proof of Proposition 17) notice that if (inactive constraint) then . Note that samples with can be removed without altering due to the imposed uniqueness requirements; otherwise, the feasibility region of the VI in (V) may enlarge, possibly resulting in a different minimum norm NE. Moreover, it should be noted that Proposition 17 does not provide guarantees that a minimal cardinality compression set is determined; this can be obtained by Algorithm 2 (see also [35]). However, the important implication of Proposition 17 is that the cardinality of a compression set is readily available by inspecting .
VI Case study: Electric vehicle charging control
VI-A Problem set-up
We consider a stylized electric vehicle (EV) charging control problem, with EVs being risk-averse, selfish entities interested in minimizing their own cost. Let index the finite population of EV vehicles/agents. We denote by the demand profile each EV seeks to determine over time slots, where for simplicity these are taken to be of unit length (1 hour). Vehicles’ charging strategy is in response to a pricing signal received from a coordinator, which in turn depends on the demand profiles of all agents. We consider price to be an affine function of the aggregate strategy , but other choices are also supported by our theoretical analysis. Price is subject to uncertainty, e.g., externalities acting on the energy spot market, encoded by the random variable , which we model by means of scenarios. In particular, each scenario is a realization of prices along the considered -slot interval. Note that these scenarios are i.i.d., however, each of them is a finite horizon path, whose entries can be correlated. Each agent aims at minimizing
[TABLE]
where , for , are diagonal matrices, and . Moreover, we assume the charging operations are subject to \mathcal{X}_{i}=\{x_{i}\in\mathbb{R}^{n}:\;\mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}x_{i}\geq E_{i},\;0\leq x_{ij}\leq P_{i},~{}\forall j=1,\ldots,n\}, where designate the desired final state of charge (SoC) and the maximum power deliverable by the charger, respectively.
We analyse the results of several randomly generated cases, differing in the parameters characterizing the EV constraints , selected from a uniform random distribution: specifically, kW, and is chosen to be feasible in the specified time interval (0–35 kWh per 12 h interval). The pairs are i.i.d. extracted from a lognormal distribution for the diagonal entries of , and a uniform distribution for the vectors . The nominal electricity price, i.e., the diagonal entries of the matrix , have been derived by rescaling a winter weekday demand profile in the UK [49], whereas .
It should be noted that even though this example fits in the class of aggregative games, it does not necessarily meet the uniqueness requirement of the second part of Proposition 17. However, we have empirically observed that the main conclusion of the proposition still holds, namely, the minimum norm solution returned by Algorithm 1 remains unaltered when the algorithm is fed only with the samples with indices in . Informally, this happens due to the fact that for any feasible problem instance \mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}x_{i}\geq E_{i} will always be binding at the optimum, and as result \mathbf{1}^{\mathchoice{\raisebox{0.75346pt}{\displaystyle\intercal}}{\raisebox{0.75346pt}{\textstyle\intercal}}{\raisebox{0.75346pt}{\scriptstyle\intercal}}{\raisebox{0.75346pt}{\scriptscriptstyle\intercal}}}\sigma(x) will be constant for any NE (see also transparent plane in Figure 3); a detailed investigation of this issue is topic of current research.
VI-B Simulation results
The NE charging schedules have been obtained by implementing Algorithm 1 with , and . At each iteration of the proposed algorithm a quadratic optimization needs to be solved; this was performed on a dual-core 7th gen. Intel processor using MATLAB. Figures 1–2 show the convergence of Algorithm 1 in the computation of the NE for EVs, and . A dominant linear convergence rate can be observed, and less than 4000 outer loop iterations were needed to meet the desired exit accuracy . The inner loop enjoys similar convergence rate (not shown for space reasons), and less than 30 iterations (15 in average) are needed to achieve an error smaller than .
To validate the a posteriori result of Theorem 8, Table I shows the average robustness performance of several solutions (with , ) obtained from different sets of samples, grouped according to the a posteriori observed compression cardinality ; we have set . The violation rate of each solution is empirically computed using newly extracted samples (according to the same aforementioned distributions) and counting the fraction of them that result in a change of the computed NE. Consistently with [35], we note that the observed value of is indicative of the confidence level on the equilibrium robustness. The experimental results are compared with the theoretical bound provided by Theorem 8 (third row). For non-degenerate problems, the conservatism of the latter can be reduced by employing the tighter expression for reported in (7), leading to the fourth row of Table I. However, note that in general it is difficult to verify whether a given problem is non-degenerate, thus preventing the use of (7). We observe that a bound of 2–3% could have been achieved with Theorem 8 by increasing the sample size to .
A visual representation of the concept of compression set is given in Fig. 3. The plot depicts the curves expressing the uncertain cost term associated to a subset of the samples used for the derivation of the NE . Values are plotted as a function of the aggregate demand on an interval around . In this case the (minimal) compression cardinality is , with supporting the solution together with the constraint on the target SoC which is binding in this case (transparent plane). Note that in this instance the constraints on the power rate are not active, and omitted from the plot for clarity.
We now investigate numerically the validity of the a priori result of Theorem 9. Fig. 4 shows the average compression set cardinality (solid line) observed over 50 trials, corresponding to different randomly generated cases corresponding to different values of and . In all cases, is bounded by as suggested by Theorem 9. In fact, the empirically calculated cardinality is significantly lower, suggesting that in this case study an a posteriori quantification is less conservative. Moreover, in all our numerical investigations we noticed that , i.e., the empirical estimate of the compression set cardinality is independent of the number of agents and is bounded by the number of individual decision variables (dashed line). We conjecture that for the aggregative EV charging game considered here, involving affine price functions, the so called support rank (see [50] for a definition) offers a tighter bound on the compression set cardinality compared to the total number of decision variables .
VII Concluding remarks
We considered the problem of NE computation in multi-agent games in the presence of uncertainty, and accompanied them with a priori and a posteriori certificates regarding the probability that the NE equilibrium remains unchanged when a new uncertainty realization is encountered.
Current work is concentrated towards relaxing the uniqueness requirements underpinning the compression set quantification of Section V for the class of aggregative games.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Lamnabhi-Lagarrigue, A. Annaswamy, S. Engell, A. Isaksson, P. Khargonekar, R. M. Murray, H. Nijmeijer, T. Samad, D. Tilbury, and P. V. den Hof, “Systems & control for the future of humanity, research agenda: Current and future roles, impact and grand challenges,” Annual Reviews in Control , vol. 43, pp. 1 – 64, 2017.
- 2[2] H. Le Cadre, I. Mezghani, and A. Papavasiliou, “A game-theoretic analysis of transmission-distribution system operator coordination,” European Journal of Operational Research , vol. 274, no. 1, pp. 317 – 339, 2019.
- 3[3] A. De Paola, F. Fele, D. Angeli, and G. Strbac, “Distributed coordination of price-responsive electric loads: A receding horizon approach,” in 2018 IEEE Conference on Decision and Control (CDC) , Dec 2018, pp. 6033–6040.
- 4[4] I. Atzeni, L. G. Ordóñez, G. Scutari, D. P. Palomar, and J. R. Fonollosa, “Noncooperative day-ahead bidding strategies for demand-side expected cost minimization with real-time adjustments: A GNEP approach,” IEEE Transactions on Signal Processing , vol. 62, no. 9, pp. 2397–2412, May 2014.
- 5[5] R. Kamat and S. Oren, “Two-settlement systems for electricity markets under network uncertainty and market,” Power. Journal of Regulatory Economics , vol. 25, pp. 5–37, Jan 2004.
- 6[6] J. Yao, I. Adler, and S. Oren, “Modelling and computation of two-settlement oligopolistic equilibrium in a congested electricity network,” Operations Research , vol. 56, no. 1, pp. 34–47, Jan 2008.
- 7[7] G. Scutari, F. Facchinei, J. S. Pang, and D. P. Palomar, “Real and complex monotone communication games,” IEEE Transactions on Information Theory , vol. 60, no. 7, pp. 4197–4231, July 2014.
- 8[8] J. Krawczyk, “Numerical solutions to coupled-constraint (or generalised Nash) equilibrium problems,” Computational Management Science , vol. 4, pp. 183–204, Nov 2007.
