Distributed generalized Nash equilibrium seeking in aggregative games on time-varying networks
Giuseppe Belgioioso, Angelia Nedi\'c, Sergio Grammatico

TL;DR
This paper introduces a fully-distributed algorithm for finding generalized Nash equilibria in aggregative games over dynamic networks, enabling agents to reach equilibrium without direct access to the aggregate decision.
Contribution
It presents the first fully-distributed algorithm for generalized Nash equilibrium seeking in aggregative games on time-varying networks with partial information.
Findings
Algorithm converges under monotone operator splitting framework
Works on time-varying communication networks
Handles partial-decision information scenarios
Abstract
We design the first fully-distributed algorithm for generalized Nash equilibrium seeking in aggregative games on a time-varying communication network, under partial-decision information, i.e., the agents have no direct access to the aggregate decision. The algorithm is derived by integrating dynamic tracking into a projected pseudo-gradient algorithm. The convergence analysis relies on the framework of monotone operator splitting and the Krasnosel'skii-Mann fixed-point iteration with errors.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Distributed generalized Nash equilibrium seeking in aggregative games on time-varying networks
Giuseppe Belgioioso, Angelia Nedić and Sergio Grammatico G. Belgioioso is with the Control Systems group, TU Eindhoven, The Netherlands. A. Nedić is with the School of Electrical, Computer, and Energy Engineering, Arizona State University, USA. S. Grammatico is with the Delft Center for Systems and Control (DCSC), TU Delft, The Netherlands. E-mail addresses: [email protected], [email protected], [email protected]. This work was partially supported by NWO (projects OMEGA, 613.001.702; P2P-TALES, 647.003.003), the ERC (project COSMOS, 802348) and by the Office of Naval Research grant no. N000141612245.
Abstract
We design the first fully-distributed algorithm for generalized Nash equilibrium seeking in aggregative games on a time-varying communication network, under partial-decision information, i.e., the agents have no direct access to the aggregate decision. The algorithm is derived by integrating dynamic tracking into a projected pseudo-gradient algorithm. The convergence analysis relies on the framework of monotone operator splitting and the Krasnosel’skii–Mann fixed-point iteration with errors.
I Introduction
An aggregative game is a collection of inter-dependent optimization problems associated with noncooperative decision makers, or agents, where each agent is affected by some aggregate effect of all the agents [1]. Remarkably, aggregative games arise in several applications, such as demand side management in the smart grid [2], e.g. for charging/discharging electric vehicles [3], demand-response regulation in competitive markets [4], congestion control in traffic and communication networks [5]. The common denominator is the presence of a large number of selfish agents, whose aggregate actions may disrupt the shared infrastructure, e.g. the power grid or the transportation network, if left uncontrolled.
Designing solution methods for multi-agent equilibrium problems in noncooperative games has recently gained high research interest. Several authors have developed semi-decentralized and distributed equilibrium seeking algorithms for games without coupling constraints [6] and, more recently, for games with coupling constraints [7, 8, 9, 10].
With focus on the generalized Nash equilibrium (GNE) problem, the formulations in [9, 10] have introduced an elegant approach based on monotone operator theory [11] to characterize the equilibrium solutions as the zeros of a monotone operator. Not only is the monotone-operator-theoretic approach general – e.g., unlike variational inequalities, smoothness of the cost functions is not required – but also computationally viable, since several algorithmic methods to solve monotone inclusions are already well established, e.g. operator-splitting methods [11, §26].
However, in the aforementioned literature on noncooperative equilibrium computation, it is assumed that the agents have direct access to the decisions of all their competitors, allowing every agent to evaluate its cost function without the need of extra communication. This game setup is known as full-decision information.
In aggregative games, this ideal scenario is achieved via the so-called semi-decentralized communication structure, where a central node gathers and broadcasts the aggregation variable to all the agents, see e.g. [7]-[9].
Recently, in the broader context of noncooperative games, the authors in [12, 13] propose fully-distributed algorithms for equilibrium seeking under partial-decision information, i.e., each agent can only observe the decision of some neighboring agents, while its cost function possibly depends on all the other agents’ decision. In [12], to deal with the lack of information, the agents are endowed with auxiliary variables, namely, the estimates of the decisions of the other agents. Then, a consensus protocol is combined with accelerated projected-pseudo-gradient dynamics to steer the estimates towards their real value and, consequently, the decisions to a Nash equilibrium, in the same time-scale. In [13], similar ideas are developed in the general framework of monotone operator theory to design an algorithm for games with coupling constraints. The algorithms proposed in [12, 13] require a number of auxiliary variables (i.e., the estimates of the decisions of all the other agents) which is proportional to the number of agents in the game. From a practical perspective, this can be regarded as a drawback in terms of memory storage and communication requirements, especially in games with very large number of agents.
Scalability with respect to the population size indeed motivates us to focus on aggregative games. In this context, the authors in [14] propose an algorithm that relies on dynamic tracking, a technique that allows a group of agents to locally track the average of some reference inputs, extensively used in distributed optimization for gradient tracking, e.g. [15]. Specifically, the authors embed dynamic tracking of the aggregate decision in a projected-pseudo-gradient update to compute a Nash equilibrium in a fully-distributed fashion (i.e., without the need of a central coordinator). In the context of aggregative games with coupling constraints, an algorithm is proposed in [16], however with important limitations: it requires a very large number of distributed communication rounds before each strategy update; convergence is guaranteed to approximate solutions (i.e., Nash equilibria) only; the communication network must be time-invariant.
More recently, two fully-distributed algorithms [17, 18], for generalized aggregative games over time-invariant and connected networks, have been proposed to compute an exact solution (i.e., GNE), without the need of multiple communication rounds before every strategy update. To cope with the lack of information, both algorithms introduce local estimates and dynamic tracking of the aggregate decision. In [17], global convergence is proved under strong monotonicity of the pseudo-gradient, by leveraging a rescrited-monotonicity property of this mapping in the exteneded space of strategies and estimates. In our preliminary work [18], this assumption is relaxed to cocoercivity at the cost of having vanishing step-sizes, which typically imply slow convergence. Unfortunately, the extension of both methodologies to cover time-varying communication networks is currently missing, since the operator theoretic framework on the basis of their convergence analysis fails when the underlying mappings vary over time.
Contribution
In this paper, we solve these technical issues and propose the first discrete-time, fully-distributed algorithm to compute a generalized Nash equilibrium in aggregative games with coupling constraints over a time-varying and repeatedly-connected communication network. The algorithm is obtained by combining dynamic tracking, projected-pseudo-gradient and Krasnosel’skii–Mann dynamics. The key approach to prove convergence of our proposed algorithm relies on applying and tailoring the framework of operator splitting methods [11] and fixed-point iteration with errors [19].
Organization of the paper
In Section II, we formalize the generalized Nash equilibrium seeking problem for aggregative games over a time-varying communication network. In Section III, we present a fully-distributed algorithm and discuss its interpretation from an operator theoretic and fixed-point perspective. In Section IV, we establish global convergence of the proposed method. To corroborate the theory, in Section V, we study the performance of the proposed method on a Nash–Cournot game. Concluding remarks and future research directions are discussed in Section VI.
Basic notation
denotes the set of real numbers, and the set of extended real numbers. () denotes a matrix/vector with all elements equal to [math] (); to improve clarity, we may add the dimension of these matrices/vectors as subscript. Given two sets, and , we denote as their Cartesian product. Given sets, , we denote with \mathrm{conv}(\mathcal{S}_{1},\ldots,\mathcal{S}_{N})=\big{\{}a_{1}x_{1}+\ldots+a_{N}x_{N}\,|\;\sum_{i=1}^{N}a_{i}=1,\,a_{i}\in\mathbb{R}_{\geq 0},\,x_{i}\in\mathcal{S}_{i},\,\forall i\in\{1,\ldots,N\}\big{\}} the convex hull of their union. denotes the Kronecker product between the matrices and . For a square matrix , where is the entry in position , its transpose is ; () stands for positive definite (semidefinite) matrix; denotes the largest singular value of ; denotes the infinity norm. If , denotes the -induced norm, such that , we omit the subscript when . Given matrices , denotes a block diagonal matrix with as diagonal blocks. Given vectors , , , ; given a vector , .
Operator theoretic definitions
denotes the identity operator. The mapping denotes the indicator function for the set , i.e., if , otherwise. For a closed set , the mapping denotes the projection onto , i.e., . The set-valued mapping denotes the normal cone operator for the set , i.e., if , otherwise. For a function , ; denotes its subdifferential set-valued mapping, defined as . A set-valued mapping is (strictly) monotone if for all , , ; is restricted-(strictly) monotone with respect to (w.r.t.) if for all , , , ; is strongly monotone, with , if for all , , ; and denote the set of fixed points and of zeros, respectively. A single-valued mapping is -Lipschitz continuous, with , if for all ; is nonexpansive if it is -Lipschitz continuous; is -averaged, with , if , for all ; is -cocoercive, with , if is -averaged.
II Problem statement
Consider a set of agents indexed by . The -th agent is characterized by a local strategy set and a cost function , which depends on the decision of agent , , and on the aggregate of all agent decisions, i.e.,
[TABLE]
Moreover, we assume that the collective strategy profile must satisfy a coupling constraint, described by the affine function , where , , and , are local parameters known to agent only. In summary, the aim of each agent , given the decision variables of the other agents, i.e., , is to choose a strategy that solves its local optimization problem, according to the game setup above, i.e.,
[TABLE]
where the last constraint is equivalent to .
Remark 1
Affine coupling constraints, as considered in this paper, are very common in the literature of noncooperative games, e.g. [8, 10, 13, 16], and cover several applications where they typically arise in the form of upper and lower limits on the available shared resources, e.g. [2]-[5].
Assumption 1
For all and any fixed , the function is convex and continuously differentiable, is non-empty, compact and convex. The global feasible set is non-empty and satisfies Slater’s constraint qualification.
From a game-theoretic perspective, our goal is to distributively compute a generalized Nash equilibrium of the aggregative game described by the inter-dependent optimization problems in (4).
Definition 1** (Generalized Nash equilibrium)**
*A collective strategy is a generalized Nash equilibrium (GNE) of the game in (4) if, for all : *
[TABLE]
II-A Communication networks
We consider a time-varying network to model the communications among agents over time. At each stage , the communication is described by an undirected graph , where is the set of vertices (agents) and is the set of edges. An unordered pair of vertices belongs to if and only if agents and can exchange information. The set of neighbors of agent at stage is defined as . Next, we assume the graphs sequence to be connected.
Assumption 2
There exists an integer such that the graph is connected, for all .
This assumption ensures that the intercommunication intervals are bounded for agents that communicate directly. In other words, every agent sends information to each of its neighboring agents at least once every time intervals.
We consider a mixing matrix associated with , whose elements satisfy the following assumption.
Assumption 3
For all , the matrix satisfies the following conditions:
- (i)
(Edge utilization) Let , . If , , for some ; otherwise; 2. (ii)
(Positive diagonal) For all , ; 3. (iii)
(Double-stochasticity) , .
Assumption 3 is strong but typical for multiagent coordination and optimization, e.g. [15, 20]. For an undirected graph it can be fulfilled, for example, by using Metropolis weights:
[TABLE]
Finally, let us introduce the so-called transition matrices from time to :
[TABLE]
for , where , for all . The following statement shows the convergence properties of the transition matrix .
Lemma 1** ([21, Lemma 5.3.1])**
Let Assumptions 2, 3 hold true. Then, the following statements hold:
- (i)
, for all . 2. (ii)
The convergence rate of is geometric, i.e., for all , where and
[TABLE]
with as in Assumption 2 and as in Assumption 3.
II-B GNE as zeros of a monotone operator
As first step, we characterize a GNE of the game in terms of the KKT conditions of the coupled optimization problems in (4). For each agent , let us introduce the Lagrangian function , defined as
[TABLE]
where is the dual variable of agent associated with the coupling constraints, and is the indicator function. It follows from [22, §12.2.3] that the set of strategies is a GNE of the game in (4) if and only if the following coupled KKT conditions are satisfied for some :
[TABLE]
Within all the possible GNE, we focus on an important subclass of equilibria, namely the variational GNE (v-GNE), that enjoy some relevant structural properties, such as “larger social stability” and “economic fairness” and corresponds to the solution set of the KKT conditions in (8) with equal dual variables, i.e., [23, Theorem 3.1]. The next proposition characterizes the subclass of v-GNE as the solution to a specific variational inequality problem111For a single-valued mapping and a set , the variational inequality problem VI is the problem of finding a vector such that , for all , [24, Def. 1.1.1]. , or equivalently as the zero set of the set-valued mapping
[TABLE]
where , , is the normal cone operator associated with a set and is the so-called pseudo-gradient mapping (PG) defined as
[TABLE]
Proposition 1
Let Assumption 1 hold. Then, the following statements are equivalent:
- (i)
* is a variational GNE of the game in (4);* 2. (ii)
* such that, the pair is a solution to the KKT in (8), for all ;* 3. (iii)
* is a solution to VI*; 4. (iv)
* such that . *
Proof:
The equivalences (i)(ii)(iii) are proven in [23, Th. 3.1] while (iii)(iv) follows by [25, Th. 3.1]. ∎
The following assumptions on the PG in (10) are standard (e.g. [8, Th. 3], [10, Assumption 2], [26, Assumption 3]) and sufficient to ensure the convergence of standard GNE seeking algorithms based on projected-pseudo-gradient dynamics.
Assumption 4
* in (10) is cocoercive over . *
When is strongly monotone and Lipschitz, then is also cocoercive. However, in general, cocoercive mappings are not necessarily strongly monotone, e.g. the gradient of a (non-strictly) convex and smooth function.
To emphasize the structure of in (10), we define
[TABLE]
that satisfies , for all . Then, we define the extended pseudo-gradient mapping (EPG)
[TABLE]
where each component mapping is given by (11). With this notation, we have . Next, we assume Lipschitz continuity of the EPG, which is usual in the context of games under partial-decision information, see e.g. [13, Assumption 3], [14, Assumption 3], [17, Assumption 4].
Assumption 5
Let be the set whose elements are convex combination of the elements from the local sets ’s. The mapping in (12) is uniformly Lipschitz continuous over , with , i.e., there exists such that, for all and ,
[TABLE]
**
Remark 2** (Existence and uniqueness of a v-GNE)**
It follows by [27, Cor. 2.2.5] that VI(,) has a non-empty and compact solution set, since is non-empty, compact and convex and is continuous, by Assumption 1. Furthermore, when is strictly monotone, then the solution to VI(,), (i.e., the v-GNE of the game), is unique [27, Th. 2.3.3].
II-C Boundedness of the dual variables
In the next statement, we formally establish the boundedness of the dual solution set of VI() or, equivalently, of the dual part of the monotone inclusion .
Lemma 2
Let Assumptions 1 hold true. If , then , where is bounded.
Proof:
The boundedness of the dual solution set follows by [25, Proposition 3.3] since VI(, ) has a non-empty bounded solution set by Remark 2 and there exists a vector satisfying Slater’s constraint qualification by Assumption 1. ∎
Let us denote with the largest entry of all the optimal dual vectors. The agents can locally build a bounded superset of the optimal dual set as follows: [28, p. 21]. In the context of distributed constrained optimization, a local estimate of can be constructed based on a Slater’s vector, see [29, §4.2], [30, §3.A (2)]. The extension of these estimation methods to generalized noncooperative games would rely on Lagrangian duality theory for variational inequalities [25]. In practice, each agent does not need an accurate estimate of the optimal dual solution set and can simply construct a local superset by taking large enough.
II-D A standard semi-decentralized algorithm
It follows by Proposition 1 that the original GNE seeking problem corresponds to the following monotone inclusion problem:
[TABLE]
Next, we recall a standard semi-decentralized GNE seeking algorithm obtained by solving the monotone inclusion problem in (13) by means of a preconditioned forward-backward (pFB) splitting [26, Alg. 1].
Algorithm 1. Semi-decentralized v-GNE seeking
Iterate until convergence
[TABLE]
Remark 3
The local auxiliary variables ’s are introduced to cast Algorithm 1 in a more compact form. The average measures the violation of the coupling constraints, technically, is the “reflected violation” of the constraints at iteration .
If the step sizes and are chosen small enough, then the sequence generated by Algorithm 1 converges to some , where is a v-GNE, see [26, Th. 1] for a formal proof of convergence.
We note that Algorithm 1 is not distributed. In fact, at each iteration , a central coordinator is needed to:
- (i)
gather and broadcast the average strategy ; 2. (ii)
gather the average quantity ; 3. (iii)
update and broadcast the dual variable .
III A distributed GNE seeking algorithm
III-A Towards a fully distributed algorithm
A first step towards a fully-distributed algorithm consists of endowing each agent with a copy, , of the dual variable and enforcing consensus on the local copies. Consider the set-valued mapping , obtained by augmenting in (9) with the local copies of the dual variable:
[TABLE]
where , , , and represents the projection onto the disagreement space.
Remark 4
When the local copies of the dual variable are equal, i.e., , where is the consensus subspace of dimension , the first row block of corresponds to that of , while each of the components of the second row block of describes the same complementarity condition, namely, the second row block of .
We note that the mapping in (14) can be written as the sum of two operators, i.e.,
[TABLE]
where is a skew-symmetric linear mapping defined as
[TABLE]
The formulation is called splitting of , and will be exploited in different ways later on. The next lemma shows that is maximally monotone and that is cocoercive and strictly monotone with respect to the consensus subspace of the dual variables, i.e., .
Lemma 3
Let Assumptions 1, 4 hold true. The following statements hold:
- (i)
* in (16) is maximally monotone on ;* 2. (ii)
* in (15) is *cocoercive, with and restricted-strictly monotone w.r.t. , i.e., for all , , it holds that ; 3. (iii)
* is maximally monotone on and restricted-strictly monotone w.r.t. . *
Proof:
See Appendix -A. ∎
The next proposition exploits the restricted-strict monotonicity of to shows that the v-GNE of the original game are fully characterized by the zeros of .
Proposition 2
Let Assumption 1 hold true. The following statements hold:
- (i)
, 2. (ii)
If , then is a v-GNE and , with .
Proof:
See Appendix -B. ∎
To find a zero of , we exploit a preconditioned version of the forward-backward method [11, §25.6] on the splitting (15)-(16), similarly to [10, 26], thus obtaining Algorithm 2.
The next theorem establishes global convergence of Algorithm 2 to a v-GNE if the step-sizes are chosen according to the following choices.
Assumption 6
Take , where as in Assumption 4. Set the global parameter and denote . Set the step-sizes as follows:
- (i)
, for all , 2. (ii)
, for all , 3. (iii)
* such that for all and . *
Note that the design choice , for all , always satisfies Assumption 6 (iii).
Theorem 1
Let Assumptions 1, 4 hold. If the step-sizes and are set as in Assumption 6, then the sequence generated by Algorithm 2 converges to some , where is a v-GNE of the game in (4).
Proof:
See Appendix -C. ∎
Remark 5** (Algorithm 2 as a fixed-point iteration)**
Our convergence analysis is based on the same operator theoretic framework in [10]-[26]. Specifically, we recast the dynamics generated by Algorithm 2 as the fixed-point iteration
[TABLE]
where is the stacked vector of the iterates and is the so-called pFB operator, defined as
[TABLE]
*where , in (15)-(16) characterize the splitting of , and is the so-called preconditioning matrix, here chosen as *
[TABLE]
, . Then, we show that, if the step sizes in the main diagonal of are set according to Assumption 6, the mapping is averaged with respect to the -induced norm, i.e., . Hence, the fixed-point iteration (18) converges to some , where is a v-GNE. See Appendix -C for a complete convergence analysis.
To conclude this section, we note that the projected-pseudo-gradient updates in Algorithm 2 can be cast compactly as
[TABLE]
where
[TABLE]
and .
Unlike Algorithm 1, Algorithm 2 does not directly rely on the actions of a central coordinator, namely, dual update and broadcast communication. However, it requires an all-to-all information exchange (or, equivalently, a complete communication graph) at each iteration , since the local updating rule of each agent necessitates the knowledge of:
- (i)
the average strategy , 2. (ii)
the average dual variable , 3. (iii)
the average quantity .
III-B A fully-distributed algorithm via dynamic tracking
To implement Algorithm 2 fully-distributively under the more realistic communication assumptions in Section II-A, we approximate its updates by endowing each agent with some surrogate variables (or estimates), i.e., , and , that dynamically track the averages , and , respectively. Then, to mitigate the errors due to the inexactness of the surrogate variables, we relax the projected-pseudo-gradient iterations by means of a Krasnosel’skii–Mann (KM) process [11, eq.(5.12)], whose step-sizes are set according to the following design choice.
Assumption 7
The sequence satisfies the following conditions:
- (i)
(non-increasing) , for all ; 2. (ii)
(non-summable) ; 3. (iii)
(square-summable) .
For example, Assumption 7 is satisfied for step sizes of the form where .
The proposed algorithm relies on agents constructing an estimate of the averages by mixing information drawn from local neighbors and making a subsequent relaxed projected-pseudo-gradient step, as in Algorithm 2. To build the estimates , , , at every iteration , agent receives ’s, ’s, ’s from its neighbors, , and aligns its intermediate estimates according to the following rules:
[TABLE]
Then, on the basis of , and , agent updates its strategy , its dual variable and the new estimates as formalized in Algorithm 3.
Note that the projected-pseudo-gradient updates in Algorithm 3 can be recast in a compact form as
[TABLE]
where
[TABLE]
and for some .
IV Convergence analysis
To prove the convergence of Algorithm 3, we rely on the framework of the inexact Krasnosel’skii–Mann fixed-point iteration [19, Alg. 5.4]. Informally speaking, our goal is to show that the error deriving from the inexactness of the estimates ’s, ’s and ’s vanishes to zero fast enough, in which case, also generated by Algorithm 3 globally converges to a v-GNE. Technically, we aim at exploiting [19, Th. 5.5], which establishes convergence of an inexact version of the KM iteration in (18), i.e.,
[TABLE]
when is nonexpansive and the step-size and error sequences, and , respectively, satisfy
- (C.1)
, 2. (C.2)
.
Note that Algorithm 3 can be written as the KM with errors in (25) where and the error at stage is
[TABLE]
where and denote the iterates generated by Algorithm 2 (defined in (21) and (22), respectively). In other words, represents the distance between the iterates in the ideal case of full-decision information (i.e., where the agents have an exact knowledge of the averages , and ) and the iterates of Algorithm 3, in which the averages are replaced by the estimates , and , built on-line by mixing information drawn from local neighboring agents only.
The main technical challenge to invoke [19, Th. 5.5] and, in turn, prove the convergence of Algorithm 3 is to find a step-size sequence , that complies with (C.1), such that the relaxed error sequence satisfies (C.2). We immediately note that if is chosen as in Assumptions 7, then it already satisfies (C.1). In the following subsection, we show that (C.2) is also satisfied.
IV-A Analysis of the relaxed error sequence
In the next lemma, we recall a fundamental invariance property of dynamic tracking, namely, at each stage , the averages among the estimates ’s, ’s, and ’s are equivalent to the correspondent averages we aim to track.
Lemma 4
Let Assumption 3 hold true and set the initial conditions as in Algorithm 3, for all . Then, the following equations hold for all :
- (i)
; 2. (ii)
; 3. (iii)
.
Proof:
See Appendix -D. ∎
The following assumption on the dual sequences generated by Algorithm 3 is instrumental for the subsequent lemma.
Assumption 8
The sequence generated by Algorithm 3 is bounded, i.e., there exists such that , for all .
For example, in the context of distributed constrained optimization, Assumption 8 can be enforced by changing the local dual updates by projecting onto a local bounded set that contains the optimal dual set [29], [30]. See Section II-C for a discussion on how to locally build such supersets.
The next lemma provides upper bounds for the estimation errors at each stage of Algorithm 3.
Lemma 5
Let Assumptions 1-3, 8 hold true. Then, there exist some positive constants , , , and and a vanishing scalar sequence defined as
[TABLE]
with as in (7) and as in Assumption 7, such that the following upper bounds hold for all :
- (i)
; 2. (ii)
; 3. (iii)
.
Proof:
See Appendix -E. ∎
By exploiting the upper bounds in Lemma 5 and a result on the convergence of scalar sequences, which is recalled next, we can show that the estimates asymptotically converge to their correspondent aggregate true values.
Lemma 6** ([31, Lemma 3.1])**
Let be a sequence.
- (a)
If and , then . 2. (b)
If for all , and , then .
Proposition 3
Let Assumptions 1-3 hold true. Then, the following statements hold:
- (i)
; 2. (ii)
; 3. (iii)
.
Proof:
(i) From the upper bound in Lemma 5 (i), we have
[TABLE]
where , since by Lemma 1, and by Lemma 6 (a), since and by Assumption 7. Hence, . The proofs of (ii) and (iii) are analogous. ∎
Next, we derive an upper bound for the error in (26) that directly depends on the estimation errors in Lemma 5.
Lemma 7
Let Assumptions 1-3, 8 hold true. Then, the following bound holds for all :
[TABLE]
Proof:
See Appendix -F. ∎
Finally, by combining the upper bounds in Lemma 5 and 7 and exploiting a result on the convergence of scalar sequences, i.e., Lemma 6 (b), we show that condition (C.2) holds, namely, the relaxed error sequence is summable.
Lemma 8
Let Assumptions 1-3, 8 hold true. The sequence , with as in (26), is summable, i.e.,
[TABLE]
Proof:
See Appendix -G. ∎
Now, we can prove the convergence of Algorithm 3.
Theorem 2
Let Assumptions 1-5, 8 hold true, the step sizes be set as in Assumption 6, and as in Assumption 7. Then, the sequence generated by Algorithm 3 globally converges to some , where is a v-GNE of the game in (4).
Proof:
For all , the iterations of Algorithm 3 can be cast as the Krasnosel’skii–Mann process with errors , where , as in (19) and as in (26). By [19, Th. 5.5], the sequence converges to some , since is averaged, thus nonexpansive, by Lemma 11, and (C.1)(C.2) hold, by Assumption 7 and Lemma 8, respectively. To conclude, we note that , by [11, Prop. 25.1 (iv)], and that , with as in (14), since , by Lemma 9, and . Since , then is a v-GNE of the game in (4), by Proposition 2 (ii). ∎
V Numerical simulations
In this section, we study the performance of the proposed algorithm on a class of network Nash–Cournot games with market capacity constraints. Such games represent an instance of generalized aggregative Nash games. In Section V-A, we describe the player cost functions and strategy sets and verify that the necessary assumptions are satisfied. In Section V-B, we compare the performance of our algorithm against a standard semi-decentralized method (Algorithm 1).
V-A Generalized network Nash–Cournot game
We extend the network Nash–Cournot game model proposed in [14, §IV] with additional market capacity constraints. Specifically, consider firms that compete over markets. Let firm ’s production and sales at location be denoted by and , respectively, while its cost of production at location is denoted by and defined as follows:
[TABLE]
where and are scaling parameters for agent .
The goods sold by firm at location fetch a revenue , where denote the sales price at location and represents the aggregate sales at location . The market price is set according to an inverse demand function which depends on the aggregate of the network, i.e.,
[TABLE]
where is the overall demand for location . Each firm has a production limitation at location , described by . Moreover, the overall production in each market must meet the correspondent demand and do not exceed a maximum capacity . Hence, the coupling constraints , for all , have to be satisfied.
Overall, each firm , given the strategies of the other firms, aims at solving the following optimization problem:
[TABLE]
Effectively, the payoff function of firm is parametrized by nodal aggregate sales and its constraints depend on the other firms’ strategies, thus leading to a generalized aggregative game. In this example, we assume that the firms communicate over a dynamic network to cope with the lack of aggregate information, which is necessary to compute their optimal production and sale strategies.
Next, we show that the proposed network Nash–Cournot game does satisfy our technical setup. Let denote the strategy vector of agent and denote the collective strategy profile. The cost function of agent is quadratic, convex in , continuously differentiable and can be cast in a compact form as
[TABLE]
where , and . The local feasible set of firm is non-empty (for an adequate choice of ’s ), convex, compact and reads as .
The coupling constraints are affine and can be written in compact form as in (4), with and , for all . Thus, Assumption 1 is satisfied.
The pseudo gradient mapping is affine and reads as
[TABLE]
with
[TABLE]
and . By a direct inspection of the eigenvalues of , we can show that is strongly monotone and Lipschitz continuous, when the coefficients ’s are positive. Hence, Assumption 4 is satisfied. In particular, it follows by [27, p.79] that is cocoercive with . Moreover, since is strongly monotone and the sets are compact, it follows by Remark 2 that there exists a unique v-GNE. The mapping is affine and reads as
[TABLE]
Similarly, it can be shown that is Lipschitz continuous with . Thus, Assumption 5 is satisfied.
V-B Simulations studies
In our numerical study we consider a network Nash-Cournot game played by firms, i.e., , over markets, i.e., . All the parameters of the game are drawn from uniform distributions and fixed over the course of the entire simulations. Specifically, for all and , we set the parameters of production cost in (28) as and , where denotes the uniform distribution over an interval with . We set the production capacities of firm as for all and for all . Moreover, the demand at market is set as , while the market capacity as for all .
At each iteration , the firms communicate according to a randomly generated and connected small world, where each node has 4 neighbors. To create a doubly stochastic mixing matrix , we exploit the Metropolis weighting rules in (5). Thus, Assumptions 2 and 3 are satisfied. The agents update their decisions and their estimates as in Algorithm 3. The step-sizes are set according to Assumption 6, where the global parameter is set larger than the theoretical lower bound , where and as in (31).
In Figure 1, we show the trajectories of the sequences of normalized residuals for different choices of the step-size sequence . Moreover, we compare the trajectories of Algorithm 3 with those obtained with Algorithm 1 [26, Alg. 1], which is a semi-decentralized algorithm and works under the assumption of full-decision information, i.e., the firms have access to the real aggregate information at each stage of the algorithm. As expected, the semi-decentralized algorithm converges faster than the fully-distributed counterpart. Interestingly, we notice that convergence is achieved also in the case of fixed relaxation step in the KM process, e.g. for all , which is not supported by our theoretical analysis.
In Figure 2, we compare the trajectories of the consensus disagreement of the dual variables for two choices of the step-size sequence .
VI Conclusion
For a general class of aggregative games with linear coupling constraints over time-varying communication networks, we have designed the first single-layer, fully-distributed algorithm to compute a variational generalized Nash equilibrium. Global convergence can be established via monotone-operator-theoretic and fixed-point arguments, integrated with a dynamic tracking methodology.
The analysis approach in this paper is genuinely novel, hence opens up a number of new research directions. Motivated by the numerical results of Section V, it would be valuable to explore the computational aspects of the proposed method, e.g. how the connectivity of the communication networks influences the convergence speed. Whether or not the proposed algorithm converges with fixed step sizes in the Krasnosel’skii-Mann process is currently an open question. Finally, it would be highly valuable to relax the assumption of double-stochasticity of the mixing matrices.
-A Proof of Lemma 3
(i) is the sum of two terms: in (20) which is a linear, skew symmetric mapping, thus maximally monotone [11, Ex. 20.30]; and which is maximally monotone since is the direct sum of maximally monotone operators [11, Prop. 20.23] (i.e, the normal cones of the closed convex sets and ). Hence, the maximal monotonicity of follows by [11, Cor. 24.4 (i)] since .
(ii) is cocoercive, by Assumption 4, and is cocoercive by [24, p.79], since is a linear, positive semi-definite mapping with . It follows that the direct sum is cocoercive, for all such that . Now, we show that is restricted-strictly monotone w.r.t. . Let us recall that and are the (-dimensional) consensus and disagreement subspaces, respectively. Moreover, each vector , can be split as , with and . Consider now , hence , with and . Let , hence and . The following inequalities show that in (16) is restricted-strictly monotone w.r.t. :
[TABLE]
where , with projection onto the disagreement subspace and is the second smallest eigenvalue of . The first inequality follows by the cocoercivity of (Assumption 4) and since , namely, the projection onto the disagreement subspace of the consensual terms is zero.
(iii): The maximal monotonicity of follows by [11, Cor. 24.4 (i)], since is cocoercive (thus maximally monotone [11, Example 20.31]), is maximally monotone and . Moreover, since is also restricted-strictly monotone with respect to then enjoys the same property.
-B Proof of Proposition 2
(i) By Proposition 1, there exists such that , where is a v-GNE. Define , with , then we have . In fact, each component of the first row block of reads as . While, each component of the second row block of reads as , since and . Hence, .
(ii) From the first part of the proof, we know that there exists such that . Now, we show that all the zeros of lie in . By contradiction, let and assume . Then, , and Lemma 3 (iii) yields , which is impossible. Therefore, , namely . Now, by substituting into (since ) we recover that , which, by Proposition 1, holds if and only if is a v-GNE.
-C Proof of Theorem 1
To prove convergence of Algorithm 2 we follow the same technical reasoning of the proof in [10, Alg. 1]. Specifically, the proof is divided in two parts to show that:
- (1)
Algorithm 2 corresponds to the fixed-point iteration in (18), i.e., , where is the so-called pFB operator. 2. (2)
If the step sizes are set as in Assumption 6, then is an averaged operator. Hence, (18) globally converges to some . Since , with as in (14), then is a v-GNE, by Proposition 2.
(1): Let us recast Algorithm 2 in a compact form as
[TABLE]
Since , and , it follows from (32) that , which leads to
[TABLE]
where we used . Similarly, since and , it follows from (33) that , which leads to
[TABLE]
Let , then the inclusions in (36)(37) can be cast in compact form as
[TABLE]
where , and as in (15), (16) and (20), respectively. By making explicit in (38), we obtain
[TABLE]
which corresponds to , where is the pFB operator in (19). Finally, it follows by (34)(35) that , which concludes the proof.
(2): Next, we introduce some technical statements that we exploit later on in this proof.
Lemma 9
Let the step-sizes satisfy Assumption 6. Then the following statements hold:
- (i)
, with as in Assumption 6, 2. (ii)
.
Proof:
(i): By the generalized Gershgorin circular theorem [32, Th. 2], each eigenvalue of the matrix in (20) satisfies at least one of the following inequalities:
[TABLE]
Hence, if we set the step-sizes as in Assumption 6, the inequalities (40)-(41) yield to . It follows that the smallest eigenvalue of , i.e., , satisfies . Thus, is positive semi-definite.
(ii): Let be the largest eigenvalue of . We have that . Moreover, . Hence . ∎
Lemma 10
Let Assumptions 1 and 4 hold and the step-sizes satisfy Assumption 6. The following properties hold in the -induced norm (i.e., ):
- (i)
* is **cocoercive and is *averaged; 2. (ii)
* is maximally monotone and is *averaged.
Proof:
(i): Since is single-valued and nonsingular, by Lemma 9 (i), for each
[TABLE]
where the last inequality follows by Lemma 9 (ii). By (42) and the cocoercivity of (Lemma 3 (ii))
[TABLE]
In other words, is cocoercive in the induced norm. It follows from [11, Prop. 4.33] that is averaged in the induced norm.
(ii): is maximally monotone in the induced norm, since is maximally monotone by Lemma 3 (i). By [11, Prop. 23.7], the resolvent mapping is averaged (or firmly-nonexpansive, see [11, Remark 4.24]) in the induced norm, since is maximally monotone in the same norm. ∎
Lemma 11
Let Assumptions 1, 4 hold and the step-sizes satisfy Assumption 6. Then, the pFB operator is averaged in the induced norm (i.e., ), with .
Proof:
By [11, Proposition 4.4] , the mapping is averaged with respect to , since composition of and which are and averaged in , respectively, by Lemma 10. Moreover, , since , by Assumption 6. ∎
The fixed-point iteration (18), that corresponds to Algorithm 2 by the first part of this proof, is the Krasnosel’skii-Mann iteration on the mapping , which is averaged, with , by Lemma 11. The convergence of (18) to some follows by [11, Prop. 5.15]. To conclude, we note that , by [11, Prop. 25.1 (iv)], and that , with as in (14), since , by Lemma 9 (i), and . Since the limit point , by Proposition 2 (i), then is a v-GNE of the game in (4), by Proposition 2 (ii), thus concluding the proof.
-D Proof of Lemma 4
We prove equation (i) by induction. At step zero, holds if the estimates are initialized as , for all . At step , we assume that . To conclude the proof, we show that relation (i) holds at step :
[TABLE]
The first equality follows from the updating rule of the ’s in Algorithm 3, the second follows by definition of , i.e., , the third follows since the mixing matrix is column stochastic, i.e., , by Assumption 3, while the last equality follows from the induction step , i.e., . The proof of equations (ii) and (iii) are analogous.
-E Proof of Lemma 5
For easy of notation, this proof is developed for the scalar case, i.e., . In this case, we can write .
(i): The update of the estimates ’s in Algorithm 3 can be written in a compact form as
[TABLE]
By telescoping (44), we obtain
[TABLE]
where the transition matrices ’s are defined in (6). By rearranging (44), we can write . Then, by exploiting the equivalence in (45), we have
[TABLE]
Now, consider , which may be written as follows:
[TABLE]
By Lemma 4, we have that , which leads to
[TABLE]
From equations (46) and (47), we have the following:
[TABLE]
where (a) follows from the Cauchy–Schwarz inequality, while (b) since for all , by Lemma 1. Next, we find an upper bound for in (48). The update of the decisions ’s can be written in a compact form as . We note that , for all since is obtained by projecting onto and is a convex combination of elements of the convex set . Since all the sets ’s are compact, by Assumption 1, it follows that for some constant , we have
[TABLE]
By combining (49) and (48), we obtain
[TABLE]
where we exploited the initialization step of Algorithm 3, i.e., , from which .
(ii): The update of the estimates ’s in Algorithm 3 can be written in a compact form as
[TABLE]
By telescoping (50), we obtain
[TABLE]
To upper bound , we note that the dual update in Alg. 3 reads in compact form as and that the dual sequence is positive and norm bounded, by Assumption 8. Hence, we have , that substituted into (51) gives
[TABLE]
(iii): The update of the estimates ’s in Algorithm 3 can be written in a compact form as
[TABLE]
By telescoping (52) (as explained in (45)), we obtain
[TABLE]
Now, consider , which may be written as follows:
[TABLE]
By Lemma 4, we have that , for all , which leads to
[TABLE]
From the relations (53) and (54), we have the following:
[TABLE]
where the first equality, (a), follows by substituting (53) and (54) to and , respectively, (b) follows from Cauchy–Schwartz inequality and (c) since for all , by Lemma 1. Now we build an upper bound for in (55):
[TABLE]
where (a) follows from the triangular inequality and (b) follows from (49)
Next, we build an upper bound for the term in the right hand side of (56).
[TABLE]
where (a) follows by exploiting the compact update of in (23), (b) follows by the nonexpansiveness of the projection operator, (c) follows by exploiting, in sequence, the triangular inequality, the Lipschitz continuity of (Assumption 5), and the Cauchy-Schwartz inequality, finally (d) follows from the relation .
Now, we find an upper bound the last two terms in (57).
[TABLE]
where (a) follows since by (44), (b) from the triangular inequality, (c) by summing and subtracting within the fist term, (d) by the triangular inequality and substituting to the bound in (49), (e) by substituting to the upper bound derived in Lemma 5 (i) and to a bound similarly derived, (f) follows by noticing that , since by Assumption 3.
Similarly, for the last addend in (57), we can derive the following bound:
[TABLE]
Finally, by combining (57) with (58) and (59), we obtain an upper bound for , i.e.,
[TABLE]
Now, by substituting (60) into (56), we obtain
[TABLE]
where is the scalar (vanishing) sequence in (27), with is as in (7) and as in Assumption 7. Finally, by combining (61) and (55), we obtain the upper bound in Lemma 5 (iii).
-F Proof of Lemma 7
From , it follows that
[TABLE]
Next, we upper bound , where and are defined in (23) and (21), respectively.
[TABLE]
where the first inequality (a) follows by the nonexpansivity of the projection operator, and (b) follows by the triangular inequality and the Lipschitz continuity of (Assumption 5).
Now, consider , where and are defined in (24) and (22), respectively. By exploiting the nonexpansiveness of the projection operator, we have
[TABLE]
Finally, by combining (64) and (63) with (62) we obtain the upper bound in Lemma 7.
-G Proof of Lemma 8
By substituting the bounds on the estimation errors of Lemma 5 into the error bound in Lemma 7, we obtain
[TABLE]
where , , and are positive constants defined as , , and . Now, we show that each term on the right-hand side of (65) is summable, hence also the sequence is such, i.e., .
Term 1: To establish the convergence of , we note that , for all , by Assumption 7, implying that , since by Lemma 1.
Term 2: Since , for all (Assumption 7), the following relations hold for the second term in the right-hand side of (65):
[TABLE]
It follows by Lemma 6 (b) that , since , for all (Assumption 7) and .
Term 3: By exploting the definition of the sequence in Lemma 5, we can write
[TABLE]
By exploiting the same technical reasoning in (i) and (ii), we can show that each term on the right-hand side of the previous inequality globally converges. Therefore, we conclude that .
Term 4: Since , for all (Assumption 7), the following hold for the last term in the right-hand side of (65):
[TABLE]
It follows by Lemma 6 (b) that , since by (iii), and .
To conclude, since all the terms in the right-hand side of (65) are summable, then we have .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Jensen, “Aggregative games and best-reply potentials,” Economic Theory, Springer , vol. 43, pp. 45–66, 2010.
- 2[2] W. Saad, Z. Han, H. Poor, and T. Başar, “Game theoretic methods for the smart grid,” IEEE Signal Processing Magazine , pp. 86–105, 2012.
- 3[3] Z. Ma, D. Callaway, and I. Hiskens, “Decentralized charging control of large populations of plug-in electric vehicles,” IEEE Trans. on Control Systems Technology , vol. 21, no. 1, pp. 67–78, 2013.
- 4[4] N. Li, L. Chen, and M. A. Dahleh, “Demand response using linear supply function bidding,” IEEE Transactions on Smart Grid , vol. 6, no. 4, pp. 1827–1838, 2015.
- 5[5] J. Barrera and A. Garcia, “Dynamic incentives for congestion control,” IEEE Trans. on Automatic Control , vol. 60, no. 2, pp. 299–310, 2015.
- 6[6] S. Grammatico, F. Parise, M. Colombino, and J. Lygeros, “Decentralized convergence to Nash equilibria in constrained deterministic mean field control,” IEEE Trans. on Automatic Control , vol. 61, no. 11, pp. 3315–3329, 2016.
- 7[7] S. Grammatico, “Dynamic control of agents playing aggregative games with coupling constraints,” IEEE Trans. on Automatic Control , vol. 62, no. 9, pp. 4537 – 4548, 2017.
- 8[8] D. Paccagnan, B. Gentile, F. Parise, M. Kamgarpour, and J. Lygeros, “Nash and wardrop equilibria in aggregative games with coupling constraints,” IEEE Transactions on Automatic Control , vol. 64, no. 4, pp. 1373–1388, 2018.
