Single-timescale distributed GNE seeking for aggregative games over networks via forward-backward operator splitting
Dian Gadjov, Lacra Pavel

TL;DR
This paper introduces a novel single-timescale distributed algorithm for aggregative games with affine constraints, enabling agents with partial information to reach a variational GNE efficiently through forward-backward operator splitting.
Contribution
It presents the first single-layer, single-timescale distributed method for variational GNE seeking in aggregative games with affine coupling constraints.
Findings
Algorithm converges under constant step sizes.
Works with partial information and local communication.
Utilizes forward-backward splitting for convergence proof.
Abstract
We consider aggregative games with affine coupling constraints, where agents have partial information on the aggregate value and can only communicate with neighbouring agents. We propose a single-layer distributed algorithm that reaches a variational generalized Nash equilibrium, under constant step sizes. The algorithm works on a single timescale, i.e., does not require multiple communication rounds between agents before updating their action. The convergence proof leverages an invariance property of the aggregate estimates and relies on a forward-backward splitting for two preconditioned operators and their restricted (strong) monotonicity properties on the consensus subspace.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Single-timescale distributed GNE seeking for aggregative games over networks via forward-backward operator splitting
Dian Gadjov and Lacra Pavel This work was supported by NSERC Discovery Grant.D. Gadjov and L. Pavel are with Dept. of Electrical and Computer Engineering, University of Toronto. [email protected], [email protected]
Abstract
We consider aggregative games with affine coupling constraints, where agents have partial information on the aggregate value and can only communicate with neighbouring agents. We propose a single-layer distributed algorithm that reaches a variational generalized Nash equilibrium, under constant step sizes. The algorithm works on a single timescale, i.e., does not require multiple communication rounds between agents before updating their action. The convergence proof leverages an invariance property of the aggregate estimates and relies on a forward-backward splitting for two preconditioned operators and their restricted (strong) monotonicity properties on the consensus subspace.
1 Introduction
Research in aggregative games has surged in recent years, due to their suitability to model decision problems in various application domains: from demand-side management for the smart grid, [1] and electric vehicles, [2], to network congestion control, [3]. Aggregative games are non-cooperative games in which each (player’s) agent’s cost depends on some aggregate effect of all other agents’ actions. Often agents have shared coupling constraints, and the relevant equilibrium concept is the generalized Nash equilibrium (GNE).
Many settings involve a large number of agents with private cost functions and constraints, who are willing and able to exchange information with their neighbours only, [4], [5]. Motivated by the above, in this note we develop a single-layer/single-timescale, discrete-time distributed GNE seeking algorithm for aggregative games, which is guaranteed to converge exactly to a variational GNE when using fixed-step sizes. Agents update simultaneously their action and an aggregate-estimate, based on local communication. To the best of our knowledge, this is the first such algorithm in the literature for aggregative games. Our novel contributions are to relate the algorithm to a preconditioned forward-backward splitting of a pair of monotone operators, and develop conditions for distributed convergence.
Literature Review: The problem of finding a (generalized) Nash Equilibrium (G)NE when agents know the actions of all other agents (full-decision information) has been studied thoroughly, e.g. [6, 7, 8, 9], or via a centralized operator-splitting [10]. In recent years, a rapidly growing field originated by [11, 12] is focused on developing algorithms that relax this full-information assumption to (G)NE computation over networks. Fully distributed NE seeking algorithms have been proposed for general games with decoupled constraints, either in continuous time, e.g. [13],[14] or in discrete time (with fixed-step sizes), e.g. [15], or [16] (with fast convergence). However, these algorithms require each agent to maintain an estimate of the actions of all other agents, hence are inefficient for aggregative games. The same holds for the fully distributed GNE seeking algorithm proposed in [17] for generally coupled games. For aggregative games, existing algorithms can be classified as semi-decentralized (when a central coordinator is required), or distributed (when only local communication is used). Semi-decentralized GNE algorithms have been proposed for aggregative games in [18, 19, 20, 21] (discrete-time) and [22] (continuous-time). An elegant operator theory approach is used to show that global convergence can be achieved with fixed-step sizes under (strict) strong monotonicity of the pseudo-gradient of the game, either to a variational generalized Nash equilibrium (GNE), or to an aggregative (Wardrop) equilibrium (GAE). The algorithms require a central coordinator to broadcast the aggregate value and to ensure the coupling constraints are met.
In a distributed setting a central coordinator/node does not exist, and distributed algorithms are more difficult to develop. For aggregative games, such distributed algorithms have been mostly developed for NE seeking with no coupled constraints. The first work to propose distributed NE algorithms for aggregative games over networks was [11]. In order to cope with not knowing the aggregate value, each agent maintains an estimate of the true aggregate, built based on local communication with neighbours, and uses it in the action update instead of the true aggregate. The algorithm proposed in [11] requires diminishing step-sizes for exact convergence, while only guaranteeing convergence to a neighbourhood of the NE when using fixed-step sizes, under (strict) strong monotonicity of the pseudo-gradient of the game. The recent algorithm proposed in [23] requires an increasing number of communication rounds before each action update. In both cases, the NE seeking algorithms effectively operate as if on two time-scales (fast aggregate estimate, slow action update) with each agent using an estimate that is near the true aggregate.
Fewer results exist for distributed GNE seeking in aggregative games with coupled constraints. In fact, the only such distributed discrete-time algorithm for aggregative games that we are aware of is the one proposed in [24]. The algorithm requires that players exchange information for a fixed number of communication rounds before every action update and is only guaranteed to reach an -GNE. The -GNE approaches a variational GNE if the fixed number of communication rounds between action updates goes to infinity, which is impractical. A continuous-time dynamics for aggregative games with equality constraints is proposed in [25]. However, convergence in continuous-time does not guarantee convergence in discrete-time.
*Contributions: * Motivated by the above, in this paper we propose a distributed, single-layer discrete-time GNE seeking algorithm for aggregative games with affine coupling constraints, that has guaranteed exact convergence to a variational GNE when using constant step-sizes. To the best of our knowledge, to date there does not exist such an algorithm tailored for aggregative games. We note that the algorithm recently proposed in [17] for generally coupled games is applicable to aggregative games, but because it does not exploit the aggregative structure is computationally inefficient when used for these types of games. In our algorithm, each agent maintains an estimate of the aggregate and multiplier, and exchanges them with its neighbours over an undirected static connected graph, in order to learn the true aggregate value and ensure that the coupling constraints are satisfied. Each agent updates its action and its aggregate estimates in the same iteration. Compared to the distributed NE algorithm in [11], and the GNE algorithm in [24] for aggregative games, our algorithm does not require diminishing step-sizes nor multiple communication rounds between each action update. It is related to a preconditioned forward-backward operator splitting iteration, inspired by the distributed framework for GNE seeking in general games conceptualized in [26] for full-decision information, and extended in [17] to partial-decision information. We note that the algorithm in [17] requires each agent to keep an estimate of all other agents’ actions; if applied to aggregative games where the coupling is only through the average, this can be not scalable/inefficient.
The distributed algorithm we develop here exploits the aggregative coupling structure in the cost. Each agent exchanges and maintains only an estimate of the aggregate (dimensionally independent on the number of agents, hence scalable) and an estimate of the dual multiplier. Thus players do not need to share action information, which might be private information. We use proof techniques similar to those in [17] with the following differences. The algorithm is tailored for the special structure of aggregative games. Unlike [17], here the estimate is a separate variable that needs to track the aggregate decision. Because of this, the update equation of the aggregate estimate has an additional correction term. This is introduced to account for the own action’s effect on the average and acts as a dynamic-tracking term. Secondly, this correction term allows us to exploit an invariance property of the aggregate estimate. This invariance property plays a critical role in ensuring convergence with fixed-step sizes and in obtaining a better bound compared to the algorithm in [17]. Thirdly, because of this correction term we need to use slightly different splitting operators and a different metric matrix, which needs handled separately. A conference version appears in [27]; here we provide all proofs and additional numerical results.
Notations. For a vector , denotes its transpose and the norm induced by inner product . For a symmetric positive-definite matrix , , and denote its minimum and maximum eigenvalues. The -induced inner product is and the -induced norm, . For a matrix , let denote the 2-induced matrix norm, where is its maximum singular value. For , denotes the stacked vector obtained from vectors , is the diagonal matrix with along the diagonal. and are the null and range space of matrix , while stands for its entry. , and denote the identity matrix, the all-ones and the all-zero vector of dimension , respectively. We may also simply use to denote an all-zero matrix of appropriate dimensions. Denote as the Cartesian product of sets , . For a function let .
2 Background
Monotone Operators
The following are from [28]. Let be a set-valued operator. The domain of is where is the empty set, and the range of is . The graph of is . The zero set of is . is called monotone if , , and strongly monotone if such that , . is maximally monotone if is not strictly contained in the graph of any other monotone operator. The resolvent of is , where is the identity operator. The fixed points of are .
An operator is nonexpansive if . , where denotes the class of -averaged operators, if and only if , . is also called firmly nonexpansive. If is maximally monotone then is firmly nonexpasive, [[28], Prop. 23.7]. Let the projection of onto be , with , where is the normal cone. If is closed and convex, is firmly nonexpansive [[28], Prop. 4.8]. is called -cocoercive if for , i.e., .
Graph Theory
Let graph describe the information exchange among a set of agents, where is the edge set. If agent can get information from agent , then and agent belongs to agent ’s neighbour set . is undirected when if and only if . is connected if there is a path between any two nodes. Let be the weighted adjacency matrix, with if and otherwise. Let , where . Assume that so the weighted Laplacian of is . When is connected and undirected, [math] is a simple eigenvalue of , , . All other eigenvalues of are positive, ordered in ascending order as , with , where is the maximal weighted degree of .
3 Game Formulation
Consider a group of agents (players) , where each player controls its local decision (action/strategy) . Denote as the decision profile of all agents. Equivalently, we also write where denotes the decision profile of all agents except player . Agent aims to optimize its objective function (coupled to other players’ decisions) with respect to its own decision over its feasible decision set . Let the globally shared, affine coupled constrained set be
[TABLE]
where is a private feasible set of player , and , is local player information. Let .
We focus on average aggregative games where the cost function of each agent depends on the average of all agents’ actions , denoted as to explictly indicate this dependency. In the remainder of the paper we use either or , depending on the context. Given the other’s actions , the objective of each player is to solve the following optimization problem with coupled constraints,
[TABLE]
A generalized Nash equilibrium (GNE) is such that
[TABLE]
Assumption 1
For each player , given any , is continuously differentiable and convex in and is a compact convex set. The constraint set is non-empty and satisfies Slater’s constraint qualification.
Assumption 1 is a standard assumption which ensures existence of a generalized Nash equilibrium (GNE).
Given the optimization problem (1) over for each agent , let its Lagrangian be defined as , where , and the dual multiplier is . Then, the KKT conditions that an optimal solution with satisfies can be written as
[TABLE]
where , with
[TABLE]
and where is the normal cone operator.
By Theorem 8, §4 in [8] when satisfies (2) for all then is a GNE of the game. Given as a GNE of game, the corresponding Lagrangian multipliers may be different for the players, i.e., . A GNE with the same Lagrangian multiplier for all agents is called variational GNE, [8], with the economic interpretation of no price discrimination and better stability/sensitivity properties, [29]. A variational GNE of the game is defined as solution of the variational inequality :
[TABLE]
where denotes the pseudo-gradient of the game defined as , i.e., the stacked vector with the partial gradients for all . solves if and only if there exists a such that the KKT conditions are satisfied, [9, §10.1],
[TABLE]
where . Assumption 1 guarantees the existence of a solution to , by [9, Corollary 2.2.5]. By [8, Thm. 9, §4], every solution of is a GNE of game. Furthermore, if together with satisfies the KKT conditions (3) for then satisfies the KKT conditions (2) with , hence is a variational GNE of game.
Given this aggregative game, our aim is to design a distributed iterative algorithm that finds a variational GNE under partial-decision information over a network.
Assumption 2
* is strongly monotone and Lipschitz continuous, i.e., and such that , , and *
Assumption 2 is commonly used in algorithms with fixed-step sizes, [11, 12, 30, 18, 19, 20], [23, 24, 25, 17, 26],[15], and guarantees that a unique variational GNE exists, [9]. We consider that each agent does not have information on the other agents actions or on the aggregate value, , and that agents communicate only locally with neighbouring agents, over a communication graph .
Assumption 3
The communication graph is undirected and connected.
Remark 1
In the remainder of the paper, to simplify notation, we assume that , . The results can be extended immediately to the case , .
4 Distributed Algorithm
In this section we present our proposed algorithm. To offset the lack of full information, each agent maintains a local estimate of the aggregate and a local multiplier , and exchanges them with its neighbours over , in order to learn the true aggregate value and the Lagrange multiplier . Each agent also maintains an additional auxiliary variable , used for the coordination of the coupling constraints and to reach consensus of the local multipliers.
Let denote the tuple with agent ’s decision variable , local aggregate estimate , local multiplier , and auxiliary variable at iteration , respectively. The goal is that over time each agent will have the same aggregate estimate, equal to the average of the agents actions, the same multiplier, and its decision will correspond to a variational GNE with the coupled constraints met. The proposed distributed algorithm is given below.
*Algorithm 1: *
where , , , , and are positive step sizes. Note that Algorithm 1 is fully distributed and instead of , each agent is using (own estimate of the aggregate) to evaluate , its own partial gradient.
To write the algorithm more compactly, let , , , . Let where and be the extended pseudo-gradient. Note that when all , , hence
[TABLE]
Thus, if in each agent is evaluating the gradient with the true action aggregate value, in each agent is using instead its own estimate of the aggregate, .
With these we can write Algorithm 1 compactly as,
[TABLE]
where , , , , , , , and is the Laplacian matrix of the graph , , , , , and .
Remark 2
The algorithm is similar to the algorithm in [17] but it is tailored for aggregative games. The update for is a projected-gradient descent of the local Lagrangian function using player’s aggregate estimate to evaluate the pseudo-gradient. Unlike [17], here the estimate is separate from the action and needs to track the aggregate of all actions. As a result, besides the consensus term it has an additional correction term to account for the action’s effect on the average. This acts as a dynamic-tracking term and, as shown in the following, leads to an invariance property of the aggregate estimate, which turns out to be critical for the convergence proof. The auxiliary variable is updated by a discrete-time integration for the multiplier consensual error. Lastly, the update for is a combination of a projected-gradient ascent of the local Lagrangian and a proportional-integral term for the multiplier consensus error.
We prove next the important invariance property that (the average of all agents’ aggregate estimates) is always equal to the actions’ true aggregate, .
Lemma 1
Suppose Assumptions 1 and 3 hold. Then the following properties hold for iterates generated by Algorithm 1 or (4).
(i) , hence , for all .
(ii) Any fixed point of Algorithm 1 is such , for all , where and satisfy (3), and is a variational GNE.
*Proof: * See Appendix.
We exploit the invariance in Lemma 1(i) and, from (4), we next construct an auxiliary iteration with respect to the consensus subspace of the aggregate estimates, which will prove instrumental for the convergence analysis. Let denote the -dimensional consensus subspace for all agents’ aggregate estimates, i.e., and let be its orthogonal complement. Note that and . Any can be decomposed as , where and , by using the projection matrices and . Note that , where .
Any generated by (4) can be decomposed as where, by using the invariance property in Lemma 1(i),
[TABLE]
Using this decomposition together with (4), consider:
[TABLE]
where , , . The next result relates precisely iterates generated by Algorithm 1 or (4) to iterates generated by (6).
Lemma 2
Suppose Assumptions 1 and 3 hold. Then, any sequence generated by Algorithm 1 or (4) with initial conditions can be derived from some sequence generated by (6) with initial conditions , as in
[TABLE]
*Proof: * The proof follows an induction argument. Due to the initial conditions, , hence (7) holds for .
Suppose (7) holds at step . Then, from (cf. (6)) with , it follows that hence by the -update in (4), . Next, using (7) into the right-hand side of the -update in (6), yields cf. (4),
[TABLE]
Hence the first and third relations in (7) hold at step , and using them on the right-hand side of the -update in (6) yields , where is generated by (4).
Lastly, we show the second relation in (7) at step . Thus,
[TABLE]
where we used relations for at step cf. (4) and for at step cf. (7). Using , it follows that , cf. the -update in (4), and the argument is complete. Lemma 2 is instrumental in what follows. Based on it, convergence of Algorithm 1 or (4) can be established once convergence of (6) is shown. Note that any generated by (6) is such that , for all . We show next that (6) can be written as a preconditioned forward-backward iteration,
[TABLE]
where , for two operators and and matrix defined as,
[TABLE]
where is an all-zero matrix of appropriate dimension.
Lemma 3
Suppose that Assumptions 1, 2 and 3 hold and let , , and be defined as in (10). Suppose that and is maximally monotone. Then the following hold:
(i): Iterates (6) are equivalently written as (9), i.e.,
[TABLE]
where and .
(ii) Any fixed point of (6) is a zero of and a fixed point of . Furthermore, any such point satisfies , , , where and satisfy the KKT conditions (3), hence is a variational GNE.
*Proof: * (i) The equivalence for , , and between (9) and (6) can be shown by expanding (9) with , and as in (10), cancelling terms. Similarly, for the update, by expanding (9), replacing the term with cf. (6), using , cancelling terms and using , yields (6). Since by assumption, (9) is equivalent to , which can be written as (11), based on the fact that is singled-valued (cf. [28, Prop. 23.7] by maximally monotone).
(ii): Let be the fixed point of (6) or (9), hence is a fixed point of . By continuity, the following equivalences hold, . Thus, using (10), it follows that satisfies: , , , and By Assumption 3, the second and third relations imply that (since ) and , for a . Using these in the first and fourth relations together with , leads to which, using , is the first relation in (3). From , premultiplying by as in the proof of Lemma 1(ii), the second one in (3)holds, so is a variational GNE.
Remark 3
Lemma 2 and 3 show that Algorithm 1 is related to a forward-backward iteration (9) or (11), [28], with a preconditioning matrix . The symmetric matrix is needed, as pointed out in [26] and [17], to be able to distribute the backward step. Our proof techniques are similar to those in [17], however, the operators and are slightly different, while the metric matrix has a different structure, with a new block involving the projection matrix which needs handled separately. One of the reasons is that in the general setup in [17] the action is part of the estimate vector, while in our setup the actions and the aggregate estimate are separate and only consensus on the aggregate is needed.
5 Convergence Analysis
In this section we show convergence of Algorithm 1. Based on Lemma 2 its convergence can be established once convergence of (6) is shown. In turn, (6) is equivalent to (9) or (11), (cf. Lemma 3). Convergence of iteration (11) is guaranteed when is a cocoercive and is a monotone operator, cf. Theorem 25.8 in [28]. However, is defined in terms of the extended pseudo-gradient for which monotonicity properties are not guaranteed to hold on the augmented space of actions and aggregate estimates (only strong monotonicity of is assumed). Instead, we will show that satisfies a *restrictive cocoercive * property and that is maximally monotone, and then similar properties for and . This turns out to be sufficient to prove convergence because the restrictive property is with respect to the aggregate consensus subspace where the zeros of lie, cf. Lemma 3(ii). To show the *restrictive cocoercive * property of we balance the strong monotonicity of the pseudo-gradient (on the aggregate consensus subspace ) with that of the Laplacian on its orthogonal component , under a Lipschitz assumption on the extended pseudo-gradient .
Assumption 4
The extended pseudo-gradient is Lipschitz continuous in both arguments, i.e., , s.t.
**
**
Assumption 4 is not restrictive and is also used in other distributed (G)NE algorithms over networks, e.g. [11], [24], [15]. A sufficient condition for it to hold is that is C1 with bounded Jacobian. For quadratic games, it is automatically satisfied.
The following lemma establishes a (restricted) monotonicity property on part of the operator, (10), denoted .
Lemma 4
Let be defined as,
[TABLE]
where . Suppose that Assumptions 1 to 4 hold. Then, for any and any with ,
[TABLE]
where
[TABLE]
Furthermore, is restricted monotone if and strongly monotone if the inequality is strict.
*Proof: * With as in (12), for any , with , the left-hand side of (13) is written as
[TABLE]
Note that by and Assumption 2. Also,
[TABLE]
by Assumption 4. Since , ,
[TABLE]
by Assumption 3. Using these inequalities into (14) yields
[TABLE]
from which (13) follows.
Remark 4
The condition in Lemma 4 is critical to ensure the restricted monotonicity of and the convergence of the proposed method, and the parameter provides the extra degree of freedom to satisfy it. Lemma 4 shows that is monotone when restricted to a subspace where , i.e., when . This means that restricted monotonicity is with respect to the -dimensional aggregate consensus subspace (independent of ). This is unlike [17] where the restricted monotonicity is with respect to the -dimensional consensus subspace (cf. Lemma 3 in [17]). In fact, the bound in (13) is tighter than the one in Lemma 3 in [17] (or Theorem 5 in [14]) and does not depend on (number of agents). We emphasize that we are able to obtain this better bound due to the invariance property of the aggregate estimate. The invariance property is induced by the extra term in the aggregate estimate update in Algorithm 1, that is not present in the algorithm in [17].
Using the fact that is restricted monotone we can now show properties for the operators and , (10).
Lemma 5
Suppose Assumptions 1 to 4 hold and . Then the following hold for operators and , (10).
(i) is maximally monotone.
(ii) is -restricted cocoercive: for any with and any with , the following holds
[TABLE]
where , .
*Proof: * (i): is written as the sum of two operators, one being a Cartesian product of normal cone operators, hence maximally monotone operator, and the other one being a skew-symmetric matrix, hence also maximally monotone, with full domain. Thus itself is maximally monotone, [28].
(ii): Let , and (12). Then
[TABLE]
Using (13) and for the second term, it follows that for any and ,
[TABLE]
On the other hand, using as in (12), Lipschitz properties of , bounds on the eigenvalues of the Laplacian matrix , we obtain
[TABLE]
where . Using this in the foregoing leads to the inequality in (ii).
The following lemma shows how agents can select step sizes independently such that .
Lemma 6
Given any and , if step sizes are selected such that
[TABLE]
where , then the matrix and .
*Proof: * See the Appendix.
The next result shows that and satisfy a monotonicity property in the -induced norm.
Lemma 7
Suppose Assumptions 1 to 4 hold and . Take any where is from Lemma 5, and step sizes , and chosen to satisfy Lemma 6. Then under the -induced norm the following hold:
(i) is maximally monotone and .
(ii) is -restricted cocoercive and is restricted nonexpansive and furthermore, the following holds for any and any ,
[TABLE]
*Proof: * The proof is based on properties of and in Lemma 5, by Lemma 6 and resolvent properties for maximally monotone operators, [28], similar to Lemma 7 in [26] and Lemma 6 in [17] and is omitted, due to space constraints.
After establishing these properties on operators and , we show next convergence to the variational GNE.
Theorem 1
Suppose Assumptions 1 to 4 hold and . Take any , where is as in Lemma 5, and step sizes , and chosen to satisfy Lemma 6. Then the action of each player generated by Algorithm 1 converges to the corresponding component in the GNE , all agents’ aggregate estimates converge to the same value (true aggregate) for all , all agent’s local multipliers converge to the same multiplier for all , corresponding to the KKT condition (3).
*Proof: * Lemma 2 shows that any sequence generated by Algorithm 1 or (4) can be obtained by some sequence generated by (6), via , cf. (7). Thus, if we show that any sequence of (6) converges to , then convergence of any sequence of Algorithm 1 follows, and moreover, by (7), its limit is , where .
To show convergence of recall that, by Lemma 3(i), we can write any of (6) as , where and and, by Lemma 3(ii), any fixed point is a fixed point of and satisfies . By Lemma 7, is restricted nonexpansive and the inequality in Lemma 7(ii) holds with respect to the consensus subspace where , and . Thus operators and satisfy restricted average properties as in Lemma 6 in [17]. Then, using an argument as in the proof of Theorem 25.8 in [28] for averaged operators (see proof of Theorem 2 in [17]), it follows that any sequence converges to a fixed-point of , which by Lemma 3(ii), satisfies , , , where is the variational GNE and the corresponding multiplier. By Lemma 2, any generated by Algorithm 1 or (4) converges, and, by (7), its limit is , hence converges to , the variational GNE.
Remark 5
Given a globally known such that , (depending on as in Lemma 5) and , players can independently select step sizes , , to satisfy the upper bounds in Lemma 6. The result in Theorem 1 provides conditions for distributed convergence of the GNE seeking algorithm on a single time-scale. This is more challenging to establish than in a semi-decentralized setting where the true aggregate is provided by a coordinator. In a distributed setting, when each agent updates action and estimates the aggregate simultaneously, effectively each has to track the changing aggregate (which depends on the other agents’ changing actions) while also updating its own action. Herein, we achieve distributed convergence on a single time-scale by using operator-splitting techniques and balancing the lack of monotonicity of the pseudo-gradient off the consensus subspace with monotonicity of the Laplacian.
6 Numerical Simulations
In this section we consider a Nash-Cournot game over a network, as in [14][11][24], for a single market with production constraints and globally coupling market capacity constraints, where , , , . The cost function for each agent is , where is the production cost and is the demand price. The variational GNE is on the boundary at . For this cost function , . We compare the results with those of the algorithm in [24], in which each agent performs communication rounds to update the local multiplier, then updates its action, followed by another communication rounds to update its aggregate estimate. The algorithm in [24] converges to an -GNE, which approaches the variational GNE if goes to infinity. For simulation we chose and step size , which means that there are communication rounds before an action update, unlike Algorithm 1 where only are needed. We compare them for a star and for a ring communication network topology, respectively. All initial conditions are set to 0. For Algorithm 1 we set the weighted adjacency matrix, if , while for the algorithm [24] the mixing matrix is (star graph) and (ring graph).
6.1 Star Communication Graph
For a star communication graph topology and , and we can use in Lemma 4, which yields , , . We set , , for which the bounds on the step sizes in Lemma 6 are , , . Although the step sizes can be different for each agent, for simplicity they are taken to be the same: , . The agents’ production plots are shown in 1 for Algorithm 1 and in 2 for the algorithm in [24], respectively, where the x-axis shows the communication iterations. In the plots above the variational GNE components are denoted by stars on the y-axis to the right. In 3 we compare the normalized error, of agents’ actions from the variational GNE for the two algorithms. We see that the algorithm in [24] converges to an -GNE (if is taken larger, one can get arbitrary close to the true GNE).
We note that in most cases the number of communications is the most important, but there may be situations where communications are cheap and number of action updates might be more relevant. In 4 we plot the evolution of production for algorithm [24], where this time the x-axis counts the number of action updates. Thus, every action iteration in 4 corresponds to 400 communication iterations (). The variational GNE components are denoted by stars on the y-axis to the right. The plots for Algorithm 1 versus number of action updates look similar to those in 1 (2 communications/action update) and we omit them.
6.2 Ring Communication Graph
For a ring communication graph and , we use which yields , , . We set , , for which the bounds on the step sizes in Lemma 6 are , , . For simplicity the step sizes are taken to be the same: , . The agents’ production plots are shown in 5 for Algorithm 1 and in 6 for the algorithm in [24], where the x-axis counts the number of communications. 7 compares the normalized error, from the variational GNE for the two algorithms. Compared to the star topology, it can be seen that convergence is slower. Thus convergence is affected by the network topology, (lower connectivity leads to slower convergence). On the other hand, we note that the bounds on the step sizes in Lemma 6 are conservative. In 8 we plot the normalized error results when Algorithm 1 is run with ten times larger step sizes, indicating fast convergence to the variational GNE.
7 Conclusion
In this paper we proposed a distributed algorithm that globally converges to a variational GNE in aggregative games with affine coupling constraints. The algorithm employs simultaneous action and aggregate estimate update, based on local communication, and uses fixed-step sizes. We proved its convergence by forward-backward operator splitting for two preconditioned operators. We specifically exploited the invariance of the estimate average to show that the operators are restricted monotone. As future directions we can mention extension to time-varying and directed communication graphs.
Appendix
*Proof of Lemma 1: * (i) Note that from (4), using , it follows that for all . Since , the first claim follows by induction, and the second one follows by using , .
(ii) Let be a fixed point of Algorithm 1 or (4). Then, i.e., which, by Assumption 3 implies that for all , for some . From the update for in (4) if follows that i.e., which by Assumption 3 implies that for all , for some , hence . Using part (i) it follows that , i.e., in steady-state all agents have the same estimate equal to the action aggregate value. From the update of in (4) it follows that , using the fact that . With , this yields , where . Since , , and with , , , this yields component-wise, which is the first KKT condition, (3). From the update for in (4) it follows that, , which with , , leads to or , for some with , . Premultiplying by and using (by Assumption 3) and , , yields , or , i.e., (by Corollary 16.39 in [28]), which gives the second KKT condition, (3). Therefore, is a variational GNE and its multiplier.
*Proof of Lemma 6 * The proof is based on the Schur complement and a diagonal dominance argument. Firstly, given any , , let the step sizes be selected as in the statement, and let , or in vector form, , where . Using in (10) we can write where
[TABLE]
[TABLE]
and is an all-zero matrix of appropriate dimension.
Since , using Schur’s complement, the top-left block matrix in is positive semi-definite if
[TABLE]
or, equivalently, if which holds since is a projection matrix. Thus is positive semi-definite.
By Ghershgorin’s theorem, a sufficient condition for to be positive semi-definite is to be diagonally dominant, i.e.,
[TABLE]
for all . Since , if and are selected as in the lemma statement, it follows that all inequalities in (16) hold, hence is positive semi-definite. Thus, is positive semi-definite, and therefore is positive definite.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] W. Saad, Z. Han, H. Poor, and T. Başar, “Game theoretic methods for the smart grid,” IEEE Signal Processing Magazine , pp. 86–105, 2012.
- 2[2] Z. Ma, S. Zou, L. Ran, X. Shi, and I. Hiskens, “Efficient decentralized coordination of large-scale plug-in electric vehicle charging,” Automatica , vol. 69:35-47, 2016.
- 3[3] J. Barrera and A. Garcia, “Dynamic incentives for congestion control,” IEEE Trans. on Automatic Control , vol. 60(2):299-310, 2015.
- 4[4] C. Tekin, M. Liu, R. Southwell, J. Huang, and S. Ahmad, “Atomic congestion games on graphs and their applications in networking,” IEEE Trans. on Networking , vol. 20(5):1541-1552, 2012.
- 5[5] K. Bimpikis, S. Ehsani, and R. Ilkilic, “Cournot competition in networked markets,” Proc. 15th ACM Conf. Econ. & Comput. , p. 733, 2014.
- 6[6] K. Arrow and G. Debreu, “Existence of an equilibrium for a competitive economy,” Econometrica , vol. 22(3):265-290, 1954.
- 7[7] T. Başar and G. Olsder, Dynamic Noncooperative Game Theory, 2nd Edition . Society for Industrial and Applied Mathematics, 1999.
- 8[8] F. Facchinei and C. Kanzow, “Generalized Nash equilibrium problems,” Annals of Operat. Research , vol. 175, no. 1, pp. 177–211, 2010.
