A distributed primal-dual algorithm for computation of generalized Nash equilibria with shared affine coupling constraints via operator splitting methods
Peng Yi, Lacra Pavel

TL;DR
This paper introduces a distributed primal-dual algorithm based on operator splitting methods for computing generalized Nash equilibria with shared affine constraints in networked noncooperative games, ensuring convergence under mild conditions.
Contribution
It develops a novel distributed primal-dual algorithm utilizing operator splitting for variational GNE computation with shared constraints, allowing decentralized implementation and convergence guarantees.
Findings
Algorithm converges to variational GNE under fixed step-sizes.
Distributed approach requires only local information and neighbor communication.
Numerical simulations demonstrate the algorithm's efficiency in network Cournot competition.
Abstract
In this paper, we propose a distributed primal-dual algorithm for computation of a generalized Nash equilibrium (GNE) in noncooperative games over network systems. In the considered game, not only each player's local objective function depends on other players' decisions, but also the feasible decision sets of all the players are coupled together with a globally shared affine inequality constraint. Adopting the variational GNE, that is the solution of a variational inequality, as a refinement of GNE, we introduce a primal-dual algorithm that players can use to seek it in a distributed manner. Each player only needs to know its local objective function, local feasible set, and a local block of the affine constraint. Meanwhile, each player only needs to observe the decisions on which its local objective function explicitly depends through the interference graph and share information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A distributed primal-dual algorithm for computation of generalized Nash equilibria with shared affine coupling constraints via operator splitting methods
Peng Yi, Lacra Pavel
Department of Electrical and Computer Engineering, University of Toronto
[email protected], [email protected]
Abstract
In this paper, we propose a distributed primal-dual algorithm for computation of a generalized Nash equilibrium (GNE) in noncooperative games over network systems. In the considered game, not only each player’s local objective function depends on other players’ decisions, but also the feasible decision sets of all the players are coupled together with a globally shared affine inequality constraint. Adopting the variational GNE, that is the solution of a variational inequality, as a refinement of GNE, we introduce a primal-dual algorithm that players can use to seek it in a distributed manner. Each player only needs to know its local objective function, local feasible set, and a local block of the affine constraint. Meanwhile, each player only needs to observe the decisions on which its local objective function explicitly depends through the interference graph and share information related to multipliers with its neighbors through a multiplier graph. Through a primal-dual analysis and an augmentation of variables, we reformulate the problem as finding the zeros of a sum of monotone operators. Our distributed primal-dual algorithm is based on forward-backward operator splitting methods. We prove its convergence to the variational GNE for fixed step-sizes under some mild assumptions. Then a distributed algorithm with inertia is also introduced and analyzed for variational GNE seeking. Finally, numerical simulations for network Cournot competition are given to illustrate the algorithm efficiency and performance.
keywords:
Network system; generalized Nash equilibrium; multi-agent systems; distributed algorithm; operator splitting;
††journal: Elseviert1t1footnotetext: This work was supported by NSERC Discovery Grant (261764).
1 Introduction
Engineering network systems, like power grids, communication networks, transportation networks and sensor networks, play a foundation role in modern society. The efficient and secure operation of various network systems relies on efficiently solving decision and control problems arising in those large scale network systems. In many decision problems, the nodes can be regarded as agents that need to make local decisions possibly limited by the shared network resources within local feasible sets. Meanwhile, each agent has a local cost/utility function to be optimized, which depends on the decisions of other agents. The traditional manner for solving such decision problems over networks is the centralized optimization approach, which relies on a control center to gather the data of the problem and to optimize the social welfare (usually taking the form of the sum of local objective functions) within the local and global constraints. The centralized optimization approach may not be suitable for decision problems over large scale networks, since it needs bidirectional communication between all the network nodes and the control center, it is not robust to the failure of the center node, and the computational burden for the center is unbearable. It is also not preferable because the privacy of each agent might be compromised when the data is transferred to the center. Recently, a distributed optimization approach is proposed as an alternative methodology for solving decision problems in network systems (Yi, Hong & Liu (2016), Shi, Ling, Wu and Yin (2015) and Zeng, Yi, Hong & Xie (2016)). In the distributed optimization approach, the data is distributed throughout the network nodes and there is no control center, and each agent in the network can just utilize its local data and share information with its neighbour agents to compute its local decision that corresponds to the optimal solution of the social welfare optimization problem. Therefore, the distributed optimization approach overcomes the drawbacks of the centralized optimization approach by decomposing the data, computation and communication to each agent. Moreover, each agent has the authority and autonomy to formulate its own objective function without worrying about privacy leaking out. However, both approaches adopt the same solution concept, that is the optimal social welfare solution with local and global constraints, as the solution criterion of decision problems in network systems.
However, optimal solutions of social welfare may not be proper solution concepts in many applications. In fact, with the deregulation and liberalization of markets, there is no guarantee that the agents will not deliberately deviate from their local optimal solutions to increase (decrease) own local utility (cost), possibly by deceiving to utilize more network resources. In this paper, we consider the game theoretic approach where each agent in the network has its own local autonomy and rationality. In such a setup of multiple interacting rational players making decisions in a noncooperative environment, Nash Equilibrium (NE) is a more reasonable solution. In an NE, no player can increase (decrease) its local utility (cost) by unilaterally changing its local decision, therefore, no agent has the incentive to deviate from it. In other words, NE is a self-enforceable solution in the sense that once NE is computed all the agents will execute that NE. Recently, there has been increasing interest in adopting game theory and NE as the modeling framework and solution concept for various network decision problems, like wireless communication systems (Scutari,Palomar, Facchinei, and Pang, (2010) and Menache & Ozdaglar (2011)), network flow control (Alpcan & Basar (2005)), optical networks (Pan & Pavel (2009)) and smart grids (Ye & Hu (2016)).
Moreover, in engineering network systems, not only the local objective function of each agent depends on other players’ decisions, but also the feasible set of each local decision could depend on other agents’ decisions, because the agents may compete for the utilization of some shared or common network resources like bandwidth, spectrum or power. This type of network decision problems can be modeled as noncooperative games with coupling constraints, and generalized Nash equilibrium (GNE) can be adopted as its solution. The study of GNE dates back to the social equilibrium concept proposed by Debreu (1952), and flourished in the last two decades with applications in practical problems like environment pollution games (Krawczyk & Uryasev (2000)), power market design (Contreras, Klusch, Krawczyk (2004)), optical networks (Pavel (2007)), wireless communication (Facchinei & Pang (2010)). Interested readers can refer to Facchinei & Kanzow (2010) for a historical review of GNE, and refer to Fischer, Herrich & Schonefeld (2014) for recent developments, and to Facchinei & Pang (2010) for a technical treatment.
Even though NE or GNE is a reasonable and expectable solution as a result of multiple rational agents making decisions in a noncooperative manner, how to arrive at an NE (GNE) is by no way a self-evident task. Each player needs to know the complete game information, including objective functions and feasible decision sets of all the other agents, in order to compute NE in an introspective manner. It gets much more complicated for computing GNE, because the agents also have to consider the coupling in the feasible decision sets. In fact, as of yet there is no universal manner to efficiently compute GNE in games with coupling constraints (Harker (1991) and Fischer, Herrich & Schonefeld (2014)), except for games with shared coupling constraints (Facchinei, Fischer & Piccialli (2007)). Moreover, for games in large scale network systems, it is quite unrealistic and undesirable to assume that each agent could have the complete information of the whole network, because this implies prohibitive communication and computation burden and no privacy protection. Therefore, each player (agent) should compute its local decision corresponding with an NE or GNE in a distributed manner, somehow resembling the distributed optimization approach. In other words, each agent should only utilize its local objective function, local feasible set and possible local data related to coupling constraints, and should only share information with its neighbouring agents to compute its local decision in the NE (GNE). This turns out to be an emerging research topic and gets studied in Salehisadaghiani & Pavel (2016a), Koshal,Nedić & Shanbhag (2016), Parise, Gentile, Grammatico & Lygeros (2015), Ye & Hu (2016) and Swenson, Kar & Xavier (2015), etc.
Motivated by the above, in this work we consider a distributed algorithm for iterative computation of GNE in noncooperative games with shared affine coupling constraints over network systems. The considered noncooperative game has each agent’s local objective function depending on other agents’ decisions as specified by an interference graph, and has also an affine constraint shared by all agents, coupling all players’ feasible decision sets. The considered game model covers many practical problems, like the power market model in Contreras, Klusch, Krawczyk (2004), environment pollution game in Krawczyk & Uryasev (2000), power allocation game in communication systems in Yin, Shanbhag & Mehta (2011). A (centralized) numerical algorithm was recently studied in Schiro, Pang & Shanbhag (2013) for quadratic objective functions and in Dreves & Sudermann (2016) for linear objective functions. Generally speaking, the GNE of the considered game may not be unique. In this work, we adopt the variational GNE, that corresponds with the solution of a variational inequality proposed in Facchinei, Fischer & Piccialli (2007), to be a refinement GNE solution. The variational GNE is a particular type of the normalized equilibrium proposed in Rosen (1965), and enjoys a nice economical interpretation that all the agents have the same shadow price for shared network resources without any discrimination as pointed in Kulkarni & Shanbhag (2012). Furthermore, the variational GNE enjoys a sensitivity and stability property (Facchinei & Pang (2010) and Facchinei & Kanzow (2010)), hence we adopt it as the desirable solution.
We propose a new type of distributed algorithm that agents can use to compute the variational GNE by only manipulating their local data and communicating with neighbouring agents. Observing that the KKT condition of the corresponding variational inequality requires all agents to reach consensus on the multiplier of the shared affine constraint, we introduce a local copy of the multiplier and an auxiliary variable for each player. To enforce the consensus of local multipliers, we use a reformulation that incorporates the Laplacian matrix of a connected graph. Motivated by the forward-backward operator splitting method for finding zeros of a sum of monotone operators (refer to Bauschke & Combettes (2011)) and the recent primal-dual algorithm proposed in Condat (2013) for optimization problems with linear composition terms, we propose a novel distributed algorithm for iterative computation of GNE. The main idea is to introduce a suitable metric matrix and to split the equivalent reformulation into two monotone operators. An operator splitting method has been adopted for NE computation in a centralized manner in Briceno-Arias & Combettes (2013). A different splitting idea is adopted here appropriate for distributed GNE computation. Moreover, a distributed algorithm with inertia is also proposed and investigated, motivated by the acceleration algorithms in Alvarez & Attouch (2001), Attouch, Chbani, Peypouquet, and Redont (2016), Iutzeler & Hendrickx (2016) and Lorenz & Pock (2015), most of which only focused on optimization problems. The convergence of the proposed algorithms is verified under suitable fixed step-size choice and some mild assumptions on the objective functions and communication graphs.
The recent works of Zhu & Frazzoli (2016), Yu, van der Schaar & Sayed (2016), Liang, Yi and Hong (2016) and Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016) are closely related with this work since all of them are concerned with the distributed algorithm for seeking GNE of noncooperative games with coupling constraints. Zhu & Frazzoli (2016) address the GNE seeking for the case where each player has non-shared local coupling constraints. Assuming that each player can observe other players’ decisions on which its local objective function and local constraint functions depend through the interference graph, Zhu & Frazzoli (2016) propose a distributed primal-dual GNE seeking algorithm based on variational inequality methods, and show algorithm convergence under diminishing step-sizes. Yu, van der Schaar & Sayed (2016) investigate the distributed GNE seeking under stochastic data observations. The authors assume that the coupling constraints have a locally shared property that if one player has its one of local constraints dependent on the decision of another player, then this constraint must be shared between those two players. Their algorithm design is based on a penalty-type gradient method. Under the assumption that each player can observe the decisions on which its local objective function and constraint functions depend, Yu, van der Schaar & Sayed (2016) utilize a gradient type algorithm to seek the pure NE of the game derived by penalizing the coupling and local constraints. They show that their algorithm can reach a region near the pure penalized NE with a constant step-size, which will approach a GNE if the penalizing parameter goes to infinity. Both Liang, Yi and Hong (2016) and Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016) consider the distributed algorithm for seeking a variational GNE of the aggregative game with globally shared affine coupling constraints. This represents a particular type of game where the players’ local objective functions depend on some aggregative variables of all agents’ decisions. Liang, Yi and Hong (2016) assume that each player has local copies of both the aggregative variables and the multipliers, and combine a finite-time convergent continuous-time consensus dynamics and a projected gradient flow to derive their distributed GNE seeking dynamics. Meanwhile, Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016) adopt the asymmetric projection algorithm for variational inequalities to design their variational GNE seeking algorithm. However Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016) assume that there is an additional central node for the update of the common multiplier, and only address quadratic objective functions.
Compared with these works, our paper has following contributions,
(i): The considered noncooperative game model is completely general, thus a generalization of the aggregative game in Liang, Yi and Hong (2016) and Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016). We further assume that the shared affine coupling constraint is also decomposed such that each player only knows its local contribution to the global constraint, that is only a sub-block matrix of the whole constraint matrix. In this sense, no player knows exactly the shared constraints, hence, our problem model is also different from the ones in Yu, van der Schaar & Sayed (2016) and Zhu & Frazzoli (2016). The decomposition of the coupling constraints, together with the localization of player’s local objective function and local feasibility set, is quite appealing for iterative computation of GNE in large-scale network systems because this reduces the data transmission and computation burden, and protects the players’ privacies.
(ii):The proposed distributed algorithms can compute the variational GNE iteratively under a more localized data structure and information observing structure compared to previous ones. Firstly, each player only utilizes the local objective function and local feasible set, and its local sub-block matrix of the affine constraints. Secondly, we assume the players have two (different) information observing graphs, i.e., interference graph and multiplier graph. Each player only needs to observe the decisions that its local objective function directly depends on through the interference graph. This type of information observation assumption has also been adopted in Yu, van der Schaar & Sayed (2016) and Zhu & Frazzoli (2016). Meanwhile, each player only needs to share information related to multipliers with its neighbouring agents through another multiplier graph. Here it is not required that each player should know the decisions that coupling constraints depend on, as assumed in Yu, van der Schaar & Sayed (2016) and Zhu & Frazzoli (2016). Therefore, our information sharing (observing) structure is more localized and sparse.
(iii): The algorithm development and convergence analysis is motivated by the operator splitting method (Bauschke & Combettes (2011)), different from the penalized method adopted in Yu, van der Schaar & Sayed (2016) and the variational inequality approach in Liang, Yi and Hong (2016), Zhu & Frazzoli (2016) and Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016). Based on this operator splitting approach, we prove the algorithm converges to the variational GNE under fixed step-sizes. Note that neither convergence nor non-bias estimation is achieved in Yu, van der Schaar & Sayed (2016) under fixed step-sizes, while Zhu & Frazzoli (2016) achieve convergence with diminishing step-sizes. On the other hand, compared with Briceno-Arias & Combettes (2013), this paper addresses the GNE seeking under coupling constraints, adopts a different splitting technique, and achieves fully distributed computations. The operator splitting method is powerful and provides additional insights. Moreover, a distributed algorithm with inertia is proposed and analyzed, resembling the acceleration algorithms in optimization (Nesterov (2013) and Iutzeler & Hendrickx (2016)). The algorithm performance is illustrated via numerical experiments of network Cournot competitions with bounded market capacities.
The paper is organized as follows. Section 2 gives the notations and preliminary background. Section 3 formulates the noncooperative game and gives the distributed algorithm for iterative computation of a GNE. Section 4 shows how the operator splitting method motivates the algorithm development, and Section 5 presents the algorithm convergence analysis. Then a distributed GNE seeking algorithm with inertia is proposed and analyzed in Section 6. Finally, a network Cournot competition with bounded market capacities is formulated with numerical studies in Section 7, while concluding remarks are given in Section 8.
2 Notations and preliminary background
In this section, we review the notations and some preliminary notions in monotone operators and graph theory.
Notations: In the following, () denotes the dimesional (nonnegative) Euclidean space. For a column vector (matrix ), () denotes its transpose. denotes the inner product of , and denotes the norm induced by inner product . Given a symmetric positive definite matrix , denote the -induced inner product . The -matrix induced norm, , is defined as . Denote by any matrix induced norm in the Euclidean space. Denote and . For column vectors , is understood componentwise. represents the block diagonal matrix with matrices on its main diagonal. and denote the null space and range space of matrix , respectively. Denote as the column vector stacked with column vectors . denotes the identity matrix in . For a matrix , or stands for the matrix entry in the th row and th column of . We also use to denote the th element in column vector . Denote or as the Cartesian product of the sets . Denote as the interior of , and as the boundary set of . Define the projection of onto a set by . A set is a convex set if . An extended value proper function is a convex function if .
2.1 Monotone operators
The following concepts are reviewed from Bauschke & Combettes (2011). Let be a set-valued operator. Denote as the identity operator, i.e, . The domain of is where stands for the empty set, and the range of is . The graph of is , then the inverse of is defined through its graph as . The zero set of operator is . Define the resolvent of operator as . An operator is called monotone if , we have Moreover, it is maximally monotone if is not strictly contained in the graph of any other monotone operator. is single-valued and if is maximally monotone *** Proposition 23.7 in Bauschke & Combettes (2011). For a proper lower semi-continuous convex (l.s.c.) function , its sub-differential is a set-valued operator and
[TABLE]
is a maximally monotone operator *** Theorem 20.40 in Bauschke & Combettes (2011) . Then is called the proximal operator of *** Proposition 16.34 in Bauschke & Combettes (2011), i.e.
[TABLE]
Define the indicator function of set as For a closed convex set , is a proper l.s.c. function. is just the normal cone operator of set , that is ***Example 16.12 of Bauschke & Combettes (2011) and
[TABLE]
In this case, we also have ***Example 23.4 of Bauschke & Combettes (2011)
[TABLE]
For a single-valued operator , a point is a fixed point of if , and the set of fixed points of is denoted as . The composition of operators and , denoted by , is defined via its graph . We also use to denote the composition when they are single-valued. Similarly, their sum is defined as . Suppose operators and are maximally monotone and , then is also maximally monotone***Corollary 24.4 in Bauschke & Combettes (2011). Further suppose that is single-valued, then ***Proposition 25.1 in Bauschke & Combettes (2011), which helps to formulate the basic forward-backward operator splitting algorithm for finding zeros of a sum of monotone operators.
2.2 Graph theory
The following concepts are reviewed from Mesbahi & Egerstedt (2010). The information sharing or exchanging among the agents is described by graph . is the set of agents, and the edge set contains all the information interactions. If agent can get information from agent , then and agent belongs to agent ’s neighbor set , and . is said to be undirected when if and only if . A path of graph is a sequence of distinct agents in such that any consecutive agents in the sequence correspond to an edge of graph . Agent is said to be connected to agent if there is a path from to . is said to be connected if any two agents are connected.
Define the weighted adjacency matrix of with if and otherwise. Assume for undirected graphs. Define the weighted degree matrix Then the weighted Laplacian of graph is When graph is a connected and undirected graph, 0 is a simple eigenvalue of Laplacian with the eigenspace , and , , while all other eigenvalues are positive. Denote the eigenvalues of in an ascending order as , then by Courant-Fischer Theorem,
[TABLE]
Denote as the maximal weighted degree of graph , then we have the following estimation,
[TABLE]
3 Problem formulation and distributed algorithm
3.1 Game formulation
Consider a group of agents (players) that seek the generalized Nash equilibrium (GNE) of a noncooperative game with coupling constraints defined as follows. Each player controls its local decision (strategy or action) . Denote as the decision profile, i.e., the stacked vector of all the agents’ decisions where . Denote as the decision profile stacked vector of the agents’ decisions except player . Agent aims to optimize its local objective function within its feasible decision set. The local objective function for agent is . Notice that the local objective function of agent is coupled with other players’ decisions (however, may not be explicitly coupled with all other players’ decisions). Moreover, the feasible decision set of player also depends on the decisions of the other players with denoting a set-valued map that maps to the feasible decision set of agent . The aim of agent is to find the best-response strategy set given the other players’ decision ,
[TABLE]
The GNE of the game in (7) is obtained at the intersection of all the players’ best-response sets, and is defined as:
[TABLE]
Here we consider the GNE seeking in noncooperative games where the couplings between players’ feasible sets are specified by globally shared affine constraints. Denote
[TABLE]
where is a private feasible decision set of player , and with , and . Denote . Given the globally shared set (which may not be known by any agents), the following set-valued map gives the feasible decision set map of agent : , or in other words:
[TABLE]
Hence, each agent has a local feasible constraint , and there exists a coupling constraint shared by all agents with sub-matrix characterizing how agent is involved in the coupling constraint (shares the global resource). Notice that agent may only know its local , in which case the globally shared affine constraint couples the agents’ feasible decision sets, but is not known by any agents.
Remark 3.1
We consider affine coupling constraints for various reasons. Even though not as general as the nonlinear constraints considered in Pavel (2007) and Zhu & Frazzoli (2016), this setup does enjoy quite strong modeling flexibility. As pointed out in page 191 of Facchinei & Kanzow (2010), “ However, it should be noted that the jointly convex assumption on the constraints practically is likely to be satisfied only when the joint constraints are linear, i.e. of the form for some suitable matrix and vector .” In fact, many existing generalized Nash game models adopt affine coupling constraints, as well documented in Schiro, Pang & Shanbhag (2013) and Dreves & Sudermann (2016).
Assumption 1
For player , is a differentiable convex function with respect to given any fixed , and is a closed convex set. has nonempty interior point (Slater’s condition).
Suppose is a GNE of game (7), then for agent , is the optimal solution to the following convex optimization problem:
[TABLE]
Define a local Lagrangian function for agent with multiplier as
[TABLE]
When is an optimal solution to (9), there exists such that the following optimality conditions (KKT) are satisfied:
[TABLE]
These can be equivalently written in the following form by using (10) and the definition of the normal cone operator in (3)
[TABLE]
In fact, since when , it must hold that and when (12) is satisfied. Furthermore, . If , then and hence ; and if , we have hence . Therefore, and . Denote . Therefore, by Theorem 4.6 in Facchinei & Kanzow (2010) when satisfies KKT (12) for , is a GNE of the game in (7)
According to the above discussions, given as a GNE of game in (7), its corresponding Lagrangian multipliers for the globally shared affine coupling constraint may be different for the agents, i.e., . In this work, we aim to seek a GNE with the same Lagrangian multiplier for all the agents, which has a nice interpretation from the viewpoint of variational inequality.
Define
[TABLE]
which is usually called the pseudo-gradient. The variational inequality (VI) approach to find a GNE of game (7) is to find the solution of the following :
[TABLE]
Let us check the KKT condition for in (14). In fact, is a solution to in (14) if and only if is the optimal solution to the following optimization problem:
[TABLE]
According to the optimization formulation of in (15), if solves , there exists such that the following optimality conditions (KKT) are satisfied:
[TABLE]
By comparing the two sets of KKT conditions in (12) and (16) we obtain,
Theorem 3.2
***Theorem 2.1 of Facchinei, Fischer & Piccialli (2007)
Suppose Assumption 1 holds. Every solution of in (14) is a GNE of game in (7). Furthermore, if together with satisfies the KKT conditions for the , i.e., (16), then together with satisfies the KKT conditions for the GNE, i.e., (12).
The solution of in (14) is termed as a variational GNE or normalized equilibrium of the game with coupling constraints in (7). A variational GNE enjoys an economical interpretations of no price discrimination and has better stability and sensitivity properties, therefore, can be regarded as a refinement of GNE (refer to Kulkarni & Shanbhag (2012) for more discussions). This paper will propose a novel distributed algorithm for the agents to find a solution of in (14), thus provides a distributed coordination mechanism such that the coupling constraint is met and a variational GNE of the game is found.
Define two operators and , both from to as follows,
[TABLE]
By the definition of in (13), the KKT conditions (16) can be equivalently written as . Notice that is a maximally monotone operator (similar arguments for this can be found in Lemma 5.14). Hence, if has some additional properties, then solving can be converted to the problem of finding zeros of a sum of monotone operators.
Assumption 2
* defined in (13) is strongly monotone with parameter over : , and Lipschitz continuous over : .*
Remark 3.3
*Assumption 2 has also been adopted in Yu, van der Schaar & Sayed (2016), Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016) and Zhu & Frazzoli (2016). Assumption 2 guarantees that there exists a unique solution to in (14) **Theorem 2.3.3 of Facchinei and Pang (2007), thus guarantees the existence of a GNE for game in (7). However, the GNE of (7) may not be unique. The algorithm for computing all the GNE of noncooperative games with coupling constraints is still an opening research topic, and interested readers can refer to Nabetani, Tseng, and Fukushima (2011). This work aims to provide a distributed algorithm for iterative computation of a variational GNE of the considered game, which enjoys nice economical interpretations and stability properties.
3.2 Distributed algorithm
In practice, each player only knows its private information in game (7), especially when the players interact over large scale networks. It is quite natural that each player can only know its local objective function and local feasible set , which cannot be shared with other players, because those data contain its local private information, such as cost function, preference and action ability. Moreover, matrix specifies how player participates in the resource allocation or market behavior, hence also contains private information, and can be decomposed as . Thus matrix can be regarded as a map from decision space to resource space , while vector can be regarded as a local contribution or observation for the global resource. For example, if there are total markets, and each player produces a kind of product with amount of , then satisfying and just specifies how each player allocates its production to each market. In this case, if is a local observation of market capacity vector, then the true market capacities can be taken as .
Therefore, we assume that player only knows its local , , and matrix and with and . In other words, player has a local first-order oracle of which returns given , meanwhile player can manipulate , and for its local computation.
The players need to find the solution to in (14) in a distributed manner by local information observation and sharing, hence find a variational GNE of the game in (7) without any coordinator. To facilitate the local coordination between agents, here we specify two graphs, and , related to the local information observations or exchanging between players.
Graph , termed as interference graph, is defined according to the dependence relationships between the agents’ objective functions and the other players’ decisions, which is also called graphical model for games in computer science (refer to Kearns, Littman, & Singh (2001)). For graph , if the objective function of agent , explicitly depends on the decision of player . We define as the set of interference neighbors whose decisions directly influence the objective function of player . Therefore, the objective function of player can also be written as , and the local oracle of player returns , i.e., , given . The local oracle might compute by approximating with local objective function value observations (taking the simultaneous perturbation stochastic approximation in Spall (1992) as an example), or by utilizing the estimation techniques developed in Salehisadaghiani & Pavel (2016b).
On the other hand, for the coordination of the feasibility of action sets and the consensus of local multipliers in Theorem *, we also assume that the agents can exchange certain local information through a multiplier graph . if player can receive certain information from player , while the information to be shared through will be specified later. Thereby, player has its multiplier neighbors . is the weighted adjacency matrix associated with multiplier graph , and is the corresponding weighted Laplacian matrix.
Assumption 3
* is undirected and connected. .*
Remark 3.4
We assume that each agent can observe the decisions which its local objective function directly depends on through interference graph . Therefore, player can get its local gradient . This type of local information observation model has also been adopted in Yu, van der Schaar & Sayed (2016) and Zhu & Frazzoli (2016). On the other hand, player ’s feasible decision set may depend on any other player ’s decision even if does not explicitly depend on the decision of player , i.e. . In fact, player ’s feasible decision set implicitly depends on all other players’ decisions through the globally shared affine coupling constraint: . To ensure that the globally shared coupling constraint is satisfied and all the agents have the same local multipliers, all players must coordinate which necessarily requires that multiplier graph must be connected. Therefore, and could be two different information observation or information sharing graphs because they serve for different purposes.
We are ready to present the main distributed algorithm after the introduction of algorithm notations. Agent (player) controls its local decision variable and local Lagragian multiplier . Meanwhile, we also assume that player has a local auxiliary variable for the coordinations needed to satisfy the affine coupling constraint and to reach consensus of the local multipliers . As indicated before, player can compute by observing the adversary players’ decisions that its local objective function directly depends on, that is the decisions of its interference neighbors in . On the other hand, player can also share information related to the local multiplier and local auxiliary variable with its multiplier neighbours in through multiplier network .
Next we show the distributed algorithm for agent .
Algorithm 3.5
**
[TABLE]
are fixed constant step-sizes of player , and is the weighted adjacency matrix of .*
Algorithm 3.5 runs sequentially as follows. At the iteration time , player gets by observing through interference graph , and updates with (18); meanwhile, player gets through multiplier graph , and updates by (19). Then player gets through multiplier graph and updates with (20) that also employs the most recent local information .
Intuitively speaking, (18) employs the projected gradient descent of the local Lagrangian function in (10). (19) can be regarded as the discrete-time intergration for the consensual errors of local copies of multipliers, which will ensure the consensus of eventually. In fact, a similar dynamics has been employed in distributed optimization in Lei, Chen & Fang (2016). Finally, (20) updates local multiplier by a combination of the projected gradient ascent of local Lagrangian function (10) and a proportional-integral dynamics for consensual errors of multipliers. Section 4 will give a detailed algorithm development from the viewpoint of operator splitting methods.
Algorithm 3.5 is a totally distributed algorithm and has following key features:
i). The full data decomposition and privacy protection is achieved since each player only needs to know its local objective function and local feasible set .
ii). The matrix is decomposed and each block is kept by player , hence the privacy of each player is protected because describes how player is involved in the market or competition.
iii). Each player only needs to observe the decisions which its local objective function directly depends on, and only needs to share information with its multiplier neighbours, through and , respectively. Both graphs usually have sparse edge connections, therefore, the observation or communication burden is relieved. Furthermore, the information observation related with decisions and the information sharing related with multipliers is decoupled and accomplished with different graphs and , respectively. Therefore, those two information sharing processes can work in a parallel manner, and can be designed independently.
iv). The algorithm converges with fixed step-sizes under some mild conditions, and works in a Gauss-Seidel manner that utilizes the most recent information when updating . Moreover, (18) and (19) can even be computed in parallel for player .
4 Algorithm development
In this section, we first show how Algorithm 3.5 is developed and provide the motivations behind the algorithm’s convergence analysis. Then we verify that the limiting point of Algorithm 3.5 solves the in (14), and thus finds a variational GNE of the game in (7).
Algorithm 3.5 is inspired by the forward-backward splitting methods for finding zeros of the sum of monotone operators (Bauschke & Combettes (2011)) and the primal-dual algorithm for optimization with linear composition terms by Condat (2013). The key difference are the specific operator splitting form and the augmentation of variables to achieve distributed computations. Next, we systematically show how to write Algorithm 3.5 in the form of a forward-backward operator splitting algorithm.
Let us define some notations to write Algorithm 3.5 in a compact form. Denote , , , , , , , and . where is the weighted Laplacian matrix of multiplier graph .
Using these notations, the definition of pseudo-gradient in (13) and ***Proposition 23.16 of Bauschke & Combettes (2011), Algorithm 3.5 can be written in a compact form as:
Algorithm 4.6
**
[TABLE]
**
Using the fact that in (4) and the definition of resolvent operator as , equation (21) be be written as , or equivalently,
[TABLE]
Notice that , and . Furthermore, is a cone and , hence . Therefore, (24) can be written as
[TABLE]
Moreover, is a cone, , and . Then with similar arguments, equation (23) can be written as:
[TABLE]
or equivalently,
[TABLE]
Therefore, equation(22) together with (25) and (27) can be written in a compact form as:
[TABLE]
Denote
[TABLE]
Notice that matrix is symmetric due to and .
Denote . Define the operators and as follows,
[TABLE]
Remark 4.7
Operators and in (49) can be regarded as an extension of operators and in (17) by augmenting to and introducing auxiliary variables . Moreover, operators in (49) utilize of Laplacian matrix to ensure the consensus of , and utilize to ensure the feasibility of affine coupling constraints.
The next result shows that Algorithm 3.5, or equivalently Algorithm 4.6, can be regarded as a forward-backward operator splitting method for finding zeros of a sum of operators, or an iterative computation of fixed points of a composition of operators.
Lemma 4.8
Suppose in (42) is positive definite, and operators and in (49) are maximally monotone. Denote and . Then any limiting point of Algorithm 3.5, i.e., , is a zero of and is a fixed point of .
Proof: Denote , then using (34), (42) and (49), Algorithm 3.5 can written in a compact form as follows:
[TABLE]
Since is symmetric and positive definite, we can write equation (63) as , or equivalently,
[TABLE]
Since is maximally monotone (refer to Lemma 5.16), is single-valued. Then by the definition of the inverse of a set-valued operator, Algorithm 3.5 is written as
[TABLE]
Suppose that Algorithm 3.5, or equivalently (65), converges to a limiting point . Then by the continuity of the right hand of Algorithm 3.5 (In fact, the right hand of Algorithm 3.5 is Lipschitz continuous due to Assumption 2 and the nonexpansive property of projection operator), . Therefore, any limiting point of Algorithm 3.5 is a fixed point of the composition , and Algorithm 3.5 can be regarded as an iterative computation of fixed points of .
By Theorem 25.8 of Bauschke & Combettes (2011), (65) is also the forward-backward splitting algorithm for finding zeros of a sum of monotone operators, hence for any limiting point . Since is positive definite, any limiting point , i.e., , also belongs to . In fact,
[TABLE]
Remark 4.9
The iteration is also known as Picard iteration for iteratively approximating fixed points of (refer to Berinde (2007)). Lemma 5.15 will give a sufficient condition for to be positive definite. Lemma 5.14 will give the condition that ensures and to be maximally monotone.
The next result shows that any limiting point of Algorithm 3.5, that is, any zero point of operator , is a variational GNE of game (7).
Theorem 4.10
Suppose that Assumptions 1-3 hold. Consider operators and defined in (49), and operators and defined in (17). Then the following statements hold:
(i): Given any , then solves the in (14), hence is a variational GNE of game in (7). Moreover , and the multiplier together with satisfy the KKT condition in (16), i.e., .
(ii): and .
Proof: (i): By the definition of operators , in (49), we have,
[TABLE]
Suppose that . From the second line of (66),
[TABLE]
It follows that since is the weighted Laplacian of multiplier graph and is connected due to Assumption 3.
Then by the first line of (66), combined with and , we have
[TABLE]
or equivalently,
[TABLE]
By the third line of (66) and using , it follows that
[TABLE]
This implies that there exist , such that
[TABLE]
Multiplying both sides of above equation with and combining with , we have
[TABLE]
Due to the fact that if ***Corollary 16.39 of Bauschke & Combettes (2011), we have
[TABLE]
By (68) and (69), for any , the KKT condition for in (14), i.e. (16), is satisfied for , . We conclude that solves in (14), and is a variational GNE of game (7) by Theorem *. It also follows that together with satisfy the KKT condition in (16). This also implies .
(ii) Under Assumptions 1 and 2, the considered game in (7) has a unique variational GNE , and there exists such that the KKT condition (16) is satisfied, i.e. . Therefore .
Then we need to show that there exists such that .
Take . Because , the second line of (66) is satisfied.
Since , . Using , . Therefore, the first line of (66) is satisfied with .
Moreover, with , . Then we need to show that there exists , such that the third line of (66) is satisfied. Take such that . Since and , take . Then . That is . By the fundamental theorem of linear algebra***Page 405 of Meyer (2000), and since is also symmetric. Notice that , hence . Noticing that , there exists such that the third line of (66) is satisfied with . Therefore, .
5 Convergence Analysis
In this section, we prove the convergence of Algorithm 3.5 by giving a sufficient step-size choice condition. The analysis is based on the compact reformulation (65). We will first show that all the prerequisites in Lemma 4.8 can be satisfied under suitable step-sizes. Then (65), i.e., Algorithm 3.5, can be regarded as a forward-backward splitting algorithm for finding zeros of a sum of monotone operators, or equivalently, an iterative computation of fixed points of a composition of operators.
In fact, some existing NE (GNE) algorithms can also be regarded as a type of iterative computation of fixed points of operators, such as the best-response learning dynamics (Parise, Gentile, Grammatico & Lygeros (2015)), relaxation algorithms based on Nikaido-Isoda function (Krawczyk & Uryasev (2000) and Contreras, Klusch, Krawczyk (2004)) and the proximal-best response algorithm in Facchinei & Pang (2010). Most of above works built their algorithm convergence analysis on the contractive property of underlying operators. However, the contractivity assumption on operators is usually quite restrictive. Herein we resort to the theory of averaged operators and firmly nonexpansive operators for convergence analysis. Firstly we give some basic definitions and properties of averaged operators and firmly nonexpansive operators***Chapter 4 and Chapter 20 of Bauschke & Combettes (2011). All the following results are valid in Hilbert spaces, thus they hold in Euclidean spaces with any matrix induced norm , given as a symmetric positive definite matrix. Denote by an arbitrary matrix induced norm in a finite dimensional Euclidean space.
An operator is nonexpansive if it is Lipschitzian, i.e., An operator is averaged if there exists a nonexpansive operator such that . Denote the class of averaged operators as . If , then is also called firmly nonexpansive operator.
Lemma 5.11
***Proposition 4.25 of Bauschke & Combettes (2011)
Given an operator and , the following three statements are equivalent:
(i): ;
(ii):
(iii): .
By (iii) of Lemma (*), if and only if
[TABLE]
The operator is called cocoercive (or inverse strongly monotone) if is firmly nonexpansive, i.e.,
[TABLE]
Lemma 5.12
***Theorem 18.15 of Bauschke & Combettes (2011)
For a convex differentiable function with Lipschitzian gradient, we have to be cocoercive, i.e.,
[TABLE]
Lemma * is known as Baillon-Haddad theorem, and one elementary proof can be found in Theorem 2.1.5 of Nesterov (2013).
Lemma 5.13
***Proposition 23.7 of Bauschke & Combettes (2011)
If operator is maximally monotone, then is firmly nonexpasive and is nonexpansive.
Hence, the projection operator onto a closed convex set is firmly nonexpansive since and is maximally monotone***Example 20.41 and Proposition 4.8 of Bauschke & Combettes (2011).
In the following, we analyze the maximal monotonicity of operators , in (49), the positive definite property of matrix , and the properties of operators and defined in Lemma 4.8 by giving sufficient step-sizes choice conditions, which are shown in Lemma 5.14, 5.15 and 5.16, respectively.
Lemma 5.14
Suppose Assumptions 1-3 hold. Given an Euclidean space with norm , then
(i): Operator in (49) is cocoercive with where is the maximal weighted degree of multiplier graph , i.e., , and are parameters in Assumption 2;
(ii): Operator in (49) is maximally monotone.
Proof: (i): According to the definition of in (49) and the definition of cocercive in (71), we need to prove that
[TABLE]
Notice that is the gradient of function . Moveover, is a convex function since is positive semi-definite due to Assumption 3 ***Proposition 17.10 of Bauschke & Combettes (2011). It can easily be verified that is Lipschitz continuous (notice that the eigenvalues of are just the elements in ), therefore, by Lemma *
[TABLE]
Since and by (6) where is the maximal weighted degree of multiplier graph , we have .
Meanwhile by Assumption 2, is strongly monotone and Lipschitz continuous over . By , we have
[TABLE]
Taking and adding (74) and (75) yields (73). Thus operator is cocoercive. By the definition of cocoercive in (71), operator is also monotone. Since operator is also single-valued, it is also maximally monotone.
(ii): The operator in (49) can be written as:
[TABLE]
Since is symmetric, is a skew-symmetric matrix, i.e., . Hence, is maximally monotone***Example 20.30 of Bauschke & Combettes (2011).
can be written as the direct sum of . Both and are maximally monotone as normal cones of closed convex sets. Obviously, is also maximally monotone as a single-valued operator. Furthermore, the direct sum of maximally monotone operators is also maximally monotone***Proposition 20.23 of Bauschke & Combettes (2011), hence is also maximally monotone.
Obviously, , hence is also maximally monotone*** Corollary 24.4 of Bauschke & Combettes (2011).
Lemma 5.15
Given any , if each player takes , and as its local fixed step-sizes in Algorithm 3.5 that satisfy:
[TABLE]
then matrix defined in (42) is positive definite, and is positive semi-definite.
Proof: It is sufficient to show that is positive semi-definite.
[TABLE]
where , , and . One sufficient condition for matrix to be positive semi-definite is that it is diagonally dominant with nonnegative diagonally elements, that is for every row of the matrix the diagonal entry is larger than or equal to the sum of the magnitudes of all the other (non-diagonal) entries in that row. This is equivalent to require that,
[TABLE]
It can easily be verified that if each agent chooses its local step-sizes satisfying (76), then (81) is satisfied.
Given a globally known parameter , each agent can independently choose its local step sizes ,, and with the rule given in (76).
Suppose that the step-sizes in Algorithm 3.5 are chosen such that in (42) is positive definite. Thus we can define a norm induced by matrix for a finite Euclidean space as . The next result investigates the properties of operators and defined in Lemma 4.8 under induced norm .
Lemma 5.16
Suppose Assumptions 1-3 hold. Take where is the maximal weighted degree of multiplier graph , and are parameters in Assumption 2. Take . Suppose that the step-sizes in Algorithm 3.5 are chosen to satisfy (76). Then the operators and with in (42) and in (49), and operators defined as Lemma 4.8 satisfy the following properties under the induced norm :
(i). is cocoercive, and .
(ii). is maximally monotone, and .
Proof: (i): By the definition of cocoercivity in (71), we need to prove , . Noticing that by the choice of parameters , we have that matrix is positive semi-definite from Lemma 5.15. Denote and as the maximal and minimal eigenvalues of matrix . It must hold that . Furthermore, ***Proposition 5.2.7 and 5.2.8 in Meyer (2000), therefore, we also have . Notice that the operator is single-valued and is also nonsingular, so that for any ,
[TABLE]
By the cocoercive property of in Lemma 5.14 and the above inequality,
[TABLE]
Therefore, the operator is cocoercive under the induced norm .
Moreover, is firmly nonexpansive by the definition of cocoercive operator. This implies that there exists a nonexpansive operator such that . Then
[TABLE]
since by the assumption that and is also nonexpansive.
(ii). is symmetric positive definite and nonsingular. For any and , and . Then since is monotone by Lemma 5.14. Therefore, is monotone under the matrix induced product .
Furthermore, take , and , for any other . For any , we have . , or equivalently, . Since is maximally monotone, then . We conclude that which implies that is maximally monotone.
Therefore, by Lemma * is firmly nonexpansive under the matrix induced norm .
Summarizing the above results, take , , and satisfying (76), then , and in the Euclidean space with matrix induced norm . The next result shows the convergence of Algorithm 3.5 based on its compact reformulation (65) and the properties of and .
Theorem 5.17
Suppose Assumptions 1-3 hold. Take where is the maximal weighted degree of multiplier graph , and are parameters in Assumption 2. Take . The step-sizes in Algorithm 3.5 are chosen to satisfy (76). Then with Algorithm 3.5, each player has its local strategy converging to its corresponding component in the variational GNE of game (7), and the local Lagrangian multipliers of all the agents converge to the same Lagrangian multiplier corresponding with KKT condition (16), i.e.,
[TABLE]
Proof: With Lemma 5.14 and 5.15, Algorithm 3.5 can be written in a compact form (65) according to Lemma 4.8, i.e., . The convergence analysis will be conducted via the analysis of this iterative computation of fixed points of .
Firstly, by (i) and (ii) of Lemma * and the fact that are averaged operators due to Lemma 5.16, are also nonexpansive operators under the matrix induced norm . Take any , or equivalently any fixed point of , i.e., , and then by Lemma 4.8 and (65),
[TABLE]
Hence the sequence is non-increasing and bounded from below. By the monotonic convergence theorem, is bounded and converges for every .
By Lemma 5.16, and . Denote . Then with (ii) of Lemma * and (65) we have,
[TABLE]
where the first inequality follows by and the second inequality follows by , both utilizing (ii) of Lemma *. Notice that
[TABLE]
For the second and third terms on the right hand side of (85),
[TABLE]
where the second equality follows from (86) by setting , and .
Combining (85) and (LABEL:equ_thm_57_2) yields ,
[TABLE]
Using (88) from [math] to and adding all inequalities yields
[TABLE]
Taking limit as we have, . Since , it follows that converges and .
Since is bounded and converges, is a bounded sequence. There exists a subsequence that converges to . Notice that the composition is (Lipschitz) continuous and single-valued, because (65) is just an equivalent expression of Algorithm 3.5, and obviously the right hand side of Algorithm 3.5 is continuous. . Since is continuous, and , passing to limiting point, we have . Therefore, the limiting point is a fixed point of , or equivalently, .
Setting in (LABEL:equ_thm_5_2), we have is bounded and converges. Since there exists a subsequence that converges to , it follows that converges to zero. Therefore, . By Theorem 4.10, this just implies (83).
6 Distributed algorithm with inertia
In this section, we propose a distributed algorithm with inertia for variational GNE seeking, which possibly accelerates the convergence under some mild additional computation burden.
There are various modifications of Picard fixed point iteration to achieve the possible acceleration of convergence speed, and most of them fall into the domains of relaxation algorithm and inertial algorithm (Refer to Iutzeler & Hendrickx (2016) for reviews and numerical comparisons for optimization problems). The relaxation algorithm that simply combines the current operator output with previous iterate, leads to the well-known Krasnosel’skiĭ-Mann type of fixed point iteration***Chapter 5 of Bauschke & Combettes (2011), and has been utilized in (generalized) Nash equilibrium computation in Contreras, Klusch, Krawczyk (2004) and Krawczyk & Uryasev (2000). Meanwhile, inertial algorithms in operator splitting methods have received attention in recent years, such as Alvarez & Attouch (2001), Attouch, Chbani, Peypouquet, and Redont (2016), Lorenz & Pock (2015) and Rosasco, Villa & Vũ (2016). These efforts are partially motivated by the heavy ball method in Polyak (1987) and Nesterov’s acceleration algorithm in Nesterov (2013)) for optimization problems and their recent success in machine learning applications (refer to Wibisono, Wilson, and Jordan, (2016)). In particular, Nesterov’s acceleration algorithm is proved to enjoy an optimal convergence speed with a specific step-size choice. Thereby, in this work we consider a distributed algorithm with inertia for variational GNE seeking given as below:
Algorithm 6.18
**
[TABLE]
is a fixed step-size in the acceleration phase, and are fixed step-sizes of player , and is the weighted adjacency matrix of multiplier graph .*
Compared with Algorithm 3.5, Algorithm 6.18 has two phases. In the acceleration phase, each player uses the local state information of the last two steps to get predictive variables by a simple linear extrapolation. In the update phase, the players just feed the predictive variables to Algorithm 3.5 to get the next iterates. Hence compared with Algorithm 3.5, Algorithm 6.18 has only an additional simple local computation burden. Obviously, Algorithm 6.18 is also totally distributed, and shares all the features of Algorithm 3.5. However, there is an additional need to choose a proper step-size .
In the following two subsections, we will first give some intuitive interpretation of Algorithm 6.18 from the viewpoint of a discretization of continuous-time dynamical systems, and then prove its convergence.
6.1 Interpretations from viewpoints of dynamical systems
The interpretation of inertial (acceleration) algorithms from a continuous-time dynamical system viewpoint can be found in Polyak (1987) and most recently in Wibisono, Wilson, and Jordan, (2016) for optimization problems and in Attouch, Chbani, Peypouquet, and Redont (2016) for proximal point algorithms. Here we give a comparative development of Algorithm 3.5 and Algorithm 6.18 just for illustrations of the differences behind the algorithms.
Firstly, let us show that Algorithm 3.5, or equivalently its compact reformulation (65) in Lemma 4.8, can be interpreted as the discretization of the following dynamical system:
[TABLE]
In fact, for differential inclusion (95), we have the following implicit/explicit discretization with step-size of ,
[TABLE]
Denote and take , then (96) can be written as
[TABLE]
Therefore, the implicit/explicit discretization of (95) is exactly (64) that leads to (65), or equivalently Algorithm 3.5. Moreover, the explicit discretization corresponds with the forward step, and the implicit discretization corresponds with the backward step. That’s the reason why Algorithm 3.5 is called a forward-backward splitting algorithm.
Adopt similar compact notations as in Section 4, and denote , , and . And further denote . Then by similar arguments as in Section 4 and using operators and defined in (49) and defined in (42), Algorithm 6.18 can be written in a compact form (assume is maximally monotone),
[TABLE]
where is defined as in Section 4.
We can show that Algorithm 6.18, or equivalently (98)-(99), can be interpreted as the discretization of the following second-order continuous-time dynamical system,
[TABLE]
In fact, for differential inclusion (100) consider the following type of implicit/explicit discretization,
[TABLE]
where is an interpolation point to be determined later. Denote and take , then (101) can be written as
[TABLE]
Denote and take , then (102) can be written as
[TABLE]
(103) leads to equations (98)-(99), or equivalently Algorithm 6.18.
Remark 6.19
Compared with (95), (100) is a second order dynamical system with an additional inertial term , hence (100) enjoys better convergence properties than (95). Therefore, it is expected that Algorithm 6.18, as a discretization of (100), would have better convergence properties than Algorithm 3.5.
6.2 Convergence analysis
The following result proves the convergence of Algorithm 6.18 by providing sufficient step-size choices for as well as . The sufficient choice condition for can be ensured by solving a simple algebra inequality. The proof idea of the following result is motivated by inertial algorithms works for optimization and operator splitting such as Alvarez & Attouch (2001), Attouch, Chbani, Peypouquet, and Redont (2016), Rosasco, Villa & Vũ (2016), Iutzeler & Hendrickx (2016), and especially Lorenz & Pock (2015). However, since this work considers a noncooperative game setup and adopts a fixed step-size in the distributed algorithm, Theorem 6.20’s proof is also provided for completeness.
Theorem 6.20
Suppose Assumptions 1-3 hold. Take where is the maximal weighted degree of multiplier graph , and are parameters in Assumption 2. Given a sufficient small , take and in Algorithm 6.18 such that . Suppose that player chooses its step-sizes in Algorithm 6.18 satisfying (76). Then with Algorithm 6.18, players’ local strategies converge to the variational GNE of game in (7), and the local multipliers of all the agents converge to the same multiplier corresponding with KKT condition (16), i.e.,
[TABLE]
Proof: By the choice of , operator is cocoercive and operator is maximally monotone due to Lemma 5.14. By the choice of and , is positive semi-definite due to Lemma 5.15, and and satisfy the properties in Lemma 5.16. Therefore, Algorithm 6.18 can be exactly written in the compact form of (98)-(99) with similar arguments as in Lemma 4.8.
Resorting to Theorem 4.10, we only need to show that Algorithm 6.18 converges and its limiting point belongs to . In fact, any limiting point of Algorithm 6.18 satisfies as shown next. Suppose , then and using the continuity of the right hand of (99). Therefore, because is a positive definite matrix.
The following relationship (similar with the cosine rule) will be heavily utilized in the convergence analysis.
[TABLE]
which can be verified by directly expanding with .
The proof is divided into three parts:
Part 1: Given any , follows a recursive inequality as follows,
[TABLE]
- 2.
Part 2: Given any , and .
- 3.
Part 3: We first show the convergence of given any , and then show the convergence of Algorithm 6.18.
Part 1: Given any point , we first prove a recursive inequality (106) for .
Using (105) to expand the left hand of (106) yields,
[TABLE]
where the last step is derived by incorporating (98).
To tackle the second term on the right hand of (LABEL:equ_thm_6_1), we proceed as follows. By (99) we have, or equivalently,
[TABLE]
Because , we also have
[TABLE]
Due to the maximal monotonicity of proved in Lemma 5.14,
[TABLE]
By incorporating (108) and (109) into (110), we have
[TABLE]
or equivalently,
[TABLE]
Using (111) for the second term on the right hand of (LABEL:equ_thm_6_1) yields
[TABLE]
By Lemma 5.14, is cocoercive. For the second term on the right hand of (112), we have
[TABLE]
where the first inequality is obtained by the cocoercive property (71) and the second inequality is obtained by .
Combining (112) with (LABEL:equ_thm_6_7) we have
[TABLE]
or equivalently we have
[TABLE]
Utilizing the equality (105) we also have,
[TABLE]
Combining (114) and (115), we have,
[TABLE]
Next using (98) and (105) for the first and third terms on the right hand of (LABEL:equ_thm_6_10) yields
[TABLE]
For the second term on the right hand of (LABEL:equ_thm_6_10), , since . Incorporating this and (LABEL:equ_thm_6_11) into (LABEL:equ_thm_6_10)
[TABLE]
Since , we derive (106).
Part 2: In this step, we will prove and .
Denote . Then is symmetric and positive definite since and . The first inequality of (LABEL:equ_thm_6_12) can also be written as:
[TABLE]
For the first term on the right hand of (119),
[TABLE]
where the first equality follows from (105), the second equality follows from (98), and the third inequality follows from .
Denote . Then is also symmetric and positive definite, since and . Combining (119) with (LABEL:equ_thm_6_15),
[TABLE]
Denote , then
[TABLE]
where the third inequality follows by (LABEL:equ_thm_6_16).
Given a sufficient small , choose and such that , then
[TABLE]
Therefore, (LABEL:equ_thm_6_17) yields . Therefore, . By the definition of , . Therefore, . By Lemma 1 at Page 44 of Polyak (1987), , and is bounded sequence.
We also have . Then . Let goes to infinity, we have,
[TABLE]
Therefore, , and .
Part 3: In this part, we first show the convergence of given any , and then show the convergence of Algorithm 6.18.
Denote and , and recall (106), we have Apply this relationship recursively,
[TABLE]
Summing (124) from to ,
[TABLE]
Let , then since ,
[TABLE]
Noticing that , , and hence the sequence , being a nonnegative and non-decreasing sequence, converges and is bounded.
Consider another sequence . Since is nonnegative and is bounded, is bounded from below. Furthermore, is a non-increasing sequence. In fact,
[TABLE]
where the second inequality follows from the definition of , . As is a non-increasing sequence and bounded from below, converges.
Therefore, , being the sum of two convergent sequences and , also converges.
We are ready to show the convergence of Algorithm 6.18 using the results in Part 1 and Part 2. Since is a bounded sequence, it has a convergent subsequence that converges to . Because by Part 2, we have and . Pass to limiting point of , then we have because the righthand side of (98)-(99) is continuous. Hence, . Taking in (106) of Part 1, we also have converges by Part 3. Because a subsequence converges to zero, the whole sequence converges to zero. Therefore, the whole sequence of converges to . Resorting to Theorem 4.10 gives the desired result.
Remark 6.21
A sufficient and simple choice of parameters to ensure the conditions in Theorem (6.20) is , and . In fact, implies that could be be any real number . If we take , then the quadratic inequality becomes . Since , takes value strictly less than zero when takes [math]. By the continuity of quadratic equation, there always exists that ensures the above quadratic inequality given any and .
7 Network Cournot game and simulation studies
There are various practical problems that can be well modeled by the game in (7), such as the river basin pollution game in Krawczyk & Uryasev (2000), the power market competition in Contreras, Klusch, Krawczyk (2004), plug-in electric vehicles charging management in Paccagnan, Gentile, Parise, Kamgarpour & Lygeros (2016), and communication network congestion game in Yin, Shanbhag & Mehta (2011). All above examples can be regarded as the type of the network Cournot game described below, which is a generalization of the network Cournot competition in Bimpikis, Ehsani, & Ilkilic (2014) by introducing additional market capacity constraints or equivalently globally shared coupling affine constraints. This type of network Cournot game with affine coupling constraints also appeared in the numerical studies of Yu, van der Schaar & Sayed (2016).
7.1 Network Cournot game
Suppose that there are companies (players) with labels and markets with labels . Company decides its strategy to participate in the competition in markets by producing and delivering amounts of products to the markets it connects with. The production limitation of company is . Company has a local matrix that specifies which market it will participate in. The -th column of , that is , has only one element being and all other elements being [math], and has its -th element being if and only if player delivers amount of production to the market . Therefore, matrices can be used to specify a bipartite graph that represents the connections between the companies and the markets. Denote , , and . Then is just the total product supply to all the markets given the action profile of all the companies. Market has a maximal capacity of , therefore, it should be satisfied that where . Suppose that is a price vector function that maps the total supply of each market to the corresponding market’s price. Each company has also a local production cost function . Then the local objective function of company (player) is .
Overall, in this network Cournot game, each company needs to solve the following optimization problem given the other companies’ profile ,
[TABLE]
Obviously, the above network Cournot game in (127) is a particular problem of game in (7). Some practical decision problems in engineering networks can be well described by the network Cournot game in (127), such as the rate control game in communication network (Yin, Shanbhag & Mehta (2011)) and the demand response game in smart grids (Ye & Hu (2016)).
Example 7.22** (Rate control game)**
Consider a group of source-destination pairs (nodes) in a communication network, that is , to decide their data rates in a non-cooperative setting. The data is transferred through a group of communication links (channels), that is , and each link has a maximal data rate capacity of . Assume that an additional layer has decided the routine table for each source-destination pair , which is encoded by . Each column of has only one element being and all the other elements being zero, and the -th element of column is if utilizes the link and transfers data rate on link . The local decision variable of is the data rate on each link that it utilizes, denoted by . also has a local feasibility constraint . Denote , , and . The total data rate on each link should be less than the capacity of that link: Given the data rate profile of all the nodes , the payoff function of , , takes the form as , where is the utility of source , and is a delay function that maps the total data rate on each link to the unit delay of that link. Thereby, the data rate control game can be well described by the network Cournot game in (127).
Example 7.23** (Demand response game)**
Given a distribution network in power grids, suppose that there are time periods, and each period has a desirable minimal total load shedding . Suppose that there are load managers (energy management units or players) in the network, and each load manager can decide a local vector as its local load shedding vector in some specific time periods. Each load manager also has a local matrix that specifies which time period player will participate. For th column of , it has one element being while all other elements being zero. The th element of the th column of is if load manager decides to decrease its load by at time . Denote , and , . Naturally, it is required that the total load shedding of all the load managers should meet the minimal value, . Each player has a local feasible constraint , and a cost function due to local load shedding. is the payment price vector function that maps total load shedding of each period to the payment price vector, therefore, is the payment awards of player for its load shedding. The disutility function of player is given all the players’ action profile . All in all, the demand response management game is well described by the network Cournot game model in (127).
Moreover, the Assumptions 1 and 2 for Algorithm 3.5 and 6.18 can easily be satisfied for many practical cost functions and price functions. For example, take company ’s production cost function to be a strongly convex function with Lipschitz continuous gradients (A quadratic function with being a symmetric and positive definite matrix and is one possible choice). The price of market is taken as the linear function of the total supplying (known as a linear inverse demand function in economics) with . Denote , , . Then is the vector price function. The payments of company by selling product to the markets that it connects with is just . Therefore, the objective function of company is,
[TABLE]
Denote , and , and
[TABLE]
then
[TABLE]
Notice that can be written as
[TABLE]
where is a block matrix defined as and . Therefore, is positive semi-definite matrix. Hence, the Jacobian matric of , is postive definite since the cost functions are strongly convex. Therefore, is strongly monotone***Proposition 2.3.2 of Facchinei and Pang (2007) and Lipschitz continuous, and Assumptions 1 and 2 are satisfied.
Remark 7.24
Notice that in the network Cournot game of (127), each player ’s local objective function only depends on the decisions of the players that participate the same markets as player . Denote as the set of integers from to . Mathematically, if and only if , and such that and . Since is usually a sparse matrix, the interference graph of network Cournot game also has sparse edge connections.
7.2 Simulation studies
In the studies, we adopt a similar simulation setting as Yu, van der Schaar & Sayed (2016) without considering the stochastic factors. Consider companies and markets, and the connection relationship between the companies and the markets is depicted in Figure 1. If there is an edge from to in Figure 1, then company participates the competition in market by producing and delivering products to market . Each company has a local constraint as where each component of is randomly drawn from . Each market has a maximal capacity of and is randomly drawn from . The local objective function is taken as (128). The local cost function of company is which is a strongly convex function with Lipschitz continuous gradient. Here is randomly drawn from , and each component of is randomly drawn from . The price function is taken as the linear function , and and are randomly drawn from and , respectively.
With Figure 1 and the definition of objective function in (128), the interference graph can be easily obtained and is depicted in Figure 2. Meanwhile, we adopte the multiplier graph shown in Figure 3. The weighted adjacency matrix of multiplier graph has all its nonzero elements to be .
With Figure 1 and the definition of objective function in (128), the interference graph can be easily obtained and is depicted in Figure 2. Meanwhile, we adopte the multiplier graph shown in Figure 3. The weighted adjacency matrix of multiplier graph has all its nonzero elements to be .
Set the step-sizes in Algorithm 3.5 as for all companies, and for Algorithm 6.18 set while other step-sizes are the same as Algorithm 3.5. The initial starting points and of both algorithms are set to be zeros.
The trajectories of selected algorithm performance indexes, including , , , , and , are shown in Figures 4, 5 and 6. The trajectories of the local decisions of some companies are shown in Figure 7, and the trajectories of , which stands for the -th component of local Lagrangian multiplier , are shown in Figure 8.
8 Conclusions
In this paper, we proposed a primal-dual distributed algorithm based on operator splitting methods for iterative computation of a variational GNE in noncooperative games with globally shared affine coupling constraints. The algorithm is motivated by the forward-backward operator splitting method for finding zeros of a sum of monotone operators. Each player only needs to knows its local information, especially a block of the affine coupling constraints. The proposed algorithm is proved to converge with fixed step-sizes under some mild assumptions by exploiting properties of composition of averaged operators. Furthermore, a distributed algorithm with inertia is also proposed and analyzed for possible acceleration of convergence speed. Numerical simulation studies for a network Cournot game demonstrate the efficiency of the proposed algorithms and the superior convergence speed of the inertial algorithm.
Many challenging and exciting topics are still open for distributed NE/GNE seeking. Here we only list some problems with probable solution hints. Finding all the generalized Nash equilibria has its only interests, and this could be partially solved by combining the design in this paper with the parameterized variational inequality method in Nabetani, Tseng, and Fukushima (2011). The algorithm requires that each player is able to observe all its neighbors’ decisions through the interference graph . This assumption could be relaxed by adopting the local consensus dynamics in Salehisadaghiani & Pavel (2016b), and then it could only be required that the players were able to observe parts of its neighbors’ decisions through a maximal triangle-free spanning subgraph of . The methodology of this paper could be extended for stochastic GNE seeking with noisy gradient observations and noisy information sharing by resorting to the stochastic forward-backward splitting algorithm in Rosasco, Villa & Vũ (2016). The strong monotonicity assumption on the pseudo-gradient might be relaxed to monotonicity assumption by utilizing the forward-backward-forward splitting method in Briceno-Arias & Combettes (2013).
Reference
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alpcan & Basar (2005) Alpcan, T. and Basar, T., 2005. Distributed algorithms for Nash equilibria of flow control games. In Advances in dynamic games (pp. 473-498). Birkhauser Boston.
- 2Alvarez & Attouch (2001) Alvarez, F. and Attouch, H., 2001. An inertial proximal method for maximal monotone operators via discretization of a nonlinear oscillator with damping. Set-Valued Analysis, 9(1-2), pp.3-11.
- 3Attouch, Chbani, Peypouquet, and Redont (2016) Attouch, H., Chbani, Z., Peypouquet, J. and Redont, P., 2016. Fast convergence of inertial dynamics and algorithms with asymptotic vanishing viscosity. Mathematical Programming, pp.1-53.
- 4Bauschke & Combettes (2011) Bauschke, H.H. and Combettes, P.L., 2011. Convex analysis and monotone operator theory in Hilbert spaces. Springer Science & Business Media.
- 5Berinde (2007) Berinde, V., 2007. Iterative approximation of fixed points. Berlin, Germany: Springer.
- 6Bimpikis, Ehsani, & Ilkilic (2014) Bimpikis, K., Ehsani, S. and Ilkilic, R., 2014. Cournot competition in networked markets. In EC (p. 733).
- 7Briceno-Arias & Combettes (2013) Briceno-Arias, L.M. and Combettes, P.L., 2013. Monotone operator methods for Nash equilibria in non-potential games. In Computational and Analytical Mathematics (pp. 143-159). Springer New York.
- 8Combettes & Vũ (2014) Combettes, P.L. and Vũ, B.C., 2014. Variable metric forward-backward splitting with applications to monotone inclusions in duality. Optimization, 63(9), pp.1289-1318.
