Input-Feedforward-Passivity-Based Distributed Optimization Over Jointly Connected Balanced Digraphs
Mengmou Li, Graziano Chesi, Yiguang Hong

TL;DR
This paper introduces a novel passivity-based distributed optimization algorithm for directed graphs, ensuring exponential convergence without global information, and demonstrates its effectiveness through numerical examples.
Contribution
It proposes a new input feedforward passivity framework and a derivative feedback algorithm that work over directed, weight-balanced graphs without requiring eigenvalue knowledge.
Findings
The algorithm guarantees exponential convergence on strongly connected topologies.
It is robust to randomly changing weight-balanced digraphs.
Numerical examples validate the effectiveness of the proposed methods.
Abstract
In this paper, a distributed optimization problem is investigated via input feedforward passivity. First, an input-feedforward-passivity-based continuous-time distributed algorithm is proposed. It is shown that the error system of the proposed algorithm can be decomposed into a group of individual input feedforward passive (IFP) systems that interact with each other using output feedback information. Based on this IFP framework, convergence conditions of a suitable coupling gain are derived over weight-balanced and uniformly jointly strongly connected (UJSC) topologies. It is also shown that the IFP-based algorithm converges exponentially when the topology is strongly connected. Second, a novel distributed derivative feedback algorithm is proposed based on the passivation of IFP systems. While most works on directed topologies require knowledge of eigenvalues of the graph Laplacian, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Input-Feedforward-Passivity-Based Distributed Optimization Over Jointly Connected Balanced Digraphs
Mengmou Li, Graziano Chesi, and Yiguang Hong A preliminary version of this work was presented in the 58th IEEE Conference on Decision and Control, Nice, France [1].The work of Y. Hong was supported by National Natural Science Foundation of China under Grant 61733018.M. Li and G. Chesi are with the Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, China (e-mail: [email protected]; [email protected]).Y. Hong is with the Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 100190, Beijing, China (e-mail: [email protected]).
Abstract
In this paper, a distributed optimization problem is investigated via input feedforward passivity. First, an input-feedforward-passivity-based continuous-time distributed algorithm is proposed. It is shown that the error system of the proposed algorithm can be decomposed into a group of individual input feedforward passive (IFP) systems that interact with each other using output feedback information. Based on this IFP framework, convergence conditions of a suitable coupling gain are derived over weight-balanced and uniformly jointly strongly connected (UJSC) topologies. It is also shown that the IFP-based algorithm converges exponentially when the topology is strongly connected. Second, a novel distributed derivative feedback algorithm is proposed based on the passivation of IFP systems. While most works on directed topologies require knowledge of eigenvalues of the graph Laplacian, the derivative feedback algorithm is fully distributed, namely, it is robust against randomly changing weight-balanced digraphs with any positive coupling gain and without knowing any global information. Finally, numerical examples are presented to illustrate the proposed distributed algorithms.
Index Terms:
Continuous-time algorithms, input feedforward passivity, weight-balanced digraphs, uniformly jointly strongly connected topologies, derivative feedback.
I Introduction
Distributed optimization over multi-agent systems has been widely investigated in recent years, due to its broad applications in various aspects including wireless networks, smart grids, and machine learning. In addition to discrete-time algorithms (e.g., [2, 3, 4]), a variety of continuous-time distributed algorithms have been proposed to solve distributed optimization problems [5, 6, 7, 8]. Continuous-time algorithms can be implemented in hardware devices like analog circuits [9], and achieve tasks such as motion coordination of multi-agent systems [10]. Studying optimization in the continuous-time domain benefits from numerous control techniques for stability analysis and also opens up the possibility to address commonly encountered problems in large-scale networks, such as disturbance rejection [11], robustness to delays or uncertainties [12, 13], or channel constraints [14]. However, most of the proposed algorithms are only for undirected topologies and not applicable to directed topologies [5, 6, 7, 8]. To deal with this difficulty, some parameters in the algorithms can be tuned to stabilize the dynamics [15, 16], while some variants of the standard proportional-integral algorithm are proposed [14, 17]. However, most of these methods often employ coordinate transformation along with complicated Lyapunov function candidates in convergence analysis, which does not preserve network structures, and requires eigenvalues of the graph Laplacian to design some parameters [14, 15, 17, 18]. Compared with these methods, a systematic approach that focuses more on the distributed interconnection of agents in the network is needed.
It is well known that dissipativity (as well as its special case, passivity) is a useful tool for stability analysis and control design [19, 20, 21]. Recently, there emerged some passivity-based algorithms on distributed optimization under some communication constraints [22, 23, 13, 24]. However, these passivity-based algorithms can only be applied over undirected graphs, while it is shown that output consensus can be achieved over directed graphs through simple output feedback interconnections of passive systems [19, 20]. Motivated by these works, we aim to study distributed algorithms over directed graphs via passivity techniques. On one hand, we conjecture that it is in general difficult to directly construct a distributed algorithm that can be interpreted as output feedback interconnections of passive systems. On the other hand, works in [25, 26, 27] point out that output consensus can be achieved over directed graphs even among IFP (or passivity-short) systems. Therefore, if a distributed algorithm inherits input feedforward passivity, it can be directly applied to weight-balanced digraphs through output feedback interconnections. As a byproduct of having the IFP properties, the distributed algorithm is also applicable over uniformly jointly strongly connected (UJSC) topologies. This feature is remarkable since it greatly reduces communication costs, and hence is more practical in large-scale networks. Though the problem of UJSC switching topologies has been considered in discrete-time algorithms [2, 3, 4], to the best of our knowledge, it has never been addressed in the continuous-time domain, due to the difficulties in stability analysis under the time-varying nature and lack of connectedness of topologies.
In this paper, we investigate the distributed optimization problem via input feedforward passivity. First, we propose an IFP-based distributed algorithm whose error system is decomposed into a group of individual IFP systems that interact with each other using output feedback information. Based on this IFP framework, we study the distributed algorithm over directed and UJSC weight-balanced topologies and derive convergence conditions of a suitable coupling gain for the algorithm. We also show that this IFP-based algorithm converges exponentially when the graph is strongly connected. Second, we propose a novel distributed derivative feedback algorithm based on the passivation of IFP systems. While most works on directed topologies in the literature require knowledge of eigenvalues of the graph Laplacian [14, 15, 17, 18], we show that the derivative feedback algorithm is fully distributed, namely, it is robust against randomly changing weight-balanced digraphs with any positive coupling gain and without knowing any global information. In other words, the derivative feedback algorithm is applicable over gossip-like balanced digraphs [28], reducing communication costs. It is worth mentioning that [16] develops a fully distributed adaptive algorithm. However, it does not apply to switching or UJSC topologies. The challenges in our work lie in the construction of a group of verifiable nonlinear IFP systems that solves the distributed optimization problem, the design of the fully distributed algorithm, and the convergence analysis of the proposed algorithms.
Moreover, our analytical method differs from most works from the literature in that we first characterize passivity from single-agent level, and then address the stability based on the output feedback interconnection model of these agents over networks. This method compares favorably to some other works [15, 14, 17] since it bypasses coordinate transformation and preserves network structures in convergence analysis. Besides, it also allows potential applications of mature passivity-based techniques in the study of network issues arising in distributed optimization.
A preliminary version of this work appeared in [1], where only the IFP-based algorithm has been proposed. In this work, we propose the IFP-based algorithm with a possibly time-varying coupling parameter, construct more practical conditions that are easier to verify in a distributed sense, and show exponential convergence of the IFP-based algorithm over strongly connected digraphs. Moreover, a fully distributed algorithm is proposed.
The rest of this paper is organized as follows. In Section II, some background knowledge of convex analysis, graph theory, and passivity is reviewed and the problem formulation is given. In Section III, an IFP-based distributed algorithm is proposed and studied over weight-balanced UJSC topologies. In Section IV, a fully distributed algorithm over weight-balanced UJSC digraphs is proposed. In Section V, numerical examples are presented to illustrate effects of the two algorithms. Finally, the paper is concluded in Section VI.
II Preliminaries and Problem Formulation
II-A Notation
Let and be the sets of real and integer numbers, respectively. The Kronecker product is denoted as . Let denote the 2-norm of a vector and also the induced 2-norm of a matrix. The determinant of a square matrix is denoted as . Given a symmetric matrix , the notation means that is positive definite (positive semi-definite). Denote the eigenvalues of in ascending order as . Let and denote the identity matrix and zero matrix (or vector) of proper sizes, respectively. denotes the vector of ones. denotes the column vector stacked with vectors . The notation denotes a (block) diagonal matrix with its th diagonal element (block) being . The notation is used to denote a times continuously differentiable function.
II-B Convex Analysis
A differentiable function is convex over a convex set if and only if , , and is strictly convex if and only if the strict inequality holds for any . It is -strongly convex if and only if , . An equivalent condition for the strong convexity is the following: , . An operator is -Lipschitz continuous over a set if , .
II-C Graph Theory
The information exchanging network is represented by a graph , where is the node set of all agents, is the edge set. The edge means that agent can obtain information from agent , and , where is agent ’s neighbor set. We assume in this work that there are no self-loops in , i.e., and . The graph is said to be undirected if and directed otherwise. A sequence of successive edges is a directed path from agent to agent . is said to be strongly connected if there exists a directed path between any two agents. The adjacency matrix is defined as , where ; if , and , otherwise. The in-degree and out-degree of the th agent are and , respectively. The graph is said to be weight-balanced if . The in-degree matrix is . The Laplacian matrix of is defined as . When is weight-balanced, it satisfies that and . A time-varying graph is said to be uniformly jointly strongly connected (UJSC) if there exists a such that for any , the union is strongly connected.
II-D Passivity
Consider a nonlinear dynamics described by
[TABLE]
where , and are the state, input and output, respectively, and , and are the state, input and output spaces, respectively. The functions , represent system and output dynamics, respectively, and are assumed to be sufficiently smooth, i.e., for large enough integer .
Let us give the definition of passivity and input feedforward passivity for a nonlinear system based on [29, Definition 6.3], [30, Definition 2.12].
Definition 1**.**
System (1) is said to be passive from to if there exists a continuously differentiable positive semi-definite function , called the storage function, such that
[TABLE]
Moreover, it is said to be input feedforward passive (IFP) if , for some , denoted as IFP().
The sign of the IFP index denotes an excess or shortage of passivity. Specifically, when , the system is said to be input strictly passive (ISP). When , the system is said to be input feedforward passivity-short (IFPS). If we define a new output as , then the IFP system becomes passive from to . Throughout this paper, we consider the storage function to be positive definite and radially unbounded.
II-E Problem Formulation
Let us formulate the problem and give some necessary assumptions in this subsection. Consider the distributed convex optimization problem among a group of agents in the node set ,
[TABLE]
where and each local objective function satisfies the following assumption.
Assumption 1**.**
Each is and -strongly convex, with its gradient being -Lipschitz continuous.
This assumption also implies that and , . Note that 1 is widely adopted in the literature, see, e.g., [14, 31]. It is required in this paper to ensure IFP properties and to estimate IFP indices of agents. In addition, it is shown later that the Lipschitz requirement can be relaxed by selecting proper parameters in the algorithms.
Under 1, the necessary and sufficient condition of optimality for problem (3) is [32, Section 5.5.3]
[TABLE]
Denote as agent ’s local estimation of the global optimal solution and let , then problem (3) is equivalent to [15]
[TABLE]
where the constraints are consensus constraints for agents to reach a common value. Under 1 and due to (4), the optimal solution to problem (5) should satisfy
[TABLE]
Consider the distributed optimization over UJSC weight-balanced digraphs. To the best of our knowledge, this problem has never been addressed in the continuous-time domain.
Assumption 2**.**
The agents interact with each other through a sequence of UJSC digraphs , where is weight-balanced pointwise in time and , .
This assumption does not restrict the switching logic of provided it is UJSC for a finite . Note that the time interval is only imposed to ensure convergence performance, and our results in this work hold as long as is strongly connected in a probabilistic sense [28]. We will propose two algorithms in the following sections. The information of is required for the first algorithm, while it is not used at all for the second algorithm. Here the trivial case of is omitted.
III IFP-Based Distributed Algorithm
In this section, we propose a distributed algorithm based on input feedforward passivity and study its stability over UJSC balanced topologies.
III-A IFP-Based Distributed Algorithm
We propose an IFP-based distributed algorithm as follows.
and are local variables and input for the th agent, respectively; , and are constant parameters and is the coupling gain for the diffusive couplings (7c). To ease the discussion on parameters, we assume that are arbitrary parameters, while is a finite and possibly time-varying coupling gain to be designed. The initial condition is required to ensure the optimality of the equilibrium point, which will be specified in subsequent analysis. A simple initial choice can be , .
Initially, each agent in (7a) estimates the optimal value by local gradient descent. Since , may not be the same, an auxiliary variable is introduced to compensate for the difference of local gradients and ensure the existence of an equilibrium. Then, a diffusive coupling protocol (7c) is added to (7a), (7b) in order to drive the dynamics to reach a consensus on the final optimal value. Algorithm 1 is a distributed algorithm since each agent only exchanges information with neighboring agents.
Denote , . Agents in Algorithm 1 are interconnected through diffusive couplings , . By eliminating , the compact form of the overall closed-loop system is written as
[TABLE]
where , and is the graph Laplacian of .
Remark 1**.**
Algorithm 1* in the form of (8) is a generalization of algorithms developed in [14]. Specifically, let and , then Algorithm 1 reduces to the distributed algorithm in [14]. When , and , Algorithm 1 reduces to the simplified algorithm in [14]. Compared with [14], Algorithm 1 includes more general cases whose convergence cannot be proved by methods in [14], e.g., when is time-varying, when is negative, and when is independent of . Moreover, it is shown later that this generalized algorithm is valid over UJSC topologies in addition to directed and strongly connected switching topologies [14], and has an exponential convergence rate when the graph is strongly connected.*
Lemma 1**.**
Under Assumptions 1 and 2. If there exists an equilibrium point to system (8) that satisfies , where , , then is also unique with being the optimal solution to problem (3).
Proof.
The equilibrium point satisfies
[TABLE]
where the term in (9a) is zero and omitted since (9b) implies . Since the graph is UJSC, for all implies that . Next, multiplying (9a) by from the left, one has,
[TABLE]
which satisfies (6). Therefore, is the optimal solution to problem (3). Besides, the strong convexity of in 1 implies that is unique [32, Section 9.1.2]. Thus, by (9a), is unique as well. ∎
Hereafter, we call the optimal point. The convergence of Algorithm 1 will be addressed in Section III-C.
III-B Input Feedforward Passivity of the Error System
In this subsection, we show that the error subsystem of each agent inherits the input feedforward passivity, which is a crucial step before the convergence analysis over UJSC balanced digraphs, and the design of a passivated algorithm in Section IV.
By Lemma 1, for agent , one has
[TABLE]
Denote , . Then, the group of error subsystems between (7) and (10) is
[TABLE]
where is defined as the output of the th error subsystem. Then the input , can be rewritten as
[TABLE]
or in a compact form, where , . Assume that, corresponding to the real agents, there exists a group of virtual agents such that the th virtual agent possesses the subsystem . Then, Algorithm 1 can be seen as output feedback interconnections of these virtual agents. In fact, no information of is needed for communication since . Then, each agent possesses the same information as its corresponding virtual agent.
Next, we show that each error subsystem in (11) is IFP() with index .
Lemma 2**.**
Under 1, each error subsystem in (11) is IFP() from input to output with respect to the storage function
[TABLE]
where and .
Proof.
See the Appendix. ∎
As pointed out by [27], it is in general difficult to derive the exact IFP index for a nonlinear system, and only its lower bound can be obtained by specifying the storage function. With the storage function (2), the lower bound of the exact IFP index can be obtained locally by solving the minimax problem
[TABLE]
When each is quadratic, , the error system (11) becomes a linear system. The exact IFP index for a linear system can be easily obtained by solving an LMI related to the positive real lemma [33, Lemma 2]. The problem of reducing this gap between the lower bound and the exact index of IFP remains open and is left for the future work.
Remark 2**.**
It is in general not difficult to obtain by solving (13) since local objective functions are usually of simple forms. Even when the local objective functions are complicated, problem (13) can be relaxed to
[TABLE]
where is the Lipschitz index defined in 1. Here (14) can be easily solved, providing a lower bound of the exact IFP index, which we can denote as the new . It can also be observed that when , (13) reduces to . The IFP index of agent is only related to the strong convexity index . In this case, the Lipschitz continuity of the gradients is not required.
III-C Algorithm Over UJSC Balanced Topologies
In this subsection, we analyze the convergence of Algorithm 1 over weight-balanced and UJSC switching topologies based on output feedback interconnections of subsystems in (11). Meanwhile, the effort in constructing candidate Lyapunov functions in convergence analysis is greatly reduced.
Definition 2**.**
The group of agents , is said to achieve output consensus if their outputs satisfy .
Theorem 1** ([1]).**
Under Assumptions 1 and 2, the states of Algorithm 1 will converge to the optimal point and solve problem (3) if and the coupling gain satisfies
[TABLE]
where is the smallest value of IFP index , denotes the nonzero smallest eigenvalue, and was defined in Section II-A.
It can be proved through the Lyapunov function , where was defined in (2), and by the fact that and have the same null space. The details of the proof with constant can be found in the conference paper [1]. Condition (15) requires the calculation of eigenvalues, which may be difficult to verify in a large-scale network. Thus, a more practical condition is derived in a different manner as follows, which is easier to verify or estimate for the design of the coupling gain in a distributed sense.
Theorem 2**.**
Under Assumptions 1 and 2, the states of Algorithm 1 with initial condition will converge to the optimal point and solve problem (3) if the positive coupling gain satisfies
[TABLE]
where is the in/out-degree of the th agent.
Proof.
Let , where was defined in (2). By the proof of Lemma 2, , thus, is radially unbounded. Suppose (16) holds. Then, following Lemma 2, the derivative of gives
[TABLE]
where , the third equality follows from (12), the fourth equality follows from , the second inequality follows from the Cauchy-Schwarz inequality, and the last inequality follows from (16).
Then exists and is finite. implies that the states , are bounded. The systems trajectories are bounded within the domain . By the first term in (see (33) in the appendix) and the jointly connectedness of , only if and , , where was defined in Lemma 2. Define the domain . Clearly, is bounded for any bounded , . Invoking the LaSalle’s Invariance Principle for nonautonomous systems [34], we conclude that the system states ultimately reach the domain . Then output consensus is achieved by Definition 2. Recalling (12), one has when output consensus is achieved. Therefore, , , or equivalently, , as , i.e., the states of (8) asymptotically converge to an equilibrium point.
Since , given the initial condition ,
[TABLE]
where the third equality follows from the Kronecker product and the last follows from . Then Lemma 1 holds, implying that the equilibrium point is the unique optimal point. Consequently, the states of Algorithm 1 will asymptotically converge to the optimal point. ∎
The case of fixed directed topologies can be seen as a special case of switching topologies, then the convergence is also guaranteed. Readers can refer to our conference paper [1] for more technical details. In addition, an exponential convergence rate can be obtained when the graph is strongly connected, as stated in the following.
Theorem 3**.**
Suppose that the conditions in Theorem 2 hold. In addition, if the communication digraph is fixed, strongly connected and weight-balanced, and the coupling gain for a constant , then the states of Algorithm 1 with initial condition will exponentially converge to the optimal point.
Proof.
See the Appendix. ∎
It can be observed from the proof that the exponential convergence also holds over time-varying strongly connected weight-balanced graphs provided , for some constants , .
Remark 3**.**
Note that only weight-balanced graphs are considered here. The consensus over unbalanced graphs can be guaranteed similarly [25, 27] with , where is the th element of the left eigenvalue of . However, the sum of local objective functions will have a shift from the global optimum [4]. Thus, some modification is needed. This problem may be solved by adding a state to estimate the left eigenvalues of (e.g., [18]), which we will leave to future work.
III-D Discussion on the coupling gain
In this subsection, we proceed to discuss the parameters and the design of the coupling gain for Algorithm 1.
By Lemma 2, the subsystem is IFP regardless of values of . Let be the threshold of when . Clearly, by Theorem 2, meaning that there always exists a small enough to synchronize the outputs. Thus, can be arbitrarily chosen within the range specified in Algorithm 1. Intuitively, the larger , , are, the faster the convergence rate is. However, the choices of these parameters will affect the IFP index by (13), and hence affect the feasible range of .
In fact, for proper parameters, there is usually a wide feasible range for the coupling gain. Let us take for instance the quadratic functions (i.e., linear time-invariant systems in (11)) from the perspective of passivity, with . When the strong convexity parameter , it can be shown by solving the LMI in [33, Lemma 2] that the IFP index is infinitesimal for each agent. Therefore, can be arbitrarily large based on the above theorems, which corresponds with the observation in [14, Remark 2], where it is said that can be chosen to be any positive value for the algorithm to converge in numerical examples, rendering fully distributed in practice. However, this is in general not true. When , each agent is IFPS with a large-magnitude index, which indicates that the coupling gain cannot be arbitrarily large. The trajectories of systems are not guaranteed to converge if is not within the feasible range. A numerical example is shown in Section V for this discussion. Consequently, the design of coupling gain is not fully distributed and requires global information like Laplacian eigenvalues or in/out-degrees.
Theorems 1 and 2 provide sufficient conditions for convergence for Algorithm 1, where the former requires eigenvalues related to the Laplacian and the latter only uses in/out-degrees. The calculation of eigenvalues could be time-consuming, especially in a large-scale network. There are many distributed algorithms to estimate Laplacian eigenvalues, e.g., [35, 36]. However, these algorithms are obviously not as simple as obtaining the maximum value of needed in Theorem 2 by locally comparing them among neighboring agents. For example, when the graph is fixed and strongly connected, let for agent , , and consider the following iteration,
[TABLE]
Obviously, the result can be obtained in a finite number of iterations since it is simply broadcasting the maximum value along the directed path. The number of iterations needed is no greater than the longest path in the graph, while the longest path is no greater than the total number of nodes. Thus, the condition in Theorem 2 should be easier to verify than the one in Theorem 1 in a distributed sense.
Note that applying a time-varying can be beneficial. When the graph is strongly connected, all agents should select an identical coupling gain to ensure optimality. However, when the graph is not strongly connected at some time , it is hard to communicate and obtain an identical for all agents. In this case, we can still use (19) to choose different coupling gains for agents in different disjoint subgraphs without affecting convergence to the optimal point. Suppose that the node set consists of isolated subsets and let denote the th subset at time . At this time, by the weight-balanced property, all the disjoint subgraphs of are strongly connected respectively [19, 1]. Then each subgroup of agents at time is considered as an isolated system, and thus convergence is guaranteed by Theorem 2. Following similar lines of the proof of Theorem 2, we have and hence
[TABLE]
By Lemma 1 the optimality is preserved.
The above discussion is summarized as the following corollary.
Corollary 1**.**
Under Assumptions 1 and 2, the states of Algorithm 1 with initial condition will converge to the optimal point and solve problem (3) if the coupling gain for agent satisfies
[TABLE]
and , , for all .
IV Distributed Derivative Feedback Algorithm
Note that Algorithm 1 still depends on the global information . In this section, we propose a fully distributed derivative feedback algorithm based on the passivation of Algorithm 1.
IV-A Passivation And Derivative Feedback
The derivative feedback is widely used in distributed algorithms to ensure convergence or to modify algorithms for directed graphs [37, 17, 38, 8]. In this subsection, we design a new distributed algorithm and reveal that the input-feedforward passivation of IFPS agents through an internal feedforward loop is a form of derivative feedback.
Let us consider again each error subsystem in (11). Suppose is IFP(), , then we apply a passivation through feedforward of input. Define a new output as for the th subsystem
[TABLE]
where is the IFP index of agent . The transformation is shown in Figure 1. Obviously, the transformed system is passive.
Lemma 3**.**
Under 1, each subsystem defined by (11) and (21), is passive from input to output with respect to the storage function (2).
Proof.
Adopt the same storage function (2), then following similar lines of the proof of Lemma 2, one has and
[TABLE]
∎
Adopt the diffusive couplings of , as new inputs,
[TABLE]
then a novel distributed algorithm is constructed as follows.
By eliminating and with (24c) and (24b), respectively, Algorithm 2 can be rewritten in a compact form
[TABLE]
where . We can observe that there exist derivative feedback terms in (25). Since each agent only requires information from neighboring agents, Algorithm 2 is a distributed algorithm.
Before proceeding to next step, note that the diffusive couplings of the new outputs bring algebraic loops [39, Section 8.3] into the overall closed-loop system. Thus, we have to check whether the feedback interconnection is well-posed.
The equation (25b) can be rewritten as
[TABLE]
Notice that should be nonsingular such that system (25) can be rewritten in the following explicit form, ensuring the well-posedness of the feedback interconnection [40].
[TABLE]
When the IFP indices are the same, e.g., , , where was defined in Theorem 1, then the nonsingularity of is obvious following a matrix decomposition. However, when the IFP indices take different values, more analysis of this term is needed. We propose the following lemma.
Lemma 4**.**
The matrix is nonsingular.
Proof.
See the appendix. ∎
IV-B Algorithm Over UJSC Balanced Topologies
Next, we derive the following theorem stating that Algorithm 2 is fully distributed without global coordination.
Theorem 4**.**
Under Assumptions 1 and 2, the states of Algorithm 2 with initial condition will converge to the optimal point and solve problem (3) given any coupling gain .
Proof.
When , system (25) reduces to system (8), meaning that the derivative term does not affect the equilibrium set of system (7). Besides, given the initial condition ,
[TABLE]
where the third equality follows from rules of the Kronecker product and the initial condition, the last follows from . It can also be shown by using the explicit expression (27b) of that , satisfying Lemma 1. Thus, the equilibrium point of Algorithm 2 with initial condition is still the optimal point to the distributed optimization problem (3).
The information of is not required for exchange. Then Algorithm 2 can be implemented by output feedback interconnections of virtual agents . Since is passive from input to output by Lemma 3, the consensus analysis among passive agents is similar to that among IFP agents with IFP indices being zero. Specifically, let , where was defined in (2). Substituting (23) into (22), we obtain
[TABLE]
where and and the last inequality follows from the fact that . Following similar lines of the proof of Theorem 2, the states of Algorithm 2 with initial condition will asymptotically converge to the optimal point. ∎
Similarly, Theorem 4 can be directly applied to fixed weight-balanced strongly connected digraphs as a special case of UJSC topologies, as stated in the following corollary.
Corollary 2**.**
Suppose the communication digraph is fixed, strongly connected and weight-balanced. Then, under 1, the states of Algorithm 2 with initial condition will converge to the optimal point for any .
IV-C Discussion on the Derivative Feedback Algorithm
Though Algorithm 2 requires agents to exchange with each other more information like derivatives of states, its advantages are significant.
Compared with most works on directed topologies, Algorithm 2 is robust against randomly changing weight-balanced digraphs with any positive coupling gain and independent of any global information. Since the time interval for UJSC graphs is not used in the proofs, can be relaxed to being strongly connected in a probabilistic sense, namely, it is applicable over gossip-like balanced digraphs [28]. It can be observed from Figure 1 that this modified algorithm can be easily realized by adding a local input feedforward loop to each subsystem . Since the input of the th virtual agent is the same as the input of the real agent , the input feedforward of virtual agents is actually the same as the input feedforward of real agents. Also, note that the passivation is achieved by each agent locally, no global information is needed beforehand. Thus, Algorithm 2 is a fully distributed algorithm.
It might be difficult to characterize the exact convergence rate of Algorithm 2 due to the existence of the derivative terms. Nevertheless, we will show in a numerical example in Section V that its empirical convergence rate is similar to the one of Algorithm 1.
V Numerical Examples
Example 1
We present a numerical example to show effects of the proposed algorithms over directed and switching topologies. Consider a network of agents possessing the following local objective functions , respectively.
[TABLE]
By calculation, we obtain ; , ; , ; , . Let . Then, we obtain that each subsystem in (11) is IFPS with , , , and . Next, we consider two cases of topologies.
Case 1: the agents are connected through a ring graph that is strongly connected and weight-balanced, as shown in Figure 2.
Case 2: for every second, the graph switches randomly among three modes as shown in Figure 3.
The threshold coupling gains are obtained as in (16) for both cases. We implement Algorithms 1 and 2 using the solver ode45 with auto-adjust variable stepsize in MATLAB over these two cases, and with , satisfying the initial condition. Let in Case 1. To illustrate Corollary 1, we adopt different coupling gains for different disjoint subgraphs in Case 2. Specifically, let and for Mode 1; and for Mode 2; and for Mode 3.
The state trajectories are shown in Figures 4, 5 and 6. It can be observed that the trajectories of , asymptotically converge to the optimal solution , in both cases. The residuals of both algorithms over the time-invariant graph in Case 1 are shown in Figure 6. We can observe that Algorithm 2 has a convergence rate very similar to Algorithm 1 despite the existence of derivative terms, and the residuals of both algorithms decrease exponentially.
Example 2
We present another example to compare the two algorithms. Consider a network of agents interconnected through the same graph as Figure 2. The local objective functions are
[TABLE]
Let . By solving the LMI in [33, Lemma 2] with the YALMIP Toolbox [41], we obtain that the agents are IFPS with , , and . Then by (16), the coupling gain threshold is obtained as . According to 2, when , the trajectories of Algorithm 1 will converge to the optimal point. Then, we implement the two distributed algorithms using the solver ode45 with auto-adjust variable stepsize in MATLAB, and with , satisfying the initial condition. The trajectories of the two algorithms asymptotically converge to the optimal solution when , as shown in Figure 7.
When the coupling gain is outside the feasibility range, Algorithm 1 is not guaranteed to converge. The error system (11) is a linear system:
[TABLE]
where . Clearly, it is unstable when , which accords with our discussion in Section III-D. On the other hand, Algorithm 2 should be valid with any positive by Theorem 4. To show this, we compare the two distributed algorithms with the same settings except . It can be observed from Figure 8 that Algorithm 1 is unstable while the trajectories of , in Algorithm 2 asymptotically converge to the optimal solution. Here the IFP indices are different and agents are passivated locally in Algorithm 2.
Example 3
We present an example of agents to show the scalability of the proposed algorithms. The local objective functions are , , where , are randomly generated with uniform distributions. The weight-balanced digraph is also randomly generated per second, where the probability that the edge exists is , , and the in/out-degree is no greater than , . Let , then by calculation, the IFP indices satisfy , . Thus, we can roughly obtain for Algorithm 1. For Algorithm 2, there is no restriction on the coupling gain, which we set as . We implement the two algorithms using ode45 in MATLAB with , , and obtain Figure 9. It can be observed that the proposed algorithms have good scalability.
Note that an important feature of the algorithms is that the graph Laplacian at any time can be very sparse. The time interval in 2 is imposed only to ensure the convergence performance and was not used in the proofs. In practice, our results hold as long as the graph is strongly connected in a probabilistic sense. In this example, usually has zero eigenvalue with a multiplicity greater than , which greatly reduces communication.
VI Conclusion
This paper has investigated a distributed optimization problem via input feedforward passivity. An input-feedforward-passivity framework has been adopted to construct a distributed algorithm that is applicable over weight-balanced digraphs. Moreover, a novel distributed derivative feedback algorithm, which is fully distributed, has been proposed via the input-feedforward passivation. The proposed algorithms have been studied over directed and uniformly jointly strongly connected balanced topologies. Convergence conditions of a suitable coupling gain for the IFP-based distributed algorithm have been derived, while it has been shown that the distributed derivative feedback algorithm is robust against randomly changing weight-balanced digraphs with any positive coupling gain and without knowing any global information.
It is worth mentioning that there are also some limitations in this work and several directions can be considered in future work. For instance, requiring continuous communication is difficult in practice, which can be resolved by applying discrete-time communication or discretization [42]. Also, one could extend the IFP-based distributed algorithms to solve constrained problems or enhance robustness. Lastly, one could consider relaxing the strong convexity requirement using more advanced feedback techniques.
Acknowledgments
The authors would like to thank the Associate Editor and the Reviewers for their valuable comments. The first author would also like to thank Shunya Yamashita and Prof. Takeshi Hatanaka from Tokyo Tech for their useful suggestions.
Appendix
-A Proof of Lemma 2
Under 1, one has , where
[TABLE]
is a positive definite matrix satisfying [31, Lemma 1]
[TABLE]
Clearly, is invertible and is also positive definite. Then, the th subsystem in (11) can be written as
[TABLE]
Since , one has and . Denote , or equivalently,
[TABLE]
Let us consider the storage function
[TABLE]
where is a positive parameter such that . By the strong convexity of , one has
[TABLE]
Then, by substituting the above inequality into , one gets
[TABLE]
where . By the Schur complement [43, Proposition 8.2.4], if and only if and
[TABLE]
Select such that , then the above inequality holds, and . Hence, and if and only if .
It follows from the gradient of that
[TABLE]
where
[TABLE]
Then the derivative of satisfies
[TABLE]
where the first inequality follows from the strong convexity of , the last inequality follows from the inequality of arithmetic and geometric means and , and . Since parameters in and are bounded, given finite , a constant can be obtained. Thus, the subsystem is IFP().
-B Proof of Theorem 3
Consider the case where . Adopt the Lyapunov function candidate , where was defined in (2) and is to be decided.
Let us look at the storage function again. It has been proven in the proof of Lemma 2 that
[TABLE]
where . Denote
[TABLE]
To obtain the smallest eigenvalue of , let us solve . By the Schur complement [43, Proposition 8.2.3], it is equivalent to solving
[TABLE]
where in the above since is not an eigenvalue to . Notice that by (29), then there exists an invertible matrix such that
[TABLE]
where is a diagonal matrix. Then (-B) becomes
[TABLE]
where is the th diagonal element of . Since and by Lemma 2, the roots , to (35) are positive. Solving (35), we obtain
[TABLE]
Denote as a function of with an abuse of notation. The smallest eigenvalue of satisfies
[TABLE]
then
Next, let us derive the upper bound of . By the strong convexity,
[TABLE]
Substituting the above inequality into , we have
[TABLE]
where . Denote the matrix
[TABLE]
By similar application of the Schur complement [43, Proposition 8.2.4], we can obtain that .
Let , satisfying Lemma 2, then . Moreover, is monotonically increasing with respect to . Thus, (36) leads to
[TABLE]
Denote , and . It satisfies that and due to (29). By the definition of , we obtain
[TABLE]
Let , then by (13), where was defined in (33). Since by Lemma 2, let us assume without loss of generality that . Choose a constant , where is the in/out-degree, and then denote
[TABLE]
Substituting into (33), the time derivative of satisfies
[TABLE]
where , the second inequality follows from (38) and the inequality of arithmetic and geometric means similarly to (33), and the last inequality follows from the definition of . The above manipulation provides a term containing in order to prove negative semi-definiteness of later.
Therefore, the time derivative of satisfies
[TABLE]
Replacing by in (17), we have
[TABLE]
where
[TABLE]
and by the definition of and condition (16).
Let us define as the stacked vector of the average value of , i.e., . Observe that for any vector , . In addition, , which implies that is orthogonal to all eigenvectors of associated with zero eigenvalues. Consequently, we have
[TABLE]
where the equality follows from , , and is the smallest nonzero eigenvalue of .
We can also observe that
[TABLE]
where the second equality follows from the Kronecker product and the last equality is due to (III-C). Consequently,
[TABLE]
where the first inequality follows from (40), (-B), the second inequality follows from the Young’s inequality with , the equality follows from (30), and the last inequality follows from (29). Observe that is negative definite if and , i.e., the following conditions hold,
[TABLE]
Choose , then the above conditions become and . Though , are functions of , it is obvious that when . Then there always exists a small enough such that the above conditions are satisfied and is negative definite.
Next, by calculations, , where is the smallest eigenvalue of . Then, by the exponential stability theorem [29, Theorem 4.10], we have , where
[TABLE]
due to (-B), and
[TABLE]
Recall that and if and only if due to (29). Then we finally obtain
[TABLE]
for any .
The cases for and can be proved similarly by taking and , respectively.
-C Proof of Lemma 4
To prove the nonsingularity, we first give some lemmas as follows.
Lemma 5**.**
Given a real matrix , the matrix is invertible if and only if is not an eigenvalue of .
Proof.
is invertible if and only if . In other words, the characteristic polynomial at is nonzero, i.e., . Since eigenvalues are the roots of the characteristic polynomial, it means that is not an eigenvalue of , or equivalent, is not an eigenvalue of . ∎
Lemma 6**.**
Let be the Laplacian matrix for a weight-balanced graph , be a diagonal matrix and , then the eigenvalues of or have non-negative real parts.
Proof.
Let us consider the case of . Since is diagonal, by direct calculation,
[TABLE]
Since and is weight-balanced, we have for all , implying that is diagonal dominant. By the Gershgorin circle theorem [43, Fact 4.10.16], the real parts of the eigenvalues of remain non-negative. The case of can be proved similarly. ∎
We are now ready to prove the nonsingularity.
Take without loss of generality. Recall that , by Lemma 2. The eigenvalues of have non-negative real parts by Lemma 6. In addition, , thus is not an eigenvalue of . By Lemma 5, is invertible and hence nonsingular.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Li, G. Chesi, and Y. Hong, “Input-feedforward-passivity-based distributed optimization over directed and switching topologies,” in 2019 IEEE 58th Conference on Decision and Control (CDC) . IEEE, 2019, pp. 6056–6061.
- 2[2] A. Nedić and A. Olshevsky, “Distributed optimization over time-varying directed graphs,” IEEE Transactions on Automatic Control , vol. 60, no. 3, pp. 601–615, 2015.
- 3[3] A. Nedić, A. Olshevsky, and W. Shi, “Achieving geometric convergence for distributed optimization over time-varying graphs,” SIAM Journal on Optimization , vol. 27, no. 4, pp. 2597–2633, 2017.
- 4[4] P. Xie, K. You, R. Tempo, S. Song, and C. Wu, “Distributed convex optimization with inequality constraints over time-varying unbalanced digraphs,” IEEE Transactions on Automatic Control , vol. 63, no. 12, pp. 4331–4337, 2018.
- 5[5] J. Wang and N. Elia, “Control approach to distributed optimization,” in 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton) . IEEE, 2010, pp. 557–561.
- 6[6] P. Yi, Y. Hong, and F. Liu, “Distributed gradient algorithm for constrained optimization with application to load sharing in power systems,” Systems & Control Letters , vol. 83, pp. 45–52, 2015.
- 7[7] M. Li, “Generalized lagrange multiplier method and KKT conditions with an application to distributed optimization,” IEEE Transactions on Circuits and Systems II: Express Briefs , vol. 66, no. 2, pp. 252–256, 2019.
- 8[8] X. Zeng, P. Yi, Y. Hong, and L. Xie, “Distributed continuous-time algorithms for nonsmooth extended monotropic optimization problems,” SIAM Journal on Control and Optimization , vol. 56, no. 6, pp. 3973–3993, 2018.
