Distributed Adaptive Gradient Optimization Algorithm
Peng Lin, Wei Ren

TL;DR
This paper introduces two distributed adaptive gradient algorithms for multi-agent systems to optimize convex functions, demonstrating convergence through Lyapunov analysis and numerical validation.
Contribution
It presents novel adaptive algorithms that utilize relative information to enhance distributed convex optimization in multi-agent systems.
Findings
Algorithms successfully minimize convex objectives over time.
Convergence proven using Lyapunov functions and system analysis.
Numerical examples validate theoretical results.
Abstract
In this paper, a distributed optimization problem with general differentiable convex objective functions is studied for single-integrator and double-integrator multi-agent systems. Two distributed adaptive optimization algorithm is introduced which uses the relative information to construct the gain of the interaction term. The analysis is performed based on the Lyapunov functions, the analysis of the system solution and the convexity of the local objective functions. It is shown that if the gradients of the convex objective functions are continuous, the team convex objective function can be minimized as time evolves for both single-integrator and double-integrator multi-agent systems. Numerical examples are included to show the obtained theoretical results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Mathematical and Theoretical Epidemiology and Ecology Models · Adaptive Dynamic Programming Control
Distributed Adaptive Gradient Optimization Algorithm
††thanks: This work was supported by the National Science Foundation under Grant ECCS-1307678 and ECCS-1611423, and the National Natural Science Foundation of China (61203080,61573082,61528301).
Peng Lin and Wei Ren Peng Lin is with the School of Information Science and Engineering, Central South University, Changsha 410083, China. Wei Ren is with the Department of Electrical and Computer Engineering, University of California, Riverside, CA92521, USA. E-mail: lin[email protected], [email protected].
Abstract
In this paper, a distributed optimization problem with general differentiable convex objective functions is studied for single-integrator and double-integrator multi-agent systems. Two distributed adaptive optimization algorithm is introduced which uses the relative information to construct the gain of the interaction term. The analysis is performed based on the Lyapunov functions, the analysis of the system solution and the convexity of the local objective functions. It is shown that if the gradients of the convex objective functions are continuous, the team convex objective function can be minimized as time evolves for both single-integrator and double-integrator multi-agent systems. Numerical examples are included to show the obtained theoretical results.
Keywords: Optimization, Consensus, Distributed Adaptive algorithm
I Introduction
As an important branch of distributed control, distributed optimization has attracted more and more attention from the control community [1, 2, 6, 7, 5, 8, 9, 3, 4, 10, 11, 12, 13, 14, 15, 16]. The aim is to use a distributed approach to minimize a team optimization function composed of a sum of local objective functions where each local objective function is known to only one agent. In the past few years, researchers have obtained many results about distributed optimization problems from different perspectives. For example, based on gradient descent method, articles [1, 2, 3, 4, 5] studied distributed optimization problems with and without state constraints, while by introducing a dynamic integrator, articles [11, 12, 13] investigated distributed optimization problems for general strongly connected balanced directed graphs. Recently, some researchers turned their attention to try to solve the distributed optimization problem from a view point of nonsmooth approaches. For example, article [14] proposed several algorithms using nonsmooth functions to solve the distributed optimization problem with the consideration of finite-time consensus optimization convergence. Also, articles [5, 16] introduced adaptive algorithms using nonsmooth functions to solve the distributed optimization problem for general differentiable convex functions or general linear multi-agent systems. However, in [14, 5, 16], it is required that the gradients or subgradients of the local objective functions be bounded or a period of the previous information should be used for each agent.
To this end, we will continue the work of [5] to study the distributed optimization problem for general differentiable objective function using nonsmooth functions. Two distributed adaptive optimization algorithm is introduced which uses the relative information to construct the gain of the interaction term. The analysis is performed based on the Lyapunov functions, the analysis of the system solution and the convexity of the local objective functions. It is shown that if the gradients of the convex objective functions are continuous, the team convex objective function can be minimized as time evolves for both single-integrator and double-integrator multi-agent systems.
Notations. denotes the set of all dimensional real column vectors; denotes the index set ; denotes the th component of the vector ; denotes the transpose of the vector ; denotes the Euclidean norm of the vector ; denotes the differential operator with respect to ; denotes the gradient of the function at ; denotes a component-wise sign function of ; and denotes the projection of the vector onto the closed convex set , i.e., .
II Preliminaries
In this section, we introduce preliminary results about graph theory and convex functions (see [19, 17, 5]).
Consider a multi-agent system consisting of agents. Each agent is regarded as a node in an undirected graph of order where is the set of nodes, is the set of edges, and is the weighted adjacency matrix. An edge of denotes that agents and can obtain information from each other. The weighted adjacency matrix is defined as and if and otherwise. The set of neighbors of node is denoted by The Laplacian of the graph , denoted by , is defined as and for all . A path is a sequence of edges of the form , where . The graph is connected, if there is a path from every node to every other node.
Lemma 1**.**
[19] If the graph is connected, then its Laplacian has a simple eigenvalue at [math] with associated eigenvector 1 and all its other eigenvalues are positive and real. **
Lemma 2**.**
[17] Let be a differentiable convex function. is minimized if and only if .
Lemma 3**.**
[5] Under Assumption 1, all and are nonempty closed bounded convex sets for all .
III Distributed Optimization Problem
Suppose that each agent has the following dynamics
[TABLE]
where is the state of agent , and is the control input of agent . Our objective is to use only local information to design for all agents to cooperatively solve the following optimization problem
[TABLE]
Assumption 1**.**
[5] Each set X_{i}\triangleq{\big{\{}}s{\big{|}}\nabla f_{i}(s)=0{\big{\}}} is nonempty and bounded.
Assumption 2**.**
[5] The length of the time interval between any two contiguous switching times is no smaller than a given constant, denoted by .
IV Main Results
IV-A Single-Integrator Multi-Agent Systems
In this section, we design a distributed adaptive algorithm for (1) to solve the optimization problem (4) for general convex local objective functions. The algorithm is given by
[TABLE]
for all . In (10), the role of the term, , is to make all agents converge to a consensus point, while the second term, , is the negative gradient of which is used to minimize .
Remark 1**.**
As algorithm (10) uses the sign functions that is nonsmooth, the system (1) using (10) would be discussed in the Filippov sense [18].
Theorem 1**.**
Suppose that the graph is undirected and connected for all , is continuous with respect to for all and Assumptions 1 and 2 hold. For system (1) with algorithm (10), all agents reach a consensus in finite time and minimize the team objective function (4) as . **
Proof: First, we prove that all are bounded for all . Under Assumption 1, from Lemma 3, we have that all and are nonempty closed bounded convex sets for all . It is clear that , and for all and some closed bounded set . Let be sufficiently large for any and all such that for all . Since , from the convexity of the function , we have .
Construct a Lyapunov function candidate as for some . Calculating along the solutions of system (1) with (10), we have
[TABLE]
Since the graph is undirected, it follows that
[TABLE]
From the convexity of the function , we have . It follows that . If for some , we have for all and hence . This implies that all remain in . Note that each is continuous with respect to for all , and is bounded. Thus, for all and some constant .
Next, we prove that all agents reach a consensus as . Let denote the contiguous switching times for all such that for some two integers and all and for all and all . Suppose that consensus is not reached as and . It is clear that . Moreover, from the dynamics of , we have that for some constant . Since each is bounded and for all and all , where denotes the time just before , is bounded for all and hence for all . That is, consensus is reached as , which yields a contradiction. Suppose that . Similar to the proof of Theorem 2 in [5], it can be proved that all agents reach a consensus in finite time.
Summarizing the above analysis, consensus can be reached as . Let . Note that each is continuous with respect to . There is a constant for any such that and for all . Recall that . Consider the Lyapunov function candidate for . Calculating , we have
[TABLE]
Note that when , . It follows that there exists a constant such that for . Since can be arbitrarily small, it follows that . It follows from Lemma 2 that the team objective function (4) is minimized as .
Remark 2**.**
In [5], a distributed algorithm was proposed to solve the optimization problem. However, it is required that a period of the previous information should be used for each agent. In contrast to [5], in this paper, the previous information is not used and the current information is sufficient for the proposed algorithm to make all agents minimize the team objective function as time evolves.
IV-B Double-Integrator Multi-agent Systems
In this part, our goal is to extend the results in Subsection A to second-order multi-agent systems with the following dynamics
[TABLE]
where and are the position and velocity states of agent and is the control input. To solve the distributed optimization problem, we use the following algorithm
[TABLE]
where is the feedback damping gain of the agents.
Let . The system (30) with (36) can be written as
[TABLE]
For convenience of expression, we assume in the proof of the following theorem.
Theorem 2**.**
Suppose that the graph is undirected and connected for all , is continuous with respect to for all and Assumptions 1 and 2 hold. For system (1) with algorithm (10), all agents reach a consensus in finite time and minimize the team objective function (4) as . **
Proof: Construct a Lyapunov function candidate as for some . Let , , and be a matrix with each entry [\Phi(t)]_{ij}=\left\{\begin{array}[]{lll}-\sum_{k=1,k\neq i}^{n}[\Phi(t)]_{ik},\mbox{if}~{}i=j,\\ -\frac{q_{ij}(t)}{2\|x_{j}(t)-x_{i}(t)\|},\mbox{if}~{}i\neq j~{}\mbox{and}~{}(i,j)\in\mathcal{E}(\mathcal{G}(t))\\ 0,\mbox{otherwise}.\end{array}\right. Regarding and as the Laplacians of some certain undirected graphs, it follows from Lemma 1 that and .
Calculating , we have
[TABLE]
where the first inequality uses the convexity of . Then by a similar approach to the proof of Theorem 1, it can be proved that all and remain in a bounded closed convex set, denoted by , for all such that , and for all . Note that each is continuous with respect to . Thus, for all and some constant .
Next, we prove that all agents reach a consensus as . Let denote the contiguous switching times for all such that for some two integers and all and for all and all . Suppose that consensus is not reached as and . It is clear that . Moreover, from the dynamics of , we have that for some constant . Note that for all and for all and all , where denotes the time just before . Hence for all . That is, for all . Since for all and all , it follows from the dynamics of each agent that for all and all . Since and for all , it follows that each is bounded for all . Hence for all .
Recall that for all and all . Clearly, for all . Since and for all , is bounded for all . Since , is bounded. Thus, is bounded for all . Thus, is also bounded. This means that . By a similar approach to prove that for all , using the continuity of , it can be proved that for all . It follows from the definition of that for all .
Suppose that . Then from the dynamics of , there must exist a pair of agents, denoted by , such that . In the following, we prove that there exist a pair of agents, denoted by , such that , and . If this is not true, we have and for some constant , all and all with and . Since , there exists a sufficiently large constant for any such that for all . By simple calculations based on (39), when and for , we have . When there exist at least an agent such that and for and either or holds, we have for . Let denote the contiguous switching times for all such that the case holds when and for all and the case when there exist at least an agent such that and for and either or holds for all and all . Note that . Calculating based on the Newton’s Law, we have that
[TABLE]
Since , from the dynamics of , we have and hence from (45) we have . That is, there exist a pair of agents such that , and . Similarly, it can be proved that there exist a pair of agents such that , and . By analogy, it can be proved that for all . Then there is a constant such that is far larger than for all and all . Since for all , it follows from (39) that is far smaller than for and all . Adopting a group of agents such that and for all and where denotes the operator of the convex closure. It is clear that for all . If for some , we have . This yields a contradiction. Thus, .
Based on the above analysis, using a approach similar to the proof of Theorem 1, the team objective function (4) is minimized as .
V Simulations
Consider a multi-agent system consisting of 8 agents in a plane. The communication graph is switched among the connected subgraphs of the graph in Fig. 1. The local objective functions are , and , where and denote the two components of . By simple calculations, when , we have that . From Lemma 2, the minimum set of the team objective function (4) is . The simulation results are shown in Figs. 2 and 3. It can be observed that the team objective function (4) is minimized as , which are consistent with Theorems 1 and 2.
VI Conclusions
In this paper, a distributed optimization problem with general differentiable convex objective functions was studied for single-integrator and double-integrator multi-agent systems. Two distributed adaptive optimization algorithm was introduced by using the relative information to construct the gain of the interaction term. The analysis was performed based on the Lyapunov functions, the analysis of the system solution and the convexity of the local objective functions. It was shown that if the gradients of the convex objective functions are continuous, the team convex objective function can be minimized as time evolves for both single-integrator and double-integrator multi-agent systems.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Nedi c ´ ´ c \acute{\mathrm{c}} , A. Ozdaglar, P. A. Parrilo, “Constrained consensus and optimization in multi-agent networks”, IEEE Transactions on Automatic Control, vol. 55, no. 4, pp.922-938, 2010.
- 2[2] A. Nedi c ´ ´ c \acute{\mathrm{c}} , and A. Ozdaglar, “Distributed Subgradient Methods for Multi-agent Optimization”, IEEE Transactions on Automatic Control, vol. 54, no. 1, pp. 48-61, 2009.
- 3[3] G. Shi, K. H. Johansson and Y. Hong, “Reaching an Optimal Consensus: Dynamical Systems That Compute Intersections of Convex Sets”, IEEE Transactions on Automatic Control, vol. 58, no. 3, pp. 610-622, 2013.
- 4[4] S. Liu, Z. Qiu and L. Xie, “Continuous-Time Distributed Convex Optimization with Set Constraints”, in proceedings of the IFAC World Congress, vol. 29, pp. 9762-9767, 2014.
- 5[5] P. Lin, W. Ren and J. A. Farrell, “Distributed continuous-time optimization: nonuniform gradient gains, finite-time convergence, and convex constraint set,” IEEE Transactions on Automatic Control, 2017, available online.
- 6[6] M. Zhu and S. Mart ı ´ ´ ı \mathrm{\acute{{\mathrm{{{\imath}}}}}} nez, “An Approximate Dual Subgradient Algorithm for Multi-Agent Non-Convex Optimization”, IEEE Transactions on Automatic Control, vol. 58, no. 6, pp. 1534-1539. 2013.
- 7[7] B. Johansson, T. Keviczky, M. Johansson, and K. H. Johansson, “Subgradient methods and consensus algorithms for solving convex optimization problems,” in Proceedings of IEEE Conference on Decision and Control, 2008, pp. 4185-4190.
- 8[8] J. Lu, C. Y. Tang, P. Regier and T. D. Bow, “Gossip Algorithms for Convex Consensus Optimization Over Networks,” IEEE Transactions on Automatic Control, vol. 56, no. 12, pp. 2917-2923. 2011.
