Distributed Optimization for a Class of High-order Nonlinear Multi-agent Systems with Unknown Dynamics
Yutao Tang

TL;DR
This paper develops distributed adaptive control algorithms for high-order nonlinear multi-agent systems with unknown dynamics, enabling agents to collaboratively reach the global optimum despite nonlinear uncertainties.
Contribution
It introduces a novel embedded control design and adaptive controllers for high-order nonlinear agents, addressing unknown dynamics in distributed optimization.
Findings
Agents' outputs converge to the global optimal solution.
Estimated parameters converge to true values under persistence of excitation.
Algorithms are validated through simulation examples.
Abstract
In this paper, we study a distributed optimization problem for a class of high-order multi-agent systems with unknown dynamics. In comparison with existing results for integrators or linear agents, we need to overcome the difficulties brought by the unknown nonlinearities and also the optimization requirement. For this purpose, we employ an embedded control based design and first convert this problem into an output stabilization problem. Then, two kinds of adaptive controllers are given for these agents to drive their outputs to the global optimal solution under some mild conditions. Finally, we show that the estimated parameter vector converges to the true parameter vector under some well-known persistence of excitation condition. The efficacy of these algorithms was verified by a simulation example.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Distributed Optimization for a Class of High-order Nonlinear Multi-agent Systems with Unknown Dynamics
Yutao Tang ***This work was supported by National Natural Science Foundation of China under Grant 61503033. Y. Tang is with the School of Automation, Beijing University of Posts and Telecommunications, Beijing 100876, China (e-mail: [email protected]).
Abstract: In this paper, we study a distributed optimization problem for a class of high-order multi-agent systems with unknown dynamics. In comparison with existing results for integrators or linear agents, we need to overcome the difficulties brought by the unknown nonlinearities and also the optimization requirement. For this purpose, we employ an embedded control based design and first convert this problem into an output stabilization problem. Then, two kinds of adaptive controllers are given for these agents to drive their outputs to the global optimal solution under some mild conditions. Finally, we show that the estimated parameter vector converges to the true parameter vector under some well-known persistence of excitation condition. The efficacy of these algorithms was verified by a simulation example.
Keywords: Distributed optimization, unknown dynamics, embedded design, adaptive control, parameter convergence.
1 Introduction
Over the past few years, distributed optimization of multi-agent systems has become a hot topic due to the fast development of multi-robot networks, machine learning and big data technologies [1, 2, 3]. In a typical setting of this problem, each agent is assigned with a local cost function and the control objective is to propose distributed controls that guarantee a consensus on the optimal solution of the sum of all local cost functions. Many effective algorithms have been proposed to achieve this goal in different situations [4, 5, 6, 7, 8, 9].
Here, we follow this technical line but consider high-order continuous-time nonlinear agents unknown dynamics. While most of the existing works were only devoted to single-integrator agents, there are many distributed optimization tasks implemented by or depending on physical plants of continuous dynamics in practice, e.g. source seeking in multi-robot systems [10], attitude formation control of rigid bodies[11], optimal power dispatch over power networks [12]. Note that these physical dynamics can hardly be described well by single integrators. In fact, an example was given to show that the algorithms designed for single integrators might fail to achieve a desired performance for these high-order agents [13]. Thus, we have to take the high-order and possible nonlinear agents’ dynamics into account to achieve the distributed optimization goal.
In light of the optimization requirement for agents, the gradient-based closed-loop systems are basically nonlinear. Furthermore, the high-order feature of these agents brings many new difficulties to the associated analysis and design, which makes this problem much more challenging. In fact, very few optimization results have been obtained on this topic with continuous-time agents in the form of high-order dynamics. For example, a distributed algorithm for double integrators was proposed in [14] with an integral control idea and then extended to Euler-Lagrange agents. A class of nonlinear minimum phase agents in output feedback form with unity relative degree was considered by an internal-model design [15]. Distributed optimization problem with bounded controls was also explored for both single and double integrators [16] . However, distributed optimization of general high-order multi-agent systems is still far from being solved.
Recently, we proposed an embedded control scheme to solve this problem for general linear systems having well-defined vector relative degrees [17]. To overcome the difficulties brought by both high-order dynamics and possible nonlinearities from gradients, we divided the design into two main parts: optimal signal generator construction and reference tracker design. This embedded technique makes the control design carried out in a “separative” way, so as to simplify the whole design by almost independently tackling the optimal consensus problem for single integrators and output tracking problem of high-order agents.
Note that exact information of system matrices is required in this embedded design, which may be not available or with measurement errors in applications. This motivates us to investigate the distributed optimization problem for high-order nonlinear agents with unknown time-varying dynamics. Furthermore, we focus on the case when this unknown time-varying nonlinearity can be linearly parameterized. The objective of this paper is to extend existing embedded control results to this class of uncertain high-order nonlinear agents and achieve the distributed optimization goal.
In view of the aforementioned observations, the contribution of this paper is at least two-fold. Firstly, a distributed optimization problem was formulated and solved for a group of high-order nonlinear agents, which can be taken as an extended version of existing results for single integrators [2, 7]. Secondly, the embedded control technique proposed in [17] was further explored and extended to solve the distributed optimization problem of high-order multi-agent systems with unknown dynamics. By removing the requirement of knowing agents’ exact dynamics, this work can be taken as an adaptive extension to existing results for linear multi-agent systems [7, 16, 17] and nonlinear ones with unity relative degree [15]. Moreover, the obtained conclusions can be applied to achieve an output average consensus for these uncertain nonlinear agents, while only integrators or linear agents were considered in existing works [18, 19, 20].
The remainder of this paper is organized as follows. Problem formulation is presented in Section 2. Then the main result is presented in Sections 3 along with both stability analysis and parameter convergence. Following that, an example is given to illustrate the effectiveness of our algorithms in Section 4. Finally, concluding remarks are given in Section 5.
Notations: Let be the -dimensional Euclidean space. For a vector , denotes its Euclidean norm. (and ) denotes an -dimensional all-one (and all-zero) column vector. denotes an diagonal matrix with diagonal elements . for column vectors . A weighted directed graph (or digraph) is defined as follows, where is the set of nodes, is the set of edges, and is a weighted adjacency matrix [21]. denotes an edge leaving from node and entering node . The weighted adjacency matrix of this digraph is described by , where and ( if and only if there is an edge from agent to agent ). A path in graph is an alternating sequence of nodes and edges for . If there exists a path from node to node then node is said to be reachable from node . The neighbor set of agent is defined as for . A graph is said to be undirected if (). An undirected graph is said to be connected if there is a path between any two vertices. The weighted Laplacian of graph is defined as and .
2 Problem formulation
In this paper, we consider a collection of heterogeneous high-order nonlinear systems described by:
[TABLE]
where is the -th state variable of agent , , and are respectively the state, output, and input of agent . The function represents the unknown time-varying nonlinearities with uncertain parameter , which might result from modeling errors or external perturbations.
Associated with this multi-agent system, each agent is endowed with a differentiable local cost function . The global cost function is defined as the sum of local costs, i.e., . Moreover, we assume the local cost function is only known to agent itself and cannot be shared globally in the multi-agent network. Coupled with these nonlinear agents, we aim to design proper controllers such that the outputs of these agents asymptotically minimize the global cost function.
To clarify our following design, we focus on a class of systems in the form of (1) as follows.
Assumption 1**.**
For any , there exist a known basis function vector and an unknown parameter vector satisfying for all , and . Furthermore, can be uniformly bounded by smooth functions of .
Remark 1**.**
The unknown nonlinear dynamics of all agents are assumed to be linearly parameterized, which have been widely studied in both classical adaptive control and multi-agent coordination literature [22, 23, 24, 25]. In fact, a plenty of practical systems can be put into this form, which is general enough to cover integrators, Van der Pol systems, Duffing equations and many mechanical systems. Moreover, this time-varying feature can further be used to represent many typical external disturbances, e.g. constants and sinusoidal signals.
The following assumption is often made in convex optimization literature [26, 7, 27], which guarantees the existence and uniqueness of the optimal solution to problem (2).
Assumption 2**.**
For , the function is -strongly convex and its gradient is -Lipschitz for constants .
As usual, we assume this optimal solution is finite and denote it as , i.e.
[TABLE]
Due to the privacy of local cost function , no agent can gather enough information to determine the global optimal solution by itself. Hence, our problem cannot be solved without cooperation and information sharing among these agents.
For this purpose, we use a weighted undirected graph to describe the information sharing topology with note set . An edge between nodes and means that agent and agent can share information with each other. Suppose the following assumption is fulfilled as in many publications [7, 28, 14, 17, 9].
Assumption 3**.**
The graph is connected.
This assumption is about the connectivity of information sharing graph , which guarantees that any agent’s information can reach any other agents. Under this assumption, it is well-known that the associated Laplacian is positive semidefinite with null space spanned by [21].
The distributed optimization problem considered in this paper is readily formulated as follows.
Problem 1**.**
For given agents of the form (1), information sharing graph and local cost function , if possible, determine a distributed protocol for each agent by using its own local data and exchanged information with its neighbors such that
all the trajectories of agents are bounded over the time interval .
- 2)
their outputs of agents satisfy
[TABLE]
Remark 2**.**
The formulated problem can be taken as a combination of the well-studied topics: distributed optimization [2, 3, 7, 29] and output consensus [18, 30, 31]. Since an output consensus of the whole high-order multi-agent system must be achieved as the solution of a convex optimization problem in the formulated problem, it is certainly more challenging than the existing output consensus results for high-order agents.
Remark 3**.**
In contrast with existing distributed optimization works, the agents considered here are high-order and heterogeneous, while only single integrators are considered in many publications [4, 8, 32, 27]. Moreover, we study the case when these agents are with unknown dynamics, while the exact information of agents’ dynamics is required in [14] and [17].
Particularly, when the local cost function is chosen as , this formulation can solve an output average consensus problem for these high-order uncertain nonlinear agents and thus includes exiting results for integrators and linear systems as special cases [18, 19, 20].
As mentioned above, the main difficulty to solve the distributed optimization for these agents lies in the coupling of the high-order structure associated with agents’ dynamics and the global optimization requirement. To overcome this point, we adopt the embedded control scheme [17] and propose adaptive control laws to solve the distributed optimization problem for agent (1) in the following section.
3 Main Results
In this section, we first employ an embedded control approach to convert our problem into an output stabilization problem, and then propose distributed adaptive algorithms for these high-order nonlinear agents with unknown dynamics to achieve the optimization goal along with parameter convergence analysis.
3.1 Embedded Control Design
The embedded control approach was first proposed by Tang et al [17] to solve the distributed optimization problem for high-order linear agents. In this approach, an optimal signal generator must be constructed first by considering the same optimization problem (2) for a group of single-integrator agents, in order to asymptotically reproduce the optimal solution by a signal . Then, by taking as an output reference signal for agent , this generator is embedded in the feedback loop via some proper interfaces for original agents. In this way, the distributed optimization problem for general linear agents is divided into two simpler subproblems, i.e., construction of an optimal signal generator for single integrators and design of proper trackers for linear agents with output references, which can be independently solved in a modular way.
The first subproblem is essentially a conventional distributed optimization problem for single integrators and has been well-studied in existing literature [7, 8, 29]. To solve this problem, the following optimal signal generator was proposed for problem (2) in [17].
[TABLE]
The effectiveness of (4) was already proven in [17]. We repeat it as follows for completeness.
Lemma 1**.**
Suppose Assumptions 2 and 3 hold. Then, along the trajectory of system (4), approaches the optimal solution exponentially as for .
Suppose we know the analytical form of or at least , the optimal signal generator can be implemented independently to generate the minimizer of problem (2) as showed in this lemma. However, this requirement may cost too much in applications and even impossible in many practical circumstances. Thus, we follow an oracle-based description[33] of and consider the case when only the real-time gradient is available upon requests in our following design.
Furthermore, exact information of system matrices is required for the proposed controllers in [17]. This implies that the corresponding designs can not be directly used to solve our problem for uncertain agents in the form of (1). Thus, the distributed optimization problem considered in this paper is much more challenging than those addressed for integrators or linear agents in existing works [7, 4, 27, 8, 17].
To deal with these two issues, we adopt a certainty-equivalence design and propose an algorithm as follows:
[TABLE]
where is the estimation of uncertain vector , and the constants and function to be specified later. Here the constant is a tunable high-gain parameter to deal with the real-time gradient issue.
Remark 4**.**
Here, the variables constitute the local compensator of agent . It can be verified that this control is indeed distributed in the sense of only using the agents’ own local data and exchanged information with their neighbors. Moreover, from its nominal form (5), the above control is composed of two parts, where the last two subsystems can be taken as a modified version of optimal signal generator (4) with real-time gradients and the rest compose an adaptive tracking controller such that can track as goes to infinity in spite of those unknown dynamics, which confirms the embedded design methodology.
Under the above control law, the composite system is then:
[TABLE]
By letting , it can be further rewritten as:
[TABLE]
where , and A_{i}=\left[\begin{array}[]{c|c}0&I_{n_{x_{i}}-1}\\ \hline\cr k_{1i}&[k_{2i}\,\dots\,k_{n_{i}\,i}]\end{array}\right].
Note that the above system is almost in a singularly perturbed form except the adaption dynamics [34]. By letting , we have and . The resultant quasi-state-state model of the above composite system is exactly the optimal signal generator (4), which in turn guarantees as goes to infinity by Lemma 1.
Based on these observations, our formulated distributed optimization problem for agent (1) is converted to a decentralized output stabilization problem for the above system (7) with output . In other words, we need to determine proper function and constant such that all trajectories of (7) is bounded over and satisfying as goes to infinity.
3.2 Solvability Analysis
To solve our problem, we denote , , , , and . The whole composite system can be put into a compact form as follows.
[TABLE]
where , , , , , , is the Laplacian of , and the function is determined by .
We first choose gain constants such that the polynomial is Hurwitz for any . This implies that the following Lyapunov equation
[TABLE]
has a unique positive definite solution with compatible dimensions.
Inspired by existing Lyapunov-based designs [22, 24], we present the main result of this paper as follows.
Theorem 1**.**
Suppose Assumptions 1–3 hold. Then, the distributed optimization problem determined by (1) and (2) can be solved by controllers of the form (5) with for a small enough , where the constant is chosen as above for .
**Proof. ** The proof is mainly based on system composition techniques.
Step 1: consider the first two subsystems of (8). Let with . Its time derivative along the trajectory of the composite system (7) satisfies
[TABLE]
By Young’s inequality, it holds that
[TABLE]
for some known positive constants and .
Choose . We further have
[TABLE]
Step 2: consider the last two subsystems of (8), which can be rewritten as
[TABLE]
where the function is -Lipschitz in by Assumption 2.
By taking as perturbations, we let be the equilibrium point of unperturbed -system (i.e. when ). Note that the unperturbed system is exactly the optimal signal generator, it implies under Assumptions 2 and 3 by Lemma 1. In fact, the equilibrium point satisfies and . As a result, there exists a constant such that since the null space of is spanned by under Assumption 3. Then, one can obtain by , which implies that is an optimal solution of (2). By Assumption 2, this implies and .
Letting and gives
[TABLE]
Inspired by the proof of Lemma 1 in [17] , we let be a matrix satisfying , , and . Apparently, the matrix has a full column rank. Denote and perform a coordinate transform . The above system is equivalent to and
[TABLE]
Furthermore, the unperturbed system is globally exponential stable at the origin under Assumptions 2 and 3 by Lemma 1.
To apply the system composition arguments, one has to investigate the robustness of system (11). For this purpose, we let and apply the converse Lyapunov theorem (Theorem 4.15 in [34]) to the unperturbed subsystem, that is, there is a continuously differentiable Lyapunov function such that
[TABLE]
for some positive constants .
Along the trajectory of perturbed system (11), one can obtain:
[TABLE]
where we use the Lipschitzness of in and thus in .
Step 3: consider the stability of the composite system composed by the first two subsystems of (8) and system (11). Let with to be specified later. By using equalities (10) and (12), the derivative of with respect to along the whole composite system satisfies
[TABLE]
Note that and are both globally Lipschitz in their arguments and under Assumption 2. From (11), one can determine a known constant satisfying that
By Young’s inequality, we have
[TABLE]
Letting and gives that
[TABLE]
According to the LaSalle-Yoshizawa theorem (Theorem 2.1 in [22]), we have that all trajectories of the closed-loop system composed of (8) and (11) are bounded over the time interval and satisfy that . As immediate results, one can conclude the boundedness of , , and . Moreover, we can obtain that and , which implies that
[TABLE]
as . The proof is thus complete.
Remark 5**.**
Note that we consider uncertain high-order agents in the form of (1), which includes integrators as its special cases. Thus, the theorems can be taken as adaptive extensions of existing results with exact known dynamics[14, 19, 20, 17]. Moreover, many actuating disturbances can be represented in the form of (1), thus we provide different methods to achieve disturbance rejection from the internal model-based approach used in existing works [28, 15].
Particularly, when the analytical form of or is known to us, the optimal signal generator can be implemented independently. Following a similar proof, we can choose the gain parameter as any positive constant to solve our problem.
Theorem 2**.**
Suppose Assumptions 1–3 hold. Then, the distributed optimization problem determined by (1) and (2) can be solved by the following control
[TABLE]
where the constant is chosen as above and is arbitrary for .
Remark 6**.**
In some circumstances, we may further let with a positive definite matrix . This matrix is called an adaption gain in literature[35]. It can be used to achieve a fast adaption and then improve the transient performance of our controllers to solve the distributed optimization problem.
Remark 7**.**
Without further information of the unknown dynamics, the two controllers may fail in practical applications if there are external disturbances or noises in measurements of , although it is theoretically proved to achieve the optimization goal as goes to infinity. To tackle this problem, we can employ a -modification [35] for with sacrificing some accuracy in control performance as follows:
[TABLE]
where is a tunable parameter such that can be smaller than any desired positive constant.
3.3 Parameter Convergence
It has been shown that parameter convergence is essential in achieving robustness of the proposed adaptive controllers[35, 22, 36]. From the proof of Theorem 1, one can merely conclude that the estimator converges to some constant. In this subsection, we assert conditions under which will converge to its true value as tends to infinity.
For this purpose, we further assume the basis function satisfying the following condition.
Assumption 4**.**
For any , along the trajectory of the closed-loop system composed of (1) and (5) or (13), there exist positive constants , , such that the function is uniformly bounded and the following inequality is satisfied:
[TABLE]
Note that is ultimately bounded by Theorem 1, the boundedness of is not too strict. The inequality (15) is often known as the persistence of excitation (PE) condition, which has widely used in adaptive control literature [22, 24].
Theorem 3**.**
Suppose Assumptions 1–4 hold. Then, along the trajectory of system (1) under the controllers proposed in Theorems 1 and 2 , it holds that for .
**Proof. ** To show this theorem, we first claim that . By the proof of Theorem 1, we have . From the uniform boundedness of associated variables and Assumption 4, it follows that is also bounded. Using Barbalat’s lemma (Lemma 8.2 in [34]) to implies that , which confirms this claim.
Next, noting gives by Assumption 1. According to Lemma 1 in [37] or its proof, the two facts and provide us that under Assumption 4. The proof is thus complete.
Remark 8**.**
Since the unknown dynamics is linearly parameterized by Assumption 1, this theorem can be further modified and applied to any number of components in satisfying such a PE condition, and then used to address the parameter convergence problem in a more precise way. Specially, when the basis function is time-invariant, the -th component of is persistently excited if and guarantees the convergence of to as goes to infinity.
4 Simulations
In this section, we present a numerical example to illustrate our problem and the effectiveness of our designs.
Consider a multi-agent system including four controlled Van der Pol systems as follows.
[TABLE]
where with but unknown. The trajectories of the unforced system with different initial conditions when are depicted in Figure 1.
To make it more interesting, we further assume that agent is subject to an actuating disturbance described by
[TABLE]
where and .
The information sharing graph of this multi-agent system is depicted in Figure 2 with unity edge weights. The local cost functions are as follows.
[TABLE]
Denote . The agents are of the form (1). Note that for some constants , depending upon . By letting and , one can find that Assumption 1 is fulfilled. Moreover, both Assumptions 2 and 3 are also verified. Additionally, the optimal solution is by numerically minimizing the global cost function . According to Theorem 1, the distributed optimization problem for these agents can be solved by a controller of the form (5).
For simulations, we choose the parameters as , , , , and . The profiles of agents’ outputs are shown in Figure 3. Satisfactory performance is observed. For parameter convergence, we have , which is time-invariant and satisfies that . By some calculations, one can also determine that
[TABLE]
Using Theorem 3, we can conclude that the estimators , , will converge to their true values, while may fail. This conclusion is confirmed by Figures 4 and 5.
5 Conclusions
A distributed optimization problem was formulated for a class of high-order nonlinear systems with unknown dynamics. By using an embedded control scheme, we proposed distributed adaptive controls to solve this problem under standard assumptions and parameter convergence was also addressed. Output feedback control with directed information sharing graphs will be our future work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. C. Derenick and J. R. Spletzer, “Convex optimization strategies for coordinating large-scale robot formations,” IEEE Trans Robot , vol. 23, no. 6, pp. 1252–1259, 2007.
- 2[2] A. Nedić, A. Ozdaglar, and P. Parrilo, “Constrained consensus and optimization in multi-agent networks,” IEEE Trans Automat Control , vol. 55, no. 4, pp. 922–938, 2010.
- 3[3] S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, “Distributed optimization and statistical learning via the alternating direction method of multipliers,” Found Trends Mach Learn , vol. 3, no. 1, pp. 1–122, 2011.
- 4[4] G. Shi, K. H. Johansson, and Y. Hong, “Reaching an optimal consensus: dynamical systems that compute intersections of convex sets,” IEEE Trans Automat Control , vol. 58, no. 3, pp. 610–622, 2013.
- 5[5] M. Zhu and S. Martínez, “On distributed convex optimization under inequality and equality constraints,” IEEE Trans Automat Control , vol. 57, no. 1, pp. 151–164, 2012.
- 6[6] D. Yuan, S. Xu, and J. Lu, “Gradient-free method for distributed multi-agent optimization via push-sum algorithms,” Int J Robust Nonlin , vol. 25, no. 10, pp. 1569–1580, 2015.
- 7[7] S. S. Kia and S. Cortés, J nd Martínez, “Distributed convex optimization via continuous-time coordination algorithms with discrete-time communication,” Automatica , vol. 55, pp. 254–264, 2015.
- 8[8] S. Yang, Q. Liu, and J. Wang, “A multi-agent system with a proportional-integral protocol for distributed constrained optimization,” IEEE Trans Automat Control , vol. 62, no. 7, pp. 3461–3467, 2017.
