Multi-agent estimation and filtering for minimizing team mean-squared error
Mohammad Afshari, Aditya Mahajan

TL;DR
This paper introduces the concept of team mean-squared error (MTMSE) for multi-agent estimation, deriving closed-form solutions and demonstrating improved performance over traditional methods in vehicle platoon scenarios.
Contribution
The paper develops the MTMSE framework for multi-agent estimation, providing closed-form solutions and recursive filtering algorithms that outperform existing methods.
Findings
MTMSE estimates outperform MMSE and consensus Kalman filtering.
Closed-form expressions for MTMSE are derived for linear systems.
Simulation shows significant improvement in vehicle distance estimation.
Abstract
Motivated by estimation problems arising in autonomous vehicles and decentralized control of unmanned aerial vehicles, we consider multi-agent estimation and filtering problems in which multiple agents generate state estimates based on decentralized information and the objective is to minimize a coupled mean-squared error which we call \emph{team mean-square error}. We call the resulting estimates as minimum team mean-squared error (MTMSE) estimates. We show that MTMSE estimates are different from minimum mean-squared error (MMSE) estimates. We derive closed-form expressions for MTMSE estimates, which are linear function of the observations where the corresponding gain depends on the weight matrix that couples the estimation error. We then consider a filtering problem where a linear stochastic process is monitored by multiple agents which can share their observations (with delay) over a…
| Info structure | Dimension of local info | Performance | ||
|---|---|---|---|---|
| IS0 : | 6 | 8 | 180.46 | |
| IS1 : | 3 | 4 | 193.72 | |
| IS2 : | 3 | 3 | 252.09 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Multi-agent estimation and filtering for minimizing team mean-squared
error
Mohammad Afshari, and Aditya Mahajan The authors are with the Department of Electrical and Computer Engineering, McGill University, Montreal, QC, H3A-0E9, Canada. Emails: [email protected], [email protected] research was supported by the Natural Science and Engineering Research Council of Canada (NSERC). A preliminary version of this paper was presented in the 2018 IEEE Conference on Decision and Control (CDC) [1].
Abstract
Motivated by estimation problems arising in autonomous vehicles and decentralized control of unmanned aerial vehicles, we consider multi-agent estimation and filtering problems in which multiple agents generate state estimates based on decentralized information and the objective is to minimize a coupled mean-squared error which we call team mean-square error. We call the resulting estimates as minimum team mean-squared error (MTMSE) estimates. We show that MTMSE estimates are different from minimum mean-squared error (MMSE) estimates. We derive closed-form expressions for MTMSE estimates, which are linear function of the observations where the corresponding gain depends on the weight matrix that couples the estimation error. We then consider a filtering problem where a linear stochastic process is monitored by multiple agents which can share their observations (with delay) over a communication graph. We derive expressions to recursively compute the MTMSE estimates. To illustrate the effectiveness of the proposed scheme we consider an example of estimating the distances between vehicles in a platoon and show that MTMSE estimates significantly outperform MMSE estimates and consensus Kalman filtering estimates.
I Introduction
Emerging applications in autonomous vehicles and decentralized control of UAVs (unmanned aerial vehicles) give rise to estimation problems where multiple agents use local measurements to estimate the state of the shared environment in which they are operating and then use these estimates to act in the environment. In the resulting decentralized estimation problems, the objective is to minimize the weighted mean-square error between the true state and the decentralized estimates generated by all agents. We call such a coupled mean-square error as team mean-squared error and the resulting estimates as minimum team mean-squared error (MTMSE) estimates.
For example, consider a platoon of self-driving vehicles where the estimation objective is to ensure that the position estimates of each vehicle are close to the true position of the vehicle and, at the same time, the difference between the position estimates of adjacent vehicles are close to the true difference between the positions. Or consider a fleet of UAVs (unmanned aerial vehicles) where the estimation objective is to ensure that the position estimates of each UAV are close to the true position of the UAV and, at the same time, the centroid of the estimates of all UAVs is close to the true centroid of their positions. A salient feature of these examples is that there are multiple agents who generate state estimates based on different information and the objective is to minimize a weighted mean-squared error between the true state and the decentralized estimates generated by all agents.
We first start with a simple example to illustrate that MTMSE estimates are different from the standard MMSE (minimum mean-squared error) estimates. Consider a system with two agents, indexed by , which observe the state of nature with noise. In particular, the measurement of agent is
[TABLE]
where , , and are independent.
Agent generates an estimate based on its local measurements, where is any arbitrary estimation strategy. The objective is to ensure that is close to and at the same time the average of the estimates is close to . Thus, the estimation error of the estimation strategy is given by
[TABLE]
where . Naively choosing as the MMSE estimate of given , i.e., choosing
[TABLE]
gives an estimation error of
[TABLE]
This naive strategy does not minimize the team mean-squared error given by (1), even within the class of linear estimation strategies. To see this, we identify the best linear estimation strategy. Let
[TABLE]
where is same for both agents due to symmetry. The estimation error for this linear strategy is
[TABLE]
which is convex in . The value of gain which minimizes this estimation error is
[TABLE]
where . The corresponding estimation error is
[TABLE]
Note that for large , and the relative improvement
[TABLE]
is significant for moderate values of . For example, for , the relative percentage improvement is 12.5%. A plot of the relative percentage improvement as a function of the variance for different values of is shown in Fig. 1.
The relative percentage improvement as a function of for different values of is shown in Fig. 1. The improvement is significant for higher values of .
This significant improvement over MMSE estimates for a simple example motivates the central question of this paper: what are the estimation and filtering strategies that minimize the team mean-squared error? We start by modeling and answering this question for estimation in Sec. II. Then, we model and answer this question for filtering, where we assume that agents are connected over a graph and can share their measurements over a communication graph in Sec. III. We generalize the filtering results to infinite horizon setup in Sec. III-F. Finally, we present examples to illustrate that MTMSE estimates significantly outperform MMSE and consensus Kalman filtering estimates.
I-A Literature overview
Following the seminal work of Kalman [2] on recursive MMSE filtering, several variations of single- and multi-agent MMSE filtering have been investigated in the literature. However, as far as we are aware, there are only two references which have investigated estimation or filtering for the MTMSE objective [3, 4]. Both references investigated multi-agent filtering of a continuous time linear stochastic process. In [3], each agent observes a noise corrupted measurement of the state and the objective is to minimize a specific form of team mean-squared error. The key idea of [3] is to consider an augmented state and observation model and formulate the team mean-square error as the squared norm of an appropriately defined inner product of these augmented variables. It is shown that team mean-squared filtering problem can be formulated as a Hilbert space mean-squared error filtering problem and, therefore, solved using an appropriate Kalman filter. The model considered in [4] is similar except that each agent has multiple observation channels and, at each time, can select which observation channel to use. The solution approach is similar to [3].
Although [3, 4] are able to transform a MTMSE filtering problem to a Hilbert space MMSE filtering problem, the approach has several limitations. First, and most importantly, the approach of [3, 4] is only applicable to a specific form of MTMSE cost. The formulation of the team mean-squared error as a squared norm of an appropriately defined inner product does not hold for the more general team mean-squared error considered in this paper. In particular, the form of the team mean-squared error considered in the practical examples in Sec. IV cannot be written as the squared norm of an appropriate inner product. Second, the size augmented state variables used in [3, 4] scales linearly with the number of agents. In particular, for a -agent MTMSE filtering where the state is of dimension , the augmented state (and therefore the augmented estimate) is of dimension . Thus the resulting Kalman filter needs to keep track of dimensional covariance matrix. In contrast, the solution that we propose only requires a Kalman filter with a dimensional error covariance. Finally, [3, 4] did not consider sharing of measurements among the agents. Such a sharing of measurements is a key feature of the general filtering model that we consider in this paper.
Estimation problems with coupling between the estimates have been considered in the economics literature [5, 6, 7]. However, in such models, agents are strategic and want to minimize an individual estimation objective. The solution concept is identifying estimation strategies which are in Nash equilibrium which is different from the solution concept of minimizing a common team estimation error considered here.
There is a rich literature on multi-agent filtering for distributed sensor fusion [8, 9, 10, 11, 12] as well as for distributed simultaneous localization and mapping (SLAM) in robotics [13, 14, 15]. There is also a rich literature on multi-agent estimation using consensus and gossip Kalman filters [16, 17, 18, 19, 20, 21] (and references therein). However, all these methods only consider MMSE filtering. As illustrated by motivating example presented at the beginning, MTMSE estimates can be significantly different from MMSE estimates. So, the vast literature on multi-agent MMSE filtering is not directly applicable for MTMSE filtering.
I-B Contributions of the paper
The salient feature of the model is that agents are informationally decentralized and need to cooperate to minimize a common team estimation objective. Our focus is to identify the structure of estimation strategies that find MTMSE when the graph topology, system dynamics, and the noise covariances are known to all agents.
We consider the problem of minimizing the team mean-squared error in an estimation problem where the measurements of the agents may be split into a common measurement and local measurements.111If no such split is possible, then the common measurement is simply empty. Using tools from team theory [22], we show that the optimal MTMSE estimate is a sum of two terms. The first term is the MMSE estimate of the state given the common measurement. The second term is a linear function of the innovation in the local measurement given the common measurement. Furthermore, the corresponding gains are computed by solving a system of matrix equation, which can be converted into a linear system of equations using vectorization.
We then consider the problem of minimizing the sum of team mean-squared errors over time in a filtering problem where the agents share their measurements with their neighbors over a completely connected communication graph. Since the graph is completely connected, the information available at each agent can be split into common information and local information. We show that the structure of the optimal MTMSE estimates identified in the estimation setup continue to hold for filtering as well. We setup an appropriate linear system with delayed observation to derive recursive formulas for the MMSE estimate of the state based on the common information and the innovation in the local measurements given the common measurements. We also derive recursive formulas for computing various covariances needed to compute the gain which multiplies the innovation term in the optimal estimates.
Finally, we show that under standard stabilizability and detectability conditions, a time-homogeneous estimation strategy is optimal for minimizing the long-term average team mean-squared error.
A preliminary version of this paper appeared in [1], where the main result for the filtering problem (Theorem 2) was stated. The proof of Theorem 2 relies heavily on the results for the estimation problem (Theorem 1) which was not included in [1]. Neither were the generalization to infinite horizon (Theorem 3). The detailed numerical experiments and the comparison with MMSE estimate and consensus Kalman filtering (Section IV), the detailed comparison with [3, 4] (Section I), the relation between the MTMSE estimates and decentralized control (Section V-B), and the trade-off between MTMSE filter complexity and estimation accuracy (Section V-C) are new as well.
I-C Notation
Let denote the Kronecker delta function (which is one if and zero otherwise). Given a matrix , denotes its -th element, denotes its -th row, denotes its -th column, A^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}} denotes its transpose, denotes the column vector of formed by vertically stacking the columns of . Given a vector , denotes x^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}x. Given matrices and , denotes the matrix obtained by putting and in diagonal blocks, and denotes the Kronecker product of the two matrices. Given matrices and with the same number of columns, denotes the matrix obtained by stacking on top of . Given a squared matrix , denotes the sum of its diagonal elements. Given a symmetric matrix , the notation and mean that is positive definite and semi-definite, respectively. is a matrix with all elements being equal to one. is a square matrix with all elements being equal to zero. is the identity matrix. We omit the subscript from when the dimension is clear from context. We sometimes consider random vectors as a set with random elements . In particular, given two random vectors and , we define to mean . Similarly, we use to mean .
Given any vector valued process and any time instances , such that , is a short hand notation for . Given matrices with the same number of rows and vectors , and denote and , respectively.
Given random vectors and , and denote the mean and variance of while denotes the covariance between and .
II Minimum team mean-squared error (MTMSE) estimation
II-A Model and problem formulation
Consider a system with agents that are indexed by the set . The agents are interested in estimating the state of nature. Agent makes a local measurement , . In addition, all agents observe a common measurement, which we denote by . We use to denote the set .
The variables are assumed to be jointly Gaussian zero-mean random variables. For any , let and
Agent generates an estimate according to an estimation rule , i.e., . Given weight matrices and , where and , the performance is measured by the team estimation error given by:
[TABLE]
Let denote the estimate of all agents. The team estimation error is a weighted quadratic function of . In particular,
[TABLE]
where and are given by
[TABLE]
We assume that the matrix is positive definite.
We now present a few examples of the estimation error function of the form (3):
Suppose , where is the local state of agent . Suppose the agents want to estimate their own local state, but at the same time, want to make sure that the average of their estimates is close to the average of their local states. In this case, the team mean-squared error function is
[TABLE]
where . This can be written in the form (3) with , and
[TABLE] 2. 2.
Suppose the agents are moving in a line (e.g., a vehicular platoon) or in a closed shape (e.g., UAVs flying in a formation) and want to estimate their local state but, at the same time, want to ensure that the difference between their estimates is close to the difference of their local states.
For example when agents are moving in a line, the team mean-squared error function is
[TABLE]
where . This can be written in the form (3) with and
[TABLE]
A similar weight matrix can be obtained for the case when agents are moving in a closed shape. 3. 3.
Suppose each agent generates an estimate of the state of nature and the objective is to minimize
[TABLE]
This can be written in the form (3) with . This cost function is equivalent to the team mean-squared error considered in [3, 4].
We are interested in the following optimization problem.
Problem 1
Given the covariance matrices and and weight matrices and , choose the estimation strategy to minimize the expected team estimation error given by
[TABLE]
Remark 1
In Problem 1, the system model is common knowledge among all agents. Thus, it may be viewed as a problem of “centralized planning and decentralized execution.” The key conceptual difficulty in the problem is that the estimates are generated using different information (recall that the information available at agent is ) with the objective of minimizing a common coupled team estimation error given by (3). This feature makes the Problem 1 conceptually different from the standard estimation problem of minimizing the MMSE error. □
II-B Optimal team estimation strategy
We define three auxiliary variables:
- •
All agents’ common estimate of state given the common measurement at all agents. We denote this estimate by and it is equal to .
- •
All agents’ common estimate of agent ’s measurement given the common measurement . We denote this estimate by and it is equal to .
- •
The innovation in the local measurement of agent with respect to the common measurement. We denote this innovation and it is equal to .
Let denote the covariance and denote the covariance . From elementary properties of Gaussian random variables, we have the following:
Lemma 1
The covariance matrices defined above are given by
. 2. 2.
.
Therefore, the auxiliary variables defined above are given by
. 2. 4.
.
Furthermore, we have
*. * 2. 6.
.
□
The result follows from elementary properties of Gaussian random variables. Then, we have the following.
Theorem 1
The estimation strategy that minimizes the team mean-squared error in Problem 1 is a linear function of the measurements. Specifically, the MTMSE estimate may be written as
[TABLE]
where the gains satisfy the following system of matrix equations:
[TABLE]
If for all , then (9) has a unique solution which can be written as
[TABLE]
[TABLE]
Furthermore, the minimum team mean-squared error is given by
[TABLE]
where and □
The proof of Theorem 1 is presented in Appendix A.
To illustrate this result, consider the two agent example presented in the introduction. In that model, there is no common measurement. So , , and therefore . Moreover, and . Therefore,
[TABLE]
Thus, the optimal gains are
[TABLE]
where and the minimum team mean-squared error is
[TABLE]
Thus, we recover the results obtained by brute force calculations in the introduction.
Remark 2
In (8), the first term of the estimate is the MMSE estimate of the current state given the common measurements. The second term may be viewed as a “correction” which depends on the innovation in the local measurement. A salient feature of the result is that the gains depend on the weight matrix . □
Remark 3
When is block diagonal, there is no cost coupling among the agents and Problem 1 reduces to separate problems. Thus, the MMSE estimates are also the MTMSE estimates. □
III Minimum team mean-squared error (MTMSE) filtering
In this section, we consider the problem of filtering to minimize team mean-squared error when agents share information over a communication graph. We start with a quick overview of graph theoretic terminology.
III-A Overview of graph theoretic terminology
A directed weighted graph is an ordered set where is the set of nodes and is the set of ordered edges, and is a weight function. An edge in is considered directed from to ; is the in-neighbor of ; is the out-neighbor of ; and and are neighbors. The set of in-neighbors of , called the in-neighborhood of , is denoted by ; the set of out-neighbors of , called the out-neighborhood, is denoted by .
In a directed graph, a directed path is a weighted sequence of distinct nodes such that . The length of a path is the weighted number of edges in the path. The geodesic distance between two nodes and , denoted by , is the shortest weight length of all paths connecting the two nodes. The weighted diameter of the graph is the largest weighted geodesic distance between any two nodes. A directed graph is called strongly connected if for every pair of nodes , there is a directed path from to and from to . A directed graph is called complete if for every pair of nodes , there is a directed edge from to and from to .
III-B Model and problem formulation
III-B1 Observation Model
Consider a linear stochastic process , , where and for ,
[TABLE]
where is a matrix and , , is the process noise. There are agents, indexed by , which observe the process with noise. At time , the measurement of agent is given by
[TABLE]
where is a matrix and , , is the measurement noise. Eq. (13) may be written in vector form as
[TABLE]
where , , and .
The agents are connected over a communication graph , which is a strongly connected weighted directed graph with vertex set . For every edge , the associated weight is a positive integer that denotes the communication delay from node to node .
Let denote the information available to agent at time . We assume that agent knows the history of all its measurements and step delayed information of its in-neighbor , , i.e.,
[TABLE]
In (14), we implicitly assume that for any .
Let denote the new information that becomes available to agent at time . Then, and for ,
[TABLE]
It is assumed that at each time , agent , communicates to all its out-neighbors. This information reaches the out-neighbor of agent at time .
Some examples of the communication graph are as follows.
Example 1
Consider a complete graph with -step delay along each edge. The resulting information structure is
[TABLE]
which is the -step delayed sharing information structure [23]. □
Example 2
Consider a strongly connected graph with unit delay along each edge. Let denote the weighted diameter of the graph and denote the -hop in-neighbors of with . The resulting information structure is
[TABLE]
which we call the neighborhood sharing information structure. □
At time agent generates an estimate of (where is a matrix) according to
[TABLE]
where is a measurable function called the estimation rule at time . The collection is called the estimation strategy of agent and is the team estimation strategy profile of all agents.
III-B2 Estimation Cost
Let denote the estimate of all agents. As in Sec. II, we assume that the estimation error is a weighted quadratic function of of the form
[TABLE]
Examples of such estimation error functions were given in Sec. II-A.
III-B3 Problem Formulation
It is assumed that the system satisfies the following assumptions.
- (A1)
The cost matrix is positive definite. 2. (A2)
The noise covariance matrices are positive definite and and are positive semi-definite. 3. (A3)
The primitive random variables are independent. 4. (A4)
For any square root of matrix such that , is stabilizable. 5. (A5)
is detectable.
We are interested in the following optimization problem.
Problem 2** (Finite Horizon)**
Given matrices , , , , , , , a communication graph (and the corresponding weights ), and a horizon , choose a team estimation strategy profile to minimize given by
[TABLE]
Problem 3** (Infinite Horizon)**
Given matrices , , , , , and a communication graph (and the corresponding weights ), choose a team estimation strategy profile to minimize given by
[TABLE]
As was the case for the estimation problem presented in Sec. II, a salient feature of the model is that the estimates are generated using different information while the objective is to minimize a common coupled estimation error given by (16) or (17). This feature makes the Problems 2 and 3 conceptually different from the standard filtering problem of minimizing the MMSE error.
Remark 4
For Problem 2, the assumption that the dynamics, measurements, and cost are time-homogeneous is made simply for convenience of notation. As will be evident from the analysis, the results for Problem 2 generalize to the setting of time-varying dynamics, measurements, and cost in a natural manner. □
III-C Roadmap of the results
The main idea behind identifying a solution for Problem 2 is as follows. We observe that the choice of the estimates only affects the instantaneous estimation error but does not affect the evolution of the system or the estimation error in the future. Therefore, the problem of choosing an estimation profile to minimize is equivalent to solving the following separate optimization problems:
[TABLE]
Since the communication graph is strongly connected, the information available at agent can be written as , where
[TABLE]
is the common information among all agents (recall that is the weighted diameter of the communication graph) and
[TABLE]
is the location information at agent . Thus, we may view Problem (18) as an estimation problem with agents where agents have local and common information and, therefore, use the results of Sec. II to derive the MTMSE filtering strategy. To do so, we define variables which are equivalent to the auxiliary variables defined in Sec. II-B:
- •
All agents’ common estimate of state given the common information at all agents. We denote this estimate by and it is equal to .
- •
All agents’ common estimate of the local information at agent given the common information. We denote this estimate by and it is equal to .
- •
The innovation in the local information at agent with respect to the common information. We denote this innovation by and it is equal to .
Furthermore, we let denote the covariance and denote the covariance .
In order to use the results of Theorem 1, we need to derive expressions for recursively updating the above variables and covariances, which we do next.
III-D Recursive expressions for auxiliary variables and covariances
The information structure of the problem is effectively equal to -step delayed information structure [23]. To derive recursive expressions for auxiliary variables and covariances, we follow the central idea of [23] and express the system variables in terms of delayed state .
III-D1 Delayed state estimates and common estimates
We define
[TABLE]
as the delayed state estimate of the state and let
[TABLE]
denote the corresponding estimation error and denote the estimation error covariance. Note that is the one-step prediction estimate in centralized Kalman filtering and can be updated as follows. Start with and for , update
[TABLE]
is the Kalman gain. Furthermore, the error covariance can be pre-computed recursively using the forward Riccati equation: and for ,
[TABLE]
where .
Now, observe that we can compute the common estimate using a -step propagation of the delayed state estimate as follows:
[TABLE]
III-D2 Local estimates and local innovation
To find a convenient expression for local innovation , we express in terms of the delayed state . For that matter, for any , define the random vector as follows:
[TABLE]
where is the weighted accumulated process noise from time to time . Note that if or . For any , we may write
[TABLE]
By definition . Thus, for any , we can identify matrix and random vectors and (which are linear functions of and ) such that
[TABLE]
As an example, we write the expressions for for the delayed sharing and neighborhood sharing information structures below. For any , define
[TABLE]
Example 1 (cont.)
For the -step delayed sharing information structure . Thus, , , and . □
Example 2 (cont.)
For the neighborhood sharing information structure, Thus,
[TABLE]
□
Now, a key-result is the following.
Lemma 2
, , , and are independent. □
Proof
Observe that and are functions of the primitive random variables up to time , while and are functions of the primitive random variables from time onwards. Thus, and are independent of and . Furthermore, (A3) implies that and are independent of each other. Note that is the estimation error when estimating given and is, therefore, uncorrelated with . Since all random variables are Gaussian, and being uncorrelated also means that they are independent. ■
Combining Lemma 2 with (27), we get
[TABLE]
Combining this with (27), we get,
[TABLE]
III-D3 Covariances
Let denote and denote . Note that these can be computed from he expressions of and , which were derived earlier based on the communication graph.
Eq. (29) and Lemma 2 imply that
[TABLE]
where is computed using (22).
Furthermore, Eqs. (25) and (29) and Lemma 2 imply that
[TABLE]
where and is computed using (22).
III-E Main result for Problem 2
As mentioned in Sec. III-C, the problem of choosing the MTMSE estimation strategy to minimize is equivalent to solving separate estimation sub-problems given by (18). Based on Theorem 1, the MTMSE estimate of each of these sub-problems is given as follows.
Theorem 2
Under assumptions (A1)–(A3), the filtering strategy which minimizes the team mean-squared error in Problem 2 is a linear function of the measurements. Specifically, the MTMSE estimates at time may be written as
[TABLE]
where and are computed using (23) and (29). The gains satisfy the following system of matrix equations
[TABLE]
where and are computed using (30) and (31). Eq. (33) has a unique solution which can be written as
[TABLE]
where
[TABLE]
Furthermore, the minimum team mean-squared error is given by
[TABLE]
where and is given by
[TABLE]
and . □
Proof
The expressions for the MTMSE estimates (32) and the corresponding gains (33) follow immediately from Theorem 1. Now, since is positive definite (which is part of (A2)), standard results from Kalman filtering [24, Section 3.4] imply that is positive definite. Using this fact in (30) implies that is positive definite. Therefore, the vectorized formula (34) follows from Lemma 5.
The expression for the minimum team mean-squared error follow from an argument similar to that in the proof of Theorem 1. The expression for follows from (23) and (25). ■
Remark 5
Remark 2 about the structure of the MTMSE estimates continues to hold for filtering setup as well. The first term in the MTMSE estimate (32) is the MMSE estimate of the current state based on the common information. The second term is a “correction” which depends on the innovation in the local measurements. □
Remark 6
As in the estimation setup, the gains which multiply the innovation in (32) are coupled and depend on the weight matrix . □
Remark 7
Since we have assumed that the dynamics are time-homogeneous, the processes , , and are stationary. Hence, for , the covariance matrices , , , and are constant. □
Remark 8
Note that when . Therefore, when the weight matrix is sparse, as is the case for the cost (6), (and, therefore, and ) need to computed only for those for which . □
III-F Main result for Problem 3
Now, we consider the infinite horizon MTMSE filtering introduced in Problem 3, which can be thought of as a “steady-state” version of Sec. III-E. We first state a standard result from centralized Kalman filtering [24].
Lemma 3
Under (A2)–(A5), for any initial covariance , the sequence given by (21) is weakly increasing and bounded (in the sense of positive semi-definiteness). Thus it has a limit, which we denote by . Furthermore,
* does not depend on .* 2. 2.
* is positive semi-definite.* 3. 3.
* is the unique solution to the following algebraic Riccati equation.*
[TABLE]
where \bar{K}=\bar{P}C^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}\big{[}C\bar{P}C^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}+R\big{]}^{-1} and . 4. 4.
The matrix is asymptotically stable.
□
Recall from Remark 7 that , and are constants for . We denote the corresponding values for as , , and . Now define:
[TABLE]
Lemma 4
Under (A2)–(A5), we have the following:
. 2. 2.
. 3. 3.
**
□
Proof
All relations follow immediately from Lemma 3 and Remark 7. ■
Theorem 3
Under (A1)–(A5), the following time-homogeneous filtering strategy minimizes the team mean-squared error for Problem 3:
[TABLE]
where (which is same as (23)), is updated using the steady state version of (20) given by
[TABLE]
and the gains satisfy the following system of matrix equations:
[TABLE]
where and are given by (39) and (40). Eq. (43) has a unique solution and can be written more compactly as
[TABLE]
where
[TABLE]
Furthermore, the optimal performance is given by
[TABLE]
where is given by (38). □
The proof of Theorem 3 is presented in Appendix C.
IV Some illustrative examples
In this section, we present a few examples to illustrate the details of the main results.
IV-A Team mean-squared estimation in a UAV formation
Consider a UAV formation with agents as shown in Fig. 2. Let and denote the state of agent . For the ease of exposition, we assume that , which could correspond to say the altitude of the UAV. Let denote the state of the system, which evolves as
[TABLE]
where is a known matrix and . The agent observes the state with noise, i.e.,
[TABLE]
where .
The communication graph is as shown in Fig. 2, where each link is assumed to have delay 2. Thus, the information structure is given by
[TABLE]
The objective is to determine the MTMSE filtering for per-step estimation error given by (5), i.e., the agents want to estimate their local state and ensure that the average of the local state estimates is close to the average of their actual states.
We first show the computations of the MTMSE estimates. Observe that and
[TABLE]
Thus, and
[TABLE]
As argued in Remark 7, the covariance matrices , , , and are constant for . Thus, we only need to compute these for and . Note that the weight matrix is dense, so we do not get the computational savings described in Remark 8.
We have the following:
- •
and for , .
- •
and for , P^{\sigma}_{i}(t)=\begin{bmatrix}\mathbf{0}_{4\times 1}&QC^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}\end{bmatrix}.
- •
and for , P^{w}_{ij}(t)=\mathrm{diag}(0,C_{i}QC_{j}^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}).
- •
and .
- •
for and all .
Substituting these, we get that and for ,
[TABLE]
Substituting these in (33) or (34) gives us the optimal gains. The MTMSE estimates can then be computed using (32) as described in Sec. V-A.
We compare the performance of MTMSE filtering strategy with two baselines. The first is MMSE strategy where, each agent ignores the cost coupling and simply generates the MMSE estimates using
[TABLE]
It can be shown that performance of the MMSE strategy is
[TABLE]
Recall that for this particular example we have .
The second is a consensus based Kalman filter as described in [16]. We do not have a closed form expression for the weighted mean square error of the consensus Kalman filter, so we evaluate the performance using Monte Carlo evaluation averaged over sample paths.
For the numerical experiments we pick
[TABLE]
, and for , , where is a vector with only the element equal to one and the rest zero, , and .
The relative improvements
[TABLE]
of the MTMSE strategy compared to MMSE strategy and consensus Kalman filtering as a function of are shown in Fig. 3. These plots show that the MTMSE strategy outperforms the MMSE and consensus Kalman filtering strategies by up to a factor of 4 and 600 in the relative improvements for and . This improvement in performance will increase with the number of agents.
IV-B Team mean-squared estimation in a vehicular platoon
Now we consider a vehicular platoon with four agents shown in Fig. 4. As before, let denote the position of the platoon. We assume that the dynamics and the observation model are similar to that described in Sec. IV-A (but with different and matrices).
The communication graph is as shown in Fig. 4. Thus, the information structure is given by
[TABLE]
The objective is to determine the MTMSE filtering for per-step estimation error given by (6), i.e., the agents want to estimate their local states and ensure that the difference between the estimates of adjacent agents is close to difference between their actual states.
We first show the computations of the MTMSE estimates. Observe that and
[TABLE]
Similar to the previous example, the covariance matrices , , , and are constant for . Thus, we need to compute these for , , and . In addition, since the cost matrix is sparse, we only need to compute and for (see Remark 8). The details for computing are similar to the previous section and are omitted due to space limitations. The MTMSE estimates can be computed using (32) as described in Sec. V-A.
We compare the performance of MTMSE filtering strategy with the MMSE strategy and the consensus Kalman filtering as before.
For the numerical experiment in this part, we pick
[TABLE]
, , and .
The relative improvements as a function of are shown in Fig. 5. These plots show that the MTMSE strategy outperforms the MMSE and consensus Kalman filtering strategies by up to a factor of 2 and 800. Again, this improvement in performance will increase with the number of agents.
V Discussion of the results
V-A Implementation of MTMSE filtering strategy
In this section, we provide the details about implementing the MTMSE filtering strategies for both the finite and infinite horizon setups.
V-A1 Implementation of finite horizon MTMSE filtering strategy
Based on Theorem 2, the MTMSE filtering strategy can be implemented as follows.
Computing the gains
The gains are computed offline as follows. First the variance are computed using the forward Riccati equation (22). Then, the covariances and are computed for all . Thereafter, the gains are computed using (21) and the gains are computed using (34).
Finally, the gains and are stored in agent .
Computing the MTMSE estimates
Agent carries out the following computations to generate . First, it computes the delayed centralized estimate using (20). Then, it uses to compute and using (23) and (28), respectively. Then, it uses and to generate the MTMSE estimate as follows
[TABLE]
V-A2 Implementation of infinite horizon MTMSE filtering strategy
Based on Theorem 3, the MTMSE filtering strategy can be implemented as follows.
Computing the gains
The gains are computed offline as follows. First the variance is computed using the forward algebraic Riccati equation (37). Then, the covariances , , and are computed for all using (38)-(40). Thereafter, the gain is computed using Lemma 3 and the gain is computed using (44). Finally, the gains and are stored in agent .
Computing the MTMSE estimates
Agent carries out the following computations to generate . First, it computes the delayed centralized estimate using (42). Then, it uses to compute and using (23) and (28), respectively. Then, it uses and to generate the MTMSE estimate as follows
[TABLE]
V-B Connection to decentralized stochastic control
One of the most celebrated results in centralized stochastic control of linear systems with quadratic cost and Gaussian disturbance (so-called LQG setup) is the separation of estimation and control. In particular, the optimal control action is equal to a gain multiplied by the current state estimate. The computation of the gain matrix and the estimate are separated from each other. The gain matrix is computed based on the solution of a backward Riccati equation where the state estimates are updated based on the Kalman filtering equation (which is a forward Riccati equation). The forward and the backward Riccati equations are decoupled and can be solved separately.
These simplifications do not hold for decentralized control of LQG systems. In general, non-linear strategies may outperform the best linear strategies. Linear strategies are known to be optimal only for specific models [25, 26, 27, 28, 29, 30]. But in these cases there is no separation of estimation and control.
The results of this paper shed light on the lack of separation in decentralized control of LQG systems. We explain this in Appendix D using the example of decentralized stochastic control with one-step delayed information structure [31, 32, 26]. For this model, we show that the decentralized control problem is equivalent to a MTMSE filtering problem, where the weight matrix depends on the solution of a backward Riccati equation. As shown in Theorem 2, the gains for MTMSE filtering depends on the weight matrix in the cost function. That is the reason that the computation of the state estimate is not separated from the computation of the controller gains.
V-C Trade-off between filter complexity and estimation accuracy
For graphs with neighborhood sharing information structure, the dimension of and are proportional to the diameter of the graph. It is possible to trade-off the implementation complexity with the filtering accuracy by “shedding” information at each agent. We explain this via the example of Sec. IV-B.
We consider two approximate information structures for this example, which we denote by and . For both these information structures, the common information is the same as before, i.e.,
[TABLE]
But the local information is a subset of the original . In particular, we assume the following.
IS1: In the first approximation, each agent just uses the measurements from a time window of size two to “correct” the common information based estimate, i.e.,
[TABLE] 2. 2.
IS2: In the second approximation, each agent justs uses its local measurements to “correct” the common information based estimate, i.e.,
[TABLE]
For completeness, we refer to the original information structure as IS0. Note that , therefore any filtering strategy based on the approximate information structure can be implemented in the original information structure . The size of (and therefore ) for the different information structures is shown in Table I.
To compare the peformance of these three information structures, we note that the structure of the weight matrix implies that is a constant. So, we evaluate for large value of () and compare the performance of the three information structures. The results are also shown in Table I.
This example shows that it is possible to trade-off the complexity of the MTMSE filter with the estimation accuracy. Note that although the two approximate information structures are almost of the same size, IS1 has better performance than IS2. This is because IS1 uses some local infomration from the neighborhood nodes, while IS2 does not. This suggested that it is better to have some information from many agents rather than a lot of information from a few agents but a more detailed investigation is needed to quantify such a comparison.
VI Conclusion
In this paper, we investigate multi-agent estimation and filtering to minimize team mean-square error. We show that the MTMSE estimates are given by
[TABLE]
The first term of the estimate is the conditional mean of the current state given the common information. The second term may be viewed as a “correction” which depends on the “innovation” in the local measurements. A salient feature of this result is that the gains depend on the weight matrix . Using illustrative examples, we show that the MTMSE estimates significantly smaller team mean-squared error as compared to MMSE strategy and consensus Kalman filtering.
The results were derived under the assumptions that the state process is a linear stochastic process and the observation channels are linear and additive Gaussian noise. In future, we plan to investigate team estimation of general stochastic processes over general measurement channels, which will give rise to non-linear filtering equations.
Finally, our focus in this paper was to establish the structure of MTMSE filtering and filtering strategies. Having identified this structure, it is possible to implement the policy efficiently in a distributed manner. For example, for the infinite horizon setup, it is possible to use a consensus Kalman filter [16, 17, 18, 19, 20, 21] to keep track of the delayed state estimate and use distributed algorithms to solve the linear system of equations using distributed algorithms [33, 34, 35].
Appendix A Proof of Theorem 1
A-A A preliminary result
In order to compute the gains and the performance, we need to compute and .
Lemma 5
For any , and of compatible dimensions, the following matrix equation
[TABLE]
for unknown of compatible dimensions can be written in vectorized form as
[TABLE]
where , , and are as defined in Theorem 1. Furthermore, define and . If , , and , , then and thus invertible. Then, Eq. (48) has a unique solution that is given by
[TABLE]
□
The proof of Lemma 5 is presented in Appendix B.
A-B Proof of Theorem 1
The key observation behind the proof is that Problem 1 may be viewed as a MTMSE filtering problem [22], where agents observe different information and want to minimize a common estimation cost. For the ease of notation, for a given agent , we let and denote the strategy and estimates of all agents. Pick an agent , and fix the strategy of all the other agents. Then the expected cost from the point of view of agent is given by
[TABLE]
where the superscript in the expectation indicates that the cost depends on the strategy of agents other than .
A necessary condition for optimality is that agent is playing a best response to the strategy of all other players, i.e.,
[TABLE]
It is shown in [22, Theorem 4], that when is convex, (51) is also a sufficient condition for optimality.
From the dominated convergence theorem, we can interchange the order of derivative and expectation to get
[TABLE]
Substituting the above in (51), we get that a necessary and sufficient condition for a strategy to be team optimal is
[TABLE]
Note here that the superscript in highlights that the expectation depends on the choice of . There is no such dependence in . Thus, the strategy given by (8) is optimal if and only if
[TABLE]
or equivalently
[TABLE]
Note that from Lemma 1, we have
[TABLE]
Substituting the above and the expression for from Lemma 1 in (54), we get that the strategy given by (8) is optimal if and only if, for all ,
[TABLE]
Since the above should hold for all , the coefficient of must be identically zero. Thus, the strategy given by (8) is optimal if and only if
[TABLE]
Furthermore, Lemma 5 implies that when , then (55) has a unique solution given by (10).
Now for the minimum value of the estimation error, consider a single term of the estimation error
[TABLE]
where follows from substituting (8), uses Lemma 1, and uses the fact that for any matrices . Thus, the expected team estimation error is
[TABLE]
where follows from (56), and follows from (55). The result now follows from observing that
[TABLE]
where the first equality follows from \operatorname{Tr}(A^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}B)=\operatorname{vec}(A)^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}\operatorname{vec}(B).
Appendix B Proof of Lemma 5
By vectorizing both sides of (48) and using \operatorname{vec}(ABC)=(C^{\mathchoice{\raisebox{0.0pt}{\displaystyle\intercal}}{\raisebox{0.0pt}{\textstyle\intercal}}{\raisebox{0.0pt}{\scriptstyle\intercal}}{\raisebox{0.0pt}{\scriptscriptstyle\intercal}}}\otimes A)\times\operatorname{vec}(B), we get
[TABLE]
Substituting and , we get (49).
If , , and , , then [32, Lemma 1] implies that and thus invertible. Hence, Eq. (48) has a unique solution that is given by (50).
Appendix C Proof of Theorem 3
is the variance of the innovation in the standard Kalman filtering equation and by positive definiteness of is positive definite. Lemma 5 implies that (43) has a unique solution that is given by (44). To show the strategy (41) is optimal, we proceed in two steps. We first identify a lower bound in optimal performance and then show that the proposed strategy achieves that lower bound.
Step 1
From Theorem 2, for any strategy , we have that
[TABLE]
Taking limits of both sides and using Lemma 4 (which implies that and ), we get
[TABLE]
Step 2
Suppose is chosen according to strategy (44) and let denote . Following (56) and (57) in the proof of Theorem 1, we have that
[TABLE]
From Lemma 4, we have that
[TABLE]
Thus, by Cesaro’s mean theorem, we get Hence, the strategy (44) achieves the lower bound of (58) and is therefore optimal.
Appendix D One-step delayed observation sharing
D-A Problem statement
In this section, we use the result of Theorem 2 to show the relationship between MTMSE filtering and control in delayed observation sharing model [31, 32, 26]. The notation used in this section is self-contained and consistent with the standard notation used in decentralized stochastic control.
Consider a decentralized control system with agents, indexed by the set . The system has a state . The initial state and the state evolves as follows:
[TABLE]
where and are matrices of appropriate dimensions. , where is the control action chosen by agent , and , is an i.i.d. process with . Each agent observes a noisy version of the state given by
[TABLE]
where , , is an i.i.d. process with . This may be written in a vector form as
[TABLE]
where , , and .
Assumption 1: The primitive random variables are independent.
In addition to its local observation , each agent also receives the one-step delayed observations of all agents. Thus, the information available to agent is given by
[TABLE]
Therefore, agent chooses the control action as follows.
[TABLE]
where is the control laws of agent at time . The collection , where is called the control strategy of the system. The performance of any control strategy is given by
[TABLE]
where is symmetric positive semi-definite matrix, is symmetric positive definite matrix, and the expectation is with respect to the joint measure on the system variables induced by the choice of .
Problem 4
Given the system dynamics and the noise statistics, choose a control strategy to minimize the total cost given by (64).
Problem 4 is a decentralized stochastic control problem. In such problems there is no separation of estimation and control (see, for example [32]). We show that this lack of separation is due to the fact that the MTMSE filtering strategy depends on the weight matrix of the estimation cost.
D-B Equivalence to MTMSE filtering
We start with a basic property of linear quadratic models. Let denote the solution to the following backward Riccati equation. and for ,
[TABLE]
Define
[TABLE]
Then, we have the following.
Lemma 6
For any control strategy , define
[TABLE]
Then, a strategy that minimizes also minimizes . □
Proof
Following [36, Chapter 8, Lemma 6.1], we can show that the total cost can be written as
[TABLE]
The third term is equal to and the first two terms do not depend on the control strategy . Thus, and have the same argmin. ■
Now, we split the state into a deterministic part and a stochastic part as follows. and
[TABLE]
Since the system is linear, we have
[TABLE]
Note that is a function of the past control actions, which are known to all agents. Now, for any control strategy , define . Then, the cost may be written as
[TABLE]
The process is an uncontrolled linear stochastic process and the cost (67) is of of the same form as the weighted mean-square cost that we have considered in this paper.
Following [25], we define which may be considered as the control-free part of the information structure.
Lemma 7
For any strategy and any agent , is equivalent to , i.e., they generate the same sigma algebra. □
Proof
The result follows from a similar argument as given in [37, Chapter 7, Section 3]. ■
Since is equivalent to , we may assume that is chosen as a function of instead of . Thus, Problem 4 is equivalent to the following MTMSE filtering problem.
Problem 5
Suppose agents observe the linear dynamical system and share their observations over a one-step delayed sharing communication graph. Thus, the information available at agent is
[TABLE]
Agent chooses an estimate of according to an estimation strategy , i.e.,
[TABLE]
to minimize an estimation cost given by (67).
Problem 5 is a MTMSE filtering problem and can be solved using Theorem 2. One can then take the solution of Problem 5 and translate it back to Problem 4 as follows.
Theorem 4
Let be the optimal strategy for Problem 5, i.e.,
[TABLE]
where
[TABLE]
and the gains are computed as per Theorem 2. Define strategy as follows:
[TABLE]
i.e.,
[TABLE]
where . Then is the optimal strategy for Problem 4. □
Proof
The change of variables implies that if is an optimal strategy for Problem 5, then given by (69) is optimal for Problem 4.
To establish (70), we need to show that . Define, and . Then by Lemma 7 we have, is equivalent to , i.e., they generate the same sigma algebra. The rest of the proof follows from the definition of . We have
[TABLE]
where follows from state splitting and and follows from the fact that is a deterministic function of . ■
The main take away is as follows. By a simple change of variables we showed that the one-step delayed observation sharing problem is equivalent to a MTMSE filtering problem, where the weight matrix of the estimation cost depends on the backward Riccati equation for the cost function. The MTMSE filtering strategy depends on the weight matrix and that is the reason why there is no separation between estimation and control. Nonetheless, the optimal gains can be computed as follows.
Solve a Riccati equation to compute the weight functions and gains . 2. 2.
Solve a Kalman filtering equation (which does not depend on ) to compute the covariances and defined in Theorem 2. 3. 3.
Use , , , and to obtain the optimal gains by solving a system of matrix equations. 4. 4.
Using Theorem 4 above, we can write the optimal strategy in terms of and .
Acknowledgment
The authors are grateful to Peter Caines, Roland Malhame, and Demosthenis Teneketzis for useful discussion and feedback.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Afshari and A. Mahajan, “Team optimal decentralized state estimation,” in IEEE Conference on Decision and Control (CDC) . IEEE, Dec. 2018.
- 2[2] R. E. Kalman, “A new approach to linear filtering and prediction problems” transaction of the asme journal of basic,” 1960.
- 3[3] S. M. Barta, “On linear control of decentralized stochastic systems,” Ph.D. dissertation, Massachusetts Institute of Technology, 1978.
- 4[4] M. S. Andersland and D. Teneketzis, “Measurement scheduling for recursive team estimation,” Journal of Optimization Theory and Applications , vol. 89, no. 3, pp. 615–636, Jun 1996.
- 5[5] R. E. Lucas, “Expectations and the neutrality of money,” Journal of Economic Theory , vol. 4, no. 2, pp. 103–124, Apr 1972.
- 6[6] S. Morris and H. S. Shin, “Social value of public information,” The American Economic Review , vol. 92, no. 5, pp. 1521–1534, 2002.
- 7[7] F. Allen, S. Morris, and H. S. Shin, “Beauty contests and iterated expectations in asset markets,” Review of Financial Studies , vol. 19, no. 3, pp. 719–752, 2006.
- 8[8] C. Sanders, E. Tacker, and T. Linton, “A new class of decentralized filters for interconnected systems,” IEEE Trans. Autom. Control , vol. 19, no. 3, pp. 259–262, Jun 1974.
