Decentralized Decision-Making Over Multi-Task Networks
Sahar Khawatmi, Abdelhak M. Zoubir, Ali H. Sayed

TL;DR
This paper introduces a distributed algorithm enabling agents in multi-task networks to collaboratively select a common objective, improving network performance through localized interactions in both static and mobile settings.
Contribution
It proposes a novel decentralized decision-making method for multi-task networks with agents observing diverse data models.
Findings
Agents successfully reach agreement on objectives through local interactions.
The approach enhances network performance in static and mobile scenarios.
Simulations validate the effectiveness of the proposed strategies.
Abstract
In important applications involving multi-task networks with multiple objectives, agents in the network need to decide between these multiple objectives and reach an agreement about which single objective to follow for the network. In this work we propose a distributed decision-making algorithm. The agents are assumed to observe data that may be generated by different models. Through localized interactions, the agents reach agreement about which model to track and interact with each other in order to enhance the network performance. We investigate the approach for both static and mobile networks. The simulations illustrate the performance of the proposed strategies.
| 2 | 3 | 4 | 5 | |
|---|---|---|---|---|
| Success rate | 99% | 98% | 99% | 99% |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Efficient Wireless Sensor Networks · Opportunistic and Delay-Tolerant Networks · Mobile Ad Hoc Networks
Decentralized Decision-Making Over
Multi-Task Networks
Sahar Khawatmi, Abdelhak M. Zoubir111Member of the European Association for Signal Processing (EURASIP).
Technische Universität Darmstadt, Signal Processing Group
64283 Darmstadt, Germany
Email: {khawatmi, zoubir}@spg.tu-darmstadt.de
Ali H. Sayed222Member of the European Association for Signal Processing (EURASIP).
Ecole polytechnique fdrale de Lausanne EPFL, Adaptive Systems Laboratory
CH-1015 Lausanne, Switzerland
Email: [email protected]
Abstract
In important applications involving multi-task networks with multiple objectives, agents in the network need to decide between these multiple objectives and reach an agreement about which single objective to follow for the network. In this work we propose a distributed decision-making algorithm. The agents are assumed to observe data that may be generated by different models. Through localized interactions, the agents reach agreement about which model to track and interact with each other in order to enhance the network performance. We investigate the approach for both static and mobile networks. The simulations illustrate the performance of the proposed strategies.
keywords:
Decentralized processing, decision-making, multi-task networks, adaptive learning.
††journal: EURASIP signal processing journal
1 Introduction and Related Work
Bio-inspired systems are designed to mimic the behavior of some animal groups such as bee swarms, birds flying in formation, and schools of fish [1, 2, 3, 4, 5, 6]. Diffusion strategies can be used to model some of these coordinated types of behavior, as well as solve inference and estimation tasks in a distributed manner over networks [7, 8]. We may distinguish between two types of networks: single-task and multi-task networks. In single-task implementations [7, 8], the networks consist of agents that are interested in the same objective and sense data that are generated by the same model. An analogy would be a school of fish tracking a food source: all elements in the fish school sense distance and direction to the same food source and are interested in approaching it. On the other hand, multi-task networks [9, 10, 11, 12, 13, 14, 15, 16, 17] involve agents sensing data arising from different models and different clusters of agents may be interested in identifying separate models. A second analogy is a school of fish sensing information about multiple food sources.
In the latter case, agents need to decide between the multiple objectives and reach agreement on following a single objective for the entire network. In the earlier works [18, 19], a scenario was considered where agents were assumed to sense data arising from two models, and a diffusion strategy was developed to enable all agents to agree on estimating a single model. The algorithm developed in [18] relied on binary labeling and is applicable only to situations involving two models. In this work, we propose an approach for more than two models.
We consider a distributed mean-square-error estimation problem over an -agent network. The connectivity of the agents is described by a graph (see Fig. 1). Data sensed by any particular agent can arise from one of different models. The objective is to reach an agreement among all agents in the network on one common model to estimate. Two definitions are introduced: the observed model, which refers to the model from which an agent collects data, and the desired model, which refers to the model the agent decides to estimate. The agents do not know which model generated the data they collect; they also do not know which other agents in their neighborhood sense data arising from the same model. Therefore, each agent needs to determine the subset of its neighbors that observes the same model. This initial step is referred to as clustering. Since the decision-making objective depends on the clustering output, errors made during the clustering process have an impact on the global decision. In this work, we rely on the clustering technique proposed in[20] to reduce this effect.
The paper is organized as follows. The network and data model are described in Section II. We illustrate the local labeling system and the decision-making algorithm in Sections III and IV, respectively. A special case when the entire network follows the model of a specific agent is studied in Section V. Simulation results and discussion are presented in Sections VI and VII, respectively.
Notation. We use lowercase letters to denote vectors, uppercase letters for matrices, plain letters for deterministic variables, and boldface letters for random variables. denotes the expectation operator and the Euclidean norm. The symbols and denote the all-one vector and identity matrix of appropriate sizes, respectively. The th row (column) of matrix is denoted by ().
2 Network and Data Model
Consider a collection of agents distributed in space, as illustrated in Fig. 1. We represent the network topology by means of an adjacency matrix whose entries are defined as follows:
[TABLE]
where is the set of neighbors of agent (we denote its size by ). We also write to denote the same neighborhood excluding agent .
Figure 1 shows the network structure where agents with the same color observe the same model. We denote the unknown models by , each of size where . Each agent observes data generated by one of these unknown models. We denote the model observed by agent by . Figure 1 shows that agent collects data from model , in which case . For any other agent observing the same model , it will hold that . We stack the into a column vector:
[TABLE]
At every time instant , every agent has access to a scalar measurement and a regression vector . The measurements across all agents are assumed to be generated via linear regression models of the form:
[TABLE]
All random processes are assumed to be stationary. Moreover, is a zero-mean white measurement noise that is independent over space and has variance . The regression data is assumed to be a zero-mean random process, independent over time and space, and independent of for all . We denote the covariance matrix of by .
Agents do not know which model is generating their data. They also do not know which models are generating the data of their neighbors. Still, we would like to perform a learning strategy that allows agents to converge towards one of the models, while also learning which of their neighbors share the same model. Using the algorithm proposed in [20], each agent repeats the following steps involving an adaptation step followed by an aggregation step:
[TABLE]
where is the step-size used by agent . These updates generate two iterates by agent at time , and which are denoted by and , respectively. We collect the estimated vectors across all agents into the aggregate vector:
[TABLE]
In a manner similar to [20], we introduce a clustering matrix . Its structure is similar to the adjacency matrix , with ones and zeros, except that the value at location will be set to one if agent believes at time instant that its neighbor belongs to the same cluster, i.e., observes the same model:
[TABLE]
These entries help define the neighborhood set , which consists of all neighbors at time instant that agent believes share the same model. To learn the matrix over time, we apply the clustering technique proposed in [20] to create the estimated clustering matrix of size as follows: we initialize and . Where the matrix is of size . Each entry is designed using the following steps from [20], where :
[TABLE]
[TABLE]
[TABLE]
where , , and the notation denotes rounding to the nearest integer. Using the evolving neighborhoods , the entries in (5) are non-negative scalars that satisfy
[TABLE]
Although there is a multitude of models generating the data that is feeding into the agents, namely, , the objective is to develop a strategy that will allow all agents to converge towards one of these models. We refer to this particular choice as the desired model and denote it by .
In this way, an agent whose source (observed) model agrees with the desired model, i.e., , will end up tracking its own source. On the other hand, an agent whose source model is not the desired model, i.e., , will track instead although it is sensing data generated by a different model.
We define the estimate vector of agent’s desired model by . The reason behind indicating as the estimate vector of agent’s desired model instead of the network’s desired model is that the agents may have different desired models before convergence (steady-state). Once the agents reach agreement among themselves on a single model, we can then refer to as the estimate vector by agent of the network’s desired model. For the initialization at time instant , each agent assigns (i.e., at time instant , the desired model of each agent is a rough estimate of its own source model). The decision-making process drives the desired models of all agents to converge. For example, if the agents observe different models, the number of the desired models in the network will decrease with iterations gradually form models down to one model. This is achieved by switching the estimate of some agents during the decision-making process according to some conditions that are explained later. However, agents do not know which models are desired by their neighbors at each time instant . Thus, we need to develop a learning strategy that allows the agents to distinguish the individual desired models of their neighbors.
It turns out that in order for the objective of the network to be met, it is important for agents to combine the estimates of their neighbors in a judicious manner because, unbeknown to the agents, some of their neighbors may be wishing to estimate different models. If cooperation is performed blindly with all neighbors, then performance can deteriorate with agents converging to non-existing locations. For this reason, and motivated by the discussion from [18], we add the step (12) below after (4) and (5), which involves two sets of combination coefficients from two matrices and . There are two main ideas behind the construction (12). First, it is meant to let each agent cooperate only with the subset of neighbors that share the same desired model as it does. Second, it also lets each agent combine if the desired model of agent at time instant is the same as ’s observed model:
[TABLE]
Note that the matrices and are not constructed from matrix . The selection of the non-negative coefficients and is explained in Section IV.
We summarize the main five steps of the approach:
Learning the observed models of the neighbors. This step is performed by building the matrix in step (7). The information provided by each entry is whether the corresponding agents and have the same observed model or not. 2. 2.
Learning and labeling the desired model of the neighbors at each time instant . This step allows the agents to distinguish the individual desired models of their neighbors at time instant . The information provided by this step is the number of different models that are desired by neighbors and how many times each model is repeated at time among neighbors. 3. 3.
Decision-making step by switching the desired model of some agents to let the network converge to only one model. 4. 4.
Learning the desired models of the neighbors after the switching step. This step is performed by building the matrix in step (19) in Section IV. The information provided by each entry is whether the corresponding agents and have the same desired model or not after the switching step. 5. 5.
Updating the estimate vectors by sharing data thoughtfully with the subset of the neighbors that share the same desired model.
3 Local Labeling
Each agent needs to learn the desired models of its neighbors to proceed with the decision-making process and let the network converge to only one model. In this step, instead of only estimating whether two agents have the same desired model or not, the construction involves a local labeling procedure that enables every agent to estimate in real-time how many different models are desired by its neighborhood.
For this purpose, we associate with each agent an matrix with entries given by:
[TABLE]
for some small threshold . Whenever , agent believes at time instant that its neighbors and wish to estimate the same desired model. On account of that, the variables and which are used in the test (13) are presenting the current desired model of agents and , respectively. It is clear from (13) that the matrix is symmetric and has ones on the diagonal. An example is depicted in Fig. 2 where agents having the same inner color observe the same model, while the outer color indicates the model in which the agent is interested (or towards which the agent is moving in mobile networks). The corresponding matrix has the following entries:
[TABLE]
From (14) agents that share the same desired model will have identical columns in matrix , namely, if agents and have the same desired model at time instant , this implies that: . We denote the local label of each agent by agent as . The local label is updated at each time instant using the following relation:
[TABLE]
where is a function that converts the input sequence from binary to decimal. For the example in (14), we have
[TABLE]
We define the number of desired models within at time instant by . After updating matrix and generating the local labels , agent counts how many models are desired by its neighborhood to update . In the example (14), agent distinguishes at time instant three desired models , i.e., . Agent labels these three different models locally as: .
In addition, agent determines which model among these models has the maximum number of followers. A follower of a model is an agent that wishes to estimate and track this model. We define the largest set of agents belonging to and following the same desired model at time instant by . In the example, agent assigns the majority set at time instant as follows: which has the label and is repeated three times among other labels.
4 Decision-Making Over Multi-Task Networks
Using the information provided by matrix , agent can capture how many agents within its neighbors follow the same desired model at time instant . Once agent and all its neighbors agree on a single desired model, the matrix will end up being of the following form with unit entries everywhere:
[TABLE]
We define the degree of agreement by each agent among its neighbors as
[TABLE]
Equally, having means that agent and all of its neighbors have agreed on a common desired model. On the other hand, if , then the following switching step is applied:
[TABLE]
The main idea of the switching step is for each agent to make a new decision or to keep the previous one. The first case of (18) implies that agent does not belong to the majority desired model set at time instant . Therefore, agent changes its decision and switches into the desired model of the majority set . The second case in (18) is applied to prevent an unwanted equilibrium situation. This problem may arise when only two desired models remain in . In this case, if all agents in belong to the majority set, this leads to a situation in which no agent in will change its decision anymore. An example is shown in Fig. 3 where the outer color of the agents indicate the desired model. We indicate only the desired model of agent’s neighbors and their neighbors. Figure 3 shows that all agents within belong to a majority set and no agent in will change its decision anymore, e.g. agents and belong to the majority set among their neighbors, as well as agents , , , and . Namely,
[TABLE]
To break the equilibrium, an agent that recognizes these two models picks randomly one of the two desired models.
From (18), we can conjecture that the network will probably converge to the most observable model, since the initial desired model by each agent is its own observed model. This fact remains true even with the random switching in the second case of (18), because in that case the more repeated desired model within has the highest probability to be picked.
To proceed with the cooperation and sharing information among the agents within the subset that has the same desired model at time instant , we define an matrix . The coefficients are updated after the switching step (18) using a test that is quite similar to (13) and is applied between each agent and its neighbors as follows:
[TABLE]
According to matrix , each agent knows which subset of its neighbors has the same desired model as it does after the switching step at time instant . Having means that and have the same desired model at time instant . We define an combination matrix as follows:
[TABLE]
where is some function which satisfies
[TABLE]
An example of is the uniform function which generates a left-stochastic matrix where each entry is given by
[TABLE]
Matrix by itself does not have enough information for proceeding and updating the estimate . The agents still need knowledge about which data to be combined from each neighbor. Therefore, matrix is split into two matrices and . The weight of the entry goes to if the desired model of agent at time instant is the same as ’s observed model. Otherwise, obtains the weight . The coefficients and for are updated using the following steps:
[TABLE]
In (23), the case that is close to implies that the observed model of agent is the same as the desired model of agent at time instant . The estimate is updated using (12). Algorithm 1 summarizes the decision-making scheme.
5 Following the Observed Model of a Specific Agent
In this section the goal is to let the whole network follow the observed model of some specific agent , as shown in Fig. 4 where agent observes model (red), therefore, the network converges in a distributed manner to estimate the model . The first step is to spread the among agents and keep updating it over time. This step aims at having a copy (reference) of by all agents in the network. Agents keep updating the copy of for two reasons. First, to have a more accurate version of the vector , which indicates the desired model of the network. Second, to endow the algorithm to work in non-stationary situations, if drift is happening in agent ’s model.
We denote the copy vector of by agent by and refer to it as the anchor vector. Agents are informed beforehand about the index of the specific agent that they should follow. If , this implies that agent receives the anchor vector directly from agent . If not, i.e., , then agent depends on another agent that has already a copy of . Agent stores the index of this source agent. The index of the source agent of agent is denoted by . Note that the anchor vector is not the final estimate of the desired model.
The circulation process of in a distributed manner needs cooperation among agents. In case that agent has no direct link to receive data from agent , i.e., , agent gets one of the provided that . If this implies that agent has already a source to update its , regardless whether or not. In other words, means that agent finds a direct or indirect link to agent . Therefore, it is important for each agent to store the agent’s index of its source. An example is shown in Fig. 5 where but . First, the anchor vectors and the source agents for agents and at time instant (Fig. 5(a)) are given, respectively, by
[TABLE]
The anchor vectors and the source agents for agents and at time instants (Fig. 5(b) and (c)) are given, respectively, by
[TABLE]
Agents update their anchor vectors at each time instant by the following step:
[TABLE]
where for agent itself. The source of the anchor vector is updated simultaneously as follows:
[TABLE]
Similarly to the previous section, the next step is to update the coefficients using the following test:
[TABLE]
Again, having leads to the situation that agent has the anchor vector and has been informed about the decision of the network, therefore, agent can start sharing information with the other agents whose as well to estimate the desired model. The matrix will be generated using (20). Agents update the coefficients of both matrices and using the following steps:
[TABLE]
Then, the estimate is updated using Eq. (12). According to (34) and (12), agent combines if the desired model of the network (which is represented by the anchor vector of agent ) is close to the observed model of agent that is represented by . Algorithm 2 summarizes the steps of the approach for following the observed model of a specific agent .
6 Simulation Results and Discussion
6.1 Static Network
We consider a connected network with 80 randomly distributed agents. The agents observe data originating from different models. Each model is generated as follows: where , . The assignment of the agents to models is random. The maximum number of neighbors is . We set . We use the uniform combination policy to generate the coefficients and .
Figure 6 shows the statistical profile of the regressors and noise across the agents. The regressors are of size zero-mean Gaussian, independent in time and space, and have diagonal covariance matrices . Figure 7 shows the topology of one of 100 Monte Carlo experiments. Agents having the same inner color observe the same model, while the outer color indicates the desired model at steady-state.
The transient network mean-square deviation (MSD) regarding each observed model at each time instant is defined by
[TABLE]
where and each is computed for agents belonging to . The transient network mean-square deviation (MSD) for the whole network regarding the desired model at each time instant is defined by
[TABLE]
where is the desired model when the whole network agrees on one common desired model, i.e., is only computed at the instants when all . Figure 8 depicts the simulated transient mean-square deviation (MSD) of the network for all observed models and for the network desired model. Table 1 displays the success rate of the decision-making to agree on one model for different numbers of observed models, . The proposed strategy appears to provide good success rate under the simulated conditions.
Regarding the application of following the observed model of a specific agent , Fig. 9 shows the topology of one case from 100 different experiments. Agents are observing different models. Agent , which is represented by a square, is the specific agent whose observed model the whole network wishes to follow. Figure 10 shows the transient mean-square deviation MSD of 100 different experiments when a change in the model assignment occurs suddenly at time instant . The success rate of the decision-making to agree on the observed model of agent was observed to be 100% in this simulation.
6.2 Mobile Network
We consider a network with 80 randomly distributed mobile agents [2]. The agents observe data originating from four different models (sources) , where . The objective of the network is to have all agents track and move towards only one model (source). Figure 11 shows the statistical profile of the regressors and noise across the agents. Every agent updates its location according to the motion mechanism described in [19].
Figure 12 shows the maneuver of the agents over time where the models (sources) are represented by squares. Figure 13 represents the transient network mean-square deviation (MSD) obtained by averaging over 100 independent Monte Carlo experiments.
7 Conclusion
We have proposed a distributed algorithm that allows agents over multi-task networks to follow only one common model while proceeding with the estimation process. Agents use a local labeling step to distinguish the multiple desired models of their neighbors. Simulation results illustrate the operation of the algorithms and its performance.
Acknowledgements
The research for this paper was financially supported by Technische Universität Darmstadt, Signal Processing Group and NSF grant CCF-1524250.
References
- [1]
F. S. Cattivelli, A. H. Sayed, Modeling bird flight formations using diffusion adaptation, IEEE Trans. Signal Process. 59 (5) (2011) 2038–2051.
- [2]
S. Y. Tu, A. H. Sayed, Mobile adaptive networks, IEEE J. Sel. Topics Signal Process. 5 (4) (2011) 649–664.
- [3]
A. Avitabile, R. A. Morse, R. Boch, Swarming honey bees guided by pheromones, Ann. Entonomol. Soc. Am. 68 (1975) 1079–1082.
- [4]
H. Berg, Motile behavior of bacteria, Physics Today 53 (1) (2000) 24–29.
- [5]
F. Dressler, O. B. Akan, Bio-inspired networking: from theory to practice, IEEE Communications Magazine (2010) 176–183.
- [6]
S. Camazine, J. L. Deneubourg, N. R. Franks, G. T. J. Sneyd, E. Bonabeau, Self-Organization in Biological Systems, Princeton, NJ: Princeton Univ. Press, 2003.
- [7]
J. C. X. Z. A. H. Sayed, S.Y. Tu, Z. Towfic, Diffusion strategies for adaptation and learning over networks: an examination of distributed strategies and network behavior, IEEE Signal Processing Magazine 30 (3) (2013) 155–171.
- [8]
A. H. Sayed, Adaptation, learning, and optimization over networks, Found. Trends in Mach. Learn. 7 (4–5) (2014) 311–801.
- [9]
X. Zhao, A. H. Sayed, Clustering via diffusion adaptation over networks, in: Proc. International Workshop on Cognitive Inform. Processing (CIP), Baiona, Spain, 2012, pp. 1–6.
- [10]
J. Chen, C. Richard, A. H. Sayed, Diffusion LMS over multitask networks, IEEE Trans. Signal Processing 63 (11) (2015) 2733–2748.
- [11]
X. Zhao, A. H. Sayed, Distributed clustering and learning over networks, IEEE Trans. Signal Processing 63 (13) (2015) 3285–3300.
- [12]
S. Khawatmi, A. M. Zoubir, Decentralized partitioning over adaptive networks, in: Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Vietri sul Mare, Salerno, Italy, 2016, pp. 1–6.
doi:10.1109/MLSP.2016.7738880.
- [13]
M. Z. Lin, M. N. Murthi, K. Premaratne, Mobile adaptive networks for pursuing multiple targets, in: Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), South Brisbane, QLD, 2015, pp. 3217–3221.
- [14]
M. Hajiabadi, H. Khoshbin, G. A. Hodtani, Transient performance analysis of adaptive multitask network based on correntropy criterion, International Journal of Adaptive Control and Signal Processing 32 (2) (2018) 229–247.
- [15]
C. Zhang, P. Zhao, S. Hao, Y. C. Soh, B. S. Lee, C. Miao, S. C. H. Hoi, Distributed multi-task classification: a decentralized online learning approach, Machine Learning 107 (4) (2018) 727–747.
- [16]
D. Acemoglu, A. Ozdaglar, Opinion dynamics and learning in social networks, Dynamic Games and Applications 1 (1) (2011) 3–49.
- [17]
S. P. Talebi, S. Werner, V. Koivunen, Kalman filtering and clustering in sensor networks, in: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2018, pp. 4309–4313.
- [18]
S. Y. Tu, A. H. Sayed, Distributed decision-making over adaptive networks, IEEE Trans. Signal Processing 62 (5) (2014) 1054–1069.
- [19]
S. Khawatmi, X. Huang, A. M. Zoubir, Distributed decision-making over mobile adaptive networks, in: Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), New Orleans, USA, 2017, pp. 3864–3868.
- [20]
S. Khawatmi, A. H. Sayed, A. M. Zoubir, Decentralized clustering and linking by networked agents, IEEE Trans. Signal Processing 65 (13) (2017) 3526–3537.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. S. Cattivelli, A. H. Sayed, Modeling bird flight formations using diffusion adaptation, IEEE Trans. Signal Process. 59 (5) (2011) 2038–2051.
- 2[2] S. Y. Tu, A. H. Sayed, Mobile adaptive networks, IEEE J. Sel. Topics Signal Process. 5 (4) (2011) 649–664.
- 3[3] A. Avitabile, R. A. Morse, R. Boch, Swarming honey bees guided by pheromones, Ann. Entonomol. Soc. Am. 68 (1975) 1079–1082.
- 4[4] H. Berg, Motile behavior of bacteria, Physics Today 53 (1) (2000) 24–29.
- 5[5] F. Dressler, O. B. Akan, Bio-inspired networking: from theory to practice, IEEE Communications Magazine (2010) 176–183.
- 6[6] S. Camazine, J. L. Deneubourg, N. R. Franks, G. T. J. Sneyd, E. Bonabeau, Self-Organization in Biological Systems, Princeton, NJ: Princeton Univ. Press, 2003.
- 7[7] J. C. X. Z. A. H. Sayed, S.Y. Tu, Z. Towfic, Diffusion strategies for adaptation and learning over networks: an examination of distributed strategies and network behavior, IEEE Signal Processing Magazine 30 (3) (2013) 155–171.
- 8[8] A. H. Sayed, Adaptation, learning, and optimization over networks, Found. Trends in Mach. Learn. 7 (4–5) (2014) 311–801.
