Study of Robust Diffusion Recursive Least Squares Algorithms with Side Information for Networked Agents
Y. Yu, R. C. de Lamare, Y. Zakharov

TL;DR
This paper introduces a robust diffusion recursive least squares algorithm for networked agents that effectively handles impulsive noise by incorporating side information and adaptive constraints, improving tracking and estimation accuracy.
Contribution
It proposes a novel RLS algorithm with a time-dependent constraint and side information, enhancing robustness and tracking in impulsive noise environments.
Findings
Outperforms existing methods in impulsive noise scenarios
Effective constraint resetting improves tracking during parameter changes
Demonstrates superior estimation accuracy through simulations
Abstract
This work develops a robust diffusion recursive least squares algorithm to mitigate the performance degradation often experienced in networks of agents in the presence of impulsive noise. This algorithm minimizes an exponentially weighted least-squares cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate update at each node. With the help of side information, the constraint is recursively updated in a diffusion strategy. Moreover, a control strategy for resetting the constraint is also proposed to retain good tracking capability when the estimated parameters suddenly change. Simulations show the superiority of the proposed algorithm over previously reported techniques in various impulsive noise scenarios.
| Parameters: , , and (R-dRLS); and (DNC) |
| Initialization: , and (R-dRLS); |
| , , and (DNC) |
| R-dRLS algorithm: |
| DNC method: |
| Step 1: to compute |
| if |
| end |
| Step 2: to reset |
| if |
| , |
| elseif |
| else |
| end |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Study of Robust Diffusion Recursive Least Squares Algorithms with Side Information for Networked Agents
Abstract
This work develops a robust diffusion recursive least squares algorithm to mitigate the performance degradation often experienced in networks of agents in the presence of impulsive noise. This algorithm minimizes an exponentially weighted least-squares cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate update at each node. With the help of side information, the constraint is recursively updated in a diffusion strategy. Moreover, a control strategy for resetting the constraint is also proposed to retain good tracking capability when the estimated parameters suddenly change. Simulations show the superiority of the proposed algorithm over previously reported techniques in various impulsive noise scenarios.
Index Terms— Diffusion cooperation, distributed algorithms, impulsive noises, robust recursive least squares.
1 Introduction
In the last decade, distributed adaptive algorithms for estimating parameters of interest over wireless sensor networks with multiple nodes (or agents) have attracted significant attention, due to their performance advantages and robustness [1]. The core idea is that each node performs adaptive estimation, in cooperation with its neighboring nodes. Distributed adaptive algorithms have been applied to many problems, e.g., frequency estimation in power grid [2] and spectrum estimation [3]. According to the cooperation strategy of interconnected nodes, existing algorithms can be categorized as the incremental [4], consensus [5], and diffusion [6, 7, 8] types. The diffusion type is the most popular [5], because it does not require a Hamiltonian cycle path as does the incremental type [4]; it is stable and has a better estimation performance than the consensus type [5]. Several diffusion-based distributed algorithms have been proposed such as the diffusion least mean square (dLMS) algorithm [6], diffusion conjugate gradient (dCG) [9], diffusion recursive least squares (dRLS) algorithm [7], and their modifications [10, 11, 12, 13, 14].
In practice, measurements at the network nodes can be corrupted by impulsive noise [15]. An impulsive noise process has the property that its occurence probability is small and the magnitude is typically much larger than the nominal measurement. It is well-known that the impulsive noise deteriorates significantly the performance of algorithms in the single-agent case. For distributed algorithms in the multi-agent case, impulsive noise can propagate over the entire network due to the exchange of information among nodes. To reduce the impulsive noise interference, many robust distributed algorithms have been proposed [16, 17, 18, 19, 20, 21, 22, 23, 24]. Some algorithms, e.g., the diffusion sign error LMS (dSE-LMS) [16], are based on using the instantaneous gradient-descent method to minimize an individual robust criterion. In [17], a robust variable weighting coefficients dLMS (RVWC-dLMS) algorithm was developed, which only considers the data and intermediate estimates from nodes not affected by impulsive noise; this is based on a judgement whether impulsive noise samples occur or not. However, these robust algorithms have slow convergence, especially for colored input signals.
RLS-based algorithms have a good decorrelating property for colored signals, thereby providing fast convergence. In this paper, therefore, we present a robust dRLS (R-dRLS) algorithm for distributed estimation over networks affected by impulsive noise. The R-dRLS algorithm minimizes a local exponentially weighted least-squares (LS) cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate at each node. Unlike the framework in [25], we consider here a multi-agent scenario with the diffusion strategy. Furthermore, in order to equip the R-dRLS algorithm with the ability to withstand sudden changes in the environment, we also propose a diffusion-based distributed nonstationary control (DNC) method. This paper is organized as follows. In Section 2, the estimation problem in the network is described. In Section 3, the proposed algorithm is derived. In Section 4, results of simulation in impulsive noise scenarios are presented. Finally, conclusions are given in Section 5.
2 Problem Formulation
Let us consider a network that has nodes distributed over some region in space, where a link between two nodes means that they can communicate directly with each other. The neighborhood of node k is denoted by , i.e., a set of all nodes connected to node including itself. The cardinality of is denoted by . At every time instant , every node k observes a data regressor vector of size and a scalar measurement , related as:
[TABLE]
where the superscript denotes the transpose, is a parameter vector of size , and is the additive noise at node k. The regressors and are spatially independent for and all . The additive noises and are spatially and temporally independent for and . Moreover, any is independent of any . The model (1) is widely used in many applications [1, 26].
The task is to estimate , using the available data collected at nodes, i.e., . For this purpose, the global LS-based estimation problem is described as [7]:
[TABLE]
where denotes the -norm of a vector, is a regularization constant, and is the forgetting factor. The dRLS algorithm solves (2) in a distributed manner [7]. In practice, may contain impulsive noise, severely corrupting the measurement . With such noise processes, the algorithms obtained from (2), e.g., the dRLS algorithm, would fail to work.
3 Proposed Distributed Algorithm
3.1 Derivation of the R-dRLS Algorithm
We focus here on the adapt-then-combine (ATC) implementation of the diffusion strategy, which has been shown to outperform the combine-then-adapt (CTA) implementation111 In fact, the CTA version is obtained by reversing the adaptation step and combination step in the ATC version. [5]. Following the ATC-based diffusion strategy [6, 7], i.e., performing first the adaptation step and then the combination step, the R-dRLS algorithm will be derived in the sequel. We start with the adaptation step. Every node , at time instant , finds an intermediate estimate of by minimizing the individual local cost function:
[TABLE]
with , where
[TABLE]
is the time-averaged correlation matrix for the regression vector at node and is an estimate of at node at time instant . Notice that the form in (3) defines the Riemmanian distance [27] between vectors and . Setting the derivative of with respect to to zero, we obtain
[TABLE]
where stands for the output error at node and . Using the matrix inversion lemma [26], we have
[TABLE]
where is initialized as and is an identity matrix. Since , where , (5) means that every node performs an RLS update. However, with the update (5), the adverse effect of an impulsive noise sample at time instant will propagate through nodes via . This effect can last for many iterations. To make the algorithm robust in impulsive noise scenarios, we propose to minimize (3) under the following constraint:
[TABLE]
where is a positive bound. This constraint is employed to enforce the squared norm of the update of the intermediate estimate not to exceed the amount regardless of the type of noise (possibly, impulsive noise), thereby guaranteeing the robustness of the algorithm. If (5) satisfies (7), i.e.,
[TABLE]
where represents the Kalman gain vector, then (5) is a solution of the above constrained minimization problem. On the other hand, if (8) is not satisfied (usually in the case of appearance of impulsive noise samples), i.e., , we propose to replace the update (5) by a normalized form to satisfy the constraint (7), which is described by
[TABLE]
where is the sign function. Consequently, combining (5), (8) and (9), we obtain the adaptation step for each node as:
[TABLE]
Then, at the combination step, the intermediate estimates from the neigborhood of node are linearly weighed, yielding a more reliable estimate [1]:
[TABLE]
where the combination coefficients are non-negative, and satisfy:
[TABLE]
Note that, is a weight that node assigns to the intermediate estimate received from its neighbor node . In general, are determined by a static rule (e.g., the Metropolis rule [28] that we adopt in this paper) which keeps them constant in the estimation, or an adaptive rule [28]. It is evident that the bound controls the robustness of the algorithm against impulsive noise and influences its dynamic behavior, so choosing its value properly is of fundamental importance. To this end, motivated by the single-agent case in [25], is adjusted recursively based on the diffusion strategy as:
[TABLE]
where is a forgetting factor, . In (13), at every node , can be initialized as , where is a positive integer, and and are powers of signals and , respectively. The proposed algorithm is shown in Table 1.
Remark: As can be seen from (10), the operation mode of the proposed algorithm is twofold. At time instant , if , the RLS update is performed; if not, the RLS update is normalized to have a norm of value . At the early iterations, the values of can be high compared to so that the algorithm will behave as the dRLS algorithm, providing a fast convergence. Whenever an impulsive noise sample appears, due to its significant magnitude, the algorithm will work as an dRLS update multiplied by a very small ’step size’ scaling factor given by , thus suppressing the negative influence of impulsive noise on the estimation [29, 30, 31, 32, 33, 34, 35, 36, 37] and reducing the error propagation effect. The algorithm robustness to impulsive noise is further maintained, due to decreasing over the iterations. This algorithm can be considered as an improved dRLS algorithm with an additional ’step size’ scaling factor which is time-varying and between 1 and , as can be observed in (10).
3.2 DNC Method
Although the decreasing values of the sequence with the iteration prompt the R-dRLS algorithm more robust against impulsive noises, the algorithm also loses its tracking capability for a sudden change of the unknown vector . To improve the tracking capability, referring to the single-agent scenario [38], we also develop a diffusion-based DNC method, summarized in Table 1. The DNC method includes two implementation procedures.
Firstly, a variable at node is computed once for every iterations, to judge whether the unknown vector has a change or not. In this step, with denoting the ascending arrangement for its arguments, and is a vector whose first elements set to one, where is a positive integer with . Thus, the product can remove the effect of outliers (e.g., impulsive noise samples) on the rightness of the computation of . Typically, for both and , good choices are with and [38]. Note that, for large occurence probability of impulsive noise, the value of should be decreased to discard the impulsive noise samples.
Secondly, if , where is a predefined threshold, meaning a change of has occured, then we need to reset to its initial value . More importantly, is also re-initialized with . It is worth noting that since the parameters , and are not affected by each other, their choices are simple.
4 Simulation Results
Simulation examples are presented for a diffusion network with nodes. The graph describing the network is assumed to be partially connected. Adjustments to the graph can be carried out using approaches reported in [39, 40, 41, 42, 43]. The vector to be estimated has a length of and a unit norm; it is generated randomly from a zero-mean uniform distribution. To evaluate the tracking capability, changes to in the middle of iterations. The input regressor has a shifted structure, i.e., [44, 4], where is colored and generated by a second-order autoregressive system:
[TABLE]
where is a zero-mean white Gaussian process with variance shown in Fig. 1(a) for all the nodes. We employ the network mean square deviation (MSD) to assess the performance of the algorithm, i.e., , where denotes the expectation. Usually, the impulsive noise can be described by either the Bernoulli-Gaussian (BG) distribution [16, 17, 18] or the -Stable distribution [45, 27]. We consider both cases. All results are the average over 200 independent trials.
4.1 BG Distribution
The additive noise includes the background noise plus the impulsive noise , where is zero-mean white Gaussian noise with variance depicted in Fig. 1(b). The impulsive noise is described by the BG distribution, , where is a Bernoulli process with probability distribution and , and is a zero-mean white Gaussian process with variance . Here, we set as a random number in the range of , and , where denotes the power of . Fig. 2 compares the performance of the dRLS, dSE-LMS, and RVWC-dLMS algorithms with that of the proposed R-dRLS algorithm. Note that, the R-dRLS (no cooperation) algorithm performs an independent estimation at each node as presented in [25]. For RLS-type algorithms, we choose =0.995 and =0.01. As expected, the dRLS algorithm has a poor performance in the presence of impulsive noise. Both the dSE-LMS and RVWC-dLMS algorithms are significantly less sensitive to impulsive noise, but their convergence is slow. Apart from the robustness for combating impulsive noise, the proposed R-dRLS algorithm has also a fast convergence. Moreover, the proposed DNC method can retain the good tracking capability of the R-dRLS algorithm, only with a slight degradation in steady-state performance.
4.2 -Stable Distribution
The impulsive noise is now modeled by the -stable distribution with a characteristic function , where the characteristic exponent describes the impulsiveness of the noise (smaller leads to more impulsive noise samples) and represents the dispersion level of the noise. In particular, when , it reduces to the Gaussian noise. It is rare to find -stable noise with in practice [27, 46]. In this example, thus we set and . The learning performance of the algorithms is shown in Fig. 3. Fig. 4 shows the node-wise steady-state MSD of the robust algorithms (i.e., excluding the dRLS) against impulsive noise, by averaging over 500 instantaneous MSD values in the steady-state. As can be seen from Figs. 3 and 4, the proposed R-dRLS algorithm with DNC outperforms the known robust algorithms.
5 Conclusion
In this paper, the R-dRLS algorithm has been proposed, based on the minimization of an individual RLS cost function with a time-dependent constraint on the squared norm of the intermediate estimate update. The constraint is dynamically adjusted based on the diffusion strategy with the help of side information. The novel algorithm not only is robust against impulsive noise, but also has fast convergence. Furthermore, to track the change of parameters of interest, a detection method (DNC method) is proposed for re-initializing the constraint. Simulation results have verified that the proposed algorithm performs better than known algorithms in impulsive noise scenarios.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A.H. Sayed, “Adaptation, learning, and optimization over networks,” Foundations and Trends in Machine Learning , vol. 7, no. 4-5, pp. 311–801, 2014.
- 2[2] S. Kanna, D.H. Dini, Y. Xia, S.Y. Hui, and D.P. Mandic, “Distributed widely linear kalman filtering for frequency estimation in power networks,” IEEE Transactions on Signal and Information Processing over Networks , vol. 1, no. 1, pp. 45–57, 2015.
- 3[3] T.G. Miller, S. Xu, R.C. de Lamare, and H.V. Poor, “Distributed spectrum estimation based on alternating mixed discrete-continuous adaptation,” IEEE Signal Processing Letters , vol. 23, no. 4, pp. 551–555, 2016.
- 4[4] L. Li, J.A. Chambers, C.G. Lopes, and A.H. Sayed, “Distributed estimation over an adaptive incremental network based on the affine projection algorithm,” IEEE Transactions on Signal Processing , vol. 58, no. 1, pp. 151–164, 2010.
- 5[5] S.Y. Tu and A.H. Sayed, “Diffusion strategies outperform consensus strategies for distributed estimation over adaptive networks,” IEEE Transactions on Signal Processing , vol. 60, no. 12, pp. 6217–6234, 2012.
- 6[6] C.G. Lopes and A.H. Sayed, “Diffusion least-mean squares over adaptive networks: Formulation and performance analysis,” IEEE Transactions on Signal Processing , vol. 56, no. 7, pp. 3122–3136, 2008.
- 7[7] F.S. Cattivelli, C.G. Lopes, and A.H. Sayed, “Diffusion recursive least-squares for distributed estimation over adaptive networks,” IEEE Transactions on Signal Processing , vol. 56, no. 5, pp. 1865–1877, 2008.
- 8[8] J. Chen and A.H. Sayed, “Diffusion adaptation strategies for distributed optimization and learning over networks,” IEEE Transactions on Signal Processing , vol. 60, no. 8, pp. 4289–4305, 2012.
