Study of Robust Diffusion Recursive Least Squares Algorithms with Side   Information for Networked Agents

Y. Yu; R. C. de Lamare; Y. Zakharov

arXiv:1812.09985·cs.IT·February 20, 2019

Study of Robust Diffusion Recursive Least Squares Algorithms with Side Information for Networked Agents

Y. Yu, R. C. de Lamare, Y. Zakharov

PDF

TL;DR

This paper introduces a robust diffusion recursive least squares algorithm for networked agents that effectively handles impulsive noise by incorporating side information and adaptive constraints, improving tracking and estimation accuracy.

Contribution

It proposes a novel RLS algorithm with a time-dependent constraint and side information, enhancing robustness and tracking in impulsive noise environments.

Findings

01

Outperforms existing methods in impulsive noise scenarios

02

Effective constraint resetting improves tracking during parameter changes

03

Demonstrates superior estimation accuracy through simulations

Abstract

This work develops a robust diffusion recursive least squares algorithm to mitigate the performance degradation often experienced in networks of agents in the presence of impulsive noise. This algorithm minimizes an exponentially weighted least-squares cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate update at each node. With the help of side information, the constraint is recursively updated in a diffusion strategy. Moreover, a control strategy for resetting the constraint is also proposed to retain good tracking capability when the estimated parameters suddenly change. Simulations show the superiority of the proposed algorithm over previously reported techniques in various impulsive noise scenarios.

Tables1

Table 1. Table 1 : Proposed R- d d \rm d RLS Algorithm with the DNC Method.

Parameters:

0 < β ≲ 1

,

λ

,

δ

and

E_{c}

(R-dRLS);

ϱ

and

t_{th}

(DNC)

Initialization:

𝒘_{k, 0} = 𝟎

,

𝑷_{k, 0} = δ^{- 1} ​ 𝑰

and

ξ_{k} ​ (0) = E_{c} ​ \frac{σ_{d, k}^{2}}{M ​ σ_{u, k}^{2}}

(R-dRLS);

Θ_{old, k} = Θ_{new, k} = 0

,

V_{t} = ϱ ​ M

, and

V_{d} = 0.75 ​ V_{t}

(DNC)

R-dRLS algorithm:

e_{k} ​ (i) = d_{k} ​ (i) - 𝒖_{k, i}^{T} ​ 𝒘_{k, i - 1}

𝑷_{k, i} = \frac{1}{λ} ​ (𝑷_{k, i - 1} - \frac{𝑷_{k, i - 1} ​ 𝒖_{k, i} ​ 𝒖_{k, i}^{T} ​ 𝑷_{k, i - 1}}{λ + 𝒖_{k, i}^{T} ​ 𝑷_{k, i - 1} ​ 𝒖_{k, i}})

𝒈_{k, i} = 𝑷_{k, i} ​ 𝒖_{k, i}

𝝍_{k, i} = 𝒘_{k, i - 1} + \min [\frac{\sqrt{ξ_{k} ​ (i - 1)}}{{∥ 𝒈_{k, i} ∥}_{2} ​ | e_{k} ​ (i) |}, 1] ​ 𝒈_{k, i} ​ e_{k} ​ (i)

𝒘_{k, i} = \sum_{m \in 𝒩_{k}} c_{m, k} ​ 𝝍_{m, i}

DNC method:

Step 1: to compute

Δ_{k} ​ (i)

if

i = n ​ V_{t}, n = 0, 1, 2, \dots

𝒂_{k, i}^{T} = 𝒪 ​ ([\frac{e_{k}^{2} ​ (i)}{{‖ 𝒖_{k, i} ‖}_{2}^{2}}, \frac{e_{k}^{2} ​ (i - 1)}{{‖ 𝒖_{k, i - 1} ‖}_{2}^{2}}, …, \frac{e_{k}^{2} ​ (i - V_{t} + 1)}{{‖ 𝒖_{k, i - V_{t} + 1} ‖}_{2}^{2}}])

Θ_{new, k} = \sum_{m \in 𝒩_{k}} c_{m, k} ​ \frac{𝒂_{m, i}^{T} ​ 𝒆}{V_{t} - V_{d}}

Δ_{k} ​ (i) = \frac{Θ_{new, k} - Θ_{old, k}}{ξ_{k} ​ (i - 1)}

end

Step 2: to reset

ξ_{k} ​ (i)

if

Δ_{k} ​ (i) > t_{th}

ζ_{k} ​ (i) = ξ_{k} ​ (0)

,

𝑷_{k, i} = 𝑷_{k, 0}

elseif

Θ_{new, k} > Θ_{old, k}

ζ_{k} ​ (i) = ξ_{k} ​ (i - 1) + (Θ_{new, k} - Θ_{old, k})

else

ζ_{k} ​ (i) = β ​ ξ_{k} ​ (i - 1) + (1 - β) ​ \min [{∥ 𝒈_{k, i} ∥}_{2}^{2} ​ e_{k}^{2} ​ (i), ξ_{k} ​ (i - 1)]

end

ξ_{k} ​ (i) = \sum_{m \in 𝒩_{k}} c_{m, k} ​ ζ_{m} ​ (i)

Θ_{old, k} = Θ_{new, k}

Equations28

d_{k} (i) = u_{k, i}^{T} w^{o} + v_{k} (i),

d_{k} (i) = u_{k, i}^{T} w^{o} + v_{k} (i),

\begin{array}[]{rcl}\begin{aligned} &\bm{w}_{i}=\arg\min\limits_{\bm{w}}\\ &\left\{\lambda^{i+1}\delta\lVert\bm{w}\rVert_{2}^{2}+\sum\limits_{j=0}^{i}\lambda^{i-j}\sum\limits_{k=1}^{N}\left(d_{k}(j)-\bm{u}_{k,i}^{T}\bm{w}\right)^{2}\right\},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} &\bm{w}_{i}=\arg\min\limits_{\bm{w}}\\ &\left\{\lambda^{i+1}\delta\lVert\bm{w}\rVert_{2}^{2}+\sum\limits_{j=0}^{i}\lambda^{i-j}\sum\limits_{k=1}^{N}\left(d_{k}(j)-\bm{u}_{k,i}^{T}\bm{w}\right)^{2}\right\},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} J_{k}(\bm{\psi}_{k,i})=&\lVert\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\rVert_{\bm{Q}_{k,i}}^{2}\\ &+[d_{k}(i)-\bm{u}_{k,i}^{T}\bm{\psi}_{k,i}]^{2},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} J_{k}(\bm{\psi}_{k,i})=&\lVert\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\rVert_{\bm{Q}_{k,i}}^{2}\\ &+[d_{k}(i)-\bm{u}_{k,i}^{T}\bm{\psi}_{k,i}]^{2},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{R}_{k,i}\triangleq&\lambda^{i+1}\delta\bm{I}+\sum\limits_{j=0}^{i}\lambda^{i-j}\bm{u}_{k,j}\bm{u}_{k,j}^{T}\\ =&\lambda\bm{R}_{k,i-1}+\bm{u}_{k,i}\bm{u}_{k,i}^{T}\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{R}_{k,i}\triangleq&\lambda^{i+1}\delta\bm{I}+\sum\limits_{j=0}^{i}\lambda^{i-j}\bm{u}_{k,j}\bm{u}_{k,j}^{T}\\ =&\lambda\bm{R}_{k,i-1}+\bm{u}_{k,i}\bm{u}_{k,i}^{T}\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\bm{P}_{k,i}\bm{u}_{k,i}e_{k}(i),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\bm{P}_{k,i}\bm{u}_{k,i}e_{k}(i),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{P}_{k,i}=\frac{1}{\lambda}\left(\bm{P}_{k,i-1}-\frac{\bm{P}_{k,i-1}\bm{u}_{k,i}\bm{u}_{k,i}^{T}\bm{P}_{k,i-1}}{\lambda+\bm{u}_{k,i}^{T}\bm{P}_{k,i-1}\bm{u}_{k,i}}\right),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{P}_{k,i}=\frac{1}{\lambda}\left(\bm{P}_{k,i-1}-\frac{\bm{P}_{k,i-1}\bm{u}_{k,i}\bm{u}_{k,i}^{T}\bm{P}_{k,i-1}}{\lambda+\bm{u}_{k,i}^{T}\bm{P}_{k,i-1}\bm{u}_{k,i}}\right),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \lVert\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\rVert_{2}^{2}\leq\xi_{k}(i-1),\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \lVert\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\rVert_{2}^{2}\leq\xi_{k}(i-1),\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \lVert\bm{g}_{k,i}\rVert_{2}\lvert e_{k}(i)\rvert\leq\sqrt{\xi_{k}(i-1)},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \lVert\bm{g}_{k,i}\rVert_{2}\lvert e_{k}(i)\rvert\leq\sqrt{\xi_{k}(i-1)},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\sqrt{\xi_{k}(i-1)}\frac{\bm{g}_{k,i}}{\lVert\bm{g}_{k,i}\rVert_{2}}\text{sign}(e_{k}(i)),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\sqrt{\xi_{k}(i-1)}\frac{\bm{g}_{k,i}}{\lVert\bm{g}_{k,i}\rVert_{2}}\text{sign}(e_{k}(i)),\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\min\left[\frac{\sqrt{\xi_{k}(i-1)}}{\lVert\bm{g}_{k,i}\rVert_{2}\lvert e_{k}(i)\rvert},\;1\right]\bm{g}_{k,i}e_{k}(i).\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{\psi}_{k,i}=\bm{w}_{k,i-1}+\min\left[\frac{\sqrt{\xi_{k}(i-1)}}{\lVert\bm{g}_{k,i}\rVert_{2}\lvert e_{k}(i)\rvert},\;1\right]\bm{g}_{k,i}e_{k}(i).\\ \end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{w}_{k,i}=\sum\limits_{m\in\mathcal{N}_{k}}c_{m,k}\bm{\psi}_{m,i},\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \bm{w}_{k,i}=\sum\limits_{m\in\mathcal{N}_{k}}c_{m,k}\bm{\psi}_{m,i},\end{aligned}\end{array}

m \in N_{k} \sum c_{m, k} = 1, and c_{m, k} = 0 if m \in / N_{k} .

m \in N_{k} \sum c_{m, k} = 1, and c_{m, k} = 0 if m \in / N_{k} .

\begin{array}[]{rcl}\begin{aligned} \zeta_{k}(i)=&\beta\xi_{k}(i-1)+(1-\beta)\left\|\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\right\|_{2}^{2}\\ =\beta\xi_{k}&(i-1)+(1-\beta)\min[\lVert\bm{g}_{k,i}\rVert_{2}^{2}e_{k}^{2}(i),\xi_{k}(i-1)],\\ \xi_{k}(i)=&\sum\limits_{m\in\mathcal{N}_{k}}c_{m,k}\zeta_{m}(i),\end{aligned}\end{array}

\begin{array}[]{rcl}\begin{aligned} \zeta_{k}(i)=&\beta\xi_{k}(i-1)+(1-\beta)\left\|\bm{\psi}_{k,i}-\bm{w}_{k,i-1}\right\|_{2}^{2}\\ =\beta\xi_{k}&(i-1)+(1-\beta)\min[\lVert\bm{g}_{k,i}\rVert_{2}^{2}e_{k}^{2}(i),\xi_{k}(i-1)],\\ \xi_{k}(i)=&\sum\limits_{m\in\mathcal{N}_{k}}c_{m,k}\zeta_{m}(i),\end{aligned}\end{array}

u_{k} (i) = 1.6 u_{k} (i - 1) - 0.81 u_{k} (i - 2) + ϵ_{k} (i),

u_{k} (i) = 1.6 u_{k} (i - 1) - 0.81 u_{k} (i - 2) + ϵ_{k} (i),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Study of Robust Diffusion Recursive Least Squares Algorithms with Side Information for Networked Agents

Abstract

This work develops a robust diffusion recursive least squares algorithm to mitigate the performance degradation often experienced in networks of agents in the presence of impulsive noise. This algorithm minimizes an exponentially weighted least-squares cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate update at each node. With the help of side information, the constraint is recursively updated in a diffusion strategy. Moreover, a control strategy for resetting the constraint is also proposed to retain good tracking capability when the estimated parameters suddenly change. Simulations show the superiority of the proposed algorithm over previously reported techniques in various impulsive noise scenarios.

Index Terms— Diffusion cooperation, distributed algorithms, impulsive noises, robust recursive least squares.

1 Introduction

In the last decade, distributed adaptive algorithms for estimating parameters of interest over wireless sensor networks with multiple nodes (or agents) have attracted significant attention, due to their performance advantages and robustness [1]. The core idea is that each node performs adaptive estimation, in cooperation with its neighboring nodes. Distributed adaptive algorithms have been applied to many problems, e.g., frequency estimation in power grid [2] and spectrum estimation [3]. According to the cooperation strategy of interconnected nodes, existing algorithms can be categorized as the incremental [4], consensus [5], and diffusion [6, 7, 8] types. The diffusion type is the most popular [5], because it does not require a Hamiltonian cycle path as does the incremental type [4]; it is stable and has a better estimation performance than the consensus type [5]. Several diffusion-based distributed algorithms have been proposed such as the diffusion least mean square (dLMS) algorithm [6], diffusion conjugate gradient (dCG) [9], diffusion recursive least squares (dRLS) algorithm [7], and their modifications [10, 11, 12, 13, 14].

In practice, measurements at the network nodes can be corrupted by impulsive noise [15]. An impulsive noise process has the property that its occurence probability is small and the magnitude is typically much larger than the nominal measurement. It is well-known that the impulsive noise deteriorates significantly the performance of algorithms in the single-agent case. For distributed algorithms in the multi-agent case, impulsive noise can propagate over the entire network due to the exchange of information among nodes. To reduce the impulsive noise interference, many robust distributed algorithms have been proposed [16, 17, 18, 19, 20, 21, 22, 23, 24]. Some algorithms, e.g., the diffusion sign error LMS (dSE-LMS) [16], are based on using the instantaneous gradient-descent method to minimize an individual robust criterion. In [17], a robust variable weighting coefficients dLMS (RVWC-dLMS) algorithm was developed, which only considers the data and intermediate estimates from nodes not affected by impulsive noise; this is based on a judgement whether impulsive noise samples occur or not. However, these robust algorithms have slow convergence, especially for colored input signals.

RLS-based algorithms have a good decorrelating property for colored signals, thereby providing fast convergence. In this paper, therefore, we present a robust dRLS (R-dRLS) algorithm for distributed estimation over networks affected by impulsive noise. The R-dRLS algorithm minimizes a local exponentially weighted least-squares (LS) cost function subject to a time-dependent constraint on the squared norm of the intermediate estimate at each node. Unlike the framework in [25], we consider here a multi-agent scenario with the diffusion strategy. Furthermore, in order to equip the R-dRLS algorithm with the ability to withstand sudden changes in the environment, we also propose a diffusion-based distributed nonstationary control (DNC) method. This paper is organized as follows. In Section 2, the estimation problem in the network is described. In Section 3, the proposed algorithm is derived. In Section 4, results of simulation in impulsive noise scenarios are presented. Finally, conclusions are given in Section 5.

2 Problem Formulation

Let us consider a network that has $N$ nodes distributed over some region in space, where a link between two nodes means that they can communicate directly with each other. The neighborhood of node k is denoted by $\mathcal{N}_{k}$ , i.e., a set of all nodes connected to node $k$ including itself. The cardinality of $\mathcal{N}_{k}$ is denoted by $n_{k}$ . At every time instant $i\geq 0$ , every node k observes a data regressor vector $\bm{u}_{k,i}$ of size $M\times 1$ and a scalar measurement $d_{k}(i)$ , related as:

[TABLE]

where the superscript $T$ denotes the transpose, $\bm{w}^{o}$ is a parameter vector of size $M\times 1$ , and $v_{k}(i)$ is the additive noise at node k. The regressors $\bm{u}_{k,i}$ and $\bm{u}_{l,j}$ are spatially independent for $k\neq l$ and all $i,j$ . The additive noises $v_{k}(i)$ and $v_{l}(j)$ are spatially and temporally independent for $k\neq l$ and $i\neq j$ . Moreover, any $\bm{u}_{k,i}$ is independent of any $v_{l}(j)$ . The model (1) is widely used in many applications [1, 26].

The task is to estimate $\bm{w}^{o}$ , using the available data collected at nodes, i.e., $\{\bm{u}_{k,i},d_{k}(i)\}_{k=1}^{N}$ . For this purpose, the global LS-based estimation problem is described as [7]:

[TABLE]

where $\lVert\cdot\rVert_{2}$ denotes the $l_{2}$ -norm of a vector, $\delta>0$ is a regularization constant, and $\lambda$ is the forgetting factor. The dRLS algorithm solves (2) in a distributed manner [7]. In practice, $v_{k}(i)$ may contain impulsive noise, severely corrupting the measurement $d_{k}(i)$ . With such noise processes, the algorithms obtained from (2), e.g., the dRLS algorithm, would fail to work.

3 Proposed Distributed Algorithm

3.1 Derivation of the R-dRLS Algorithm

We focus here on the adapt-then-combine (ATC) implementation of the diffusion strategy, which has been shown to outperform the combine-then-adapt (CTA) implementation111 In fact, the CTA version is obtained by reversing the adaptation step and combination step in the ATC version. [5]. Following the ATC-based diffusion strategy [6, 7], i.e., performing first the adaptation step and then the combination step, the R-dRLS algorithm will be derived in the sequel. We start with the adaptation step. Every node $k$ , at time instant $i$ , finds an intermediate estimate $\bm{\psi}_{k,i}$ of $\bm{w}^{o}$ by minimizing the individual local cost function:

[TABLE]

with $\bm{Q}_{k,i}=\bm{R}_{k,i}-\bm{u}_{k,i}\bm{u}_{k,i}^{T}$ , where

[TABLE]

is the time-averaged correlation matrix for the regression vector at node $k$ and $\bm{w}_{k,i-1}$ is an estimate of $\bm{w}^{o}$ at node $k$ at time instant $i-1$ . Notice that the form $\lVert\bm{x}\rVert_{\bm{Q}}^{2}\triangleq\bm{x}^{T}\bm{Q}\bm{x}$ in (3) defines the Riemmanian distance [27] between vectors $\bm{\psi}_{k,i}$ and $\bm{w}_{k,i-1}$ . Setting the derivative of $J_{k}(\bm{\psi}_{k,i})$ with respect to $\bm{\psi}_{k,i}$ to zero, we obtain

[TABLE]

where $e_{k}(i)=d_{k}(i)-\bm{u}_{k,i}^{T}\bm{w}_{k,i-1}$ stands for the output error at node $k$ and $\bm{P}_{k,i}\triangleq\bm{R}_{k,i}^{-1}$ . Using the matrix inversion lemma [26], we have

[TABLE]

where $\bm{P}_{k,i}$ is initialized as $\left.\bm{P}_{k,0}=\delta^{-1}\bm{I}\right.$ and $\bm{I}$ is an identity matrix. Since $\bm{w}_{k,i-1}=\bm{R}_{k,i-1}^{-1}\bm{z}_{k,i-1}$ , where $\bm{z}_{k,i}=\lambda\bm{z}_{k,i-1}+\bm{u}_{k,i}d_{k}(i)$ , (5) means that every node $k$ performs an RLS update. However, with the update (5), the adverse effect of an impulsive noise sample at time instant $i$ will propagate through nodes via $e_{k}(i)$ . This effect can last for many iterations. To make the algorithm robust in impulsive noise scenarios, we propose to minimize (3) under the following constraint:

[TABLE]

where $\xi_{k}(i)$ is a positive bound. This constraint is employed to enforce the squared norm of the update of the intermediate estimate not to exceed the amount $\xi_{k}(i-1)$ regardless of the type of noise (possibly, impulsive noise), thereby guaranteeing the robustness of the algorithm. If (5) satisfies (7), i.e.,

[TABLE]

where $\left.\bm{g}_{k,i}\triangleq\bm{P}_{k,i}\bm{u}_{k,i}\right.$ represents the Kalman gain vector, then (5) is a solution of the above constrained minimization problem. On the other hand, if (8) is not satisfied (usually in the case of appearance of impulsive noise samples), i.e., $\left.\lVert\bm{g}_{k,i}\rVert_{2}\lvert e_{k}(i)\rvert>\sqrt{\xi_{k}(i-1)}\right.$ , we propose to replace the update (5) by a normalized form to satisfy the constraint (7), which is described by

[TABLE]

where $\text{sign}(\cdot)$ is the sign function. Consequently, combining (5), (8) and (9), we obtain the adaptation step for each node $k$ as:

[TABLE]

Then, at the combination step, the intermediate estimates $\psi_{m,i}$ from the neigborhood $m\in\mathcal{N}_{k}$ of node $k$ are linearly weighed, yielding a more reliable estimate $\bm{w}_{k,i}$ [1]:

[TABLE]

where the combination coefficients $\{c_{m,k}\}$ are non-negative, and satisfy:

[TABLE]

Note that, $c_{m,k}$ is a weight that node $k$ assigns to the intermediate estimate $\bm{\psi}_{m,i}$ received from its neighbor node $m$ . In general, $\{c_{m,k}\}$ are determined by a static rule (e.g., the Metropolis rule [28] that we adopt in this paper) which keeps them constant in the estimation, or an adaptive rule [28]. It is evident that the bound $\xi_{k}(i)$ controls the robustness of the algorithm against impulsive noise and influences its dynamic behavior, so choosing its value properly is of fundamental importance. To this end, motivated by the single-agent case in [25], $\xi_{k}(i)$ is adjusted recursively based on the diffusion strategy as:

[TABLE]

where $\beta$ is a forgetting factor, $0<\beta\lesssim 1$ . In (13), at every node $k$ , $\xi_{k}(i)$ can be initialized as $\xi_{k}(0)=E_{c}\sigma_{d,k}^{2}/(M\sigma_{u,k}^{2})$ , where $E_{c}$ is a positive integer, and $\sigma_{d,k}^{2}$ and $\sigma_{u,k}^{2}$ are powers of signals $d_{k}(i)$ and $\bm{u}_{k,i}$ , respectively. The proposed algorithm is shown in Table 1.

Remark: As can be seen from (10), the operation mode of the proposed algorithm is twofold. At time instant $i$ , if $\lVert\bm{g}_{k,i}\rVert_{2}^{2}e_{k}^{2}(i)\leq\xi_{k}(i-1)$ , the RLS update is performed; if not, the RLS update is normalized to have a norm of value $\xi_{k}(i-1)$ . At the early iterations, the values of $\xi_{k}(i)$ can be high compared to $\lVert\bm{g}_{k,i}\rVert_{2}^{2}e_{k}^{2}(i)$ so that the algorithm will behave as the dRLS algorithm, providing a fast convergence. Whenever an impulsive noise sample appears, due to its significant magnitude, the algorithm will work as an dRLS update multiplied by a very small ’step size’ scaling factor given by $\sqrt{\xi_{k}(i-1)}/(\lVert\bm{g}_{k,i}\rVert_{2}|e_{k}(i)|)$ , thus suppressing the negative influence of impulsive noise on the estimation [29, 30, 31, 32, 33, 34, 35, 36, 37] and reducing the error propagation effect. The algorithm robustness to impulsive noise is further maintained, due to decreasing $\xi_{k}(i)$ over the iterations. This algorithm can be considered as an improved dRLS algorithm with an additional ’step size’ scaling factor which is time-varying and between 1 and $\sqrt{\xi_{k}(i-1)}/(\lVert\bm{g}_{k,i}\rVert_{2}|e_{k}(i)|)$ , as can be observed in (10).

3.2 DNC Method

Although the decreasing values of the sequence $\{\xi_{k}(i)\}$ with the iteration $i$ prompt the R-dRLS algorithm more robust against impulsive noises, the algorithm also loses its tracking capability for a sudden change of the unknown vector $\bm{w}^{o}$ . To improve the tracking capability, referring to the single-agent scenario [38], we also develop a diffusion-based DNC method, summarized in Table 1. The DNC method includes two implementation procedures.

Firstly, a variable $\varDelta_{k}(i)$ at node $k$ is computed once for every $V_{t}$ iterations, to judge whether the unknown vector has a change or not. In this step, $\bm{a}_{k,i}^{T}=\mathcal{O}\left(\left[\frac{e_{k}^{2}(i)}{\|\bm{u}_{k,i}\|_{2}^{2}},\frac{e_{k}^{2}(i-1)}{\|\bm{u}_{k,i-1}\|_{2}^{2}},\text{...},\frac{e_{k}^{2}(i-V_{t}+1)}{\|\bm{u}_{k,i-V_{t}+1}\|_{2}^{2}}\right]\right)$ with $\mathcal{O}(\cdot)$ denoting the ascending arrangement for its arguments, and $\bm{e}=[1,...,1,0,...,0]^{T}$ is a vector whose first $V_{t}-V_{d}$ elements set to one, where $V_{d}$ is a positive integer with $V_{d}<V_{t}$ . Thus, the product $\bm{a}_{k,i}^{T}\bm{e}$ can remove the effect of outliers (e.g., impulsive noise samples) on the rightness of the computation of $\varDelta_{k}(i)$ . Typically, for both $V_{t}$ and $V_{d}$ , good choices are $V_{t}=\varrho M$ with $\varrho=1\sim 3$ and $V_{d}=0.75V_{t}$ [38]. Note that, for large occurence probability of impulsive noise, the value of $V_{t}-V_{d}$ should be decreased to discard the impulsive noise samples.

Secondly, if $\varDelta_{k}(i)>t_{\text{th}}$ , where $t_{\text{th}}$ is a predefined threshold, meaning a change of $\bm{w}^{o}$ has occured, then we need to reset $\xi_{k}(i)$ to its initial value $\xi_{k}(0)$ . More importantly, $\bm{P}_{k,i}$ is also re-initialized with $\bm{P}_{k,0}$ . It is worth noting that since the parameters $\gamma,\;N_{w},\;\varrho$ , and $t_{\text{th}}$ are not affected by each other, their choices are simple.

4 Simulation Results

Simulation examples are presented for a diffusion network with $N=20$ nodes. The graph describing the network is assumed to be partially connected. Adjustments to the graph can be carried out using approaches reported in [39, 40, 41, 42, 43]. The vector $\bm{w}^{o}$ to be estimated has a length of $M=16$ and a unit norm; it is generated randomly from a zero-mean uniform distribution. To evaluate the tracking capability, $\bm{w}^{o}$ changes to $-\bm{w}^{o}$ in the middle of iterations. The input regressor $\bm{u}_{k,i}$ has a shifted structure, i.e., $\bm{u}_{k,i}=[u_{k}(i),u_{k}(i-1),...,u_{k}(i-M+1)]^{T}$ [44, 4], where $u_{k}(i)$ is colored and generated by a second-order autoregressive system:

[TABLE]

where $\epsilon_{k}(i)$ is a zero-mean white Gaussian process with variance $\sigma_{\epsilon,k}^{2}$ shown in Fig. 1(a) for all the nodes. We employ the network mean square deviation (MSD) to assess the performance of the algorithm, i.e., $\text{MSD}_{\text{net}}(i)=\frac{1}{N}\sum\limits_{k=1}^{N}E\{\|\bm{w}^{o}-\bm{w}_{k,i}\|_{2}^{2}\}$ , where $E\{\cdot\}$ denotes the expectation. Usually, the impulsive noise can be described by either the Bernoulli-Gaussian (BG) distribution [16, 17, 18] or the $\alpha$ -Stable distribution [45, 27]. We consider both cases. All results are the average over 200 independent trials.

4.1 BG Distribution

The additive noise $v_{k}(i)$ includes the background noise $\theta_{k}(i)$ plus the impulsive noise $\eta_{k}(i)$ , where $\theta_{k}(i)$ is zero-mean white Gaussian noise with variance $\sigma_{\theta,k}^{2}$ depicted in Fig. 1(b). The impulsive noise $\eta_{k}(i)$ is described by the BG distribution, $\eta_{k}(i)=b_{k}(i)\cdot g_{k}(i)$ , where $b_{k}(i)$ is a Bernoulli process with probability distribution $P[b_{k}(i)=1]=p_{r,k}$ and $\left.P[b_{k}(i)=0]=1-p_{r,k}\right.$ , and $g_{k}(i)$ is a zero-mean white Gaussian process with variance $\sigma_{g,k}^{2}$ . Here, we set $p_{r,k}$ as a random number in the range of $[0.001,0.05]$ , and $\sigma_{g,k}^{2}=1000\sigma_{y,k}^{2}$ , where $\sigma_{y,k}^{2}$ denotes the power of $y_{k}(i)=\bm{u}_{k,i}^{T}\bm{w}^{o}$ . Fig. 2 compares the performance of the dRLS, dSE-LMS, and RVWC-dLMS algorithms with that of the proposed R-dRLS algorithm. Note that, the R-dRLS (no cooperation) algorithm performs an independent estimation at each node as presented in [25]. For RLS-type algorithms, we choose $\lambda$ =0.995 and $\delta$ =0.01. As expected, the dRLS algorithm has a poor performance in the presence of impulsive noise. Both the dSE-LMS and RVWC-dLMS algorithms are significantly less sensitive to impulsive noise, but their convergence is slow. Apart from the robustness for combating impulsive noise, the proposed R-dRLS algorithm has also a fast convergence. Moreover, the proposed DNC method can retain the good tracking capability of the R-dRLS algorithm, only with a slight degradation in steady-state performance.

4.2 $\alpha$ -Stable Distribution

The impulsive noise is now modeled by the $\alpha$ -stable distribution with a characteristic function $\varphi(t)=\exp(-\gamma\lvert t\lvert^{\alpha})$ , where the characteristic exponent $\alpha\in(0,2]$ describes the impulsiveness of the noise (smaller $\alpha$ leads to more impulsive noise samples) and $\gamma>0$ represents the dispersion level of the noise. In particular, when $\alpha=2$ , it reduces to the Gaussian noise. It is rare to find $\alpha$ -stable noise with $\alpha<1$ in practice [27, 46]. In this example, thus we set $\alpha=1.15$ and $\gamma=1/15$ . The learning performance of the algorithms is shown in Fig. 3. Fig. 4 shows the node-wise steady-state MSD of the robust algorithms (i.e., excluding the dRLS) against impulsive noise, by averaging over 500 instantaneous MSD values in the steady-state. As can be seen from Figs. 3 and 4, the proposed R-dRLS algorithm with DNC outperforms the known robust algorithms.

5 Conclusion

In this paper, the R-dRLS algorithm has been proposed, based on the minimization of an individual RLS cost function with a time-dependent constraint on the squared norm of the intermediate estimate update. The constraint is dynamically adjusted based on the diffusion strategy with the help of side information. The novel algorithm not only is robust against impulsive noise, but also has fast convergence. Furthermore, to track the change of parameters of interest, a detection method (DNC method) is proposed for re-initializing the constraint. Simulation results have verified that the proposed algorithm performs better than known algorithms in impulsive noise scenarios.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A.H. Sayed, “Adaptation, learning, and optimization over networks,” Foundations and Trends in Machine Learning , vol. 7, no. 4-5, pp. 311–801, 2014.
2[2] S. Kanna, D.H. Dini, Y. Xia, S.Y. Hui, and D.P. Mandic, “Distributed widely linear kalman filtering for frequency estimation in power networks,” IEEE Transactions on Signal and Information Processing over Networks , vol. 1, no. 1, pp. 45–57, 2015.
3[3] T.G. Miller, S. Xu, R.C. de Lamare, and H.V. Poor, “Distributed spectrum estimation based on alternating mixed discrete-continuous adaptation,” IEEE Signal Processing Letters , vol. 23, no. 4, pp. 551–555, 2016.
4[4] L. Li, J.A. Chambers, C.G. Lopes, and A.H. Sayed, “Distributed estimation over an adaptive incremental network based on the affine projection algorithm,” IEEE Transactions on Signal Processing , vol. 58, no. 1, pp. 151–164, 2010.
5[5] S.Y. Tu and A.H. Sayed, “Diffusion strategies outperform consensus strategies for distributed estimation over adaptive networks,” IEEE Transactions on Signal Processing , vol. 60, no. 12, pp. 6217–6234, 2012.
6[6] C.G. Lopes and A.H. Sayed, “Diffusion least-mean squares over adaptive networks: Formulation and performance analysis,” IEEE Transactions on Signal Processing , vol. 56, no. 7, pp. 3122–3136, 2008.
7[7] F.S. Cattivelli, C.G. Lopes, and A.H. Sayed, “Diffusion recursive least-squares for distributed estimation over adaptive networks,” IEEE Transactions on Signal Processing , vol. 56, no. 5, pp. 1865–1877, 2008.
8[8] J. Chen and A.H. Sayed, “Diffusion adaptation strategies for distributed optimization and learning over networks,” IEEE Transactions on Signal Processing , vol. 60, no. 8, pp. 4289–4305, 2012.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Study of Robust Diffusion Recursive Least Squares Algorithms with Side Information for Networked Agents

Abstract

1 Introduction

2 Problem Formulation

3 Proposed Distributed Algorithm

3.1 Derivation of the R-dRLS Algorithm

3.2 DNC Method

4 Simulation Results

4.1 BG Distribution

4.2 α\alphaα-Stable Distribution

5 Conclusion

4.2 $\alpha$ -Stable Distribution