Preserving Privacy of Finite Impulse Response Systems

Giulio Bottegal; Farhad Farokhi; Iman Shames

arXiv:1706.01587·math.OC·June 7, 2017

Preserving Privacy of Finite Impulse Response Systems

Giulio Bottegal, Farhad Farokhi, Iman Shames

PDF

TL;DR

This paper proposes methods to add noise to FIR systems to protect their models from identification, balancing privacy with system performance, using optimal filtering and differential privacy techniques.

Contribution

It introduces novel noise design strategies for FIR systems that maximize identification error while controlling performance loss, combining optimal filtering and differential privacy.

Findings

01

Optimal filters for noise construction are developed.

02

Differential privacy mechanisms are applied to FIR systems.

03

Trade-offs between privacy and system performance are characterized.

Abstract

Adding input and output noises for increasing model identification error of finite impulse response (FIR) systems is considered. This is motivated by the desire to protect the model of the system as a trade secret by rendering model identification techniques ineffective. Optimal filters for constructing additive noises that maximizes the identification error subject to maintaining the closed-loop performance degradation below a limit are constructed. Furthermore, differential privacy is used for designing output noises that preserve the privacy of the model.

Equations92

y_{t} = H (q^{- 1}) r_{t} + e_{t},

y_{t} = H (q^{- 1}) r_{t} + e_{t},

y_{t} = H (q^{- 1}) r_{t} + e_{t} + w_{t} .

y_{t} = H (q^{- 1}) r_{t} + e_{t} + w_{t} .

R := r_{1} r_{2} ⋮ r_{n_{h}} ⋮ r_{N} 0 r_{1} ⋮ r_{n_{h} - 1} ⋮ r_{N - 1} 00 ⋮ r_{n_{h} - 2} ⋮ \dots \dots \dots ⋱ \dots ⋱ \dots 00 ⋮ r_{1} ⋮ r_{N - n_{h} + 1},

R := r_{1} r_{2} ⋮ r_{n_{h}} ⋮ r_{N} 0 r_{1} ⋮ r_{n_{h} - 1} ⋮ r_{N - 1} 00 ⋮ r_{n_{h} - 2} ⋮ \dots \dots \dots ⋱ \dots ⋱ \dots 00 ⋮ r_{1} ⋮ r_{N - n_{h} + 1},

\hat{h} = (R^{⊤} R)^{- 1} R^{⊤} y .

\hat{h} = (R^{⊤} R)^{- 1} R^{⊤} y .

P_{h} := E {(\hat{h} - h) (\hat{h} - h)^{⊤}} .

P_{h} := E {(\hat{h} - h) (\hat{h} - h)^{⊤}} .

w_{t} = L (q^{- 1}) v_{t},

w_{t} = L (q^{- 1}) v_{t},

L := l_{n_{l} - 1} 0 ⋮ 00 \dots l_{n_{l} - 1} ⋱ \dots 0 l_{0} \dots ⋱ 0 \dots 0 l_{0} ⋱ l_{n_{l} - 1} 0 00 ⋱ \dots l_{n_{l} - 1} \dots \dots ⋱ l_{0} \dots 00 ⋮ 0 l_{0} .

L := l_{n_{l} - 1} 0 ⋮ 00 \dots l_{n_{l} - 1} ⋱ \dots 0 l_{0} \dots ⋱ 0 \dots 0 l_{0} ⋱ l_{n_{l} - 1} 0 00 ⋱ \dots l_{n_{l} - 1} \dots \dots ⋱ l_{0} \dots 00 ⋮ 0 l_{0} .

P_{h}

P_{h}

= (R^{⊤} R)^{- 1} R^{⊤} (L L^{⊤} + σ^{2} I_{N}) R (R^{⊤} R)^{- 1} .

λ_{y} := E {y_{t}^{2} ∣ r_{t} = 0} = E {(w_{t} + e_{t})^{2}} = ∥ l ∥^{2} + σ^{2},

λ_{y} := E {y_{t}^{2} ∣ r_{t} = 0} = E {(w_{t} + e_{t})^{2}} = ∥ l ∥^{2} + σ^{2},

x_{t} = L (q^{- 1}) v_{t},

x_{t} = L (q^{- 1}) v_{t},

y_{t}

y_{t}

= H (q^{- 1}) (r_{t} + L (q^{- 1}) v_{t}) + e_{t} .

F (q^{- 1}) := H (q^{- 1}) L (q^{- 1}),

F (q^{- 1}) := H (q^{- 1}) L (q^{- 1}),

F (q^{- 1}) = k = 0 \sum n_{f} - 1 f_{k} q^{- k}, n_{f} = n_{h} + n_{l} - 1.

F (q^{- 1}) = k = 0 \sum n_{f} - 1 f_{k} q^{- k}, n_{f} = n_{h} + n_{l} - 1.

P_{h}

P_{h}

= (R^{⊤} R)^{- 1} R^{⊤} (F F^{⊤} + σ^{2} I_{N}) R (R^{⊤} R)^{- 1} .

arg max_{l \in R^{n_{l}}}

arg max_{l \in R^{n_{l}}}

s.t.

ρ :=

ρ :=

E

E

c

arg max_{l \in R^{n_{l}}}

arg max_{l \in R^{n_{l}}}

s.t.

η^{*} \in arg max_{η \in R^{n_{l}}}

η^{*} \in arg max_{η \in R^{n_{l}}}

s.t.

arg min_{l \in R^{n_{l}}} (\mbox tr (P_{h}))^{- 1} + γ_{2} λ_{y},

arg min_{l \in R^{n_{l}}} (\mbox tr (P_{h}))^{- 1} + γ_{2} λ_{y},

arg min_{l \in R^{n_{l}}} (l^{⊤} M l + c)^{- 1} + γ_{2} ∥ l ∥^{2},

arg min_{l \in R^{n_{l}}} (l^{⊤} M l + c)^{- 1} + γ_{2} ∥ l ∥^{2},

l^{*} = {0, 1/ γ λ_{1} - c / λ_{1} v_{1}, λ_{1} \leq γ_{2} c^{2}, \mbox o t h er w i se .

l^{*} = {0, 1/ γ λ_{1} - c / λ_{1} v_{1}, λ_{1} \leq γ_{2} c^{2}, \mbox o t h er w i se .

arg max_{l \in R^{n_{l}}}

arg max_{l \in R^{n_{l}}}

s.t.

\mbox tr (P_{h}) = f^{⊤} Q_{f}^{⊤} (I_{N + n_{f} - 1} \otimes E) Q_{f} f + c,

\mbox tr (P_{h}) = f^{⊤} Q_{f}^{⊤} (I_{N + n_{f} - 1} \otimes E) Q_{f} f + c,

f = H l,

f = H l,

arg max_{l \in R^{n_{l}}}

arg max_{l \in R^{n_{l}}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Preserving Privacy of Finite Impulse Response Systems

Giulio Bottegal, Farhad Farokhi, and Iman Shames F. Farokhi and I. Shames are with the University of Melbourne, Australia. G. Bottegal is with TU Eindhoven, The Netherlands. e-mails: [email protected] (F. Farokhi), [email protected] (I. Shames), [email protected] (G. Bottegal)The work was supported by a McKenzie Fellowship and the Australian Research Council (LP130100605).

Abstract

Adding input and output noises for increasing model identification error of finite impulse response (FIR) systems is considered. This is motivated by the desire to protect the model of the system as a trade secret by rendering model identification techniques ineffective. Optimal filters for constructing additive noises that maximizes the identification error subject to maintaining the closed-loop performance degradation below a limit are constructed. Furthermore, differential privacy is used for designing output noises that preserve the privacy of the model.

I Introduction

Innovative industries invest resources (e.g., money and time for research and development) to construct new systems and to improve the performance of the previously-deployed ones. To generate revenue and offset the cost of research, they ideally want to capitalize on their achievements. This is sometimes done by restricting the use of their ideas through patents or by hiding the features of their systems as trade secrets. When opting for trade secrets, reverse engineering techniques can be used by competitors to unravel their secrets. For instance, model identification tools can be utilized to identify a black-box system or to extract the parameters of a gray-box system. The gained information can be then used to reverse the financial gains. This motivates the use of methods that can render reverse-engineering techniques ineffective. Such methods, however, most often degrade the performance of the system. Therefore, a framework for balancing the need for preserving the trade secrets against maintaining the performance of the systems is required.

In this paper, linear time-invariant discrete-time finite impulse response (FIR) system are considered. Specifically, the idea of adding noises to the input and output for increasing the error of model identification is explored. A bound on closed-loop performance degradation caused by the additive noise is enforced. An optimal filter for constructing the additive input and output noises that maximizes the identification error subject to maintaining the performance degradation below a threshold is constructed. This is done for both known and unknown input sequences. The former is useful to make the identification difficult for given inputs, such as the optimal experimental design in the model identification literature [1]. The latter, which requires statistics of the input, can accommodate the belief of the designer on the reverse engineering techniques, e.g., a frequently used input for model identification purposes is a sequence of i.i.d.111i.i.d. stands for independently and identically distributed. Gaussian noise [2]. Finally, differential privacy framework is used for designing output additive noises that make the system identification difficult without any assumptions on the utilized inputs.

In differential privacy literature, noises are added to the outcome of statistical queries from databases to preserve the privacy of individuals in the database [3]. This framework was more recently used in dynamical systems [4, 5]. In differential privacy literature, most often, additive Laplace noises are used and the parameters of the noise are selected according to the sensitivity of the outcome to variations in the data (that should be kept private). However, weaker variants of differential privacy can be achieved by additive Gaussian noises. This is advantageous as adding Laplace noise can make the designer’s task considerably more difficult (in terms of utilizing the outputs of the system), e.g., optimal state estimation when measurements are corrupted by Laplace noise results in non-linearities and memory issues [6].

To the best of our knowledge, the differential privacy has not been explored in the context of preserving the privacy of dynamical systems with the aim of protecting the model as a trade secret. This has been explored thoroughly in one of the sections of the paper. In addition, in this paper, the problem of preserving the privacy of the systems is cast as a concrete optimization problem that balances the need for keeping the privacy with that of the maintaining the performance. This provides a different approach to that of differential privacy in which constraints on the performance degradation cannot be enforced directly to optimally balance between privacy and performance. Finally, note that the problem of releasing the dynamical model of a system under privacy constraints was considered in [7]. In this paper, we take a different approach, i.e., we do not release the model of the system. We want to ensure that inferring an exact model relating inputs and outputs is made difficult.

The rest of the paper is organized as follows. The design of optimal additive input and output noise to hinder system identification is studied in Section II. Section III uses the differential privacy for constructing additive output noises. A numerical example is provided in Section IV. Some concluding remarks are presented in Section V.

II Optimal Additive Noise

Here, we investigate the use of additive noise to preserve the privacy of the model information assuming that the eavesdropper uses the best linear unbiased estimate. These results are subsequently generalized (to the case where the model of the eavesdropper is not known) when using the differential privacy framework.

II-A Problem Formulation

In this paper, for sake of simplicity of presentation, linear single-input single-output (SISO) time-invariant discrete-time systems are considered. All the derivations can be extended to multi-input multi-output (MIMO) systems. The system is described by the following equation

[TABLE]

where $H(q^{-1})$ represents the transfer function of the system, which is driven by the reference input $r_{t}$ . The output $y_{t}$ is corrupted by additive white Gaussian noise with variance $\sigma^{2}$ , which is represented by $e_{t}$ . Assume that $H(q^{-1})$ can be well-represented by a finite-impulse response (FIR) system of order $n_{h}$ , i.e., $H(q^{-1})=\sum_{k=0}^{n_{h}-1}h_{k}q^{-k}$ . Hence, the dynamics of the system is completely characterized by the vector of coefficients $h:=[h_{0}\,\ldots\,h_{n_{h}-1}]^{\top}$ . In this paper, we assume null initial conditions (that is $r_{t}=0$ for $t\leq 0$ ), though extension to any initial condition is straightforward due to the linearity of the underlying system.

Assume that an adversary is interested in inferring on the process relating $r_{t}$ to $y_{t}$ by attempting to estimate $h$ from a set of $N$ input/output measurements $\{r_{t},\,y_{t}\}_{t=1}^{N}$ . To complicate the identification process, an additional component (which is not accessible to the adversary) can be added to the input or to the output of the system to lower the identification accuracy. Let $w_{t}$ capture such an additional component, which changes the model of the system as

[TABLE]

This term can capture both the additive input and output noise as discussed, in detail, in what follows.

Assumption II.1

The malicious entity is unaware of the presence of the additive input or output noise.

This assumption is rather conservative. When using the differential privacy framework in the next section, we can avoid such assumptions. Considering a FIR model for the system and in light of Assumption II.1, the best linear unbiased estimate (BLUE) of $h$ from perspective of the malicious entity is given by the standard least-squares estimate [8, Ch. 4]. Let us introduce the vectors $y:=[y_{1}\,\ldots\,y_{N}]^{\top}$ , $e:=[e_{1}\,\ldots\,e_{N}]^{\top}$ , and $w:=[w_{1}\,\ldots\,w_{N}]^{\top}$ . Assuming that the system is at rest prior to the data collection (i.e., $r_{t}=0$ for all $t\leq 0$ ) and defining the matrix

[TABLE]

it is evident that $y=Rh+w+e.$ The least-squares estimate of $h$ is then given by

[TABLE]

Note that this estimator is not the true BLUE, which would require the knowledge of the second order statistics of $w_{t}$ . However, it is the best that the malicious entity can do without the knowledge that $w_{t}$ exists. This estimator is still unbiased because ${\mathbb{E}}\{\hat{h}\}={\mathbb{E}}\{(R^{\top}R)^{-1}R^{\top}(Rh+w+e)\}=h+(R^{\top}R)^{-1}R^{\top}{\mathbb{E}}\{w+e\}=h$ . Then, a measure of the accuracy of the estimation of the impulse response is the covariance matrix of $\hat{h}$ [8, Ch. 4], namely

[TABLE]

The additional input $w_{t}$ determines the quality of the estimated system $\hat{h}$ by entering into the expression of the parameter covariance matrix $P_{h}$ . Intuitively, the higher the power of $w_{t}$ , the higher $P_{h}$ (and thus the lower the identification accuracy). On the other hand, $w_{t}$ has an undesired effect on the output power. Therefore, the additive noise is designed to increase the total variance of $\hat{h}$ (expressed through the trace of $P_{h}$ ) while keeping low the contribution of $w_{t}$ to the variance of $y_{t}$ . Let $\lambda_{y}:={\mathbb{E}}\left[y_{t}^{2}|r_{t}=0,\,t\in\mathbb{Z}\right]$ be such contribution. Note that, if $r_{t}=0$ , the output is driven only by the stationary noise processes $e_{t}$ and $w_{t}$ and so $\lambda_{y}$ is constant in $t$ .

Problem II.2

For a given input $r$ , find an appropriate additive noise $w_{t}$ to maximize the identification error $\mbox{\rm tr}(P_{h})$ while keeping the performance degradation small by guaranteeing $\lambda_{y}\leq\gamma_{1}$ .

In Problem II.2, $\gamma_{1}$ is a pre-selected constant that reflects the maximum tolerable output variance, which is a measure of the performance degradation caused by the additive input and output noises. If $\gamma_{1}$ is very small, the optimal solution is add no noise. In this case, the closed-loop performance is far superior to protecting the model. However, if $\gamma_{1}$ is too large, the output of the system is drowned in noise and thus the system becomes practically useless.

Here, the additive noise is designed for a given sequence of inputs captured by $r$ . This might not be generally feasible as, when dealing with causal systems, the additive noise should be designed and employed prior to receiving the entire sequence of inputs. This design methodology is however very useful to make the identification difficult for a given input, such as those in optimal experimental design in the model identification literature [1]. Alternatively, a distribution for the input signal can be considered. Furthermore, the length of the experiment $N$ that the malicious entity is collecting to identify the system is also unknown a priori, and shall be treated as a random quantity.

Assumption II.3

Let $N\in\mathbb{N}$ be a random number distributed according to $\mathbb{P}\{N=\ell\}=p(\ell)$ for some $p:\mathbb{N}\rightarrow[0,1]$ such that $\sum_{\ell\in\mathbb{N}}p(\ell)=1$ . For a given $N$ , assume that $r\in\mathbb{R}^{N}$ is distributed according to the conditional probability density function $p(\cdot|N)$ such that $\mathbb{P}\{r\in\mathcal{R}|N\}=\int_{r^{\prime}\in\mathcal{R}}p(r^{\prime}|N)\mathrm{d}r^{\prime}$ for all Lebesgue-measurable sets $\mathcal{R}\subseteq\mathbb{R}^{N}$ .

Remark II.4

In general, the probability density function of the input signals might not be known in advance. In that case, an online or adaptive approach can be used to estimate the statistical properties of the input as more inputs are revealed over time and design (or update the design of) privacy-preserving filters based on the additional gathered information. The result of this paper can serve as a first step in that direction. This is because if rigorous treatment of the problem for known deterministic inputs or random inputs with known probability distributions is not well understood, the analysis of the online approach would not be possible (or straightforward to say the least).

In this case, the identification error $P_{h}$ which is used as a measure of privacy should be replaced with $\mathbb{E}\{P_{h}\}$ with the expectation being taken over random variables $r$ and $N$ . This allows us to generalize the problem of the interest as follows.

Problem II.5

For given distributions of random variables $N$ and $r$ following Assumption II.3, find an appropriate additive noise $w_{t}$ to maximize the identification error $\mbox{\rm tr}(\mathbb{E}\{P_{h}\})$ while keeping the performance degradation small by guaranteeing $\lambda_{y}\leq\gamma_{1}$ .

In this paper, two families of additive noise are considered, namely, additive output noise and additive input noise. In the remainder of this section, these two families are described.

II-A1 Additive Output Noise

Figure 1 (a) illustrates the schematic diagram of the closed-loop system with additive output noise. The additive noise $w_{t}$ is modelled by a zero-mean moving-average (MA) stochastic process of the form

[TABLE]

where $v_{t}$ is a sequence of i.i.d. zero-mean noise (which is not necessarily Gaussian) of unit variance and $L(q^{-1}):=\sum_{k=0}^{n_{l}}l_{k}q^{-k}$ is a FIR filter of prescribed order $n_{l}$ . Then, $w_{t}$ is a stationary process with zero-mean and well-defined autocovariance function [9]. The additive noise $w:=[w_{1}\,\ldots\,w_{N}]^{\top}$ can be expressed as $w=Lv$ , where $v:=[v_{-n_{l}+2}\,\ldots v_{0}\,v_{1}\,\ldots\,v_{N}]^{\top}$ and

[TABLE]

The identification error covariance, in this case, is

[TABLE]

Further, the output variance can be determined by

[TABLE]

where $l=[l_{0}\,\ldots\,l_{n_{l}-1}]^{\top}$ .

Remark II.6

It should be noted that by increasing the order of the noise generation filter $n_{l}$ , the performance can only be improved while maintaining the same privacy guarantee. This is because the optimal solution from the lower order is always feasible in the optimization problem relating to the higher order noise filters. The order of the system is thus only dictated by the available resources for preserving the privacy of the model.

II-A2 Additive Input Noise

Figure 1 (b) shows the schematic diagram of the closed-loop system with additive input noise. In this case, the additive input noise is denoted by $x_{t}$ and is modeled by a zero-mean MA stochastic process of the form

[TABLE]

where, similarly, $v_{t}$ is a sequence of i.i.d. zero-mean noise of unit variance and $L(q^{-1})$ is a FIR filter of prescribed order $n_{l}$ determining the autocorrelation of $x_{t}$ . Then, the new system is described by

[TABLE]

The additive noise $w_{t}$ , in this case, is the contribution of $x_{t}$ to the output, i.e., $w_{t}=H(q^{-1})L(q^{-1})v_{t}$ . Define

[TABLE]

which can be expressed as

[TABLE]

Note that $x:=[x_{1}\ldots x_{N}]^{\top}$ can be expressed as $x=Fv$ with $v:=[v_{-n_{f}+2}\ldots v_{0}v_{1}\,\ldots v_{N}]^{\top}$ and $F$ is defined similarly to $L$ in (6). The identification error covariance becomes

[TABLE]

Finally, it can be shown that $\lambda_{y}=\|f\|^{2}+\sigma^{2}$ , where $f=[f_{0}\,\ldots\,f_{n_{f}-1}]^{\top}$ .

II-B Deterministic Input

This part is dedicated to solving Problem II.2. The results are first presented for the output noise case.

II-B1 Additive Output Noise

For additive output noise, Problem II.2 can be rewritten as

[TABLE]

where $\gamma_{1}$ denotes the maximum tolerated output variance. Define the performance degradation ratio

[TABLE]

If the goal of the designer is to keep the performance degradation ratio below $\epsilon$ , the constant $\gamma_{1}$ can be selected to be smaller than $\sigma^{2}\epsilon$ . The following lemma is instrumental to obtain an analytic solution of (14).

Lemma II.7

Let

[TABLE]

and denote by $Q_{l}$ a selection matrix such that $\operatorname{vec}(L)=Q_{l}l$ , where $\operatorname{vec}(L)$ is a vector composed of all the columns of the matrix $L$ . Then, for the additive noise model, $\mbox{\rm tr}(P_{h})=l^{\top}Q_{l}^{\top}(I_{N+n_{l}-1}\otimes E)Q_{l}l+c$ .

Proof:

See Appendix -A, ∎

Defining $M:=Q_{l}^{\top}(I_{N+n_{l}-1}\otimes E)Q_{l}$ and noting that the term $c$ is independent of $l$ (and thus can be discarded from the optimization problem), we transform (14) into

[TABLE]

The following result can be immediately proved.

Theorem II.8

The solution of (16) is $l^{*}=\sqrt{\gamma_{1}-\sigma^{2}}\eta^{*}$ , where $\eta^{*}$ is the normalized eigenvector corresponding to the largest eigenvalue of $M$ .

Proof:

The change of variable $\eta=l/\sqrt{\gamma_{1}-\sigma^{2}}$ transforms the optimization problem in (16) to

[TABLE]

Note that $M\geq 0$ has at least one positive eigenvalue (as otherwise $M=0$ ). Therefore, Courant–Fischer–Weyl min-max principle [10, p. 58] shows $\eta^{*}$ is the normalized eigenvector corresponding to the largest eigenvalue of $M$ .∎

It can be seen that the quality of the model identification drops linearly with increasing $\gamma_{1}$ . At the same time, the performance degradation ratio increases linearly with $\gamma_{1}$ . This capture the trade-off between these two objectives. Note that, for instance, simply increasing the noise variance $\sigma^{2}$ to the upper bound $\gamma_{1}$ would determine a linear increase of the identification error, as $P_{h}$ is proportional to $\sigma^{2}$ . However, this strategy is non-optimal, and Theorem II.8 shows how to obtain the best trade-off between performance degradation and model quality degradation, namely how to get highest linear gain. A comparison between these two strategies is given in Section IV.

If, for a given application, the linear dependency between model quality degradation and system performance degradation is not suitable, one can use the following alternative formulation of the problem:

[TABLE]

where $\gamma_{2}$ determines weight on the performance versus the privacy. This formulation is useful when the constraint on the performance is not hard (i.e., the degradation does not need to be maintained under a given level but large output variations are not pleasant). This problem is rewritten as

[TABLE]

where $c$ is defined in (15).

Theorem II.9

Let $\lambda_{1}\geq\lambda_{2}\geq\ldots\geq\lambda_{n_{l}}\geq 0$ be the eigenvalues of $M$ and $v_{1},v_{2},\ldots,v_{n_{l}}$ denote the corresponding eigenvectors. The solution of (18) is

[TABLE]

Proof:

See Appendix -B. ∎

II-B2 Additive Input Noise

Similarly, Problem II.2 can be expressed as

[TABLE]

Using the same line of reasoning as in Lemma II.7, we introduce the following instrumental result.

Lemma II.10

Let $Q_{f}$ be a selection matrix such that $\operatorname{vec}(F)=Q_{f}f$ . Then, for the additive input noise model,

[TABLE]

*where $E$ and $c$ are defined in (15). *

Proof:

The proof follows the same line of reasoning as in Lemma II.7.∎

Now, note that the coefficients of the filter $L(q^{-1})$ and filter $F(q^{-1})=H(q^{-1})L(q^{-1})$ are related according to

[TABLE]

where $H\in\mathbb{R}^{n_{f}\times n_{l}}$ is a Toeplitz matrix formed by the coefficients of $h$ . Substituting (21) in (20) gives $\mbox{\rm tr}(P_{h})=l^{\top}H^{\top}Q_{f}^{\top}(I_{N+n_{f}-1}\otimes E)Q_{f}Hl+c.$ Therefore, the optimization problem in (19) can be transformed into

[TABLE]

where $M^{\prime}=H^{\top}Q_{f}^{\top}(I_{N+n_{f}-1}\otimes E)Q_{f}H$ . The following result can be immediately proved.

Theorem II.11

Assume $H^{\top}H>0$ . The solution of (22) is $l^{*}=\sqrt{\gamma_{1}-\sigma^{2}}(H^{\top}H)^{1/2}\eta^{*}$ , where $\eta^{*}$ is the normalized eigenvector corresponding to the largest eigenvalue of $(H^{\top}H)^{-1/2}M^{\prime}(H^{\top}H)^{-1/2}$ .

Proof:

Introducing $\eta=(H^{\top}H)^{-1/2}l/\sqrt{\gamma_{1}-\sigma^{2}}$ transforms the optimization problem in (16) to

[TABLE]

The rest of the proof follows the same line of reasoning as in the proof of Theorem II.8. ∎

The condition $H^{\top}H>0$ is satisfied so long as $H$ has full column rank. This is guaranteed if $h_{n_{h}}\neq 0$ , i.e., no fewer than $n_{h}$ parameters are required for describing filter $H(q^{-1})$ .

Remark II.12

The derivations of this section hold for arbitrary noise distributions as only the first and the second moments of the noise were considered. However, the choice of the Gaussian noise is highly preferred as it makes the integration of the closed-loop system with other control loops much easier. This is an important feature as, most often, off-the-shelf systems are interconnected to achieve complex tasks. Other noise distributions do not lend themselves that easily to integration as they might violate assumptions in the design of the control loops (e.g., Laplace noise results in an increased false alarm rate for fault detection schemes).

II-C Extension to regularized least-squares

We now modify the proposed privacy-preserving technique to cope with regularized least-squares estimators. The cost function associated with this type of estimators is

[TABLE]

where $K$ is a positive semidefinite matrix (usually called a kernel) inducing desired properties in the estimates $\hat{h}$ , see [11] for details on regularized methods for system identification. The solution to (23) is

[TABLE]

with obvious defintion of $C$ . This solution is biased. Further, it can be verified (see, e.g., [11]) that the mean square error (MSE) of the estimate is given by

[TABLE]

the first term on the right hand side corresponding to the bias induced by the regularization penalty. Then, the results of Theorems II.8 and II.9 hold by redefining

[TABLE]

and, accordingly, updating the definition of matrix $M$ . Note that the identification performance depends on the parameter $\eta$ , regulating the bias-variance trade off, and on the kernel matrix $K$ . These are user choices, which are not accessible to privacy-preserving device. One possible way to circumvent this issue is to consider the best possible choice of kernel, which is given by $K=hh^{\top}$ [11].

II-D Random Inputs

The problem of designing an additive output noise is only considered in this section. The results can be easily extended to the design of input noises following the same line of reasoning. Problem II.5 can be cast as

[TABLE]

Note that $\mbox{\rm tr}(P_{h})=\mathbb{E}\{c(r,N)\}+l^{\top}\mathbb{E}\{Q_{l}(N)^{\top}(I_{N+n_{f}-1}\otimes E(r,N))Q_{l}(N)\}l$ . Although having the same definition, $Q_{l}(N)$ , $E(r,N)$ , $c(r,N)$ are used instead of $Q_{l}$ , $E$ , and $c$ to emphasize they are functions of random variables $N$ and $r$ . Define $M^{\prime\prime}:=\mathbb{E}\{Q_{l}(N)^{\top}(I_{N+n_{f}-1}\otimes E(r,N))Q_{l}(N)\}$ . The optimization problem in (27) can be rewritten as

[TABLE]

Theorem II.13

The solution of (28) is $l^{*}=\sqrt{\gamma_{1}-\sigma^{2}}\eta^{*}$ , where $\eta^{*}$ is the normalized eigenvector corresponding to the largest eigenvalue of $M^{\prime\prime}$ .

Proof:

The proof follows the same line of reasoning as in Theorem II.8. ∎

Unfortunately, calculating $M^{\prime\prime}$ in an explicit from as a function of the distributions of $N$ and $r$ is generally difficult. The following remark provides a numerical algorithm for constructing an approximation of this matrix.

Remark II.14 (Monte Carlo Simulation)

Samples of possible input length $N^{i}$ , $i\in\{1,\dots,\theta\}$ , are selected randomly. For each $N^{i}$ , $\vartheta$ samples of the inputs of length $N^{i}$ can be selected. Let these samples be denoted by $r^{ij}$ . Define $\hat{M}^{\prime\prime}=(1/(\theta\vartheta))\sum_{i=1}^{\theta}\sum_{j=1}^{\vartheta}Q_{l}(N^{i})^{\top}(I_{N^{i}+n_{f}-1}\otimes E(r^{ij},N^{i}))Q_{l}(N^{i}).$ Evidently, $\mathbb{P}\{\|\hat{M}^{\prime\prime}-M^{\prime\prime}\|\geq\epsilon\}\rightarrow 0$ as both $\theta$ and $\vartheta$ tend to infinity for all $\epsilon>0$ . Therefore, by selecting enough samples, an arbitrarily close approximation of $M^{\prime\prime}$ with a high probability can be constructed.

III Relationship to Differential Privacy

Throughout this section, the design of an additive output noise is only considered. The results for the additive input noise can be constructed similarly. Furthermore, $h$ is assumed to belong to a compact set $\mathcal{H}\subseteq\mathbb{R}^{n_{h}}$ .

Definition III.1

The system is $\epsilon$ -differential private if $\mathbb{P}\{y\in\mathcal{Y}|h\}\leq\exp(\epsilon)\mathbb{P}\{y\in\mathcal{Y}|h^{\prime}\}$ for all Lebesgue-measurable sets $\mathcal{Y}\subseteq\mathbb{R}$ and $h,h^{\prime}\in\mathcal{H}$ that differ in at most only one entry, i.e., $\|h-h^{\prime}\|_{0}\leq 1$ . The system is $(\epsilon,\delta)$ -differential private if $\mathbb{P}\{y\in\mathcal{Y}|h\}\leq\exp(\epsilon)\mathbb{P}\{y\in\mathcal{Y}|h^{\prime}\}+\delta$ .

Note that a random variable $w$ is said to follow the Laplace distribution with mean $\mu$ and (scaling) parameter $b>0$ if $\mathbb{P}\{w\in\mathcal{W}\}=\int_{w\in\mathcal{W}}(2b)^{-1}\exp(-|w-\mu|/b)\mathrm{d}w$ for all Lebesgue-measurable sets $\mathcal{W}\subseteq\mathbb{R}$ .

Theorem III.2

Assume $w_{t}$ is i.i.d. Laplace random variables with $b\geq\sup_{h,h^{\prime}\in\mathcal{H}:\|h-h^{\prime}\|_{0}\leq 1}\|Rh-Rh^{\prime}\|_{1}/\epsilon$ . Then, the system is $\epsilon$ -differential private.

Proof:

See Appendix -C. ∎

Note that $\sup_{h,h^{\prime}\in\mathcal{H}:\|h-h^{\prime}\|_{0}\leq 1}\|Rh-Rh^{\prime}\|_{1}$ exists and is finite because $\mathcal{H}$ is assumed to be a compact set.

Theorem III.3

Assume $w_{t}$ is i.i.d. Laplace random variables with scaling parameter $b$ . Then, $\lambda_{y}=2b^{2}+\sigma^{2}$ .

Proof:

The proof follows from that $\lambda_{y}:={\mathbb{E}}\{y_{t}^{2}|r_{t}=0\}={\mathbb{E}}\{w_{t}^{2}\}+{\mathbb{E}}\{e_{t}^{2}\}=2b^{2}+\sigma^{2}.$ ∎

Combination of Theorems III.2 and III.3 illustrates the trade-off between preserving privacy and closed-loop performance because as $\epsilon$ tends to zero (to achieve a higher level of privacy), the performance degrades (i.e., $\lambda_{y}$ goes to infinity).

Proposition III.4

Let $\mathcal{H}:=\{h\in\mathbb{R}^{n_{h}}\,|\,\underline{h}\leq h_{i}\leq\overline{h},\forall i\}$ . Then, $\sup_{h,h^{\prime}\in\mathcal{H}:\|h-h^{\prime}\|_{0}\leq 1}\|Rh-Rh^{\prime}\|_{1}=(\overline{h}-\underline{h})\sum_{k=1}^{N}|r_{k}|$ .

Proof:

See Appendix -D.∎

Proposition III.4 illustrates that the parameter of the Laplace noise $b$ should be increased upon admitting larger input sequences. This is because, with larger $N$ , there are more data to extract the system parameters and, thus, the employed mechanism needs to be more conservative to avoid leaking the private information. Some relaxations of the differential privacy, e.g., $(\epsilon,\delta)$ -differential privacy, that lend themselves to using a Gaussian noise, e.g., [4]. Let for any $\epsilon$ and $\delta$ define $\kappa(\epsilon,\delta)=(\mathcal{Q}^{-1}(\delta)+\sqrt{\mathcal{Q}^{-1}(\delta)^{2}+2\epsilon})/2$ with $\mathcal{Q}^{-1}$ denoting the inverse of $\mathcal{Q}:x\mapsto\int_{x}^{\infty}1/\sqrt{2\pi}\exp(-u^{2}/2)\mathrm{d}u$ .

Theorem III.5

Assume $w_{t}$ is i.i.d. zero-mean Gaussian noise with $\sigma\geq\kappa(\epsilon,\delta)\sup_{h,h^{\prime}\in\mathcal{H}:\|h-h^{\prime}\|_{0}\leq 1}\|Rh-Rh^{\prime}\|_{2}/\epsilon$ . Then, the system is $(\epsilon,\delta)$ -differential private.

Proof:

The proof is similar to that of Theorem III.2 and can be found in [4]. ∎

IV Numerical Examples

Consider the discrete-time system $y_{t}=G(q^{-1})r_{t}+e_{t},$ where $G(q^{-1})=(q^{-1}-0.2q^{-2})/(1-0.9q^{-1}+0.17q^{-2}).$ Clearly, $G(q^{-1})$ is not a FIR system. This system can be approximated by the FIR filter $H(q^{-1})=q^{-1}+0.7q^{-2}+0.46q^{-3}+0.295q^{-4}+0.1873q^{-5}+0.1184q^{-6}+0.0747q^{-7}+0.0471q^{-8}+0.0297q^{-9}.$ The quality of the approximation is $\|H(q^{-1})-G(q^{-1})\|=0.0507$ . In the following, we consider the deterministic input and the random input cases.

IV-1 Deterministic inputs

We assume that a sequence of $N=200$ input samples is injected by the malicious entity. The sequence is generated by filtering a white noise process through the low-pass filter $W(q^{-1})=1/(1-0.95q^{-1})$ . We set $\sigma^{2}=1$ and $\gamma_{1}=2$ , so that we are allow to double the variance of the output. First, we consider the least-squares estimator (3). We compute the identification error, given by $\mbox{\rm tr}(P_{h})$ , of least-squares equipped with the proposed privacy preserving technique using output additive noise case with $n_{l}=10$ , and the identification error of least-squares without any privacy preserving device. To get a fair comparison, in the latter case the noise variance is equal to the total noise variance of the former case, that is $\mbox{\rm tr}(FF^{\prime})/N+\sigma^{2}$ . The noise filter designed by the privacy preserving device yields $\mbox{\rm tr}(P_{h})=0.25$ , while the variance obtained using standard least-squares is $\mbox{\rm tr}(P_{h})=0.17$ ; we have thus obtained an error increase of approximately $50\%$ .

We now consider regularized least-squares estimators, as described in Subsection II-C. We employ as regularization kernel the stable spline kernel $K_{i,j}=\beta^{\max(i,j)}$ (see [11]), with $\beta=0.7$ . The trade off parameter $\eta$ is set as $\eta=0.1$ . Using the proposed privacy preserving technique the obtained MSE of the estimated system is $0.17$ , while without privacy preservation (and with the same noise variance) we get a MSE equal to $0.13$ . Increasing $\eta$ , the privacy preserving device tends to have a milder effect on the MSE, because the regularized least-squares estimator gives higher weight to the prior knowledge, penalizing the information acquired from data.

IV-2 Random inputs

Assume that the malicious entity injects a sequence of i.i.d. zero-mean unit-variance Gaussian variables of length $N$ chosen with equal probability from $\{10,\dots,20\}$ . The approach of Subsection II-D is considered for constructing an optimal additive output noise with $n_{l}=5$ . In this example, $M^{\prime\prime}$ is approximated using the method of Remark II.14 with $\theta=100$ and $\vartheta=1000$ . Set $\sigma^{2}=0.1$ and $\gamma_{1}=0.2$ . Therefore, the performance degradation ratio is upper-bounded as $\rho\leq 2$ (indeed the upper bound is tight due to the nature of the optimal solution). The optimal additive input noise, in this case, is driven by the FIR filter $L(q^{-1})=0.1450+0.0799q^{-1}+0.2125q^{-2}+0.0799q^{-3}+0.1450q^{-4}$ . Using the Monte Carlo simulation, it can be shown that $\mbox{\rm tr}(\mathbb{E}\{P_{h}\})/\mbox{\rm tr}(\mathbb{E}\{P_{h}|w_{t}=0\})\approx 1.9639.$ Therefore, the system identification error has been approximately doubled at the expense of doubling the output variance. From Theorem II.13, it can be inferred that $\mbox{\rm tr}(\mathbb{E}\{P_{h}\})/\mbox{\rm tr}(\mathbb{E}\{P_{h}|w_{t}=0\})=1+(\eta^{*\top}M^{\prime\prime}\eta^{*})/\mathbb{E}\{c(r,N)\}(\gamma_{1}-\sigma^{2}).$

V Conclusions

Adding input and output noises for increasing the model identification error was considered. Optimal filters for constructing additive coloured noises were designed to maximize the identification error while maintaining the closed-performance degradation below a threshold. Differential privacy was also explored for designing output noises that preserve the privacy of the model.

-A Proof of Lemma II.7

We have $\mbox{\rm tr}(P_{h})=\mbox{\rm tr}((R^{\top}R)^{-1}R^{\top}(LL^{\top}+\sigma^{2}I_{N})R(R^{\top}R)^{-1})=\mbox{\rm tr}(L^{\top}EL)+c$ Now, note that $\mbox{\rm tr}(L^{\top}EL)=\operatorname{vec}(L)^{\top}\operatorname{vec}(EL)=\operatorname{vec}(L)^{\top}(I_{N+n_{l}-1}\otimes E)\operatorname{vec}(L)=l^{\top}Q_{l}^{\top}(I_{N+n_{l}-1}\otimes E)Q_{l}l,$ where the second step follows from [12, Lemma 4.3.1].

-B Proof of Theorem II.9

Taking the derivative of the cost function with respect to $l$ results in $\partial/\partial l\left[(l^{\top}Ml+c)^{-1}+\gamma_{2}\|l\|^{2}\right]=-2Ml/(l^{\top}Ml+c)^{2}+\gamma_{2}l.$ Setting this derivative equal to zero gives $\left(M-\gamma_{2}(l^{\top}Ml+c)^{2}I_{n_{l}}\right)l=0.$ The candidate solutions for this equation are either $l=0$ (referred to as the type-1 solution) or vectors $l$ that are parallel to $v_{i}$ with the condition that $\|l\|^{2}=1/\sqrt{\gamma\lambda_{i}}-c/\lambda_{i}$ for all $i=1,\ldots,n_{l}$ (referred to as the type-2 solutions). An eigenvalue $\lambda_{i}$ may generate a type-2 solution only if $\lambda_{i}\geq\gamma_{2}c^{2}$ (since otherwise $l$ would have a negative norm, which is not possible).

Therefore, if $\lambda_{1}<\gamma_{2}c^{2}$ , the only solution to (18) can be the type-1 solution $l=0$ (as the condition $\lambda_{i}\geq\gamma_{2}c^{2}$ cannot be satisfied for any $i$ if it cannot be satisfied for the largest eigenvalue $\lambda_{1}$ ). This is the case if the penalty on the variance of $y$ is too large and no variations can be tolerated.

If $\lambda_{i}=\gamma_{2}c^{2}$ , the two types of solution coincide.

We now verify whether type-1 and type-2 solutions correspond to global minima of the cost function in (18). Let us define $k:=(l^{\top}Ml+c)$ , and also denote the $i$ -th row of $M$ by $m_{i}^{\top}$ . Computing the Hessian of the cost function in (18) at $l$ yields $J(l)=-\frac{2}{k^{2}}M+\frac{8}{k^{3}}V(l)+2\gamma_{2}I_{n_{l}}\,,$ where $V(l)$ is a matrix such that its entry $(h,k)$ is $V_{hk}(l)=l^{\top}m_{h}m_{k}^{\top}l$ . Then $J(0)=-\frac{2}{c^{2}}M+2\gamma_{2}I_{n_{l}},$ which is positive definite only if $\lambda_{1}<\gamma_{2}c^{2}$ . This observation shows that the type-1 solution $l=0$ is only a minimum when $\lambda_{1}<\gamma_{2}c^{2}$ . Noting that for the case where $\lambda_{1}<\gamma_{2}c^{2}$ , $l=0$ is the only stationary point of the cost function, then it is a global minimum.

We now study type-2 solutions. Let us define $\alpha_{i}^{2}:=1/\sqrt{\gamma_{2}\lambda_{i}}-c/\lambda_{i}$ , so that a candidate type-2 solution can be written $l^{*}=\alpha_{i}v_{i},\,i=1,\,\ldots,\,n_{l}$ . In what follows, we first assume that $\lambda_{1}>\lambda_{2}\geq\lambda_{i}$ . We then relax this assumption at the end of the proof. For any $k=1,\,\ldots,\,n_{l}$ , we have $m_{k}^{\top}l^{*}=m_{k}^{\top}\alpha_{i}v_{i}=\lambda_{i}\alpha_{i}v_{i,k}$ , where $v_{i,k}$ is the $k$ -th entry of $v_{i}$ . Consequently $V_{hk}(l^{*})=l^{*T}m_{h}m_{k}^{\top}l^{*}=\lambda_{i}^{2}\alpha_{i}^{2}v_{i,h}v_{i,k},$ and, in matrix notation, $V(l^{*})=\lambda_{i}^{2}\alpha_{i}^{2}v_{i}v_{i}^{\top}.$ Hence, for any of these solutions, we have $J(l^{*})=-2/(\alpha_{i}^{2}\lambda_{i}+c)^{2}M+(8\alpha_{i}^{2}\lambda_{i}^{2})/(\alpha_{i}^{2}\lambda_{i}+c)^{3}v_{i}v_{i}^{\top}+2\gamma_{2}I_{n_{l}}=-2\gamma_{2}/\lambda_{i}M+8\gamma_{2}v_{i}v_{i}^{\top}-c\sqrt{\gamma_{2}^{3}}/\sqrt{\lambda}_{i}v_{i}v_{i}^{\top}+2\gamma_{2}I_{n_{l}}.$ Since $M$ is positive semidefinite, its eigenvectors form an orthonormal basis [12, p. 229]. Hence, $M$ admits the decomposition $M=\sum_{j=1}^{n_{l}}\lambda_{j}v_{j}v_{j}^{\top}$ . Consequently, we can write $J(l^{*})=\sum_{j=1}^{n_{l}}\eta_{j}v_{j}v_{j}^{\top}+2\gamma_{2}I_{n_{l}},$ where

[TABLE]

Due to the orthonormality of the $v_{j}$ , the eigenvalues of $J(l^{*})$ are then $\eta_{j}+2\gamma_{2},\,j=1,\,\ldots,\,n_{l}$ .

Consider now a candidate type-2 solution corresponding to an eigenvalue $\lambda_{i},\,i\geq 2$ . In this case, one of the eigenvalues of $J(l^{*})$ is $2\gamma_{2}\left(1-\lambda_{1}/\lambda_{i}\right)$ , which is negative under the assumption $\lambda_{1}>\lambda_{2}\geq\lambda_{i}$ . Therefore, all the candidate type-2 solution corresponding to an eigenvalue $\lambda_{i},\,i\geq 2$ , are not minimums so we must discard them. As for $\lambda_{1}$ , the set of eigenvalues $\rho_{j}$ of $J(l^{*})$ are

[TABLE]

which are all positive for $\lambda_{1}>c^{2}\gamma_{2}$ . Therefore, $J(l^{*})$ is positive definite for $l^{*}=\sqrt{1/\sqrt{\gamma_{2}\lambda_{1}}-c/\lambda_{1}}v_{1}$ and, since there are no other minimums, this corresponds to a global minimum.

Now, assume that $\lambda_{1}=\lambda_{2}=\cdots=\lambda_{j}>\lambda_{j-1}$ . Following the same steps as the proof above, we can show that none of the type-2 solutions corresponding to $\lambda_{i}$ with $j-1\leq i\leq n_{l}$ can be a minimizer (because the Hessian is indefinite for them). Similarly, we can also show that all the type-2 solutions corresponding to $\lambda_{i}$ with $1\leq i\leq j$ are at least local minimums (because the Hessian is positive definite). To show that these points are also a global minimizer, we need to prove that they have the same cost. Let $l^{*}_{i_{1}}=\sqrt{1/\sqrt{\gamma_{2}\lambda_{i_{1}}}-c/\lambda_{i_{1}}}v_{i_{1}}$ and $l^{*}_{i_{2}}=\sqrt{1/\sqrt{\gamma_{2}\lambda_{i_{2}}}-c/\lambda_{i_{2}}}v_{i_{2}}$ for any $1\leq i_{1},i_{2}\leq j$ . We have $({l^{*}_{i_{1}}}^{\top}Ml^{*}_{i_{1}}+c)^{-1}+\gamma_{2}\|l^{*}_{i_{1}}\|^{2}=(\lambda_{i_{1}}+c)^{-1}+\gamma_{2}(1/\sqrt{\gamma_{2}\lambda_{i_{1}}}-c/\lambda_{i_{1}})=(\lambda_{i_{2}}+c)^{-1}+\gamma_{2}(1/\sqrt{\gamma_{2}\lambda_{i_{2}}}-c/\lambda_{i_{2}})=({l^{*}_{i_{2}}}^{\top}Ml^{*}_{i_{2}}+c)^{-1}+\gamma_{2}\|l^{*}_{i_{2}}\|^{2},$ where the first equality follows from that $\lambda_{i_{1}}=\lambda_{i_{2}}$ .

-C Proof of Theorem III.2

It can be proved that

[TABLE]

where $\chi(\cdot)$ is a characteristic function, i.e., $\chi(y\in\mathcal{Y})=1$ if $y\in\mathcal{Y}$ and $\chi(y\in\mathcal{Y})=0$ if $y\notin\mathcal{Y}$ , and the inequality follows from $\|u-Rh^{\prime}-e\|_{1}=\|u-Rh^{\prime}-e-Rh+Rh\|_{1}\leq\|u-Rh-e\|_{1}+\|Rh^{\prime}-Rh\|_{1}.$ Integrating (29) over $e$ gives $\mathbb{P}\{y\in\mathcal{Y}|h\}\leq\exp(\|Rh^{\prime}-Rh\|_{1}/b)\mathbb{P}\{y\in\mathcal{Y}|h^{\prime}\}=\exp(\epsilon)\mathbb{P}\{y\in\mathcal{Y}|h^{\prime}\}$ .

-D Proof of Proposition III.4

If $h,h^{\prime}$ only differ in entry $j$ , $\|Rh-Rh^{\prime}\|_{1}=|h_{j}-h^{\prime}_{j}|\sum_{k=1}^{N-j}|r_{k}|,$ $1\leq j\leq n_{h}$ . Thus, $\sup_{\underline{h}\leq h_{j},h^{\prime}_{j}\leq\overline{h}}\|Rh-Rh^{\prime}\|_{1}=(\overline{h}-\underline{h})\sum_{k=1}^{N-j}|r_{k}|.$ The rest of the proof follows from that all the terms in the sum are positive (and setting $j=1$ keeps the most terms).

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] C. R. Rojas, J. S. Welsh, G. C. Goodwin, and A. Feuer, “Robust optimal experiment design for system identification,” Automatica , vol. 43, no. 6, pp. 993–1008, 2007.
2[2] M. Gevers, “A personal view of the development of system identification: A 30-year journey through an exciting field,” Control Systems, IEEE , vol. 26, no. 6, pp. 93–105, 2006.
3[3] C. Dwork, “Differential privacy,” in Automata, Languages and Programming: 33rd International Colloquium, ICALP 2006, Venice, Italy, July 10-14, 2006, Proceedings, Part II (M. Bugliesi, B. Preneel, V. Sassone, and I. Wegener, eds.), pp. 1–12, Berlin, Heidelberg: Springer, 2006.
4[4] J. Le Ny and G. J. Pappas, “Differentially private filtering,” IEEE Transactions on Automatic Control , vol. 59, no. 2, pp. 341–354, 2014.
5[5] Z. Huang, Y. Wang, S. Mitra, and G. E. Dullerud, “On the cost of differential privacy in distributed control systems,” in Proceedings of the 3rd International Conference on High Confidence Networked Systems , pp. 105–114, 2014.
6[6] F. Farokhi, J. Milosevic, and H. Sandberg, “Optimal state estimation with measurements corrupted by laplace noise,” in Proceedings of the 55th Conference on Decision and Control , pp. 302–307, IEEE, 2016.
7[7] J. Le Ny and G. J. Pappas, “Privacy-preserving release of aggregate dynamic models,” in Proceedings of the 2nd ACM International Conference on High Confidence Networked Systems , pp. 49–56, 2013.
8[8] T. Söderström and P. Stoica, System identification . Prentice-Hall, 1988.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Preserving Privacy of Finite Impulse Response Systems

Abstract

I Introduction

II Optimal Additive Noise

II-A Problem Formulation

Assumption II.1

Problem II.2

Assumption II.3

Remark II.4

Problem II.5

II-A1 Additive Output Noise

Remark II.6

II-A2 Additive Input Noise

II-B Deterministic Input

II-B1 Additive Output Noise

Lemma II.7

Proof:

Theorem II.8

Proof:

Theorem II.9

Proof:

II-B2 Additive Input Noise

Lemma II.10

Proof:

Theorem II.11

Proof:

Remark II.12

II-C Extension to regularized least-squares

II-D Random Inputs

Theorem II.13

Proof:

Remark II.14** (Monte Carlo Simulation)**

III Relationship to Differential Privacy

Definition III.1

Theorem III.2

Proof:

Theorem III.3

Proof:

Proposition III.4

Proof:

Theorem III.5

Proof:

IV Numerical Examples

IV-1 Deterministic inputs

IV-2 Random inputs

V Conclusions

-A Proof of Lemma II.7

-B Proof of Theorem II.9

-C Proof of Theorem III.2

-D Proof of Proposition III.4

Remark II.14 (Monte Carlo Simulation)