Detecting Integrity Attacks on Control Systems using a Moving Target   Approach

Sean Weerakkody; Bruno Sinopoli

arXiv:1706.08182·cs.SY·June 27, 2017

Detecting Integrity Attacks on Control Systems using a Moving Target Approach

Sean Weerakkody, Bruno Sinopoli

PDF

TL;DR

This paper proposes a moving target approach with unknown time-varying dynamics and external states to detect and prevent integrity attacks on control systems, even when adversaries have extensive access.

Contribution

It introduces a novel moving target method using unknown linear time-varying dynamics and external states to enhance attack detection in control systems.

Findings

01

The approach can detect stealthy attacks with bounded performance.

02

External states improve detection of adversaries attempting system identification.

03

The method is robust against adversaries with full access to sensors and actuators.

Abstract

Maintaining the security of control systems in the presence of integrity attacks is a significant challenge. In literature, several possible attacks against control systems have been formulated including replay, false data injection, and zero dynamics attacks. The detection and prevention of these attacks may require the defender to possess a particular subset of trusted communication channels. Alternatively, these attacks can be prevented by keeping the system model secret from the adversary. In this paper, we consider an adversary who has the ability to modify and read all sensor and actuator channels. To thwart this adversary, we introduce external states dependent on the state of the control system, with linear time-varying dynamics unknown to the adversary. We also include sensors to measure these states. The presence of unknown time-varying dynamics is leveraged to detect an…

Equations136

x_{k + 1}

x_{k + 1}

y_{k}

J = T \to \infty lim \frac{1}{T + 1} E [k = 0 \sum T x_{k}^{T} W x_{k} + u_{k}^{T} U u_{k}],

J = T \to \infty lim \frac{1}{T + 1} E [k = 0 \sum T x_{k}^{T} W x_{k} + u_{k}^{T} U u_{k}],

\overset{x}{^}_{k + 1∣ k}^{r} = A \overset{x}{^}_{k ∣ k}^{r} + B u_{k},

\overset{x}{^}_{k + 1∣ k}^{r} = A \overset{x}{^}_{k ∣ k}^{r} + B u_{k},

\overset{x}{^}_{k ∣ k}^{r} = (I - K C) \overset{x}{^}_{k ∣ k - 1}^{r} + K y_{k},

K = P C^{T} (C P C^{T} + R)^{- 1},

P = A P A^{T} + Q - A P C^{T} (C P C^{T} + R)^{- 1} C P A^{T} .

u_{k}^{*} = L \overset{x}{^}_{k ∣ k}^{r}, L = - (B^{T} S B + U)^{- 1} B^{T} S A,

u_{k}^{*} = L \overset{x}{^}_{k ∣ k}^{r}, L = - (B^{T} S B + U)^{- 1} B^{T} S A,

S = A^{T} S A + W - A^{T} S B (B^{T} S B + U)^{- 1} B^{T} S A .

S = A^{T} S A + W - A^{T} S B (B^{T} S B + U)^{- 1} B^{T} S A .

g_{k} (I_{k}) H_{0} ≷ H_{1} η_{k} .

g_{k} (I_{k}) H_{0} ≷ H_{1} η_{k} .

β_{k} = \mbox P r (g_{k} (I_{k}) > η_{k} ∣ H_{1}), α = \mbox P r (g_{k} (I_{k}) > η_{k} ∣ H_{0}) .

β_{k} = \mbox P r (g_{k} (I_{k}) > η_{k} ∣ H_{1}), α = \mbox P r (g_{k} (I_{k}) > η_{k} ∣ H_{0}) .

x_{k + 1}

x_{k + 1}

y_{k}

x_{k + 1}^{a}

x_{k + 1}^{a}

s_{k}^{a}

[\tilde{x}_{k + 1} x_{k + 1}] = A_{k} [\tilde{x}_{k} x_{k}] + B_{k} u_{k} + [\tilde{w}_{k} w_{k}],

[\tilde{x}_{k + 1} x_{k + 1}] = A_{k} [\tilde{x}_{k} x_{k}] + B_{k} u_{k} + [\tilde{w}_{k} w_{k}],

A_{k} ≜ [A_{1, k} 0 A_{2, k} A], B_{k} ≜ [B_{k} B] .

A_{k} ≜ [A_{1, k} 0 A_{2, k} A], B_{k} ≜ [B_{k} B] .

[\tilde{y}_{k} y_{k}] = C_{k} [\tilde{x}_{k} x_{k}] + [\tilde{v}_{k} v_{k}], C_{k} ≜ [C_{k} 0 0 C] .

[\tilde{y}_{k} y_{k}] = C_{k} [\tilde{x}_{k} x_{k}] + [\tilde{v}_{k} v_{k}], C_{k} ≜ [C_{k} 0 0 C] .

A_{1, k}, A_{2, k}, B_{k}, C_{k + 1} \sim f_{A_{1, k}, A_{2, k}, B_{k}, C_{k + 1}} (A_{1}, A_{2}, B, C) .

A_{1, k}, A_{2, k}, B_{k}, C_{k + 1} \sim f_{A_{1, k}, A_{2, k}, B_{k}, C_{k + 1}} (A_{1}, A_{2}, B, C) .

[\tilde{w}_{k} w_{k}] \sim N (0, Q), [\tilde{v}_{k} v_{k}] \sim N (0, R),

[\tilde{w}_{k} w_{k}] \sim N (0, Q), [\tilde{v}_{k} v_{k}] \sim N (0, R),

Q = [\tilde{Q} \tilde{Q}_{12}^{T} \tilde{Q}_{12} Q] ≻ 0, R = [\tilde{R} \tilde{R}_{12}^{T} \tilde{R}_{12} R] ≻ 0.

Q = [\tilde{Q} \tilde{Q}_{12}^{T} \tilde{Q}_{12} Q] ≻ 0, R = [\tilde{R} \tilde{R}_{12}^{T} \tilde{R}_{12} R] ≻ 0.

[\hat{\tilde{x}}_{k + 1∣ k} \overset{x}{^}_{k + 1∣ k}]

[\hat{\tilde{x}}_{k + 1∣ k} \overset{x}{^}_{k + 1∣ k}]

+ B_{k} L \overset{x}{^}_{k ∣ k}^{r},

K_{k}

P_{k + 1}

l (\overset{x}{^}_{0∣0}^{r}, x_{0}, A, B, C, K, L, w_{0} \dots w_{k - 1}, v_{1} \dots v_{k}),

l (\overset{x}{^}_{0∣0}^{r}, x_{0}, A, B, C, K, L, w_{0} \dots w_{k - 1}, v_{1} \dots v_{k}),

z_{k} ≜ [\tilde{y}_{k} y_{k}] - C_{k} [\hat{\tilde{x}}_{k ∣ k - 1} \overset{x}{^}_{k ∣ k - 1}] \sim N (0, C_{k} P_{k} C_{k}^{T} + R) .

z_{k} ≜ [\tilde{y}_{k} y_{k}] - C_{k} [\hat{\tilde{x}}_{k ∣ k - 1} \overset{x}{^}_{k ∣ k - 1}] \sim N (0, C_{k} P_{k} C_{k}^{T} + R) .

g_{k} (z_{k}) = z_{k}^{T} (\overset{ˉ}{P}_{k})^{- 1} z_{k},

g_{k} (z_{k}) = z_{k}^{T} (\overset{ˉ}{P}_{k})^{- 1} z_{k},

[\tilde{x}_{k + 1} x_{k + 1}] = A_{k} [\tilde{x}_{k} x_{k}] + B_{k} (u_{k} + u_{k}^{a}) + [\tilde{w}_{k} w_{k}],

[\tilde{x}_{k + 1} x_{k + 1}] = A_{k} [\tilde{x}_{k} x_{k}] + B_{k} (u_{k} + u_{k}^{a}) + [\tilde{w}_{k} w_{k}],

[\tilde{y}_{k}^{a} y_{k}^{a}] = [\tilde{y}_{k} y_{k}] + [\tilde{s}_{k}^{a} s_{k}^{a}] .

[\tilde{y}_{k}^{a} y_{k}^{a}] = [\tilde{y}_{k} y_{k}] + [\tilde{s}_{k}^{a} s_{k}^{a}] .

I_{k}^{A}

I_{k}^{A}

I_{k}^{D}

I_{k}^{P}

\overset{x}{ˉ}_{k + 1}^{a} = A_{k} \overset{x}{ˉ}_{k}^{a} + B_{k} u_{k}^{a}, Δ \overset{y}{ˉ}_{k}^{a} = C_{k} \overset{x}{ˉ}_{k}^{a},

\overset{x}{ˉ}_{k + 1}^{a} = A_{k} \overset{x}{ˉ}_{k}^{a} + B_{k} u_{k}^{a}, Δ \overset{y}{ˉ}_{k}^{a} = C_{k} \overset{x}{ˉ}_{k}^{a},

\overset{s}{ˉ}_{k}^{a} = - E [Δ \overset{y}{ˉ}_{k}^{a} ∣ I_{k}^{A} \cup I_{k}^{P}] .

\overset{s}{ˉ}_{k}^{a} = - E [Δ \overset{y}{ˉ}_{k}^{a} ∣ I_{k}^{A} \cup I_{k}^{P}] .

[\overset{x}{ˉ}_{k + 1} \overset{x}{ˉ}_{k + 1}^{a}]

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\IEEEoverridecommandlockouts\overrideIEEEmargins

Detecting Integrity Attacks on Control Systems using a Moving Target Approach

Sean Weerakkody Bruno Sinopoli S. Weerakkody and B. Sinopoli are with the Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, USA 15213. Email: [email protected], [email protected]. Weerakkody is supported in part by the Department of Defense (DoD) through the National Defense Science & Engineering Graduate Fellowship (NDSEG) Program. The work by S. Weerakkody, and B. Sinopoli is supported by NSF grant CNS-1329936 CPS: Synergy: Collaborative Research: Event-Based Information Acquisition, Learning, and Control in High-Dimensional Cyber-Physical Systems

Abstract

Maintaining the security of control systems in the presence of integrity attacks is a significant challenge. In literature, several possible attacks against control systems have been formulated including replay, false data injection, and zero dynamics attacks. The detection and prevention of these attacks may require the defender to possess a particular subset of trusted communication channels. Alternatively, these attacks can be prevented by keeping the system model secret from the adversary. In this paper, we consider an adversary who has the ability to modify and read all sensor and actuator channels. To thwart this adversary, we introduce external states dependent on the state of the control system, with linear time-varying dynamics unknown to the adversary. We also include sensors to measure these states. The presence of unknown time-varying dynamics is leveraged to detect an adversary who simultaneously aims to identify the system and inject stealthy outputs. Potential attack strategies and bounds on the attacker’s performance are provided.

1 Introduction

Cyber-Physical systems (CPSs), referring to the tight interconnection of sensing, communication, and control in physical spaces, are becoming widespread in today’s society. Indeed, these systems will serve a significant role in several applications including transportation, water distribution, medical technologies, manufacturing, and of course the smart grid. Due to the proliferation of CPSs in critical infrastructures, their safety and security are of paramount importance. There have already been several powerful attacks against CPSs. One major example is Stuxnet, which targeted Supervisory Control and Data Acquisition (SCADA) systems at uranium enrichment facilities in Iran [1, 2]. Here, the adversary was able to appropriate controllers running centrifuges at the plant, and avoid detection by replaying previous measurements to the system operator. An additional example is the Maroochy Shire incident where a disgruntled employee performed an attack on a SCADA based sewage control system [3].

Previous work [4] has suggested that existing tools in cyber security are insufficient to address attacks on CPSs due to the underlying physical system. Two main classes of attacks defined by [4] are denial of service attacks where an attacker restricts the flow of information between the plant and control center, and integrity attacks where an adversary can alter control inputs and sensor outputs. An intelligent adversary can potentially cause physical damage to a system using access to control inputs while manipulating sensor measurements to avoid detection. As such, integrity attacks are the main focus of this paper.

Several integrity attacks have been investigated in the literature. For instance, [5, 6] analyze zero dynamics attacks where an adversary injects inputs into both the actuators and sensors so as to bias the state without inserting a net bias on the sensor measurements. False data injection attacks on measurements, where an adversary alters a subset of sensor measurements to induce destabilizing control inputs from the defender have also been studied. Liu et. al. [7] first studied false data injection attacks in the context of electricity grids. Furthermore, in [8], the authors consider false data injection in control systems, providing sufficient and necessary conditions for an attacker to destabilize a system while introducing a bounded bias on measurement residues. Finally, replay attacks where an adversary repeats a sequence of past measurements are analyzed in [9, 10].

The detection and prevention of integrity attacks on control systems against adversaries who are aware of the system model rely on the presence of one or more secure communication channels between the operator and the plant. For instance, [6] provides sufficient and necessary conditions for zero dynamics attacks based on the actuators and sensors in possession of the adversary. If the adversary has access to all sensors and actuators, a trivial zero dynamics attack is to subtract ones influence from the true measurements. To prevent false data injection attacks in control systems, a particular subset of measurements must be secure from the adversary [8]. Moreover, [11] proposes assigning security indices to each sensor to quantify the effort required for an adversary to introduce a successful false data injection attack. Physical watermarking, used to detect replay attacks in [9, 10] and robust attacks defined in [12], relies on the ability to inject secret noisy inputs into the control system. Also, [13] which considers the problem of robust estimation and control in the presence of integrity attacks, relies on the assumption that the attacker is only able to manipulate less than half the sensors.

In this paper, we consider the scenario where an adversary has access to all communication channels. Thus, to prevent an attack, an adversary must not be aware of the full system model. [14] considers the problem of altering system matrices to avoid zero dynamics attacks. However, in practice an adversary can use his access to both inputs and outputs to identify the system. Moreover, a malicious insider such as the attacker in the Maroochy Shire incident might be aware of the system model. Consequently, we propose introducing extraneous states correlated to the ordinary states of the system so that modification of the original states will impact the extraneous states. The extraneous states will have linear time-varying dynamics, known to the system operator and hidden from the adversary. The dynamics act as a moving target, changing fast enough so the adversary does not have adequate opportunity to identify the extraneous system. In this scenario, we propose attacks for the adversary and obtain detection bounds.

The rest of the paper is organized as follows. In Section II, we introduce our system model and control strategy. In Section III, we propose the moving target approach to detect integrity attacks on control systems. In Section IV, we summarize the attacker’s capabilities and propose two attack models. In Section V, we analyze bounds on the attacker’s performance. Section VI concludes the paper.

2 System Model

In this section, we introduce the model for our system. In particular, we assume our cyber-physical system can be modeled as a discrete time control system where

[TABLE]

Here $x_{k}\in\mathbb{R}^{n}$ is the state vector at time $k$ and $u_{k}\in\mathbb{R}^{p}$ is a collection of control inputs. A suite of sensors are used to monitor the state. Here $y_{k}\in\mathbb{R}^{m}$ is a vector of sensor measurements taken at time $k$ . $w_{k}$ is the independent and identically distributed (IID) process noise with probability distribution given by $\mathcal{N}(0,Q)$ where $Q\succ 0$ . Meanwhile, $v_{k}$ is the IID measurement noise with distribution given by $v_{k}\sim\mathcal{N}(0,R)$ where $R\succ 0$ . We assume that $(A,C)$ is detectable. Additionally, $(A,B)$ and $(A,Q^{\frac{1}{2}})$ are assumed to be stabilizable.

The set of measurements $y_{k}$ are sent to the SCADA center in order to compute the optimal control input. For our purposes, we assume that the operator wishes to minimize a quadratic function of the states and inputs as follows

[TABLE]

where $W\in\mathbb{R}^{n\times n},~{}U\in\mathbb{R}^{p\times p}$ are positive definite matrices defining the relative cost of each state and input. The optimal control input for the given cost function is a combination of a Kalman filter and a linear state feedback controller [15].

The Kalman filter computes the minimum mean squared error state estimate $\hat{x}_{k|k}^{r}$ 111The superscript $r$ is used to distinguish the ordinary state estimate from the state estimate obtained through the moving target model. given the previous set of measurements up to $y_{k}$ denoted by $y_{1:k}$ . We assume that the system has been running for a long time so that the Kalman filter has converged to a fixed gain linear estimator.

[TABLE]

The optimal control input with respect to (3) is given by

[TABLE]

and $S$ satisfies the following Riccati equation

[TABLE]

A bad data detector can be utilized to determine whether a malicious attack is occurring. Typically, the bad data detector can be written as a threshold-based detector where

[TABLE]

Here, $\mathcal{I}_{k}$ is the information available to the defender. The null hypothesis $\mathcal{H}_{0}$ is that the system is operating normally while the alternate hypothesis $\mathcal{H}_{1}$ is that the system is under attack. A more specific detector will be discussed later in the article. We furthermore define the probability of detection $\beta_{k}$ and false alarm $\alpha$ as

[TABLE]

Observe that $\alpha$ is independent of $k$ since the system is stationary under $\mathcal{H}_{0}$ . Regardless of the information available to a system operator, an attacker with knowledge of the input to output model as well as the ability to manipulate sensor measurements and control inputs, can generate undetectable attacks [16].

For instance, an adversary can simply subtract the influence he inserts through the control inputs from the system outputs as follows

[TABLE]

where $s_{k}^{a}$ is given by

[TABLE]

In this case, the attack has zero net effect on the outputs and as a result $\beta_{k}=\alpha$ .

3 The Moving Target

As discussed in the previous section, an adversary who is both aware of the system model and has access to all channels can generate undetectable attacks. In this work, we propose introducing linear time-varying dynamics, unknown to the adversary, but known to the defender, into the system. The defender can leverage his knowledge of the system to detect integrity attacks by the adversary. Moreover, by introducing time-varying dynamics, the defender limits the adversary’s ability to identify the system using his access to measurements and inputs. The time-varying dynamics act as a moving target.

3.1 Extended Model

We extend the state $x_{k}$ to include extraneous states $\tilde{x}_{k}\in\mathbb{R}^{\tilde{n}}$ as follows

[TABLE]

where

[TABLE]

Moreover, we introduce additional sensors $\tilde{y}_{k}\in\mathbb{R}^{\tilde{m}}$ to measure the extraneous states.

[TABLE]

The matrices are assumed to be IID random variables which are independent of the sensor and process noise with distribution

[TABLE]

Furthermore, we also assume that

[TABLE]

where

[TABLE]

Remark 1

While we assume the structure of the system introduced above with IID matrices $A_{1,k},A_{2,k},B_{k},C_{k+1}$ , the moving target design can still be effective in other scenarios. For instance, the dynamics need not be linear as long as the defender can accurately model the system. Moreover, the system parameters do not have to evolve at each time step, though the longer the target remains in place, the easier it is for the adversary to identify the system. In addition, the matrices $A_{1,k}$ , $A_{2,k},$ or $B_{k}$ may be sparse, as long as there exists adequate coupling between $x_{k}$ and $\tilde{x}_{k}$ .

Remark 2

The defender must be able to introduce extraneous states with time-varying dynamics correlated to the original state of the system. The extraneous states are application dependent and are to be decided by the system operator. Nonetheless, the system operator can leverage existing waste products of the system, for instance the heat dissipated by a reaction or process. The dynamics can be made time-varying by changing conditions at the plant. Alternatively, the defender can introduce dynamics into the system. For instance, the defender can introduce RLC circuits which measure the states. Time varying dynamics can be incorporated by including variable resistors or capacitors. By varying the components of the circuit according to some IID distribution at each time step, the defender can generate IID system matrices.

Remark 3

In the above formulation we assume that the defender is aware of the real time system matrices although they are random. In general, this information should not be sent over the network since doing so amounts to the existence of a secure communication channel. The secure communication channel could be leveraged to detect an attack without considering a moving target approach, for instance through physical watermarking [12]. Alternatively, we can generate pseudo random system matrices using a pseudo random number generator (PRNG). In this case, the seed of the PRNG will be known to the defender and kept hidden from the attacker.

3.2 Estimation and Detection

The presence of additional sensors allows us to improve our estimate of the state. In particular, we can incorporate an additional Kalman filter to estimate the state as follows.

[TABLE]

Observe that we use the state estimate $\hat{x}_{k|k}^{r}$ to compute the input $u_{k}^{*}$ as opposed to an estimate derived from (22). We assume the defender does not care about controlling $\tilde{x}_{k}$ . In this case, adding the moving target does not change $J$ . Such a strategy also prevents the attacker from using information from the input to learn about the system model. In fact, we have the following result.

Theorem 1

The input $u_{k}^{*}=L\hat{x}_{k|k}^{r}$ is independent from the system matrices $A_{1,k-1},A_{2,k-1},B_{k-1},C_{k}$ for all $k$ .

Proof 3.2.

The input $u_{k}^{*}$ is given by

[TABLE]

where $l$ is some deterministic function of variables which by assumption are independent from $A_{1,k-1},A_{2,k-1},B_{k-1},C_{k}$ for all $k$ . The result immediately follows.

A similar result can be obtained under attack where $u_{k}^{*}$ is conditionally independent of the system matrices $A_{1,k-1},A_{2,k-1},B_{k-1},C_{k}$ for all $k$ , given the adversary’s attack inputs.

We assume that a residue based detector is incorporated where the residue $z_{k}$ is given by

[TABLE]

We can leverage knowledge of the distribution of $z_{k}$ under normal operation to design a detector. In particular we consider a $\chi^{2}$ detector where $g_{k}$ in (10) is given by

[TABLE]

where $\bar{\mathcal{P}}_{k}=\mathcal{C}_{k}\mathcal{P}_{k}\mathcal{C}_{k}+\mathcal{R}$ . Under normal operation $g_{k}$ has a $\chi^{2}$ distribution. In general, the window for the detector can be extended to consider past measurements. In Figure 1, we include a diagram of the moving target system operating normally.

4 Attack Model

In this section we describe a near omnipotent attacker in terms of his capabilities, access to information, and potential strategies. On one hand, the adversary may acquire his knowledge and resources through a highly sophisticated attack strategy as done in Stuxnet. On the other hand, an adversary can obtain his resources through insider information and access as done in the Maroochy Shire incident.

4.1 Attack Capabilities

The attacker can insert arbitrary inputs into the system and can arbitrarily alter the sensor measurements. As a result, when under attack, the system has dynamics given by

[TABLE]

where $u_{k}^{a}$ is the attacker’s control input and $\tilde{s}_{k}^{a}$ and $s_{k}^{a}$ are the biases injected on the extraneous sensors and ordinary sensors respectively.

The attacker can read the true outputs of the system $\tilde{y}_{k},y_{k}$ and the inputs being sent by the defender to the plant $u_{k}$ for all time $k$ .

Remark 4.3.

The attacker essentially performs a man in the middle attack between the plant and system operator so that he can manipulate and read all communication channels arbitrarily. A malicious insider can do this by breaking encryption schemes. Furthermore, physical attacks can be used to change sensor measurements and control inputs. For instance, locally heating or cooling a temperature sensor would change the sensor measurements without violating the integrity or authenticity of data from a cyber perspective.

The attacker has full knowledge of the system model $\mathcal{S}\triangleq\{A,B,C,K,L,\mathcal{Q},\mathcal{R}\}$ . Moreover, the adversary knows the probability density function (pdf) of random matrices $A_{1,k},A_{2,k},B_{k},C_{k+1}$ .

Remark 4.4.

While conservative, the adversary can obtain his knowledge of the system model by observing the communication channels for an extended period of time and performing system identification. Moreover, observe that since the attacker is aware of the original system model and all outputs, he can asymptotically predict the state estimate $\hat{x}_{k|k}^{r}$ if the matrix $(A+BL)(I-KC)$ is stable [9].

Remark 4.5.

The attacker can leverage his probabilistic knowledge of the system model as well as the true outputs of the system to generate stealthy attack inputs $s_{k}^{a},\tilde{s}_{k}^{a}$ . In particular, the adversary can attempt to simultaneously identify the moving target and generate convincing counterfeit sensor outputs.

Based on the above definitions we can define the private information available to the attacker and defender at time $k$ $\mathcal{I}_{k}^{A},\mathcal{I}_{k}^{D}$ and the public information $\mathcal{I}_{k}^{P}$ available to both as

[TABLE]

In Figure 2, we include a diagram of the system under attack.

4.2 Attack Strategy

In this subsection we propose two main attack strategies. Without loss of generality we assume any attack begins at $k=0$ .

4.2.1 Attack 1: Subtract Influence

In the first attack strategy the attacker aims to estimate his influence on the control system and subtract it. Define $\bar{s}_{k}^{a}\triangleq[\tilde{s}_{k}^{a~{}T}~{}s_{k}^{a~{}T}]^{T}$ . Observe that if

[TABLE]

with initial state $\bar{x}_{0}^{a}=0$ and $\bar{s}_{k}^{a}=-\Delta\bar{y}_{k}^{a}$ , an attack is completely stealthy. As the adversary does not know the time varying matrices, we assume he computes an estimate of $\Delta\bar{y}_{k}^{a}$ and uses that to subtract his influence on the sensor measurements. Thus, we would have

[TABLE]

Remark 4.6.

Observe that the adversary can exactly subtract his influence from measurements $y_{k}$ due to his knowledge of the system model. However, the adversary should be unable to completely subtract his bias from the extraneous sensors $\tilde{y}_{k}$ .

Optimal Theoretical Estimation Define $\bar{y}_{k}^{a}\triangleq[\tilde{y}_{k}^{aT}~{}y_{k}^{aT}]^{T},~{}\bar{x}_{k}\triangleq[\tilde{x}_{k}^{T}~{}x_{k}^{T}]^{T},~{}~{}\bar{w}_{k}\triangleq[\tilde{w}_{k}^{T}~{}w_{k}^{T}]^{T}$ , $\bar{v}_{k}\triangleq[\tilde{v}_{k}^{T}~{}v_{k}^{T}]^{T}$ , and $\bar{y}_{k}\triangleq[\tilde{y}_{k}^{T}~{}y_{k}^{T}]^{T}$ . The adversary’s observations can be formulated through the following linear time-varying system,

[TABLE]

To estimate $\Delta\bar{y}_{k}^{a}$ at time $k$ , assume the adversary has access to the following distribution $f(\bar{x}_{k},\bar{x}_{k}^{a},C_{k}|\mathcal{I}_{k}^{A\cup P})$ where $\mathcal{I}_{k}^{A\cup P}=\mathcal{I}_{k}^{A}\cup\mathcal{I}_{k}^{P}$ Then we have

[TABLE]

We show that the pdf can be recursively computed at each step. Letting $\zeta_{k+1}=\{\bar{x}_{k+1},\bar{x}_{k+1}^{a},C_{k+1}\}$ we have

[TABLE]

The second equality follows from the conditional independence of $\zeta_{k+1}$ and $\bar{y}_{k}^{a},\bar{s}_{k}^{a}$ given $\bar{y}_{k}$ and $u_{k}$ . The last equality follows from Bayes rule and the conditional independence of $\bar{y}_{k+1}$ and $u_{k},u_{k}^{a}$ given $\zeta_{k+1}$ . We note that this distribution can be theoretically computed given the attacker’s information. That is, we know that

[TABLE]

Moreover, $\zeta_{k+1}$ and $\bar{y}_{k+1}$ are deterministic functions of $\zeta_{k}$ , $u_{k}$ , $u_{k}^{a}$ and random variables $A_{1,k}$ , $A_{2,k}$ , $B_{k}$ , $C_{k+1}$ , $\bar{w}_{k}$ , $\bar{v}_{k+1}$ which are independent of $\zeta_{k}$ given $\mathcal{I}_{k}^{A\cup P}$ . Thus, theoretically, $f(\zeta_{k+1}|\mathcal{I}_{k+1}^{A\cup P})$ can be recursively computed from $f(\zeta_{k}|\mathcal{I}_{k}^{A\cup P})$ .

Remark 4.7.

If the attacker subtracts his influence, he might be susceptible to a growing cancellation error if he attempts to excite the system’s unstable dynamics. Instead of subtracting his influence the attacker can instead directly estimate what the defender expects to see as summarized in the next section.

4.2.2 Attack 2: Estimate Expected Measurement

In the next strategy, the adversary aims to track the system operator’s state estimate. Using the system operator’s state estimate, the adversary attempts to generate stealthy outputs. Let $\hat{\bar{x}}_{k}=[\hat{\tilde{x}}_{k|k-1}^{T}\hat{x}_{k|k-1}^{T}]^{T}$ . The attacker’s observations and strategy can be formulated as follows

[TABLE]

The attacker wishes to track $\zeta_{k}=\{\bar{x}_{k},\hat{\bar{x}}_{k},C_{k},\mathcal{P}_{k}\}$ . The use of the preceding attack design is motivated by the ensuing result which states that the chosen attack vector minimizes a fixed quadratic function of the measurement residues.

Theorem 4.8.

Let $\Sigma\succeq 0$ be a positive semidefinite matrix.

[TABLE]

Proof 4.9.

Observe that

[TABLE]

Taking the gradient with respect to $\bar{s}_{k}^{a}$ and setting the resulting expression equal to 0, we obtain

[TABLE]

Solving gives

[TABLE]

and the result holds.

To determine $\bar{s}_{k}^{a}$ at time $k$ assume the adversary has access to the following distribution $f(\zeta_{k}|\mathcal{I}_{k}^{A\cup P})$ . As done before, the attacker can theoretically compute $\bar{s}_{k}^{a}$ by taking a conditional expectation. Additionally, similar to (38) we have

[TABLE]

Moreover, by similar analysis as in attack 1, we can demonstrate that $f(\zeta_{k+1}|\mathcal{I}_{k+1}^{A\cup P})$ can be recursively computed from $f(\zeta_{k}|\mathcal{I}_{k}^{A\cup P})$ . The main difference here is that the adversary must also estimate $\mathcal{P}_{k}$ . Note that in practice the proposed attacks are difficult to execute for an adversary since it is likely a challenge to compute the necessary distribution functions and expected values. As a result, in the next section we aim to provide bounds on the attacker’s estimation performance in terms of mean square error matrices.

5 Bounds on Attacker’s Performance

5.1 Bounds on Attacker’s State Estimation

In this section we attempt to characterize lower bounds on the error matrices associated with the states $\zeta_{k}$ defined in attack strategy 1 and 2. From there, we can attempt to characterize how well the adversary can design $\bar{s}_{k}^{a}$ to fool the bad data detector.

We leverage conditional posterior Cramer-Rao lower bounds for Bayesian sequences derived by [17]. The authors here make use of the Bayesian Cramer-Rao lower bound or Van Trees bound derived in [18] which states that for observations $y$ and states $\zeta$ the mean squared error matrix is bounded by the Fisher information as follows

[TABLE]

where the Fisher information matrix $I$ is given by

[TABLE]

Note that

[TABLE]

In [17], this result is extended to nonlinear Bayesian sequences with dynamics given by

[TABLE]

where $\omega_{k}$ and $\bar{v}_{k}$ are independent process and sensor noise respectively. In our case, we slightly adapt these results to account for the fact there is feedback in our system so that

[TABLE]

The inputs $u_{k}$ , $u_{k}^{a}$ and $\bar{s}_{k}^{a}$ are incorporated into the definition of $F_{k}$ , while uncertainty in the model $(A_{1,k},A_{2,k},B_{k},C_{k+1})$ can be incorporated in the process noise $\omega_{k}$ . It can shown that the following posterior Cramer-Rao lower bound holds

[TABLE]

where

[TABLE]

Remark 5.10.

We remark that since $F_{k}$ is defined by inputs $u_{k}$ , $u_{k}^{a}$ and $\bar{s}_{k}^{a}$ , $f_{k+1}^{c}$ is implicitly conditioned on $u_{0:k},\bar{s}_{1:k}^{a},u_{0:k}^{a}$ . Moreover, $f_{k+1}^{c}$ is defined given the adversary’s knowledge of $\mathcal{S},f(A_{1},A_{2},B,C)$ .

Observe that (51) gives us an expected lower bound for the error matrix associated with the entire state history $\zeta_{0:k+1}$ with knowledge of measurements $\bar{y}_{1:k}$ . This expectation is taken over the state history as well the measurement $\bar{y}_{k+1}$ so that $\hat{\zeta}_{0:k+1}$ is a function of the measurement $\bar{y}_{k+1}$ . Observe that unlike the traditional Cramer-Rao bound which is limited to unbiased estimators, the Bayesian Cramer-Rao bound here considers both biased and unbiased estimators $\hat{\zeta}$ .

While the lower bound given here applies to the entire state history $\zeta_{0:k+1}$ , in practice we care about estimating a lower bound on the current state $\zeta_{k+1}$ . Nonetheless, it can be easily shown that

[TABLE]

where $I^{-1}(\zeta_{k+1}|\bar{y}_{1:k})$ is the $\mbox{dim}(\zeta_{k})\times\mbox{dim}(\zeta_{k})$ lower right submatrix of $I^{-1}(\zeta_{0:k+1}|\bar{y}_{1:k})$ . In practice, computing $I^{-1}(\zeta_{k+1}|\bar{y}_{1:k})$ from $I^{-1}(\zeta_{0:k+1}|\bar{y}_{1:k})$ is impractical since it requires computing and taking the inverse of a Fisher information matrix which grows in dimension at each time step. As a result, we would like a recursion to compute $I^{-1}(\zeta_{k+1}|\bar{y}_{1:k})$ . From [17] we have the following result,

[TABLE]

where

[TABLE]

In addition,

[TABLE]

where

[TABLE]

We observe that it is still difficult to obtain matrices $E_{k}^{11},E_{k}^{12},E_{k}^{21},E_{k}^{22}$ so [17] introduces the following approximate recursion

[TABLE]

where

[TABLE]

We observe that in practice it may still be difficult to compute the exact expectations because high dimensional integration is generally involved. Nonetheless, particle filters as described in [19] can be used to approximate these expectations. Alternative approximations for the conditional posterior Cramer-Rao lower bound can be found in [20]. Unconditional bounds can be found in [21].

5.2 Bounds on Detection

The algorithm described allows us to compute an approximate lower bound on the mean square error matrix of the attacker’s state $\zeta_{k}$ for a given set of inputs $u_{0:k}^{a},\bar{s}_{1:k}^{a}$ and observation history $\bar{y}_{1:k}$ . This allows us to obtain a lower bound on the value of $g_{k}(z_{k})$ as follows.

Theorem 5.11.

Consider the special case that $\{C_{j}\}$ is known to the adversary for all $j\in\mathbb{Z}$ . Suppose an attacker attempts to estimate $\zeta_{k}=\{\bar{x}_{k},\hat{\bar{x}}_{k},\mathcal{P}_{k}\}$ as in attack strategy 2. Let $\hat{\bar{x}}_{k}^{e}(\bar{y}_{k})$ be an estimate of $\hat{\bar{x}}_{k}$ as a function of $\bar{y}_{k}$ given $\bar{y}_{1:k-1}$ and $\hat{e}_{k}=\hat{\bar{x}}_{k}-\hat{\bar{x}}_{k}^{e}(\bar{y}_{k})$ . Suppose a lower bound $Z$ on the error matrix of $\hat{\bar{x}}_{k}$ is obtained so that

[TABLE]

Then we have

[TABLE]

where $f^{*}={f(\hat{\bar{x}}_{k},\bar{y}_{k}|\mathcal{I}_{k-1}^{A\cup P},{u}_{k-1}^{a},\bar{s}_{k-1}^{a},{u}_{k-1})}$ .

Proof 5.12.

First, observe from remark 5.10

[TABLE]

We now have the following.

[TABLE]

The first two equalities follow from properties of the trace and expectation. The third equality follows from monotonicity properties of the trace function and the fact that $\bar{\mathcal{P}}_{k}^{-1}$ is constant with respect to $f^{*}$ . The fourth equality is based on the fact that given $\mathcal{C}_{k}$ , a minimizer lies in the range space of $\mathcal{C}_{k}$ . The fifth equality is due to (61). The final inequality follows from (59).

Remark 5.13.

In general, the adversary’s ability to estimate $\{\zeta_{k}\}$ is dependent on the inputs $\{u_{k}^{a}\},\{\bar{s}_{k}^{a}\}$ . For instance, the more the adversary biases the state away from its expected region of operation, the more challenging it is to perform estimation. Thus, if the system operator wishes to analyze how well an adversary can generate stealthy outputs, he must consider a particular sequence of attack inputs $u_{k}^{a},\bar{s}_{k}^{a}$ .

Remark 5.14.

In practice, it may be difficult to perform performance analysis when assuming $\mathcal{P}_{k}$ is an unknown state. However, one can still approximate a lower bound on the error matrix by assuming that the adversary has an oracle which allows him to know $\mathcal{P}_{k}$ , $\mathcal{K}_{k}$ , $I-\mathcal{K}_{k}\mathcal{C}_{k}$ .

6 Conclusion

In this paper, we have considered attacks on control systems where an adversary has access to all channels in a communication network. In order to counter such an adversary, we propose introducing time-varying dynamics into the system which are unknown to the adversary and can in turn be leveraged to detect attacks. Future work will consider sufficient conditions for the design of these matrices to prevent zero-dynamic attacks and the analysis of optimal identification techniques for the adversary.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. M. Chen, “Stuxnet, the real start of cyber warfare? [editor’s note],” IEEE Network , vol. 24, no. 6, pp. 2–3, 2010.
2[2] R. Langner, “To kill a centrifuge: A technical analysis of what Stuxnet’s creators tried to achieve,” Langner Communications, Tech. Rep., November 2013. [Online]. Available: www.langner.com/en/wp-content/uploads/2013/11/To-kill-a-centrifuge.pdf
3[3] J. Slay and M. Miller, “Lessons learned from the Maroochy water breach,” in Critical Infrastructure Protection . Springer US, 2008, pp. 73–82.
4[4] A. A. Cárdenas, S. Amin, and S. S. Sastry, “Secure Control: Towards Survivable Cyber-Physical Systems,” in Distributed Computing Systems Workshops, 2008. ICDCS ’08. 28th International Conference on DOI - 10.1109/ICDCS.Workshops.2008.40 . IEEE, 2008, pp. 495–500.
5[5] A. Teixeira, D. Perez, H. Sandberg, and K. H. Johannson, “Attack models and scenarios for networked control systems,” in Proceedings of the 1st international conference on High Confidence Networked Systems , Beijing, China, 2012, pp. 55–64.
6[6] F. Pasqualetti, F. Dorfler, and F. Bullo, “Attack detection and identification in cyber-physical systems,” IEEE Transactions on Automatic Control , vol. 58, no. 11, pp. 2715–2729, 2013.
7[7] Y. Liu, M. Reiter, and P. Ning, “False data injection attacks against state estimation in electric power grids,” in Proceedings of the 16th ACM conference on computer and communications security , Chicago, IL, 2009.
8[8] Y. Mo and B. Sinopoli, “False data injection attacks in cyber physical systems,” in First Workshop on Secure Control Systems , Stockholm, Sweden, April 2010.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Detecting Integrity Attacks on Control Systems using a Moving Target Approach

Abstract

1 Introduction

2 System Model

3 The Moving Target

3.1 Extended Model

Remark 1

Remark 2

Remark 3

3.2 Estimation and Detection

Theorem 1

Proof 3.2**.**

4 Attack Model

4.1 Attack Capabilities

Remark 4.3**.**

Remark 4.4**.**

Remark 4.5**.**

4.2 Attack Strategy

4.2.1 Attack 1: Subtract Influence

Remark 4.6**.**

Remark 4.7**.**

4.2.2 Attack 2: Estimate Expected Measurement

Theorem 4.8**.**

Proof 4.9**.**

5 Bounds on Attacker’s Performance

5.1 Bounds on Attacker’s State Estimation

Remark 5.10**.**

5.2 Bounds on Detection

Theorem 5.11**.**

Proof 5.12**.**

Remark 5.13**.**

Remark 5.14**.**

6 Conclusion

Proof 3.2.

Remark 4.3.

Remark 4.4.

Remark 4.5.

Remark 4.6.

Remark 4.7.

Theorem 4.8.

Proof 4.9.

Remark 5.10.

Theorem 5.11.

Proof 5.12.

Remark 5.13.

Remark 5.14.