Worst-case Guarantees for Remote Estimation of an Uncertain Source

Mukul Gagrani; Yi Ouyang; Mohammad Rasouli; Ashutosh Nayyar

arXiv:1902.03339·cs.SY·February 12, 2019

Worst-case Guarantees for Remote Estimation of an Uncertain Source

Mukul Gagrani, Yi Ouyang, Mohammad Rasouli, Ashutosh Nayyar

PDF

TL;DR

This paper addresses a worst-case scenario for remote estimation of an uncertain autoregressive source with bounded noise, establishing optimal open-loop communication schedules and estimation strategies under limited communication.

Contribution

It provides a complete characterization of optimal strategies for a decentralized minimax problem in remote estimation with bounded noise.

Findings

01

Optimal open-loop communication scheduling strategy identified.

02

The optimal estimator depends only on the most recent received observation.

03

Complete solution to the decentralized minimax decision problem.

Abstract

Consider a remote estimation problem where a sensor wants to communicate the state of an uncertain source to a remote estimator over a finite time horizon. The uncertain source is modeled as an autoregressive process with bounded noise. Given that the sensor has a limited communication budget, the sensor must decide when to transmit the state to the estimator who has to produce real-time estimates of the source state. In this paper, we consider the problem of finding a scheduling strategy for the sensor and an estimation strategy for the estimator to jointly minimize the worst-case maximum instantaneous estimation error over the time horizon. This leads to a decentralized minimax decision-making problem. We obtain a complete characterization of optimal strategies for this decentralized minimax problem. In particular, we show that an open loop communication scheduling strategy is optimal…

Figures1

Click any figure to enlarge with its caption.

Equations220

[[X_{1}, \dots, X_{n}]] = [[X_{1}]] \times \dots \times [[X_{n}]]

[[X_{1}, \dots, X_{n}]] = [[X_{1}]] \times \dots \times [[X_{n}]]

[[X, Y ∣ Z]] = [[X]] \times [[Y ∣ Z]] .

[[X, Y ∣ Z]] = [[X]] \times [[Y ∣ Z]] .

X, Y sup f (X, Y) = x \in [[X]] sup y \in [[Y ∣ x]] sup f (x, y) .

X, Y sup f (X, Y) = x \in [[X]] sup y \in [[Y ∣ x]] sup f (x, y) .

(X, Y) ∣ z sup f (X, Y) = x \in [[X ∣ z]] sup y \in [[Y ∣ x, z]] sup f (x, y) .

(X, Y) ∣ z sup f (X, Y) = x \in [[X ∣ z]] sup y \in [[Y ∣ x, z]] sup f (x, y) .

S_{t + 1} = f_{t + 1} (S_{t}, A_{t}, N_{t + 1}),

S_{t + 1} = f_{t + 1} (S_{t}, A_{t}, N_{t + 1}),

O_{t + 1} = h_{t + 1} (S_{t}, A_{t}, N_{t + 1}),

A_{t} = η_{t} (Q_{t}) .

A_{t} = η_{t} (Q_{t}) .

\displaystyle\inf_{\eta}\Big{\{}\sup_{N_{1:T}}\max_{t\in\mathcal{T}}\rho_{t}(S_{t},A_{t})\Big{\}}.

\displaystyle\inf_{\eta}\Big{\{}\sup_{N_{1:T}}\max_{t\in\mathcal{T}}\rho_{t}(S_{t},A_{t})\Big{\}}.

V_{T}^{*} (π_{T}, s_{T}^{o})

V_{T}^{*} (π_{T}, s_{T}^{o})

V_{t}^{*} (π_{t}, s_{t}^{o})

V_{t}^{*} (π_{t}, s_{t}^{o})

\displaystyle:=\inf_{a_{t}\in\mathcal{A}(s_{t}^{o})}\Big{\{}\displaystyle\sup_{s_{t}^{h}\in\pi_{t},n_{t+1}\in\left[\left[N_{t+1}\right]\right]}\max\Big{(}\rho_{t}(s_{t},a_{t}),V^{*}_{t+1}(\Pi_{t+1},S_{t+1}^{o})\Big{)}\Big{\}}.

Π_{t + 1} = {s_{t + 1}^{h} : s_{t + 1} =

Π_{t + 1} = {s_{t + 1}^{h} : s_{t + 1} =

s_{t} = (s_{t}^{h}, s_{t}^{o}), s_{t}^{h} \in π_{t}, n_{t + 1} \in [[N_{t + 1}]]} .

X_{t + 1} = λ A X_{t} + N_{t + 1},

X_{t + 1} = λ A X_{t} + N_{t + 1},

E_{t + 1} = max (E_{t} - U_{t}, 0) .

E_{t + 1} = max (E_{t} - U_{t}, 0) .

Y_{t} = h (X_{t}, U_{t}) = {X_{t} ϵ if U_{t} = 1, if U_{t} = 0,

Y_{t} = h (X_{t}, U_{t}) = {X_{t} ϵ if U_{t} = 1, if U_{t} = 0,

U_{t} = f_{t} (X_{1 : t}, E_{1 : t}, Y_{1 : t - 1}),

U_{t} = f_{t} (X_{1 : t}, E_{1 : t}, Y_{1 : t - 1}),

\hat{X}_{t} = g_{t} (Y_{1 : t}),

\hat{X}_{t} = g_{t} (Y_{1 : t}),

J (f, g) = N_{1 : T} sup t \in T max ∣∣ X_{t} - \hat{X}_{t} ∣∣.

J (f, g) = N_{1 : T} sup t \in T max ∣∣ X_{t} - \hat{X}_{t} ∣∣.

f, g min J (f, g)

f, g min J (f, g)

subject to \eqref e q : X t - \eqref e q : x_{h} a t

U_{t} = Γ_{t} (X_{t}) .

U_{t} = Γ_{t} (X_{t}) .

Γ_{t} = d_{t} (Y_{1 : t - 1}),

Γ_{t} = d_{t} (Y_{1 : t - 1}),

\hat{X}_{t} = g_{t} (Y_{1 : t}),

\hat{X}_{t} = g_{t} (Y_{1 : t}),

\hat{J} (d, g) = N_{1 : T} sup t \in T max ∣∣ X_{t} - \hat{X}_{t} ∣∣.

\hat{J} (d, g) = N_{1 : T} sup t \in T max ∣∣ X_{t} - \hat{X}_{t} ∣∣.

d, g min \hat{J} (d, g)

d, g min \hat{J} (d, g)

subject to \eqref e q : X t - \eqref e q : Y t, \eqref e q : U_{c} oor d ina t or, \eqref e q : G amma, \eqref e q : x ha t_{c} oor d ina t or

\tilde{E}_{t} = E_{t} - Γ_{t} (X_{t}) .

\tilde{E}_{t} = E_{t} - Γ_{t} (X_{t}) .

Θ_{t} = [[X_{t} ∣ Q_{t}]], Π_{t} = [[X_{t} ∣ Q_{t +}]] .

Θ_{t} = [[X_{t} ∣ Q_{t}]], Π_{t} = [[X_{t} ∣ Q_{t +}]] .

\displaystyle\Theta_{t+1}=\Big{\{}x_{t+1}:

\displaystyle\Theta_{t+1}=\Big{\{}x_{t+1}:

\displaystyle\text{for some}\,x_{t}\in\Pi_{t}~{}~{}~{}\text{and}\,||n_{t+1}||\leq a_{t+1}\Big{\}}.

:= ϕ_{t} (Π_{t})

Π_{t}

Π_{t}

:= ψ (Θ_{t}, Γ_{t}, Y_{t})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Worst-case Guarantees for Remote Estimation of an Uncertain Source

Mukul Gagrani, Yi Ouyang, Mohammad Rasouli and Ashutosh Nayyar This work was supported by NSF Grant ECCS 1509812, CNS 1446901 and ECCS 1750041.

Abstract

Consider a remote estimation problem where a sensor wants to communicate the state of an uncertain source to a remote estimator over a finite time horizon. The uncertain source is modeled as an autoregressive process with bounded noise. Given that the sensor has a limited communication budget, the sensor must decide when to transmit the state to the estimator who has to produce real-time estimates of the source state. In this paper, we consider the problem of finding a scheduling strategy for the sensor and an estimation strategy for the estimator to jointly minimize the worst-case maximum instantaneous estimation error over the time horizon. This leads to a decentralized minimax decision-making problem. We obtain a complete characterization of optimal strategies for this decentralized minimax problem. In particular, we show that an open loop communication scheduling strategy is optimal and the optimal estimate depends only on the most recently received sensor observation.

I Introduction

Information collection is essential for most engineering systems. In many applications, sensors are deployed to collect and send information to a base station/control center to estimate or control the state of the system. In environmental monitoring, for example, remote sensors are used to measure environmental variables such as temperature, rainfall, soil moisture, etc. The sensors collect information and transmit it to the base station through wireless communication. For a sensor with limited battery, the energy spent in communication is a significant factor determining the battery lifespan. Since battery replacement is expensive for remote sensors, it is important for sensors to adopt a transmission schedule that preserves energy while achieving a desired level of estimation accuracy. Similar scenarios of remote estimation also arise in other applications such as smart grids, networked control systems and healthcare monitoring [1, 2, 3].

The remote estimation problem with one sensor and one estimator has been studied under two different communication models: i) Remote estimation with pull communication protocol: In this class of problems the estimator decides when to get data from the sensor. Since the estimator is the only decision-maker in the system, this protocol leads to a centralized sequential decision-making problem. Instances of such problems have been studied in [4, 5, 6, 7]. ii) Remote estimation with push communication protocol: Here, the sensor makes the decision about when to send data to the estimator. The estimator decides, at each time, what estimate to produce. This leads to a decentralized decision-making problem with the sensor and the estimator as the two decsion-makers. Computing jointly optimal scheduling and estimation strategies in a decentralized setup is difficult in general. However, several works have addressed this problem by placing some restrictions on the transmission/estimation strategies and/or by making certain assumptions about the source statistics. For example, [8, 9] studied the problem of remote estimation under limited number of transmissions when the state process is i.i.d. and the transmission strategy is restricted to be threshold-based. A continuous-time version of the problem is considered in [10] with a Markov state process, limited number of transmissions and a fixed estimation strategy. [11] derived the optimal communication schedule assuming a Kalman-like estimator. Jointly optimal scheduling and estimation strategies were derived in [12, 13, 14] for Markov sources that satisfied certain symmetry assumptions on their probability distributions.

The uncertainties in all the aforementioned work are modeled as random variables and the objective is to minimize the expected sum cost over a finite time horizon. However, in many applications, there is no statistical model for the system variables of interest. Furthermore, guarantees on estimation accuracy at each time instant may be critical for safety concerned systems such as healthcare monitoring. For example, while monitoring the heartbeat of a patient it is desirable that the estimation error at each time is minimal.

In this paper, we consider an uncertain source that can be modeled as a discrete-time autoregressive process with bounded noise. The source is observed by a sensor with limited communication budget. The sensor can communicate with a remote estimator that needs to produce real-time estimates of the source state. Given such a model, we are interested in the worst-case guarantee on estimation error at any time that can be achieved under a limited communication budget. Put another way, we want to find the minimum communication budget needed to ensure that the worst-case estimation error at any time is below a given threshold. In order to address these questions, we consider a minimax formulation of the remote estimation problem. Our goal is to design a communication scheduling strategy for the sensor and an estimation strategy for the estimator to jointly minimize the worst-case instantaneous estimation cost over all realizations of the source process.

Centralized decision and control problems where the goal is to minimize a worst-case cost have long been studied in the literature. One prominent line of work has focused on developing dynamic program type approaches for minimax problems [15, 16, 17, 18, 19]. These centralized minimax dynamic programs use analogues of stochastic dynamic programming concepts such as information states and value functions. The centralized minimax dynamic program can be interpreted in terms of a zero-sum game between the controller and an adversary who selects the disturbances to maximize the cost metric [15, 16]. Dynamic games based approaches for minimax design problems were also studied in [20]. Minimax problems where the goal is to minimize the worst-case maximum instantaneous cost were studied in [21, 22, 23].

In the centralized minimax problems described above, the uncertainties are described in terms of the set of values they can take. In contrast, some minimax problems have looked at systems with stochastic uncertainties. In these problems, the parameters of the stochastic uncertainties are ambiguous. These parameters are either fixed apriori but unknown or they are chosen dynamically by an adversary. In either case, the objective of the control problem is to minimize the maximum expected cost corresponding to the worst-choice of unknown parameters. Examples of this line of work include [24, 25, 26, 27, 28].

Our minimax problem is most closely related to the minimax control problems studied in [15] and [22, 23]. The minimax problems in [15] and [22, 23] were centralized decision-making problem involving a single decision-maker acting over time. In contrast, our minimax problem involves two decision-makers, the sensor and the estimator, making decisions based on different information. The decentralized nature of our decision problem creates issues such as signaling where decision-makers may communicate implicitly through their actions. A decision to not communicate by the sensor, for example, can implicitly convey some information about the source to the estimator. Such signaling effects are a key reason why the joint optimization of strategies becomes a difficult problem [12, 13]. A class of decentralized minimax control problems with partial history sharing were investigated in [29].

In order to jointly optimize the strategies for the sensor and the estimator while taking into account the signaling between them, we extend the coordinator-based approach of [30], [13] which was developed for a stochastic model and expected cost criterion to our minimax setting. Using this, we explicitly identify optimal communication scheduling and estimation strategy for our minimax problem.

Organization: We start with a general centralized minimax control problem in Section II and then formulate the minimax remote estimation problem in Section III. We formulate an equivalent centralized minimax control problem in Section IV and derive the optimal scheduling and estimation strategies in Section V. We conclude in Section VI.

Notation and Uncertain Variables: $X_{a:b}$ denotes the collection of variables $(X_{a},X_{a+1},\ldots,X_{b})$ . $\mathbb{I}_{A}$ denotes the indicator function of an event $A$ .

We now review the concept of uncertain variables as defined in [31]. An uncertain variable is a mapping from some underlying sample space $\Omega$ to a space of interest. We use capital letters to denote uncertain variables while small letters denote their realizations and script letters denote the spaces of all possible realizations. For example, an uncertain variable $X$ has a realization $X(\omega)=x\in\mathcal{X}$ for an outcome $\omega\in\Omega$ .

Instead of probability measures as in the case of random variables, uncertain variables can be analyzed using their ranges. The range of $X$ is defined as $\left[\left[X\right]\right]=\{X(\omega):\omega\in\Omega\}$ . Similarly, for a collection of uncertain variables $X_{1},\ldots,X_{n}$ , $\left[\left[X_{1},\ldots,X_{n}\right]\right]=\{(X_{1}(\omega),\ldots,X_{n}(\omega)):\omega\in\Omega\}$ . The conditional range of $X$ given $Y=y$ is denoted by $\left[\left[X\,\middle|\,y\right]\right]$ (or $\left[\left[X\,\middle|\,Y=y\right]\right]$ ) and is defined as $\{X(w):Y(w)=y,w\in\Omega\}$ . We also define the uncertain conditional range $\left[\left[X\,\middle|\,Y\right]\right]$ as an uncertain variable that takes the value $\left[\left[X\,\middle|\,y\right]\right]$ when $Y$ takes the value $y$ .

Using the ranges of uncertain variables, an analogue of statistical independence can be defined as follows [Definition 2.1 [31]].

Definition 1.

Uncertain variables $X_{1},X_{2},\dots,X_{n}$ are unrelated if

[TABLE]

where $\times$ is the Cartesian product.

The following property comes from the definition of unrelated uncertain variables [Lemma 2.1 [31]].

Property 1.

If $X$ is unrelated to $(Y,Z)$ , that is, $\left[\left[X,Y,Z\right]\right]=\left[\left[X\right]\right]\times\left[\left[Y,Z\right]\right]$ , then

[TABLE]

For a function $f(x)$ , we define $\sup_{X}f(X):=\sup_{x\in\left[\left[X\right]\right]}f(x)$ to denote its supremum over the range of $X$ . Similarly $\sup_{X_{1:n}}f(X_{1:n}):=\sup_{x_{1:n}\in\left[\left[X_{1:n}\right]\right]}f(x_{1:n})$ . Also, $\sup_{X|y}f(X):=\sup_{x\in\left[\left[X\,\middle|\,y\right]\right]}f(x)$ denotes the supremum of $f(x)$ over the conditional range. For a bivariate function $f(x,y)$ , we have the following property.

Property 2.

If $X,Y$ are uncertain variables. Then

[TABLE]

Furthermore, if $Z$ is another uncertain variable, then

[TABLE]

Note that the above property is the analogue of the tower property of conditional expectation with supremum playing the role of expectation.

II Minimax Control with Maximum Instantaneous Cost Objective

Consider a discrete time system with state $S_{t}\in\mathcal{S}$ and observation $O_{t}\in\mathcal{O}$ evolving according to the following dynamics:

[TABLE]

where $A_{t}$ is the control action, $N_{t}$ is the noise, $t\in\mathcal{T}=\{1,2,\dots,T\}$ , $S_{1}=N_{1}$ and $O_{1}=h_{1}(S_{1})$ . The noise process $N=\{N_{t},t=1,\ldots,T\}$ is a sequence of unrelated uncertain variables. We assume that the state has two components, $S_{t}=(S_{t}^{h},S_{t}^{o})$ , where $S_{t}^{h}$ is the hidden part and $S_{t}^{o}\in\mathcal{S}^{o}$ is the observable part.

At each time $t$ , the controller’s available information is $Q_{t}=(O_{1:t},S^{o}_{1:t},A_{1:t-1})$ . Note that $Q_{t}$ includes the history of observations $O_{1:t}$ , the history of observable part of the states $S^{o}_{1:t}$ and the past control actions $A_{1:t-1}$ . $\mathcal{Q}_{t}$ denotes the set of all possible values of $Q_{t}$ . The set of available control actions at $t$ , which may depend on the directly observable state $S^{o}_{t}$ , is $\mathcal{A}(S^{o}_{t})$ . Based on the available information at $t$ , the controller takes a control action according to a function $\eta_{t}:\mathcal{Q}_{t}\mapsto\mathcal{A}(S^{o}_{t})$ as

[TABLE]

We call $\eta=(\eta_{1},\eta_{2},\dots,\eta_{T})$ a strategy of the controller. The instantaneous cost at time $t$ is $\rho_{t}(S_{t},A_{t})$ . The minimax control objective is to find a strategy $\eta$ that minimizes the worst-case maximum instantaneous cost. Thus, the strategy optimization problem is

[TABLE]

Let $\Pi_{t}=[[S_{t}^{h}|Q_{t}]]$ be the conditional range of the hidden part of the state $S_{t}^{h}$ given the available information $Q_{t}$ . Let $\mathcal{B}$ denote the space of all possible $\Pi_{t}$ . Note that $S^{o}_{t}$ belongs to $Q_{t}$ , so conditional range of $S^{o}_{t}$ given $Q_{t}$ is the singleton set i.e. $[[S_{t}^{o}|Q_{t}]]=\{S^{o}_{t}\}$ .

The conditional range $\Pi_{t}$ along with $S_{t}^{o}$ can be used as an information state for decision-making in the minimax control problem. In particular, we can obtain the following dynamic programming result using arguments from [23].

Theorem 1.

*For each $t\in\mathcal{T}$ , define functions $V^{*}_{t}:\mathcal{B}\times\mathcal{S}^{o}\mapsto\mathbb{R}$ as follows:

i) For $\pi_{T}\in\mathcal{B},s_{T}^{o}\in\mathcal{S}^{o}$ ,*

[TABLE]

ii) For $t<T,\pi_{t}\in\mathcal{B},s_{t}^{o}\in\mathcal{S}^{o}$ ,

[TABLE]

where $\Pi_{t+1}$ is given as follows,

[TABLE]

If the infimum in (9), (10) is achieved, then for each $\pi_{t}\in\mathcal{B}$ and $s_{t}^{o}\in\mathcal{S}^{o}$ the minimizing $a_{t}$ in (9)-(10) gives the optimal action at time $t$ for $t\in\mathcal{T}$ . Moreover, the optimal cost is given by $\sup_{Q_{1}}V^{*}_{1}(\Pi_{1},S_{1}^{o})$ .

Proof.

See Appendix A ∎

III Problem Formulation

Consider a communication problem between a sensor (transmitter) and an estimator (receiver) over a finite time horizon $\mathcal{T}=\{1,2,\dots,T\}$ , $T\geq 1$ . The sensor perfectly observes a discrete-time uncertain process $X_{t}\in\mathbb{R}^{n}$ which evolves according to the following dynamics

[TABLE]

where $\lambda$ is a scalar and $A$ is an orthogonal matrix. $N_{t}$ is an uncertain variable which lies in the ball of radius $a_{t}$ around the origin i.e. $||N_{t}||\leq a_{t}$ . We assume that the initial state $X_{1}=N_{1}$ . The numbers $a_{1},\ldots,a_{T}$ are finite. Since all the noise in the system is bounded, the state $X_{t}$ also remains bounded for all $t$ . Let $\mathcal{X}\subset\mathbb{R}^{n}$ denote a bounded set such that $X_{t}\in\mathcal{X}$ for $t\in\mathcal{T}$ .

The sensor can send the observed state to the estimator through a perfect channel. However, each transmission consumes one unit of sensor’s energy, and the sensor has a limited energy budget of $K$ units111 $K$ is a fixed known integer and not an uncertain variable with $1\leq K<T$ . Let $E_{t}$ denote the energy available at time $t$ . We use $U_{t}$ to denote the transmission decision at time $t$ . $U_{t}$ is $1$ if the current state observation is transmitted and [math] otherwise. Note that $U_{t}\in\mathcal{U}(E_{t})$ where $\mathcal{U}(E_{t})=\{0,1\}$ if $E_{t}>0$ and $\mathcal{U}(0)=\{0\}$ i.e. there can be no transmission at time $t$ if $E_{t}=0$ . The energy at time $t+1$ can be written as:

[TABLE]

The estimator receives $Y_{t}$ at time $t$ which is given as,

[TABLE]

where $\epsilon$ denotes no transmission. The sensor makes the transmission decision at $t$ based on available information $X_{1:t},E_{1:t},Y_{1:t-1}$ ,

[TABLE]

where $f_{t}$ is the transmission strategy of the sensor at time $t$ . We call the collection $\mathbf{f}=(f_{1},f_{2},\ldots,f_{T})$ the transmission strategy.

The estimator produces an estimate of the state $\hat{X}_{t}$ based on its received information $Y_{1:t}$ at time $t$ as follows:

[TABLE]

where $g_{t}$ denotes the estimation strategy at time $t$ . The collection $\mathbf{g}=(g_{1},g_{2},\ldots,g_{T})$ is referred to as the estimation strategy. The cost incurred under a transmission strategy $\mathbf{f}$ and estimation strategy $\mathbf{g}$ is the worst case maximum instantaneous distortion cost over the entire horizon, given by,

[TABLE]

We can now formulate the following problem.

Problem 1.

Determine a transmission strategy $\mathbf{f}$ for the sensor and an estimation strategy $\mathbf{g}$ for the estimator which jointly minimize the cost $J(\mathbf{f},\mathbf{g})$ in (16).

[TABLE]

Remark 1.

Communication scheduling and remote estimation problems similar to Problem 1 have been studied in [9, 12, 13]. The key differences between the problems in [9, 12, 13] and Problem 1 are: (i) source model- [9, 12, 13] deal with a stochastic source model whereas the source model in Problem 1 is non-stochastic; (ii) objective- [9, 12, 13] deal with minimizing an expected cumulative cost over a time horizon whereas the objective in Problem 1 is to minimize the worst-case instantaneous cost. The objective in Problem 1 may be more suitable for safety critical systems.

Next, we provide a structural result which establishes that the sensor can ignore past values of the source and energy levels without losing performance.

Lemma 1.

The transmission strategy can be restricted to the form $U_{t}=f_{t}(X_{t},Y_{1:t-1})$ without any loss in performance.

Proof.

See Appendix B. ∎

Problem 1 is a minimax sequential decision-making problem with two decision-makers (the sensor and the estimator). We will adopt the common information approach [13] for stochastic remote estimation problem to our minimax problem. This involves formulating a single-agent sequential decision-making problem from the perspective of an agent who knows the common information. In our setup, we can adopt the estimator’s perspective to formulate the single-agent problem as done in the following section.

IV An Equivalent Problem

We now formulate a new sequential decision problem that will help us to solve Problem 1. In the new problem, we consider the model of Section III with the following modification. At the beginning of $t^{th}$ time step, the estimator selects a mapping $\Gamma_{t}:\mathcal{X}\mapsto\{0,1\}$ . $\Gamma_{t}$ will be referred to as the estimator’s prescription to the sensor. The sensor uses the prescription to evaluate $U_{t}$ as follows:

[TABLE]

The estimator selects the prescription based on its available information, that is,

[TABLE]

where the function $d_{t}$ is referred to as the prescription strategy at time $t$ . At the end of $t^{th}$ time step, the estimator produces an estimate $\hat{X}_{t}$ as follows

[TABLE]

where $g_{t}$ is the estimation strategy at time $t$ . The cost incurred by the prescription strategy $\mathbf{d}=(d_{1},\ldots,d_{T})$ and the estimation strategy $\mathbf{g}=(g_{1},\ldots,g_{T})$ is,

[TABLE]

We consider the following problem,

Problem 2.

Determine a prescription strategy $\mathbf{d}$ and an estimation strategy $\mathbf{g}$ to minimize the cost $\hat{J}(\mathbf{d},\mathbf{g})$ .

[TABLE]

In Problem 2, the estimator is the sole decision-maker since the sensor merely evaluates the prescription at the current source state. Problem 2 can be shown to be equivalent to Problem 1 in a similar manner as in [13] for the stochastic remote estimation problem. The main idea is that for every choice of sensor strategy $\mathbf{f}$ there exists an equivalent prescription strategy $\mathbf{d}$ and vice-versa. Since this equivalence is true for every realization of the uncertain variables $N_{1:T}$ , the stochastic case argument also holds in this minimax scenario.

Problem 2 can be seen as an instance of the minimax problem formulated in Section II as follows:

We can imagine the system operating with $2T$ decision points by splitting each time instant into two decision points: (i) At each time $t$ , before the transmission at that time the estimator decides the prescription $\Gamma_{t}$ ; (ii) After receiving $Y_{t}$ , the estimator decides $\hat{X}_{t}$ . We denote this decision point by $t+$ (See Figure 2).

State: At $t$ , the state is $S_{t}=(S_{t}^{h},S_{t}^{o})=(X_{t},E_{t})$ since $E_{t}$ is observable by the estimator. At $t+$ , $S_{t+}=(S_{t+}^{h},S_{t+}^{o})=(X_{t},\tilde{E}_{t})$ where $\tilde{E}_{t}$ is the post-transmission energy given as

[TABLE] 3. 3.

Actions : At $t$ , action $A_{t}=\Gamma_{t}\in\mathcal{A}(E_{t})$ , where $\mathcal{A}(E_{t})$ is the collection of functions from $\mathcal{X}$ to $\mathcal{U}(E_{t})$ . Recall that $\mathcal{U}(0)=\{0\}$ and $\mathcal{U}(E_{t})=\{0,1\}$ for $E_{t}>0$ . At $t+$ , action $A_{t+}=\hat{X}_{t}\in\mathbb{R}^{n}$ . 4. 4.

Information: The information available at time $t$ to choose a prescription is $Q_{t}=\{Y_{1:t-1},\Gamma_{1:t-1},\hat{X}_{1:t-1}\}$ and at time $t+$ to generate $\hat{X}_{t}$ is $Q_{t+}=\{Y_{1:t},\Gamma_{1:t},\hat{X}_{1:t-1}\}$ . 5. 5.

Cost: The instantaneous cost at time $t$ , $\rho_{t}(S_{t},A_{t})=0$ and at time $t+$ , $\rho_{t+}(S_{t+},A_{t+})=||X_{t}-\hat{X}_{t}||$ .

Since Problem 2 is an instance of the minimax problem of Section II, we can use Theorem 1 to conclude that the optimal strategy is a function of the conditional range of the state $(X_{t},E_{t})$ given the estimator’s information. Since $E_{t}$ is known to the estimator, we just need to define the conditional range of $X_{t}$ . For that purpose, we define $\Theta_{t}$ as the pre-transmission conditional range of $X_{t}$ and $\Pi_{t}$ as the post-transmission conditional range of $X_{t}$ at time $t$ as follows:

[TABLE]

The following lemma describes the evolution of the sets $\Theta_{t}$ and $\Pi_{t}$ .

Lemma 2.

The pre-transmission conditional range $\Theta_{t+1}$ at time $t+1$ is a function of $\Pi_{t}$ i.e. $\Theta_{t+1}=\phi_{t}(\Pi_{t})$ . 2. 2.

The post-transmission conditional range $\Pi_{t}$ is a function of $\Theta_{t},\Gamma_{t}$ and $Y_{t}$ i.e. $\Pi_{t}=\psi(\Theta_{t},\Gamma_{t},Y_{t})$ .

Proof.

Given the post-transmission conditional range $\Pi_{t}$ , $\Theta_{t+1}$ is given as

[TABLE] 2. 2.

Given the pre-transmission conditional range $\Theta_{t}$ , $\Pi_{t}$ can be evaluated after receiving $Y_{t}$ as follows

[TABLE]

∎

Let $\mathcal{B}$ denote the space of all possible realizations of $\Pi_{t},\Theta_{t}$ and $\mathcal{E}=\{0,1,\ldots,K\}$ . Then, Theorem 1 can be used to write a dynamic program which characterizes the optimal estimates $\hat{X}_{t}$ and the optimal prescriptions $\Gamma_{t}$ in Problem 2 as follows,

Lemma 3.

*For $t\in\mathcal{T}$ , define the functions $V_{t}:\mathcal{B}\times\mathcal{E}\mapsto\mathbb{R}$ and $W_{t}:\mathcal{B}\times\mathcal{E}\mapsto\mathbb{R}$ as follows:

(i) For $\pi_{T}\in\mathcal{B}$ and $\tilde{e}_{T}\in\mathcal{E}$ define222 $\tilde{e}_{t}$ denotes a realization of the post-transmission energy as defined in (20).,*

[TABLE]

(ii) For $t\in\mathcal{T}$ , $\theta_{t}\in\mathcal{B}$ and $e_{t}\in\mathcal{E}$ define,

[TABLE]

*where $y_{t}=h(x_{t},\gamma_{t}(x_{t}))$ .

(iii) For $t<T$ , $\pi_{t}\in\mathcal{B}$ and $\tilde{e}_{t}\in\mathcal{E}$ define,*

[TABLE]

Suppose the infimum in (23),(24),(25) are always achieved. Then, for each $\theta_{t}\in\mathcal{B}$ and $e_{t}\in\mathcal{E}$ the minimizing $\gamma_{t}$ in (24) gives the optimal prescription at time $t$ . Also, for each $\pi_{t}$ (or $\pi_{T}$ ) $\in\mathcal{B}$ , the minimizing $\hat{x}_{t}(\text{or }\hat{x}_{T})$ gives the optimal estimate. Furthermore, $W_{1}([[X_{1}]],K)$ is the optimal cost for Problem 2.

Proof.

The result follows by writing the dynamic program using Theorem 1, Lemma 2 and associating the function $V_{t}$ with the value function at time $t+$ and $W_{t}$ with the value function at time $t$ . ∎

Note that the above dynamic program is computationally hard to solve because: i) It involves minimization over functions in (24) ii) The information state is the conditional range of the source state and thus can be any arbitrary subset of $\mathcal{X}$ . In the next section, we will analyze the dynamic program to obtain certain properties of the value functions which will help us in identifying the structure of the optimal strategies.

V Globally optimal strategies

We now proceed with solving the dynamic program of Lemma 3. We proceed in four steps.

Step 1: Nature of optimal prescriptions

We define a relation $\mathbf{Q}$ between sets which will be helpful in identifying the structure of the globally optimal prescriptions. To that end, we define the radius of a set $S\subset\mathbb{R}^{n}$ as $r^{*}(S):=\inf_{x\in\mathbb{R}^{n}}\sup_{y\in S}||y-x||$ . The following lemma gives the relation between the radius of a set $E$ and the radius of its transformation $\phi_{t}(E)$ defined by (1).

Lemma 4.

Let $E\subset\mathbb{R}^{n}$ . Then,

[TABLE]

Proof.

See Appendix C. ∎

We now define a relation $\mathbf{Q}$ between sets and a property $\mathbf{Q}$ for functions.

Definition 2.

Let $G,H\subset\mathbb{R}^{n}$ be two sets. We say $G\mathbf{Q}H$ if $r^{*}(G)=r^{*}(H)$ . 2. 2.

We say that a function $f:\mathcal{B}\times\mathcal{E}\mapsto\mathbb{R}$ satisfies property $\mathbf{Q}$ if

[TABLE]

Let $\gamma^{all}$ denote the ’always transmit’ prescription, i.e. $\gamma^{all}(x)=1,\forall x\in\mathcal{X}$ . Let $\gamma^{none}$ denote the ’never transmit’ prescription, i.e. $\gamma^{none}(x)=0,\forall x\in\mathcal{X}$ .

Lemma 5.

For each $t\in\mathcal{T}$ , the functions $V_{t}$ and $W_{t}$ of Lemma 3 satisfy property $\mathbf{Q}$ . 2. 2.

For each $t\in\mathcal{T}$ , either $\gamma^{all}$ or $\gamma^{none}$ is an optimal choice of prescription $\gamma_{t}$ in (24).

Proof.

See Appendix C. ∎

Consider two singleton sets $\{x^{1}_{t}\}$ and $\{x^{2}_{t}\}$ . The first part of Lemma 5 implies that $V_{t}(\{x^{1}_{t}\},e_{t})=V_{t}(\{x^{2}_{t}\},e_{t})$ because $\{x_{t}^{1}\}\mathbf{Q}\{x_{t}^{2}\}$ . Thus, $V_{t}(\{x_{t}\},e_{t})$ does not depend on the value of $x_{t}$ and can be represented as function of energy alone, that is, $V_{t}(\{x_{t}\},e_{t})=K_{t}(e_{t})$ . The second part of Lemma 5 implies that we can replace the infimum in (24) by minimzation over just two prescriptions, $\gamma^{all}$ and $\gamma^{none}$ . Using the above observations, we can reduce the dynamic program of Lemma 3 to the following:

[TABLE]

where (27) and (28) follow from the definition of $r^{*}(\pi_{t})$ and the dynamic program in Lemma 3; for $e_{t}>0$ ,

[TABLE]

where $K_{t}(e_{t}-1)=V_{t}(\{x_{t}\},e_{t}-1)$ for any $x_{t}$ . For $e_{t}=0$ ,

[TABLE]

Step 2: Simplified information state

We will now use property $\mathbf{Q}$ to simplify the information state of the dynamic program. Lemma 5 suggests that value functions $V_{t},W_{t}$ depend only on the radius of the conditional range. Thus, we would expect that the radius of the conditional range can act as an information state of the dynamic program. This idea is formalized in the following lemma.

Lemma 6.

*Define $\tilde{V}_{t}:\mathbb{R}^{+}\times\mathcal{E}\rightarrow\mathbb{R}$ and $\tilde{W}_{t}:\mathbb{R}^{+}\times\mathcal{E}\rightarrow\mathbb{R}$ as follows:

(i) For $t=T$ , $r\in\mathbb{R}^{+}$ and $\tilde{e}\in\mathcal{E}$ ,*

[TABLE]

(ii) For $t\in\mathcal{T}$ , $r\in\mathbb{R}^{+}$ and $\tilde{e}\in\mathcal{E}$ ,

[TABLE]

(iii) For $t<T$ ,

[TABLE]

Then, for $t\in\mathcal{T}$ ,

[TABLE]

Proof.

$V_{T}(\pi_{T},\tilde{e}_{T})=\tilde{V}_{T}(r^{*}(\pi_{t}),\tilde{e}_{t})$ follows from (31), (27). We then proceed by induction — we first show that (35) is true if (34) is true for $t$ . (35) follows easily from (29),(30) and the induction hypothesis by noting that $K_{t}(e_{t}-1)=V_{t}(\{x\},e_{t}-1)=\tilde{V}_{t}(0,e_{t}-1)$ . Next, we show that (34) is true for $t$ if (35) is true for $t+1$ . Using (28) and the induction hypothesis together with the fact that $r^{*}(\phi_{t}(\pi_{t}))=|\lambda|r^{*}(\pi_{t})+a_{t+1}$ , (34) can be easily established. ∎

We can further eliminate $\tilde{V}_{t}$ from (31)-(33) to obtain a recursive relation among $\tilde{W}_{t}$ given as:

[TABLE]

For $t<T$ ,

[TABLE]

The above equations can be seen as a reduced version of the dynamic program of Lemma 3 with the radius of the conditional range and the energy level as the information state. Unlike the dynamic program of Lemma 3, however, the above dynamic program is completely deterministic, that is, it does not involve maximization over any uncertain variables. In the next step, we will connect this deterministic dynamic program to a deterministic optimal control problem and use it to identify optimal transmission strategy.

**Step 3: A deterministic control problem

**Consider a deterministic control system with state $(X_{t}^{d},E_{t}^{d})\in\mathbb{R}^{+}\times\mathcal{E}$ and control action $U_{t}^{d}\in\mathcal{U}(E_{t}^{d})$ , where $\mathcal{U}(E_{t}^{d})=\{0,1\}$ if $E_{t}^{d}>0$ and $\mathcal{U}(0)=\{0\}$ , operating for a time horizon $T$ . The dynamics of the state are as follows:

[TABLE]

with $X_{1}^{d}=a_{1}$ and $E_{1}^{d}=K$ . The instantaneous cost is given by

[TABLE]

The deterministic control problem can be stated as follows.

Problem 3.

Determine a control sequence $U^{d}_{1:T}$ to minimize the cost

[TABLE]

We are interested in the above deterministic control problem because of the following lemma.

Lemma 7.

The optimal cost for the original problem (i.e, Problem 1), the coordinator’s problem (i.e, Problem 2) and the deterministic control problem (i.e, Problem 3) are equal. That is,

[TABLE]

Proof.

We have already discussed that Problems 1 and 2 are equivalent, so we will focus on the second equality in (38). Since the deterministic control problem is a special case of the minimax problem of Section II, we can use Theorem 1 to write the following dynamic program for it:

[TABLE]

for $t<T$ ; with $\tilde{W}^{d}_{1}(a_{1},K)$ being the optimal cost for Problem 3.

Comparing the above dynamic program with (36)-(37), it is easy to see that $\tilde{W}^{d}_{t}(x,e)=\tilde{W}_{t}(x,e),\forall x,e,t$ . From Theorem 1, the optimal cost of Problem 3 is

[TABLE]

which is the same as the optimal cost of Problem 2.

∎

**Step 4: Optimal transmission and estimation strategies for Problem 1 ** - We can now identify optimal transmission and estimation strategies for Problem 1. We start with the estimation strategy. We define $\tilde{X}_{0}=0$ and for $t\in\mathcal{T}$ ,

[TABLE]

Lemma 8.

In Problem 1 and Problem 2, the globally optimal estimation strategy is $g_{t}^{*}(Y_{1:t})=\tilde{X}_{t}$ , for $t\in\mathcal{T}$ .

Proof.

See Appendix D. ∎

Let $U_{t}^{d*}$ be an optimal open loop control sequence for Problem 3. Since Problem 3 is an optimal control problem with determinstic dynamics we know that there exists such an open loop strategy and can be computed via the dynamic program. We can now identify the optimal strategies for Problem 1.

Theorem 2.

Let $\mathbf{g}^{*}$ be the estimation strategy as defined in Lemma 8 and $\mathbf{f}^{*}$ be defined as follows:

[TABLE]

where $U_{t}^{d*}$ is an optimal open loop control sequence for Problem 3. Then, $(\mathbf{f}^{*},\mathbf{g^{*}})$ are globally optimal strategies for Problem 1.

Proof.

See Appendix D. ∎

Theorem 2 establishes that the globally optimal transmission strategy to minimize the worst-case instantaneous cost is an open-loop strategy that transmits at pre-determined time instants. Thus, even though the sensor has access to the state and transmission history, this information is not used by the optimal transmission strategy.

Remark 2.

We can compare the nature of optimal strategies in Theorem 2 with the optimal strategies in the stochastic remote estimation problem in [13, 12]. The optimal estimation strategy obtained in our minimax setup is identical to the one obtained in the stochastic case considered in [13, 12]. However, the optimal transmission strategy in [13, 12] is a threshold-based strategy in contrast to the deterministic strategy obtained in our setup.

V-A Homogenous noise

Consider the case when all the uncertain noise variables take values in the ball of same size i.e $a_{t}=a$ for all $t\in\mathcal{T}$ . It turns out that transmitting at uniformly spaced intervals is optimal in this case as made precise in the following lemma.

Lemma 9.

Define $\Delta:=\left\lceil\frac{T+1}{K+1}\right\rceil$ . Then,

The optimal cost for Problem 1 under homogenous noise model is,

[TABLE] 2. 2.

An optimal control sequence for Problem 1 under homogenous noise model is given as follows:

[TABLE]

Proof.

See Appendix E. ∎

Remark 3.

In the case of homogenous noise, it is possible that the sensor does not utilize all the $K$ available transmission opportunities under the transmission strategy $f^{*}$ . For example, when $T=5,K=3$ , the sensor will transmit only twice at $t=2,4$ . Thus, the worst-case error achieved in this case would be the same even if $K=2$ . Therefore, one could also ask the following question: What is the minimum number of transmission opportunities ( $K^{*}$ ) required so that the worst-case error is at most $\epsilon$ ? $K^{*}$ can be computed as follows:

[TABLE]

Remark 4.

Consider the problem where the estimator requests transmissions instead of the sensor deciding when to transmit. The cost of this problem is lower bounded by the cost of Problem 1 because the sensor has more information to make the transmission decision than the estimator. Moreover, since the optimal scheduling strategy obtained for Problem 1 is an open loop strategy, it can also be implemented in this new problem. Therefore, the results obtained for Problem 1 also hold for this problem.

Remark 5.

Consider the problem where the sensor can observe the source state only $M$ times instead of observing the state at each time with $M\geq K$ . In addition to the scheduling strategy, here the sensor must also decide when to observe the source. The cost of this problem is lower bounded by the cost of Problem 1 because the sensor has less information in this case compared to Problem 1. Also, since the optimal scheduling strategy for Problem 1 is an open loop strategy, the sensor in this problem can take observations at the fixed times when it transmits, thereby achieving the same cost as in Problem 1. Therefore, the results obtained for Problem 1 also hold for this problem.

Remark 6.

For each $t\in\mathcal{T}$ , let $\mathcal{B}_{t}$ be any set such that $\mathcal{B}_{t}$ is symmetric (i.e. if $n\in\mathcal{B}_{t}$ then $-n\in\mathcal{B}_{t}$ ) and $\sup_{n_{t}\in\mathcal{B}_{t}}||n_{t}||=a_{t}$ . It can be shown that the optimal transmission and estimation strategy remains the same if the noise $N_{t}$ lies in the set $\mathcal{B}_{t}$ .

VI Conclusion

We considered the problem of remote estimation of a non-stochastic source over a finite time horizon where the sensor has a limited communication budget. Our objective was to find jointly optimal scheduling and estimation strategies which minimize the worst-case maximum instantaneous estimation error over the time horizon. This problem is a decentralized minimax decision-making problem. Our approach started with the dynamic program (DP) for a general centralized minimax control problem. We framed our decentralized minimax problem from the estimator’s perspective and used the common information approach to write down a dynamic program. This dynamic program, however, involved minimization over functions. By identifying a key property of the value functions, we were able to characterize the globally optimal strategies. In particular, we show that an open loop transmission strategy and simple Kalman-like estimator are jointly optimal. We also described related problems where the same optimal strategy holds.

Appendix A Proof of Theorem 1

To prove Theorem 1, we first derive some useful properties. Recall that $N=(N_{1},N_{2},\ldots,N_{T})$ is the collection of all the noise variables in the system. Note that given the strategy $\eta$ , the state $S_{r}$ and the information $Q_{r}$ can be written down as some function of $N$ for $r\in\mathcal{T}$ . Thus, for any function $f$ and $r\geq t$ we can write $\sup_{(S_{r},Q_{r})|q_{t}}f(S_{r},Q_{r})=\sup_{N|q_{t}}f(S_{r},Q_{r})$

For any strategy $\eta$ , we define its “cost-to-go” function at time $t$ as

[TABLE]

which is a function of the realization $q_{t}$ of available information at time $t$ . Then it is clear that the worst case cost of strategy $\eta$ is

[TABLE]

We also define the value function of the problem at $t$ to be

[TABLE]

We have the following result.

Lemma 10.

For any strategy $\eta$ , at each time $t$ and for every realization $q_{t}$ , we have

[TABLE]

Proof.

The proof is done by induction. At $T$ we have

[TABLE]

Suppose the lemma is true at $t+1$ . Then at $t$ we have

[TABLE]

From Property 2 we get

[TABLE]

Now from (48)-(49) and the induction hypothesis we get

[TABLE]

∎

It is straightforward to see that a strategy $\eta^{*}$ achieving infimum at each stage in the definition of $V^{*}_{t}(q_{t})$ will be optimal and its cost will be $\sup_{{Q_{1}}}V^{*}_{1}(Q_{1})$ .

Let $\Theta_{t}=[[S_{t}|Q_{t}]]$ be the conditional range of the state at time $t$ . Recall that $\Pi_{t}=[[S^{h}_{t}|Q_{t}]]$ . Note that $\Pi_{t}$ and $\Theta_{t}$ are related as follows

[TABLE]

The evolution of $\Theta_{t}$ has the following feature.

Lemma 11.

There exists a function $\phi_{t}(\theta_{t},a_{t},o_{t+1},s^{o}_{t+1})$ such that

[TABLE]

Proof.

We can write $(O_{t+1},S^{o}_{t+1})=\tilde{h}_{t+1}(S_{t},A_{t},N_{t+1})$ for some function $\tilde{h}_{t+1}$ . Under any strategy $\eta$ ,

[TABLE]

where the last equality follows from Property 1 and the fact that $N_{t+1}$ is unrelated to $S_{t}$ and $Q_{t}$ . Therefore, (53) implies that $\Theta_{t+1}$ is a function of $A_{t},O_{t+1},S^{o}_{t+1}$ and $\Theta_{t}=\left[\left[S_{t}|Q_{t}\right]\right]$ . ∎

Now let’s prove Theorem 1. Its easy to observe using (51) that $\Theta_{t}$ can be completely characterized using $\Pi_{t},S_{t}^{o}$ . Thus, to prove Theorem 1 it suffices to show that the optimal value function depends only on $\Theta_{t}$ .

Proof of Theorem 1.

Lemma 10 ensures that optimal costs and optimal strategies are characterized by the dynamic program

[TABLE]

Therefore, it just remains to show that the above value function at $t$ can be written as a function of $\theta_{t}$ . Then the optimal value will depend only on $\theta_{t}$ instead of the entire $q_{t}$ . This claim about the value functions is proved by induction. At $T$ , we have

[TABLE]

Suppose this claim is true at $t+1$ . From Lemma 11 and the induction hypothesis we have

[TABLE]

Since $(O_{t+1},S^{o}_{t+1})=\tilde{h}_{t+1}(S_{t},A_{t},N_{t+1})$ as in the proof of Lemma 11, the above equation can be further expressed as

[TABLE]

where the last equality follows from Property 1 since $\theta_{t}=\left[\left[S_{t}\,\middle|\,q_{t},a_{t}\right]\right]$ depends on the realization of $Q_{t},A_{t}$ and $N_{t+1}$ is unrelated to all variables before $t+1$ . Therefore, the value function at $t$ is equal to

[TABLE]

which finishes the proof of the claim. It is straightforward to see that a strategy achieving infimum at each stage will have a cost equal to $\sup_{{Q_{1}}}V^{*}_{1}(Q_{1})=\sup_{q_{1}\in\left[\left[Q_{1}\right]\right]}V^{*}_{1}(\theta_{1})$ where $\theta_{1}=\left[\left[S_{1}|q_{1}\right]\right]$ . Hence the proof is complete. ∎

Appendix B Proof of Lemma 1

Fix the estimator’s strategy to some arbitrary $\mathbf{g}$ . Define $S_{t}=(X_{t},E_{t},Y_{1:t-1})$ . Then,

[TABLE]

The instantaneous cost at time $t$ can be written as

[TABLE]

The problem of optimizing the transmission strategy is now an instance of the centralized minimax control problem discussed in Section II with $S_{t}$ as the directly observable state and $U_{t}$ as the action. Since there is no hidden state for the transmitter, the optimal transmission strategy at time $t$ is a function of the current state $S_{t}$ .

Since the above argument holds for any arbitrary estimation strategy $\mathbf{g}$ , it holds true for an optimal estimation strategy as well. Therefore, it is sufficient to consider transmission strategies of the form $U_{t}=f_{t}(S_{t})=f_{t}(X_{t},E_{t},Y_{1:t-1})$ . Moreover, since $E_{t}$ can be inferred from $Y_{1:t-1}$ , we can further restrict transmission strategies to the form $U_{t}=f_{t}(X_{t},Y_{1:t-1})$ without any loss in performance.

Appendix C Proof of lemmas 4 and 5

Proof of Lemma 4

The proof is trivial if $\lambda=0$ , so we will focus on the case of $\lambda\neq 0$ . For a set $S$ and $x\in\mathbb{R}^{n}$ , define $r(S,x):=\sup_{z\in S}||z-x||$ . For a fixed $x$ , we can write,

[TABLE]

where we used the fact that for any vector $u$ , $||Au||=||u||$ since $A$ is an orthogonal matrix.

Let $\epsilon>0$ . Then, $\exists y_{\epsilon}\in\pi$ such that $||y_{\epsilon}-\tilde{x}||>r(\pi,\tilde{x})-\frac{\epsilon}{|\lambda|}$ . Taking $w=a_{t+1}\frac{A(y_{\epsilon}-\tilde{x})}{||y_{\epsilon}-\tilde{x}||}sign(\lambda)$ and $y=y_{\epsilon}$ we get

[TABLE]

Since $\epsilon$ is arbitrary (60) and (62) implies,

[TABLE]

Using (61) and (63) we get $r(\phi_{t}(\pi),x)=|\lambda|r(\pi,\tilde{x})+a_{t+1}$ . Thus,

[TABLE]

where the second equality follows since $\frac{1}{\lambda}A^{-1}$ is invertible.

Proof of Lemma 5

We start by showing that the lemma is true for $t=T$ . Note that $V_{T}(\pi,\tilde{e})=r^{*}(\pi)$ by definition of $r^{*}(\pi)$ and $V_{T}(\pi,\tilde{e})$ . Therefore, it follows trivially that $V_{T}$ satisfies property $\mathbf{Q}$ .

Now, consider two sets $\theta$ and $\tilde{\theta}$ such that $\theta\mathbf{Q}\tilde{\theta}$ . At $t=T$ , observe that if $e_{T}>0$ , the prescription $\gamma^{all}$ achieves the infimum in (24) and the corresponding infimum value is zero. Thus, $W_{T}(\theta,e)=W_{T}(\tilde{\theta},e)=0,\forall e>0$ . If $e_{T}=0$ then the only possible choice of $\gamma_{T}$ is $\gamma^{none}$ . Observe from (22) that $\psi(\theta,\gamma^{none},\epsilon)=\theta$ , thus it follows that $W_{T}(\theta,0)=V_{T}(\theta,0)$ from (24). Since $\theta\mathbf{Q}\tilde{\theta}$ and $V_{T}$ satisfies property $\mathbf{Q}$ , we have $W_{T}(\theta,0)=W_{T}(\tilde{\theta},0)$ . Thus, $W_{T}$ satisfies property $\mathbf{Q}$ .

We now proceed by induction to prove that the lemma is true for $t<T$ . We first show that if $W_{t+1}$ satisfies property $\mathbf{Q}$ , then so does $V_{t}$ . (25) can be simplified to the following:

[TABLE]

where $r^{*}(\pi_{t})=\inf_{\hat{x_{t}}\in\mathbb{R}}\sup_{x_{t}\in\pi_{t}}||x_{t}-\hat{x}_{t}||$ . Let $\pi,\tilde{\pi}$ be two sets such that $\pi\mathbf{Q}\tilde{\pi}$ . Then, $r^{*}(\pi_{t})=r^{*}(\tilde{\pi}_{t})$ . Hence, the first term inside the maximization in (65) is the same for $\pi$ and $\tilde{\pi}$ .

It follows from Lemma 4 that if $\pi\mathbf{Q}\tilde{\pi}$ then $\phi_{t}(\pi)\mathbf{Q}\phi_{t}(\tilde{\pi})$ . Then, $W_{t+1}(\phi_{t}(\pi),e_{t})=W_{t+1}(\phi_{t}(\tilde{\pi}),e_{t})$ follows using the induction hypothesis. Thus, both the terms in the maximization in (65) satisfy property $\mathbf{Q}$ . Therefore, $V_{t}$ also satisfies property $\mathbf{Q}$ .

Next, we show that if $V_{t}$ satisfies property $\mathbf{Q}$ then so does $W_{t}$ . Observe that if $\{x_{1}\}$ and $\{x_{2}\}$ are two singleton sets then $V_{t}(\{x_{1}\},e)=V_{t}(\{x_{2}\},e)$ since $V_{t}$ satisfies property $\mathbf{Q}$ . Thus, we may write

[TABLE]

Let $e_{t}>0$ . Define $W_{t}^{\gamma}(\theta_{t},e_{t})$ for a given prescription $\gamma$ as follows:

[TABLE]

Then, $W_{t}(\theta_{t},e_{t})=\displaystyle\inf_{\gamma}W_{t}^{\gamma}(\theta_{t},e_{t})$ . For any prescription $\gamma$ , let ${A_{\gamma,\theta_{t}}:=\{x\in\theta_{t}:\gamma(x)=0\}}$ be the set of the state values in $\theta_{t}$ which are mapped to the control action [math]. If $A_{\gamma,\theta_{t}}=\emptyset$ , then

[TABLE]

If $\theta_{t}\text{\textbackslash}A_{\gamma,\theta_{t}}=\emptyset$ , then

[TABLE]

If neither $A_{\gamma,\theta_{t}}$ or $\theta_{t}\text{\textbackslash}A_{\gamma,\theta_{t}}$ is empty, then

[TABLE]

Also, it is easy to see that for the prescriptions $\gamma^{all}$ and $\gamma^{none}$ we have $W_{t}^{\gamma^{all}}(\theta_{t},e_{t})=K_{t}(e_{t}-1)$ and $W_{t}^{\gamma^{none}}(\theta_{t},e_{t})=V_{t}(\theta_{t},e_{t})$ respectively. Thus, it is clear that

[TABLE]

Thus, either $\gamma^{all}$ or $\gamma^{none}$ is an optimal prescription at time $t$ .

Now, if $\theta\mathbf{Q}\tilde{\theta}$ , then it follows from the induction hypothesis that $W_{t}(\theta,e_{t})=\min\{V_{t}(\theta,e_{t}),K_{t}(e_{t}-1)\}=\min\{V_{t}(\tilde{\theta},e_{t}),K_{t}(e_{t}-1)\}=W_{t}(\tilde{\theta},e_{t})$ . Similar arguments can be made if $e_{t}=0$ . Therefore, $W_{t}$ satisfies property $\mathbf{Q}$ .

Thus, by induction, $V_{t}$ and $W_{t}$ satisfy property $\mathbf{Q}$ for all $t=1,2,\ldots,T$ .

Appendix D

Proof of Lemma 8

We first show that the post-transmission conditional range $\Pi_{t}$ is a ball centered around $\tilde{X}_{t}$ under a globally optimal prescription strategy. This can be done by a simple induction argument: At $t=1$ one of the following two will happen

If $\gamma_{1}=\gamma^{all}$ , then $\tilde{X}_{1}=X_{1}$ and $\Pi_{1}=\{X_{1}\}$ . 2. 2.

If $\gamma_{1}=\gamma^{none}$ , then $\tilde{X}_{1}=0$ and $\Pi_{1}=\{x_{1}:||x_{1}||\leq a_{1}\}$ .

Hence, the claim is true for $t=1$ . Let the claim be true for $t$ . Then, at time $t+1$ one of the following will happen,

If $\gamma_{t+1}=\gamma^{all}$ , then $\tilde{X}_{t+1}=X_{t+1}$ and $\Pi_{t+1}=\{X_{t+1}\}$ . 2. 2.

If $\gamma_{t+1}=\gamma^{none}$ , then $\tilde{X}_{t+1}=\lambda A\tilde{X}_{t}$ . In this case, $\Pi_{t+1}=\Theta_{t+1}=\{x_{t+1}:x_{t+1}=\lambda Ax_{t}+n_{t+1},\,x_{t}\in\Pi_{t},||n_{t+1}||\leq a_{t+1}\}$ i.e. $\Pi_{t+1}$ is obtained by rotating $\Pi_{t}$ using $A$ , scaling it by $\lambda$ and then adding it to a ball centered around origin of radius $a_{t+1}$ . Using the induction hypothesis that $\Pi_{t}$ is a ball centered at $\tilde{X}_{t}$ , it follows that $\Pi_{t+1}$ is a ball centered at $\tilde{X}_{t+1}=\lambda A\tilde{X}_{t}$ .

Thus, $\Pi_{t}$ is a ball centered around $\tilde{X}_{t}$ for all $t$ . Therefore, the infimum in (25) will be achieved by $\tilde{X}_{t}$ . Hence, $\tilde{X}_{t}$ is the optimal esimate at time $t$ .

Proof of Theorem 2

We will argue that the strategies $\mathbf{f}^{*},\mathbf{g^{*}}$ achieve the globally optimal cost for Problem 1. Denote the $K$ time instants333If $t_{i}=t_{i+1}$ for some $i$ , the controller chooses control action $1$ fewer than $K$ times. with $U^{d*}_{t}$ equal to $1$ by $1\leq t_{1}\leq\ldots t_{K}\leq T$ with the convention that $t_{K+1}=T+1,t_{0}=0$ and $X_{0}^{d}=0$ .

Now, in Problem 3, if $t_{i}+1<t_{i+1}$ , the state grows in the interval $[t_{i}+1,t_{i+1}-1]$ for all $i$ and in the interval $[t_{K}+1,T]$ if $t_{K}<T$ . Therefore,

[TABLE]

Using (68) and the state dynamics we can write

[TABLE]

Now, consider the worst case instantaneous cost in Problem 1 under the strategy $\mathbf{f}^{*},\mathbf{g^{*}}$ . First consider the interval $[1,t_{1}]$ . If $t_{1}=1$ then the estimation error is [math] in this interval. When $t_{1}>1$ , let $1\leq t<t_{1}$ , then $\hat{X}_{t}=0$ under $\mathbf{g^{*}}$ . Then at time $t$ , the worst case estimation error is $\sup_{N_{1:t}}||\sum_{j=0}^{t-1}\lambda^{j}A^{j}N_{t-j}||=\sum_{j=1}^{t}|\lambda|^{t-j}a_{j}$ . Hence, the worst case estimation error in $[1,t_{1}]$ is $\left(\sum_{j=1}^{t_{1}-1}|\lambda|^{t_{1}-1-j}a_{j}\right)\mathbb{I}_{1<t_{1}}$ . Repeating this argument we get that the worst case estimation error in the interval $[t_{i}+1,t_{i+1}-1]$ is $\left(\sum_{j=t_{i}+1}^{t_{i+1}-1}|\lambda|^{t_{i+1}-1-j}a_{j}\right)\,\mathbb{I}_{t_{i}+1<t_{i+1}}$ . The cost incurred by the pair $\mathbf{f}^{*},\mathbf{g^{*}}$ is the maximum of the worst case estimation error in each interval and thus $J(\mathbf{f}^{*},\mathbf{g^{*}})=J^{d}(U^{d*}_{1:T})$ using (69). Now, since $U^{d*}_{1:T}$ is the optimal open loop sequence it must achieve the optimal cost for Problem 3 which is the same as the optimal cost for Problem 1 from Lemma 7. Therefore, $(\mathbf{f}^{*},\mathbf{g}^{*})$ is globally optimal.

Appendix E Proof of Lemma 9

Consider some open loop sequence $U^{d}_{t}$ and let the $K$ time instants with $U^{d}_{t}$ equal to $1$ be denoted by $1\leq t_{1}\leq\ldots t_{K}\leq T$ with the convention that $t_{K+1}=T+1,t_{0}=0$ . Define $y_{i}=t_{i}-t_{i-1}$ for $1\leq i\leq K+1$ . We refer to $\{y_{i}\}_{1\leq i\leq K+1}$ as the partition of the time horizon. Then, $\sum_{i=1}^{K+1}y_{i}=T+1$ . Since $K<T$ , $t_{i}+1<t_{i+1}$ will hold for some $i$ . Then, using the proof of Theorem 2, observe that the cost incurred for a partition $\{y_{i}\}$ would be $(\max_{i}\frac{|\lambda|^{y_{i}-1}-1}{|\lambda|-1})a=(\frac{|\lambda|^{\max_{i}y_{i}-1}-1}{|\lambda|-1})a$ when $|\lambda|\neq 1$ . We will show that $\max_{i}y_{i}$ is at least $\Delta$ for any partition. We first consider the case when $\frac{T+1}{K+1}$ is not an integer. Suppose $\displaystyle\max_{i}y_{i}<\left\lceil\frac{T+1}{K+1}\right\rceil$ , then $y_{i}\leq\left\lfloor\frac{T+1}{K+1}\right\rfloor\,\forall i$

[TABLE]

(70) gives a contradiction since $\sum_{i=1}^{K+1}y_{i}=T+1$ . For the case when $\frac{T+1}{K+1}$ is an integer, a similar contradiction can be obtained by noting that $y_{i}\leq\frac{T+1}{K+1}-1\,\forall i$ . Thus, $\displaystyle\max_{i}y_{i}\geq\left\lceil\frac{T+1}{K+1}\right\rceil=\Delta.$

Now, consider the strategy where $U_{t}^{d}=1$ when $t=m\Delta$ for some $m\in\{1,\ldots,K\}$ . Note that $l\Delta\leq T<(l+1)\Delta$ for some $1\leq l\leq K$ . It is easy to check that $\max_{i}y_{i}=\Delta$ for this strategy and hence it achieves the optimal cost. The proof for the case when $|\lambda|=1$ can be easily obtained in a similar manner.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] H. Li, L. Lai, and W. Zhang, “Communication requirement for reliable and secure state estimation and control in smart grid,” IEEE Transactions on Smart Grid , vol. 2, pp. 476–486, Sept 2011.
2[2] J. P. Hespanha, P. Naghshtabrizi, and Y. Xu, “A survey of recent results in networked control systems,” Proceedings of the IEEE , vol. 95, pp. 138–162, Jan 2007.
3[3] M. S. Kiran, P. Rajalakshmi, K. Bharadwaj, and A. Acharyya, “Adaptive rule engine based iot enabled remote health care data acquisition and smart transmission system,” in Internet of Things (WF-Io T), 2014 IEEE World Forum on , pp. 253–258, IEEE, 2014.
4[4] M. Athans, “On the determination of optimal costly measurement strategies for linear stochastic systems,” Automatica , vol. 8, no. 4, pp. 397–412, 1972.
5[5] J. S. Baras and A. Bensoussan, “Optimal sensor scheduling in nonlinear filtering of diffusion processes,” SIAM Journal on Control and Optimization , vol. 27, no. 4, pp. 786–813, 1989.
6[6] W. Wu and A. Arapostathis, “Optimal sensor querying: General markovian and lqg models with controlled observations,” IEEE Transactions on Automatic Control , vol. 53, pp. 1392–1405, July 2008.
7[7] M. Naghshvar and T. Javidi, “Active hypothesis testing: Sequentiality and adaptivity gains,” in 2012 46th Annual Conference on Information Sciences and Systems (CISS) , pp. 1–6, March 2012.
8[8] O. C. Imer and T. Basar, “Optimal estimation with limited measurements,” in Decision and Control, 2005 and 2005 European Control Conference. CDC-ECC’05. 44th IEEE Conference on , pp. 1029–1034, IEEE, 2005.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Worst-case Guarantees for Remote Estimation of an Uncertain Source

Abstract

I Introduction

Definition 1**.**

Property 1**.**

Property 2**.**

II Minimax Control with Maximum Instantaneous Cost Objective

Theorem 1**.**

Proof.

III Problem Formulation

Problem 1**.**

Remark 1**.**

Lemma 1**.**

Proof.

IV An Equivalent Problem

Problem 2**.**

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

V Globally optimal strategies

Lemma 4**.**

Proof.

Definition 2**.**

Lemma 5**.**

Proof.

Lemma 6**.**

Proof.

Problem 3**.**

Lemma 7**.**

Proof.

Lemma 8**.**

Proof.

Theorem 2**.**

Proof.

Remark 2**.**

V-A Homogenous noise

Lemma 9**.**

Proof.

Remark 3**.**

Remark 4**.**

Remark 5**.**

Remark 6**.**

VI Conclusion

Appendix A Proof of Theorem 1

Lemma 10**.**

Proof.

Lemma 11**.**

Proof.

Proof of Theorem 1.

Appendix B Proof of Lemma 1

Appendix C Proof of lemmas 4 and 5

Proof of Lemma 4

Proof of Lemma 5

Appendix D

Proof of Lemma 8

Proof of Theorem 2

Appendix E Proof of Lemma 9

Definition 1.

Property 1.

Property 2.

Theorem 1.

Problem 1.

Remark 1.

Lemma 1.

Problem 2.

Lemma 2.

Lemma 3.

Lemma 4.

Definition 2.

Lemma 5.

Lemma 6.

Problem 3.

Lemma 7.

Lemma 8.

Theorem 2.

Remark 2.

Lemma 9.

Remark 3.

Remark 4.

Remark 5.

Remark 6.

Lemma 10.

Lemma 11.