Data-Driven Minimum-Energy Controls for Linear Systems

Giacomo Baggio; Vaibhav Katewa; and Fabio Pasqualetti

arXiv:1902.02228·math.OC·May 1, 2019

Data-Driven Minimum-Energy Controls for Linear Systems

Giacomo Baggio, Vaibhav Katewa, and Fabio Pasqualetti

PDF

Open Access

TL;DR

This paper introduces a data-driven method to compute minimum-energy controls for linear systems directly from experimental data, bypassing the need for system models and improving reliability especially for large, uncertain systems.

Contribution

It demonstrates that optimal controls can be learned exactly from data with finite experiments, challenging traditional model-based approaches.

Findings

01

Exact control learning from data without system models

02

Algorithm outperforms traditional model-based computation

03

Applicable to large, uncertain linear systems

Abstract

In this paper we study the problem of computing minimum-energy controls for linear systems from experimental data. The design of open-loop minimum-energy control inputs to steer a linear system between two different states in finite time is a classic problem in control theory, whose solution can be computed in closed form using the system matrices and its controllability Gramian. Yet, the computation of these inputs is known to be ill-conditioned, especially when the system is large, the control horizon long, and the system model uncertain. Due to these limitations, open-loop minimum-energy controls and the associated state trajectories have remained primarily of theoretical value. Surprisingly, in this paper we show that open-loop minimum-energy controls can be learned exactly from experimental data, with a finite number of control experiments over the same time horizon, without…

Equations59

x (t + 1) = A x (t) + B u (t),

x (t + 1) = A x (t) + B u (t),

\displaystyle\begin{array}[]{ll}\min\limits_{u}&\sum\limits_{t=0}^{T-1}\|u(t)\|_{2}^{2},\\[10.00002pt] \,\text{s.t.}&x(t+1)=Ax(t)+Bu(t),\\[5.0pt] &x(0)=x_{0},\ x(T)={x}_{\textup{f}}.\end{array}

\displaystyle\begin{array}[]{ll}\min\limits_{u}&\sum\limits_{t=0}^{T-1}\|u(t)\|_{2}^{2},\\[10.00002pt] \,\text{s.t.}&x(t+1)=Ax(t)+Bu(t),\\[5.0pt] &x(0)=x_{0},\ x(T)={x}_{\textup{f}}.\end{array}

W_{T} = t = 0 \sum T - 1 A^{t} B B^{T} (A^{T})^{t}

W_{T} = t = 0 \sum T - 1 A^{t} B B^{T} (A^{T})^{t}

u^{*} (t) = B^{T} (A^{T})^{T - t - 1} W_{T}^{†} (x_{f} - A^{T} x_{0}),

u^{*} (t) = B^{T} (A^{T})^{T - t - 1} W_{T}^{†} (x_{f} - A^{T} x_{0}),

x_{f} = A^{T} x_{0} + C_{T} [B A B \dots A^{T - 1} B] u,

x_{f} = A^{T} x_{0} + C_{T} [B A B \dots A^{T - 1} B] u,

u^{*} = C_{T}^{†} (x_{f} - A^{T} x_{0}) .

u^{*} = C_{T}^{†} (x_{f} - A^{T} x_{0}) .

x_{i} = A^{T} x_{0} + C_{T} u_{i} .

x_{i} = A^{T} x_{0} + C_{T} u_{i} .

X = [x_{1} \dots x_{N}], and U = [u_{1} \dots u_{N}],

X = [x_{1} \dots x_{N}], and U = [u_{1} \dots u_{N}],

\displaystyle\begin{array}[]{lcl}\alpha^{*}=&\arg\min\limits_{\alpha}&\|U\alpha\|_{2}^{2},\\[10.00002pt] &\text{s.t.}&{x}_{\textup{f}}=X\alpha,\end{array}

\displaystyle\begin{array}[]{lcl}\alpha^{*}=&\arg\min\limits_{\alpha}&\|U\alpha\|_{2}^{2},\\[10.00002pt] &\text{s.t.}&{x}_{\textup{f}}=X\alpha,\end{array}

u^{*} = (I - U K (U K)^{†}) U X^{†} x_{f},

u^{*} = (I - U K (U K)^{†}) U X^{†} x_{f},

\tilde{x}_{f}

\tilde{x}_{f}

= C_{T} U X^{†} x_{f} - = 0 because C_{T} U K = X K = 0 C_{T} U K (U K)^{†} U X^{†} x_{f} = X X^{†} x_{f},

\displaystyle\begin{array}[]{ll}C_{T}^{*}=\arg\min\limits_{C}&\|X-CU\|_{F}^{2},\end{array}

\displaystyle\begin{array}[]{ll}C_{T}^{*}=\arg\min\limits_{C}&\|X-CU\|_{F}^{2},\end{array}

(I - U K (U K)^{†}) U X^{†} x_{f} = (X U^{†})^{†} x_{f} .

(I - U K (U K)^{†}) U X^{†} x_{f} = (X U^{†})^{†} x_{f} .

(U K)^{T} P = 0 ⟹ P = P^{T} P U K = 0 \Rightarrow P U X^{†} X = P U .

(U K)^{T} P = 0 ⟹ P = P^{T} P U K = 0 \Rightarrow P U X^{†} X = P U .

X (I - U^{†} U) = 0 \Rightarrow X U^{†} U = X .

X (I - U^{†} U) = 0 \Rightarrow X U^{†} U = X .

X U^{†} (I - P) = X U^{†} U K (U K)^{†} = \eqref e q : e q_{p} f_{p} r o p_{2} X K (U K)^{†} = 0.

X U^{†} (I - P) = X U^{†} U K (U K)^{†} = \eqref e q : e q_{p} f_{p} r o p_{2} X K (U K)^{†} = 0.

U K (U K)^{†} = I - P = 0

U K (U K)^{†} = I - P = 0

\Rightarrow (I - P) (I - U U^{†}) = [(I - P) (I - U U^{†})]^{T}

\Rightarrow U U^{†} P = P U U^{†},

\displaystyle\begin{array}[]{ll}M^{*}=\arg\min\limits_{M}&\|MX-U\|_{F}^{2}.\end{array}

\displaystyle\begin{array}[]{ll}M^{*}=\arg\min\limits_{M}&\|MX-U\|_{F}^{2}.\end{array}

M^{*} = U X^{†},

M^{*} = U X^{†},

\overset{u}{^} = M^{*} x_{f} = U X^{†} x_{f} .

\overset{u}{^} = M^{*} x_{f} = U X^{†} x_{f} .

\frac{1}{N} k = 1 \sum N U_{ik} U_{j k} a.s. E [U_{i 1} U_{j 1}] = {σ^{2}, 0, if i = j, if i \neq = j,

\frac{1}{N} k = 1 \sum N U_{ik} U_{j k} a.s. E [U_{i 1} U_{j 1}] = {σ^{2}, 0, if i = j, if i \neq = j,

\frac{1}{N} U U^{T} a.s. σ^{2} I as N \to \infty.

\frac{1}{N} U U^{T} a.s. σ^{2} I as N \to \infty.

U X^{†}

U X^{†}

= f (\frac{1}{N} U U^{T}) a.s. f (σ^{2} I) = C_{T}^{†} .

\displaystyle\begin{array}[]{ll}\min\limits_{\alpha}&\|U\alpha\|_{2}^{2},\\[10.00002pt] \,\text{s.t.}&{x}_{\textup{f}}=X\alpha\ \text{ and }\ 1=\mathbbold{1}^{\mathsf{T}}\alpha,\end{array}

\displaystyle\begin{array}[]{ll}\min\limits_{\alpha}&\|U\alpha\|_{2}^{2},\\[10.00002pt] \,\text{s.t.}&{x}_{\textup{f}}=X\alpha\ \text{ and }\ 1=\mathbbold{1}^{\mathsf{T}}\alpha,\end{array}

u^{*} = (I - U K (U K)^{†}) U \overset{ˉ}{X}^{†} \overset{x}{ˉ}_{f} .

u^{*} = (I - U K (U K)^{†}) U \overset{ˉ}{X}^{†} \overset{x}{ˉ}_{f} .

Bias [\overset{u}{^}]

Bias [\overset{u}{^}]

= [\frac{1}{2 ε} u_{1} ln (\frac{u _{1} + ε}{u _{1} - ε}) - 1] x_{f},

x (t + 1) = A x (t) + B u (t), y (t) = C x (t),

x (t + 1) = A x (t) + B u (t), y (t) = C x (t),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Control Systems and Identification · Probabilistic and Robust Engineering Design

Full text

Data-Driven Minimum-Energy Controls

for Linear Systems

Giacomo Baggio, Vaibhav Katewa, and Fabio Pasqualetti This material is based upon work supported in part by ARO 71603NSYIP. Giacomo Baggio, Vaibhav Katewa and Fabio Pasqualetti are with the Department of Mechanical Engineering, University of California at Riverside, {gbaggio, vkatewa, fabiopas}@engr.ucr.edu.

Abstract

In this paper we study the problem of computing minimum-energy controls for linear systems from experimental data. The design of open-loop minimum-energy control inputs to steer a linear system between two different states in finite time is a classic problem in control theory, whose solution can be computed in closed form using the system matrices and its controllability Gramian. Yet, the computation of these inputs is known to be ill-conditioned, especially when the system is large, the control horizon long, and the system model uncertain. Due to these limitations, open-loop minimum-energy controls and the associated state trajectories have remained primarily of theoretical value. Surprisingly, in this paper we show that open-loop minimum-energy controls can be learned exactly from experimental data, with a finite number of control experiments over the same time horizon, without knowledge or estimation of the system model, and with an algorithm that is significantly more reliable than the direct model-based computation. These findings promote a new philosophy of controlling large, uncertain, linear systems where data is abundantly available.

Index Terms:

Linear systems, optimal control, statistical learning, identification for control, control of networks.

I Introduction

Consider the discrete-time linear time-invariant system

[TABLE]

where, respectively, $A\in\mathbb{R}^{n\times n}$ and $B\in\mathbb{R}^{n\times m}$ denote the system and input matrices, and $x:\mathbb{N}\rightarrow\mathbb{R}^{n}$ and $u:\mathbb{N}\rightarrow\mathbb{R}^{m}$ describe the state and input of the system. For a control horizon $T\in\mathbb{N}$ and a desired state ${x}_{\textup{f}}$ , the minimum-energy control problem asks for the input sequence $u(0),\dots,u(T-1)$ with minimum energy that steers the state from $x_{0}$ to ${x}_{\textup{f}}$ in $T$ steps, and it can be formulated as

[TABLE]

As a classic result [1], the minimization problem (5) is feasible if and only if $({x}_{\textup{f}}-A^{T}x_{0})\in\operatorname{Im}(W_{T})$ , where

[TABLE]

is the $T$ -steps controllability Gramian and $\operatorname{Im}(W_{T})$ denotes the image of the matrix $W_{T}$ . Further, the solution to (5) is

[TABLE]

where $W_{T}^{\dagger}$ is the Moore–Penrose pseudoinverse of $W_{T}$ [2].

The controllability Gramian (6) and the minimum-energy control input (7) identify fundamental control limitations for the system (1), and have been extensively used to solve design [3], sensor and actuator placement [4], and control problems [5] for systems and networks. However, besides their theoretical value, the optimal control input (7) is rarely used in practice or even computed numerically because (i) it relies on the perfect knowledge of the system dynamics, (ii) its performance is not robust to model uncertainties, and (iii) the controllability Gramian is typically ill-conditioned, especially when the system is large [5, 6]. This implies that the control sequence (7) is numerically difficult to compute, and that its implementation leads to errors [7]. To the best of our knowledge, efficient and numerically reliable methods to compute minimum-energy control inputs are still lacking.

Paper contributions. This paper features two main contributions. First, we show that minimum-energy control inputs for linear systems can be computed from data obtained from control experiments with non-minimum-energy inputs, and without knowledge or estimation of the system matrices. Thus, optimal inputs can be learned from non-optimal ones, and we provide three different expressions for doing so. Surprisingly, we also establish that a finite number of non-optimal control experiments is always sufficient to compute minimum-energy control inputs towards any reachable state. Second, we show that the data-driven computation of minimum-energy inputs is numerically as reliable as the computation of the inputs based on the exact knowledge of the system matrices, and substantially more reliable than using the closed-form expression based on the Gramian. Further, as minor contributions, we (i) derive bounds on the number of required control experiments as a function of the dimension of the system, number of control inputs, and length of the control horizon, (ii) discuss the effect of noisy data on the data-driven expressions, and (iii) extend our data-driven framework to the case of output measurements.

Our results suggest the tantalizing hypothesis that several optimal control problems can be solved efficiently and reliably using a combination of data-driven algorithms and system properties (in our setup, linearity of the dynamics), even when the system model is uncertain or unknown.

Related work. Several works investigate the problem of estimating optimal controls for linear systems from input-output data. The classic model-based approach [8] consists of (i) identifying a model of the system from the available data, and (ii) using the estimated model to design the optimal control inputs. Data-driven algorithms have been proposed in [9, 10, 11, 12] for the LQR/LQG problem. In particular, the approach pursued in these papers relies on the estimation of the Markov parameters of the system, thereby bypassing the identification step of the model-based approach. Differently from the above approaches, in this paper we focus on computing open-loop minimum-energy inputs from experimental data, without reconstructing the system matrices and where the experiments use arbitrary control inputs. To the best of our knowledge, this paper addresses a novel problem and provides new and numerically more reliable expressions for the computation of minimum-energy control inputs.

II Learning minimum-energy control inputs

In vector form, the minimum-energy control problem asks to find the minimum-norm solution to the following equation:

[TABLE]

where the vector $u\in\mathbb{R}^{mT}$ contains the control inputs over the control horizon $[0,T-1]$ , namely $u=[u(T-1)^{\mathsf{T}}\,\cdots\,\,u(0)^{\mathsf{T}}]^{\mathsf{T}}$ , and $C_{T}$ denotes the $T$ -steps controllability matrix.111To simplify the technical treatment and without compromising generality, we assume that ${x}_{\textup{f}}$ is reachable in $T$ -steps, i.e., $\!({x}_{\textup{f}}-A^{T}x_{0})\in\operatorname{Im}(C_{T})$ . Then, if the controllability matrix $C_{T}$ is known, the minimum-energy control input to reach ${x}_{\textup{f}}$ is

[TABLE]

Instead of using (8), in this paper we aim to compute minimum-energy control inputs leveraging a set of $N$ control experiments and assuming that the system matrices, and thus the controllability matrix, are not available. The $i$ -th control experiment consists of applying the input sequence $u_{i}$ to (1), and measuring the system state at time $T$ , namely $x_{i}$ , where

[TABLE]

We remark that the inputs $u_{i}$ are arbitrary and not necessarily of minimum-norm. In vector form, the available data is

[TABLE]

where $x_{i}$ is the state at time $T$ with input $u_{i}$ as in (9).222While the full state trajectory could be measured [13], here we show that measuring the final state is sufficient to compute minimum-energy inputs.

II-A Data-driven minimum-energy controls

Because we only rely on the experimental data $(X,U)$ to learn the minimum-energy control input to reach a desired state, we postulate that such input can be computed as a linear combination of the inputs $U$ . Thus, we formulate and study the following constrained minimization problem:

[TABLE]

where $\alpha\in\mathbb{R}^{N}$ is the optimization variable. As we show in Theorem II.1, a first data-driven expression for the minimum-energy control input derives from a solution to (13). We start with the expression of the minimum-energy control input for the case $x_{0}=0$ , and we postpone the general case $x_{0}\neq 0$ to Remark 2. Let $\operatorname{Im}(M)$ and $\operatorname{Ker}(M)$ denote the range-space and the null-space of the matrix $M$ , respectively. With a slight abuse of notation, we write $K~{}=~{}\operatorname{Im}(A)$ (resp. $K=\operatorname{Ker}(A)$ ) to say that $K$ is a basis of $\operatorname{Im}(A)$ (resp. $\operatorname{Ker}(A)$ ). A matrix is full row rank if the dimension of its range-space equals the number of its rows.

Theorem II.1

(Data-driven minimum-energy control inputs when $x_{0}=0$ )* If the matrix $U$ in (10) is full row rank, then, for any final state ${x}_{\textup{f}}$ , the minimum-energy input equals*

[TABLE]

where $K=\operatorname{Ker}(X)$ and $X$ is as in (10).

Proof:

We first show that (13) is feasible, and that $u^{*}=U\alpha^{*}$ . Notice that, because $U$ is full row rank, there exists $\alpha^{*}$ such that $u^{*}=U\alpha^{*}$ , where $u^{*}$ is the minimum-energy control input to reach ${x}_{\textup{f}}$ . Additionally, $\alpha^{*}$ satisfies the constraint in (13) because $X\alpha^{*}=C_{T}U\alpha^{*}=C_{T}u^{*}={x}_{\textup{f}}$ . Finally, because $u^{*}$ is unique [1], $\alpha^{*}$ is also a solution to (13), and its computation is equivalent to computing the input $u^{*}$ .

To compute $\alpha^{*}$ we solve the constraint ${x}_{\textup{f}}=X\alpha$ and substitute it in the cost function. Namely, $\alpha=X^{\dagger}{x}_{\textup{f}}-Kw$ , where $K=\operatorname{Ker}(X)$ and $w$ is an arbitrary vector. Equating to zero the derivative of the cost function with respect to $w$ , we obtain $w^{*}=(UK)^{\dagger}UX^{\dagger}{x}_{\textup{f}}.$ This implies that $\alpha^{*}=X^{\dagger}{x}_{\textup{f}}-Kw^{*}$ , from which (14) follows by letting $u^{*}=U\alpha^{*}$ . ∎

Theorem II.1 provides an expression of the minimum-energy control input, which only uses data originated from a set of control experiments, and does not require the knowledge of the system matrices. Importantly, Theorem II.1 shows that minimum-energy control inputs can be directly computed based on a number of control experiments with arbitrary, thus not minimum-energy, inputs. Further, Theorem II.1 assumes that $U$ is full row rank, which guarantees the computation of the minimum-energy input for any final state ${x}_{\textup{f}}$ . When $U$ is not full row rank but $u^{*}\in\operatorname{Im}(U)$ , the minimum-energy control input can still be computed as in Theorem II.1. Instead, when $u^{*}\not\in\operatorname{Im}(U)$ , the minimum-energy input cannot be computed as a (linear) combination of the experimental data (10). In this case, the data-driven input (14) reaches the desired final state ${x}_{\textup{f}}$ , if ${x}_{\textup{f}}\in\operatorname{Im}(X)$ , or the final state ${\tilde{x}}_{\textup{f}}\in\operatorname{Im}(X)$ that is closest to ${x}_{\textup{f}}$ , if ${x}_{\textup{f}}\not\in\operatorname{Im}(X)$ . To see this, let $u^{*}$ be as in (14) and note that

[TABLE]

which shows that ${\tilde{x}}_{\textup{f}}$ is the orthogonal projection of ${x}_{\textup{f}}$ onto $\operatorname{Im}(X)$ . This in particular implies that the error $\|{x}_{\textup{f}}-{\tilde{x}}_{\textup{f}}\|_{2}$ is non-increasing in the number of experiments $N$ , and it vanishes when the experimental data satisfies ${x}_{\textup{f}}\in\operatorname{Im}(X)$ . Finally, Theorem II.1 can also be used to quantify the number of experiments needed to compute minimum-energy inputs.

Corollary II.2

(Required number of control experiments to compute minimum-energy inputs)* Let $n$ be the dimension of the system, $m$ the number of inputs, $T$ the control horizon, and $N$ the number of control experiments. Then,*

(i)

$N\geq n$ * is necessary to compute minimum-energy control inputs towards any arbitrary final state ${x}_{\textup{f}}$ ;* 2. (ii)

$N=mT$ * is sufficient to compute minimum-energy control inputs towards any arbitrary final state ${x}_{\textup{f}}$ , provided that the inputs $u_{i}$ are linearly independent.*

Proof:

(Necessity) Assume by contradiction that the number of experiments is strictly less than $n$ . Then, $\text{Rank}(X)<n$ , and there exists ${x}_{\textup{f}}\not\in\operatorname{Im}(X)$ . Then, the minimization problem (13) is infeasible, and the minimum-energy control input cannot be computed from the inputs $U$ .

(Sufficiency) Let the experimental inputs be linearly independent. Then, $U$ is invertible and, for any ${x}_{\textup{f}}$ , there exists a solution $\alpha^{*}$ such that $u^{*}=U\alpha^{*}$ . This shows that the minimum-energy input can be computed from the data. ∎

Corollary II.2 characterizes the number of control experiments that are required to compute minimum-energy control inputs from experimental data. In particular, as few as $n$ experiments are needed, in which case the experiments must contain $n$ linearly independent minimum-energy control inputs, and as many as $mT$ experiments are sufficient, in which case the control inputs can be selected arbitrarily provided that they form a linearly independent set of vectors. This also shows that optimal control inputs can be learned from a finite number of non-optimal control inputs.

Example 1

(Data-driven control inputs when $N\leq mT$ )* We consider a two-dimensional system with matrices $A$ and $B$ as in Fig. 1(b), control horizon $T=4$ , initial state $x_{0}=[0\ 0]^{\mathsf{T}}$ , and final state ${x}_{\textup{f}}=[0\ 1]^{\mathsf{T}}$ . We vary the number of control experiments $N$ from $1$ to $4$ , where the inputs are as in Fig. 1(c). For each number of control experiments, we compute the data-driven input (14), and report the corresponding state trajectory and norm in Fig. 1(a) and Fig. 1(d), respectively. Notice that, when $N=1$ , the data-driven input does not steer the system state to ${x}_{\textup{f}}$ . Instead, for $N=2,3,4$ the state trajectory reaches ${x}_{\textup{f}}$ . Finally, the data-driven input has minimum norm only when $N=4$ . $\square$ *

Remark 1

(Geometric properties of (14))* Several geometric properties of (14) can be highlighted. First, $UK=\operatorname{Ker}(C_{T})$ when $U$ is full row rank. In fact, $C_{T}UK=XK=0$ , showing that $\operatorname{Im}(UK)\subseteq\operatorname{Ker}(C_{T})$ . Further, if $C_{T}u=0$ and $u=U\alpha$ , then, $X\alpha=C_{T}U\alpha=C_{T}u=0$ , showing that $\alpha\in\operatorname{Im}(K)$ and $\operatorname{Ker}(C_{T})\subseteq\operatorname{Im}(UK)$ . Thus, $\operatorname{Im}(UK)=\operatorname{Ker}(C_{T})$ when $U$ is full row rank. Second, $I-UK(UK)^{\dagger}$ is the orthogonal projection onto the kernel of $(UK)^{\mathsf{T}}$ and, consequently, $u^{*}=(I-UK(UK)^{\dagger})UX^{\dagger}{x}_{\textup{f}}$ is orthogonal to $\operatorname{Ker}(C_{T})$ . This is expected, because $u^{*}$ is the minimum-energy control input to reach the state ${x}_{\textup{f}}$ . $\square$ *

II-B An alternative expression of minimum-energy controls

In this subsection, we present a different optimization problem that can be used to derive an equivalent expression of the data-driven minimum-energy control input (14). Specifically, we consider the following problem, which encodes the problem of estimating the controllability matrix from data:

[TABLE]

where $\|\cdot\|_{F}$ denotes the Frobenius norm of a matrix. The above problem has a unique solution, which equals $C_{T}^{*}=XU^{{\dagger}}$ . Notice that the minimization problem (16) returns an estimate of the controllability matrix, which can be used to compute the input as $\hat{u}=(C^{*}_{T})^{{\dagger}}{x}_{\textup{f}}=(XU^{{\dagger}})^{{\dagger}}{x}_{\textup{f}}$ . We next show that $\hat{u}$ coincides with the control input (14).

Theorem II.3

(Equivalent expressions of data-driven minimum-energy inputs)* Let $X$ and $U$ be as in (10). Then,*

[TABLE]

Proof:

We show that $(XU^{{\dagger}})^{{\dagger}}=(I-UK(UK)^{\dagger})UX^{\dagger}$ . That is, we show that $(I-UK(UK)^{\dagger})UX^{\dagger}$ satisfies the four conditions [2] defining the Moore–Penrose pseudoinverse of $XU^{\dagger}$ . To this aim, let $K=I-X^{{\dagger}}X$ . Since $P=I-UK(UK)^{\dagger}$ is the orthogonal projection onto $\text{Ker}((UK)^{\mathsf{T}})$ ,

[TABLE]

Because $X=C_{T}U$ , we have $\text{Ker}(U)\subseteq\text{Ker}(X)$ . Since $I-U^{{\dagger}}U$ is the orthogonal projection onto $\text{Ker}(U)$ , we have

[TABLE]

Further, using $XK=0$ , we obtain

[TABLE]

Finally, since $I-UU^{{\dagger}}$ denotes the orthogonal projection onto $\text{Ker}(U^{\mathsf{T}})$ , and $UK(UK)^{\dagger}$ the orthogonal projection onto $\text{Im}(UK)\subseteq\text{Im}(U)\perp\text{Ker}(U^{\mathsf{T}})$ , we have

[TABLE]

where the last implication follows because $I-P$ and $I-UU^{{\dagger}}$ are symmetric. To conclude, we show that $PUX^{{\dagger}}=(XU^{{\dagger}})^{\dagger}$ by proving the four Moore–Penrose conditions [2]:

(i)

I $PUX^{{\dagger}}XU^{{\dagger}}PUX^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_1}}{=}PUU^{{\dagger}}PUX^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_4}}{=}P^{2}UU^{{\dagger}}\cdot UX^{{\dagger}}\-=PUX^{{\dagger}}$ ; 2. (ii)

$XU^{{\dagger}}PUX^{{\dagger}}XU^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_1}}{=}XU^{{\dagger}}PUU^{{\dagger}}=XU^{{\dagger}}UU^{{\dagger}}-XU^{{\dagger}}(I-P)UU^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_3}}{=}XU^{{\dagger}}$ ; 3. (iii)

$XU^{{\dagger}}PUX^{{\dagger}}=XU^{{\dagger}}UX^{{\dagger}}-XU^{{\dagger}}(I-P)UX^{{\dagger}}\hskip 2.0pt\overset{\eqref{eq:eq_pf_prop_2},\,\eqref{eq:eq_pf_prop_3}}{=}XX^{{\dagger}}=(XX^{{\dagger}})^{\mathsf{T}}$ ; 4. (iv)

$PUX^{{\dagger}}XU^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_1}}{=}PUU^{{\dagger}}\overset{\eqref{eq:eq_pf_prop_4}}{=}UU^{{\dagger}}P=(PUU^{{\dagger}})^{\mathsf{T}}$ .

∎

II-C An asymptotic expression of minimum-energy controls

The minimization problem (16) reconstructs the forward controllability matrix $C_{T}$ , from which minimum-energy control inputs can be derived by subsequently computing $C_{T}^{\dagger}$ . To avoid the computation of $C_{T}^{\dagger}$ and obtain a potentially simpler expression, we next consider the problem of directly estimating $C_{T}^{\dagger}$ from the experimental data:

[TABLE]

Notice that the latter problem is equivalent to estimating the inverse map from $X$ to $U$ , and it is typically more difficult than the problem of estimating the map from $U$ to $X$ . In fact, while the forward map is unique, the inverse map is typically not.333In particular, the inverse map is not unique whenever $mT>n$ . Further, the control input $M^{*}{x}_{\textup{f}}$ obtained by solving the minimization problem (23) is not guaranteed to be of minimum norm and to steer the system to ${x}_{\textup{f}}$ , as these constraints do not appear in the minimization problem. In what follows, we say that a sequence of random matrices $\{X_{n}\}_{n\in\mathbb{N}}$ converges almost surely (a.s.) to a matrix $X$ , and denote it with $X_{n}\xrightarrow{\text{a.s.}}X$ , if $\mathrm{Pr}(\lim_{n\to\infty}X_{n}=X)=1$ .

Theorem II.4

(Asymptotically equivalent expression to (14))* Let $X$ and $U$ be as in (10). The unique solution to the minimization problem (23) is*

[TABLE]

and the corresponding control input can be written as

[TABLE]

Further, if $X$ is full row rank, then $C_{T}M^{*}{x}_{\textup{f}}={x}_{\textup{f}}$ . That is, the control $\hat{u}$ steers the system from $x_{0}=0$ to $x(T)~{}=~{}{x}_{\textup{f}}$ . Finally, if the entries of $U$ are i.i.d. random variables with zero mean and nonzero finite variance, then $UX^{\dagger}\xrightarrow{\text{a.s.}}C_{T}^{\dagger}$ as $N\to\infty$ . That is, as the number of control experiments increases, the input $\hat{u}$ converges a.s. to the optimal input $u^{*}$ .

Proof:

The expression (24) follows from the properties of the Moore–Penrose pseudoinverse. For the second claim, we note that $C_{T}\hat{u}=C_{T}UX^{{\dagger}}{x}_{\textup{f}}=XX^{{\dagger}}{x}_{\textup{f}}={x}_{\textup{f}}$ , where we have used that $X$ is full row rank and $X=C_{T}U$ . To prove the third statement, let $N\to\infty$ , and let the control experiments be chosen so that the entries of $U$ are i.i.d. random variables with zero mean and finite variance $\sigma^{2}$ . Let $U_{ij}$ denote the $(i,j)$ -th entry of $U$ , and observe that the $(i,j)$ -th entry of $\frac{1}{N}UU^{\mathsf{T}}$ equals $\frac{1}{N}\sum_{k=1}^{N}U_{ik}U_{jk}$ . Because $\{U_{ik}U_{jk}\}_{k\in\mathbb{N}}$ is an i.i.d. sequence of random variables, for all $i$ , $j\in\{1,\dots,N\}$ and, due to the Strong Law of Large Numbers [14, p. 6], when $N\to\infty$ we have

[TABLE]

where $\operatorname{\mathbb{E}}[\cdot]$ denotes the expected value operator. Then,

[TABLE]

Next, consider the function $f:\mathbb{R}^{mT\times mT}\to\mathbb{R}^{mT\times n}$ , $Y\mapsto YC_{T}^{\mathsf{T}}(C_{T}YC_{T}^{\mathsf{T}})^{{\dagger}}$ . Note that $f(Y)$ is continuous at $Y=\alpha I$ , $\alpha>0$ ,444In fact, since $\text{Rank}(C_{T}YC_{T}^{\mathsf{T}})=\text{Rank}(C_{T}C_{T}^{\mathsf{T}})$ for any positive definite $Y$ , it holds $\lim_{k\to\infty}(C_{T}Y_{k}C_{T}^{\mathsf{T}})^{\dagger}=(\alpha\,C_{T}C_{T}^{\mathsf{T}})^{\dagger}$ for any sequence of positive definite matrices $\{Y_{k}\}_{k\in\mathbb{N}}$ such that $\lim_{k\to\infty}Y_{k}=\alpha I$ [2, p. 238]. and $f(\alpha I)=C_{T}^{\mathsf{T}}(C_{T}C_{T}^{\mathsf{T}})^{\dagger}=C_{T}^{\dagger}$ [2, p. 49]. To conclude, we employ the Continuous Mapping Theorem [14, Theorem 2.3(iii)] and (26) to obtain, as $N\to\infty$ ,

[TABLE]

∎

Theorem II.4 contains a data-driven expression of the minimum-energy control input for a linear system, which does not rely on the estimation of the system matrices or the controllability matrix. As we show in the next section, the expression (25) is not only conceptually simpler than the classic Gramian-based expression of the minimum-energy control input and our other data-driven expressions (14) and (17), but it is also numerically more reliable as it requires a smaller number of operations. Yet, differently from (14) and (17), the expression (25) coincides with the minimum-energy control only asymptotically in the number of experiments, and assuming that the entries of the input matrix $U$ are zero-mean i.i.d. random variables with nonzero finite variance.

Remark 2

(Data-driven minimum-energy control inputs when $x_{0}\neq 0$ )* When $x_{0}\neq 0$ , the computation of the minimum-energy control input to reach ${x}_{\textup{f}}$ is more involved, as the unknown matrix $A$ and vector $x_{0}$ enter the relation (9).555Notice that the term $A^{T}x_{0}$ remains unknown even if the exact value of $x_{0}\neq 0$ is known. Thus knowledge of $x_{0}$ does not modify the expressions we obtain when $x_{0}\neq 0$ is treated as an unknown variable. Yet, under a mild assumption on the experimental inputs $U$ , minimum-energy inputs can still be computed with a finite number of experiments. To see this, consider the problem*

[TABLE]

Assume that the matrix $U$ is full row rank, and that there exists a vector $w$ such that $Uw=0$ and $\mathbbold{1}^{\mathsf{T}}w\neq 0$ . The first assumption guarantees that there exists $\alpha^{*}$ such that $u^{*}=U\alpha^{*}$ , and thus the computation of the minimum-energy control for any final state ${x}_{\textup{f}}$ (cf. Theorem 2.1). The second assumption ensures that there exists $\alpha^{*}$ satisfying $1=\mathds{1}^{\mathsf{T}}\alpha^{*}$ , which allows us to correctly reconstruct the term $A^{T}x_{0}$ from $X$ .666These assumptions can always be satisfied by properly designing the experimental inputs, or by running sufficiently many random experiments. In fact, let $\alpha^{*}=U^{\dagger}u^{*}+w\,{(1-\mathbbold{1}^{\mathsf{T}}U^{\dagger}u^{*})}/{(\mathbbold{1}^{\mathsf{T}}w)},$ and notice that $u^{*}=U\alpha^{*}$ , where $u^{*}$ is the minimum-energy control input to reach ${x}_{\textup{f}}$ . Further, using (9) and $1=\mathbbold{1}^{\mathsf{T}}\alpha^{*}$ , we have $X\alpha^{*}=\sum_{i=1}^{N}X_{i}\alpha_{i}^{*}=A^{T}x_{0}\sum_{i=1}^{N}\alpha_{i}^{*}+C_{T}\sum_{i=1}^{N}\alpha_{i}^{*}U_{i}=A^{T}x_{0}+C_{T}u^{*}={x}_{\textup{f}}.$ Then, similarly to the proof of Theorem II.1, a solution to (29) determines the minimum-energy input.

To solve the minimization problem (29), let $\bar{X}=[X^{\mathsf{T}}\;\mathbbold{1}]^{\mathsf{T}}$ and ${\bar{x}}_{\textup{f}}=[{x}_{\textup{f}}^{\mathsf{T}}\;1]^{\mathsf{T}}$ . Then, similarly to Theorem II.1, we obtain $\alpha^{*}=\bar{X}^{\dagger}{x}_{\textup{f}}-K(UK)^{\dagger}U\bar{X}^{\dagger}{\bar{x}}_{\textup{f}},$ where $K=\operatorname{Ker}(\bar{X})$ , and

[TABLE]

Because the matrix $U$ is required to have a nontrivial null-space, a sufficient number of linearly-independent non-optimal experiments for the computation of the minimum-energy control input to any arbitrary final state is $mT+1$ .

Finally, from the above reasoning and the proof of Theorem II.3 and Theorem II.4, the minimum-energy input (30) can be written equivalently as $u^{*}=(\bar{X}U^{\dagger})^{\dagger}{\bar{x}}_{\textup{f}}=U\bar{X}^{\dagger}{\bar{x}}_{\textup{f}}$ , where the last equality holds asymptotically for any choice of inputs satisfying the assumptions in Theorem II.4. $\square$

Remark 3

(Data-driven expressions with noisy data)* Let the measurements of the input $u_{i}$ and the final state $x_{i}$ be corrupted by noise. Let $\tilde{U}=[u_{1}+w_{1}\ \cdots\ u_{N}+w_{N}]$ and $\tilde{X}=[x_{1}+v_{1}\ \cdots\ x_{N}+v_{N}]$ be the matrices obtained by concatenating all noisy measurements. The data-driven estimates (14), (17), and (25) computed from the noisy data $(\tilde{U},\tilde{X})$ are typically biased. To see this, consider the system $x(t+1)=ax(t)+u(t)$ , $a\in\mathbb{R}$ , $x_{0}=0$ , and $T=N=1$ . In this simple case, expressions (14), (17), and (25) are equivalent and, assuming that $x_{1}+v_{1}\neq 0$ , read as $\hat{u}=\frac{u_{1}+w_{1}}{x_{1}+v_{1}}x_{\text{f}}$ . If $w_{1}$ and $v_{1}$ are independent random variables uniformly distributed in $[-\varepsilon,\varepsilon]$ , with $0<\varepsilon<|u_{1}|$ , it holds*

[TABLE]

where $\mathbb{E}_{z}[\cdot]$ denotes the expected value with respect to $z$ . It can be shown that, if $u_{1}$ and $x_{\text{f}}$ are nonzero, the previous equation vanishes only in the limit $\varepsilon\to 0$ . This implies that all data-driven expressions in this simple case are biased. When $n>1$ , a quantitative characterization of the bias (and covariance) of the data-driven expressions appears to be difficult, due to the presence of pseudoinverse operations. However, numerical simulations with i.i.d. normally distributed noise (see also Fig. 2) suggest that (i) all data-driven expressions are biased in the case of noisy measurements, (ii) the magnitude of the bias is proportional to the standard deviation $\sigma$ of the noise for (17) and (25), while it increases rapidly as $\sigma$ grows and sets to a constant value for (14). $\square$

Remark 4

(Data-driven expressions with output measurements)* Consider the system*

[TABLE]

where $C\in\mathbb{R}^{p\times n}$ , and assume that for each experimental input $u_{i}$ , $i\in\{1,\dots,N\}$ , we can measure the output of the system at time $T$ , namely, $y_{i}=Cx_{i}$ . Let $Y=[y_{1}\ \cdots\ y_{N}]\in\mathbb{R}^{p\times N}$ be the matrix concatenating all output measurements, and assume that the system is output controllable in $T$ steps. That is, the $T$ -steps output controllability matrix $C_{O,T}=[CB\ \ CAB\ \ \cdots\ \ CA^{T-1}B]$ has full row rank [15]. The minimum-energy input to reach the output $y_{\text{f}}\in\mathbb{R}^{p}$ in $T$ steps is $u^{*}=C_{O,T}^{\dagger}(y_{\text{f}}-CA^{T}x_{0})$ . All results discussed in this paper apply to the case of output control after substituting $X$ and $x_{\text{f}}$ with $Y$ and $y_{\text{f}}$ , respectively. $\square$

III Numerical analysis

What remains unclear from the previous analysis is the benefit, if any, in collecting a large number of control experiments. We next show that increasing the number of control experiments can improve the numerical reliability and accuracy of computing minimum-energy control inputs.

In Fig. 3 we compare the numerical performance of the model-based expressions of the minimum-energy controls $u^{*}=C_{T}^{\dagger}{x}_{\textup{f}}$ and $u^{*}=C_{T}^{\mathsf{T}}W_{T}^{{\dagger}}{x}_{\textup{f}}$ (Gramian-based), with our data-driven expressions in (14), (17), and (25). In particular, in Fig. 3(a)-(b) we plot the norm of the control inputs and the numerical errors in reaching the final state ${x}_{\textup{f}}$ , for all strategies and as a function of the number $N$ of control experiments. Here, we focus on a “worst-case” analysis and choose a small input dimension ( $m=2$ ), since a large value of $m$ certainly improves the conditioning of all expressions. Fig. 3(a) shows that the norm of the data-driven control inputs (14) and (17) equals its minimum value when $N\geq mT$ (as predicted by Theorems II.1 and II.3), whereas the norm of the data-driven input (25) converges to its minimum value only asymptotically (as predicted by Theorem II.4). Fig. 3(b) shows that, for sufficiently large $N$ , the final state reached by the three data-driven control strategies is almost as close to ${x}_{\textup{f}}$ as the one computed via the model-based formula $u^{*}=C_{T}^{\dagger}{x}_{\textup{f}}$ , and considerably closer to ${x}_{\textup{f}}$ than the state reached by the Gramian-based control input, with expressions (14) and (25) being the most accurate, showing that the computation of the minimum-energy control input via our data-driven expression is as reliable as the computation of the input based on the exact knowledge of the system matrices, and numerically more reliable than the model-based Gramian formula. Instead, in Fig. 3(c)-(d) we plot the norm of the control inputs obtained through the different strategies described above and their corresponding errors in the final state as a function of the system dimension $n$ . As expected, the accuracy of the Gramian-based control input deteriorates rapidly as $n$ increases. Yet, surprisingly, the data-driven expressions of the minimum-energy control inputs remain accurate for systems of considerably larger dimension. Further, the data-driven control (25) yields the smallest error in the final state among the three data-driven strategies. This could be due to the simpler form of (25), which requires the computation of only one pseudoinverse, or to the fact that the energy of (25) reaches the minimum value only asymptotically in $N$ . Finally, Fig. 3(c)-(d) show that expression (14) becomes numerically unreliable for smaller values of the system dimension compared to (17) and (25). This is likely because of the additional computations in (14).

IV Conclusion and future work

In this paper we derive data-driven expressions of open-loop minimum-energy control inputs for linear systems. Leveraging linearity of the dynamics, we show that such optimal controls can be learned from a finite number of control experiments, without knowing or reconstructing the system matrices, and where the control experiments are conducted with non-optimal and arbitrary inputs. We derive three different data-driven expressions of minimum-energy controls: while (17) appears to be the simplest exact data-driven expression, (14) constitutes a radically different and new way of computing minimum-energy controls, and highlights several geometric connections between the minimum-energy solutions and the experimental data, and (25) provides a simple way of computing a family of data-driven, sub-optimal, minimum-energy controls. We further illustrate that our data-driven expressions of the minimum-energy inputs are simpler and numerically more reliable than the classic Gramian-based expression, especially when the dimension of the system increases.

The results of this paper support the intriguing idea of combining model-based control methods with data-driven techniques, showing that this new framework has the potential to considerably increase the reliability and effectiveness of the two parts alone. This paper also creates several directions of future research, including the extension to closed-loop, noisy, and model predictive control problems.

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. Kailath. Linear Systems . Prentice-Hall, 1980.
2[2] A. Ben-Israel and T. N. E. Greville. Generalized inverses: theory and applications , volume 15 of CMS Books in Mathematics . Springer-Verlag New York, 2nd edition, 2003.
3[3] S. Zhao and F. Pasqualetti. Networks with diagonal controllability gramians: Analysis, graphical conditions, and design algorithms. Automatica , 102:10–18, 2019.
4[4] T. H. Summers, F. L. Cortesi, and J. Lygeros. On submodularity and controllability in complex dynamical networks. IEEE Transactions on Control of Network Systems , 3(1):91–101, 2016.
5[5] F. Pasqualetti, S. Zampieri, and F. Bullo. Controllability metrics, limitations and algorithms for complex networks. IEEE Transactions on Control of Network Systems , 1(1):40–52, 2014.
6[6] D. C. Sorensen and Y. Zhou. Bounds on eigenvalue decay rates and sensitivity of solutions to Lyapunov equations. Technical Report 02-07, Rice University, Houston, TX, 2002.
7[7] J. Sun and A. E. Motter. Controllability transition and nonlocality in network control. Physical Review Letters , 110(20):208701, 2013.
8[8] M. Gevers. Identification for control: From the early achievements to the revival of experiment design. European Journal of Control , 11:1–18, 2005.