A trajectory-based framework for data-driven system analysis and control

Julian Berberich; Frank Allg\"ower

arXiv:1903.10723·cs.SY·October 27, 2020

A trajectory-based framework for data-driven system analysis and control

Julian Berberich, Frank Allg\"ower

PDF

TL;DR

This paper presents a trajectory-based data-driven framework for analyzing and controlling LTI and certain nonlinear systems, enabling system understanding and control without explicit model identification by leveraging measured trajectories and kernel methods.

Contribution

It extends behavioral system theory to classical state-space and nonlinear systems, introducing kernel methods for data-driven simulation.

Findings

01

Single measured trajectory can capture full system behavior for LTI systems.

02

Extension to nonlinear systems linear in input-output coordinates.

03

Kernel methods enable rich basis functions for data-driven simulation.

Abstract

The vector space of all input-output trajectories of a discrete-time linear time-invariant (LTI) system is spanned by time-shifts of a single measured trajectory, given that the respective input signal is persistently exciting. This fact, which was proven in the behavioral control framework, shows that a single measured trajectory can capture the full behavior of an LTI system and might therefore be used directly for system analysis and controller design, without explicitly identifying a model. In this paper, we translate the result from the behavioral context to the classical state-space control framework and we extend it to certain classes of nonlinear systems, which are linear in suitable input-output coordinates. Moreover, we show how this extension can be applied to the data-driven simulation problem, where we introduce kernel-methods to obtain a rich set of basis functions.

Equations66

H_{L}

H_{L}

x_{[a, b]} = x_{a} ⋮ x_{b} .

x_{[a, b]} = x_{a} ⋮ x_{b} .

x_{k + 1} y_{k} = A x_{k} + B u_{k}, x_{0} = \overset{x}{ˉ}, = C x_{k} + D u_{k},

x_{k + 1} y_{k} = A x_{k} + B u_{k}, x_{0} = \overset{x}{ˉ}, = C x_{k} + D u_{k},

x_{k + 1}

x_{k + 1}

y_{k}

[H_{L} (u) H_{L} (y)] α = [\overset{u}{ˉ} \overset{y}{ˉ}] .

[H_{L} (u) H_{L} (y)] α = [\overset{u}{ˉ} \overset{y}{ˉ}] .

\overset{u}{ˉ}_{[0, L - 1]}

\overset{u}{ˉ}_{[0, L - 1]}

\overset{y}{ˉ}_{[0, L - 1]}

\overset{x}{ˉ}_{[0, L - 1]} = i = 0 \sum N - L α_{i} x_{[i, L - 1 + i]},

\overset{x}{ˉ}_{[0, L - 1]} = i = 0 \sum N - L α_{i} x_{[i, L - 1 + i]},

H_{L} (u) 0 H_{L} (y) 0 0 I_{ξ - 1} \otimes H_{L - n} (u_{[n, N - 1]}) 0 I_{ξ - 1} \otimes H_{L - n} (y_{[n, N - 1]}) α^{1} ⋮ α^{ξ} =

H_{L} (u) 0 H_{L} (y) 0 0 I_{ξ - 1} \otimes H_{L - n} (u_{[n, N - 1]}) 0 I_{ξ - 1} \otimes H_{L - n} (y_{[n, N - 1]}) α^{1} ⋮ α^{ξ} =

H_{n} (u_{[L - n, N - 1]}) α^{i} = H_{n} (u_{[0, N - L + n - 1]})

H_{n} (y_{[L - n, N - 1]}) α^{i} = H_{n} (y_{[0, N - L + n - 1]})

i \in I_{[1, ξ - 1]} .

[\overset{u}{ˉ}_{[0, L - 1]}^{i} \overset{y}{ˉ}_{[0, L - 1]}^{i}] = [H_{L} (u) H_{L} (y)] α^{i}, i \in I_{[1, ξ]},

[\overset{u}{ˉ}_{[0, L - 1]}^{i} \overset{y}{ˉ}_{[0, L - 1]}^{i}] = [H_{L} (u) H_{L} (y)] α^{i}, i \in I_{[1, ξ]},

\overset{u}{ˉ}_{[0, \tilde{L} - 1]} = \overset{u}{ˉ}_{[0, L - 1]}^{1} \overset{u}{ˉ}_{[n, L - 1]}^{2} ⋮ \overset{u}{ˉ}_{[n, L - 1]}^{ξ}, \overset{y}{ˉ}_{[0, \tilde{L} - 1]} = \overset{y}{ˉ}_{[0, L - 1]}^{1} \overset{y}{ˉ}_{[n, L - 1]}^{2} ⋮ \overset{y}{ˉ}_{[n, L - 1]}^{ξ} .

\overset{u}{ˉ}_{[0, \tilde{L} - 1]} = \overset{u}{ˉ}_{[0, L - 1]}^{1} \overset{u}{ˉ}_{[n, L - 1]}^{2} ⋮ \overset{u}{ˉ}_{[n, L - 1]}^{ξ}, \overset{y}{ˉ}_{[0, \tilde{L} - 1]} = \overset{y}{ˉ}_{[0, L - 1]}^{1} \overset{y}{ˉ}_{[n, L - 1]}^{2} ⋮ \overset{y}{ˉ}_{[n, L - 1]}^{ξ} .

\overset{u}{ˉ}_{[L - n, L - 1]}^{i}

\overset{u}{ˉ}_{[L - n, L - 1]}^{i}

\overset{y}{ˉ}_{[L - n, L - 1]}^{i}

x_{k + 1} y_{k} = A x_{k} + B ψ (u_{k}), x_{0} = \overset{x}{ˉ}, = C x_{k} + D ψ (u_{k}),

x_{k + 1} y_{k} = A x_{k} + B ψ (u_{k}), x_{0} = \overset{x}{ˉ}, = C x_{k} + D ψ (u_{k}),

v_{k} = ψ_{1} (u_{k}) ⋮ ψ_{r} (u_{k}) .

v_{k} = ψ_{1} (u_{k}) ⋮ ψ_{r} (u_{k}) .

[H_{L} (v) H_{L} (y)] α = [\overset{v}{ˉ} \overset{y}{ˉ}],

[H_{L} (v) H_{L} (y)] α = [\overset{v}{ˉ} \overset{y}{ˉ}],

\overset{v}{ˉ}_{k} = ψ_{1} (\overset{u}{ˉ}_{k}) ⋮ ψ_{r} (\overset{u}{ˉ}_{k}) .

\overset{v}{ˉ}_{k} = ψ_{1} (\overset{u}{ˉ}_{k}) ⋮ ψ_{r} (\overset{u}{ˉ}_{k}) .

x_{k + 1} y_{k} = A x_{k} + \tilde{B} v_{k}, = C x_{k} + \tilde{D} v_{k},

x_{k + 1} y_{k} = A x_{k} + \tilde{B} v_{k}, = C x_{k} + \tilde{D} v_{k},

x_{k + 1} y_{k} = A x_{k} + B u_{k}, x_{0} = \overset{x}{ˉ}, = ϕ (C x_{k} + D u_{k}),

x_{k + 1} y_{k} = A x_{k} + B u_{k}, x_{0} = \overset{x}{ˉ}, = ϕ (C x_{k} + D u_{k}),

z_{k} = \tilde{ϕ}_{1} (y_{k}) ⋮ \tilde{ϕ}_{q} (y_{k}),

z_{k} = \tilde{ϕ}_{1} (y_{k}) ⋮ \tilde{ϕ}_{q} (y_{k}),

[H_{L} (u) H_{L} (z)] α = [\overset{u}{ˉ}_{[0, L - 1]} \overset{z}{ˉ}_{[0, L - 1]}],

[H_{L} (u) H_{L} (z)] α = [\overset{u}{ˉ}_{[0, L - 1]} \overset{z}{ˉ}_{[0, L - 1]}],

\overset{z}{ˉ}_{k} = \tilde{ϕ}_{1} (\overset{y}{ˉ}_{k}) ⋮ \tilde{ϕ}_{q} (\overset{y}{ˉ}_{k}) .

\overset{z}{ˉ}_{k} = \tilde{ϕ}_{1} (\overset{y}{ˉ}_{k}) ⋮ \tilde{ϕ}_{q} (\overset{y}{ˉ}_{k}) .

[H_{L} (u) H_{ν} (y_{[0, N - L + ν - 1]})] α = [\overset{u}{ˉ} \overset{y}{ˉ}_{[0, ν - 1]}] .

[H_{L} (u) H_{ν} (y_{[0, N - L + ν - 1]})] α = [\overset{u}{ˉ} \overset{y}{ˉ}_{[0, ν - 1]}] .

H_{L, ν} (u, y) : = [H_{L} (u) H_{ν} (y_{[0, N - L + ν - 1]})], \overset{w}{ˉ} : = [\overset{u}{ˉ} \overset{y}{ˉ}_{[0, ν - 1]}] .

H_{L, ν} (u, y) : = [H_{L} (u) H_{ν} (y_{[0, N - L + ν - 1]})], \overset{w}{ˉ} : = [\overset{u}{ˉ} \overset{y}{ˉ}_{[0, ν - 1]}] .

α \in R^{N - L + 1} minimize ∥ H_{L, ν} (u, y) α - \overset{w}{ˉ} ∥_{2}^{2} .

α \in R^{N - L + 1} minimize ∥ H_{L, ν} (u, y) α - \overset{w}{ˉ} ∥_{2}^{2} .

α \in R^{N - L + 1} minimize ∥ H_{L, ν} (u, y) α - \overset{w}{ˉ} ∥_{2}^{2} + λ ∥ α ∥_{2}^{2},

α \in R^{N - L + 1} minimize ∥ H_{L, ν} (u, y) α - \overset{w}{ˉ} ∥_{2}^{2} + λ ∥ α ∥_{2}^{2},

x_{k + 1} y_{k} = A x_{k} + B ψ (u_{k}), x_{0} = \overset{x}{ˉ}, = Φ (C x_{k} + D ψ (u_{k})) .

x_{k + 1} y_{k} = A x_{k} + B ψ (u_{k}), x_{0} = \overset{x}{ˉ}, = Φ (C x_{k} + D ψ (u_{k})) .

α \in R^{N - L + 1} minimize H_{L, ν} (v, z) α - [\overset{v}{ˉ} \overset{z}{ˉ}_{[0, ν - 1]}]_{2}^{2} + λ ∥ α ∥_{2}^{2} .

α \in R^{N - L + 1} minimize H_{L, ν} (v, z) α - [\overset{v}{ˉ} \overset{z}{ˉ}_{[0, ν - 1]}]_{2}^{2} + λ ∥ α ∥_{2}^{2} .

K_{ψ} (u_{k}^{1}, u_{k}^{2})

K_{ψ} (u_{k}^{1}, u_{k}^{2})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A trajectory-based framework for data-driven

system analysis and control

Julian Berberich, Frank Allgöwer The authors are with the Institute for Systems Theory and Automatic Control, University of Stuttgart, 70550 Stuttgart, Germany. E-mail: $\{$ julian.berberich, frank.allgower}@ist.uni-stuttgart.de, Phone: +49 711 685-{67747, 67733}, Contact for correspondence: J. Berberich. The authors thank the German Research Foundation (DFG) for support of this work within the German Excellence Strategy under grant EXC 2075, along with the Max Planck Research School (IMPRS) for Intelligent Systems for their support.

Abstract

The vector space of all input-output trajectories of a discrete-time linear time-invariant (LTI) system is spanned by time-shifts of a single measured trajectory, given that the respective input signal is persistently exciting. This fact, which was proven in the behavioral control framework, shows that a single measured trajectory can capture the full behavior of an LTI system and might therefore be used directly for system analysis and controller design, without explicitly identifying a model. In this paper, we translate the result from the behavioral context to the classical state-space control framework and we extend it to certain classes of nonlinear systems, which are linear in suitable input-output coordinates. Moreover, we show how this extension can be applied to the data-driven simulation problem, where we introduce kernel-methods to obtain a rich set of basis functions.

††publicationid: pubid:

This version has been accepted for publication in Proc. European Control Conference (ECC), 2020. Personal use of this material is permitted. Permission from EUCA must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

I Introduction

Finding rigorous and efficient ways to integrate data into control theory has been a problem of great interest for many decades. Since most of the classical contributions in control theory rely on model knowledge, the problem of finding such a model from measured data, i.e., system identification, has become a mature research field [1]. More recently, learning controllers directly from data has received increasing interest, not least due to many successful practical applications of reinforcement learning techniques [2]. However, as is thoroughly evaluated in [3], such methods typically require large amounts of data, they are often not reproducible, and their analysis rarely addresses rigorous guarantees on, e.g., stability of the closed loop. Also in the control community, several approaches for the direct design of controllers from data have been proposed. Established methods include the Virtual Reference Feedback Tuning paradigm [4] or Iterative Feedback Tuning [5]. However, fundamental problems such as the direct data-driven design of linear quadratic optimal controllers with guarantees from finite noisy data have only been considered recently [6, 7].

In this paper, we consider an alternative, unitary framework for data-driven control theory, which allows for the development of various system analysis and controller design methods based directly on measured data. This framework relies on the characterization of all trajectories of an unknown system using a single measured data trajectory. The latter problem has been solved in the context of behavioral systems theory for discrete-time linear time-invariant (LTI) systems in [8]. In the behavioral approach, a system is not defined via a differential or difference equation with inputs and outputs, but rather as the space of all system trajectories [9, 10]. Thus, it is naturally well-suited for the development of purely data-driven approaches to system analysis and control.

Recently, there have been various contributions, which use the result of [8] for direct data-driven system analysis and control. In [11, 12], a data-driven MPC scheme relying on [8] is suggested to control unknown systems. A stochastic analysis of this scheme and an application to power systems are detailed in [13] and [14], respectively. Moreover, [15] provides a first theoretical analysis of stability and robustness of a data-driven MPC scheme based on terminal equality constraints. In [16], a data-driven closed-loop parametrization under state-feedback is derived and employed to design stabilizing and LQR controllers. This approach is extended to robust design from noisy data in [17]. Further, [18] provides a general framework for analyzing data-driven problems with not persistently exciting data. Finally, data-based conditions for dissipativity are suggested in [19]. Altogether, this indicates a great potential of the work of [8] for direct data-driven analysis and control. In this paper, we consider the work of [8] in the classical control framework and extend it to certain classes of nonlinear systems. Moreover, we illustrate the usefulness of this extension via a novel kernel-based approach to nonlinear data-driven simulation.

The remainder of this paper is structured as follows. In Section III, we phrase the main theorem of [8], which uses measured data to characterize all system trajectories, in the classical control setting, and we show how this result can be improved by weaving multiple such trajectories together. In Section IV, we provide a novel extension of [8] to classes of nonlinear systems, which are linear in suitably chosen and known nonlinear coordinates. Building on these results, we solve the data-driven simulation problem for such nonlinear systems in Section V. The paper is concluded in Section VI.

II Setting

We denote the set of integers in the interval $[a,b]$ by $\mathbb{I}_{[a,b]}$ . The Kronecker product is written as $\otimes$ . For a sequence $\{x_{k}\}_{k=0}^{N-1}$ , we define the Hankel matrix

[TABLE]

For a stacked window of the sequence, we write

[TABLE]

Further, $x$ will denote either the sequence itself or the stacked vector $x_{[0,N-1]}$ containing all of its components. A key assumption for our results will be persistence of excitation of the input signal, as captured in the following standard definition.

Definition 1.

We say that a signal $\{x_{k}\}_{k=0}^{N-1}$ with $x_{k}\in\mathbb{R}^{n}$ is persistently exciting of order $L$ if $\text{rank}(H_{L}(x))=nL$ .

Note that the above definition implies $N\geq(n+1)L-1$ . This means that, for a signal to be persistently exciting, it is not sufficient that its time-shifts are linearly independent, but the signal must also be long enough. A large part of this paper deals with discrete-time multi-input multi-output LTI systems of the form

[TABLE]

where the matrices $A,B,C,D$ as well as the initial condition $\bar{x}$ are unknown and only input-output data $\{u_{k},y_{k}\}_{k=0}^{N-1}$ , which may be obtained from (1) via simulation or an experiment, is available. Throughout this paper, $n$ denotes the order of the unknown system which is only assumed to be known in terms of a potentially rough upper bound. Further, we denote the input and output dimension by $m$ and $p$ , respectively.

We will use a single trajectory to characterize all other trajectories, which might be produced from the system (1), i.e., which satisfy the following definition.

Definition 2.

We say that an input-output sequence $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of an LTI system $G$ , if there exists an initial condition $\bar{x}\in\mathbb{R}^{n}$ as well as a state sequence $\{x_{k}\}_{k=0}^{N}$ such that

[TABLE]

for $k=0,\dots,N-1$ , where $(A,B,C,D)$ is a minimal realization of $G$ .

It follows from linearity that the set of all trajectories of an LTI system in the sense of Definition 2 is a vector space. As we will see in Section III, a basis for this vector space is formed by time-shifts of a single measured trajectory, given that the respective input signal is persistently exciting.

Throughout this paper, we make extensive use of the well-known fact that any LTI system admits a controllable and observable minimal realization. The particular choice of a specific minimal realization is however not relevant. Further, using LTI system properties, it is easy to show that any fixed window of an input-output trajectory $\{u_{k},y_{k}\}_{k=a}^{b}$ induces a unique state trajectory $\{x_{k}\}_{k=a}^{b}$ (in a given minimal realization), whenever $b-a\geq n-1$ .

III Trajectory-based representation of linear systems

In this section, we translate the main result of [8], which characterizes the trajectory space of an unknown system from measured data, to the classical state-space control framework. While the behavioral theory is naturally well-suited for such a result, we illustrate that it can also be formulated in the classical framework in an elegant way. Further, we show how a required persistence of excitation assumption can be relaxed by weaving multiple trajectories together to achieve an overall larger time horizon.

The following result is the correspondence of [8, Theorem 1] in the classical control setting and it will serve as the basis for the remainder of this paper.

Theorem 3.

Suppose $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of an LTI system $G$ , where $u$ is persistently exciting of order $L+n$ . Then, $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of $G$ if and only if there exists $\alpha\in\mathbb{R}^{N-L+1}$ such that

[TABLE]

Proof.

This is a direct application of [8, Theorem 1] to the special case of controllable state-space systems. ∎

Note that (2) is equivalent to

[TABLE]

i.e., the trajectory space is spanned by time-shifts of the measured trajectory. Similarly, it holds for the state that

[TABLE]

where $\bar{x}$ and $x$ are states corresponding to $(\bar{u},\bar{y})$ and $(u,y)$ , respectively, in the same minimal realization. The “if”-direction in Theorem 3 follows directly from the fact that $G$ is LTI, without adhering to the persistence of excitation assumption. The intuition about the “only if”-direction is sketched in the following. Take any trajectory $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ of $G$ . Clearly, $L$ degrees of freedom in the input are required to choose $\alpha\in\mathbb{R}^{N-L+1}$ such that (3) holds. Additional $n$ degrees of freedom can then be used to attain the internal initial condition $\bar{x}_{0}$ . Since $\{\bar{y}_{k}\}_{k=0}^{L-1}$ is a linear combination of $\{\bar{u}_{k}\}_{k=0}^{L-1}$ and $\bar{x}_{0}$ , this is enough to find an $\alpha$ which satisfies both (3) and (4), and thus (2). Therefore, persistence of excitation of order $L+n$ is required for the equivalence in Theorem 3.

Theorem 3 shows that all trajectories of an unknown LTI system can be constructed from a single persistently exciting trajectory. Equivalently, the vector space of all system trajectories is equal to the range of a data-dependent Hankel matrix. Thus, in a way, the measured input-output trajectory serves as a system representation on its own, without using it explicitly to identify a model. Prior knowledge of the unknown system’s order is only needed implicitly in Theorem 3 through the condition that $u$ has to be persistently exciting of order $L+n$ . Hence, if the amount of available data $N$ is significantly larger than $n$ and the input is persistently exciting of a sufficiently high order, a rough upper bound on $n$ suffices to apply Theorem 3.

As described above, persistence of excitation is necessary for the equivalence in Theorem 3. Note however that it also sets a fundamental limit on the application of Theorem 3: In order to span the space of all trajectories of length $L$ , Theorem 3 requires $N\geq(m+1)(L+n)-1$ or, equivalently, $L\leq\frac{N+1}{m+1}-n$ . Loosely speaking, if $m=1$ , $L$ can only be half as long as $N$ and, with increasing input dimension $m$ , the maximum length $L$ decreases by a factor of $\frac{1}{m+1}$ .

An intuitive solution to overcome this limitation would be to weave several, say $\xi\in\mathbb{N}$ , trajectories of length $L$ together to construct an overall trajectory of length $\xi L$ . This is however not trivial, since the internal states of the separate trajectories have to align at the intersections. In [20, Lemma 3], it is shown that two distinct input-output trajectories can be weaved together if they align over a sufficiently long window at their intersection. The following result is an extension of [20, Lemma 3] to more than two trajectories.

Proposition 4.

Suppose $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of an LTI system $G$ , where $u$ is persistently exciting of order $L+n$ . Then, $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\tilde{L}-1}$ with $\tilde{L}=\xi L+(1-\xi)n$ , $\xi\in\mathbb{N}$ , is a trajectory of $G$ if and only if there exist $\alpha^{i}\in\mathbb{R}^{N-L+1},i\in\mathbb{I}_{[1,\xi]},$ such that

[TABLE]

Proof.

If. Define $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ via

[TABLE]

and note that (6) means that $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\tilde{L}-1}$ is a stacked version of the sequences $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ in the sense that

[TABLE]

According to Theorem 3, the sequences $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ are trajectories of $G$ . Further, (7) and (8) imply that, at the transitions between the separate trajectories, they align over windows of length $n$ , i.e.,

[TABLE]

Denote by $\{\bar{x}^{i}_{k}\}_{k=0}^{L-1}$ the state trajectory corresponding to $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ in some minimal realization of $G$ . The conditions (10) and (11) imply that, at the transitions between the separate trajectories, the internal states align, i.e., $\bar{x}^{i}_{L}=\bar{x}^{i+1}_{n}$ , and thus, $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\tilde{L}-1}$ is a trajectory of $G$ .

Only If. Suppose $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\tilde{L}-1}$ is a trajectory of $G$ . Define $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ , $i\in\mathbb{I}_{[1,\xi]}$ , according to (9)-(11) and note that any of these sequences is itself a trajectory of $G$ . Hence, it follows directly from Theorem 3 that there exist $\alpha^{i}\in\mathbb{R}^{N-L+1}$ , $i\in\mathbb{I}_{[1,\xi]},$ such that (6)-(8) hold. ∎

Proposition 4 weaves multiple trajectories $\{\bar{u}^{i}_{k},\bar{y}^{i}_{k}\}_{k=0}^{L-1}$ together to form a single, longer sequence $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\tilde{L}-1}$ . To make this sequence a trajectory of $G$ , it only needs to be ensured that the shorter trajectories align over at least $n$ steps at their intersections. Note that the number of trajectories $\xi$ can be chosen arbitrarily large and thus, Proposition 4 can be used to construct trajectories of arbitrary length, using a single measured trajectory of finite length. Although we assume for notational simplicity that all trajectories contributing to the overall trajectory are of equal length, the same idea can be applied to weave trajectories of different lengths together. Further, one can straightforwardly employ measurements from multiple experiments of possibly different time horizons.

IV Trajectory-based representation of nonlinear systems

In this section, we extend Theorem 3 to certain classes of nonlinear systems. In particular, we consider the special cases of Hammerstein and Wiener systems. More generally, this allows us to extend Theorem 3 to all systems, which are linear in suitably chosen and known input-output coordinates. During the last decades, there have been many contributions to identify Hammerstein and Wiener systems from data [21, 22]. Our results can be seen as an alternative to the identification of such systems, using a single measured trajectory to represent them.

IV-A Hammerstein systems

A Hammerstein system is a nonlinear system, composed of a static nonlinearity followed by an LTI system, i.e.,

[TABLE]

with a nonlinear function $\psi:\mathbb{R}^{m}\to\mathbb{R}^{\tilde{m}}$ . In the following, we deal only with the case $\tilde{m}=1$ for notational simplicity, but the same ideas can be employed for $\tilde{m}>1$ . We assume that $\psi$ can be written as $\psi(u)=\sum_{i=1}^{r}a_{i}\psi_{i}(u)$ , with $a_{i}$ not all zero, for $r$ known basis functions $\psi_{i}$ . Further, we define the auxiliary input trajectory $\{v_{k}\}_{k=0}^{N-1}$ with components

[TABLE]

The following result uses the fact that (22) can also be viewed as a linear map from $v$ to $y$ .

Proposition 5.

Suppose $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of a Hammerstein system (12), where $v$ from (13) is persistently exciting of order $L+n$ . Then, $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of (12) if and only if there exists $\alpha\in\mathbb{R}^{N-L+1}$ such that

[TABLE]

where $\{\bar{v}_{k}\}_{k=0}^{L-1}$ is the sequence with components

[TABLE]

Proof.

Define the LTI system

[TABLE]

with input $v$ and output $y$ , with $a^{\top}=\begin{bmatrix}a_{1}&\dots&a_{r}\end{bmatrix}$ , $\tilde{B}=Ba^{\top},\tilde{D}=Da^{\top}$ , and $A,B,C,D$ from (12). Clearly, a sequence $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of (12) if and only if $\{\bar{v}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of (15). Further, using that $v$ is persistently exciting and (15) is controllable since the $a_{i}$ ’s are not all zero, Theorem 3 implies that $\{\bar{v}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of (15) if and only if there exists $\alpha\in\mathbb{R}^{N-L+1}$ such that (14) holds, which was to be shown. ∎

For the application of Proposition 5, the basis functions $\psi_{i}$ of $\psi$ have to be known. In practice, it may be adequate to simply choose sufficiently many basis functions, thereby approximating the true ones. Note however that the number of basis functions $r$ enters into the persistence of excitation assumption on the auxiliary input $v$ . To be more precise, for $v$ to be persistently exciting of order $L+n$ , it is necessary that $N\geq(r+1)(L+n)-1$ and hence, Proposition 5 does not allow for arbitrarily many basis functions. Nevertheless, we show in Section V that, for the data-driven simulation problem, Proposition 5 leads to meaningful results even if infinitely many basis functions are chosen implicitly via a kernel function.

IV-B Wiener systems

A Wiener system consists of an LTI system followed by a static nonlinearity, i.e., it is of the form

[TABLE]

with a nonlinear function $\phi:\mathbb{R}^{\tilde{p}}\to\mathbb{R}^{p}$ . Similar to Section IV-A, we consider in the following only the case $\tilde{p}=1$ . To apply the same reasoning as for Hammerstein systems, we assume that $\phi$ is invertible and that its inverse admits a basis function decomposition as $\phi^{-1}(y)=\sum_{i=1}^{q}b_{i}\tilde{\phi}_{i}(y)$ with $q$ known basis functions $\tilde{\phi}_{i}$ . We define an auxiliary output trajectory $\{z_{k}\}_{k=0}^{N-1}$ with components

[TABLE]

which will serve as the output of an equivalent LTI system. The following result is the correspondence of Proposition 5 for the Wiener system case.

Proposition 6.

Suppose $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of a Wiener system (16), where $u$ is persistently exciting of order $L+n$ . Then, $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ is a trajectory of (16) if there exists $\alpha\in\mathbb{R}^{N-L+1}$ such that

[TABLE]

where $\{\bar{z}_{k}\}_{k=0}^{L-1}$ is the sequence with components

[TABLE]

Proof.

This can be shown using similar arguments as in the proof of Proposition 5. Therefore, the proof is omitted. ∎

Contrary to the Hammerstein case, the above result does not pose any limit on the maximal number of basis functions we may choose. However, they represent the inverse of $\phi$ and are thus more difficult to select in applications. Further, the “only if”-direction does in general not hold for Wiener systems since the map $u\mapsto z$ is not necessarily linear.

Remark 7.

From the perspective of Koopman operator theory, there has recently been a renewed interest in viewing nonlinear systems as linear systems in lifted state coordinates [23]. In a similar fashion, Propositions 5 and 6 can be combined directly to provide trajectory-based representations of nonlinear systems, which are linear in suitable higher-dimensional input-output coordinates. Even if such coordinates do not exist or are not known, one may in practice simply choose sufficiently many basis functions to approximate the unknown nonlinear system. In Section V, we illustrate the effectiveness of this approach for the data-driven simulation problem. Note that considering systems which are linear in suitable input-output coordinates is more restrictive than dealing with systems which are linear in certain lifted state coordinates. On the other hand, in contrast to many methods related to Koopman operator theory, the present setting does not require state measurements, but only input-output data.

V Data-driven simulation

The data-driven simulation problem is concerned with the computation of an unknown system’s output resulting from the application of a given input, using no model but only a previously measured input-output trajectory. Its solution is described in the behavioral context in [24]. Loosely speaking, the idea is to fix $\bar{u}$ in (2) to first solve $\bar{u}=H_{L}(u)\alpha$ for $\alpha$ , in order to then compute the new predicted output $\bar{y}=H_{L}(y)\alpha$ . To fix a unique such output, initial conditions have to be specified [24]. Since a state-space model is not available, we consider an initial input-output trajectory over a length of at least $n$ , since this induces a unique initial state in some minimal realization. The following is the main result of [24].

Proposition 8.

Suppose $\{u_{k},y_{k}\}_{k=0}^{N-1}$ is a trajectory of a discrete-time LTI system $G$ , where $u$ is persistently exciting of order $L+n$ . Let $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{L-1}$ be an arbitrary trajectory of $G$ . If $\nu\geq n$ , then there exists an $\alpha\in\mathbb{R}^{N-L+1}$ to

[TABLE]

Further, it holds that $\bar{y}=H_{L}(y)\alpha$ .

Proof.

This follows directly from the corresponding result in the behavioral framework [24, Proposition 1]. ∎

The main idea of Proposition 8 is that the input $\{\bar{u}_{k}\}_{k=\nu}^{L-1}$ together with the initial trajectory $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\nu-1}$ fixes a vector $\alpha$ which can be used to uniquely predict the remaining elements of $\bar{y}$ . The condition $\nu\geq n$ means that $\nu$ is an upper bound on the order $n$ of $G$ and implies that $\{\bar{u}_{k},\bar{y}_{k}\}_{k=0}^{\nu-1}$ specifies a unique initial condition for the internal state.

The practical application of Proposition 8 is illustrated in Algorithm 9. Although the classical simulation problem is commonly approached using a model, it can be solved in the proposed trajectory-based framework using a single measured input-output trajectory. Several extensions of Proposition 8 have been suggested to account for noise [25], to simulate systems in closed loop [26], and to find feedforward controllers [24], but nonlinear systems have not been addressed in the literature. Due to noise or numerical inaccuracies, the system of equations (19) can usually not be solved exactly. Instead, $\alpha$ can be computed via a simple least-squares optimization problem. Denote

[TABLE]

In practice, the system of equations (19) can be replaced by

[TABLE]

In case that $H_{L,\nu}(u,y)$ contains noisy data, a solution $\alpha$ with small norm reduces the influence of the noise on the simulation accuracy. Therefore, it is desirable to penalize the norm of $\alpha$ , leading to the regularized least-squares problem

[TABLE]

where $\lambda>0$ is a regularization parameter. As an alternative, one may consider general quadratic regularization terms $\lVert\alpha\rVert_{P}^{2}$ with $P\succ 0$ or an $\ell_{1}$ -regularization. In the following, we show how kernel methods can be employed to derive an appealing reformulation of Problem (21) for the class of nonlinear systems considered in Section IV.

Let us consider a Hammerstein-Wiener system of the form

[TABLE]

According to Propositions 5 and 6, the trajectory space of (22) is spanned by Hankel matrices containing data in the lifted coordinates $v$ and $z$ , as defined in (13) and (17). Thus, for the system class (22), the optimization problem (21) takes the form

[TABLE]

In the following, we write $\psi^{r}(u_{k})$ and $\tilde{\Phi}^{q}(y_{k})$ for the stacked inputs $v_{k}$ and outputs $z_{k}$ at time $k$ , respectively. Note that Problem (23) does not depend explicitly on these vectors, but only on their scalar product. This allows for an application of the kernel trick, which can be used to compute such inner products implicitly [27]. Define kernel functions as

[TABLE]

and note that (23) depends only on those kernels, but not explicitly on the basis functions $\psi^{r},\tilde{\Phi}^{q}$ . Thus, for the implementation, it suffices to select a kernel, which then implicitly implies a set of basis functions for the nonlinearities $\psi$ and $\tilde{\Phi}$ . For instance, if $m=1$ , a squared exponential kernel of the form

[TABLE]

for some hyperparameter $\sigma>0$ , corresponds to an infinite set of basis functions. If the set of basis functions corresponding to the chosen kernel contains all basis functions of $\psi$ and $\tilde{\Phi}$ , then the data-driven simulation problem can be solved exactly for the considered class of nonlinear systems. In fact, as we will see in the following example, the data-driven simulation problem can be solved accurately, even if the data is affected by noise and the true basis functions are only represented approximately by the chosen kernel.

Example 10.

We consider a Hammerstein system (12) with nonlinearity $\psi(u)=\sin(u)$ and the system matrices

[TABLE]

We assume that the system order $n=4$ is known, i.e., $\nu=4$ . From an open-loop simulation, a trajectory $\{u_{k},y_{k}\}_{k=0}^{N-1}$ of length $N=1000$ is collected, where the output is subject to multiplicative measurement noise with signal-to-noise ratio $5\%$ . Problem (23) with a squared exponential kernel with $\sigma=1$ is used to compute the output $\bar{y}$ resulting from a uniformly distributed random input $\bar{u}$ in the interval $[-0.3,0.3]$ of length $L=50$ with zero initial conditions. The regularization parameter is chosen as $\lambda=10$ . Figure 1 shows the resulting output estimate as well as the true output for comparison. It can be seen that the estimate is good, considering the noise level. If the regularization term is omitted, i.e., $\lambda=0$ , or a fixed number of polynomial basis functions is chosen, then the estimation accuracy deteriorates significantly, even for smaller noise levels.

VI Conclusion

This paper described a purely data-driven framework for system analysis and control. All trajectories of an unknown system can be constructed from a single measured trajectory and thus, this trajectory captures all the required information needed for analysis and controller design, without explicit identification of a model. After describing this result in the classical control framework, we extended it to certain classes of nonlinear systems and we applied this extension to the data-driven simulation problem via kernel methods. Future research should further explore applications of the nonlinear extension presented in Section IV to data-driven system analysis and control problems, as well as connections to more elaborate results from the literature on kernel methods.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Ljung, System Identification: Theory for the User . Prentice-Hall, Englewood Cliffs, NJ, 1987.
2[2] R. Sutton and A. Barto, Reinforcement Learning: An Introduction . Cambridge, MA, MIT Press, 1998.
3[3] B. Recht, “A tour of reinforcement learning: The view from continuous control,” Annual Review of Control, Robotics, and Autonomous Systems , 2018.
4[4] M. C. Campi, A. Lecchini, and S. M. Savaresi, “Virtual reference feedback tuning: a direct method for the design of feedback controllers,” Automatica , vol. 38, no. 8, pp. 742–753, 2002.
5[5] H. Hjalmarsson, M. Gevers, S. Gunnarsson, and O. Lequin, “Iterative feedback tuning: theory and applications,” IEEE Control Systems Magazine , vol. 18, no. 4, pp. 26–41, 1998.
6[6] S. Dean, H. Mania, N. Matni, B. Recht, and S. Tu, “On the sample complexity of the linear quadratic regulator,” Foundations of Computational Mathematics , 2019, https://doi.org/10.1007/s 10208-019-09426-y.
7[7] J. Umenberger and T. B. Schön, “Learning convex bounds for linear quadratic control policy synthesis,” in Advances in Neural Information Processing Systems 31 , S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds. Curran Associates, Inc., 2018, pp. 9584–9595.
8[8] J. C. Willems, P. Rapisarda, I. Markovsky, and B. De Moor, “A note on persistency of excitation,” Systems & Control Letters , vol. 54, pp. 325–329, 2005.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A trajectory-based framework for data-driven

Abstract

I Introduction

II Setting

Definition 1**.**

Definition 2**.**

III Trajectory-based representation of linear systems

Theorem 3**.**

Proof.

Proposition 4**.**

Proof.

IV Trajectory-based representation of nonlinear systems

IV-A Hammerstein systems

Proposition 5**.**

Proof.

IV-B Wiener systems

Proposition 6**.**

Proof.

Remark 7**.**

V Data-driven simulation

Proposition 8**.**

Proof.

Example 10**.**

VI Conclusion

Definition 1.

Definition 2.

Theorem 3.

Proposition 4.

Proposition 5.

Proposition 6.

Remark 7.

Proposition 8.

Example 10.