Koopman Operators for Generalized Persistence of Excitation Conditions for Nonlinear Systems
Nibodh Boddupalli, Aqib Hasnain, Sai Pushpak Nandanoori, Enoch, Yeung

TL;DR
This paper develops new identifiability conditions for nonlinear biological models using Koopman operators, linking data spectral properties to model predictability, and demonstrates their application with a synthetic gene circuit.
Contribution
It introduces a novel framework for nonlinear system identifiability based on Koopman operators and spectral data analysis, guiding experimental design for biological modeling.
Findings
Identifiability depends on dataset spectral rank in Koopman space.
Rank degeneracy leads to overfitting and poor model predictions.
Proposes criteria for designing experiments to ensure model predictability.
Abstract
It is hard to identify nonlinear biological models strictly from data, with results that are often sensitive to experimental conditions. Automated experimental workflows and liquid handling enables unprecedented throughput, as well as the capacity to generate extremely large datasets. We seek to develop generalized identifiability conditions for informing the design of automated experiments to discover predictive nonlinear biological models. For linear systems, identifiability is characterized by persistence of excitation conditions. For nonlinear systems, no such persistence of excitation conditions exist. We use the input-Koopman operator method to model nonlinear systems and derive identifiability conditions for open-loop systems initialized from a single initial condition. We show that nonlinear identifiability is intrinsically tied to the rank of a given dataset's power spectral…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Koopman Operators for Generalized Persistence of Excitation Conditions for Nonlinear Systems
Nibodh Boddupalli, Aqib Hasnain, Sai Pushpak Nandanoori, and Enoch Yeung Please address correspondence to Nibodh Boddupalli at [email protected] Boddupalli, Aqib Hasnain, and Enoch Yeung are with the Department of Mechanical Engineering and the Center for Control,Dynamical Systems, and Computation at the University of California Santa Barbara, Santa Barbara, CA 93106, USA. Sai Pushpak Nandanoori is with the Energy and Environment Directorate at the Pacific Northwest National Laboratory.
Abstract
It is hard to identify nonlinear biological models strictly from data, with results that are often sensitive to experimental conditions. Automated experimental workflows and liquid handling enables unprecedented throughput, as well as the capacity to generate extremely large datasets. We seek to develop generalized identifiability conditions for informing the design of automated experiments to discover predictive nonlinear biological models. For linear systems, identifiability is characterized by persistence of excitation conditions. For nonlinear systems, no such persistence of excitation conditions exist. We use the input-Koopman operator method to model nonlinear systems and derive identifiability conditions for open-loop systems initialized from a single initial condition. We show that nonlinear identifiability is intrinsically tied to the rank of a given dataset’s power spectral density, transformed through the lifted Koopman observable space. We illustrate these identifiability conditions with a simulated synthetic gene circuit model, the repressilator. We illustrate how rank degeneracy in datasets results in overfitted nonlinear models of the repressilator, resulting in poor predictive accuracy. Our findings provide novel experimental design criteria for discovery of globally predictive nonlinear models of biological phenomena.
I INTRODUCTION
Many physical systems exhibit phenomena with unknown governing dynamics. These systems can be high dimensional and partially modeled or completely unstudied. Self-assembling complex systems, biological networks, Internet-of-Things infrastructure models, smart cities, and social networks are all examples of dynamically evolving systems that frequently are represented by data. Identifying globally predictive models from data requires data collection strategies that capture all the modes of a system. For example, in biological network modeling and discovery, designing an informative set of experimental conditions can produce global biological models that capture multiple modes of dynamics, including invariant subspaces and multiple equilibria. Unfortunately, there are few identifiability metrics for quantifying the richness, or the informativity, of datasets of nonlinear systems. Further, a dataset may only elicit linear modes of a nonlinear system, even though nonlinear modes may lay dormant. In these scenarios, the accuracy of a model discovery algorithm is often confounded with the informativity or richness of a dataset used to train the model.
Nonlinear systems lack generalized criteria for quantifying the information content of a given dataset. Even if a nonlinear model is deemed globally identifiable, this may be under the assumption of continuous noise-free sampling (perfect data) of all states. In linear systems theory, the informativity of a dataset is characterized by its ability to persistently excite all the modes in the transfer function. These conditions, referred to as persistence of excitation conditions, prescribe rank requirements on either time-series or spectral signals. First introduced in [1], PE is “not (yet) consistently defined” [2]. There are two classical approaches to modeling persistence of excitation [1, 2] . Firstly, defining criteria or spectral properties of a control input such that it can elicit a response from the system, thereby uniquely specifying the frequency response model of the system ([3, 4]. Secondly, requiring that a control input that is non-zero in all channels, at least once throughout the time course ([5, 6]). Persistence of excitation conditions can also inform the appropriate construction of a universally exciting input signal, e.g. the construction of the Pseudo-Random Binary Sequence [7], which persistently excites all linear systems. It is thus desirable to establish a modeling framework to link classical results in linear system identification [3, 4, 6] for the treatment of nonlinear systems.
Recently, an emerging set of operator-theoretic tools have gained traction, centered on discovering linear representations of nonlinear dynamical systems in a lifted space of coordinates [8, 9, 10]. Originally derived for Hamiltonian systems [11], numerical [12] and theoretical [13] techniques for Koopman operator theory enable input [14, 15, 16, 17, 18, 19] and spectral modeling of nonlinear systems [9, 20], deep-learning based models of nonlinear phenomena [21, 22, 23], and study of chaotic and uncountable spectra arising in complex nonlinear phenomena [18, 24, 16, 8, 9]. In this work, we use the Koopman operator method (Sections II and III) to lift nonlinear systems into a linear space and thereby derive generalized persistence of excitation conditions (Section V), following a similar approach taken in linear systems theory. We illustrate how generalized PE conditions inform the design of initial conditions for simulated experiments on the three node repressilator (Section VI).
II KOOPMAN OPERATOR THEORY
In 1931, Koopman showed the existence of a coordinate transformation and a corresponding unitary operator for any Hamiltonian (non-dissipative) dynamical systems. He showed that the operator and observable could be used to represent the time evolution of the underlying Hamiltonian system as a linear time-invariant system. This idea has been developed and generalized to other classes of nonlinear systems in recent years ([8, 9, 13], etc.), in the effort to find global representations instead of local approximations.
A discrete-time nonlinear dynamical system with state at time under can be represented as:
[TABLE]
Then functions are called ”observables,” and represent a mapping from the state-space into a lifted set of coordinates. For example, may be a scalar observable comprised of nonlinearities, weighted combinations of the state, such as . The true Koopman observables are often approximated in numerical Koopman learning algorithms like extended DMD (eDMD), deepDMD, and Hankel dynamic mode decomposition using functions which define a generic basis on a Hilbert function space, e.g. radial basis functions (RBFs), Hermite polynomials, or a combination of such basis functions in deepDMD [21].
For an analytical function , we know from [14] that there exists a countably infinite or finite dimensional Koopman operator that acts linearly on the observable under function composition, satisfying the Koopman equation
[TABLE]
This equation states that the action of the Koopman operator on an observable function is equivalent to the action of the observable function on the vector field of the state.
From the above transient behaviour, one can see that this itself is a dynamical system in the space of observables whose time-evolution is governed by an infinite-dimensional operator that preserves the essence of (1). The transformation from (1) to (2) is frequently referred to as ”lifting” since the observable usually has dimension This is not always the case, but [23] shows that the observables define an expansion of the nonlinearities in the governing equations.
II-A Finite Dimensional Approximations
III Koopman Input Output Theory
Our study of the dynamical system (1), ultimately will require treating the initial condition as an input signal (via a Kronecker delta function) to the system. For clarity, we introduce the notion of an input-Koopman operator here, as well as the notion of the Koopman transfer function of a nonlinear system. Given a discrete-time nonlinear system with analytic vector field and control input , we write the dynamics of the system as:
[TABLE]
[16] showed that an observable can lift the above system to such that:
[TABLE]
For an exogenous memoryless input, [15] demonstrated that the above can be modified by splitting into components and , where is a stacked vector valued observable of all scalar valued observable functions from that do not depend on , and is a stacked vector valued observable of all remaining terms. This results in the decomposed representation
[TABLE]
where and is a stacked vector of all multivariate terms of and . This is a representation of the nonlinear system dynamics that is linear in the lifted state observable and the lifted input-state mixture observable . We thus can define the Koopman discrete time transfer function as
[TABLE]
where we have assumed that The zeros and the poles of the transfer function are defined in the usual manner, but with respect to the transformations on the state and input . A continuous time analogue of the discrete-time Koopman operator is readily derived using the Koopman generator, rather than the Koopman operator. While the discrete-time formulation is chosen in this work, a similar treatment of the sequel can be used to develop persistence of excitation conditions for continuous-time nonlinear dynamical systems.
IV Problem Statement
In classical system identification theory, the design of an input or initial condition to guarantee identifiability is framed in terms of PE. One of the merits of a Koopman operator representation is that it provides a representation of a nonlinear system in linear coordinates. This allows us to rigorously formulate the question of how to design an initial condition that guarantees identifiability of the model. Specifically, we consider two problems:
Problem 1** (Identifiability of a Nonlinear System with Fixed Initial Conditions).**
Given a discrete time-invariant autonomous nonlinear system of the form
[TABLE]
determine if the model can be identified from the continuously sampled data stream and the set of initial conditions .
This problem is difficult to solve, as it couples the problem of nonlinear function regression of and with the requirement of characterizing the richness of a set of fixed initial conditions. This problem is equivalent to the nonlinear state-space realization problem, given a single initial condition or set of initial conditions .
The other variant of this problem is where the initial conditions are design parameters in an experiment and can be set by the user. This scenario is especially common in synthetic biology where experiments are conducted with varying initial conditions of concentrations, pH, etc. and hence is of interest from a design of experiment (DoE) standpoint.
Problem 2** (Identifiability of a Nonlinear System with Designed Initial Conditions).**
Given a continuous time-invariant autonomous nonlinear system of the form and the design choice of initial conditions
[TABLE]
Find that guarantees the model can be identified from the continuously sampled data stream .
This variant of the problem presumably has more degrees of freedom, namely to be precise. However, the challenge is to relate identifiability of the model to the initial condition, which for an unknown is inherently difficult. We now use the input-Koopman framework derived in the previous sections to reformulate the problem with a linear Koopman representation. The recasting of the problem will permit extension of classical identifiability notions such as the design of PE or SR input signals or initial conditions.
V Persistence of Excitation Conditions for Nonlinear Systems with Koopman Operators
Our contribution in this paper is to formulate the problem of identifiability with fixed and designed initial conditions using the method of Koopman. Once we have formulated the problem in the Koopman operator theoretic framework, we derive computational certificates for PE for fixed or designed initial conditions. This leads to an algorithm for selection of the initial conditions of a dynamical system given In both scenarios, we treat the system’s initial condition as an input. Given a nonlinear discrete-time dynamic system of the form
[TABLE]
where and the Koopman equation for the corresponding system defines the action of an operator on a vector valued observable acting on the state , namely
[TABLE]
Then, (3) can be represented with initial state as an input:
[TABLE]
We thus model the initial condition as a Kronecker delta input to the dynamical system. Accordingly, we will abuse notation slightly and define the input of the system as , which yields the classic input-state Koopman representation for a nonlinear system
[TABLE]
Notice that the state vector is a vector observable function on which the Koopman operator acts as a linear operator to update the state. In essence, by defining an appropriate ’lifting’ or ’observable’ function, we can treat the problem as a classical linear identifiability problem, with additional caveats imposed by the presence of the nonlinearities in and . As there are multiple definitions in the literature for PE [3, 6], we chose a definition that enables relating identifiability to the initial condition of a transfer function. We pose an extension of the definition from [3] which is that of SR from [6], but in the Koopman framework.
Definition 1** (Persistence of Excitation).**
A quasi-stationary discrete time observable is said to be persistently exciting of order if the covariance matrix is positive definite:
[TABLE]
where is the auto-covariance of formulated as:
[TABLE]
and denotes the expectation operator. The above definition has been referred to as sufficient richness in [6] as well. Positive semi-definiteness of can easily be proved. By substituting the of (10) in (9) we obtain:
[TABLE]
Defining \bm{v}\triangleq\big{[}\bm{\varphi}(\bm{u}_{t+1})^{T},\cdots,\bm{\varphi}(\bm{u}_{t+N})^{T}\big{]}^{T}, the above becomes:
[TABLE]
Let we have
[TABLE]
Hence, is positive semi-definite. From the Herglotz Theorem, any function defined on integers is positive semi-definite if and only if it has a Bochner representation (can be represented as the inverse Fourier transform) of a unique Spectral Measure on a circle:
[TABLE]
If is a scalar, is its power spectrum as pointed out in [6] and [5]. Alternate definitions have been mentioned in literature but the following definition and result elucidates how PE of an initial condition for a dynamical system can be related to the Koopman transfer function.
Definition 2**.**
Given system (1), we say an initial condition , treated as a Kronecker delta signal is persistently exciting of Koopman-order if it is persistently exciting for a state-inclusive Koopman operator of order satisfying
[TABLE]
Theorem 1**.**
The initial condition is persistently exciting for the nonlinear dynamical system (1) if and only if the Fourier transform of the auto-covariance matrix
[TABLE]
has distinct frequencies where does not vanish, i.e. positive spectral lines.
Proof.
Let the dimension of the lifted space and corresponding Koopman vector valued observable be denoted as . We write the initial condition as an input to the nonlinear dynamical system (1), of the form . We suppose that is state inclusive [23]. This implies that:
[TABLE]
i.e. the PE of an input signal for the system (13) implies the PE of the system (1), since the flow and vector field of system (13) is a projection of the flow and vector field of the system (1), respectively. It suffices to demonstrate the equivalence of the PE of , up to order , to the linear independence of spectral lines for the power spectral density
We know that the spectral measure of would be a positive semi-definite, symmetric matrix of bounded measures, symmetric over all since the observable elements are all real valued. We have by the definition of the spectral power measure that
[TABLE]
Since the spectral measure is bounded almost everywhere, the monotone convergence theorem allows us to write from (12)
[TABLE]
After swapping the finite sum with limiting sums
[TABLE]
and distributing sums across the integrand, we obtain
[TABLE]
Defining filters , the above becomes:
[TABLE]
Thus, positive definiteness of only holds if and only if is positive definite. But positive definiteness of holds if and only if there are at least frequencies in which the integrand does not vanish, namely frequencies where
[TABLE]
is in the orthogonal complement of the null space of This proves the result. ∎
From our simulation studies, we found that most initial conditions are PE up to order , for their respective Koopman operator and dynamical system. If we consider the design problem of selecting an initial condition such that is PE, or a set of are PE of Koopman-order , we can express the problem in terms of the positive definiteness of adjusting the signal or more generally the timing of the input to ensure the positive definiteness of the auto-covariance matrix. Alternatively, when working with a collection of initial condition signals it is straightforward to visualize the power spectrum using the transformed signal Initial conditions can be selected or drawn randomly from the phase space until a suitable collection of initial conditions and spectral lines are identified.
VI Persistence of Excitation of Different Initial Conditions for a Repressilator Genetic Circuit Model
VI-A The Repressilator
The repressilator is a classical genetic circuit used in synthetic biology to implement circadian rhythms or synthetic oscillations. The architecture is that of a 3 node Goodwin oscillator, with three genes that produce proteins or mRNA that serve to repress the downstream or target gene’s function. Each gene represses its downstream target, with the final gene repressing the original gene to form a cycle of negative feedback. When the gain of the individual genes are balanced with respect to each other [25], the genetic circuit admits a limit cycle in the phase portrait and a single basin of attraction surrounding the origin.
There are many models of the repressilator, with varying degrees of complexity and intricacy to capture the underlying biophysical dynamics. We consider a simplified three dimensional model from the first experimental implementation of the repressilator [26], that captures the limit cycle and basin of attraction, to study the role of the initial condition in PE of the nonlinear system. Consider the model:
[TABLE]
An example set of commonly used initial concentrations is nM for LacI, -cI, TetR, mLacI, m-cI, and mTetR. We model the degradation and dilution rate of all proteins as a lump term with average kinetic rate . Fig. 1(a) (and 2(a)) shows simulations (solid lines) of the repressilator from different initial conditions. The repressilator exhibits a strongly attracting limit cycle and a single unstable equilibrium point at the origin. Several initial conditions in the phase space mapped through the observable function have low gain, specifically those within (unit ball in ). These observable functions have low gain power spectra with spectral lines on the magnitude of numerical noise. We noted that these initial conditions, when mapped through higher order polynomials, lead to overfitting due to the vanishing of the signal in higher-order terms. To illustrate, has been obtained using eDMD with third-order Hermite polynomials.
We considered training the repressilator model with initial conditions drawn from two different regions of the phase space. First, initial conditions within the unit disc centered at the unstable equilibrium point (solid lines in Fig. 1(b)) and secondly, initial conditions outside the unit disc centered at the same point (solid lines in Fig. 2(b)). We simulated using 6 initial conditions from one basin of attraction and evaluated test predictions using initial conditions from another basin. For example, we would train within the unit disc (Fig. 1(b)) or outside it (Fig. 2(b)) and evaluate prediction accuracy of the Koopman operator for trajectories initiated outside that basin of attraction (Fig. 1(c) and 2(c) respectively) using a norm based error between the predicted and simulated trajectories.
Notice the rank is greater for the power spectrum in Figure 1 than in Figure 2, which correlates with the failure to predict long-term global behavior in Figure 2. Interestingly, the rank of the power spectrum was not as high as the dimension of the list of dictionaries, indicating that the true Koopman observable space is of a lower dimension than the dimension of dictionary functions. Future work will investigate the iterative processes for identifying the minimal set of Koopman observables, their relationship to HankelDMD [27], as well as the formal design of automated biological experiments to ensure global predictive accuracy of discovered Koopman models.
VII ACKNOWLEDGMENTS
The authors would also like to thank Igor Mezic, Robert Egbert, Bassam Bamieh, Sai Pushpak, Sean Warnick, and Umesh Vaidya for stimulating conversations. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the Defense Advanced Research Projects Agency (DARPA), the Department of Defense, or the United States Government. This work was supported partially by a Defense Advanced Research Projects Agency (DARPA) Grant No. DEAC0576RL01830 and an Institute of Collaborative Biotechnologies Grant.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K.-J. Åström and B. Torsten, “Numerical identification of linear dynamic systems from normal operating records,” IFAC Proceedings Volumes , vol. 2, no. 2, pp. 96–111, 1965.
- 2[2] A. Padoan, G. Scarciotti, and A. Astolfi, “A geometric characterization of the persistence of excitation condition for the solutions of autonomous systems,” IEEE Transactions on Automatic Control , vol. 62, no. 11, pp. 5666–5677, 2017.
- 3[3] L. Ljung, System identification: theory for the user. Prentice Hall, 1987.
- 4[4] A. K. Tangirala, Principles of system identification: theory and practice . CRC Press, 2014.
- 5[5] S. Boyd and S. S. Sastry, “Necessary and sufficient conditions for parameter convergence in adaptive control,” Automatica , vol. 22, no. 6, pp. 629–639, 1986.
- 6[6] E.-W. Bai and S. S. Sastry, “Persistency of excitation, sufficient richness and parameter convergence in discrete time adaptive control,” Systems & control letters , vol. 6, no. 3, pp. 153–163, 1985.
- 7[7] O. Nelles, Nonlinear system identification: from classical approaches to neural networks and fuzzy models . Springer Science & Business Media, 2013.
- 8[8] I. Mezić and A. Banaszuk, “Comparison of systems with complex behavior,” Physica D: Nonlinear Phenomena , vol. 197, no. 1-2, pp. 101–133, 2004.
