Stability Analysis of Reservoir Computers Dynamics via Lyapunov Functions
Afroza Shirin, Isaac S. Klickstein, Francesco Sorrentino

TL;DR
This paper applies Lyapunov functions to analyze the nonlinear stability of reservoir computers, identifying regions of stability and linking stability to the polynomial structure of the reservoir dynamics.
Contribution
It introduces a Lyapunov-based method for stability analysis of reservoir computers and analytically determines stability regions for both continuous and discrete systems.
Findings
Training error is lower within the stability region.
Stability is influenced by the polynomial structure of the reservoir dynamics.
Nonzero coefficients for odd and even powers are important for polynomial dynamics.
Abstract
A Lyapunov design method is used to analyze the nonlinear stability of a generic reservoir computer for both the cases of continuous-time and discrete-time dynamics. Using this method, for a given nonlinear reservoir computer, a radial region of stability around a fixed point is analytically determined. We see that the training error of the reservoir computer is lower in the region where the analysis predicts global stability but is also affected by the particular choice of the individual dynamics for the reservoir systems. For the case that the dynamics is polynomial, it appears to be important for the polynomial to have nonzero coefficients corresponding to at least one odd power (e.g., linear term) and one even power (e.g., quadratic term).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Stability Analysis of Reservoir Computers Dynamics via Lyapunov Functions
Afroza Shirin
Mechanical Engineering Department, University of New Mexico, Albuquerque, NM 87131
Isaac S. Klickstein
Mechanical Engineering Department, University of New Mexico, Albuquerque, NM 87131
Francesco Sorrentino
Mechanical Engineering Department, University of New Mexico, Albuquerque, NM 87131
Abstract
A Lyapunov design method is used to analyze the nonlinear stability of a generic reservoir computer for both the cases of continuous-time and discrete-time dynamics. Using this method, for a given nonlinear reservoir computer, a radial region of stability around a fixed point is analytically determined. We see that the training error of the reservoir computer is lower in the region where the analysis predicts global stability but is also affected by the particular choice of the individual dynamics for the reservoir systems. For the case that the dynamics is polynomial, it appears to be important for the polynomial to have nonzero coefficients corresponding to at least one odd power (e.g., linear term) and one even power (e.g., quadratic term).
While nonlinearity appears to be a fundamental component of reservoir computers, not much research has been performed to analyze stability of the nonlinear dynamics of these systems. In this paper, we use a Lyapunov design method to estimate the basin of attraction of a fixed point for the dynamics of a generic reservoir computer. Our nonlinear stability analysis unveils a trade-off between the need for global stability, which is achievable by linear dynamics alone, and the need for higher-order terms of the dynamics, which could in turn compromise stability.
I Introduction
A reservoir computer (RC) is a complex nonlinear dynamical system that is used for processing and analyzing empirical data, see e.g. jaeger2001echo ; schrauwen2007overview ; natschlager2002liquid ; maass2002real ; martinenghi2012photonic ; brunner2013parallel ; nakajima2015information ; hermans2015photonic ; vinckier2015high ; duport2016fully ; larger2017high , modeling of complex dynamical systems suykens2012artificial , speech recognition crutchfield2010introduction , learning of context free and context sensitive languages rodriguez2001simple ; gers2001lstm , the reconstruction and prediction of chaotic attractors lu2018attractor ; zimmermann2018observing ; antonik2018using ; jaeger2004harnessing ; pathak2017using ; pathak2018model , image recognition jalalvand2018application , and control of robotic systems graves2004biologically ; robinson1994application ; lukovsevivcius2012reservoir .
A typical RC consists of a set of nodes coupled together to form a network. Each node of the RC evolves in time in response to an input signal that is externally provided to the reservoir. An output signal is then generated from the time evolutions of the RC nodes. In a RC, the output connections (those that connect the RC nodes to the output) are trained to produce a best fit between the output signal and a training signal related to the original input signal. On the other hand, the connections between the nodes of the reservoir are constant parameters of the system. As a result, RCs are easier to analyze than other machine learning tools for which all the connections are typically trained.
The functions of RCs mainly depend on two factors; (i) nonlinearity of the nodal dynamics which is needed to process the information in the input signal and (ii) linear memory to boost the excitability of the RC dynamics dambre2012information . Though earlier works have shown that maximizing linear memory is important in information processing jaeger2002short ; ganguli2008memory ; buonomano2009state ; white2004short , more recent works have shown that the performance of a reservoir computer is related to consideration of both factors (i) and (ii) verstraeten2007experimental ; inubushi2017reservoir ; marzen2017difference . In addition, the performance of a reservoir computer is also affected by a number of other factors, including the reservoir adjacency matrix, i.e., the strengths of the connections between the RC nodes, and the dynamic range of the input signals verstraeten2009quantification .
From linear systems theory, a dynamical system is reliable and safe when it is stabilizable around some operating point sontag2013mathematical . Previous research has used linear stability to assess the stability of RCs around this operating point marcus1989stability ; tanaka1995stability ; belair1996frustration . However, as the RC dynamics requires nonlinearity for its proper operation, a linear stability analysis of the RC dynamics around a specific point is not sufficient. This motivates us to develop a nonlinear stability analysis, based on Lyapunov functions, which we use to characterize the basin of attraction of the desired operating point. As assessing stability of the nonlinear system when forced by an external stimulus is contingent on the particular stimulus provided, we characterize nonlinear stability of the unforced RC dynamics. If the desired operating point is found to be globally asymptotically stable, then stability is independent of the particular stimulus, as long as it is bounded.
We compute a constant -radius region around the operating point such that the dynamics of the system remains bounded inside this region. This can be done by choosing the parameters of the system and the type of nonlinearity, with the goal of possibly enhancing the performance of the reservoir computer. We consider different types of nonlinear dynamics at the network nodes, e.g., polynomial, in either continuous time or discrete time. This differs from the common approach in the literature, where the nodal dynamics is chosen to have a squashing type nonlinearity (in most cases a sigmoid function, e.g., ) so that the states of the nonlinear system always remain bounded inside some region liu2001stability ; saul2000attractor ; yang2014exponential . (A squashing function is defined as a function that is monotonic and bounded within a small range. For example, the function squashes the argument to the interval .)
Few theoretical works have investigated how the underlying stability of a nonlinear RC affects its performance. Reference ganguli2008memory showed that the total memory capacity, a measure of performance, is related to the size of the network. This analysis did not consider how the underlying stability of the system is related to the total memory capacity. In Ref. verstraeten2010memory , an optimal parameter setting for the reservoir was studied, but no direct relation between the dynamic properties in terms of nonlinear stability and the reservoir performance was provided. Previous work verstraeten2009quantification found that the performance of the RC was improved when the condition number of the Jacobian of the reservoir dynamics was small. The references listed herein, and others, motivated us to perform a rigorous dynamical investigation of the nonlinear stability of RCs.
In this paper, for a given nonlinear RC, we determine the -region of stability, where is the radius around a fixed point and is the set of parameters of the reservoir. We use Lyapunov design methods haddad2011nonlinear to find the -region where a nonlinear reservoir computer is stable, safe, reliable and its performance is predictable. Lyapunov design methods are used widely in controls engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system’s state in a desired operating range sontag2013mathematical . To the best of our knowledge, the use of a Lyapunov-based design approach with respect to the performance of a reservoir computer is novel. In Ref. perkins2002lyapunov , a Lyapunov function has been used to design the controller of a reinforcement learning system, but the paper does not show how stability of the RC affects its performance. In this article, first we assume that the input signal is normalized and scaled properly, the eigenvalues of the adjacency matrix satisfy certain constraints, and second we design the -region of stability by using a Lyapunov design method. Our nonlinear stability analysis provides insight into the effects of the RC parameters on its performance.
In Sec. II we lay out the general theory to assess the basin of attraction within which the RC dynamics is stable for both continuous-time dynamics and discrete-time dynamics. In Sec. III we investigate the relation between our stability predictions and the RC performance in terms of the computed training error. Finally, conclusions are presented in Sec. IV.
II Methods
II.1 Reservoir Computer with Continuous Time Dynamics
We consider the dynamics of a reservoir computer as continuous-time carroll2019network ,
[TABLE]
and the unforced (without input) reservoir dynamics,
[TABLE]
where denotes the state of node at time , is the nonlinear nodal dynamics at node , the adjacency matrix indicates the coupling from node to node , is an input signal and is a vector describing the coupling of input signal with node . We assume that is a (linearly) stable fixed point of the system in Eq. (2) and the input signal in Eq. (1) is normalized to have zero mean and standard deviation equal to one.
II.1.1 Lyapunov Function and -region design
We define a Lyapunov function for the unforced dynamical equation in Eq. (2),
[TABLE]
where is the phase space region that is included in a hypersphere of radius , centered at the origin. Here for and . Then for all ,
[TABLE]
where is the symmetric part of the matrix , is the largest real eigenvalue of the matrix , and is the set of parameters that completely characterize the nonlinear function . Now we introduce a quadratic upper bound to the term . Let us consider that the term is such a quadratic upper bound, that is , where is a scalar function of and . The inequality in Eq. (4e) can now be written as,
[TABLE]
According to the second Lyapunov stability theorem haddad2011nonlinear , the system in Eq. (2) is stable within if the following inequality holds,
[TABLE]
Note that for each , the term on the left hand side of Eq. (6b) depends on the dynamics and parameters of the individual nodes and the term on the right hand side of Eq. (6b) depends on the network topology. Thus Eq. (6b) effectively decouples the stability problem into two terms that can be adjusted independently of each other: the nodal dynamics and the network topology.
We now provide a definition of -region stability of a reservoir computer.
Definition 1
A nonlinear reservoir computer is -region stable if .
Note that this is a sufficient (but not necessary) condition for a reservoir to be stable inside the region . Also, if , the system is globally asymptotically stable.
We note that for any constant , the system is -region stable. We will thus attempt to find an upper bound to that is as tight as possible. To find the minimal , we define an optimization problem,
[TABLE]
As lies within the closed interval , the constraint in Eq. (7) must be satisfied along a continuum. However, we know that the constraint in Eq. (7b) achieves equality for some at the optimal solution ,
[TABLE]
or equivalently,
[TABLE]
The optimal coefficient is chosen as the term of maximum value in the set,
[TABLE]
where the four cases in Eq. (10) are the possible maxima of the ratio over a closed interval. The stability condition for the reservoir computer is
[TABLE]
From the inequality in Eq. (11) and the definition of in Eq. (10), we can find for which and the -region is determined by . We call the radius of the region . If then we say the system is globally stable (less formally we say ), while if then the system is unstable. In the next subsection, we will find the -region for the case that the nonlinear nodal dynamics is described by a polynomial.
II.1.2 Polynomial Type Nonlinearity
We now consider an example for which the reservoir computer consists of homogeneous nodes and the nodal dynamics of each node is defined by the following third-order polynomial function carroll2019network ; carroll2019mutual ,
[TABLE]
Polynomial functions are a very general way to express nonlinearity.
The dynamical equation that governs the evolution of each node is,
[TABLE]
Here, , and are the coefficients of the polynomial. In this case, the set of parameters . The origin is a fixed point for the dynamics and is linearly stable if the largest real part of the eigenvalues of the matrix is negative. Therefore, in what follows, we fix so as to ensure that the matrix is Hurwitz and then we characterize the basin of attraction as a function of the remaining parameters and .
According to Eq. (10), the scalar function can be obtained as,
[TABLE]
We note that the condition determines the radius of the sphere for which the reservoir computer is -region stable. Global stability is achieved when remains upper bounded by as .
Here, we provide an example to explain how to find the -region for a simple reservoir computer with nodes. We set , , and let the parameter vary. In Fig. 1(A), we plot versus for different values of . The solid black line is the constant-ordinate line at . For this example, we observe that is symmetric about the parameter . For , which is represented by a black dot where the curves for cross . For , reaches a constant below as grows, indicating that the basin of attraction has infinite radius. Figure 1(B) considers the case that . We see two different regions in the -plane distinguished by two colors: the red region indicates the initial conditions from which the system’s time evolution approaches the origin as time grows and the yellow region indicates the initial conditions from which the system’s time evolution does not converge to the origin, in which case the dynamics converges to either another fixed point, or a limit cycle, or any other attractor other than the origin. The black circle is the solution of , for . In Figs. 1(C) and 1(D) we plot the trajectory versus when the system is evolved from a typical initial condition from within the red and the yellow region, respectively.
II.2 Reservoir Computer with Discrete Time Dynamics
We now turn to the dynamics of a reservoir computer with discrete time dynamics,
[TABLE]
which in the unforced case becomes,
[TABLE]
where , , denotes the state of the node of the reservoir at time step , is the nonlinear nodal dynamics of node , the adjacency matrix indicates the pattern of connectivity between the network nodes, is the input signal at time step and is a vector that describes the coupling of the input signal to each one of the nodes. The input signal in Eq. (15) is normalized to have mean [math] and standard deviation equal to lu2017reservoir ; carroll2019network .
Hereafter we assume that the operating fixed point for the dynamics Eq. (16) coincides with the origin (however, this assumption can be removed; see the example that follows for the case of a sigmoid nonlinearity.)
II.2.1 Lyapunov Function and -region Design
We define a Lyapunov function
[TABLE]
where and . Here for and . Then ,
[TABLE]
We seek to find a scalar function such that which also satisfies the inequality in Eq. (18b),
[TABLE]
According to the Lyapunov stability theorem for discrete time dynamics haddad2011nonlinear , the system is stable only if,
[TABLE]
where is an eigenvalue of the matrix and . The above inequality can be written as,
[TABLE]
We see that there is both an upper bound and a lower bound for . Hence, there are two critical eigenvalues: and which are the eigenvalues closest to the positive side of the unit circle and closest to the negative side of the unit circle when moving only along the real axis, respectively. This concept is displayed graphically in Fig. 2 where and . The maximum distance the eigenvalue can shift to the right is and the maximum distance the eigenvalue can shift to the left is . Thus there exist two scalar functions denoted by and such that,
[TABLE]
An illustration is presented in Fig. 2 which shows how to find the critical eigenvalues and . In Fig. 2, several eigenvalues of some hypothetical adjacency matrix are shown inside the unit circle. For each eigenvalue, we compute and . From the table we see that is the critical eigenvalue and is the critical eigenvalue .
Now using the fact that all the nodes are homogeneous, from inequality Eq. (19), we can write,
[TABLE]
From the inequalities in Eqs. (22) and (23), it follows that,
[TABLE]
Thus, we find and such that
[TABLE]
As we want tight upper and lower bounds on , we seek to find and that solve the following two optimization problems,
[TABLE]
and
[TABLE]
A solution to the problem in Eq. (26) and to the problem in Eq. (27) must satisfy their respective constraints exactly for some and ,
[TABLE]
or equivalently,
[TABLE]
We can find as
[TABLE]
and we can find as
[TABLE]
Once we obtain and , we can find and such that and , respectively. The -region for the discrete time RC can be determined as , where .
Example: type nonlinearity
We choose the nodal dynamics to be
[TABLE]
The dynamics of node is described by,
[TABLE]
Here . We see that for , the origin is a fixed point, which is stable if all the eigenvalues of the matrix are inside the unit circle. The constant functions and can be found as,
[TABLE]
and
[TABLE]
Now if , then and if , then for any choice of .
II.2.2 Lyapunov Function and -region Design for Non-homogeneous Nodal Dynamics
One generalization of Eq. (16) is to the case of non-homogeneous nodal dynamics,
[TABLE]
Without loss of generality we retain the assumption that the above set of equations has a fixed point at the origin. In the case a fixed point exists that is different from the origin, this assumption can be removed by applying a coordinate transformation that moves the fixed point to the origin (see the example that follows for the case of a sigmoid nonlinearity). Now according to the Lyapunov function analysis described in section B.1, for each node we find scalar functions and which satisfy,
[TABLE]
We can find as
[TABLE]
and we can find as
[TABLE]
Now we define the scalar function as,
[TABLE]
and the scalar function as,
[TABLE]
Once we obtain and , we can find and such that and , respectively. Then the -region for the reservoir computer can be determined as , where .
Example: Sigmoid Nonlinearity
Here we choose the nodal dynamics to be
[TABLE]
The unforced dynamics of each node is,
[TABLE]
Here the parameters are . For this example, we see that the origin is not a fixed point. Instead a nonzero fixed point exists: , . Let and the system in Eq. (43) can be transformed to the form of Eqs. (36), where . According to Eq. (38) and (39), we find the scalar functions and as follows,
[TABLE]
and
[TABLE]
Now we define the scalar function as follows,
[TABLE]
and the scalar function as follows,
[TABLE]
Once we obtain and , we can find and such that and , respectively. Then the -region for the reservoir computer can be determined as , where .
III Results
For our numerical simulations, in both continuous-time and discrete-time, we construct the adjacency matrix as follows: (i) We set the entries of the initial matrix to be equal to , where is the Kronecker delta, . (ii) of the off-diagonal entries of the matrix are chosen randomly and set to zero. (iii) of the remaining nonzero entries of the matrix are chosen randomly and are flipped from to . (iv) Finally, the adjacency matrix is normalized so that the absolute value of the largest real part of its eigenvalues is equal to .
III.1 Training Error of a Reservoir Computer
The training error, , is used to quantify how well the training signal (, in the case of discrete dynamics) can be reconstructed from the input signal (). Lower values of indicate a better performance of the reservoir computer. In the continuous-time case, the training signal and the input signal are discretized in time and are thus treated as sequences. Before driving the reservoir computer by the input signal (), both and ( and ) are normalized to have mean equal to zero and standard deviation equal to one lu2017reservoir . However, such a choice is completely arbitrary; the amplitude of the input signal can be conveniently reduced in case one finds the (stable) reservoir dynamics to become unstable when driven by the input signal. Next, we present the procedure to compute the training error in the case of discrete-time dynamics (the procedure for the case of continuous-time dynamics is analogous.) We set the number of nodes of the reservoir to . When the reservoir is driven by the input signal , the first 2000 time steps are discarded as a transient. The next time steps from each node are recorded and combined into the matrix,
[TABLE]
The last column of the matrix is set to 1 to account for any constant offset in the fitting. We then introduce
[TABLE]
where is the fit to the training signal and is the vector of coefficients. The vector k is obtained from the minimum-norm solution to the linear least squares problem,
[TABLE]
The training error is computed as,
[TABLE]
where the symbol is computed by using the following formula,
[TABLE]
where and .
III.2 Results for Continuous-Time Dynamics
We now consider continuous-time and use the general polynomial function for the individual nodal dynamics,
[TABLE]
of which Eq. (13) is an example. We keep constant () as we are mainly interested in the effects of the nonlinear terms. In what follows we will study the effects of varying different pairs of coefficients, , of the polynomial (53) (in addition to setting .) For different cases we will indicate the pairs of coefficients we are focusing on, with the understanding that all the remaining coefficients are set to zero. We see that carefully choosing the form of the polynomial (53) is important and that setting certain coefficients to zero can dramatically worsen the performance of the reservoir, even when this is not directly predicted by the stability analysis.
Figure 3 provides a visual assessment of the training error of the reservoir computer in terms of the parameters and . First we construct the matrix as described at the beginning of this section. For the input signal we choose the signal from a Lorenz chaotic attractor, while the training signal is the Lorenz signal pathak2017using . The input signal and the training signal are normalized to have mean equal to zero and standard deviation equal to one. In Fig. 3 we plot the training error as a function of the parameters and . The color represents variations in log scale of the training error from dark blue (small) to dark red (large). The solid black curve represents the boundary between sets of parameters such that the RC is globally stable (below the black line) versus sets of parameters such that is finite (above the black curve). In other words, the region under the black curve determines the part of the parameter space for which the reservoir computer is globally stable and the RC is successful in performing the computation (though the training error may vary by different orders of magnitude as the parameters change.) On the other hand, above the black curve, it is difficult to assess how the system behaves. For example, the system could be globally stable, or locally stable, in which case depending on the particular choice of the input, the system dynamics might approach a different attractor or might be driven to . Another interesting observation is the presence of a tiny triangular region above the black curve where the reservoir computer performs well. The white region is the area of the parameter space for which the system goes to when it is driven by the input signal and the reservoir computer fails in performing the computation. For the RC dynamics diverges independent of the choice of . We also notice that the training error is symmetric about the -axis and the training error is very high (almost equals to 1) at , which indicates that the reservoir computer fails to capture and transfer the information from input to output if the quadratic term is absent from the nodal dynamics. Note that this seems to indicate there are two distinct requirements for the reservoir to work properly: (i) it needs to operate in the area of global stability and (ii) the coefficient needs to be nonzero. We observe a similar behavior in Fig. 4 where the input signal is the - component and the training signal is the -component of the Duffing chaotic attractor chang2017chaotic . Figure 7 shows in further detail the results in Fig. 3 for while preserving the nodal dynamics, the adjacency matrix , etc. Figure 7(A) is a plot of versus the parameter when and the black line is the constant-ordinate line at of the symmetric part of the matrix . In Fig. 7(B), we plot the inverse of the radius of the -region () versus the parameter for the particular case of . Here indicates that the system is globally stable and indicates that the system is unstable. Intermediate values of indicate that the system is -radius stable. In Fig. 7(C), we present a box plot of the training error () versus the parameter (). In this simulation, we consider 100 different realizations of the matrix . The training signal is the -component and the input signal is the -component of the Lorenz attractor.
We run some additional simulations to investigate the importance of carefully choosing the nonzero coefficients of the polynomial (53) on the reservoir performance. These are shown in Figs. 5 and 6. The case shown in Fig. 5 is that of a polynomial with linear, cubic and quartic order terms (but no quadratic term.) Similarly to the case when linear, quadratic and cubic terms were present, we do not see the distinct boundary between sets of parameters such that the RC is globally stable (below the black line in . 3) versus sets of parameters such that is finite (above the black curve in Fig. 3). In Fig. 5 we plot the training error as a function of the parameters and . The color represents variations in log scale of the training error from dark blue (small) to dark red (large). The solid black curves represent the level curves for different values of computed from in the parameter space . We observe that the reservoir computer performs well for those values of and for which , except in the case in which , which is analogous to what previously shown in Fig. 3 for . Moreover, we see for a case in which the polynomial had first order, third order, and fifth order terms but no even power terms that the training error was always very high (shown in Fig. 6). We wish to emphasize that while global stability could be achieved by a proper choice of the parameter alone (with all the other , , equal zero), our simulations show the importance of setting certain coefficients , , in the polynomial (53) to be non-zero and in particular it appears to be important to have nonzero coefficients corresponding to at least one odd power and one even power.
III.3 Results for Discrete-Time Dynamics with Sigmoid Nonlinearity
For the numerical simulations of a reservoir computer with discrete-time dynamics, we set the nodal dynamics as in Eq. (15) with . We choose the matrix as we described at the beginning of the section. We set the input signal and training signal from the Lorenz chaotic attractor and compute the training error as described in Sec. III.A. We consider 100 realizations of the matrix but keep the input and the training signal unchanged. We compute the fixed point of Eq. (43) and the scalar functions and by following the Eqs. (44)-(47). In Fig. 8, we plot the training error in log scale as a function of the parameters and . The solid black curve represents as a function of and . The dashed black curve represents as a function of and . The region between the two black curves determines the part of the parameter space for which the reservoir computer is globally stable and the reservoir computer is successful in performing the computation, while the training error may vary by different orders of magnitude as the parameter changes. On the other hand outside of this region, it is hard to assess how the system behaves. For example, the system could still be globally stable, or locally stable, in which case depending on the particular choice of input, the system dynamics might approach a different attractor or might be driven to . Figure 9 displays the results in Fig. 8 in further detail for the cross-section . Figure 9(A) shows (magenta) and (green) as functions of for constant . The black solid line is the constant-ordinate line at and the dashed black line is the constant-ordinate line at . For , the system is globally stable and the reservoir computer successfully performs the computation. In Fig. 9(B), we plot the training error () versus the parameter for the particular case of . We notice that when , the training error is a bit high but the performance of the RC is consistence, and when the RC performs very well.
IV Conclusion and Discussion
In this paper we have used the Lyapunov design method to analyze the nonlinear stability of a generic reservoir computer for both continuous-time and discrete-time dynamics. Our analysis presented in the paper is simple, input independent and yet is able to predict with a certain efficacy the actual performance of the reservoir in terms of computed training error, see e.g. Fig. 3 and 4. Making the analysis input dependent presents the major drawback that the input may not be known a priori and one usually wants the reservoir to be able to process different input signals. Our approach allows computation of the -radial region of stability about a desired fixed point, where is the radius of the stability region and is a set of parameters for the nodes’ individual dynamics. For each our approach decouples the effects of the individual nodal dynamics from those of the network topology.
For the case of continuous-time dynamics, we have considered a general form of polynomial nonlinearity. We have derived a scalar function which determines the region within the parameter space for which the system is globally stable and the reservoir performance is typically enhanced. Moreover, we have found that the particular type of nonlinearity matters. It is known from the literature that a RC requires nonlinearity to perform well, see e.g., jaeger2004harnessing . Here we have found additional evidence that (i) the performance of a reservoir typically worsens when one of the coefficients in the polynomial is set to zero and (ii) it is usually important to have nonzero coefficients corresponding to at least one odd power (e.g., linear) and one even power (e.g., quadratic or quartic order.) These observations hold for both cases that the input and training signals are generated by the Lorenz and the Duffing attractors.
For the case of discrete-time dynamics, a sigmoid function is used for the nodal dynamics. In this case, two scalar functions and determine the region on the parameter space for which the system is globally stable and the reservoir performance is enhanced.
Our plots in Figs. 3, 4, and 8 show a remarkable connection between the area of global stability in the parameter space predicted by the analysis and the performance of the RC as measured by the training error. In particular it appears that the training error worsens considerably when a change of the parameters causes loss of global stability. Our nonlinear stability analysis unveils a trade-off between the need for global stability, which is achievable by linear dynamics alone and the need for higher-order terms in the dynamics, which could in turn compromise stability. While fundamental insight into the exact role the nodal dynamics and the adjacency matrix have on the performance of the reservoir is an open area of research, the manual adjustment of the parameters of the reservoir computer is important to determine its dynamic regime. Our nonlinear stability analysis allows us to find the region within the parameter space for which satisfactory sets of parameters may be selected. One is able to then perform a brute search from within this region for adequate sets of parameters. Moreover, by reducing the parameter space to only a finite region, this analysis empowers us to design an optimization problem to find the optimal set of parameters to maximize the performance of a reservoir computer.
Acknowledgments
This work was supported by the National Science Foundation through Grant No. 1727948, the Office of Naval Research through Grant No. N00014-16-1-2637, and the Defense Threat Reduction Agency through Grant No. HDTRA1-12-1-0020.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Herbert Jaeger. The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. Bonn, Germany: German National Research Center for Information Technology GMD Technical Report , 148(34):13, 2001.
- 2[2] Benjamin Schrauwen, David Verstraeten, and Jan Van Campenhout. An overview of reservoir computing: theory, applications and implementations. In Proceedings of the 15th european symposium on artificial neural networks. p. 471-482 2007 , pages 471–482, 2007.
- 3[3] Thomas Natschläger, Wolfgang Maass, and Henry Markram. The” liquid computer”: A novel strategy for real-time computing on time series. Special issue on Foundations of Information Processing of TELEMATIK , 8(ARTICLE):39–43, 2002.
- 4[4] Wolfgang Maass, Thomas Natschläger, and Henry Markram. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural computation , 14(11):2531–2560, 2002.
- 5[5] Romain Martinenghi, Sergei Rybalko, Maxime Jacquot, Yanne K Chembo, and Laurent Larger. Photonic nonlinear transient computing with multiple-delay wavelength dynamics. Physical review letters , 108(24):244101, 2012.
- 6[6] Daniel Brunner, Miguel C Soriano, Claudio R Mirasso, and Ingo Fischer. Parallel photonic information processing at gigabyte per second data rates using transient states. Nature communications , 4:1364, 2013.
- 7[7] Kohei Nakajima, Helmut Hauser, Tao Li, and Rolf Pfeifer. Information processing via physical soft body. Scientific reports , 5:10487, 2015.
- 8[8] Michiel Hermans, Miguel C Soriano, Joni Dambre, Peter Bienstman, and Ingo Fischer. Photonic delay systems as machine learning implementations. Journal of Machine Learning Research , 2015.
