Euclidean Contractivity of Neural Networks with Symmetric Weights
Veronica Centorrino, Anand Gokhale, Alexander Davydov, Giovanni Russo, and Francesco Bullo

TL;DR
This paper analyzes the stability of certain neural network models with symmetric weights using contraction theory, providing conditions for Euclidean contractivity and applying results to quadratic optimization.
Contribution
It introduces new algebraic results and sufficient conditions for Euclidean contractivity in neural networks with symmetric weights, including non-smooth activations.
Findings
Contraction rates are log-optimal for most symmetric matrices.
Provided stability conditions for Hopfield and firing-rate networks.
Applied contraction analysis to optimize quadratic problems with box constraints.
Abstract
This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the norm, of both models with symmetric weights and (possibly) non-smooth activation functions. Our contraction analysis leads to contraction rates which are log-optimal in almost all symmetric synaptic matrices. Finally, we use our results to propose a firing-rate neural network model to solve a quadratic optimization problem with box constraints.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Control and Stability of Dynamical Systems · Model Reduction and Neural Networks
\includeversion
tac \excludeversionarxiv
Euclidean Contractivity of Neural Networks
with Symmetric Weights
Veronica Centorrinoa, Anand Gokhaleb, Alexander Davydovb,
Giovanni Russoc and Francesco Bullob This work was in part supported by AFOSR project FA9550-21-1-0203. The authors thank Dr. Leo Kozachkov for insightful comments.aVeronica Centorrino is with Scuola Superiore Meridionale, University of Naples Federico II, Italy. [email protected].bAnand Gokhale, Alexander Davydov, and Francesco Bullo are with the Center for Control, Dynamical Systems, and Computation, UC Santa Barbara, Santa Barbara, CA 93106 USA. [email protected], [email protected], [email protected].cGiovanni Russo is with the Department of Information and Electric Engineering and Applied Mathematics, University of Salerno, Italy. [email protected].
Abstract
This paper investigates stability conditions of continuous-time Hopfield and firing-rate neural networks by leveraging contraction theory. First, we present a number of useful general algebraic results on matrix polytopes and products of symmetric matrices. Then, we give sufficient conditions for strong and weak Euclidean contractivity, i.e., contractivity with respect to the norm, of both models with symmetric weights and (possibly) non-smooth activation functions. Our contraction analysis leads to contraction rates which are log-optimal in almost all symmetric synaptic matrices. Finally, we use our results to propose a firing-rate neural network model to solve a quadratic optimization problem with box constraints.
I Introduction
Continuous-time recurrent neural networks (RNNs) are dynamical models widely studied in computational neuroscience and machine learning. Recent interest has focused on establishing the contractivity properties of RNNs. Contracting dynamics are robustly stable, feature computationally friendly methods for equilibrium computation, and enjoy many other properties. Motivated by optimization [18, 2] and neuroscientific applications [16], [8, Chapter 17], this paper focuses on symmetric synaptic interactions.
While a comprehensive contractivity analysis with respect to and norms was recently presented in [5], the corresponding analysis with respect to weighted Euclidean norms is not complete yet. A recent breakthrough in this direction was obtained by [11]; this work extends and complements these results (a detailed comparison is offered below).
Two common models of RNNs are the firing-rate neural network (FNN) and Hopfield neural network (HNN); the main difference being the order by which the activation function acts. Under mild assumptions, FNNs are positive systems and, arguably, more biologically-plausible. HNNs are relevant in optimization and machine learning [18, 2, 16, 21]. For certain synaptic matrices and initial conditions, FNN and HNN are known to be equivalent via an appropriate change of coordinates and input transformation [14]. However, the understanding of this partial correspondence is not complete and, as we will show below, their contractivity properties are not exactly coincident.
Related literature
RNNs naturally emerge when modelling neural processes [8]. Critical questions when studying RNNs are related to finding conditions that guarantee stability and robustness of the network. For example, sufficient conditions for the stability of HNNs are given in [7] based on the use of Lyapunov diagonally stable matrices. Stability and robustness can be simultaneously established using contraction theory. Indeed, contracting systems exhibit highly ordered transient and asymptotic behaviors that appear to be convenient in the context of RNNs. For example: (i) initial conditions are exponentially forgotten [13]; (ii) time-invariant dynamics admits a unique globally exponential stable equilibrium [13]; (iii) contraction ensures entrainment to periodic inputs [17] and (iv) enjoy highly robust behavior, such as input-to-state stability [20]. (v) Moreover, efficient numerical algorithms can be devised for numerical integration and fixed point computation of contracting systems [10]. Recently, non-Euclidean contractivity of RNNs is studied in [5] and in [4], where stability properties of HNN and FNN with dynamic synapses undergoing Hebbian learning are proposed. Euclidean contractivity is studied in [12] to analyze the stability of RNNs with dynamic synapses and in [11], where a number of contractivity conditions are proposed. Finally, the design of norms minimizing the logarithmic norm is reviewed in [3, Section 2.7].
Contributions:
our main results are a set of sufficient conditions characterizing strong and weak infinitesimal contractivity properties (see Section II for the definitions) of FNNs and HNNs with symmetric weights and possibly non-smooth activation functions. We also establish a lower bound on the contraction rate and, remarkably, demonstrate that the bound is log-optimal in almost all symmetric weight matrices. One of the main benefits of our approach to the study of FNNs and HNNs is that, with just a single condition, it ensures global exponential convergence, along with all the other useful properties of contracting systems. The main results leverage a number of general algebraic results, which are interesting per se and are also a contribution of this paper. With these algebraic results, we: (i) determine a weighted norm for matrix polytopes which is log-optimal for almost all synaptic matrices; (ii) give a lower bound on the spectral abscissa of matrix polytopes; (iii) provide optimal and log-optimal norms for the product of symmetric matrices. Finally, we leverage our sufficient conditions for contractivity to propose a FNN solving certain quadratic optimization problems with box constraints.
Our results for strong infinitesimal contractivity of the FNN and HNN models with symmetric weights are based on and generalize [11, Theorem 2]. Specifically, (i) we provide the explicit expression of the matrix weights for which the models are contracting. The matrices we find are different for the two models, highlighting the importance of choosing the appropriate model based on the properties being studied; (ii) we address the weak contractivity case, i.e., when the contraction rate is [math], making it applicable for, e.g., systems that enjoy conservation or invariance properties; (iii) we handle weakly increasing and (iv) locally Lipschitz activation functions, allowing us to consider common activation functions such as the rectified linear unit (ReLU) and soft thresholding functions.
II Mathematical Preliminaries
We denote by the function if , if . Given , we define to be the diagonal matrix with diagonal entries equal to . Vector inequalities of the form are entrywise. We let , be the all-ones and all-zeros vectors, respectively, be the identity matrix, and be the set of real symmetric matrices. For , let , and denote the spectrum, spectral radius and the spectral abscissa of , respectively; here denotes the real part of . For , let and denote its minimum and maximum eigenvalue, respectively. Given , we write (resp. ) if is positive semidefinite (resp. definite). The Moore–Penrose inverse of is the unique matrix such that , , with , . Finally, whenever it is clear from the context, we omit to specify the dependence of functions on time .
II-A Norms and induced norms
Let denote both a norm on and its corresponding induced matrix norm on . Given the logarithmic norm (log-norm) induced by is
[TABLE]
Specifically, the Euclidean vector norm, matrix norm, and log-norm are, respectively: , , and .
For an norm, , and for an invertible matrix , the -weighted norm is defined as . The corresponding log-norm is . Specifically, the weighted Euclidean vector norm, matrix norm, and log-norm are, respectively: , , and .
For two invertible matrices , , it holds
[TABLE]
Given , with open and connected, we denote by the one-sided Lipschitz constant of . For continuously differentiable and convex set it holds
[TABLE]
where is the Jacobian of with respect to . We write to specify that the one-sided Lipschitz constant is computed with respect to a -weighted norm. Specifically, for the weighted Euclidean norm we have:
[TABLE]
We refer to [3] for a recent review of those tools.
II-B Contraction theory for dynamical systems
We start with the following
Definition 1**.**
Given a norm, a function , with -invariant, open and convex, and a constant ( referred as contraction rate, is strongly (weakly) infinitesimally contracting on if
[TABLE]
or, equivalently for differentiable vector fields, if
[TABLE]
One of the main benefits of contraction theory is that, with just a single condition, it ensures global exponential convergence, along with other useful properties, as highlighted in the introduction Section.
The next result [5, Theorem 16] allows using condition (2) for locally Lipschitz function, for which, by Rademacher’s theorem, exists almost everywhere (a.e.) in .
Theorem 1**.**
Consider a norm, a function locally Lipschitz on open and convex set. Then for every the following statements are equivalent:
- (i)
, for all , 2. (ii)
, for a.e. and .
II-C Hopfield and firing-rate continuous-time neural networks
We are interested in the following continuous-time FNN and HNN models defined, respectively, as:
[TABLE]
where: , are neural activation vectors, is a nonlinear and diagonal activation function, i.e., for , , where . is the synaptic matrix, with being the synaptic weight from neuron to neuron . Finally, , are the external stimuli in the FNN and HNN, respectively. The models (3) and (4) assume homogeneous dissipation rates; we leave the heterogeneous case to future work.
Remark 2**.**
When the activation function is non-negative the positive orthant is forward-invariant for in (3) and is interpreted as a firing-rate. Instead, in (4) is sign indefinite and is interpreted as a membrane potential.
III Main Results
This section presents the main results of the paper. Namely, we study Euclidean contractivity properties of continuous-time RNNs with symmetric weights.
First, we give algebraic results on weighted norms of certain matrix polytopes. Then, we use those results to give sufficient conditions for the strong infinitesimal contractivity of the FNN and the HNN with symmetric weights with respect to weighted Euclidean norms.
Assumption 1** (Symmetric synaptic weights).**
The synaptic matrix is symmetric.
Under Assumption 1, the eigenvalues of are real, and . Moreover, can be decomposed as
[TABLE]
where is the orthogonal matrix whose columns are the eigenvectors of , and is diagonal with being the vector of the eigenvalues of .
Given , we define by
[TABLE]
We illustrate in Figure 1. For our derivations, it is useful to introduce the shorthand notation . Also, we introduce
[TABLE]
and, when is invertible, is defined as
[TABLE]
Remark 3**.**
The matrix defined in (8) can be written as , where we use the notation , with defined by
[TABLE]
III-A *Results on the
Euclidean log-norm of matrix polytopes*
First, we give the following definition for polytopes.
Definition 2** (Log-optimal and log--optimal norms for matrix polytopes).**
Given , consider the polytope
[TABLE]
and a scalar . We say that the norm is
- (i)
logarithmically optimal (log-optimal) for * if*
[TABLE] 2. (ii)
logarithmically -optimal (log--optimal) for * if*
[TABLE]
We are specifically interested in the matrix polytopes defined as and . Namely, in Theorem 5 we give algebraic results on the Euclidean log-norm of matrices in and (the proof is in Section IV, together with a number of instrumental results).
Remark 4**.**
It is always possible to rewrite and in the form of Definition 2. In fact, let be the vertices defined by where is the binary vector with entries either [math] or (note that there are such binary vectors). Then the set is exactly the set . To prove this, note that the vertices of the convex set are the vectors . Therefore, given there exist , , with such that . Thus,
[TABLE]
The same reasoning holds for .
Theorem 5** (Euclidean log-norm of matrix polytopes).**
Given a symmetric synaptic matrix (Assumption 1), the following statements holds:
- (i)
if , then , with defined in (7), is log-optimal for , i.e.,
[TABLE]
In addition, if is invertible, then , with defined in (8), is log-optimal for , i.e.,
[TABLE] 2. (ii)
if , then for each the norm , with defined in (7), is log -optimal for , i.e.,
[TABLE] 3. (iii)
if , then is log-optimal for and , i.e.,
[TABLE]
Remark 6**.**
Theorem 5 applies to polytopes of the form and of the form , for all . This follows from the log-norm translation property, i.e., for all .
III-B Contractivity of recurrent neural networks
Next, we consider the neural network dynamics for the FNN in (3) and for the HNN in (4).
Assumption 2** (Slope-restricted activation function).**
The activation function is Lipschitz and slope restricted in , i.e.,
[TABLE]
Assumption 2 ensures that for almost all . Many common activation functions including ReLU, and sigmoid, satisfy Assumption 2, possibly after rescaling. In fact, Assumption 2 can be relaxed for larger classes of coupling by restricting the slope to , where . By defining and our following results still hold for this general case, with replaced by . We assume to simplify the notation.
III-B1 Contractivity of firing rate neural networks
We now provide an upper bound on the one-sided Lipschitz constant and sufficient conditions for the Euclidean contractivity of FNNs with symmetric weights.
Theorem 7** (Euclidean one-sided Lipschitz constant of the FNN).**
Consider the FNN (3) satisfying Assumptions 1, 2:
- (i)
if , then
[TABLE]
with defined in (7); 2. (ii)
if , then
[TABLE]
with defined in (7); 3. (iii)
if , then
[TABLE]
Proof.
Regarding part (i) note that for almost all we have
[TABLE]
where the last equality follows by the log-norm translation property and part (i) in Theorem 5. The proof follows by applying Theorem 1. Parts (ii) and (iii) can be proved similarly, using parts (ii) and (iii) in Theorem 5.∎
Remark 8**.**
Under further assumptions on the synaptic matrix and the activation function, some inequalities in Theorem 7 are tight – see Appendix B.
The next result follows from Theorem 7.
Corollary 9** (Euclidean contractivity of the FNN).**
Under the same assumptions and notations as in Theorem 7,
- (i)
if , then the FNN is weakly infinitesimally contracting with respect to ; 2. (ii)
if , then the FNN is strongly infinitesimally contracting with rate with respect to ; 3. (iii)
if , then for any the FNN is strongly infinitesimally contracting with rate with respect to ; 4. (iv)
if , then the FNN is strongly infinitesimally contracting with rate with respect to .
III-B2 Contractivity of Hopfield neural networks
We first provide an upper bound on the Euclidean one-sided Lipschitz constant and sufficient conditions for the contractivity of HNNs with non-singular symmetric synaptic matrix. Then, we give sufficient conditions for the contractivity with singular symmetric synapses. This latter result is proven in Section IV: differently from our analysis on FNNs, it requires a distinct mathematical approach.
Theorem 10** (Euclidean one-sided Lipschitz constant of the HNN with non-singular symmetric weights).**
Consider the HNN (4) satisfying Assumptions 1, 2 with non-singular weight matrix ,
- (i)
if , then
[TABLE]
with defined in (8); 2. (ii)
if , then
[TABLE]
Proof.
Regarding part (i), note that for almost all we have
[TABLE]
where the last equality follows by the log-norm translation property and part (i) in Theorem 5. The proof then follows by applying Theorem 1. Part (ii) can be proved similarly, using part (iii) in Theorem 5. ∎
Remark 11**.**
Following the same reasoning as in Appendix B, under the same assumptions of Theorem 10, if the activation function satisfies , and , then the inequalities in Theorem 10 are tight.
Corollary 12** (Euclidean contractivity of the HNN with non-singular symmetric weights).**
Under the same assumptions and notations as in Theorem 10,
- (i)
if , then the HNN is weakly infinitesimally contracting with respect to ; 2. (ii)
if , then the HNN is strongly infinitesimally contracting with rate with respect to ; 3. (iii)
if , then the HNN is strongly infinitesimally contracting with rate with respect to .
Finally, we give sufficient infinitesimal contractivity conditions of the HNN with singular symmetric synapses (see Section IV for the proof).
Theorem 13** (Contractivity of the HNN with singular symmetric weights).**
Consider the HNN (4) satisfying Assumptions 1, 2 with having kernel , and such that . Then, for each the HNN is strongly infinitesimally contracting with rate .
Remark 14**.**
If , then the FNN (3) and the HNN (4) are contracting with rate 1. As a consequence of Corollaries 9, 12 and Theorem 13, when coupling is added to the networks, they remain (strongly) contracting as long as . Note that the entries of are allowed to be large, so as the activation function and this allows to have different types of coupling as long as the matrix is Hurwitz.
IV Proofs and Additional Results
We now present additional algebraic results on matrix polytopes and symmetric matrices, and the proofs of Theorems 5 and 13. First, we give a technical result for the spectral abscissa of matrix polytopes.
Lemma 15** (Lower bound on spectral abscissa of polytope of matrices).**
For any , we have
[TABLE]
Proof.
First, note that the spectral abscissa is a continuous function and that the set is compact, hence the maximum is well defined. To prove (10) we compute:
[TABLE]
The same calculation applies to prove inequality (11). ∎
We now give the proof of Theorem 5. To enhance clarity we prove its parts case by case. Lemma 16 and parts (i) and (ii) in Theorem 5, are based upon and extend the treatment in [11, Theorem 2] – see our statement of contributions.
Lemma 16** (Splitting upper-bounded symmetric matrices).**
Consider satisfying Assumptions 1. Assume , for some and let and be defined in (6) and (7), respectively. Then,
[TABLE]
Proof.
By definition of the function , for all , , it holds
[TABLE]
In fact, we have
[TABLE]
Equation (13) implies . Equality (12) follows by multiplying by and to the left and to the right, respectively, with defined in (5). ∎
First, we prove part (i), i.e., the log-optimality of the norm and, when is invertible, of for multiplicatively-scaled matrices with positive maximum eigenvalue.
Proof of part (i).
First, we prove that is log-optimal for and . To this purpose, define
[TABLE]
Lemma 16 implies . Next, pick satisfying , so that is diagonal and invertible. Then
[TABLE]
Since , we can apply the Schur complement to this LMI to conclude that
[TABLE]
Setting for arbitrary , the inequality (15) implies
[TABLE]
In summary, we have established that the weak LMI (14) (independent of ) implies the weak LMI (16) for all . Here, by weak LMI, we mean to state that the linear matrix inequality is not strict. It is known [9, Theorem 6.3.5] that the eigenvalues of a symmetric matrix are continuous functions of the matrix entries. Therefore, the LMI (16) holds also for . Finally, note that the LMI (16) is equivalent to the condition for all , therefore
[TABLE]
Moreover, it is well known [6] that for every log-norm and every matrix it holds . Specifically in our case:
[TABLE]
The proof then follows from (10), after noticing that in this case .
Next, assume that is invertible. We need to prove that is log-optimal for and that it holds . We have
[TABLE]
where the last equality follows from the log-optimality of for . The proof again follows from (10). ∎
The proof of part (ii) of Theorem 5, i.e., the log-optimality of the weighted norm for multiplicatively-scaled negative semidefinite matrices, follows the same reasoning as that of part (i) by considering instead of . Hence, we omit it here for brevity.
Finally, we prove part (iii), i.e., the log-optimality of for multiplicatively-scaled negative definite matrices. To do so, we give the following algebraic result.
Lemma 17** (Optimal norms for products of symmetric matrices).**
Let and where , , with . Then, for each ,
- (i)
* is real and has the same number of negative, zero, and positive eigenvalues as ;* 2. (ii)
the norm is optimal for the matrix , i.e., ; 3. (iii)
the norm is log-optimal for , i.e., .
Proof.
Let . is similar to , hence is real. Part (i) then follows from Sylvester’s law of inertia, noting that is congruent to . Regarding part (ii), we compute
[TABLE]
where the last equality follows from the fact that has the same eigenvectors as and real eigenvalues equal to the square of the real eigenvalues of . Finally, to prove part (iii) we compute
[TABLE]
This concludes the proof of part (ii). The proof for is a straightforward adaptation. ∎
Proof of part (iii).
Pick satisfying and consider the matrices and . Lemma 17 with and , implies that the spectrum of the product matrices and is real and has the same number of negative, zero, positive eigenvalues as . Therefore,
[TABLE]
Maximizing over we get part (iii). ∎
Finally, we give the proof of Theorem 13.
Proof of Theorem 13.
Let be the number of non-zero eigenvalues of . Without loss of generality, we reorder the elements in and , so that and , where is the eigenvector of .
Next, let , , , , and define , , so that .
We have . Therefore, given we can always define and . We note that implies , , , and . Also,
[TABLE]
Moreover, we have
[TABLE]
In fact, from Corollary 9 we know:
[TABLE]
By multiplying by and to the left and to the right, respectively, we get
[TABLE]
Thus, . Next, by multiplying (4) by and we obtain the interconnected system:
[TABLE]
thus,
[TABLE]
Equation (21) is always contracting with respect to any norm in the subspace with , being . For system (22) we define , where , with
[TABLE]
Next, we note that the matrix Q_{\textup{H\parallel}}:=U_{\parallel}^{\top}Q_{\textup{F},\alpha(W)}{W}^{\dagger}U_{\parallel}=\theta_{\parallel}\Lambda_{\parallel}^{-1} and that . Thus, we have
[TABLE]
Thus system (22) is strongly infinitesimally contracting in with respect to \|\cdot\|_{Q_{\textup{H\parallel}}} with rate .
Finally, we note that at fixed and , the map is Lipschitz with constant . In fact, , we get
[TABLE]
We can now construct the gain matrix (30)
[TABLE]
The eigenvalues of are . The fact that implies . In turn, since by assumptions , we have . Thus is Hurwitz and . By applying Theorem 21, for each we have that the HNN is strongly infinitesimally contracting with rate . This concludes the proof. ∎
V Using Euclidean contractivity to solve quadratic optimization problems
We now apply the previous results to propose a firing-rate neural network solving certain quadratic optimization problems with box constraints. By utilizing Corollary 9, we ensure global exponential convergence of our dynamic, along with all the other properties of contracting systems.
Given , an input , and the quadratic optimization problem with box constraints is
[TABLE]
Note that is strongly convex and the constraints are convex, thus (24) admits a unique global optimal solution.
We propose the following FNN model to solve (24). Given a single-layered neural network of neurons, the state evolves according to
[TABLE]
with output . The activation function , illustrated in Figure 2, is defined as , where is
[TABLE]
To simplify the notation, whenever it is clear from the context, we use the same symbol for both the scalar and vector forms of the saturation function.
Remark 18**.**
The function satisfies Assumption (2). Almost everywhere, its partial derivative is defined by
[TABLE]
Next, we use Corollary 9 to give sufficient conditions for the strong infinitesimal contractivity of (25). Then, we show that the equilibrium of (25) is the optimal solution of (24).
Lemma 19** (Strong infinitesimal contractivity).**
Let in (25). The FNN (25) is strongly infinitesimally contracting with rate with respect to thee norm , where
- (i)
if , then and , with defined in (7); 2. (ii)
if , then for any , and , with defined in (7); 3. (iii)
if , then and .
Proof.
The thesis follows by applying Corollary 9 noticing that implies , thus , and satisfies Assumption 2. ∎
An immediate consequence of Lemma 19 is that (25) admits a unique equilibrium point. Next, we prove that this equilibrium point is the optimal solution of (24).
Lemma 20**.**
The vector is the global minimum for (24) if and only if is the equilibrium point of (25).
Proof.
Let be a global minimum for (24), thus . Then it follows from the KKT conditions that, for all ,
[TABLE]
Note that is an equilibrium of (25) if, for all , we have
[TABLE]
If , let . By definition of it holds . Moreover, from the KKT conditions (27), and being monotonically non-decreasing, we get the reverse inequality. Thus verifies (28). Similarly it can be proved that (28) holds for , and .
Vice versa, let be an equilibrium of (24), i.e., (28) holds. If , then (28) implies By definition of we get , thus , and , which implies . Similarly, if , then , while if , then and . This ends the proof since we have shown that the KKT conditions (27) hold for all .∎
VI Conclusion
We presented sharp conditions for strong and weak Euclidean contractivity of Hopfield and firing-rate neural networks with symmetric weights together with a number of general algebraic results. Specifically, we analyzed the Euclidean log-norm of matrix polytopes, proposing norms that are log-optimal for almost all matrices, and provided optimal and log-optimal norms for the product of symmetric matrices. We considered networks with (possibly) non-smooth activation functions, which allows us to consider common activation functions such as ReLU and the soft thresholding function. Finally, to demonstrate the practical implications of our results, we proposed a FNN to solve quadratic optimization problems with box constraints.
As future work, it would be useful to (i) extend our results to arbitrary synaptic matrices (as opposed to only symmetric) and heterogeneous dissipation matrices, (ii) establish higher-order contractivity properties [19] and consider stochastic models [1], and (iii) apply these results to neuroscience and machine learning problems. For example, we plan to study sparse reconstruction networks (inspired by [16]) and implicit learning models (e.g., see [15]).
Appendix A Interconnected systems
In this section, we briefly review the theory of contracting interconnected systems, that we used to prove Theorem 13. We refer to [3] for a recent and more detailed review.
Given positive integers such that , consider the decomposition , a local norm on , for each , with associated log-norm . Consider the interconnection of dynamical systems
[TABLE]
where , and denote the vector without the component . We recall the following results that will be useful for our analysis.
Theorem 21** (Contractivity of interconnected system).**
Consider the interconnected system in (29). Assume
- (1)
(contractivity-at-each-node) at fixed and , each function is strongly infinitesimally contracting with rate with respect to . 2. (2)
(Lipschitz interconnections) at fixed and , each function is Lipschitz with Lipschitz constant .
Define the gain matrix
[TABLE]
If is Hurwitz, then the interconnected system is strongly infinitesimally contracting with respect to and with rate , where , , and .
Appendix B Justification for Remark 8
Lemma 22**.**
Given the FNN (3) with symmetric (Assumption 1) and invertible synaptic matrix , Lipschitz and slope restricted in (Assumption 2) activation function satisfying and ,
- (i)
if , then
[TABLE]
with defined in (7); 2. (ii)
if , then
[TABLE]
Proof.
The proof of both parts follows by applying Theorem (7) and noticing that under the above assumptions for any log-norm it holds the reverse inequality
[TABLE]
To prove (31), let be the function defined by where is the measure zero set of points in where is not differentiable. It is well-known that for any closed and bounded set , . Then, since is bounded, the closure of satisfies
[TABLE]
Letting be the measure zero points in where is not differentiable, we compute
[TABLE]
We justify the above (in)equalities as follows. Equality (33) holds because is invertible. Inequality (36) holds because of the condition (32). Finally, equality (37) follows because is a convex function of its argument and the maximum value of a convex function over a polytope occurs at one of its vertices.
In particular, for the respective choice of norm in parts (i) and (ii), the result is proved in view of Theorem 5 and the translation property for log-norms. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Z. Aminzare. Stochastic logarithmic Lipschitz constants: A tool to analyze contractivity of stochastic differential equations. IEEE Control Systems Letters , 6:2311–2316, 2022. doi:10.1109/LCSYS.2022.3148945 . · doi ↗
- 2[2] A. Bouzerdoum and T. R. Pattison. Neural network for quadratic optimization with bound constraints. IEEE Transactions on Neural Networks , 4(2):293–304, 1993. doi:10.1109/72.207617 . · doi ↗
- 3[3] F. Bullo. Contraction Theory for Dynamical Systems . Kindle Direct Publishing, 1.1 edition, 2023, ISBN 979-8836646806. URL: http://motion.me.ucsb.edu/book-ctds .
- 4[4] V. Centorrino, F. Bullo, and G. Russo. Modelling and contractivity of neural-synaptic networks with Hebbian learning. Automatica , July 2022. Submitted. doi:10.48550/ar Xiv.2204.05382 . · doi ↗
- 5[5] A. Davydov, A. V. Proskurnikov, and F. Bullo. Non-Euclidean contraction analysis of continuous-time neural networks. IEEE Transactions on Automatic Control , September 2022. Submitted. doi:10.48550/ar Xiv.2110.08298 . · doi ↗
- 6[6] C. A. Desoer and M. Vidyasagar. Feedback Systems: Input-Output Properties . Academic Press, 1975, ISBN 978-0-12-212050-3. doi:10.1137/1.9780898719055 . · doi ↗
- 7[7] M. Forti and A. Tesi. New conditions for global stability of neural networks with application to linear and quadratic programming problems. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications , 42(7):354–366, 1995. doi:10.1109/81.401145 . · doi ↗
- 8[8] W. Gerstner, W. M. Kistler, R. Naud, and L. Paninski. Neuronal Dynamics: From Single Neurons To Networks and Models of Cognition . Cambridge University Press, 2014, ISBN 9781107635197. URL: https://neuronaldynamics.epfl.ch .
