Positive solutions for large random linear systems
Pierre Bizeul (ENS Paris Saclay), Jamal Najim (LIGM)

TL;DR
This paper analyzes the positivity and stability of solutions to large random linear systems with Gaussian entries, revealing a sharp phase transition at a critical scaling factor and implications for biological community models.
Contribution
It establishes a precise phase transition threshold for positivity in large Gaussian linear systems and links this to the stability of associated Lotka-Volterra models.
Findings
Phase transition at lpha* s pprox n
Below threshold, solutions have negative components with high probability
Above threshold, solutions are positive and the system is stable
Abstract
Consider a large linear system where is a matrix with independent real standard Gaussian entries, is a vector of ones and with unknown the vector satisfyingWe investigate the (componentwise) positivity of the solution depending on the scaling factor as the dimension goes to . We prove that there is a sharp phase transition at the threshold : below the threshold (), has negative components with probability tending to 1 while above (), all the vector's components are eventually positive with probability tending to 1. At the critical scaling , we provide a heuristics…
| 0.33 | 0.27 | 0.23 | 0.21 | 0.19 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Positive solutions for large random linear systems
Pierre Bizeul, Jamal Najim
Abstract.
Consider a large linear system where is a matrix with independent real standard Gaussian entries, is a vector of ones and with unknown the vector satisfying
[TABLE]
We investigate the (componentwise) positivity of the solution depending on the scaling factor as the dimension goes to . We prove that there is a sharp phase transition at the threshold : below the threshold (), has negative components with probability tending to 1 while above (), all the vector’s components are eventually positive with probability tending to 1. At the critical scaling , we provide a heuristics to evaluate the probability that is positive.
Such linear systems arise as solutions at equilibrium of large Lotka-Volterra systems of differential equations, widely used to describe large biological communities with interactions such as foodwebs for instance.
In the domaine of positivity of the solution , that is when , we establish that the Lotka-Volterra system of differential equations whose solution at equilibrium is precisely is stable in the sense that its jacobian
[TABLE]
has all its eigenvalues with negative real part with probability tending to one.
Our results shed a new light and complement the understanding of feasibility and stability issues for large biological communities with interaction.
Key words and phrases:
Linear systems; large random matrices; Gaussian concentration; Lotka-Volterra equations.
2010 Mathematics Subject Classification:
Primary 15B52, 60G70, Secondary 60B20, 92D40
1. Introduction
Denote by a matrix with independent Gaussian entries and by a positive sequence. We are interested in the componentwise positivity of the vector , solution of the linear system
[TABLE]
where is the vector with components 1.
It is well-known since Geman [7] that the spectral radius of almost surely (a.s.) converges to 1, so that matrix is eventually invertible as long as . In this case, vector where writes
[TABLE]
where is the canonical vector and denotes the transconjugate of matrix (or simply its transpose if is real).
The positivity of the ’s is a key issue in the study of Large Lotka-Volterra systems, widely used in mathematical biology and ecology to model populations with interactions.
Consider for instance a given foodweb and denote by the vector of abundances of the various species within the foodweb at time . A standard way to connect the various abundances is via a Lotka-Volterra (LV) system of equations that writes
[TABLE]
where the interactions can be modeled as random in the absence of any prior information. Here, the ’s are assumed to be i.i.d. . At the equilibrium , the abundance vector is solution of (1.1) and a key issue is the existence of a feasible solution, that is a solution where all the ’s are positive. Dougoud et al. [5], based on Geman and Hwang [8], proved that a feasible solution is very unlikely to exist if is a constant. In fact, the CLT proved in [8] asserts that for any fixed number of components
[TABLE]
where (resp. ) stands for the convergence in distribution (resp. in probability) and where . As an important consequence, vectors with positive components will become extremely rare since
[TABLE]
In this article, we consider a growing scaling factor and study the positivity of ’s components in relation with . We find that there exists a critical threshold
[TABLE]
below which feasible solutions exist with vanishing probability and above which feasible solutions are more and more likely to exist. More precisely, we prove the following:
Theorem 1.1** (Feasibility).**
Let and denote by . Let be the solution of (1.1).
- (1)
If there exists such that eventually then
[TABLE] 2. (2)
If there exists such that eventually then
[TABLE]
Proof of Theorem 1.1 is based on an analysis of the order of magnitude of the extreme values of the ’s, which relies on Gaussian concentration of Lipschitz functionals whose argument is matrix .
Remark 1.2*.*
In Figure 1, we illustrate the transition toward feasibility depending on the scaling . For , we plot the proportion of feasible solutions obtained after 500 simulations. The transition occurs at the optimal scaling corresponding to .
Remark 1.3*.*
Notice that the convergence of to zero is extremely slow, as shown in Table 1, and could easily be mistaken with some constant scaling where .
To complement the picture, we provide the following heuristics at the critical scaling :
[TABLE]
Aside from the question of feasibility arises the question of stability : for a complex system, how likely a perturbation of the solution at equilibrium will return to the equilibrium? Gardner and Ashby [6] considered stability issues of complex systems connected at random. Based on the circular law for large matrices with i.i.d. entries, May [13] provided a complexity/stability criterion and motivated the systematic use of large random matrix theory in the study of foodwebs, see for instance Allesina et al. [1]. Recently, Stone [14] and Gibbs et al. [9] revisited the relation between feasibility and stability.
We complement the information of Theorem 1.1 by adressing the question of stability in the context of a Lotka-Volterra system (1.2) and prove that under the first condition of the theorem feasibility and stability occur simultaneously.
Recall that the solution at equilibrium is stable if the Jacobian matrix of the Lotka-Volterra system evaluated at , that is
[TABLE]
has all its eigenvalues with negative real part.
Theorem 1.4** (Stability).**
Let be the solution of (1.1). Denote by and assume that . Denote by the spectrum of and let . Then
[TABLE]
Moreover,
[TABLE]
Proof of Theorem 1.4 relies on standard perturbation results from linear algebra and on Theorem 1.1.
Organization of the paper
Proof of Theorem 1.1 is provided in Section 2. Theorem 1.4 is proved in Section 3. In Section 4, elements to bear out heuristics (1.3) are provided. We also formulate some concluding remarks for non-homogeneous linear systems where vector is replaced by a positive vector and briefly mention possible extensions to non-Gaussian entries.
Acknowlegments
JN thanks Christian Mazza for introducing him to the study of large LV systems in theoretical ecology. The authors thank François Massol and Olivier Guédon for fruitful discussions.
2. Positive solutions: proof of Theorem 1.1
We will use the following notations for the various norms at stake: if is a vector then stands for its euclidian norm; if is a matrix then stands for its spectral norm and for its Frobenius norm. Let be a function from or to then .
2.1. Some preparation and strategy of the proof
Denote by the resolvent and by the largest singular value of a given matrix . Then it is well known that almost surely (see for instance [3, Chapter 5]) hence . In particular, the solution
[TABLE]
with the identity, is uniquely defined almost surely. In order to study the minimum of ’s components, we partially unfold the above resolvent (in the sequel, we will simply denote instead of ) and write:
[TABLE]
where
[TABLE]
Notice in particular that the ’s are i.i.d. standard Gaussian. Before focusing on the analysis of the remaining term , we recall standard results for extreme values of Gaussian random variables.
Extreme values of Gaussian random variables
Consider the sequence of standard Gaussian i.i.d. random variables and let
[TABLE]
Denote by the cumulative distribution of a Gumbel distributed random variable.
Then the following results are standard, see for instance [12, Theorem 1.5.3]: for all
[TABLE]
Strategy of the proof
Eq. (2.1) immediatly yields
[TABLE]
We rewrite the first equation as
[TABLE]
where we have used the fact that . Similarly,
[TABLE]
The theorem will then follow from the following lemma.
Lemma 2.1**.**
The following convergence holds
[TABLE]
Proof of Lemma 2.1 requires a careful analysis of the order of magnitude of the extreme values of the remaining term . It is postponed to Section 2.3.
2.2. Lipschitz property and tightness of
Let be a smooth function with values
[TABLE]
and strictly decreasing from to zero as goes from to . Recall that is the largest singular value of the normalized matrix and denote by
[TABLE]
Notice that (as a by-product of the a.s. convergence of to ).
Instead of directly working with we introduce the truncated quantity
[TABLE]
For a given matrix , we may consider its hermitized matrix defined as
[TABLE]
Recall that the singular values of together with their opposites are the eigenvalues of .
We prove hereafter that as a function of the entries of matrix , the function is lipschitz.
Lemma 2.2**.**
Let be given by (2.7), then the function is Lipschitz, i.e.
[TABLE]
where is the Frobenius norm and is a constant independent from and .
Proof.
Notice that and for , which implies that one may consider the bound in the following computations, for or its derivatives would be zero otherwise. Recall the definition of the resolvent then which yields from which we deduce that
[TABLE]
for large enough.
We first consider a matrix such that has simple spectrum (i.e. with distinct eigenvalues, each with multiplicity 1). We denote by and prove that the vector satisfies
[TABLE]
To lighten the notations, we may drop the dependence of in . We begin by computing
[TABLE]
Straightforward computations yield
[TABLE]
It remains to compute . Recall that has a simple spectrum and notice that is differentiable. In fact, since is simple, it is a simple root of the characteristic polynomial. In particular, it is not a root of its derivative and one can use the implicit function theorem to conclude. Let and be respectively the left and right normalized singular vectors associated to . Then
[TABLE]
moreover is (up to a scaling factor) the unique eigenvector of since has multiplicity one by assumption. We can now apply [10, Theorem 6.3.12] to compute ’s derivative:
[TABLE]
(recall that all the considered vectors are real).
We first handle the term .
[TABLE]
We now handle the term .
[TABLE]
The term can be handled similarly and one can prove
[TABLE]
Gathering all these estimates, we finally obtain the desired bound:
[TABLE]
where neither depends on nor on .
Having proved a local estimate over for each matrix such that has simple spectrum, we now establish the Lipschitz estimate (2.8) for two such matrices .
Let such that and have simple spectrum and consider for . Notice first that the continuity of the eigenvalues implies that there exists sufficiently small such that has a simple spectrum for and . To go beyond and prove that has simple spectrum for the entire interval except maybe for a finite number of points, we rely on the argument in Kato [11, Chapter 2.1] which states that apart from a finite number of ’s:
[TABLE]
the number of eigenvalues of remains constant for and . Since has simple spectrum for , it has simple spectrum for all .
We can now proceed:
[TABLE]
By iterating this process, we obtain
[TABLE]
hence the Lipschitz property along the segment for and with simple spectrum.
The general property follows by density of such matrices in the set of matrices and by continuity of . Let be given and and be such that and have simple spectrum then:
[TABLE]
Proof of Lemma 2.2 is completed. ∎
We now use concentration arguments to obtain a bound on .
Proposition 2.3**.**
Let be the constant obtained in Lemma 2.2, then
[TABLE]
Proof.
By applying Tsirelson-Ibragimov-Sudakov inequality [4, Theorem 5.5] to with the Lipschitz estimate obtained in Lemma 2.2, we obtain the following exponential estimate:
[TABLE]
for all . We can now estimate the expectation of the maximum (we drop the dependence in ).
[TABLE]
Hence for
[TABLE]
Optimizing in , we obtain and , which is the desired estimate. ∎
Proposition 2.4**.**
The following estimate holds111Notice that the proof does not rely on the fact that the entries are Gaussian. In particular, we did not use the integration by part formula , only valid for .:
[TABLE]
uniformly in .
Proof.
Given an almost surely differentiable function , we shall use the following Taylor expansion:
[TABLE]
We have
[TABLE]
Notice that does not depend on anymore, hence is independent from this random variable. In particular . We denote by a function evaluated at , i.e. . We have
[TABLE]
Recall that , where has been computed in (2.10). Denote by and and recall that and . We have
[TABLE]
Hence . Now
[TABLE]
Hence . Finally
[TABLE]
Hence .
We have finally proven that uniformly in , which concludes the proof of the lemma. ∎
We are now in position to prove Lemma 2.1.
2.3. Proof of Lemma 2.1
We first establish the convergence for . Notice that the r.v. is nonnegative hence by Markov inequality,
[TABLE]
Now since the random variables are exchangeable, and
[TABLE]
by Proposition 2.3. This implies that
[TABLE]
We now prove that
[TABLE]
By Proposition 2.4, hence . Applying Poincaré’s inequality to the Lipschitz functional (cf. Lemma 2.2), we can bound ’s variance by and obtain
[TABLE]
This yields (2.12). Combining (2.11) and (2.12) finally yields:
[TABLE]
In order to obtain the result for the untilded quantities, we write
[TABLE]
One proves the second assertion similarly, which concludes the proof of Lemma 2.1.
3. Stability: proof of Theorem 1.4
In order to study the stability of large Lotka-Volterra systems, we are led to study the matrix
[TABLE]
We first establish the following estimates
[TABLE]
The first estimate immediatly follows from (2.6) together with Lemma 2.1. From ’s decomposition (2.1) we have
[TABLE]
where the last inequality follows from Lemma 2.1 and the fact that .
We now compare the spectra of matrices and by relying on Bauer and Fike’s theorem [10, Theorem 6.3.2]: for every , there exists a component of vector such that
[TABLE]
where follows from the second estimate in (3.1) and from the spectral norm estimate. Notice that the majorization above is uniform for . The first part of the theorem is proved. Finally,
[TABLE]
The estimate (1.5) finally follows from the first estimate in (3.1).
4. Heuristics at critical scaling, non-homogeneous systems and non-gaussian entries
4.1. A heuristics at the critical scaling
We provide here a heuristics to compute the probability that a solution is feasible at critical scaling .
Heuristics 4.1**.**
The probability that a solution is feasible at the critical scaling is asymptotically given by
[TABLE]
In Figure 2, we compare the heuristics with results from simulations.
Arguments.
Consider
[TABLE]
Following Geman and Hwang [8, Lemma A.1], one could prove that and are asymptotically independent centered Gaussian random variables, each with variance one. We thus approximate the quantity by a Gaussian random variable with distribution and set
[TABLE]
where the ’s are i.i.d. . Denote by then
[TABLE]
Recall that standard extreme value convergence results for Gaussian i.i.d. random variables yield
[TABLE]
where is defined in (2.3). Denote by then
[TABLE]
Notice that
[TABLE]
Hence
[TABLE]
We finally end up with the announced approximation
[TABLE]
Remark 4.1*.*
A rougher approximation would have been to set with and to drop the next term in the heuristics but this would have resulted in the following approximation
[TABLE]
which is worst than , as illustrated in Figure 2.
Approximation in (4.3) may look doubtful, especially because the convergence (4.2) is used for growing . Since it is well-known that convergence in distribution might not capture the convergence of the tails, one may want to switch to the regime of large deviations. We rely on computations made by Vivo [15] to confirm that the approximation is legitimate.
The following large deviations estimate is provided in [15, Eq. (52)]:
[TABLE]
which yields the approximation
[TABLE]
On the other hand, by classical extreme value theory,
[TABLE]
Now, in order to extend the validity of (4.5) for , we consider simultaneously the approximation (4.4) for and (4.5) for , that is
[TABLE]
Equating both exponentials yields
[TABLE]
This gives us the following rule of thumb: one may apply (4.5) if . This condition is fulfilled for .
∎
4.2. Positivity for a non-homogeneous linear system
The results developed so far for the system (1.1) extend to a non-homogeneous (NH) linear system where is replaced by a deterministic vector with slight modifications. In particular, we identify a regime where feasibility and stability occur simultaneously.
Denote by a deterministic vector with positive components and consider the linear system
[TABLE]
Introduce the notations
[TABLE]
Assume that there exist independent from such that eventually
[TABLE]
Then
Theorem 4.2** (Feasibility - NH case).**
Let and denote by . Let be the solution of (4.6).
- (1)
If there exists such that eventually then 2. (2)
If there exists such that eventually then
Remark 4.3*.*
Contrary to the homogeneous system where there is a sharp transition at , the situation is not as clean-cut here and there is a buffer zone
[TABLE]
in which the study of the feasibility is not clear. This buffer zone is illustrated in Figure 3.
In Figure 3, we illustrate the transition toward feasibility for a non-homogeneous system (4.6) in the case where deterministic vector is equally distributed over , i.e.
[TABLE]
We introduce the quantities
[TABLE]
As one may notice, the transition region is wider than in the homogeneous case.
Elements of proof.
We have
[TABLE]
where the ’s are i.i.d. . One can check by carefully reading the proof of Lemma 2.1 that the conclusions of the lemma apply to . In particular, one may check that Proposition 2.4 holds uniformly in in the non-homogeneous case. Denote by , then
[TABLE]
The first statement of the theorem follows. Similarly,
[TABLE]
Proof of Theorem 4.2 is completed. ∎
A non homogeneous system (4.6) is associated to the following Lotka-Volterra system
[TABLE]
for whose jacobian at equilibrium is still given by (1.4).
Theorem 4.4** (Stability - NH case).**
Let be the solution of (4.6) and assume that
[TABLE]
Denote by the spectrum of . Then for every ,
[TABLE]
4.3. Beyond the Gaussian case
The results presented so far heavily rely on the Gaussianity of the entries. A closer look at ’s components reveals that Gaussianity plays an important role at three levels:
[TABLE]
- (1)
Gaussian entries immediatly imply that the ’s are independent standard Gaussian random variables, for which the study of the extrema is standard.
In the case where the entries are not Gaussian any more, the ’s are no longer Gaussian but this issue can easily be circumvented since by the CLT the ’s converge in distribution to a standard Gaussian. The extreme value study of such families of ’s has been carried out in [2, Propositions 2 & 3].
- (2)
The study of the extreme values of in this article relies on the sub-Gaussiannity of which is a consequence of Gaussian concentration for Lipschitz functionals. 2. (3)
Poincaré’s inequality is used to prove that goes to zero in probability, which is crucial to establish Lemma 2.1.
If the distribution of the entries is strongly log-concave in the sense of [16, Eq. (3.48)], then [16, Theorem 3.16] yields the sub-Gaussiannity of together with Poincaré’s inequality. In particular, Theorems 1.1 and 1.4 hold verbatim for entries i.i.d., centered with variance one and whose distribution is strongly log-concave.
The case of bounded and/or discrete entries is not covered and remains open although the simulations (see Figure 4) indicate that a similar phase transition occurs.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Allesina and S. Tang. The stability–complexity relationship at age 40: a random matrix perspective. Population Ecology , 57(1):63–75, 2015.
- 2[2] C. W. Anderson, S. G. Coles, and J. Hüsler. Maxima of poisson-like variables and related triangular arrays. The Annals of Applied Probability , pages 953–971, 1997.
- 3[3] Z. D. Bai and J. W. Silverstein. Spectral analysis of large dimensional random matrices . Springer Series in Statistics. Springer, New York, second edition, 2010.
- 4[4] S. Boucheron, G. Lugosi, and P. Massart. Concentration Inequalities: A Nonasymptotic Theory of Independence . Oxford University Press, 2013.
- 5[5] M. Dougoud, L. Vinckenbosch, R. P. Rohr, L.-F. Bersier, and C. Mazza. The feasibility of equilibria in large ecosystems: A primary but neglected concept in the complexity-stability debate. P Lo S computational biology , 14(2):e 1005988, 2018.
- 6[6] M. R. Gardner and W. R. Ashby. Connectance of large dynamic (cybernetic) systems: critical values for stability. Nature , 228(5273):784, 1970.
- 7[7] S. Geman. The spectral radius of large random matrices. Ann. Probab. , 14(4):1318–1328, 1986.
- 8[8] S. Geman and C.-R. Hwang. A chaos hypothesis for some large systems of random equations. Z. Wahrsch. Verw. Gebiete , 60(3):291–314, 1982.
