Positive solutions for large random linear systems

Pierre Bizeul (ENS Paris Saclay); Jamal Najim (LIGM)

arXiv:1904.04559·math.PR·April 10, 2019·ICASSP

Positive solutions for large random linear systems

Pierre Bizeul (ENS Paris Saclay), Jamal Najim (LIGM)

PDF

TL;DR

This paper analyzes the positivity and stability of solutions to large random linear systems with Gaussian entries, revealing a sharp phase transition at a critical scaling factor and implications for biological community models.

Contribution

It establishes a precise phase transition threshold for positivity in large Gaussian linear systems and links this to the stability of associated Lotka-Volterra models.

Findings

01

Phase transition at lpha* s pprox n

02

Below threshold, solutions have negative components with high probability

03

Above threshold, solutions are positive and the system is stable

Abstract

Consider a large linear system where $A_{n}$ is a $n \times n$ matrix with independent real standard Gaussian entries, $1_{n}$ is a $n \times 1$ vector of ones and with unknown the $n \times 1$ vector $x_{n}$ satisfying $x_{n} = 1_{n} + \frac{1}{α _{n} n} A_{n} x_{n} .$ We investigate the (componentwise) positivity of the solution $x_{n}$ depending on the scaling factor $α_{n}$ as the dimension $n$ goes to $\infty$ . We prove that there is a sharp phase transition at the threshold $α_{n}^{*} = 2 lo g n$ : below the threshold ( $α_{n} ≪ 2 lo g n$ ), $x_{n}$ has negative components with probability tending to 1 while above ( $α_{n} ≫ 2 lo g n$ ), all the vector's components are eventually positive with probability tending to 1. At the critical scaling $α_{n}^{*}$ , we provide a heuristics…

Tables1

Table 1. Table 1 . The quantity 1 α n ∗ = 1 2 log ⁡ n 1 superscript subscript 𝛼 𝑛 1 2 𝑛 \frac{1}{\alpha_{n}^{*}}=\frac{1}{\sqrt{2\log n}} vanishes extremely slowly as n 𝑛 n increases.

$n$	$10^{2}$	$10^{3}$	$10^{4}$	$10^{5}$	$10^{6}$
$\frac{1}{α_{n}^{*}}$	0.33	0.27	0.23	0.21	0.19

Equations188

x_{n} = 1_{n} + \frac{1}{α _{n} n} A_{n} x_{n} .

x_{n} = 1_{n} + \frac{1}{α _{n} n} A_{n} x_{n} .

J (x_{n}) = diag (x_{n}) (- I_{n} + \frac{A _{n}}{α _{n} n})

J (x_{n}) = diag (x_{n}) (- I_{n} + \frac{A _{n}}{α _{n} n})

x_{n} = 1_{n} + \frac{1}{α _{n} n} A_{n} x_{n},

x_{n} = 1_{n} + \frac{1}{α _{n} n} A_{n} x_{n},

x_{n} = (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n} with x_{k} = e_{k}^{*} (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n},

x_{n} = (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n} with x_{k} = e_{k}^{*} (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n},

\frac{d x _{k} ( t )}{d t} = x_{k} (t) 1 - x_{k} (t) + \frac{1}{α _{n} n} ℓ \in [n] \sum A_{k ℓ} x_{ℓ} (t) for k \in [n],

\frac{d x _{k} ( t )}{d t} = x_{k} (t) 1 - x_{k} (t) + \frac{1}{α _{n} n} ℓ \in [n] \sum A_{k ℓ} x_{ℓ} (t) for k \in [n],

(x_{k} - 1)_{k \in [M]} D n \to \infty Z \sim N (0, σ_{α}^{2} I_{M}),

(x_{k} - 1)_{k \in [M]} D n \to \infty Z \sim N (0, σ_{α}^{2} I_{M}),

P {x_{k} > 0, k \in [M]} n \to \infty (\int_{σ_{α}^{- 1}}^{\infty} \frac{e ^{- x^{2} /2}}{2 π} d x)^{M} \Rightarrow P {x_{k} > 0, k \in [n]} n \to \infty 0 .

P {x_{k} > 0, k \in [M]} n \to \infty (\int_{σ_{α}^{- 1}}^{\infty} \frac{e ^{- x^{2} /2}}{2 π} d x)^{M} \Rightarrow P {x_{k} > 0, k \in [n]} n \to \infty 0 .

α_{n}^{*} = 2 lo g n

α_{n}^{*} = 2 lo g n

P {k \in [n] min x_{k} > 0} n \to \infty 0 .

P {k \in [n] min x_{k} > 0} n \to \infty 0 .

P {k \in [n] min x_{k} > 0} n \to \infty 1 .

P {k \in [n] min x_{k} > 0} n \to \infty 1 .

P {k \in [n] min x_{k} > 0} \approx 1 - \frac{e}{4 π lo g n} + \frac{e}{8 π lo g n} as n \to \infty .

P {k \in [n] min x_{k} > 0} \approx 1 - \frac{e}{4 π lo g n} + \frac{e}{8 π lo g n} as n \to \infty .

J (x_{n}) = diag (x_{n}) (- I_{n} + \frac{A _{n}}{α _{n} n})

J (x_{n}) = diag (x_{n}) (- I_{n} + \frac{A _{n}}{α _{n} n})

λ \in S_{n} max k \in [n] min ∣ λ + x_{k} ∣ P n \to \infty 0 .

λ \in S_{n} max k \in [n] min ∣ λ + x_{k} ∣ P n \to \infty 0 .

λ \in S_{n} max Re (λ) \leq - (1 - ℓ^{+}) + o_{P} (1) .

λ \in S_{n} max Re (λ) \leq - (1 - ℓ^{+}) + o_{P} (1) .

x_{n} = (x_{k})_{k \in [n]} = (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n} = Q_{n} 1_{n},

x_{n} = (x_{k})_{k \in [n]} = (I_{n} - \frac{A _{n}}{α _{n} n})^{- 1} 1_{n} = Q_{n} 1_{n},

x_{k}

x_{k}

Z_{k} = e_{k}^{*} (n^{- 1/2} A) 1 = \frac{1}{n} i = 1 \sum n A_{k i} and R_{k} = e_{k}^{*} (n^{- 1/2} A)^{2} Q 1 .

Z_{k} = e_{k}^{*} (n^{- 1/2} A) 1 = \frac{1}{n} i = 1 \sum n A_{k i} and R_{k} = e_{k}^{*} (n^{- 1/2} A)^{2} Q 1 .

M_{n} = k \in [n] max Z_{k}, \overset{ˇ}{M}_{n} = k \in [n] min Z_{k}, α_{n}^{*} = 2 lo g n and β_{n}^{*} = α_{n}^{*} - \frac{1}{2 α _{n}^{*}} lo g (4 π lo g n) .

M_{n} = k \in [n] max Z_{k}, \overset{ˇ}{M}_{n} = k \in [n] min Z_{k}, α_{n}^{*} = 2 lo g n and β_{n}^{*} = α_{n}^{*} - \frac{1}{2 α _{n}^{*}} lo g (4 π lo g n) .

P {α_{n}^{*} (M_{n} - β_{n}^{*}) \leq x}

P {α_{n}^{*} (M_{n} - β_{n}^{*}) \leq x}

P {α_{n}^{*} (\overset{ˇ}{M}_{n} + β_{n}^{*}) \geq - x}

\left\{\begin{array}[]{lcl}\min_{k\in[n]}x_{k}&\geq&1+\frac{1}{\alpha}\check{M}+\frac{1}{\alpha^{2}}\min_{k\in[n]}R_{k}\ ,\\ \min_{k\in[n]}x_{k}&\leq&1+\frac{1}{\alpha}\check{M}+\frac{1}{\alpha^{2}}\max_{k\in[n]}R_{k}\,.\end{array}\right.

\left\{\begin{array}[]{lcl}\min_{k\in[n]}x_{k}&\geq&1+\frac{1}{\alpha}\check{M}+\frac{1}{\alpha^{2}}\min_{k\in[n]}R_{k}\ ,\\ \min_{k\in[n]}x_{k}&\leq&1+\frac{1}{\alpha}\check{M}+\frac{1}{\alpha^{2}}\max_{k\in[n]}R_{k}\,.\end{array}\right.

k \in [n] min x_{k} \geq 1 + \frac{α _{n}^{*}}{α _{n}} (\frac{M ˇ + β _{n}^{*}}{α _{n}^{*}} - \frac{β _{n}^{*}}{α _{n}^{*}} + \frac{min _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}) = 1 + \frac{α _{n}^{*}}{α _{n}} (- 1 + o_{P} (1) + \frac{min _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}),

k \in [n] min x_{k} \geq 1 + \frac{α _{n}^{*}}{α _{n}} (\frac{M ˇ + β _{n}^{*}}{α _{n}^{*}} - \frac{β _{n}^{*}}{α _{n}^{*}} + \frac{min _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}) = 1 + \frac{α _{n}^{*}}{α _{n}} (- 1 + o_{P} (1) + \frac{min _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}),

k \in [n] min x_{k} \leq 1 + \frac{α _{n}^{*}}{α _{n}} (- 1 + o_{P} (1) + \frac{max _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}) .

k \in [n] min x_{k} \leq 1 + \frac{α _{n}^{*}}{α _{n}} (- 1 + o_{P} (1) + \frac{max _{k \in [n]} R _{k}}{α _{n}^{*} α _{n}}) .

\frac{max _{k \in [n]} R _{k}}{α _{n} 2 lo g n} P n \to \infty 0 and \frac{min _{k \in [n]} R _{k}}{α _{n} 2 lo g n} P n \to \infty 0 .

\frac{max _{k \in [n]} R _{k}}{α _{n} 2 lo g n} P n \to \infty 0 and \frac{min _{k \in [n]} R _{k}}{α _{n} 2 lo g n} P n \to \infty 0 .

φ (x) = {10 if x \in [0, 2 + η] if x \geq 3,

φ (x) = {10 if x \in [0, 2 + η] if x \geq 3,

φ_{n} := φ (s_{n}) = φ (s (n^{- 1/2} A)) .

φ_{n} := φ (s_{n}) = φ (s (n^{- 1/2} A)) .

R_{k} = φ_{n} R_{k} .

R_{k} = φ_{n} R_{k} .

{\mathcal{H}}(A)=\left(\begin{array}[]{cc}0&A\\ A^{*}&0\end{array}\right)\,.

{\mathcal{H}}(A)=\left(\begin{array}[]{cc}0&A\\ A^{*}&0\end{array}\right)\,.

R_{k} (A) - R_{k} (B) \leq K ∥ A - B ∥_{F},

R_{k} (A) - R_{k} (B) \leq K ∥ A - B ∥_{F},

φ_{n} ∥ Q ∥ \leq φ_{n} (1 - \frac{1}{α} n^{- \frac{1}{2}} A)^{- 1} \leq \frac{1}{1 - 3 α ^{- 1}} \leq 3

φ_{n} ∥ Q ∥ \leq φ_{n} (1 - \frac{1}{α} n^{- \frac{1}{2}} A)^{- 1} \leq \frac{1}{1 - 3 α ^{- 1}} \leq 3

∥\nabla R_{k} (A) ∥ = ij \sum \partial_{ij} R_{k} (A)^{2} \leq K .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Positive solutions for large random linear systems

Pierre Bizeul, Jamal Najim

Abstract.

Consider a large linear system where $A_{n}$ is a $n\times n$ matrix with independent real standard Gaussian entries, $\boldsymbol{1}_{n}$ is a $n\times 1$ vector of ones and with unknown the $n\times 1$ vector $\boldsymbol{x}_{n}$ satisfying

[TABLE]

We investigate the (componentwise) positivity of the solution $\boldsymbol{x}_{n}$ depending on the scaling factor $\alpha_{n}$ as the dimension $n$ goes to $\infty$ . We prove that there is a sharp phase transition at the threshold $\alpha^{*}_{n}=\sqrt{2\log n}$ : below the threshold ( $\alpha_{n}\ll\sqrt{2\log n}$ ), $\boldsymbol{x}_{n}$ has negative components with probability tending to 1 while above ( $\alpha_{n}\gg\sqrt{2\log n}$ ), all the vector’s components are eventually positive with probability tending to 1. At the critical scaling $\alpha^{*}_{n}$ , we provide a heuristics to evaluate the probability that $\boldsymbol{x}_{n}$ is positive.

Such linear systems arise as solutions at equilibrium of large Lotka-Volterra systems of differential equations, widely used to describe large biological communities with interactions such as foodwebs for instance.

In the domaine of positivity of the solution $\boldsymbol{x}_{n}$ , that is when $\alpha_{n}\gg\sqrt{2\log n}$ , we establish that the Lotka-Volterra system of differential equations whose solution at equilibrium is precisely $\boldsymbol{x}_{n}$ is stable in the sense that its jacobian

[TABLE]

has all its eigenvalues with negative real part with probability tending to one.

Our results shed a new light and complement the understanding of feasibility and stability issues for large biological communities with interaction.

Key words and phrases:

Linear systems; large random matrices; Gaussian concentration; Lotka-Volterra equations.

2010 Mathematics Subject Classification:

Primary 15B52, 60G70, Secondary 60B20, 92D40

1. Introduction

Denote by $A_{n}$ a $n\times n$ matrix with independent Gaussian ${\mathcal{N}}(0,1)$ entries and by $\alpha_{n}$ a positive sequence. We are interested in the componentwise positivity of the $n\times 1$ vector $\boldsymbol{x}_{n}$ , solution of the linear system

[TABLE]

where $\boldsymbol{1}_{n}$ is the $n\times 1$ vector with components 1.

It is well-known since Geman [7] that the spectral radius of $\frac{A_{n}}{\sqrt{n}}$ almost surely (a.s.) converges to 1, so that matrix $\left(I_{n}-\frac{A_{n}}{\alpha_{n}\sqrt{n}}\right)$ is eventually invertible as long as $\alpha_{n}\gg 1$ . In this case, vector $\boldsymbol{x}_{n}=(x_{k})_{k\in[n]}$ where $[n]=\{1,\cdots,n\}$ writes

[TABLE]

where $\boldsymbol{e}_{k}$ is the $n\times 1$ canonical vector and $B^{*}$ denotes the transconjugate of matrix $B$ (or simply its transpose if $B$ is real).

The positivity of the $x_{k}$ ’s is a key issue in the study of Large Lotka-Volterra systems, widely used in mathematical biology and ecology to model populations with interactions.

Consider for instance a given foodweb and denote by $\boldsymbol{x}_{n}(t)=(x_{k}(t))_{k\in[n]}$ the vector of abundances of the various species within the foodweb at time $t$ . A standard way to connect the various abundances is via a Lotka-Volterra (LV) system of equations that writes

[TABLE]

where the interactions $(A_{k\ell})$ can be modeled as random in the absence of any prior information. Here, the $A_{k\ell}$ ’s are assumed to be i.i.d. ${\mathcal{N}}(0,1)$ . At the equilibrium $\frac{d\boldsymbol{x}_{n}}{dt}=0$ , the abundance vector $\boldsymbol{x}_{n}$ is solution of (1.1) and a key issue is the existence of a feasible solution, that is a solution $\boldsymbol{x}_{n}$ where all the $x_{k}$ ’s are positive. Dougoud et al. [5], based on Geman and Hwang [8], proved that a feasible solution is very unlikely to exist if $\alpha_{n}\equiv\alpha$ is a constant. In fact, the CLT proved in [8] asserts that for any fixed number $M$ of components

[TABLE]

where $\xrightarrow[]{\mathcal{D}}$ (resp. $\xrightarrow[]{\mathcal{P}}$ ) stands for the convergence in distribution (resp. in probability) and where $\sigma_{\alpha}^{2}={\mathcal{O}}(1)$ . As an important consequence, vectors $\boldsymbol{x}_{n}$ with positive components will become extremely rare since

[TABLE]

In this article, we consider a growing scaling factor $\alpha_{n}\to\infty$ and study the positivity of $\boldsymbol{x}_{n}$ ’s components in relation with $\alpha_{n}$ . We find that there exists a critical threshold

[TABLE]

below which feasible solutions exist with vanishing probability and above which feasible solutions are more and more likely to exist. More precisely, we prove the following:

Theorem 1.1 (Feasibility).

Let $\alpha_{n}\xrightarrow[n\to\infty]{}\infty$ and denote by $\alpha_{n}^{*}=\sqrt{2\log n}$ . Let $\boldsymbol{x}_{n}=(x_{k})_{k\in[n]}$ be the solution of (1.1).

(1)

If there exists $\varepsilon>0$ such that eventually $\alpha_{n}\leq(1-\varepsilon)\alpha_{n}^{*}$ then

[TABLE] 2. (2)

If there exists $\varepsilon>0$ such that eventually $\alpha_{n}\geq(1+\varepsilon)\alpha_{n}^{*}$ then

[TABLE]

Proof of Theorem 1.1 is based on an analysis of the order of magnitude of the extreme values of the $x_{k}$ ’s, which relies on Gaussian concentration of Lipschitz functionals whose argument is matrix $A_{n}$ .

*Remark 1.2**.*

In Figure 1, we illustrate the transition toward feasibility depending on the scaling $\alpha_{N}(\kappa)=\kappa\sqrt{\log(N)}$ . For $\kappa\in[0.5,2.5]$ , we plot the proportion of feasible solutions $\boldsymbol{x}_{N}(\kappa)$ obtained after 500 simulations. The transition occurs at the optimal scaling $\alpha_{N}^{*}=\sqrt{2\log(N)}$ corresponding to $\kappa=\sqrt{2}$ .

*Remark 1.3**.*

Notice that the convergence of $\frac{1}{\alpha_{n}^{*}}$ to zero is extremely slow, as shown in Table 1, and could easily be mistaken with some constant scaling $\sigma<1$ where $\sigma=\frac{1}{\alpha_{n}^{*}}$ .

To complement the picture, we provide the following heuristics at the critical scaling $\alpha_{n}^{*}=\sqrt{2\log n}$ :

[TABLE]

Aside from the question of feasibility arises the question of stability : for a complex system, how likely a perturbation of the solution $\boldsymbol{x}_{n}$ at equilibrium will return to the equilibrium? Gardner and Ashby [6] considered stability issues of complex systems connected at random. Based on the circular law for large matrices with i.i.d. entries, May [13] provided a complexity/stability criterion and motivated the systematic use of large random matrix theory in the study of foodwebs, see for instance Allesina et al. [1]. Recently, Stone [14] and Gibbs et al. [9] revisited the relation between feasibility and stability.

We complement the information of Theorem 1.1 by adressing the question of stability in the context of a Lotka-Volterra system (1.2) and prove that under the first condition of the theorem feasibility and stability occur simultaneously.

Recall that the solution at equilibrium $\boldsymbol{x}_{n}$ is stable if the Jacobian matrix ${\mathcal{J}}$ of the Lotka-Volterra system evaluated at $\boldsymbol{x}_{n}$ , that is

[TABLE]

has all its eigenvalues with negative real part.

Theorem 1.4 (Stability).

Let $\boldsymbol{x}_{n}=(x_{k})_{k\in[n]}$ be the solution of (1.1). Denote by $\boldsymbol{\ell}^{+}=\limsup_{n\to\infty}\frac{\sqrt{2\log n}}{\alpha_{n}}$ and assume that $\boldsymbol{\ell}^{+}<1$ . Denote by ${\mathcal{S}}_{n}$ the spectrum of ${\mathcal{J}}(\boldsymbol{x}_{n})$ and let $\lambda\in{\mathcal{S}}_{n}$ . Then

[TABLE]

Moreover,

[TABLE]

Proof of Theorem 1.4 relies on standard perturbation results from linear algebra and on Theorem 1.1.

Organization of the paper

Proof of Theorem 1.1 is provided in Section 2. Theorem 1.4 is proved in Section 3. In Section 4, elements to bear out heuristics (1.3) are provided. We also formulate some concluding remarks for non-homogeneous linear systems where vector $\boldsymbol{1}_{n}$ is replaced by a positive vector $\boldsymbol{r}_{n}$ and briefly mention possible extensions to non-Gaussian entries.

Acknowlegments

JN thanks Christian Mazza for introducing him to the study of large LV systems in theoretical ecology. The authors thank François Massol and Olivier Guédon for fruitful discussions.

2. Positive solutions: proof of Theorem 1.1

We will use the following notations for the various norms at stake: if $\boldsymbol{v}$ is a vector then $\|\boldsymbol{v}\|$ stands for its euclidian norm; if $A$ is a matrix then $\|A\|$ stands for its spectral norm and $\|A\|_{F}=\sqrt{\sum_{ij}|A_{ij}|^{2}}$ for its Frobenius norm. Let $\varphi$ be a function from $\Sigma=\mathbb{R}$ or $\mathbb{C}$ to $\mathbb{C}$ then $\|\varphi\|_{\infty}=\sup_{x\in\Sigma}|\varphi(x)|$ .

2.1. Some preparation and strategy of the proof

Denote by $Q_{n}=\left(I_{n}-\frac{A_{n}}{\alpha_{n}\sqrt{n}}\right)^{-1}$ the resolvent and by $s(B)$ the largest singular value of a given matrix $B$ . Then it is well known that almost surely $s_{n}:=s(n^{-1/2}A_{n})\xrightarrow[n\to\infty]{}2$ (see for instance [3, Chapter 5]) hence $s\left(\frac{1}{\alpha_{n}\sqrt{n}}A_{n}\right)\xrightarrow[n\to\infty]{}0$ . In particular, the solution

[TABLE]

with $I_{n}$ the $n\times n$ identity, is uniquely defined almost surely. In order to study the minimum of $\boldsymbol{x}_{n}$ ’s components, we partially unfold the above resolvent (in the sequel, we will simply denote $A,\alpha,\boldsymbol{1},Q$ instead of $A_{n},\alpha_{n},\boldsymbol{1}_{n},Q_{n}$ ) and write:

[TABLE]

where

[TABLE]

Notice in particular that the $Z_{k}$ ’s are i.i.d. standard Gaussian. Before focusing on the analysis of the remaining term $R_{k}$ , we recall standard results for extreme values of Gaussian random variables.

Extreme values of Gaussian random variables

Consider the sequence $(Z_{k})$ of standard Gaussian i.i.d. random variables and let

[TABLE]

Denote by $G(x)=e^{-e^{-x}}$ the cumulative distribution of a Gumbel distributed random variable.

Then the following results are standard, see for instance [12, Theorem 1.5.3]: for all $x\in\mathbb{R}$

[TABLE]

Strategy of the proof

Eq. (2.1) immediatly yields

[TABLE]

We rewrite the first equation as

[TABLE]

where we have used the fact that $(\alpha^{*}_{n})^{-1}(\check{M}+\beta^{*}_{n})=o_{P}(1)$ . Similarly,

[TABLE]

The theorem will then follow from the following lemma.

Lemma 2.1.

The following convergence holds

[TABLE]

Proof of Lemma 2.1 requires a careful analysis of the order of magnitude of the extreme values of the remaining term $(R_{k})_{k\in[n]}$ . It is postponed to Section 2.3.

2.2. Lipschitz property and tightness of $R_{k}(A)$

Let $\varphi:\mathbb{R}^{+}\to[0,1]$ be a smooth function with values

[TABLE]

and strictly decreasing from $1$ to zero as $x$ goes from $2+\eta$ to $3$ . Recall that $s_{n}=s(n^{-1/2}A)$ is the largest singular value of the normalized matrix $n^{-1/2}A$ and denote by

[TABLE]

Notice that $\mathbb{P}\{\varphi_{n}<1\}=\mathbb{P}\{s_{n}>2+\eta\}\xrightarrow[n\to\infty]{}0$ (as a by-product of the a.s. convergence of $s_{n}$ to $2$ ).

Instead of directly working with $R_{k}$ we introduce the truncated quantity

[TABLE]

For a given $n\times n$ matrix $A$ , we may consider its $2n\times 2n$ hermitized matrix ${\mathcal{H}}(A)$ defined as

[TABLE]

Recall that the singular values of $A$ together with their opposites are the eigenvalues of ${\mathcal{H}}(A)$ .

We prove hereafter that as a function of the entries of matrix $A$ , the function $A\mapsto\widetilde{R}_{k}(A)$ is lipschitz.

Lemma 2.2.

Let $\widetilde{R}_{k}$ be given by (2.7), then the function $A\mapsto{\widetilde{R}}_{k}(A)$ is Lipschitz, i.e.

[TABLE]

where $\|A\|_{F}$ is the Frobenius norm and $K$ is a constant independent from $k$ and $n$ .

Proof.

Notice that $\varphi(s_{n})=0$ and $\varphi^{\prime}(s_{n})=0$ for $s_{n}\geq 3$ , which implies that one may consider the bound $s_{n}\leq 3$ in the following computations, for $\widetilde{R}_{k}$ or its derivatives would be zero otherwise. Recall the definition of the resolvent $Q=\left(I-\frac{A}{\alpha\sqrt{n}}\right)^{-1}$ then $Q^{-1}Q=I$ which yields $Q=I+\frac{A}{\alpha\sqrt{n}}Q$ from which we deduce that

[TABLE]

for $n$ large enough.

We first consider a matrix $A$ such that ${\mathcal{H}}(A)$ has simple spectrum (i.e. with $2n$ distinct eigenvalues, each with multiplicity 1). We denote by $\partial_{ij}=\frac{\partial\ }{\partial A_{ij}}$ and prove that the vector $\nabla{\widetilde{R}}_{k}(A)=\left(\partial_{ij}\widetilde{R}(A),\ i,j\in[n]\right)$ satisfies

[TABLE]

To lighten the notations, we may drop the dependence of $\widetilde{R}_{k}$ in $A$ . We begin by computing

[TABLE]

Straightforward computations yield

[TABLE]

It remains to compute $\partial_{ij}\varphi_{n}=\varphi^{\prime}(s_{n})\partial_{ij}s_{n}$ . Recall that ${\mathcal{H}}(A)$ has a simple spectrum and notice that $A\mapsto s_{n}(A)$ is differentiable. In fact, since $s_{n}$ is simple, it is a simple root of the characteristic polynomial. In particular, it is not a root of its derivative and one can use the implicit function theorem to conclude. Let $\boldsymbol{u}$ and $\boldsymbol{v}$ be respectively the left and right normalized singular vectors associated to $s(A)$ . Then

[TABLE]

moreover $\boldsymbol{w}$ is (up to a scaling factor) the unique eigenvector of $s(A)$ since $s(A)$ has multiplicity one by assumption. We can now apply [10, Theorem 6.3.12] to compute $s_{n}$ ’s derivative:

[TABLE]

(recall that all the considered vectors are real).

We first handle the term $T_{1,ij}$ .

[TABLE]

We now handle the term $T_{2,ij}$ .

[TABLE]

The term $T_{3,ij}$ can be handled similarly and one can prove

[TABLE]

Gathering all these estimates, we finally obtain the desired bound:

[TABLE]

where $K$ neither depends on $k$ nor on $n$ .

Having proved a local estimate over $\|\nabla\widetilde{R}_{k}(A)\|$ for each matrix $A$ such that ${\mathcal{H}}(A)$ has simple spectrum, we now establish the Lipschitz estimate (2.8) for two such matrices $A,B$ .

Let $A,B$ such that ${\mathcal{H}}(A)$ and ${\mathcal{H}}(B)$ have simple spectrum and consider $A_{t}=(1-t)A+tB$ for $t\in[0,1]$ . Notice first that the continuity of the eigenvalues implies that there exists $\delta>0$ sufficiently small such that ${\mathcal{H}}(A_{t})$ has a simple spectrum for $t\leq\delta$ and $t\geq 1-\delta$ . To go beyond $[0,\delta)\cup(1-\delta,1]$ and prove that ${\mathcal{H}}(A_{t})$ has simple spectrum for the entire interval $[0,1]$ except maybe for a finite number of points, we rely on the argument in Kato [11, Chapter 2.1] which states that apart from a finite number of $t_{\ell}$ ’s:

[TABLE]

the number of eigenvalues of ${\mathcal{H}}(A_{t})$ remains constant for $t\in[0,1]$ and $t\neq t_{\ell},\ell\in[L]$ . Since ${\mathcal{H}}(A_{t})$ has simple spectrum for $t\in[0,\delta)\cup(1-\delta,1]$ , it has simple spectrum for all $t\notin\{t_{\ell},\ell\in[L]\}$ .

We can now proceed:

[TABLE]

By iterating this process, we obtain

[TABLE]

hence the Lipschitz property along the segment $[A,B]$ for ${\mathcal{H}}(A)$ and ${\mathcal{H}}(B)$ with simple spectrum.

The general property follows by density of such matrices in the set of $n\times n$ matrices and by continuity of $A\mapsto\widetilde{R}_{k}(A)$ . Let $A,B$ be given and $A_{\varepsilon}\to A$ and $B_{\varepsilon}\to B$ be such that ${\mathcal{H}}(A_{\varepsilon})$ and ${\mathcal{H}}(B_{\varepsilon})$ have simple spectrum then:

[TABLE]

Proof of Lemma 2.2 is completed. ∎

We now use concentration arguments to obtain a bound on $\operatorname{\mathbb{E}}\max_{k\in[n]}({\widetilde{R}}_{k}-\operatorname{\mathbb{E}}{\widetilde{R}}_{k})$ .

Proposition 2.3.

Let $K$ be the constant obtained in Lemma 2.2, then

[TABLE]

Proof.

By applying Tsirelson-Ibragimov-Sudakov inequality [4, Theorem 5.5] to ${\widetilde{R}}_{k}(A)$ with the Lipschitz estimate obtained in Lemma 2.2, we obtain the following exponential estimate:

[TABLE]

for all $\lambda\in\mathbb{R}$ . We can now estimate the expectation of the maximum (we drop the dependence in $A$ ).

[TABLE]

Hence for $\lambda>0$

[TABLE]

Optimizing in $\lambda$ , we obtain $\lambda^{*}=\frac{\sqrt{2\log n}}{K}$ and $\Phi(\lambda^{*})=K\sqrt{2\log n}$ , which is the desired estimate. ∎

Proposition 2.4.

The following estimate holds111Notice that the proof does not rely on the fact that the entries are Gaussian. In particular, we did not use the integration by part formula $\mathbb{E}Xf(X)=\mathbb{E}f^{\prime}(X)$ , only valid for $X\sim{\mathcal{N}}(0,1)$ .:

[TABLE]

uniformly in $k\in[n]$ .

Proof.

Given an almost surely differentiable function $\Psi:\mathbb{R}\to\mathbb{R}$ , we shall use the following Taylor expansion:

[TABLE]

We have

[TABLE]

Notice that $\Psi_{i\ell}(0)$ does not depend on $A_{ki}$ anymore, hence is independent from this random variable. In particular $\mathbb{E}A_{ki}\Psi_{i\ell}(0)=0$ . We denote by $\underline{F}$ a function $F$ evaluated at $sA_{ki}$ , i.e. $\underline{F}=F(sA_{ki})$ . We have

[TABLE]

Recall that $\partial_{ij}\varphi_{n}=\varphi^{\prime}(s_{n})\partial_{ij}s_{n}$ , where $\partial_{ij}s_{n}$ has been computed in (2.10). Denote by $u_{k}=\boldsymbol{u}^{*}\boldsymbol{e}_{k}$ and $v_{i}=\boldsymbol{e}_{i}^{*}\boldsymbol{v}$ and recall that $\|\boldsymbol{v}\|=1$ and $|u_{k}|,|v_{i}|\leq 1$ . We have

[TABLE]

Hence $T_{1}={\mathcal{O}}\left(\alpha\right)$ . Now

[TABLE]

Hence $T_{2}={\mathcal{O}}(\alpha)$ . Finally

[TABLE]

Hence $T_{3}={\mathcal{O}}(1)$ .

We have finally proven that $\mathbb{E}\widetilde{R}_{k}(A_{n})={\mathcal{O}}\left(\alpha\right)$ uniformly in $k$ , which concludes the proof of the lemma. ∎

We are now in position to prove Lemma 2.1.

2.3. Proof of Lemma 2.1

We first establish the convergence for $\max_{k\in[n]}{\widetilde{R}}_{k}(A)-\widetilde{R}_{1}(A)$ . Notice that the r.v. $\max_{k\in[n]}{\widetilde{R}}_{k}(A)-\widetilde{R}_{1}(A)$ is nonnegative hence by Markov inequality,

[TABLE]

Now since the random variables $\widetilde{R}_{1}(A),\dots,\widetilde{R}_{n}(A)$ are exchangeable, $\max_{k\in[n]}\mathbb{E}{\widetilde{R}}_{k}(A)=\operatorname{\mathbb{E}}\widetilde{R}_{1}(A)$ and

[TABLE]

by Proposition 2.3. This implies that

[TABLE]

We now prove that

[TABLE]

By Proposition 2.4, $\mathbb{E}\widetilde{R}_{1}(A)={\mathcal{O}}(\alpha)$ hence $\mathbb{E}\widetilde{R}_{1}(A)/(\alpha\sqrt{2\log(n)})\to 0$ . Applying Poincaré’s inequality to the Lipschitz functional $A\mapsto\widetilde{R}_{1}(A)$ (cf. Lemma 2.2), we can bound $\widetilde{R}_{1}(A)$ ’s variance by $L^{2}$ and obtain

[TABLE]

This yields (2.12). Combining (2.11) and (2.12) finally yields:

[TABLE]

In order to obtain the result for the untilded quantities, we write

[TABLE]

One proves the second assertion similarly, which concludes the proof of Lemma 2.1.

3. Stability: proof of Theorem 1.4

In order to study the stability of large Lotka-Volterra systems, we are led to study the matrix

[TABLE]

We first establish the following estimates

[TABLE]

The first estimate immediatly follows from (2.6) together with Lemma 2.1. From $x_{k}$ ’s decomposition (2.1) we have

[TABLE]

where the last inequality follows from Lemma 2.1 and the fact that $\left(\alpha^{*}_{n}\right)^{-1}(M_{n}-\beta^{*}_{n})\xrightarrow[]{\mathcal{P}}0$ .

We now compare the spectra of matrices ${\mathcal{D}}(\boldsymbol{x}_{n})=-\mathrm{diag}(\boldsymbol{x}_{n})$ and ${\mathcal{J}}(\boldsymbol{x}_{n})$ by relying on Bauer and Fike’s theorem [10, Theorem 6.3.2]: for every $\lambda\in{\mathcal{S}}_{n}$ , there exists a component $x_{k}$ of vector $\boldsymbol{x}_{n}$ such that

[TABLE]

where $(a)$ follows from the second estimate in (3.1) and from the spectral norm estimate. Notice that the majorization above is uniform for $\lambda\in{\mathcal{S}}_{n}$ . The first part of the theorem is proved. Finally,

[TABLE]

The estimate (1.5) finally follows from the first estimate in (3.1).

4. Heuristics at critical scaling, non-homogeneous systems and non-gaussian entries

4.1. A heuristics at the critical scaling

We provide here a heuristics to compute the probability that a solution $\boldsymbol{x}_{n}$ is feasible at critical scaling $\alpha_{n}^{*}=\sqrt{2\log n}$ .

Heuristics 4.1.

The probability that a solution is feasible at the critical scaling $\alpha_{n}^{*}$ is asymptotically given by

[TABLE]

In Figure 2, we compare the heuristics with results from simulations.

Arguments.

Consider

[TABLE]

Following Geman and Hwang [8, Lemma A.1], one could prove that $Z_{k}$ and $R_{k}$ are asymptotically independent centered Gaussian random variables, each with variance one. We thus approximate the quantity $Z_{k}+\frac{R_{k}}{\alpha_{n}^{*}}$ by a Gaussian random variable with distribution ${\mathcal{N}}\left(0,1+\frac{1}{(\alpha_{n}^{*})^{2}}\right)$ and set

[TABLE]

where the $U_{k}$ ’s are i.i.d. ${\mathcal{N}}(0,1)$ . Denote by $\check{M}^{U}_{n}=\min_{k\in[n]}U_{k}$ then

[TABLE]

Recall that standard extreme value convergence results for Gaussian i.i.d. random variables yield

[TABLE]

where $\beta_{n}^{*}$ is defined in (2.3). Denote by $\Theta(\alpha)=\sqrt{1+\alpha^{-2}}$ then

[TABLE]

Notice that

[TABLE]

Hence

[TABLE]

We finally end up with the announced approximation

[TABLE]

*Remark 4.1**.*

A rougher approximation would have been to set $x_{k}\approx 1+\frac{Z_{k}}{\alpha_{n}^{*}}$ with $Z_{k}\sim{\mathcal{N}}(0,1)$ and to drop the next term $\frac{R_{k}}{(\alpha_{n}^{*})^{2}}$ in the heuristics but this would have resulted in the following approximation

[TABLE]

which is worst than $H_{1}(n)$ , as illustrated in Figure 2.

Approximation $(a)$ in (4.3) may look doubtful, especially because the convergence (4.2) is used for growing $x\sim\log(\log n)$ . Since it is well-known that convergence in distribution might not capture the convergence of the tails, one may want to switch to the regime of large deviations. We rely on computations made by Vivo [15] to confirm that the approximation $(a)$ is legitimate.

The following large deviations estimate is provided in [15, Eq. (52)]:

[TABLE]

which yields the approximation

[TABLE]

On the other hand, by classical extreme value theory,

[TABLE]

Now, in order to extend the validity of (4.5) for $x\gg 1$ , we consider simultaneously the approximation (4.4) for $\xi\sim 1$ and (4.5) for $x\gg 1$ , that is

[TABLE]

Equating both exponentials yields

[TABLE]

This gives us the following rule of thumb: one may apply (4.5) if $1\ll x\ll\log n$ . This condition is fulfilled for $x\sim\log(\log n)$ .

∎

4.2. Positivity for a non-homogeneous linear system

The results developed so far for the system (1.1) extend to a non-homogeneous (NH) linear system where $\boldsymbol{1}_{n}$ is replaced by a deterministic $n\times 1$ vector $\boldsymbol{r}_{n}$ with slight modifications. In particular, we identify a regime where feasibility and stability occur simultaneously.

Denote by $\boldsymbol{r}_{n}=(r_{k})$ a $n\times 1$ deterministic vector with positive components and consider the linear system

[TABLE]

Introduce the notations

[TABLE]

Assume that there exist $\rho_{\min},\rho_{\max}$ independent from $n$ such that eventually

[TABLE]

Then

Theorem 4.2 (Feasibility - NH case).

Let $\alpha_{n}\xrightarrow[n\to\infty]{}\infty$ and denote by $\alpha_{n}^{*}=\sqrt{2\log n}$ . Let $\boldsymbol{x}_{n}=(x_{k})_{k\in[n]}$ be the solution of (4.6).

(1)

If there exists $\varepsilon>0$ such that eventually $\alpha_{n}\leq(1-\varepsilon)\frac{\alpha_{n}^{*}\sigma_{\boldsymbol{r}}(n)}{r_{\max}(n)}$ then $\mathbb{P}\left\{\min_{k\in[n]}x_{k}>0\right\}\xrightarrow[n\to\infty]{}0\,.$ 2. (2)

If there exists $\varepsilon>0$ such that eventually $\alpha_{n}\geq(1+\varepsilon)\frac{\alpha_{n}^{*}\sigma_{\boldsymbol{r}}(n)}{r_{\min}(n)}$ then $\mathbb{P}\left\{\min_{k\in[n]}x_{k}>0\right\}\xrightarrow[n\to\infty]{}1\,.$

*Remark 4.3**.*

Contrary to the homogeneous system where there is a sharp transition at $\alpha^{*}_{n}=\sqrt{2\log(n)}$ , the situation is not as clean-cut here and there is a buffer zone

[TABLE]

in which the study of the feasibility is not clear. This buffer zone is illustrated in Figure 3.

In Figure 3, we illustrate the transition toward feasibility for a non-homogeneous system (4.6) in the case where deterministic vector $\boldsymbol{r}_{N}$ is equally distributed over $[1,3]$ , i.e.

[TABLE]

We introduce the quantities

[TABLE]

As one may notice, the transition region is wider than in the homogeneous case.

Elements of proof.

We have

[TABLE]

where the $U_{k}$ ’s are i.i.d. ${\mathcal{N}}(0,1)$ . One can check by carefully reading the proof of Lemma 2.1 that the conclusions of the lemma apply to $R^{\boldsymbol{r}}_{k}$ . In particular, one may check that Proposition 2.4 holds uniformly in $k\in[n]$ in the non-homogeneous case. Denote by $\check{M}=\min_{k\in[n]}U_{k}$ , then

[TABLE]

The first statement of the theorem follows. Similarly,

[TABLE]

Proof of Theorem 4.2 is completed. ∎

A non homogeneous system (4.6) is associated to the following Lotka-Volterra system

[TABLE]

for $k\in[n]$ whose jacobian at equilibrium is still given by (1.4).

Theorem 4.4 (Stability - NH case).

Let $\boldsymbol{x}_{n}=(x_{k})_{k\in[n]}$ be the solution of (4.6) and assume that

[TABLE]

Denote by ${\mathcal{S}}_{n}$ the spectrum of ${\mathcal{J}}(\boldsymbol{x}_{n})$ . Then for every $\lambda\in{\mathcal{S}}_{n}$ ,

[TABLE]

4.3. Beyond the Gaussian case

The results presented so far heavily rely on the Gaussianity of the entries. A closer look at $\boldsymbol{x}_{n}$ ’s components reveals that Gaussianity plays an important role at three levels:

[TABLE]

(1)

Gaussian entries immediatly imply that the $Z_{k}$ ’s are independent standard Gaussian random variables, for which the study of the extrema is standard.

In the case where the entries are not Gaussian any more, the $Z_{k}$ ’s are no longer Gaussian but this issue can easily be circumvented since by the CLT the $Z_{k}$ ’s converge in distribution to a standard Gaussian. The extreme value study of such families of $Z_{k}$ ’s has been carried out in [2, Propositions 2 & 3].

(2)

The study of the extreme values of $(R_{k},k\in[n])$ in this article relies on the sub-Gaussiannity of $\widetilde{R}_{k}(A)$ which is a consequence of Gaussian concentration for Lipschitz functionals. 2. (3)

Poincaré’s inequality is used to prove that $\widetilde{R}_{1}(A)/(\alpha\sqrt{2\log(n)})$ goes to zero in probability, which is crucial to establish Lemma 2.1.

If the distribution of the entries is strongly log-concave in the sense of [16, Eq. (3.48)], then [16, Theorem 3.16] yields the sub-Gaussiannity of $\widetilde{R}_{1}(A)$ together with Poincaré’s inequality. In particular, Theorems 1.1 and 1.4 hold verbatim for entries $(A_{ij})$ i.i.d., centered with variance one and whose distribution is strongly log-concave.

The case of bounded and/or discrete entries is not covered and remains open although the simulations (see Figure 4) indicate that a similar phase transition occurs.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Allesina and S. Tang. The stability–complexity relationship at age 40: a random matrix perspective. Population Ecology , 57(1):63–75, 2015.
2[2] C. W. Anderson, S. G. Coles, and J. Hüsler. Maxima of poisson-like variables and related triangular arrays. The Annals of Applied Probability , pages 953–971, 1997.
3[3] Z. D. Bai and J. W. Silverstein. Spectral analysis of large dimensional random matrices . Springer Series in Statistics. Springer, New York, second edition, 2010.
4[4] S. Boucheron, G. Lugosi, and P. Massart. Concentration Inequalities: A Nonasymptotic Theory of Independence . Oxford University Press, 2013.
5[5] M. Dougoud, L. Vinckenbosch, R. P. Rohr, L.-F. Bersier, and C. Mazza. The feasibility of equilibria in large ecosystems: A primary but neglected concept in the complexity-stability debate. P Lo S computational biology , 14(2):e 1005988, 2018.
6[6] M. R. Gardner and W. R. Ashby. Connectance of large dynamic (cybernetic) systems: critical values for stability. Nature , 228(5273):784, 1970.
7[7] S. Geman. The spectral radius of large random matrices. Ann. Probab. , 14(4):1318–1328, 1986.
8[8] S. Geman and C.-R. Hwang. A chaos hypothesis for some large systems of random equations. Z. Wahrsch. Verw. Gebiete , 60(3):291–314, 1982.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Positive solutions for large random linear systems

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem 1.1** (Feasibility).**

Remark 1.2*.*

Remark 1.3*.*

Theorem 1.4** (Stability).**

Organization of the paper

Acknowlegments

2. Positive solutions: proof of Theorem 1.1

2.1. Some preparation and strategy of the proof

Extreme values of Gaussian random variables

Strategy of the proof

Lemma 2.1**.**

2.2. Lipschitz property and tightness of Rk(A)R_{k}(A)Rk​(A)

Lemma 2.2**.**

Proof.

Proposition 2.3**.**

Proof.

Proposition 2.4**.**

Proof.

2.3. Proof of Lemma 2.1

3. Stability: proof of Theorem 1.4

4. Heuristics at critical scaling, non-homogeneous systems and non-gaussian entries

4.1. A heuristics at the critical scaling

Heuristics 4.1**.**

Arguments.

Remark 4.1*.*

4.2. Positivity for a non-homogeneous linear system

Theorem 4.2** (Feasibility - NH case).**

Remark 4.3*.*

Elements of proof.

Theorem 4.4** (Stability - NH case).**

4.3. Beyond the Gaussian case

Theorem 1.1 (Feasibility).

*Remark 1.2**.*

*Remark 1.3**.*

Theorem 1.4 (Stability).

Lemma 2.1.

2.2. Lipschitz property and tightness of $R_{k}(A)$

Lemma 2.2.

Proposition 2.3.

Proposition 2.4.

Heuristics 4.1.

*Remark 4.1**.*

Theorem 4.2 (Feasibility - NH case).

*Remark 4.3**.*

Theorem 4.4 (Stability - NH case).