On non-negative solutions to large systems of random linear equations

Stefan Landmann; Andreas Engel

arXiv:1904.06977·cond-mat.dis-nn·June 24, 2020

On non-negative solutions to large systems of random linear equations

Stefan Landmann, Andreas Engel

PDF

TL;DR

This paper investigates the conditions under which large random linear systems have non-negative solutions, identifying a sharp threshold based on the ratio of unknowns to equations, with implications for various models.

Contribution

It analytically determines the non-negative solution threshold for large random systems and connects it to known models like perceptron storage and resource competition.

Findings

01

Identifies a sharp transition point for solution existence.

02

Derives the threshold as a function of statistical properties.

03

Validates results with numerical simulations.

Abstract

Systems of random linear equations may or may not have solutions with all components being non-negative. The question is, e.g., of relevance when the unknowns are concentrations or population sizes. In the present paper we show that if such systems are large the transition between these two possibilities occurs at a sharp value of the ratio between the number of unknowns and the number of equations. We analytically determine this threshold as a function of the statistical properties of the random parameters and show its agreement with numerical simulations. We also make contact with two special cases that have been studied before: the storage problem of a perceptron and the resource competition model of MacArthur.

Figures5

Click any figure to enlarge with its caption.

Equations84

⟨ a_{μ i} ⟩ = A, ⟨(a_{μ i} - A)^{2} ⟩ = σ^{2} .

⟨ a_{μ i} ⟩ = A, ⟨(a_{μ i} - A)^{2} ⟩ = σ^{2} .

⟨ b_{i} ⟩ = B, ⟨(b_{i} - B)^{2} ⟩ = \frac{γ ^{2}}{N} .

⟨ b_{i} ⟩ = B, ⟨(b_{i} - B)^{2} ⟩ = \frac{γ ^{2}}{N} .

\overset{a}{^}^{T} x = b,

\overset{a}{^}^{T} x = b,

c_{1} a_{1} + c_{2} a_{2} + \dots + c_{α N} a_{α N}

c_{1} a_{1} + c_{2} a_{2} + \dots + c_{α N} a_{α N}

\overset{a}{^}^{T} x = b with x \geq 0,

\overset{a}{^}^{T} x = b with x \geq 0,

\overset{a}{^} y \geq 0 and b \cdot y < 0.

\overset{a}{^} y \geq 0 and b \cdot y < 0.

∥ y ∥^{2} = i \sum y_{i}^{2} = N

∥ y ∥^{2} = i \sum y_{i}^{2} = N

Ω (\overset{a}{^}, b) := \frac{\int _{- \infty}^{\infty} \prod _{i} d y _{i} δ ( \sum _{i} y _{i}^{2} - N ) Θ ( - \frac{1}{N} \sum _{i} b _{i} y _{i} ) \prod _{μ} Θ ( \frac{1}{N} \sum _{i} a _{μ i} y _{i} )}{\int _{- \infty}^{\infty} \prod _{i} d y _{i} δ ( \sum _{i} y _{i}^{2} - N )} .

Ω (\overset{a}{^}, b) := \frac{\int _{- \infty}^{\infty} \prod _{i} d y _{i} δ ( \sum _{i} y _{i}^{2} - N ) Θ ( - \frac{1}{N} \sum _{i} b _{i} y _{i} ) \prod _{μ} Θ ( \frac{1}{N} \sum _{i} a _{μ i} y _{i} )}{\int _{- \infty}^{\infty} \prod _{i} d y _{i} δ ( \sum _{i} y _{i}^{2} - N )} .

Θ (x) = {10 x \geq 0 x < 0 .

Θ (x) = {10 x \geq 0 x < 0 .

Ω_{typ} ≃ exp (⟨ lo g Ω ⟩)

Ω_{typ} ≃ exp (⟨ lo g Ω ⟩)

S (α, A, B, σ, γ) := N \to \infty lim \frac{1}{N} ⟨ lo g Ω (\overset{a}{^}, b)⟩ .

S (α, A, B, σ, γ) := N \to \infty lim \frac{1}{N} ⟨ lo g Ω (\overset{a}{^}, b)⟩ .

⟨ lo g Ω ⟩ = n \to 0 lim \frac{⟨ Ω ^{n} ⟩ - 1}{n} .

⟨ lo g Ω ⟩ = n \to 0 lim \frac{⟨ Ω ^{n} ⟩ - 1}{n} .

Ω^{n}

Ω^{n}

\int_{- \infty}^{\infty} i, a \prod \frac{d y _{i}^{a}}{2 π e} a \prod δ (i \sum (y_{i}^{a})^{2} - N) μ, a \prod Θ (\frac{1}{N} i \sum a_{μ i} y_{i}^{a}) a \prod Θ (- \frac{1}{N} i \sum b_{i} y_{i}^{a}),

a \prod δ (i \sum (y_{i}^{a})^{2} - N) = \int a \prod \frac{d E ^{a}}{4 π} exp (\frac{i}{2} a \sum E^{a} (i \sum (y_{i}^{a})^{2} - N)),

a \prod δ (i \sum (y_{i}^{a})^{2} - N) = \int a \prod \frac{d E ^{a}}{4 π} exp (\frac{i}{2} a \sum E^{a} (i \sum (y_{i}^{a})^{2} - N)),

\displaystyle\prod_{a}\Theta\left(-\frac{1}{\sqrt{N}}\sum_{i}b_{i}y^{a}_{i}\right)=\int^{\infty}_{0}\prod_{a}d\eta^{a}\int\prod_{a}\frac{d\hat{\eta}^{a}}{2\pi/N}\text{exp}\left(iN\sum_{a}\hat{\eta}^{a}\Big{(}\eta^{a}+\frac{1}{\sqrt{N}}b_{i}\sum_{i}y^{a}_{i}\Big{)}\right),

\displaystyle\prod_{\mu,a}\Theta\left(\frac{1}{\sqrt{N}}\sum_{i}a_{\mu i}y^{a}_{i}\right)=\prod_{\mu}\int^{\infty}_{0}\prod_{a}d\vartheta^{a}_{\mu}\int\prod_{a}\frac{d\hat{\vartheta}^{a}_{\mu}}{2\pi}\text{exp}\left(i\sum_{a}\hat{\vartheta}^{a}_{\mu}\Big{(}\vartheta^{a}_{\mu}-\frac{1}{\sqrt{N}}\sum_{i}a_{\mu i}y^{a}_{i}\Big{)}\right),

i, μ \prod ⟨ exp (- \frac{i}{N} a \sum a_{μ i} \hat{ϑ}_{μ}^{a} y_{i}^{a}) ⟩

i, μ \prod ⟨ exp (- \frac{i}{N} a \sum a_{μ i} \hat{ϑ}_{μ}^{a} y_{i}^{a}) ⟩

i \prod ⟨ exp (i N b_{i} a \sum \overset{η}{^}^{a} y_{i}^{a}) ⟩

m^{a} = \frac{1}{N} i \sum y_{i}^{a} and q^{ab} = \frac{1}{N} i \sum y_{i}^{a} y_{i}^{b} for a < b .

m^{a} = \frac{1}{N} i \sum y_{i}^{a} and q^{ab} = \frac{1}{N} i \sum y_{i}^{a} y_{i}^{b} for a < b .

⟨ Ω^{n} ⟩ = \int a \prod

⟨ Ω^{n} ⟩ = \int a \prod

\displaystyle\quad\text{exp}\Big{(}N\Big{[}\frac{i}{\sqrt{N}}\sum_{a}m^{a}\hat{m}^{a}+i\sum_{a<b}q^{ab}\hat{q}^{ab}-\frac{i}{2}\sum_{a}E^{a}+i\sum_{a}\hat{\eta}^{a}\eta^{a}+iB\sum_{a}\hat{\eta}^{a}m^{a}

\displaystyle\quad-\frac{\gamma^{2}}{2}\sum_{a}(\hat{\eta}^{a})^{2}-\gamma^{2}\sum_{a<b}\hat{\eta}^{a}\hat{\eta}^{b}q^{ab}+\alpha G_{E}(m^{a},q^{ab})+G_{S}(E^{a},\hat{m}^{a},\hat{q}^{ab})\Big{]}\Big{)},

G_{E} (m^{a}, q^{ab}) =

G_{E} (m^{a}, q^{ab}) =

\displaystyle\times\quad\text{exp}\Big{(}i\sum_{a}\hat{\vartheta}^{a}\vartheta^{a}-iA\sum_{a}\hat{\vartheta}^{a}m^{a}-\frac{\sigma^{2}}{2}\sum_{a}(\hat{\vartheta}^{a})^{2}-\sigma^{2}\sum_{a<b}\hat{\vartheta}^{a}\hat{\vartheta}^{b}q^{ab}\Big{)},

G_{S}(E^{a},\hat{m}^{a},\hat{q}^{ab})=\log\int\prod_{a}\frac{dy^{a}}{\sqrt{2\pi e}}\,\text{exp}\Big{(}\frac{i}{2}\sum_{a}E^{a}(y^{a})^{2}-i\sum_{a}\hat{m}^{a}y^{a}-i\sum_{a<b}\hat{q}^{ab}y^{a}y^{b}\Big{)}.

G_{S}(E^{a},\hat{m}^{a},\hat{q}^{ab})=\log\int\prod_{a}\frac{dy^{a}}{\sqrt{2\pi e}}\,\text{exp}\Big{(}\frac{i}{2}\sum_{a}E^{a}(y^{a})^{2}-i\sum_{a}\hat{m}^{a}y^{a}-i\sum_{a<b}\hat{q}^{ab}y^{a}y^{b}\Big{)}.

m^{a} =

m^{a} =

q^{ab} =

\displaystyle G_{E}(m,q)=n\int Dt\,\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}+O(n^{2}).

\displaystyle G_{E}(m,q)=n\int Dt\,\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}+O(n^{2}).

H (x) := \int_{x}^{\infty} D t and κ := \frac{m A}{σ}

H (x) := \int_{x}^{\infty} D t and κ := \frac{m A}{σ}

\displaystyle G_{S}(E,\hat{m},\hat{q})=-\frac{n}{2}\Big{(}1+\log(E+\hat{q})\Big{)}+\frac{n}{2}\frac{\hat{m}^{2}+\hat{q}}{E+\hat{q}}+O(n^{2}).

\displaystyle G_{S}(E,\hat{m},\hat{q})=-\frac{n}{2}\Big{(}1+\log(E+\hat{q})\Big{)}+\frac{n}{2}\frac{\hat{m}^{2}+\hat{q}}{E+\hat{q}}+O(n^{2}).

ζ := (\frac{σ B}{γ A})^{2}

ζ := (\frac{σ B}{γ A})^{2}

\displaystyle S(\alpha,\zeta)=\mathrm{extr}_{q,\kappa}\Big{[}\frac{1}{2}\log(1-q)+\frac{q}{2(1-q)}-\frac{\zeta}{2}\frac{\kappa^{2}}{1-q}+\alpha\int Dt\,\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}\Big{]}.

\displaystyle S(\alpha,\zeta)=\mathrm{extr}_{q,\kappa}\Big{[}\frac{1}{2}\log(1-q)+\frac{q}{2(1-q)}-\frac{\zeta}{2}\frac{\kappa^{2}}{1-q}+\alpha\int Dt\,\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}\Big{]}.

\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}\sim\begin{cases}-\frac{(t-\kappa)^{2}}{2(1-q)}&\text{if}\quad t>\kappa,\\ 0&\text{otherwise,}\end{cases}

\log H\Big{(}\frac{\sqrt{q}\,t-\kappa}{\sqrt{1-q}}\Big{)}\sim\begin{cases}-\frac{(t-\kappa)^{2}}{2(1-q)}&\text{if}\quad t>\kappa,\\ 0&\text{otherwise,}\end{cases}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On non-negative solutions to large systems of random linear equations

Stefan Landmann

Institute of Physics, Carl von Ossietzky University of Oldenburg, D-26111 Oldenburg, Germany

[email protected]

Andreas Engel

Abstract

Systems of random linear equations may or may not have solutions with all components being non-negative. The question is, e.g., of relevance when the unknowns are concentrations or population sizes. In the present paper we show that if such systems are large the transition between these two possibilities occurs at a sharp value of the ratio between the number of unknowns and the number of equations. We analytically determine this threshold as a function of the statistical properties of the random parameters and show its agreement with numerical simulations. We also make contact with two special cases that have been studied before: the storage problem of a perceptron and the resource competition model of MacArthur.

keywords:

Disordered Systems , Replica Theory , Linear Algebra

1 Introduction

Large systems of linear equations show up in many areas of physics. In particular they occur when investigating the stability of stationary states in systems with many degrees of freedom. Quite often, e.g. when considering chemical networks [1, 2] or ecological models [3, 4] only non-negative solutions are admissible since concentrations have to be positive or zero.

The interactions in such large systems are as a rule rather complex and not known in detail. On the other hand, macroscopic properties are believed not to depend on all the microscopic specifics. In situations like these models using random parameters have been extremely successful [5]. A paradigmatic case is provided by spin-glass theory [6, 7], but also problems from computational complexity [8], information theory [9] and artificial neural networks [10] have been analyzed along these lines. In these approaches emphasis is on self-averaging properties of the systems under consideration. In the thermodynamic limit their probability distributions become sharp and their averages, which are analytically accessible, characterize each typical individual realization of the randomness.

In the present paper we investigate the existence of non-negative solutions to large sets of random linear equations consisting of $N$ equations for $\alpha N$ unknowns. We are interested in the limit $N\to\infty$ with $\alpha$ staying constant. As we will show there is a sharp transition at a critical threshold $\alpha_{c}$ from a phase in which typically no non-negative solutions exist to one where such solutions can be found. The value of $\alpha_{c}$ is self-averaging, i.e., it depends only on the parameters characterizing the distributions of the random variables involved and not on their particular realization. Using the replica-technique we determine $\alpha_{c}$ analytically and find very good agreement with results from numerical simulations.

The paper is organized as follows. In the next section we define the problem and map it to the determination of a random volume in $N$ -dimensional space. In section 3 we determine the typical value of this volume by a replica calculation. Section 4 discusses the transition and determines the critical line. In section 5 we consider some limiting cases and make contact with related results from the literature. In section 6 we speculate about the behavior of the system away from criticality and finally section 7 contains our conclusions.

2 Problem and notation

We consider an $\alpha N\times N$ random matrix $\hat{a}$ with entries $a_{\mu i},\,\mu=1\dots\alpha N,\,i=1\dots N,$ drawn independently from a Gaussian distribution with mean $A$ and variance $\sigma^{2}$ ,

[TABLE]

Moreover we choose a random vector $\mathbf{b}$ composed of independent Gaussian components $b_{i},\,i=1,\dots,N$ , with

[TABLE]

Here and in the following the brackets $\langle\dots\rangle$ denote the average over the $a_{\mu i}$ and $b_{i}$ .

We want to know whether the $N$ linear equations

[TABLE]

for the $\alpha N$ unknowns $x_{\mu}$ possess a solution $\mathbf{x}$ with all components being non-negative, $x_{\mu}\geq 0,\;\mu=1,..,\alpha N$ , which we write as $\mathbf{x\geq 0}$ .

The answer to this question clearly depends on the realization of the random numbers $a_{\mu i}$ and $b_{i}$ . In the limit $N\to\infty$ with $\alpha$ and $\gamma$ staying constant, however, there is a critical value $\alpha_{c}$ of $\alpha$ such that for $\alpha<\alpha_{c}$ there is no such solution for almost all realization of the random parameters whereas above this threshold typically one can be found. Our aim is to determine $\alpha_{c}$ as a function of $A,\sigma,B$ and $\gamma$ .

It is advantageous to map the problem onto an equivalent one. Let us consider the row vectors $\mathbf{a}_{\mu}$ of the matrix $\hat{a}$ . All their linear combinations

[TABLE]

with non-negative coefficients $c_{\mu}$ form what is called the non-negative cone of the vectors $\mathbf{a}_{\mu}$ . If $\mathbf{b}$ belongs to this cone, cf. Fig. 1 left, (3) has a non-negative solution $\mathbf{x}$ . If $\mathbf{b}$ lies outside the cone as shown in Fig. 1 right no such solution exists. In this case there must be a hyperplane separating $\mathbf{b}$ from the non-negative cone. Denoting the normal of this hyperplane by $\mathbf{y}$ we therefore either have

[TABLE]

or there exists a vector $\mathbf{y}$ with

[TABLE]

This duality is known as Farkas’ Lemma [11]. In what follows we will investigate the problem given by Eq. (6). To this end we determine the typical volume of solutions $\mathbf{y}$ to this problem. If it is positive, (6) has solutions and consequently (5) has none. If the volume is zero no separating hyperplane exists, $\mathbf{b}$ is part of the non-negative cone and (5) can be solved. The transition between both cases occurs when the volume shrinks to zero.

3 Typical volume of solutions to the dual problem

With each solution $\mathbf{y}$ to (6) also $\lambda\mathbf{y}$ with positive $\lambda$ is a solution. In order to eliminate this trivial degeneracy it is convenient to impose the spherical constraint

[TABLE]

on the solution vectors $\mathbf{y}$ . Our central quantity of interest is the fractional volume

[TABLE]

It comprises all normalized vectors $\mathbf{y}$ obeying the inequalities (6), cf. Fig. 2. In (8) $\Theta(x)$ denotes the Heaviside function

[TABLE]

The expression for the fractional volume is rather similar to that for the version space in neural network models [12] and can be analyzed along similar lines. In particular, since $\Omega$ involves the product of many independent random terms its logarithm is assumed to be self-averaging, i.e. in the large $N$ limit the typical value of $\Omega$ is given by

[TABLE]

rather than by $\langle\Omega\rangle$ . Therefore, in order to characterize the solution space of (6) we need to determine the averaged intensive entropy

[TABLE]

This may be done using the replica trick [5] which is based on the identity

[TABLE]

$\langle\Omega^{n}\rangle$ is first determined for $n\in\mathbb{N}$ and the result needs to be continued in a meaningful way to real $n$ in order to perform the limit $n\to 0$ . For $n\in\mathbb{N}$ we have

[TABLE]

where the replica index $a$ runs from 1 to $n$ and the denominator $\sqrt{2\pi e}$ accounts for the normalization (7). Using standard techniques [10] we replace the $\delta$ - and $\Theta$ -functions by their integral representations:

[TABLE]

and perform the averages over $a_{\mu i}$ and $b_{i}$

[TABLE]

The integrals over the $y_{i}$ , the auxiliary variables $\vartheta^{a}_{\mu},\,\eta^{a}$ and their conjugates $\hat{\vartheta}^{a}_{\mu},\,\hat{\eta}^{a}$ can be decoupled by introducing the order parameters

[TABLE]

The expression for the $n$ -th power of the fractional volume then acquires the form

[TABLE]

with the auxiliary functions

[TABLE]

and

[TABLE]

To determine the entropy (11) we only need the asymptotics of this expression for ${N\to\infty}$ so that the integrals in (17) may be calculated by the saddle-point method. The term $\frac{1}{\sqrt{N}}\sum_{a}m^{a}\hat{m}^{a}$ can be neglected in this limit and will be dropped.

The solution space of Eq. (6) is connected, we therefore assume a replica-symmetric saddle-point and use the ansätze

[TABLE]

Keeping in mind that for the final limit $n\to 0$ only terms up to order $n$ are needed the expressions for $G_{E}$ and $G_{S}$ simplify considerably. We find

[TABLE]

Here the Gaussian measure $Dt:=dt/\sqrt{2\pi}\,e^{-t^{2}/2}$ was introduced and the abbreviations

[TABLE]

were used. Similar manipulations yield for $G_{S}$

[TABLE]

Also the other terms in (17) can be simplified using (20) and (21). As a consequence the saddle-point equations for the order parameters $E,\hat{m},\hat{q},\eta$ become algebraic and these parameters can be eliminated. Introducing

[TABLE]

we finally get

[TABLE]

The saddle-point values of $q$ and $\kappa$ have to be determined numerically. Note that the final expression (26) for the averaged entropy depends on the parameters $A,\sigma,B$ and $\gamma$ characterizing the distributions of $a_{\mu i}$ and $b_{i}$ only through the combination $\zeta$ that gives the ratio between the relative variances of $a_{\mu i}$ and $b_{i}$ .

4 The transition point

At the transition point $\alpha_{c}$ the typical volume $\Omega_{\mathrm{typ}}$ becomes zero, i.e. the average entropy tends to minus infinity. It is possible to extract this point from the numerical extremalization in (26). However, the analysis can be simplified by observing that with $\Omega$ shrinking to zero the typical overlap $q$ between two different solutions $\mathbf{y}$ as defined in (16) tends to one. To leading order in $1/(1-q)$ we get

[TABLE]

implying

[TABLE]

Keeping only the most divergent terms near the transition we thus find

[TABLE]

The saddle-point equation with respect to $q$ gives

[TABLE]

whereas the one with respect to $\kappa$ results in

[TABLE]

Together with (30) this yields

[TABLE]

Eqs. (30) and (32) give a parametric description of the transition line $\alpha_{c}(\zeta)$ . It is shown in Fig. 3 together with results from numerical solutions of Eq. (3). There is good agreement between the analytical prediction and simulation results.

5 Special cases

The basic problems (5) and (6) respectively are rather general. Some special cases have been studied previously in different settings.

5.1 Relation to the storage problem of a perceptron

From the numerical analysis of (26) one finds that for large $\zeta$ the saddle-point value of $\kappa$ tends to zero such that also $\zeta\kappa^{2}\to 0$ . In this limit we hence find

[TABLE]

which coincides with Gardner’s famous expression describing the storage problem of a perceptron [12]. In this problem one is given $\alpha N$ random input patterns $\boldsymbol{\xi}_{\mu}$ and corresponding outputs $\lambda_{\mu}$ and has to find a synaptic vector $\mathbf{J}\in\mathbb{R}^{N}$ such that

[TABLE]

for all $\mu=1,\dots,\alpha N$ . For more details see [10].

There are two situations that make the equivalence between this storage problem for the perceptron and our dual problem (6) for $\zeta\to\infty$ particularly clear. Firstly, let us consider the case $\gamma=0$ such that $b_{i}=B$ for all $i$ . We then decompose $\hat{a}=\hat{A}+\delta\,\hat{a}$ where the entries of $\hat{A}$ are constant $A_{\mu i}=A$ while the entries of $\delta\hat{a}$ are i.i.d. random gaussian variables with zero mean and variance $\sigma^{2}$ . Then, (6) requires

[TABLE]

The condition $\sum_{i}y_{i}<0$ restricts all $\mathbf{y}$ to lie on the ’negative’ half of the $N$ -sphere. From high-dimensional geometry it is known that for $N\rightarrow\infty$ the volume spanned by the vectors $\mathbf{y}$ on the $N$ -sphere is tightly concentrated around the equator. Therefore, it may be assumed that $\sum_{i}y_{i}=0^{-}$ without reducing the fractional volume 111This is substantiated by the fact that the saddle-point value of $m$ as defined in (16) is zero in this case.. Effectively the conditions therefore become

[TABLE]

In this form they are equivalent to the storage problem for $\alpha N+1$ patterns. For $N\to\infty$ the influence of the additional pattern can be neglected. The critical value of $\alpha$ in the storage problem is known to be $\alpha_{c}=2$ [12]. Figure 4 shows results from a finite size analysis of $\alpha_{c}$ for the system (3) with $\gamma=0$ . For $N\rightarrow\infty$ we obtain $\alpha_{c}=2.001\pm 0.004$ corroborating the equivalence between the two problems in the large $N$ limit.

Secondly, our analysis reduces to the storage problem of the perceptron if $A=0$ . In this case Eq.(6) acquires the form

[TABLE]

Clearly, the first condition is equivalent to the storage problem for $\alpha N$ patterns. To show that the second one does not reduce the fractional volume $\Omega$ for large $N$ we split it into $B\sum_{i}y_{i}<0$ and $\delta\mathbf{b}\cdot\mathbf{y}<0$ and require these two constraints separately. Even this more severe restriction is only equivalent to two additional patterns, $\alpha N\to\alpha N+2$ , which in the limit $N\to\infty$ is again irrelevant. Figure 4 shows that also in this case the finite size analysis of $\alpha_{c}$ is in perfect agreement with the result from the storage problem of the perceptron.

5.2 MacArthur model of resource competition

A classical model to describe biodiversity in ecological systems considers a number $\alpha N$ of species that compete for a number $N$ of different resources [13]. The species differ in their ability to live on the various resources which is described by a matrix of metabolic strategies $\hat{a}$ . The elements $a_{\mu i}$ of $\hat{a}$ characterize the ease with which species $\mu$ may consume resource $i$ . Resources are provided by certain fixed influxes $b_{i}$ from the environment. For sufficiently large $\alpha$ there is a stationary state of the system with species concentrations $x_{\mu}$ in which the overall consumption of each resource $i$ is exactly balanced by its influx:

[TABLE]

Since moreover all $x_{\mu}$ must be either positive (if the species survives) or zero (if the species got extinct) the vector $\mathbf{x}$ has to fulfill Eq. (5).

The MacArthur model has been studied extensively for systems with only a few species and correspondingly few resources. On the other hand, realistic biological systems may involve hundreds of species and resources. In a very interesting recent paper [14] a random version of MacArthur’s model was analyzed in the limit $N\to\infty$ . To this end the elements of the matrix of metabolic strategies were chosen independently of each other as either one with probability $p$ or zero with probability $1-p$ . Large values of $p$ hence describe communities of universalists that can live from many different resources, small values of $p$ stand for systems consisting mostly of specialists. The resource influx was modelled as a Gaussian random vector with elements of average one and variance $\Delta^{2}/N$ . Interestingly, in this variant of the MacArthur model a transition was found from a so-called vulnerable phase at low potential biodiversity (small $\alpha$ ) to a so-called shielded phase at large $\alpha$ . In the vulnerable phase the species are unable to use all available resources properly, Eq. (38) cannot be fulfilled, less than $N$ species survive, and variations in the resource influxes have a direct impact on each species. Contrarily, in the shielded phase the species form a collective field that guards them from changes in the external conditions, the maximal possible number $N$ of species survives [15], and Eq. (38) is satisfied. This phase is a striking example for the emergence of cooperation in a purely competitive system [16]. The transition found in [14] is equivalent to the one discussed in the present paper for $A=p$ , $\sigma^{2}=p(1-p)$ , $B=1$ and $\gamma=\Delta$ . For more details see [17].

6 Making contact away from criticality

The Farkas Lemma connects the two dual problems (5) and (6): whenever there is a solution to one there is none to the other. Therefore the threshold values $\alpha_{c}$ for both problems coincide. However, from the solution space analysis described in section 3 more details may be extracted. For $\alpha<\alpha_{c}$ , e.g., there are many different solutions to problem (6). The order parameter $q$ which can be determined from the numerical extremalization in (26) is then strictly smaller than one and provides a measure of the variability between different solutions to (6). At the same time the dual problem (5) is unsolvable as demonstrated by a non-zero residual norm

[TABLE]

This residual norm in some sense quantifies how far away problem (5) is from its threshold of solvability. It is tempting to speculate about a connection between $q$ and $r$ . If $q$ is rather small there are many different solutions to (6). Accordingly, problem (5) should be ’far from being solvable’ corresponding to a large value of $r$ . For $q$ near to one, on the other hand, there is not much freedom left for different solutions of (5) and small values of $r$ are likely. We have tested this conjecture with the help of numerical simulations. The results shown in Fig. 5 indeed suggest a monotonous relation between $q$ and $r$ .

Equivalent considerations apply to the region $\alpha>\alpha_{c}$ . Here (6) is unsolvable and its solution space is empty. By a slight generalization of the techniques used in section 3 it is possible to determine the minimal number of violated inequalities in (6) [18]. It is an open question whether this number may be related to the variability between different solutions of (5) for $\alpha>\alpha_{c}$ .

7 Conclusion

In the present paper we investigated under which conditions a large set of $\alpha N$ random linear equations for $N$ unknowns typically possesses a non-negative solution. This is a rather general question with relevance for a number of different problems in statistical mechanics. It is also an active area of research in mathematics on its own [19, 20].

With the help of Farkas’ lemma we mapped the problem onto the determination of the typical size of a random volume in high-dimensional space which could be characterized analytically using methods from the statistical physics of disordered systems. Our main result is a sharp transition in the limit $N\to\infty$ that separates a phase where the linear system typically has no non-negative solution to one in which such a solution can be found with probability one. The analytical result for the transition line agrees very well with the outcome of numerical simulations. Special cases of the transition have been discussed previously in the literature in connection with the storage capacity of a large perceptron and with a random variant of MacArthurs resource competition model.

In order to keep the calculations simple we assumed Gaussian distributions for the random parameters. For the elements of the coefficient matrix this assumption is not crucial. As in related mean-field models any distribution of these elements with finite moments would give similar results depending only on their first two cumulants. This is, e.g., exemplified by the application to MacArthurs model. The statistical properties of the inhomogeneity vector are more subtle and similar generalizations to different distributions deserve further investigation.

The dual problem related to the typical size of a random volume in $N$ -space can also be studied for values of $\alpha$ different from the threshold $\alpha_{c}$ . It were rather interesting if this knowledge could be used to quantitatively characterize the original problem away from criticality, i.e. deeply in the solvable or unsolvable phase. We presented preliminary numerical evidence that this may be possible but more work is needed to substantiate this connection.

Acknowledgement: Financial support from the German Science Foundation under project EN 278/10-1 is gratefully acknowledged. A.E. is deeply grateful for numerous inspiring discussions and genial conversations with Chris van den Broeck he enjoyed over many years.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Polettini, M. Esposito, Irreversible thermodynamics of open chemical networks. I. Emergent cycles and broken conservation laws, The Journal of chemical physics 141 (2014) 024117.
2[2] J. Schnakenberg, Simple chemical reaction systems with limit cycle behaviour, Journal of Theoretical Biology 81 (3) (1979) 389–400.
3[3] R. M. May, Will a large complex system be stable?, Nature 238 (1972) 413.
4[4] M. Eigen, P. Schuster, The hypercycle, Naturwissenschaften 65 (1) (1978) 7–41.
5[5] M. Mézard, G. Parisi, M. Virasoro, Spin glass theory and beyond, World Scientific Publishing Company, 1987.
6[6] S. F. Edwards, P. W. Anderson, Theory of spin glasses, Journal of Physics F: Metal Physics 5 (5) (1975) 965.
7[7] D. Sherrington, S. Kirkpatrick, Solvable model of a spin-glass, Physical Review Letters 35 (26) (1975) 1792.
8[8] R. Monasson, R. Zecchina, S. Kirkpatrick, B. Selman, L. Troyansky, Determining computational complexity from characteristic ‘phase transitions’, Nature 400 (1999) 133–137.