Loop-erased partitioning of a graph: mean-field analysis

Luca Avena; Alexandre Gaudilliere; Paolo Milanesi; Matteo Quattropani

arXiv:1906.03858·math.PR·July 15, 2020

Loop-erased partitioning of a graph: mean-field analysis

Luca Avena, Alexandre Gaudilliere, Paolo Milanesi, Matteo Quattropani

PDF

Open Access

TL;DR

This paper analyzes a graph partitioning method based on loop-erased random walks, revealing phase transitions in community detection and providing insights into the macroscopic structure of complex networks.

Contribution

It introduces a novel loop-erased random walk-based partitioning approach and characterizes its phase transition behavior in community detection.

Findings

01

Derived an interaction potential for vertex pairs based on non-membership probability.

02

Computed the potential and its scaling limits on complete and non-homogeneous graphs.

03

Identified a phase transition in community detectability depending on parameters.

Abstract

We consider a random partition of the vertex set of an arbitrary graph that can be sampled using loop-erased random walks stopped at a random independent exponential time of parameter $q > 0$ , that we see as a tuning parameter.The related random blocks tend to cluster nodes visited by the random walk on time scale $1/ q$ . We explore the emerging macroscopic structure by analyzing 2-point correlations. To this aim, it is defined an interaction potential between pair of vertices, as the probability that they do not belong to the same block of the random partition. This interaction potential can be seen as an affinity measure for ``densely connected nodes'' and capture well-separated regions in network models presenting non-homogeneous landscapes. In this spirit, we compute this potential and its scaling limits on a complete graph and on a non-homogeneous weighted version with community…

Equations375

L = A - D,

L = A - D,

μ_{q} (Π_{m}) = \frac{q ^{m} \times \sum _{F : Π (F) = Π_{m}} w ( F )}{Z ( q )}, Π_{m} \in P (V),

μ_{q} (Π_{m}) = \frac{q ^{m} \times \sum _{F : Π (F) = Π_{m}} w ( F )}{Z ( q )}, Π_{m} \in P (V),

Z (q) := F \in F \sum q^{∣ F ∣} w (F) = det [q I - L],

Z (q) := F \in F \sum q^{∣ F ∣} w (F) = det [q I - L],

μ_{q} (Π_{m}) = \frac{q ^{m} \times \prod _{i = 1}^{m} n _{i}^{n_{i} - 1}}{q ( q + N ) ^{N - 1}},

μ_{q} (Π_{m}) = \frac{q ^{m} \times \prod _{i = 1}^{m} n _{i}^{n_{i} - 1}}{q ( q + N ) ^{N - 1}},

{B_{q} (x) \neq = B_{q} (y)} := {x and y are in different blocks of Π_{q}},

{B_{q} (x) \neq = B_{q} (y)} := {x and y are in different blocks of Π_{q}},

U_{q} (x, y) :=

U_{q} (x, y) :=

= γ \sum P_{x}^{L E_{q}} (Γ = γ) P_{y} (τ_{γ} > τ_{q})

\overline{U}_{q} (x, y) := E [\frac{1 _{{B_{q} (x) \neq = B_{q} (y)}}}{μ ( B _{q} ( x )) μ ( B _{q} ( y ))}],

\overline{U}_{q} (x, y) := E [\frac{1 _{{B_{q} (x) \neq = B_{q} (y)}}}{μ ( B _{q} ( x )) μ ( B _{q} ( y ))}],

U_{q}^{(N)} (x, y) = U_{q}^{(N)} = h = 1 \sum N - 1 \frac{q}{q + N w} (\frac{N w}{q + N w})^{h - 1} k = 2 \prod h (1 - \frac{k}{N}),

U_{q}^{(N)} (x, y) = U_{q}^{(N)} = h = 1 \sum N - 1 \frac{q}{q + N w} (\frac{N w}{q + N w})^{h - 1} k = 2 \prod h (1 - \frac{k}{N}),

U_{q} := N \to \infty lim U_{q}^{(N)} = 2 π z e^{\frac{z ^{2}}{2}} P (Z > z),

U_{q} := N \to \infty lim U_{q}^{(N)} = 2 π z e^{\frac{z ^{2}}{2}} P (Z > z),

ξ_{q, N} := \frac{q}{q + N ( w _{1} + w _{2} )}

ξ_{q, N} := \frac{q}{q + N ( w _{1} + w _{2} )}

\tilde{P} = (p 1 - p 1 - p p), p = \frac{w _{1}}{w _{1} + w _{2}} .

\tilde{P} = (p 1 - p 1 - p p), p = \frac{w _{1}}{w _{1} + w _{2}} .

U_{q}^{(N)} (x, y) = U_{q}^{(N)} (⋆) := n \geq 1 \sum P (T_{q} = n) k = 1 \sum n \tilde{P}_{\underline{1}} (ℓ (n) = k) N^{- n + 1} \hat{f} (n, k) θ (n, k) P_{⋆}^{†} (n, k)

U_{q}^{(N)} (x, y) = U_{q}^{(N)} (⋆) := n \geq 1 \sum P (T_{q} = n) k = 1 \sum n \tilde{P}_{\underline{1}} (ℓ (n) = k) N^{- n + 1} \hat{f} (n, k) θ (n, k) P_{⋆}^{†} (n, k)

\hat{f} (n, k) = (N - 2)_{k - 1} (N - 1)_{n - k}, θ (n, k) = \frac{( q - λ _{1} ( n , k ) ) ( q - λ _{2} ( n , k ) )}{q ( q + 2 N w _{2} )}

\hat{f} (n, k) = (N - 2)_{k - 1} (N - 1)_{n - k}, θ (n, k) = \frac{( q - λ _{1} ( n , k ) ) ( q - λ _{2} ( n , k ) )}{q ( q + 2 N w _{2} )}

λ_{i} (n, k) = - \frac{1}{2} [w_{1} n + w_{2} N + (- 1)^{i} w_{1}^{2} (2 k - n)^{2} + 4 (N - k) (N - k) w_{2}^{2}],

λ_{i} (n, k) = - \frac{1}{2} [w_{1} n + w_{2} N + (- 1)^{i} w_{1}^{2} (2 k - n)^{2} + 4 (N - k) (N - k) w_{2}^{2}],

P_{⋆}^{†} (n, k) = \frac{q ( q + k _{⋆} ( w _{1} - w _{2} ) + w _{2} N )}{[ q + k w _{1} ] [ q + ( n - k ) w _{1} ] + N w _{2} ( 2 q + n w _{1} ) + w _{2}^{2} [ N n - k ( n - k )]} \times η_{⋆}

P_{⋆}^{†} (n, k) = \frac{q ( q + k _{⋆} ( w _{1} - w _{2} ) + w _{2} N )}{[ q + k w _{1} ] [ q + ( n - k ) w _{1} ] + N w _{2} ( 2 q + n w _{1} ) + w _{2}^{2} [ N n - k ( n - k )]} \times η_{⋆}

k_{⋆} := {k, n - k, if ⋆ = o u t, if ⋆ = in, η_{⋆} = {(N - 1) (N - n + k - 1), N (N - k - 1), if ⋆ = o u t, if ⋆ = in .

k_{⋆} := {k, n - k, if ⋆ = o u t, if ⋆ = in, η_{⋆} = {(N - 1) (N - n + k - 1), N (N - k - 1), if ⋆ = o u t, if ⋆ = in .

P (∣ Π_{q} ∣ = c N^{α \land 1} (1 \pm o (1))) = 1 - o (1) .

P (∣ Π_{q} ∣ = c N^{α \land 1} (1 \pm o (1))) = 1 - o (1) .

\overline{U}_{q} (x, y) = N^{2} [K_{q} (x, x) K_{q} (y, y) - K_{q} (x, y) K_{q} (y, x)],

\overline{U}_{q} (x, y) = N^{2} [K_{q} (x, x) K_{q} (y, y) - K_{q} (x, y) K_{q} (y, x)],

K_{q} (x, y) := q (q - L)^{- 1} (x, y) = P_{x} (X (τ_{q}) = y)

K_{q} (x, y) := q (q - L)^{- 1} (x, y) = P_{x} (X (τ_{q}) = y)

\overline{U}_{q} (⋆) \sim {4 q^{2} + 8 q 4 q^{2} + 8 q + 4 if ⋆ = in, if ⋆ = o u t .

\overline{U}_{q} (⋆) \sim {4 q^{2} + 8 q 4 q^{2} + 8 q + 4 if ⋆ = in, if ⋆ = o u t .

\overline{U}_{q} (in) \sim \overline{U}_{q} (o u t) \sim {4 q (q + 1) N^{m a x {2, 2 α}} if α \leq 0 and β < 1 - α, if α > 0.

\overline{U}_{q} (in) \sim \overline{U}_{q} (o u t) \sim {4 q (q + 1) N^{m a x {2, 2 α}} if α \leq 0 and β < 1 - α, if α > 0.

L = A - D, with A = w (1 1^{'} - I) and with D = (n - 1) w I .

L = A - D, with A = w (1 1^{'} - I) and with D = (n - 1) w I .

P = I - L = \frac{1}{N} 1 1^{'}

P = I - L = \frac{1}{N} 1 1^{'}

A = (w 1 1^{'} 0^{'} q 1 0),

A = (w 1 1^{'} 0^{'} q 1 0),

L = A - D, D = ([(N - 1) w + q] I 0^{'} 00) .

L = A - D, D = ([(N - 1) w + q] I 0^{'} 00) .

L = \frac{1}{N w + q} L = (\frac{w}{N w + q} 1 1^{'} - I 0^{'} \frac{q}{N w + q} 1 0)

L = \frac{1}{N w + q} L = (\frac{w}{N w + q} 1 1^{'} - I 0^{'} \frac{q}{N w + q} 1 0)

P = I - L = (\frac{w}{N w + q} 1 1^{'} 0^{'} \frac{q}{N w + q} 1 1) = ((1 - p) \frac{1}{N} 1 1^{'} 0^{'} p 1 1),

P = I - L = (\frac{w}{N w + q} 1 1^{'} 0^{'} \frac{q}{N w + q} 1 1) = ((1 - p) \frac{1}{N} 1 1^{'} 0^{'} p 1 1),

r := \frac{q}{N w + q} .

r := \frac{q}{N w + q} .

P (∣ H_{n + 1} ∣ = h ∣ ∣ H_{n} ∣ \geq h, T_{q} > n + 1) = \frac{1}{N} .

P (∣ H_{n + 1} ∣ = h ∣ ∣ H_{n} ∣ \geq h, T_{q} > n + 1) = \frac{1}{N} .

P (Z (0) = \cdot) =

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTheoretical and Computational Physics · Complex Network Analysis Techniques · Stochastic processes and statistical mechanics

Full text

Loop-erased partitioning of a graph:

mean-field analysis

Luca Avena*‡*

‡ Leiden University, Mathematical Institute, Niels Bohrweg 1 2333 CA, Leiden. The Netherlands.

[email protected]

,

Alexandre Gaudillière⋆

$\star$ Aix-Marseille Université, CNRS, Centrale Marseille. I2M UMR CNRS 7373. 39, rue Joliot Curie. 13 453 Marseille Cedex 13. France.

[email protected]

,

Paolo Milanesi§

§ Aix-Marseille Université, CNRS, Centrale Marseille. I2M UMR CNRS 7373. 39, rue Joliot Curie. 13 453 Marseille Cedex 13. France.

[email protected]

and

Matteo Quattropani∗

∗ Dipartimento di Matematica e Fisica, Università di Roma Tre, Largo S. Leonardo Murialdo 1, 00146 Roma, Italy.

[email protected]

Abstract.

We consider a random partition of the vertex set of an arbitrary graph that can be sampled using loop-erased random walks stopped at a random independent exponential time of parameter $q>0$ , that we see as a tuning parameter.The related random blocks tend to cluster nodes visited by the random walk on time scale $1/q$ . We explore the emerging macroscopic structure by analyzing 2-point correlations. To this aim, it is defined an interaction potential between pair of vertices, as the probability that they do not belong to the same block of the random partition. This interaction potential can be seen as an affinity measure for “densely connected nodes” and capture well-separated regions in network models presenting non-homogeneous landscapes. In this spirit, we compute this potential and its scaling limits on a complete graph and on a non-homogeneous weighted version with community structures. For the latter geometry we show a phase-transition for “community detectability” as a function of the tuning parameter and the edge weights.

Key words and phrases:

Discrete Laplacian, random partitions, loop-erased random walk, Wilson’s algorithm, spanning rooted forests

2010 Mathematics Subject Classification:

05C81, 05C85, 60J10, 60J27, 60J28

1. Intro: Loop-erasure and random partitioning

Consider an arbitrary simple undirected weighted connected graph $G=(V,E,w)$ on $N=|V|$ vertices where $E=\{e=(x,y):x,y\in V\}$ stands for the edge set and $w:E\rightarrow[0,\infty)$ is a given edge-weight function. We call the Random Walk (RW) associated to $G$ the continuous-time Markov chain $X=(X_{t})_{t\geq 0}$ with state space $V$ and the discrete Laplacian as infinitesimal generator, i.e., the $N\times N$ matrix:

[TABLE]

where for any $x,y\in[N]:=\{1,2,\dots,N\}$ , $\mathcal{A}(x,y)=w(x,y)\mathbf{1}_{\{x\neq y\}}$ is the weighted adjacency matrix and $\mathcal{D}(x,y)=\mathbf{1}_{\{x=y\}}\sum_{z\in[N]\setminus\{x\}}w(x,z)$ is the diagonal matrix guarantying that the entries of each row in $\mathcal{L}$ sum up to [math].

The goal of this paper is to explore the following probability measure on the set of partitions $\mathcal{P}(V)$ of the vertex set $V$ .

Definition 1 (Loop-erased partitioning).

Given $G=(V,E,w)$ , fix a positive parameter $q>0$ . We call loop-erased a partition of $V$ in $m\leq N$ blocks sampled according to the following probability measure:

[TABLE]

where the sum is over spanning rooted forests $F$ ’s of $G$ , $\Pi(F)$ stands for the partition of $V$ induced by a forest $F$ , $w(F):=\prod_{e\in F}w(e)$ for the forest weight, and $Z(q)$ is a normalizing constant. We denote by $\Pi_{q}$ a random variable in $\mathcal{P}(V)$ with law $\mu_{q}$ .

In the above definition a spanning rooted forest of a graph is a collection of rooted trees spanning its vertex set. Denoting by $\mathcal{F}$ the set of spanning rooted forests of $G$ , we note that—due to the matrix tree theorem—the normalizing constant in Eq. 1.2 can be expressed as the characteristic polynomial of the matrix $\mathcal{L}$ evaluated at $q$ , i.e.

[TABLE]

where $|F|$ denotes the number of trees in $F\in\mathcal{F}$ . Furthermore, the number of blocks in $\Pi_{q}$ , denoted by $|\Pi_{q}|$ , is distributed as the sum of $N$ independent Bernoulli random variables with success probabilities $\frac{q}{q+\lambda_{i}}$ , for $i\leq N$ , with $\lambda_{i}$ ’s being the eigenvalues of $-\mathcal{L}$ . We refer the reader to [5, Prop. 2.1] for a proof of these statements.

1.1. Tuning parameter and underlying geometry.

The first factor $q^{m}$ in Eq. 1.2 favors partitions having many small blocks as $q$ growths, while as $q$ vanishes, the measure degenerates into a one-block partition. The second combinatorial factor takes into account the underlying geometry and for example in the unweighted case (i.e. constant edge–weights $w\equiv 1$ ) counts how many rooted forests are compatible with a given partition. In the simple setup of an unweighted complete graph on $N$ vertices , the measure in Definition 1 reduces to

[TABLE]

for a partition $\Pi_{m}=\{B_{1},\ldots,B_{m}\}\in\mathcal{P}(V)$ constituted of $m$ blocks with sizes $|B_{i}|=:n_{i}$ , $i\leq m$ such that $\sum_{i\leq m}n_{i}=N$ . In particular, we see in this setup that this second factor favors partitions with a few “fat” blocks. Notice that Eq. 1.3 holds true because, by Cayley’s formula, $n_{i}^{n_{i}-2}$ unrooted trees can cover block $B_{i}$ , and since we are dealing with rooted trees, an extra volume factor $n_{i}$ for the possible roots is needed. In general, the competition between these two factors depends on the delicate interplay among the tuning parameter $q$ , the underlying geometry and the weight function $w$ .

1.2. Sampling algorithm and Loop-Erased RW (LERW)

An attractive feature of this measure is that there exists a simple exact sampling algorithm. Originally due to Wilson [22] and based on the associated LERW killed at random times. The LERW with killing is the process obtained by running the RW $X$ , erasing cycles as soon as they appear, and stopping the evolving self-avoiding trajectory at an independent random time $\tau_{q}$ with law an exponential of parameter $q$ .

The algorithm can be described as follows:

(1)

pick any arbitrary vertex in $V$ and run a LERW up to an independent time $\tau_{q}\overset{d}{\sim}\exp(q).$ Call $\gamma_{1}$ the obtained self-avoiding trajectory. 2. (2)

pick any arbitrary vertex in $V$ that does not belong to $\gamma_{1}$ . Run a LERW until $\min\{\tau_{q},\tau_{\gamma_{1}}\}$ , $\tau_{\gamma_{1}}$ being the first time the RW hits a vertex in $\gamma_{1}$ . Call $\gamma_{2}$ the union of $\gamma_{1}$ and the new self-avoiding trajectory obtained in this step. 3. (3)

Iterate step (2) with $\gamma_{\ell+1}$ in place of $\gamma_{\ell}$ until exhaustion of the vertex set $V$ .

In step (2) we note that if the killing occurs before $\tau_{\gamma_{1}}$ , then $\gamma_{2}$ is a rooted forest in $G$ , else $\gamma_{2}$ is a rooted tree.

When the above algorithm stops, it produces a spanning rooted forest $F\in\mathcal{F}$ , where the roots are the points where the involved LERWs were killed along the algorithm steps. The resulting forest $F$ on $G$ induces the partition $\Pi(F)$ of the vertex set $V$ , where each block is identified by vertices belonging to the same tree. It can be shown that the probability to obtain a given rooted spanning forest $F$ is proportional to $q$ to the power of the number of trees, times the forest weight $w(F)$ . It then follows that the induced partition is distributed as $\Pi_{q}$ in Definition 1. We refer the reader to [5] for the proof of the latter and for more detailed aspects of this algorithm, including dynamical variants. In the sequel we will denote by $\mathbb{P}$ a probability measure on an abstract probability space sufficiently rich for the randomness required by this algorithm.

1.3. Partition detecting “metastable landscapes”.

The Wilson’s sampling algorithm described above shows that the resulting partition has the tendency to cluster in the same block (tree) points that can be visited by the RW with high probability on time scale $\tau_{q}$ . In this sense the loop-erased partitioning has the tendency to capture metastable-like regions (blocks), namely, regions of points from which it is difficult for the RW to escape on time scale $1/q$ . This makes the probability $\mu_{q}$ an interesting measure for randomized clustering procedures, see in this direction [2] and [3, Sec. 5]. Yet, a-priori it is not clear how strong and stable is this feature of capturing “metastable landscapes”, since it heavily depends on the underlying geometry (weighted adjacency matrix) and the choice of the killing parameter $q$ . The scope of this paper is to start making precise this heuristic by analyzing 2-points correlations associated to $\mu_{q}$ on the simplest dense informative geometries.

1.4. Two-point correlations

For a pair of distinct vertices $x,y\in V$ , consider the event that these vertices belong to different blocks in $\Pi_{q}$ . That is, the event

[TABLE]

where $B_{q}(z)$ stands for the block in $\Pi_{q}$ containing $z\in V$ . The probability of this event induces a 2-point correlation function which turns out to be analyzable by means of LERW explorations, and it encodes relevant information on how the resulting partition looks like on the underlying graph as a function of the parameters. Here is the formal definition together with an operative characterization.

Definition 2 (Pairwise LEP-interaction potential).

For given $q>0$ and $G$ , and any pair $x,y\in V$ , we call pairwise LEP-interaction potential the following probability:

[TABLE]

where $\mathbb{P}_{x}^{LE_{q}}$ and $\mathbb{P}_{x}$ stand for the laws of the LERW killed at rate $q$ and of the RW, respectively, starting from $x\in V$ , and the above sum runs over all possible self-avoiding paths $\gamma$ ’s starting at $x$ .

The representation in Eq. 1.4 is a consequence of Wilson’s sampling procedure described in Section 1.2 and it holds true since, remarkably, in steps (1) and (2) of the algorithm the starting points can be chosen arbitrarily.

Furthermore, we notice that, as for any generic random partition of $V$ , such an interaction potential defines a distance on the vertex set. This specific metric $U_{q}(x,y)$ can be interpreted as an affinity measure capturing how densely connected vertices $x$ and $y$ are in the graph $G$ . Thus providing a further motivation to analyze it.

Still, the observable captured by $U_{q}(x,y)$ is not the only one inducing a natural notion of 2-point correlations associated to $\Pi_{q}$ . For example, if we express the LEP-potential in Definition 2 as an expectation, i.e. $U_{q}(x,y)=\mathbb{E}\left[\mathbf{1}_{\{B_{q}(x)\neq B_{q}(y)\}}\right]$ , we may think of normalizing it with the masses of the related blocks and obtain another natural 2-point correlation function. This is captured in the following definition.

Definition 3 (Pairwise RW-interaction potential).

For given $q>0$ and $G$ , and any pair $x,y\in V$ , we call pairwise RW-interaction potential the following correlation function:

[TABLE]

where $\mu(\cdot)$ is the uniform measure on $V$ .

As we will see, the functional $\overline{U}_{q}$ is actually much simpler to analyze but it captures less insightful information on the underlying graph structure. Further, unlike $U_{q}$ , this is not a probability, it is neither a metric, and it does not allow to derive a description of the macroscopic structure of $\Pi_{q}$ . In a sense, the latter is not surprising, in fact (see Lemma 1) this alternative correlation function can be expressed in terms of the sole RW Green’s kernel without need to introduce the LEP $\Pi_{q}$ . Note in particular that the uniform measure $\mu$ in Definition 3 corresponds to the invariant measure of the RW $X$ .

1.5. Related literature

Several properties of the forest measure associated to the loop-erased partitioning have been derived in the recent [5, 6]. Based on these results, in [3, Prop. 6] and [4, Sect. 5.2], the authors proposed an approach making use of the loop-erased partitioning and so-called intertwining dualities to describe the evolution of local equilibria of a finite state space Markov chain.

As mentioned before, this sampling method based on LERW is originally due to Wilson [22] and shows that the measure considered herein is intimately related to the well-known Uniform Spanning Tree (UST) measure. Actually the measure on spanning rooted forests mentioned in Section 1.2 can be seen as a generalized version of the UST measure which is recovered by taking $q\downarrow 0$ when $w\equiv 1$ . Therefore the results presented in this manuscript are along the line of the flourishing literature on statistical properties of the UST and LERW, see e.g. [1, 7, 8, 9, 11, 18, 12, 14, 15, 19, 20, 21].

A detailed exact and asymptotic analysis of observables related to Wilson’s algorithm on a complete graph have been pursued in [16]. The derivation of our results is in this spirit, although we deal with the additional randomness given by the presence of the killing parameter, which in turns makes the combinatorics more involved.

We further mention that in dense geometries, the UST has been studied under the perspective of the continuous random tree topology on the complete graph [1] and with respect to local weak convergence still on the complete graph [9] and more recently on growing expanders admitting a limiting graphon[10]. These other interesting lines of investigation could also be naturally considered for the forest measure in Section 1.2 but we will not pursue these approaches in this work.

1.6. Paper overview

Our main theorems are presented in Section 2 and identify the LEP-potential in Definition 2 and its asymptotics on a complete graph, Theorem 1, and on a non-homogeneous complete graph with two communities, Theorems 2 and 3. Some consequences on the macroscopic emergent partition $\Pi_{q}$ on these mean-field models are derived in Corollary 1. The last result in Proposition 1 concerns the asymptotics detectability related to the other 2-point correlation function in Definition 3. The concluding Sections 3 and 4 are devoted to the proofs for the complete graph and the community model, respectively.

1.7. Basic standard notation

In what follows we will use the following standard asymptotic notation. For given positive sequences $f(N)$ and $g(N)$ , we write:

•

$f(N)=o(g(N))$ if $\lim_{N\to\infty}\frac{f(N)}{g(N)}=0$ .

•

$f(N)=O(g(N))$ if $\lim\sup_{N\to\infty}\frac{f(N)}{g(N)}<\infty$ .

•

$f(N)=\omega(g(N))$ if $\lim_{N\to\infty}\frac{f(N)}{g(N)}=\infty$ .

•

$f(N)=\Omega(g(N))$ if $\lim\inf_{N\to\infty}\frac{f(N)}{g(N)}>0$ .

•

$f(N)=\Theta(g(N))$ if $0<\lim\inf_{N\to\infty}\frac{f(N)}{g(N)}\leq\lim\sup_{N\to\infty}\frac{f(N)}{g(N)}<\infty$ .

•

$f(N)\sim g(N)$ if $\lim_{N\to\infty}\frac{f(N)}{g(N)}=1$ .

For $k\leq n\in\mathbb{N}$ we will denote by $(n)_{k}:=n(n-1)(n-2)\cdots(n-k)$ the descendent factorial. Furthermore, we denote by $I$ the identity matrix, $\mathbf{1}$ and $\mathbf{1}^{\prime}$ , respectively, for the row and column vectors of all $1$ ’s, where the dimensions will be clear from the context. We will write $A^{Tr}$ for the transpose of a matrix $A$ .

2. Results: correlations and emerging partition on mean-field models

Our first result characterizes the LEP-potential in absence of geometry for finite $N$ , and shows that this probability is asymptotically non-degenerate at scale $\sqrt{N}$ :

Theorem 1.

(Mean-field LEP-potential and limiting law)* Fix $q>0$ and let $\mathcal{K}_{N}$ be a complete graph on $N\geq 1$ vertices with constant edge weight $w>0$ . Then, for all $x\neq y\in[N]$ ,*

[TABLE]

Furthermore, if $q=z\cdot w\sqrt{N}$ , for fixed $z,w>0$ , then

[TABLE]

*with $Z$ being a standard Gaussian random variable. *

Notice that the critical scale $\sqrt{N}$ is the typical length of a LERW path with no killing and—as can be derived by the results in [16]—is the typical length of the first branch of the Wilson’s algorithm on the complete graph, when $q=O(\sqrt{N})$ .

Our second result is the analogous of Eq. 2.1 when still every vertex is accessible from any other, but the edge weights are non-homogeneous and give rise to a community structure. In this sense we will informally refer to this network as of a mean-field-community model. Formally, for given positive reals $w_{1}$ and $w_{2}$ , we denote by $\mathcal{K}_{2N}(w_{1},w_{2})$ the graph $G$ with $V=[2N]$ , and $w(e)=w_{1}$ if $e=(x,y)$ is such that either $x,y\in[N]$ or $x,y\in[2N]\setminus[N]$ , and $w(e)=w_{2}$ otherwise. Thus, the weight $w_{1}$ measures the pairwise connection intensity within the same community, while $w_{2}$ between pairs of nodes belonging to different communities. Given the symmetry of the model, we will use the notation $U^{(N)}_{q}(out)$ to refer to the potential $U^{(N)}_{q}(x,y)$ , for $x$ and $y$ in different communities. Conversely, we set $U^{(N)}_{q}(in)$ for the potential associated to two nodes belonging to the same community.

Theorem 2.

*(LEP-potential for mean-field-community model) *** Fix $q,w_{1},w_{2}>0$ and consider a two-community-graph $\mathcal{K}_{2N}(w_{1},w_{2})$ . Let $T_{q}\geq 1$ be a geometric random variable with success parameter

[TABLE]

and let $\tilde{X}=\left(\tilde{X}_{n}\right)_{n\in\mathbb{N}_{0}}$ be a discrete-time Markov chain with state space $\{\underline{1},\underline{2}\}$ and transition matrix

[TABLE]

Denote by $\ell(n)=\sum_{m<n}\mathbf{1}_{\left\{\tilde{X}_{m}=\underline{1}\right\}}$ the corresponding local time in state $\underline{1}$ up to time $n$ and by $\tilde{}\mathbb{P}_{\underline{1}}$ the corresponding path measure starting from $\underline{1}$ .

For $x\in[N]$ , set $\star=in$ if $y\in[N]$ , and $\star=out$ if $y\in[2N]\setminus[N]$ ,then

[TABLE]

where

[TABLE]

with, for $i=1,2$ ,

[TABLE]

and

[TABLE]

with

[TABLE]

The above theorem is saying that the pairwise LEP-potential can be seen as the double-expectation of the function $g_{\star}(n,k)=N^{-n+1}\left(\hat{f}\theta P^{\dagger}_{\star}\right)(n,k)$ in Eq. 2.3 with respect to the geometric time $T_{q}$ and to the local time of the coarse-grained RW $\tilde{X}$ . As can be seen in the proof, the analysis of this model can be in fact reduced to the study of such a coarse-grained RW jumping between the two “lumped communities” up to the independent random time $T_{q}$ . The function $g_{\star}$ is the crucial combinatorial term encoding in the different parameter regimes the most likely trajectories for such a stopped two-state macroscopic walk $\tilde{X}$ .

Remark 1.

*(Extensions to many communities of arbitrary sizes and weigths) *** The formula in Eq. 2.3 can be derived also for the general model with arbitrary number of communities of variable compatible sizes and arbitrary weights within and among communities. The corresponding statement and proof are more involved but they follow exactly the same scheme of this equal-size-two-community case captured in the above theorem. We refer the reader interested in such an extension to [17].

The next theorem gives the limit of the LEP-potential computed in Theorem 2, the resulting scenario is summarized in the phase-diagram in Fig. 1.

Theorem 3.

*(Detectability and phase diagram for two communities) *** Under the assumptions of Theorem 2, set $w_{1}=1$ , $w_{2}=N^{-\beta}$ and $q=N^{\alpha}$ for some $\alpha\in\mathbb{R},\>\beta\in\mathbb{R}^{+}$ . Then:

(a)

if $1-\beta<\alpha=\frac{1}{2}$ , $\lim_{N\to\infty}U^{(N)}_{q}(out)=1$ and $\lim_{N\to\infty}U^{(N)}_{q}(in)=\varepsilon_{0}(\beta)\in(0,1)$ .

(b)

if $1-\beta<\alpha<\frac{1}{2}$ , $\lim_{N\to\infty}U^{(N)}_{q}(out)=1$ and $\lim_{N\to\infty}U^{(N)}_{q}(in)=0$ .

(c)

if $\alpha=1-\beta<\frac{1}{2}$ , $\lim_{N\to\infty}U^{(N)}_{q}(out)=\varepsilon_{2}(\alpha,\beta)\in(0,1)$ and $\lim_{N\to\infty}U^{(N)}_{q}(in)=0$ .

(d)

if $\alpha<\min\{\frac{1}{2},1-\beta\}$ , $\lim_{N\to\infty}U^{(N)}_{q}(\star)=0,\star\in\{in,out\}.$

(e)

if $\alpha=\frac{1}{2}<1-\beta$ , $\lim_{N\to\infty}U^{(N)}_{q}(\star)=\varepsilon_{1}(\alpha,\beta)\in(0,1)$ , $\star\in\{in,out\}$ .

(f)

if $\alpha>\frac{1}{2}$ , $\lim_{N\to\infty}U^{(N)}_{q}(\star)=1,\star\in\{in,out\}.$

Remark 2.

(Anticommunities for negative $\beta$ )* The above theorem is stated for arbitrary $\alpha\in\mathbb{R}$ and $\beta>0$ . We notice that while for $\beta=0$ we are back to the complete graph with constant weight 1, for $\beta<0$ , it would be more appropriate to speak about “anticommunities” rather than communities. In fact in this case, at every step, the RW prefers to change community rather than staying in its original one. Thus, it is somewhat artificial to see what the loop-erased partitioning captures. This is the reason why the plot in Fig. 1 is restricted to $\beta\geq 0$ . However, the theorem still remains valid for negative $\beta$ and, not surprisingly, the difference between the in and out potentials turns out to be zero.*

The next statement collects some simple consequences, deduced from these two-point LEP-potential, on the macroscopic structure of $\Pi_{q}$ . We recall that $|\Pi_{q}|$ stands for the number of blocks in the random partition $\Pi_{q}$ .

Corollary 1.

(Macroscopic emergent structure)* Under the assumption of Theorem 3, the following scenarios hold true. If $\beta>0$ , there exists $c>0$ depending only on $\alpha$ and $\beta$ s.t.*

[TABLE]

Moreover:

(a)

if $1-\beta<\alpha=\frac{1}{2}$ then whp there are two blocks of linear size s.t. each block has a fraction $(1-o(1))$ of vertices from the same community.

(b)

if $1-\beta<\alpha<\frac{1}{2}$ then whp there are two blocks of size $N(1-o(1))$ s.t. each block has a fraction $(1-o(1))$ of vertices from the same community.

(c)

if $\alpha=1-\beta<\frac{1}{2}$ then whp there is at least a block of linear size.

(d)

if $\alpha<\min\{\frac{1}{2},1-\beta\}$ then whp there is one block of size $2N(1-o(1))$ .

(e)

if $\alpha=\frac{1}{2}<1-\beta$ then whp there is at least a block of linear size.

(f)

if $\alpha>\frac{1}{2}$ then whp blocks of linear size do not exist.

Theorem 3 says that the LEP–potential contains sufficient information to detect the underlying communities in a parametric region where the ratio of the out and in weights is bigger than $\sqrt{N}$ . This suggests that estimating the probabilities in Definition 2 could be a valuable method to design a community detection algorithm for well-separated regions. Nonetheless, there can be other observables associated to $\Pi_{q}$ which perform better, meaning e.g. that they can be used for detection beyond regions (a)–(c) in Fig. 1. However, it is not the scope of this paper to explore the practical applications and implications of this loop-erased partitioning in the context of community detection. For this reason we will omit complexity and other algorithmic considerations. As already mentioned, our main goal is rather to start understanding analytically the measure $\mu_{q}$ and its emergent structure.

Our last result, Proposition 1, is the analogous of Theorem 3 for the RW-potential in Definition 3 and shows that this other potential gives essentially no insight on the emergent partition and very little can be detected from it. To state the result, we first give in the next lemma a characterization of the RW-potential which reveals that in reality this other 2-body interaction is determined only by the RW flow in the graph rather than the LEP–measure.

Lemma 1.

(RW–potential independent of LEP structure)* For any arbitrary graph $G$ on $N$ vertices, the pairwise correlation function in Definition 3 admits the following representation:*

[TABLE]

where

[TABLE]

is, up to the factor $q$ , the Green’s kernel of the RW $X$ stopped at an independent exponentially distributed time $\tau_{q}$ , with rate $q$ .

We can now state the detectability captured by this RW–potential in the mean-field-community model. As for the LEP-potential we adapt the notation $\overline{U}_{q}(in/out)$ to distinguish between pairs within the same community or not.

Proposition 1.

(Detectability via RW–potential)* Consider the two–community–graph $\mathcal{K}_{2N}(w_{1},w_{2})$ with $w_{1}=1$ , $w_{2}=N^{-\beta}$ and $q=\Theta(N^{\alpha})$ . Then, if $\alpha\leq 0$ and $\beta>1-\alpha$ *

[TABLE]

On the other hand:

[TABLE]

As anticipated, this last statement shows that this RW-potential is less informative than the LEP one. In particular, the detectable parametric region is narrower and corresponds to the triangle for $\alpha\leq 0$ in the detectable region depicted in Fig. 1.

3. Proofs of Theorem 1: homogeneous complete graph

Proof of Eq. 2.1

For convenience, we consider a discretization of the continuous time Markov process with generator

[TABLE]

Set $L=\frac{1}{Nw}\mathcal{L}$ , so that $L=I-\frac{1}{N}\mathbf{1}\mathbf{1}^{\prime}$ and the associated transition matrix is given by

[TABLE]

If we consider the killing as an absorbing state within the state space of the Markov chain extended from $V$ to $V\bigcup\{\Delta\}$ , $\Delta$ denoting this absorbing state, we get the adjacency matrix

[TABLE]

and generator

[TABLE]

We can then normalize it by setting

[TABLE]

and get a discrete RW with transition matrix given by

[TABLE]

where

[TABLE]

It should be clear that a sample of a LE-path starting at a given vertex can be obtained as the output of the following procedure:

•

With probability $r$ the discrete process reaches the absorbing state. In particular we set $T_{q}$ for a geometric random variable of parameter $q/(Nw+q)$ .

•

With probability $1-r$ the LERW moves accordingly to the law $P(v,\cdot)$ where $v$ is the last reached node.

•

We call $H_{n}$ the vertices covered by the LE-path up to time $n$ . Then, if at time $n+1$ the transition $X_{n}\to X_{n+1}$ takes place and the vertex $X_{n+1}\not\in H_{n}$ , then $H_{n+1}=H_{n}\cup\{X_{n+1}\}$ . Conditioning on $|H_{n}|$ , the latter event occurs with probability $\frac{N-H_{n}}{N}$ . Conversely, if $X_{n+1}\in H_{n}$ , then we remove from $H_{n}$ all the vertices that has been visited by the LERW since its last visit to $X_{n+1}$ . As consequence the quantity $|H|$ reduces. One can then compute that the reductions occur with law

[TABLE]

It would be easier to look at the quantity $|H_{n}|$ by using the following metaphor. We interpret $|H_{n}|$ as the height from which a bear fall down while moving on a stair of height $n$ . In particular, we will assume that

•

The bear starts with probability 1 from the first stair.

•

At each time the bear select a step of the stair uniformly at random, including also the step he currently stands on.

•

If the choice made by the bear is a lower step (or the current one), he moves to that step.

•

If he chooses an upper step, then he walks in the upper direction by a single step.

•

Before doing each step, there is a probability $r$ as in Eq. 3.7 that the bear “falls down”.

Let us next fix $q=0$ , that is, $r=0$ , so that we can study the bear’s dynamic independently of his falling. By setting $Z(n)$ for the position of the bear at time $n\in\mathbb{N}$ , we get

[TABLE]

The latter implies that at time $n=h$ we reached the ergodic measure over the first $h$ steps of the stair, while at time $n=N$ the probability measure is exactly the ergodic one.

It is interesting to notice that an easier expression can be written for the cumulative distribution of the variable $Z(n)$ , i.e.

[TABLE]

Next, calling $T^{-}$ the time immediately before the bear falls, we get

[TABLE]

which gives us the distribution of the last step of the bear before his failing. Recall that this is equivalent to the length of the original LERW starting on $x\in\mathcal{K}_{N}$ , when the walk is stopped at an exponential time of rate $q$ . Hence, we are now left to compute the probability that another walker, starting on $y\not=x$ , is killed before it hits the previously sampled LERW.

Thanks to the bear metaphor, for the size of the LE-trajectory we get:

[TABLE]

and by explicit computation, setting $T_{\Gamma}$ for the first hitting time of the LE-path $\Gamma$ ,

[TABLE]

∎

Proof of Eq. 2.2

Let

[TABLE]

and notice that if $q=x\sqrt{N}$ , with $x,w=\Theta(1)$ , then

[TABLE]

Call

[TABLE]

in order to rewrite

[TABLE]

and notice that the first term in the latter sum is the probability that the geometric random variable $T_{q}\overset{d}{\sim}Geom\left(\frac{\xi_{q}}{N}\right)$ assumes value $k$ . Moreover it trivially holds that

[TABLE]

Hence,

[TABLE]

Let us approximate $\ln f(k+1,N)$ at the first order as follows

[TABLE]

Next, set $Y\,\overset{d}{\sim}\,exp(x)$ and $Z\,\overset{d}{\sim}\,\mathcal{N}(0,1)$ , notice that $\mathbb{E}[e^{\frac{Y^{2}}{2}}]=\sqrt{2\pi}xe^{\frac{x^{2}}{2}}\mathbb{P}(Z>x)$ and that

[TABLE]

since $T_{q}/\sqrt{N}$ converges in distribution to $Y$ as $N$ diverges. In view of the latter together with Eq. 3.22, we can estimate

[TABLE]

where the last inequality holds true by choosing any $\delta\in\left(\frac{1}{2},\frac{2}{3}\right)$ which in particular guarantees that $c_{N}(k)=o(1)$ . ∎

4. Proofs for mean-field-communities

4.1. Proof of Theorem 2

We use here the same line of argument used in the proof of Theorem 1. We will consider the process having state space $V=V_{1}\sqcup V_{2}$ , where

[TABLE]

and generator

[TABLE]

We will specialize later on the case $N_{1}=N_{2}=N$ .

We now consider a killed LERW $\Gamma$ , and we denote by $\Gamma_{i}$ the set of points of the $i$ -th community belonging to $\Gamma$ , i.e.,

[TABLE]

We can write

[TABLE]

and we assume, without loss of generality, that $x\in V_{1}$ ; then, by conditioning, we get for $y\neq x$ with $y\in V_{j}$ , $j=1,2$

[TABLE]

$T_{\Gamma}$ being the hitting time of $\Gamma$ .

The LERW starting from $x$

A result due to Marchal [13] provides the following explicit expression for the probability of a loop erased trajectory:

[TABLE]

By looking closely at the latter formula we distinguish two parts: a product over the weights of the edges of the path and an algebraic part containing the ratio of two determinants which encodes the “loop-erased” feature of the process. In particular we notice that the former contains all the details about the trajectory, while the latter only depends on the number of points visited in each community. Let $j_{1}$ (respectively, $j_{2}$ ) be the number of jumps from the first community to the second (from the second to the first, respectively) along the LE-path. We have

[TABLE]

where

•

The first binomial coefficients stays for the $k_{1}-1$ possible choices for the points in $G_{1}$ (one of those must be $x$ ) over the possible $N_{1}-1$ points of the first community (except $x$ ). In the second community we can choose any $k_{2}$ vertices over the possible $N_{2}-1$ vertices of the second community (except $y$ ).

•

The factorials stay for the possible ordering of the nodes covered in each community. Notice that the path on the first community must start by $x$ .

•

We sum over all the possible jumps from the first community to the second, $j_{1}$ , and from the second to the first, $j_{2}$ (notice that if $j_{2}$ must be equal or one smaller than $j_{1}$ ).

•

For any choice over the product of the previous three terms we have a path that has probability as given by the Marchal formula.

In the case in which we condition on having both $x$ and $y$ in the same (first, say) community we have

[TABLE]

Namely, only the first combinatorial term changes.

The ratio of determinants

In our mean-field setup, the terms in Eq. 4.6 and Eq. 4.7 coming from Eq. 4.5 can be explicitly computed. We consider here the two communities case, i.e. $V=V_{1}\sqcup V_{2}$ , where the communities possibly have different sizes, $|V_{1}|=N_{1}$ and $|V_{2}|=N_{2}$ . Now, consider the matrix obtained by erasing $k_{1}$ ( $k_{2}$ ) rows and corresponding columns in the first community (the second one, respectively) in $-\mathcal{L}$ . We are left with a square matrix made of two square blocks on the diagonal of size $N_{1}-k_{1}=:K_{1}$ (respectively $N_{2}-k_{2}=:K_{2}$ ). We will denote this matrix by

[TABLE]

where the elements on the diagonal are given by

[TABLE]

We want to find $K_{1}+K_{2}$ solutions of the problem

[TABLE]

First we consider eigenvectors of the form $v=(x_{1},x_{1},...,x_{1},x_{2},...,x_{2})^{Tr}$ , where the upper component has length $K_{1}$ and the lower one has length $K_{2}$ . If we write explicitly Eq. 4.10 we get the following linear system:

[TABLE]

from which we get two eigenvalues, which we will refer to as $\lambda_{1}$ and $\lambda_{2}$ .

Then we consider $v=(x_{1},x_{2},...,x_{K_{1}},0,...,0)^{Tr}$ ; with this choice we are left with the system

[TABLE]

and we have to find $K_{1}-1$ eigenvalues that are associated with eigenvector orthogonal to constants. By direct computation, $A_{1}$ has eigenvalue $\lambda_{1}^{\prime}:=(N_{1}w_{1}+N_{2}w_{2})$ with multiplicity $K_{1}-1$ . With the opposite choice, namely $v=(0,...,0,x_{1},...,x_{K_{2}})^{Tr}$ , we get

[TABLE]

Namely, there is an eigenvalue $\lambda_{2}^{\prime}:=(N_{2}w_{1}+N_{1}w_{2})$ with multiplicity $K_{2}-1$ . So the spectrum of $M$ is

[TABLE]

with multiplicity denoted by $\mu_{M}(\cdot)$ :

[TABLE]

Therefore, we can see that the ratio of determinants in Eq. 4.6 and Eq. 4.7 can be written explicitly. Indeed, at the denominator we have

[TABLE]

while at the numerator we are left with

[TABLE]

where

[TABLE]

while $\lambda_{1}$ and $\lambda_{2}$ are the two solutions of the system in Eq. 4.11. In particular, if we specialize in the case $N_{1}=N_{2}=N$ we can conclude that the ratio of determinants is given by

[TABLE]

where we defined

[TABLE]

and

[TABLE]

The path starting from $y$

Now we have to consider the second path starting from $y$ which decides the root at which $y$ will be connected in the forest generated by the algorithm. The latter corresponds to the second factor in Eq. 4.4. Notice that it is sufficient to consider such path in the simpler fashion, i.e. without erasing the loops, since we are only concerned with the absorption of the walker: either in $\gamma$ or killed at rate $q$ . Moreover, we can exploit again the symmetry of the model to reduce it to a Markov chain $\bar{X}$ with state space $\{\bar{1},\bar{2},\bar{3},\bar{4}\}$ corresponding to the sets $\left\{V_{1}\setminus\gamma_{1},V_{2}\setminus\gamma_{2},\gamma_{1}\sqcup\gamma_{2},\Delta\right\}$ , where $\Delta$ is again the absorbing state, i.e., the “state-independent” exponential killing. We will assume that

[TABLE]

Hence, the transition matrix we are interested in is given by

[TABLE]

where

[TABLE]

with

[TABLE]

The states represent:

( $\bar{1}$ )

nodes of the $1^{st}$ community that have not been covered by the LE-path started at $x$ .

( $\bar{2}$ )

nodes of the $2^{nd}$ community that have not been covered by the LE-path started at $x$ .

( $\bar{3}$ )

nodes of both communities that have been covered by the LE-path started at $x$ .

( $\bar{4}$ )

the absorbing state $\Delta$ .

Called $T_{abs}$ the hitting time of the absorbing set $\left\{\bar{3},\bar{4}\right\}$ , we want to compute the probability that the process $\bar{X}$ is absorbed in the state, $\bar{4}$ and not in $\bar{3}$ . In terms of our original process, this means that the process is killed before the hitting of the LE-path starting at $x$ . By direct computation

[TABLE]

notice that the first component of the vector $P^{\dagger}\in\mathbb{R}^{2}$ corresponds to the intra-community case $\left\{x,y\right\}\in V_{i}$ for some $i$ , i.e., $U^{(N)}_{q}(in)$ , while the second one to the inter-community case, namely $U^{(N)}_{q}(out)$ .

If we now use the assumption that $N_{1}=N_{2}=N$ , the steps above allow us to write the following formulas

[TABLE]

where

[TABLE]

$\theta(k_{1},k_{2})$ as in Eq. 4.19 and

[TABLE]

By direct computation we see that

[TABLE]

where

[TABLE]

Local time interpretation

Now consider the part of the formula concerning the jumps among the two communities of the killed-LE-path starting at $x$ , i.e.

[TABLE]

The latter can be thought of as a function of a Markov Chain $(\tilde{X}_{n})_{n\in\mathbb{N}}$ on the state space $\left\{\underline{1},\underline{2}\right\}$ , with transition matrix

[TABLE]

where the $\underline{i}$ -th state stays for the $i$ -th community. Indeed, we can rewrite Eq. 4.32 as

[TABLE]

with $\ell$ being the local time as in the statement of Theorem 2.

Geometric smoothing

From the previous steps we get the following expression

[TABLE]

Next, we would like to make appear a geometric term as in the complete and uniform case of Theorem 1. Notice that multiplying and dividing by $N^{k_{1}+k_{2}-1}$ one obtains

[TABLE]

we can then define

[TABLE]

in order to obtain

[TABLE]

and

[TABLE]

where $T_{q}$ is an independent random variable with law $Geom\left(\xi_{q,N}\right)$ .

Conclusions

One can ideally divide the formulas in Eqs. 4.38 and 4.39 in five terms, namely

(1)

The entropic term

[TABLE]

was already present in the complete and uniform case Eq. 2.1. Indeed

[TABLE] 2. (2)

The term related to the spectrum of the size 2 matrix presented in Eq. 4.11, i.e.

[TABLE]

which is the same in both in e out community cases. It can be rewritten as the ratio between two parabolas in $q$ , i.e.,

[TABLE] 3. (3)

The term related to the geometric random variable of parameter $\xi_{q,N}$ , which was present also in the case of the uniform graph, Eq. 2.1. 4. (4)

The term related to the local times of the 2-states Markov chain $\tilde{P}$ , in Eq. 4.33. 5. (5)

The term related to the absorption probability, i.e., to the quantity $P^{\dagger}$ , see Eq. 4.25, as a function of the process $\bar{P}$ presented in Eq. 4.21.

It is worth noticing that the $P^{\dagger}$ above is slightly different from the $P^{\dagger}_{\star}$ in the statement of Theorem 2 which contains the extra factor $\eta_{\star}$ . At this point by setting

[TABLE]

we can write

[TABLE]

and

[TABLE]

which is equivalent to the statement in Theorem 2. ∎

4.2. Proof of Theorem 3

**Proofs of (a) and (b): $1-\beta<\alpha<(=)\frac{1}{2}$ (detectability) **

As expressed in the following lemma in this regime the RW is confined to its starting community for the entire life-time.

Lemma 2 (RW is confined to its community up to dying).

Let $1>\alpha>1-\beta$ and for $x\in[2N]$ , consider the event

[TABLE]

where $T_{x}^{out}$ is the first time in which the RW moves out of the community in which $x$ lies.

Then, as $N\to\infty$ ,

[TABLE]

Proof.

Let $Z$ be a r.v. that can assume values in the set $\{Out,In,\Delta\}$ with probabilities:

[TABLE]

Let $(Z_{n})_{n\in\mathbb{N}}$ be a sequence of i.i.d. r.v.s with the same law of $Z$ and notice that

[TABLE]

Therefore

[TABLE]

from which the claim. ∎

In view of the decomposition in Eq. 1.4 and the above lemma, we can write for any $x\neq y$

[TABLE]

Let us first consider $U^{(N)}_{q}(out)$ . In this case, by Lemma 2, for any $\alpha\leq 1/2$ and uniformly in $\gamma$ , we have that

[TABLE]

As a consequence $\mathbb{P}_{y}(T_{\gamma}>T_{q}|E_{x}^{c})\geq 1-o(1)$ , and by plugging this estimate in Eq. 4.48, we get $U^{(N)}_{q}(out)\to 1$ .

Concerning $U^{(N)}_{q}(in)$ , one has to notice that, for every LERW $\gamma$ starting from $x$ and ending at the absorbing state, we can consider the event

[TABLE]

Once more, uniformly in $\gamma$ , we get by Lemma 2 that

[TABLE]

Thus, for $x,y\in[N]$ , by Eq. 4.48, we can estimate

[TABLE]

Notice that, under such conditioning, the sum can be read as the probability that two vertices in a complete graph with $N$ vertices end up in two different trees. Therefore, this reduces to Eq. 2.2, which in turns gives $U^{(N)}_{q}(in)\to 0$ for $\alpha<1/2$ and $U^{(N)}_{q}(in)\to\varepsilon_{0}(\alpha)$ else. ∎

Proof of (f) : $\alpha>\frac{1}{2}$ (high killing region)

We will only show that $U^{(N)}_{q}(in)\to 1$ , this will suffice since e.g. by direct computation one can check that $U^{(N)}_{q}(in)\geq U^{(N)}_{q}(out)$ .

Observe first that being $\alpha>\frac{1}{2}$ , the length of the Loop-Erased path $\Gamma$ must be “small” with high probability. In particular we can bound

[TABLE]

hence

[TABLE]

∎

We next prove the remaining items in Theorem 3 for which we will implement a similar strategy which we start explaining. In all remaining regimes we need to show that $U^{(N)}_{q}(\star)$ , $\star\in\{in,out\}$ either vanishes or stays bounded away from zero. To this aim, we will use the representation in Eq. 2.3.

Depending on the parameter regimes, we will split the sum over $t$ in different pieces to be treated according to the asymptotic behavior of the involved factors. To simplify the exposition we will restrict in what follows to the positive quadrant $\alpha,\beta>0$ . We stress however that, as the reader can check, the following estimates hold true and actually converge faster even outside of the positive quadrant.

Let us start with a few observations. We notice that $\hat{f}(n,k)\leq 1$ for every choice of $k,N,n$ , moreover $\hat{f}(t,n)=0$ if $n\geq N$ . Furthermore, for each $N$ ,

[TABLE]

and while estimating the involved factors it will be crucial the behavior of the product $\left(\hat{f}\theta P_{\star}^{\dagger}\right)(n,k)$ for which we can in general observe the following facts.

(A)

For any $\varepsilon>0$ , if $n>N^{1/2+\varepsilon}$ , then it follows from Eq. 3.23 that $N\mapsto\hat{f}_{N}$ decays to zero, uniformly in $k$ , faster than any polynomial as $N\to\infty$ . For such $n$ ’s , since $N\mapsto\theta_{N}P^{\dagger}_{\star}$ is polynomially bounded (uniformly in $n,k$ ), the contribution in Eq. 2.3 of such terms can be neglected. 2. (B)

Whenever we consider $n$ ’s for which $\theta P^{\dagger}_{\star}=o(1)$ , because of Eq. 4.49 and the uniform control on $\hat{f}$ , the contribution of such terms in Eq. 2.3 can also be neglected. 3. (C)

For $n$ ’s for which neither Item A nor Item B hold, we will estimate the asymptotics of such part of the sum by controlling the mass of the geometric time $T_{q}$ against $\theta P^{\dagger}_{\star}$ , and in the most delicate cases (on the separation lines in Fig. 1), taking into account the behavior of the local time too.

We are now ready to treat the remaining parameter regimes using such facts.

**Proof of (d): $\alpha<\min\{\frac{1}{2},1-\beta\}$ (changing-communities before dying) **

In this regime, the overall picture resembles the phenomenology of the complete graph. In particular, the RW will manage to change community before being killed and up to the killing time scale, it will forget its starting community. Moreover, with high probability a single tree of size $2N(1-o(1))$ will be formed, so that, given any two points $x,y$ , they will end up in the same tree with high probability independently on their communities.

To prove the claim notice that, uniformly in $n,k$ ,

[TABLE]

As a consequence the asymptotics of $U^{(N)}_{q}(\star)$ will be independent of $\star$ . To show that such a limit is zero we argue as follows. Within this parameter region:

[TABLE]

which together with Eq. 4.50 leads to

[TABLE]

We can now plug in this asymptotic representation of $\theta P^{\dagger}_{\star}$ in Eq. 2.3, and separately treat the four resulting terms.

For the first term, namely the sum in Eq. 2.3 with $\theta P^{\dagger}_{I}$ in place of $\theta P^{\dagger}_{\star}$ , we split the sum in $n$ into two parts at $N^{\alpha+\varepsilon}$ , for small $\varepsilon>0$ , and show that they both goes to zero, by using Item C and Item B, respectively In fact, with this “cut” we see that:

[TABLE]

Analogously, for the second term we split the sum over $n$ into two parts at $N^{1/2+\varepsilon}$ , with small $\varepsilon>0$ . Using Item C for the first part and Item A for the second one, we see that

[TABLE]

For the third term we need to split the corresponding sum into three parts at $T_{1}:=N^{1-\beta-\varepsilon}$ and $T_{2}:=N^{1/2+\varepsilon}$ , which will be controlled by Item B, Item C and Item A, respectively. That is

[TABLE]

Finally, for the last term, we split the sum at $N^{1/2+\varepsilon}$ . Indeed we see that: on the one hand, for $n\leq N^{1/2+\varepsilon}$ , we can use Item C since

[TABLE]

On the other hand, for $n\geq N^{1/2+\varepsilon}$ , we can argue as in Item A. Hence,

[TABLE]

∎

Proofs of (c) and (e) (high-entropy separating lines)

We start by proving (e), i.e.

[TABLE]

Start noting that under our assumptions on $\alpha$ and $\beta$ we have that

[TABLE]

and

[TABLE]

We are going to split the sum over $n$ in Eq. 2.3 in three parts:

•

$n\leq N^{\frac{1}{2}-\varepsilon}$ . For such $n$ ’s we have that the product $\theta P_{\star}^{\dagger}(n,k)$ is of order $1$ . Hence we can neglect this part by using Item C together with the estimate

[TABLE]

•

$n>N^{\frac{1}{2}+\varepsilon}$ . Also this part can be neglected thanks to the argument of Item A.

•

$N^{\frac{1}{2}-\varepsilon}<n\leq N^{\frac{1}{2}+\varepsilon}$ . This is the delicate non-vanishing part. We start by noticing that, due to Eq. 4.67 and Eq. 4.68, the leading term in $\theta P_{\star}^{\dagger}$ does not involve $k_{\star}$ , so that —at first order— $U^{(N)}_{q}(in)$ must equal $U^{(N)}_{q}(out)$ . In order to show that the latter two are asymptotically bounded away from zero, we fix $c\in(0,1)$ and consider

[TABLE]

Moreover, thanks to Eq. 4.71 we can easily deduce that the limit is strictly smaller than $\frac{1}{2}$ .

We next conclude by giving the proof of (e), i.e., we are going to show that

[TABLE]

Observe that, under our assumptions on $\alpha$ and $\beta$ , we have that

[TABLE]

and

[TABLE]

hence, their product behaves asymptotically as

[TABLE]

To evaluate the asymptotic behavior of $U^{(N)}_{q}(\star)$ , we split the sum over $n$ in Eq. 2.3 in three pieces:

•

$n\leq N^{\alpha+\varepsilon}$ : where, thanks to Eq. 4.75, we know that $\theta P_{\star}^{\dagger}(n,k)=O(N^{\varepsilon})$ . We argue as in Item C, obtaining

[TABLE]

•

$n>N^{\frac{1}{2}+\varepsilon}$ : in this case we can argue as in Item A.

•

$N^{\alpha+\varepsilon}<n\leq N^{\frac{1}{2}+\varepsilon}$ : in this case we have to distinguish between $U^{(N)}_{q}(in)$ and $U^{(N)}_{q}(out)$ .

Consider first $U^{(N)}_{q}(in)$ . We call $E_{n}$ the following event concerning the Markov chain $(\tilde{X}_{n})_{n\in\mathbb{N}}$

[TABLE]

Notice that if $N^{\alpha+\varepsilon}<n\leq N^{\frac{1}{2}+\varepsilon}$ then the event $E_{n}^{c}$ occurs with high probability. Hence, for any choice of $n\in[1,N]$ and $k\in[1,n]$ we can write

[TABLE]

$\delta_{k,n}$ being the Kronecker delta. Hence

[TABLE]

Concerning $U^{(N)}_{q}(out)$ , it is easy to get a lower bound via a soft argument by considering the events

[TABLE]

Indeed,

[TABLE]

Finally, we are left to show that $U^{(N)}_{q}(out)$ is asymptotically bounded away from $1$ . We consider the further split

[TABLE]

Focusing on the first sum in the latter display, thanks to Eq. 4.75, we have that

[TABLE]

Concerning the second sum, we have

[TABLE]

∎

4.3. Proof of Corollary 1

Let $0=\lambda_{0}\leq\lambda_{1}\leq\dots\leq\lambda_{2N-1}$ be the eigenvalues of $-\mathcal{L}$ . As shown in[5, Prop. 2.1], the number of blocks of the induced partition, $|\Pi_{q}|$ , is distributed as the sum of $2N$ independent Bernoulli random variables with success probabilities $\frac{q}{q+\lambda_{i}}$ . That is

[TABLE]

In case of the two-communities model we have

[TABLE]

Therefore

[TABLE]

where

[TABLE]

Hence

[TABLE]

Moreover, we can prove the concentration result claimed in the first part of the statement by using the multiplicative version of the Chernoff bound on the sum of $Y_{i}$ ’s. Indeed, denoting by

[TABLE]

we have that

[TABLE]

and since

[TABLE]

we can deduce the concentration of $|\Pi_{q}|$ .

Notice also that the second part of the statement is a trivial consequence of the detectability result of Theorem 3. ∎

4.4. Proof of Lemma 1

In this proof we will consider the probability measure $\nu_{q}$ on the space of rooted spanning forests studied in [5], namely,

[TABLE]

where we denoted by $\rho(F)$ the set of roots of $F\in\mathcal{F}$ . As mentioned in Section 1.2, we stress that the measure in Definition 1 can be obtained by projecting this forest measure $\nu_{q}(\cdot)$ on the set of partitions.

Call $\mathcal{B}_{q}$ the $\sigma$ -field generated by the block structure $\Pi_{q}$ of the random forest $F$ . By [5, Proposition 6.4], we have

[TABLE]

Now we notice that by Definition 3 and the tower property,

[TABLE]

We can now invoke [5, Theorem 3.4], stating that the set of roots is a determinantal process with kernel $K_{q}$ . As a consequence we obtain that

[TABLE]

and the claim readily follows. ∎

4.5. Proof of Proposition 1

We consider here the discrete time version of the process $X$ as presented in Theorem 1, see (3.6). As a warm-up, we start by computing the potential in the complete graph with unitary weights. In this case,

[TABLE]

where

[TABLE]

Therefore,

[TABLE]

From which:

[TABLE]

Thus, in order to have a non-degenerate potential on $\mathcal{K}_{N}$ , we need to take $q=\Theta(1)$ .∎

We next move to the mean-field-community model $\mathcal{K}_{2N}(w_{1},w_{2})$ with $w_{1}=1$ , $w_{2}=N^{-\beta},\beta>0$ and arbitrary $q$ . The corresponding discrete-time RW is killed at an independent geometric time $T_{q}\overset{d}{\sim}Geom(r_{q})$ with

[TABLE]

Denoting by $J_{t}$ the random variable that counts the number of times, up to time $t$ , in which this random walk changes community, we notice that:

[TABLE]

that is, conditioning on $T_{q}=t+1$ , $J_{t}$ has binomial distribution $Bin(t,c)$ with success parameter

[TABLE]

We are now in shape to compute the probability that $x$ is absorbed in some $y$ . Without loss of generality we assume $x\in[N]$ , so that $y\in[N]$ and $y\in[2N]\setminus[N]$ determines the $in-$ and $out-$ potential, respectively.

Thus:

[TABLE]

where the last identity is due to the fact that the sum in Eq. 4.95 is a probability and hence bounded above by $1$ .

(high killing) When $q=N^{\alpha}$ , with $\alpha>0$ , $r_{q}=\omega\left(N^{-1}\right)$ , thus the $O\left(N^{-1}\right)$ term in Eq. 4.96 is negligible, and $\overline{U}_{q}(in/out)\sim N^{2}r_{q}^{2}$ . In particular, the potential diverges as $N^{2}$ or $N^{2\alpha}$ depending on $\alpha\geq 1$ or $\alpha<1$ , respectively.

(order one killing) In the regime $q=O(1)$ , the $O(N^{-1})$ term in Eq. 4.96 is no longer negligible and needs to be analyzed further. Let us first consider the sub-regime $q=\Theta(1)$ .

Notice that, when $t=\Theta(1/r_{q})$ ,

[TABLE]

Clearly, $\mathbb{E}(\text{Bin}(t,c))=o(1)$ implies that $\mathbb{P}\left(\text{Bin}(t,c)\in 2\mathbb{N}_{0}\right)=1+o(1)$ , while if $\mathbb{E}(\text{Bin}(t,c))=\omega(1)$ then $\mathbb{P}\left(\text{Bin}(t,c)\in 2\mathbb{N}_{0}\right)=\frac{1}{2}+o(1).$ From which, if $\beta>1$ , then

[TABLE]

while, for $\beta<1$ :

[TABLE]

where in Eqs. 4.98 and 4.99 we used the fact that, in order to compute the first order, it is sufficient to restrict the sum over $t$ to the values on the scale $\Theta(1/r_{q})$ . By Eq. 4.95 and the above estimates, we conclude that, for $\beta>1$ :

[TABLE]

and $K_{q}(x,x)\sim\frac{q+1}{N}$ , which together with Definition 3 lead to:

[TABLE]

On the other hand, for $\beta<1$ , the estimate in Eq. 4.99 shows that, regardless of the community of $y$ , $K_{q}(x,y)\sim(\delta_{x,y}q+1/2)/N$ . Thus the $in-$ and $out-$ potentials are asymptotically equivalent. In particular, $\overline{U}_{q}(in/out)\sim 4q^{2}+4q$ .

(vanishing killing) It remains to analyze the case when $q=N^{\alpha}$ for some negative $\alpha<0$ . In this case, we have that

[TABLE]

We can then argue as in the case $q=\Theta(1)$ but distinguishing between $\beta$ being bigger or smaller than $1-\alpha$ . In particular, due to Eq. 4.102, when $\beta<1-\alpha$ the resulting $in-$ and $out-$ potentials are asymptotically equivalent and decay as $N^{\alpha}$ . On the other hand, for $\beta>1-\alpha>1$ , $r_{q}\sim N^{\alpha-1}$ , which together with Eq. 4.102 and Eq. 4.95 lead to the estimates: $K_{q}(x,x)\sim r_{q}+N^{-1}\sim N^{-1}$ , $K_{q}(x,y)\sim N^{-1}$ for $y\in[N]\setminus\{x\}$ and $K_{q}(x,y)=o(N^{-1})$ for pairs $(x,y)$ in different communities. By plugging these estimates in Lemma 1 the statement follows. ∎

Acknowledgments

L. Avena was supported by NWO Gravitation Grant 024.002.003-NETWORKS. M. Quattropani was partially supported by the INdAM-GNAMPA Project 2019 “Markov chains and games on networks”. Part of this work started during the preparation of the master thesis [17] and the authors are thankful to Diego Garlaschelli for acting as co-supervisor of this thesis project.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. Aldous, The Continuum Random Tree. I. Ann. Probab. 19, 1–28 (1991).
2[2] L. Avena, F. Castell, A. Gaudillière and C. Mélot, Intertwining wavelets or multiresolution analysis on graphs through random forests, ACHA DOI:10.1016/j.acha.2018.09.006 (2018).
3[3] L. Avena, F. Castell, A. Gaudillière and C. Mélot, Random Forests and Networks Analysis, J. Stat. Phys. 173, 985–1027 (2018).
4[4] L. Avena, F. Castell, A. Gaudillière and C. Mélot, Approximate and exact solutions of intertwining equations through random spanning forests, Ar Xiv:1702.05992 (2017).
5[5] L. Avena and A. Gaudillière, Two applications of random spanning forests, J. Theor. Probab. 31, 1975–2004 (2018).
6[6] L. Avena and A. Gaudillière, A proof of the transfer-current theorem in absence of reversibility, Stat. Probab. Lett. 142, 17–22 (2018).
7[7] I. Benjamini and G. Kozma, Loop-erased random walk on a torus in dimensions 4 and above, Comm. Math. Phys. 259, 257–286 (2005).
8[8] R. Burton and R. Pemantle, Local characteristics, entropy and limit theorems for spanning trees and domino tilings via transfer-impedances, Ann. Probab. 21, 1329–1371 (1993).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Loop-erased partitioning of a graph:

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Intro: Loop-erasure and random partitioning

Definition 1** **(Loop-erased partitioning).

1.1. Tuning parameter and underlying geometry.

1.2. Sampling algorithm and Loop-Erased RW (LERW)

1.3. Partition detecting “metastable landscapes”.

1.4. Two-point correlations

Definition 2** **(Pairwise LEP-interaction potential).

Definition 3** **(Pairwise RW-interaction potential).

1.5. Related literature

1.6. Paper overview

1.7. Basic standard notation

2. Results: correlations and emerging partition on mean-field models

Theorem 1**.**

Theorem 2**.**

Remark 1**.**

Theorem 3**.**

Remark 2**.**

Corollary 1**.**

Lemma 1**.**

Proposition 1**.**

3. Proofs of Theorem 1: homogeneous complete graph

Proof of Eq. 2.1

Proof of Eq. 2.2

4. Proofs for mean-field-communities

4.1. Proof of Theorem 2

The LERW starting from xxx

The ratio of determinants

The path starting from yyy

Local time interpretation

Geometric smoothing

Conclusions

4.2. Proof of Theorem 3

Lemma 2** (RW is confined to its community up to dying).**

Proof.

4.3. Proof of Corollary 1

4.4. Proof of Lemma 1

4.5. Proof of Proposition 1

Acknowledgments

Definition 1 (Loop-erased partitioning).

Definition 2 (Pairwise LEP-interaction potential).

Definition 3 (Pairwise RW-interaction potential).

Theorem 1.

Theorem 2.

Remark 1.

Theorem 3.

Remark 2.

Corollary 1.

Lemma 1.

Proposition 1.

The LERW starting from $x$

The path starting from $y$

Lemma 2 (RW is confined to its community up to dying).