Phase transitions for edge-reinforced random walks on the half-line
Jiro Akahori, Andrea Collevecchio, Masato Takei

TL;DR
This paper investigates phase transitions in edge-reinforced random walks on the half-line, highlighting how reinforcement strength and initial weights influence the process's behavior, extending previous foundational results.
Contribution
It provides a comprehensive description of phase transitions in heterogeneous edge-reinforced random walks, completing and extending prior theoretical work.
Findings
Identification of different behavioral regimes
Description of phase transition conditions
Extension of previous theoretical results
Abstract
We study the behaviour of a class of edge-reinforced random walks {on }, with heterogeneous initial weights, where each edge weight can be updated only when the edge is traversed from left to right. We provide a description for different behaviours of this process and describe phase transitions that arise as trade-offs between the strength of the reinforcement and that of the initial weights. Our result aims to complete the ones given by Davis~\cite{Davis89, Davis90}, Takeshima~\cite{Takeshima00, Takeshima01} and Vervoort~\cite{Vervoort00}.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Theoretical and Computational Physics · Diffusion and Search Dynamics
Phase transitions for edge-reinforced random walks on the half-line
Jiro Akahori
Jiro Akahori
Department of Mathematical Sciences, Ritsumeikan University
,
Andrea Collevecchio
Andrea Collevecchio
School of Mathematical Sciences, Monash University, Melbourne
and
Masato Takei
Masato Takei
Department of Applied Mathematics, Faculty of Engineering, Yokohama National University
Abstract.
We study the behaviour of a class of edge-reinforced random walks on , with heterogeneous initial weights, where each edge weight can be updated only when the edge is traversed from left to right. We provide a description for different behaviours of this process and describe phase transitions that arise as trade-offs between the strength of the reinforcement and that of the initial weights. Our result aims to complete the ones given by Davis [3, 4], Takeshima [9, 10] and Vervoort [11].
Key words and phrases:
Self-interacting random walks, Reinforced random walks
1. Introduction
Reinforced random walks (RRW) have been extensively studied in the past 30 years. The canonical model is the one introduced by Coppersmith and Diaconis [2], called Linearly Edge-Reinforced Random Walk (LERRW) which can be described as follows. Consider a graph which is locally finite and to each edge assign initial weight one. These weights are updated depending on the behaviour of the process. LERRW takes values on the vertices of , at each step it jumps to vertices which are neighbors of the present one, say . The probability to pick a particular neighbor is proportional to the weight of the edge connecting that vertex to . Each time the process traverses an edge, its weight is increased by one. When is a tree, then LERRW is a random walk in an i.i.d. environment. In general, it can be represented as a mixture of Markov chains (see Merkl and Rolles [6]). The mixing measure is connected with models, which in turn are used to explain the phenomena of Anderson localization. For more information about this connection and details about models, see for example [7] and its bibliography.
Our goal is to study a large class of edge-reinforced walks on , inspired by the work of Davis [3, 4]. We allow heterogeneous initial weights on the edges, and a reinforcement that is different from linear. A theorem of Vervoort [11] establishes an interesting recurrence criterion for a large class of RRW with general initial weights. The reinforcement scheme of these processes is characterised by the fact that there is a chance that an edge increases its weight when traversed from left to right (see Theorem V below for a precise statement). Hence, we focus our attention on the case where the reinforcement can happen only when the process traverses an edge from right to left. Intuitively, this class of processes is the ‘most’ transient and has an interesting phase transition in terms of the initial weights and the reinforcement.
We provide a general phase diagram for edge-reinforced random walks which take values on the vertices of and heterogenous initial conditions. This includes a description of phase transitions that are trade-offs between the strength of the reinforcement and that of the initial weights. We use a martingale approach, a theorem of Austin (see [1]) and the so-called Rubin construction (see [4]) combined with Cramér-type bounds. Some of the methods used in the proofs are close in spirit to the ones proposed by Davis in [3, 4].
1.1. Edge-reinforced random walks on the half-line
We define the edge-reinforced random walk (ERRW), denoted by , as follows. This process takes values on the vertices of and at each step it jumps to one of the nearest neighbors. Denote by the non-oriented edge connecting and . In contrast, we use to denote the oriented edge connecting to . Define
[TABLE]
that is the number of traversals of the edge by time . For each , let be a non-decreasing sequence of positive numbers, called the reinforcement scheme at . For each , the weights at time are defined by
[TABLE]
and the transition probability is given by
[TABLE]
Here we set for all , which implies a reflection at the origin.
We say that the path is recurrent if every point is visited infinitely often, and transient if every point is visited only finitely many times. Finally, if the set of points that visits infinitely often is finite, then we say that localizes. Takeshima [9] proved that ERRW on can be either recurrent, transient or it localizes. Notice there are cases where
[TABLE]
In fact, if we set for all , and for all and , we have the following. The process does not visit , i.e. only visits the vertices [math] and , with probability
[TABLE]
Moreover, with positive probability the process drifts away to infinity. In fact, it behaves like a transient Markov chain on the sites with .
For and , define
[TABLE]
It is well known that if for all and , i.e. if there is no reinforcement at all, then is recurrent a.s. if , and is transient a.s. otherwise. In fact, can be associated to the effective resistance of the network, which characterizes the behaviour of the relative Markov chain (see [5]). We say the ERRW is initially recurrent (resp. initially transient) if (resp. ).
Davis [3] proved that initially recurrent reinforced random walks are not necessarily recurrent.
Theorem D** (Davis [3]).**
Consider the ERRW on .
- (i)
If , then is either recurrent or it localizes on a single edge.
- (ii)
There exists a reinforcement scheme such that and and is transient with positive probability.
The known phases can be summarised in the following table, where we combined Theorem D with other results by Davis [4], Sellke [8] and Takeshima [9, 10].
TABLE I
[TABLE]
The question marks in Table I indicate what is left open in general. In this paper we partially fill these gaps for a general class of reinforcement, which we call Factor Type Reinforcement (FTR), and which is of the form
[TABLE]
where is a positive non-decreasing sequence with . Furthermore, Vervoort proved the following result (Theorem 8.2.2 in [11]). For the sake of completeness we include the proof in the Appendix.
Theorem V** (Vervoort [11]).**
Suppose that , and suppose that has FTR. Then, the process is recurrent a.s. if either i) is bounded or ii) for all , and for some .
By virtue of Theorem V, we can focus on the case where is unbounded and an edge can be reinforced only when the process traverses from right to left, i.e. when for all .
2. Main results
Definition 1**.**
The sequence is called down-only type (DT) if , if it is non-decreasing, and .
Our main result is the following.
Theorem 2**.**
Let be a reinforced random walk with FTR, and suppose that is DT. Let and , for all , with and .
If , then is recurrent a.s..
- 2)
If and , then is transient a.s..
- 3)
If , then is transient a.s..
- 4)
If , then localizes on a single edge a.s..
As highlighted in the next example, if we perturb a single reinforcement even slightly, we can witness a transition from recurrence to transience.
Example 3**.**
Let , for all . Let , for all . For , define the family of functions
[TABLE]
and for all . Define the family of reinforced random walks with FTR where . Each of these processes, in virtue of Theorem V, is recurrent. On the other hand, what is somewhat surprising is that the process with FTR is transient, in virtue of Theorem 2, part 2).
Remark 4**.**
We emphasize the fact that outside the intervals and the behaviour of the process is known from previous results (see Table I and Figure 1). Moreover, the proofs of Theorem 2 parts 3) and 4) cover more general cases, as stated in Propositions 7 and 8. In principle parts 1) and 2) can also be adapted to more general reinforcements, e.g. with a slowly varying factor. To be more precise, our proofs rely on some integral estimations of series. In this context, reinforcements which are power functions are easy to deal with and give explicit estimates. On the other hand, the method itself covers more general cases.
3. Proof of Theorem 2
Let and
[TABLE]
The process is in general a non-negative supermartingale and will play a major role in our proofs. In fact, in virtue of our assumption that is DT, we have that is indeed a martingale (see Lemma 3.0 in [4] for details).
We use the following 0-1 law (see Sellke [8] or Takeshima [10] for a proof).
Theorem S** (Sellke’s 0-1 law).**
Consider the ERRW on . If for all , then is either recurrent a.s. or transient a.s..
3.1. Proof of Theorem 2 part 1)
Since Theorem V part i) covers the case (that is ), hereafter we assume that . For , let
[TABLE]
Define the event E:=\big{\{}\mbox{\tau=+\infty\displaystyle\lim_{n\to\infty}X_{n}=+\infty}\big{\}}, which implies transience. We reason by contradiction and suppose that . On , we have that for any , there exists a such that for all . This implies that for all , we have
[TABLE]
By taking limits, we have that on
[TABLE]
On the other hand, is a non-negative martingale. Combining Doob’s convergence theorem (see [12]) with (3.1), we have that
[TABLE]
At the same time, as is a non-negative martingale, we can apply Austin’s theorem (see [1]), which says
[TABLE]
Since
[TABLE]
for and , we have
[TABLE]
which implies
[TABLE]
Let
[TABLE]
so that
[TABLE]
By Hölder’s inequality,
[TABLE]
On the other hand, if , then we have and
[TABLE]
This gives a contradiction, and proves that . The result follows by Sellke’s 0-1 law (see Theorem S).
3.2. Proof of Theorem 2 part 2)
In this Section we prove that is transient, a.s., under the assumptions of part 2). We reason by contradiction. Suppose that is recurrent a.s. (see Theorem S). We prove that is bounded in , which implies that is uniformly integrable and , which in turn implies transience. In fact, for , there exists a constant such that
[TABLE]
In the last step, we used Jensen’s inequality, as the map is concave, for . Similarly, for , we have
[TABLE]
In virtue of (3.2), in order to prove Theorem 2 part 2), it is enough to prove that there exists such that
[TABLE]
for all . In order to see why (3.4) is sufficient for our purposes, simply notice that is equivalent to the condition of the Theorem. The remaining part of this Section is devoted to prove (3.4). In particular, Lemma 6 below is the key result for our goal.
Definition 5**.**
Fix and set . Consider a generalised Pólya urn, which initially contains one white and one black ball. The reinforcement scheme for white balls is , for . The reinforcement scheme for black balls is , for . In other words, if the composition of the urn at stage is white balls and black balls, then the probability to pick a white ball at the next stage is . At each stage a ball is picked, and returned to the urn together with another ball of the same colour. Denote by the measure describing this model, and by the expected value with respect to . Denote by , with , the composition of the urn by time , with . Denote by the sequence under the measure .
Lemma 6**.**
Assume that and . Let for . Define There exists a constant such that for any , we have
[TABLE]
Proof.
Consider Rubin’s embedding (see the Appendix of [4]), which is shortly described as follows. Let and be two independent sequences of independent exponentials with parameter one. Set, for each ,
[TABLE]
The variables can be used to generate a Pólya urn process with the features of Definition 5. In this context, for if and only if by the time the urn contains white balls, it contains at least black ones. Let
[TABLE]
where denote the smallest integer larger or equal to . Fix a sequence , which will be specified later. For we have
[TABLE]
Call the product of the first two terms and the third term . We have
[TABLE]
The first inequality follows from an elementary bound
[TABLE]
The third inequality uses an integral comparison. We can obtain the fourth and fifth inequalities by noting that and , respectively. On the other hand,
[TABLE]
By choosing , we can see that is bounded by a positive constant . Thus, we have
[TABLE]
Suppose that is large enough to imply . Letting and noting that ,
[TABLE]
Hence we have
[TABLE]
In the last inequality, we used the fact that for , we have
[TABLE]
The previous inequality can be proved via the mean value theorem applied to the function , defined for . Finally we have
[TABLE]
for all large . By choosing a large , we obtain
[TABLE]
for all .
We can use a collection of independent generalized Pólya urns (), where has distribution , to generate a reinforced random walk . In this context, the jumps from vertex are modelled using the urn . Each time the process is at , we pick a ball from the urn and observe its color. If it is black the walk moves to , and moves to otherwise. Recall that is the total number of jumps from to before time . As we assume that is recurrent a.s., the variable is -measurable. Therefore is independent of . Using Lemma 6 with the urn , with , we have
[TABLE]
As , we have that . By taking the expected value of both sides in (3.7) and using Jensen’s inequality, we have that
[TABLE]
Proof of Theorem 2 part 2).
Consider a sequence satisfying
[TABLE]
Notice that the sequence satisfies (3.9), and is finite in virtue of the definition of . Hence (3.4) is proved once we prove that
[TABLE]
for some positive constant , and all . We prove this by induction. Of course it is true for , as we can choose simply large enough. Suppose it is true for . Using (3.9), we have that
[TABLE]
Hence
[TABLE]
Set . Notice that as , the larger is, the smaller becomes, approaching zero in the limit. Using , the right-hand side of (3.12) can be bounded as follows:
[TABLE]
We can choose smaller than 1 (i.e. large enough). Hence
[TABLE]
3.3. Proof of Theorem 2 part 3)
We prove a more general result, and the proof is closely related to the one given by Davis [3].
Proposition 7**.**
Suppose that has FTR and is DT. Assume that and . If and , then is transient a.s..
Proof.
For each ,
[TABLE]
Thus we have
[TABLE]
with probability one. By the orthogonality of martingale increments, we have
[TABLE]
for any , which shows that is an -bounded martingale. We have
[TABLE]
which implies
[TABLE]
This together with Sellke’s 0-1 law (see Theorem S) shows that is transient.
3.4. Proof of Theorem 2 part 4)
We provide a proof for a more general result, which includes initially transient cases.
Proposition 8**.**
Suppose that has FTR, and there exists a constant such that
[TABLE]
If , then localizes on a single edge a.s..
Proof.
For each , we define
[TABLE]
that is the event that the process never jumps from to . The -th time the process visits vertex , the conditional probability that it jumps to is
[TABLE]
Then, we have
[TABLE]
This shows that . The second Borel-Cantelli lemma implies that \mathbb{P}(\mbox{E_{x}x's})=1 and \mathbb{P}(\mbox{\mathbf{X} is of finite range})=1. In fact we have \mathbb{P}(\mbox{\mathbf{X} is localized to a single edge})=1 by an application of Rubin’s theorem (see Corollary 3.6 in [9]).
4. Appendix
Proof of Theorem V.
Let E:=\left\{\mbox{\tau=+\infty\displaystyle\lim_{n\to\infty}X_{n}=+\infty}\right\}. By Theorem S, implies that is recurrent a.s.. Recall the definition of from (1.1). On the event ,
[TABLE]
Suppose that for all . Then we have
[TABLE]
By Doob’s convergence theorem, cannot be positive.
Next we assume ii). Define
[TABLE]
The process is a nonnegative martingale. (see Lemma 3.0 in [4] for details). We rewrite
[TABLE]
Fix and suppose that for some , we have
[TABLE]
Assume that , and let . Then we have
[TABLE]
and
[TABLE]
This shows that cannot be positive.
Acknowledgement**.**
A.C. is grateful to Yokohama National University for its hospitality, and he was supported by ARC grant DP180100613 and Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS) CE140100049. M.T. is partially supported by JSPS Grant-in-Aid for Young Scientists (B) No. 16K21039. The authors thank Ben Amiet for reading the manuscript and helping to produce Figure 1. They also thank two anonymous referees for detailed comments. Finally they thank Amanoya for offering a very nice environment, where part of this research was carried.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Austin, D. G. (1966). A sample function property of martingales. Ann. Math. Statist. 37 , 1396–1397.
- 2[2] Coppersmith, D. and Diaconis, P. (1986). Random walks with reinforcement. Unpublished manuscript.
- 3[3] Davis, B. (1989). Loss of recurrence in reinforced random walk. Almost Everywhere Convergence 179–188, Academic Press.
- 4[4] Davis, B. (1990). Reinforced random walk. Probab. Theory Relat. Fields 84 , 203–229.
- 5[5] Lyons, R. and Peres, Y. (2016). Probability on trees and networks. Cambridge University Press, New York . Pages xvi+699. Available at http://pages.iu.edu/~rdlyons/.
- 6[6] Merkl, F. and Rolles, S. W. W. (2007). A random environment for linearly edge-reinforced random walks on infinite graphs, Probab. Theory Relat. Fields , 138 , 157–176.
- 7[7] Sabot, C. and Tarrès, P. (2015). Edge-reinforced random walk, vertex-reinforced jump process and the supersymmetric hyperbolic sigma model. J. Eur. Math. Soc. 17 , 2353–2378.
- 8[8] Sellke, T. (1994). Reinforced random walks on the d 𝑑 d -dimensional integer lattice. Technical Report, Department of Statistics, Purdue University #94-26 . / (2008). Markov Processes Relat. Fields 14 , 291–308.
