The hierarchical Cannings process in random environment

Andreas Greven; Frank den Hollander; Anton Klimovsky

arXiv:1703.03061·math.PR·March 10, 2017

The hierarchical Cannings process in random environment

Andreas Greven, Frank den Hollander, Anton Klimovsky

PDF

TL;DR

This paper extends the hierarchical Cannings process to a random environment, analyzing conditions for coexistence versus clustering, and explores the impact of randomness on the process's scaling behavior and cluster formation.

Contribution

It introduces a quenched random environment version of the hierarchical Cannings process and provides a full scaling analysis with universality classes for clustering behavior.

Findings

01

Necessary and sufficient condition for coexistence versus clustering.

02

In the random environment, the diffusion coefficient is smaller, slowing cluster growth.

03

Five universality classes of cluster formation are identified.

Abstract

In an earlier paper, we introduced and studied a system of hierarchically interacting measure-valued random processes which describes a large population of individuals carrying types and living in colonies labelled by the hierarchical group of order $N$ . The individuals are subject to migration, resampling on all hierarchical scales simultaneously. Upon resampling, a random positive fraction of the population in a block of colonies inherits the type of a random single individual in that block, which is why we refer to our system as the hierarchical Cannings process. In the present paper, we study a version of the hierarchical Cannings process in random environment, namely, the resampling measures controlling the change of type of individuals in different blocks are chosen randomly with a given mean and are kept fixed in time (= the quenched setting). We give a necessary and sufficient…

Equations487

\Omega_{N}=\Big{\{}\eta=(\eta^{l})_{l\in\mathbb{N}_{0}}\in\{0,1,\ldots,N-1\}^{\mathbb{N}_{0}}\colon\,\sum_{l\in\mathbb{N}_{0}}\eta^{l}<\infty\Big{\}},\qquad N\in\mathbb{N}\backslash\{1\},

\Omega_{N}=\Big{\{}\eta=(\eta^{l})_{l\in\mathbb{N}_{0}}\in\{0,1,\ldots,N-1\}^{\mathbb{N}_{0}}\colon\,\sum_{l\in\mathbb{N}_{0}}\eta^{l}<\infty\Big{\}},\qquad N\in\mathbb{N}\backslash\{1\},

d_{Ω_{N}} (η, ζ) = d_{Ω_{N}} (0, η - ζ) = min {k \in N_{0} : η^{l} = ζ^{l} \forall l \geq k}, η, ζ \in Ω_{N} .

d_{Ω_{N}} (η, ζ) = d_{Ω_{N}} (0, η - ζ) = min {k \in N_{0} : η^{l} = ζ^{l} \forall l \geq k}, η, ζ \in Ω_{N} .

B_{k} (η) = {ζ \in Ω_{N} : d_{Ω_{N}} (η, ζ) \leq k}, η \in Ω_{N}, k \in N_{0}

B_{k} (η) = {ζ \in Ω_{N} : d_{Ω_{N}} (η, ζ) \leq k}, η \in Ω_{N}, k \in N_{0}

\underline{c} = (c_{k})_{k \in N_{0}} \in (0, \infty)^{N_{0}},

\underline{c} = (c_{k})_{k \in N_{0}} \in (0, \infty)^{N_{0}},

a^{(N)} (η, ζ) = k \geq d_{Ω_{N}} (η, ζ) \sum \frac{c _{k - 1}}{N ^{2 k - 1}}, η, ζ \in Ω_{N}, η \neq = ζ, a^{(N)} (η, η) = 0.

a^{(N)} (η, ζ) = k \geq d_{Ω_{N}} (η, ζ) \sum \frac{c _{k - 1}}{N ^{2 k - 1}}, η, ζ \in Ω_{N}, η \neq = ζ, a^{(N)} (η, η) = 0.

\gamma(N)\left\{\begin{array}[]{ll}<0,&\quad c<1\text{ (strongly recurrent)},\\ =0,&\quad c=1\text{ (critically recurrent)},\\ >0,&\quad c>1\text{ (transient)}.\end{array}\right.

\gamma(N)\left\{\begin{array}[]{ll}<0,&\quad c<1\text{ (strongly recurrent)},\\ =0,&\quad c=1\text{ (critically recurrent)},\\ >0,&\quad c>1\text{ (transient)}.\end{array}\right.

k \to \infty lim sup \frac{1}{k} lo g c_{k} < lo g N .

k \to \infty lim sup \frac{1}{k} lo g c_{k} < lo g N .

\underline{\Lambda}=\big{(}\Lambda_{k})_{k\in\mathbb{N}_{0}}\in\mathcal{M}_{f}([0,1])^{\mathbb{N}_{0}},

\underline{\Lambda}=\big{(}\Lambda_{k})_{k\in\mathbb{N}_{0}}\in\mathcal{M}_{f}([0,1])^{\mathbb{N}_{0}},

Λ_{0} ({0}) = 0, \int_{(0, 1]} \frac{Λ _{0} ( d r )}{r} = \infty,

Λ_{0} ({0}) = 0, \int_{(0, 1]} \frac{Λ _{0} ( d r )}{r} = \infty,

Λ_{k} ({0}) = 0, \int_{(0, 1]} \frac{Λ _{k} ( d r )}{r ^{2}} < \infty. k \in N,

Λ_{k} ({0}) = 0, \int_{(0, 1]} \frac{Λ _{k} ( d r )}{r ^{2}} < \infty. k \in N,

λ_{k} = Λ_{k} ((0, 1]), λ_{k}^{*} = Λ_{k}^{*} ((0, 1]), k \in N_{0},

λ_{k} = Λ_{k} ((0, 1]), λ_{k}^{*} = Λ_{k}^{*} ((0, 1]), k \in N_{0},

\underline{λ} = (λ_{k})_{k \in N_{0}} \in (0, \infty)^{N_{0}} .

\underline{λ} = (λ_{k})_{k \in N_{0}} \in (0, \infty)^{N_{0}} .

k \to \infty lim sup \frac{1}{k} lo g λ_{k}^{*} < lo g N .

k \to \infty lim sup \frac{1}{k} lo g λ_{k}^{*} < lo g N .

\displaystyle F(x)=\int_{E^{n}}\left(\bigotimes_{m=1}^{n}x_{\eta_{m}}\big{(}\mathrm{d}u^{m}\big{)}\right)f\big{(}u^{1},\ldots,u^{n}\big{)},\quad x=(x_{\eta})_{\eta\in\Omega_{N}}\in\mathcal{P}(E)^{\Omega_{N}},

\displaystyle F(x)=\int_{E^{n}}\left(\bigotimes_{m=1}^{n}x_{\eta_{m}}\big{(}\mathrm{d}u^{m}\big{)}\right)f\big{(}u^{1},\ldots,u^{n}\big{)},\quad x=(x_{\eta})_{\eta\in\Omega_{N}}\in\mathcal{P}(E)^{\Omega_{N}},

n \in N, f \in C_{b} (E^{n}, R), η_{1}, \dots, η_{n} \in Ω_{N} .

L^{(\Omega_{N})}\colon\,\mathcal{F}\to C_{\mathrm{b}}\big{(}\mathcal{P}(E)^{\Omega_{N}},\mathbb{R}\big{)}

L^{(\Omega_{N})}\colon\,\mathcal{F}\to C_{\mathrm{b}}\big{(}\mathcal{P}(E)^{\Omega_{N}},\mathbb{R}\big{)}

L^{(Ω_{N})} = L_{mig}^{(Ω_{N})} + L_{res}^{(Ω_{N})} .

L^{(Ω_{N})} = L_{mig}^{(Ω_{N})} + L_{res}^{(Ω_{N})} .

(L_{mig}^{(Ω_{N})} F) (x) = η, ζ \in Ω_{N} \sum a^{(N)} (η, ζ) \int_{E} (x_{ζ} - x_{η}) (d a) \frac{\partial F ( x )}{\partial x _{η}} [δ_{a}]

(L_{mig}^{(Ω_{N})} F) (x) = η, ζ \in Ω_{N} \sum a^{(N)} (η, ζ) \int_{E} (x_{ζ} - x_{η}) (d a) \frac{\partial F ( x )}{\partial x _{η}} [δ_{a}]

(L_{res}^{(Ω_{N})} F) (x)

(L_{res}^{(Ω_{N})} F) (x)

+ η \in Ω_{N} \sum (L_{η}^{d_{0}} F) (x),

y_{η, k} = N^{- k} ζ \in B_{k} (η) \sum x_{ζ}

y_{η, k} = N^{- k} ζ \in B_{k} (η) \sum x_{ζ}

\Big{[}\big{(}\Phi_{r,a,B_{k}(\eta)}\big{)}(x)\Big{]}_{\zeta}=\begin{cases}(1-r)y_{\eta,k}+r\delta_{a},&\zeta\in B_{k}(\eta),\\ x_{\zeta},&\zeta\in\Omega_{N}\backslash B_{k}(\eta),\end{cases}\quad r\in[0,1],\,a\in E,\,k\in\mathbb{N}_{0},\,\eta\in\Omega_{N},

\Big{[}\big{(}\Phi_{r,a,B_{k}(\eta)}\big{)}(x)\Big{]}_{\zeta}=\begin{cases}(1-r)y_{\eta,k}+r\delta_{a},&\zeta\in B_{k}(\eta),\\ x_{\zeta},&\zeta\in\Omega_{N}\backslash B_{k}(\eta),\end{cases}\quad r\in[0,1],\,a\in E,\,k\in\mathbb{N}_{0},\,\eta\in\Omega_{N},

(L_{η}^{d_{0}} F) (x) = d_{0} \int_{E} \int_{E} Q_{x_{η}} (d u, d v) \frac{\partial ^{2} F ( x )}{\partial x _{η}^{2}} [δ_{u}, δ_{v}]

(L_{η}^{d_{0}} F) (x) = d_{0} \int_{E} \int_{E} Q_{x_{η}} (d u, d v) \frac{\partial ^{2} F ( x )}{\partial x _{η}^{2}} [δ_{u}, δ_{v}]

Q_{y} (d u, d v) = y (d u) δ_{u} (d v) - y (d u) y (d v), y \in P (E),

Q_{y} (d u, d v) = y (d u) δ_{u} (d v) - y (d u) y (d v), y \in P (E),

\frac{\partial ^{2} F ( x )}{\partial x _{η}^{2}} [δ_{u}, δ_{v}] = \frac{\partial}{\partial x _{η}} (\frac{\partial F ( x )}{\partial x _{η}} [δ_{u}]) [δ_{v}], u, v \in E .

\frac{\partial ^{2} F ( x )}{\partial x _{η}^{2}} [δ_{u}, δ_{v}] = \frac{\partial}{\partial x _{η}} (\frac{\partial F ( x )}{\partial x _{η}} [δ_{u}]) [δ_{v}], u, v \in E .

\int_{E} y_{η, k} (d a) [F (Φ_{r, a, B_{k} (η)} (x)) - F (x)] = F (y_{η, k}) - F (x) + O (r^{2}), r ↓ 0.

\int_{E} y_{η, k} (d a) [F (Φ_{r, a, B_{k} (η)} (x)) - F (x)] = F (y_{η, k}) - F (x) + O (r^{2}), r ↓ 0.

X^{(Ω_{N})} = (X^{(Ω_{N})} (t))_{t \geq 0},

X^{(Ω_{N})} = (X^{(Ω_{N})} (t))_{t \geq 0},

Ω_{N}^{T} = k \in N_{0} ⋃ Ω_{N}^{(k)} with Ω_{N}^{(k)} = Ω_{N} / B_{k} (0),

Ω_{N}^{T} = k \in N_{0} ⋃ Ω_{N}^{(k)} with Ω_{N}^{(k)} = Ω_{N} / B_{k} (0),

∣ ξ ∣ = the height of ξ (counting from the leaves),

∣ ξ ∣ = the height of ξ (counting from the leaves),

B_{∣ ξ ∣} (ξ)

B_{∣ ξ ∣} (ξ)

MC_{k} (η)

MC_{k} (η)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The hierarchical Cannings process in random environment

A. Greven1, F. den Hollander2, A. Klimovsky3

(March 18, 2024)

Abstract

In an earlier paper, we introduced and studied a system of hierarchically interacting measure-valued random processes that arises as the continuum limit of a large population of individuals carrying different types. Individuals live in colonies labelled by the hierarchical group of order $N$ , and are subject to migration and resampling on all hierarchical scales simultaneously. The resampling mechanism is such that a random positive fraction of the population in a block of colonies inherits the type of a random single individual in that block, which is why we refer to our system as the hierarchical Cannings process. Before resampling in a block takes place, all individuals in that block are relocated uniformly, which we call reshuffling.

In the present paper, we study a version of the hierarchical Cannings process in random environment, namely, the resampling measures controlling the change of type of individuals in different blocks are chosen randomly with a given mean and are kept fixed in time, i.e., we work in the quenched setting. We give a necessary and sufficient condition under which a multi-type equilibrium is approached (= coexistence) as opposed to a mono-type equilibrium (= clustering). Moreover, in the hierarchical mean-field limit $N\to\infty$ , with the help of a renormalization analysis we obtain a full picture of the space-time scaling behaviour of block averages on all hierarchical scales simultaneously. We show that the $k$ -block averages are distributed as the superposition of a Fleming-Viot diffusion with a deterministic volatility constant $d_{k}$ and a Cannings process with a random jump rate, both depending on $k$ . In the random environment $d_{k}$ turns out to be smaller than in the homogeneous environment of the same mean. We investigate how $d_{k}$ scales with $k$ . This leads to five universality classes of cluster formation in the mono-type regime. We find that if clustering occurs, then the random environment slows down the growth of the clusters, i.e., enhances the diversity of types. In some universality classes the growth of the clusters depends on the realisation of the random environment.

Keywords: Hierarchical Cannings process, random environment, migration, block reshuffling, block resampling, block coalescence, hierarchical mean field limit, random Möbius transformations.

MSC 2010: Primary 60J25, 60K35; Secondary 60G57, 60J60, 60J75, 82C28, 92D25.

Acknowledgements: AG was supported by the Deutsche Forschungsgemeinschaft (grant DFG-GR 876/15-2), FdH was supported by the European Research Council (Advanced Grant VARIS-267356) and by the Netherlands Organization for Scientific Research (Gravitation Grant NETWORKS-024.002.003), AK was supported by the Netherlands Organization for Scientific Research (grant 613.000.913). The authors are grateful to Evgeny Verbitskiy for help with the renormalization analysis.

1) Department Mathematik, Universität Erlangen-Nürnberg, Cauerstrasse 11, D-91058 Erlangen, Germany

[email protected]

2) Mathematisch Instituut, Universiteit Leiden, P.O. Box 9512, NL-2300RA Leiden, The Netherlands

[email protected]

3) Fakultät für Mathematik, Universität Duisburg-Essen, Thea-Leymann-Strasse 9, D-45127 Essen, Germany

[email protected]

1 Introduction
1.1 Motivation and goal
1.2 Summary of the main results
1.3 Outline
2 The model
2.1 The hierarchical Cannings process
2.1.1 The hierarchical group of order $N$
2.1.2 Block migration
2.1.3 Block reshuffling-resampling
2.1.4 The generator and the martingale problem
2.2 The hierarchical Cannings process in random environment
2.2.1 The random environment on the full tree
2.2.2 The generator in random environment
3 Main theorems
3.1 Results for fixed $N$
3.1.1 Well-posedness of the martingale problem
3.1.2 Dichotomy: coexistence versus clustering
3.2 Results for $N\to\infty$
3.2.1 McKean-Vlasov process
3.2.2 Random environment for $N=\infty$
3.2.3 Renormalization via block averages
3.2.4 Dichotomy for the interaction chain
3.2.5 Scaling of the volatility
3.2.6 Cluster formation
3.3 Summary of the effects of the random environment
4 Existence, uniqueness, duality and equilibrium
4.1 The spatial coalescent in random environment
4.2 Dualities
4.3 Well-posedness of the martingale problems and equilibria
4.4 Consequences for the Cannings process
5 Dichotomy: coexistence versus clustering
5.1 Mean hazard
5.2 Zero-one law
6 Multi-scale analysis
6.1 The mean-field finite-system scheme
6.2 The hierarchical mean-field limit
6.2.1 The $2$ -level system on 3 time scales
6.2.2 The $k$ -level system on $k+1$ time scales
6.2.3 The infinite-level system on infinitely many time scales
6.3 Dichotomy in the hierarchical mean-field limit
7 The orbit of the renormalization transformations
7.1 Random environment lowers the volatility
7.2 Scaling of the volatility: polynomial coefficients
7.3 Scaling of the volatility: exponential coefficients
8 Identification of the universality classes of cluster formation
8.1 Random cluster size
8.2 Random cluster order

1 Introduction

1.1 Motivation and goal

Two models play a central role in the world of stochastic multi-type population dynamics:

(1)

The Moran model and its limit for large populations, the Fleming-Viot measure-valued diffusion.

(2)

The Cannings model and its limit for large populations, the Cannings measure-valued jump process (also called the generalized Fleming-Viot process).

The Cannings model accounts for situations in which resampling is such that a random positive fraction of the population in the next generation inherits the type of a random single individual in the current generation, even in the infinite population limit (see Cannings [Can74], [Can75]). In order to describe a setting where this effect has a geographical structure, i.e., where migration of individuals is allowed as well, different models have been proposed in Limic and Sturm [LS06], Blath, Etheridge and Meredith [BEM07], Barton, Etheridge and Véber [BEV10], Berestycki, Etheridge and Véber [BEV13], and Greven, den Hollander, Kliem and Klimovsky [GdHKK14]. The behaviour of these models has been studied in detail and its dependence on the geographic space is fairly well understood.

The type space is typically chosen to be a compact Polish space $E$ . In [GdHKK14], we focused on the case where the geographic space is the hierarchical group $\Omega_{N}$ of order $N$ , since this allowed us to carry out a full renormalization analysis. In the hierarchical mean-field limit $N\to\infty$ , the migration can be chosen in such a way that it approximates migration on the geographic space $\mathbb{Z}^{2}$ , a possibility that was exploited by Sawyer and Felsenstein [SF83] (see also Dawson, Gorostiza and Wakolbinger [DGW04]).

We analyze the model introduced in [GdHKK14], but add the effect that the Cannings resampling mechanism is controlled by catastrophic events on a small time scale, for which it is appropriate to assume that the rate of occurrence has a spatially inhomogeneous structure. This leads us to consider spatial Cannings models with block resampling in random environment, i.e., both the form and the overall rate of the block resampling mechanism depend on the geographic location.

Remark 1.1.

In a catastrophic event, a part of the population is killed in a large spatial area and is subsequently replenished via a rapid recolonization, resulting in a bottleneck effect consisting of compression and subsequent expansion of the descendants of a single ancestor. The mechanisms behind such events are functions of the background environment, which is inhomogeneous in space but constant in time. It would be interesting to derive our continuum model (defined in Section 2.2) from an individual-based model with two time scales: the catastrophic events happen on a fast time scale, while the migration and resampling happen on a slow time scale. Moreover, in our individual-based model we do reshuffling before resampling, which must be motivated likewise. Carrying out the details of such a derivation would merit a paper in its own right.* $\square$ *

The goal of the present paper is three-fold:

(1)

Construction of the hierarchical Cannings process in random environment via a well-posed martingale problem and derivation of a duality relation with a hierarchical spatial coalescent in random environment.

(2)

Analysis of the longtime behaviour, in particular, the dichotomy between a multi-type equilibrium and a mono-type equilibrium.

(3)

Scaling analysis of a collection of renormalized processes obtained by looking at the evolution of blocks averages on successive space-time scales in the hierarchical mean-field limit and the consequences for universality classes of the mono-type cluster formation.

We are particularly interested in new effects caused by the random environment.

The mathematical tools we will exploit are the duality of the hierarchical Canning process in random environment with a hierarchical spatial coalescent in random environment, and the scaling of the block averages towards a mean-field process in random environment called the McKean-Vlasov process. This in turn will lead us to study two independent hierarchical random walks in the same random environment, and to analyze the orbit of iterations of non-linear transformations arising from random Möbius transformations that link the behaviour on successive hierarchical scales.

1.2 Summary of the main results

In an earlier paper, we introduced and studied a system of hierarchically interacting measure-valued random processes that arises as the continuum limit of a large population of individuals subject to migration, reshuffling and resampling [GdHKK14]. More precisely, individuals live in colonies labelled by $\Omega_{N}$ , the hierarchical group of order $N$ , and are subject to migration based on a sequence of migration coefficients $\underline{c}=(c_{k})_{k\in\mathbb{N}_{0}}$ and to resampling based on a sequence of resampling measures $\underline{\Lambda}=(\Lambda_{k})_{k\in\mathbb{N}_{0}}$ , both acting on blocks of colonies (= macro-colonies) on all hierarchical scales $k\in\mathbb{N}_{0}$ simultaneously. The resampling mechanism is such that a random positive fraction of the population in a block of colonies inherits the type of a random single individual in that block, even in the infinite population limit, which is why we refer to our system as the hierarchical Cannings process. Before resampling in a block takes place, all individuals in that block are relocated uniformly. This relocation is called reshuffling and means that resampling is done in a locally “panmictic” manner.

In the present paper, we study a version of the hierarchical Cannings process in random environment, namely, the resampling measures in different blocks are chosen randomly with mean $\underline{\Lambda}$ and are kept fixed in time, i.e., we consider the quenched version of the system. We construct the hierarchical Cannings process in random environment via a well-posed martingale problem, and establish duality with a system of coalescing hierarchical random walks with block coalescence in random environment. We study the long-time behaviour of the process, in particular, we give a necessary and sufficient condition on $\underline{c}$ and $\underline{\Lambda}$ under which almost sure convergence to a multi-type equilibrium occurs (= coexistence), as opposed to a mono-type equilibrium (= clustering). The equilibrium depends on the environment, but the condition on $\underline{c}$ and $\underline{\Lambda}$ for its occurrence does not.

To obtain more detailed information on the evolution of the system, we consider the hierarchical mean-field limit $N\to\infty$ . In this limit, with the help of a renormalization analysis, we obtain a full picture of the space-time scaling behaviour on all hierarchical scales simultaneously. Our main result is that, on each hierarchical scale $k\in\mathbb{N}_{0}$ , the $k$ -block averages on time scale $N^{k}$ converge to a random process that is a superposition of a Cannings process with a resampling measure equal to the associated $k$ -block resampling measure (which depends on the environment) and an additional Fleming-Viot process with volatility $d_{k}$ , reflecting the macroscopic impact of the lower-order resampling and of the drift of strength $c_{k}$ towards the limiting $(k+1)$ -block average (which is constant on the limiting time scale). It turns out that $d_{k}$ is a function of $c_{l}$ and $\Lambda_{l}$ for all $0\leq l<k$ , and of the law of the random environment. Thus, it is through the volatility that the renormalization manifests itself.

We show that the random environment makes the system less volatile, i.e., $d_{k}$ is strictly smaller than its corresponding value for the homogenous system where the resampling measures are replaced by their mean. We investigate how $d_{k}$ scales as $k\to\infty$ , which leads to various different cases depending on the choice of $\underline{c}$ and $\underline{\Lambda}$ . We find that if migration and resampling occur with comparable rates on all hierarchical scales, then the lower volatility persists in the limit as $k\to\infty$ . The renormalization transformation connecting $d_{k+1}$ to $d_{k}$ turns out to be a non-linear transformation arising from a random Möbius transformation. The scaling behaviour of the iterates of these transformations is studied in detail. We find that if clustering occurs, then the random environment slows down the growth of the clusters, i.e., enhances the diversity of types. We find five universality classes of cluster formation in the regime of clustering. These are linked to the different cases of scaling behaviour of $d_{k}$ . We find that if the growth of the clusters is rapid, then the rate of growth depends on the realisation of the environment, while if the growth is slow, then the effect of the environment averages out. The latter happens e.g. in the critical regime where the system is barely clustering.

1.3 Outline

Sections 2–5 deal with the model for finite $N$ , while Sections 6–8 deal with the hierarchical mean-field limit $N\to\infty$ . In Section 2 we define the hierarchical Cannings process and its dual. In Section 3 we state our main theorems and summarize the effects of the random environment. Section 4 contains the proof of existence and uniqueness of the hierarchical Cannings process and its dual, and establishes convergence to an equilibrium. Section 5 proves the dichotomy between coexistence (multi-type equilibrium) versus clustering (mono-type equilibrium), and provides the parameter range for both. Section 6 contains the multi-scale analysis for the evolution of block averages on successive space-time scales in the hierarchical mean-field limit, proves the dichotomy in that limit, and identifies the renormalization transformations connecting the successive scales. Section 7 analyzes the orbit of the iterations of these transformations and identifies various different cases for the scaling of the volatility of the block averages. Section 8 links these cases to the universality classes of cluster formation.

2 The model

In this section, we define the hierarchical Cannings process in random environment and construct its dual: a spatial coalescent in random environment. We begin in Section 2.1 by recalling the process without random environment introduced in [GdHKK14]. In Section 2.2 we explain how the random environment is added.

2.1 The hierarchical Cannings process

In Sections 2.1.1–2.1.3, we recall the definition of the hierarchical Canning process given in [GdHKK14]. In Section 2.1.4 we add the random environment and indicate how the definition needs to be modified.

2.1.1 The hierarchical group of order $N$

The hierarchical group $\Omega_{N}$ of order $N$ is the set

[TABLE]

endowed with the addition operation $+$ defined by $(\eta+\zeta)^{l}=\eta^{l}+\zeta^{l}\textrm{ (mod$ N $)}$ , $l\in\mathbb{N}_{0}$ . In other words, $\Omega_{N}$ is the direct sum of the cyclical group of order $N$ (a fact that is important for the application of Fourier analysis). The group $\Omega_{N}$ is equipped with the ultrametric distance $d_{\Omega_{N}}(\cdot,\cdot)$ defined by

[TABLE]

Let

[TABLE]

denote the $k$ -block around $\eta$ (i.e., the ball of hierarchical radius $k$ around $\eta$ ), which we think of as a macro-colony. The geometry of $\Omega_{N}$ is explained in Fig. 1.

In what follows, we consider a system of individuals organized in colonies labelled by $\Omega_{N}$ . Initially each colony has $M$ individuals, each carrying a type drawn from a Polish type space $E$ that is compact. Subsequently, individuals are subject to block migration (Section 2.1.2) and block reshuffling-resampling (Section 2.1.3). In the continuum-mass limit $M\to\infty$ , the evolution converges to the hierarchical Cannings process (Section 2.1.4).

2.1.2 Block migration

We introduce migration on $\Omega_{N}$ through a random walk kernel. For that purpose, we introduce a sequence of migration rates

[TABLE]

and we let the individuals migrate as follows:

•

Each individual, for every $k\in\mathbb{N}$ , chooses at rate $c_{k-1}/N^{k-1}$ the block of radius $k$ around its present location and jumps to a location chosen uniformly at random in that block.

The transition kernel of the random walk thus performed by the individuals is

[TABLE]

Remark 2.1.

The behaviour of the random walk in (2.5) is known in great detail. Dawson, Gorostiza and Wakolbinger [DGW05] showed that it is recurrent if and only if $\sum_{k\in\mathbb{N}_{0}}(1/c_{k})=\infty$ . They introduced the concept of degree of recurrence/transience $\gamma_{N}$ [DGW05, Definition 2.1.1], which in the special case where $c_{k}=c^{k}$ equals $\gamma(N)=\log c/\log(N/c)$ . Note that

[TABLE]

This is the same as for simple random walk on $\mathbb{Z}^{d}$ with Hausdorff dimension $d=d(N)=(2\log N)/\log(N/c)$ (when we allow for a continuum of dimensions). In particular, $d=d(N)=2$ for $c=1$ .**

Throughout the paper, we assume that

[TABLE]

This guarantees that the total migration rate per individual is finite.

2.1.3 Block reshuffling-resampling

The idea of the Cannings resampling mechanism is to allow reproduction with an offspring that is of a size comparable to the whole population. Since we have introduced a spatial structure, we now allow, on all hierarchical levels $k$ simultaneously, a reproduction event where each individual treats the $k$ -block around its present location as a macro-colony and uses it for its resampling. More precisely, we choose a sequence of resampling measures

[TABLE]

where $\mathcal{M}_{f}([0,1])$ denotes the set of finite non-negative measures on $[0,1]$ , satisfying

[TABLE]

and

[TABLE]

Let $\Lambda^{*}_{k}(\mathrm{d}r)=\Lambda_{k}(\mathrm{d}r)/r^{2}$ , $r\in(0,1]$ . Set

[TABLE]

and assume that

[TABLE]

We let individuals reshuffle-resample by carrying out the following two steps at once:

•

For every $\eta\in\Omega_{N}$ and $k\in\mathbb{N}_{0}$ , choose the block $B_{k}(\eta)$ at rate $1/N^{2k}$ .

•

First, each individual in $B_{k}(\eta)$ independently is moved to a uniformly random location in $B_{k}(\eta)$ , i.e., a reshuffling takes place (see Fig. 2). After that, $r$ is drawn according to the intensity measure $\Lambda^{*}_{k}$ and $a$ is drawn according to the current type distribution in $B_{k}(\eta)$ , and each of the individuals in $B_{k}(\eta)$ independently is replaced by an individual of type $a$ with probability $r$ .

Note that the reshuffling-resampling affects all the individuals in a macro-colony simultaneously and in the same manner. The reshuffling-resampling occurs at all levels $k\in\mathbb{N}_{0}$ , at a rate that is fastest in single colonies and gets slower as the level $k$ of the macro-colony increases. 111Because the reshuffling is done first, the resampling always acts on a uniformly distributed state (“panmictic resampling”). Reshuffling is a parallel update affecting all individuals in a macro-colony simultaneously. Therefore it cannot be seen as a migration of individuals equipped with independent clocks.

The first conditions in (2.9) and (2.10) make the resampling a jump process. Later we will add in diffusion by hand. The second condition in (2.9) guarantees that the population has a well-defined genealogy and that after a positive finite time most of the population at a site descends from a finite number of ancestors (see Pitman [Pit99]). The second condition in (2.10) is needed to guarantee that in finite time a macro-colony is affected by finitely many reshuffling-resampling events, otherwise the resampling cannot be properly defined.

Throughout the paper, we assume that

[TABLE]

Note that each of the $N^{k}$ colonies in a $k$ -block can trigger reshuffling-resampling in that block, and for each colony the block is chosen at rate $N^{-2k}$ . Therefore, (2.13) guarantees that the total resampling rate per individual is bounded.

2.1.4 The generator and the martingale problem

We are now ready to formally define the hierarchical Cannings process in terms of a martingale problem. The process arises as the continuum-mass limit of the individual-based model described in Sections 2.1.1–2.1.3. Namely, in each colony of size $M$ , instead of recording the numbers of individuals of a given type we record the empirical distribution of the types and pass to the limit $M\to\infty$ .

Let $\mathcal{P}(E)$ denote the set of probability measure on $E$ equipped with the topology of weak convergence. We equip the set $\mathcal{P}(E)^{\Omega_{N}}$ with the product topology to get a state space that is Polish. Let $\mathcal{F}\subset C_{\mathrm{b}}\big{(}\mathcal{P}(E)^{\Omega_{N}},\mathbb{R}\big{)}$ be the algebra of functions of the form

[TABLE]

The linear operator for the martingale problem

[TABLE]

has two parts,

[TABLE]

The migration operator is given by

[TABLE]

and the reshuffling-resampling operator by

[TABLE]

where

[TABLE]

is the $k$ -block average of the components of $x$ in $B_{k}(\eta)$ , $\Phi_{r,a,B_{k}(\eta)}\colon\,\mathcal{P}(E)^{\Omega_{N}}\to\mathcal{P}(E)^{\Omega_{N}}$ is the reshuffling-resampling map acting as

[TABLE]

and $L^{d_{0}}_{\eta}$ is the Fleming-Viot diffusion operator with volatility $d_{0}\geq 0$ , acting on the colony $x_{\eta}$ , given by

[TABLE]

with

[TABLE]

the Fleming-Viot diffusion coefficient, and

[TABLE]

Remark 2.2.

Note that the right-hand side of (2.18) is well-defined because of assumption (2.10). Indeed, by Taylor-expanding the inner integral in (2.18) in powers of $r$ , we get

[TABLE]

To have a well-defined resampling operator (2.18), the expression in (2.24) must be integrable with respect to $\Lambda^{*}_{k}(\mathrm{d}r)$ , which is equivalent to assumption (2.10).* $\square$ *

The following proposition was proved in [GdHKK14].

Proposition 2.3 (Hierarchical martingale problem).

For every $x\in\mathcal{P}(E)^{\Omega_{N}}$ , the martingale problem for $(L^{(\Omega_{N})},\mathcal{F},\delta_{x})$ is well-posed. 222As part of the definition of the martingale problem, we always require that the solution has càdlàg paths and is adapted to the natural filtration. The unique solution is a strong Markov process with the Feller property. $\square$

The Markov process arising as the solution of this martingale problem is denoted by

[TABLE]

and is referred to as the $C_{N}^{\underline{c},\underline{\Lambda}}$ -process on $\Omega_{N}$ . Proposition 2.3 does not actually need the second condition in (2.9).

This condition will be needed only later.

2.2 The hierarchical Cannings process in random environment

Our task in this section is to modify the first term in the right-hand side of (2.18) so as to include the effect of a random environment on the Cannings resampling mechanism. Section 2.2.1 defines the random environment, Section 2.2.2 the modified generator.

2.2.1 The random environment on the full tree

Recall that $\Omega_{N}$ is the set of leaves of the tree in Fig. 1. To introduce the random environment, we need to consider the full tree, i.e.,

[TABLE]

where $\Omega_{N}/B_{k}(0)$ denotes the quotient group of $\Omega_{N}$ modulo $B_{k}(0)$ , which can be identified with the layer of the tree situated at height $k$ above the leaves. Indeed, because $d_{\Omega_{N}}$ is an ultrametric distance (recall (2.2)), for each $k\in\mathbb{N}_{0}$ the set $\Omega_{N}$ decomposes into disjoint balls of radius $k$ , which can be labelled by the set $\Omega^{(k)}_{N}$ . For $\xi\in\Omega^{\mathbb{T}}_{N}$ , we write

[TABLE]

i.e., $|\xi|=k$ when $\xi\in\Omega^{(k)}_{N}$ for $k\in\mathbb{N}_{0}$ , and we define

[TABLE]

to be the set of sites in $\Omega_{N}$ that lie below $\xi$ (see Fig. 3). We can define the distance on the layer $\Omega^{(k)}_{N}$ as the graph distance to the most recent common ancestor, and the distance on the full tree $\Omega^{\mathbb{T}}_{N}$ as the largest of the two graph distances to the most recent common ancestor (recall Fig. 1). The latter will be denoted by $d_{\Omega^{\mathbb{T}}_{N}}$ . We write

[TABLE]

to denote the vertex in $\Omega^{\mathbb{T}}_{N}$ at height $k\in\mathbb{N}_{0}$ above $\eta\in\Omega_{N}$ . This site carries the rate for the random walk on $\Omega_{N}$ to become uniformly distributed on the $k-$ ball around $\eta$ . Since $\Omega^{\mathbb{T}}_{N}$ is isomorphic to $\Omega_{N}\times\mathbb{N}_{0}$ , we sometimes write $\mathrm{MC}_{k}(\eta)=\xi=(\eta,k)$ .

We want to make the reshuffling-resampling spatially random. To that end, we let

[TABLE]

be a random field of $\mathcal{M}_{f}([0,1])$ -valued resampling measures indexed by the tree.

•

Throughout the paper, we use the symbol $\omega$ to denote the random environment and the symbol $\mathbb{P}$ to denote the law of $\omega$ .

In what follows, we assume that $\Lambda^{\xi}(\omega)$ is of the form

[TABLE]

where $\underline{\lambda}=(\lambda_{k})_{k\in\mathbb{N}_{0}}$ is a deterministic sequence in $(0,\infty)$ (playing the role of modulation coefficients) and

[TABLE]

is a random field of $\mathcal{M}_{f}([0,1])$ -valued resampling measures that is stationary under translations in $\Omega^{\mathbb{T}}_{N}$ (i.e., translations sideways and up in Fig. 3), and satisfies the conditions in (2.9) when $\xi=0$ and the conditions in (2.10) when $\xi\neq 0$ .

Abbreviate

[TABLE]

which is the total mass of $\chi^{\xi}(\omega)$ . Clearly,

[TABLE]

is a random field of $(0,\infty)$ -valued total masses that is also stationary under translations in $\Omega^{\mathbb{T}}_{N}$ . Throughout the paper, we assume that

[TABLE]

and that the sigma-algebra at infinity associated with (2.33), defined by

[TABLE]

is trivial, i.e., all its events have probability 0 or 1 under the law $\mathbb{P}$ . For one of the theorems below we need to strengthen (2.35) to

[TABLE]

2.2.2 The generator in random environment

Throughout the sequel, we use the symbols $\eta,\zeta$ to denote elements of $\Omega_{N}$ and the symbol $\xi$ to denote elements of $\Omega^{\mathbb{T}}_{N}$ .

In random environment, we keep the definitions in Section 2.1.4, but we replace the reshuffling-resampling operator $L^{(\Omega_{N})}_{\mathrm{res}}$ in (2.18) by

[TABLE]

with $(\Lambda^{\xi}(\omega))^{*}(\mathrm{d}r)=\Lambda^{\xi}(\omega)(\mathrm{d}r)/r^{2}$ , $r\in(0,1]$ , where $y_{\xi}\in\mathcal{P}(E)$ is given by

[TABLE]

and $\Phi_{r,a,B_{|\xi|}(\xi)}\colon\,\mathcal{P}(E)^{\Omega_{N}}\to\mathcal{P}(E)^{\Omega_{N}}$ is the reshuffling-resampling map acting as

[TABLE]

The difference between (2.18) and (2.38) is that the resampling in blocks occurs according to the resampling measure associated with the center of the block, labelled by $\Omega^{\mathbb{T}}_{N}$ . The full generator is

[TABLE]

with $L^{(\Omega_{N})}_{\mathrm{mig}}$ the migration operator in (2.17).

3 Main theorems

In Section 3.1 we present results for fixed $N$ , in Section 3.2 for $N\to\infty$ , the hierarchical mean-field limit. In Section 3.3 we summarize the effects of the random environment. Throughout the paper, the environment $\omega$ is fixed and we use the symbol $\mathcal{L}[W]$ to denote the law of a random variable $W$ .

3.1 Results for fixed $N$

Section 3.1.1 establishes the well-posedness of the martingale problem, Section 3.1.2 the convergence to an equilibrium that depends on $\omega$ .

3.1.1 Well-posedness of the martingale problem

We begin by establishing that the martingale problem characterizes the process uniquely and specifies a strong Markov process.

Theorem 3.1 (Well-posedness of the martingale problem).

Fix $N\in\mathbb{N}\backslash\{1\}$ . For $\mathbb{P}$ -a.s. $\omega$ and every $x\in\mathcal{P}(E)^{\Omega_{N}}$ , the $(L^{(\Omega_{N})}(\omega),\mathcal{F},\delta_{x})$ -martingale problem is well-posed. The unique solution is a strong Markov process with the Feller property. $\square$

The Markov process arising as the solution of the martingale problem is denoted by

[TABLE]

and is referred to as the hierarchical Cannings process on $\Omega_{N}$ in the environment $\omega$ . Theorem 3.1 does not actually need the second condition in (2.9). This condition will be needed only later.

3.1.2 Dichotomy: coexistence versus clustering

We next show that the law of our process converges to a limit law that depends on $\omega$ .

Theorem 3.2 (Equilibrium).

Fix $N\in\mathbb{N}\backslash\{1\}$ . Suppose that, under the law $\mathbb{P}$ , the law of the initial state $X^{(\Omega_{N})}(\omega;0)$ is stationary and ergodic under translations in $\Omega_{N}$ , with mean single-coordinate measure

[TABLE]

Then, for $\mathbb{P}$ -a.e. $\omega$ , there exists an equilibrium measure $\nu^{N}_{\theta}(\omega)\in\mathcal{P}(\mathcal{P}(E)^{\Omega_{N}})$ , arising as

[TABLE]

satisfying

[TABLE]

Moreover, under the law $\mathbb{P}$ , $\nu^{N}_{\theta}(\omega)$ is stationary and ergodic under translations in $\Omega_{N}$ . $\square$

Note that $\nu^{N}_{\theta}(\omega)$ depends on $\omega$ even though its mean single-coordinate measure $\theta$ (which is determined by the initial state) does not. The proof of Theorem 3.2 is based on a computation with the dual hierarchical Cannings process, which allows us to control second moments. As we will see in Section 5, in random environment this computation is delicate because it involves two random walks in the same environment, and the difference of these two random walks is not a random walk itself, like in the average environment.

Using the stationarity and ergodicity of $\nu^{N}_{\theta}$ , we next identify the parameter regime for which $\nu^{N}_{\theta}(\omega)$ is a multi-type equilibrium (= coexistence given $\omega$ ), i.e.,

[TABLE]

respectively, a mono-type equilibrium (= clustering given $\omega$ ), i.e.,

[TABLE]

The two regimes are complementary. In the latter regime the system grows mono-type clusters that eventually cover all finite subsets of $\Omega_{N}$ (where types may or may not change infinitely often).

Theorem 3.3 (Dichotomy for finite $N$ ).

Fix $N\in\mathbb{N}\backslash\{1\}$ and assume (2.37).

(a)

Let $\mathcal{C}_{N}=\{\omega\colon\,\text{in$ \omega $coexistence occurs}\}$ . Then $\mathbb{P}(\mathcal{C}_{N})\in\{0,1\}$ .

(b)

$\mathbb{P}(\mathcal{C}_{N})=1$ * if and only if*

[TABLE]

$\square$ **

Cox and Klenke [CK00] give a criterion in the clustering regime for when the type at a given site changes infinitely often. For interacting Fleming-Viot processes they show that this happens as soon as $\theta$ is not a $\delta$ -measure. Because of the reasoning in Section 5, we therefore get the following.

Corollary 3.4 (Change of types).

In the clustering regime, if $\theta\neq\delta_{u}$ for some $u\in E$ , then at every site the type changes infinitely often. $\square$

3.2 Results for $N\to\infty$

Our remaining theorems capture the space-time scaling behaviour of our process in the hierarchical mean-field limit $N\to\infty$ . In this limit, the degree of recurrence/transience $\gamma(N)$ tends to [math] while the Hausdorff dimension $d(N)$ in the metric $\eta\mapsto e^{|\eta|}$ tends to $2$ (recall Remark 2.1), so that our process becomes near-critical.

In Section 3.2.1 we introduce a key process, called the McKean-Vlasov process, which naturally arises in this limit. In Section 3.2.2 we define the random environment for $N=\infty$ . In Section 3.2.3 we look at the block averages on successive space-time scales and show that as $N\to\infty$ these converge to a sequence of McKean-Vlasov processes with renormalized volatilities. In Section 3.2.5 we identify the scaling behaviour of the volatility on hierarchical scale $k$ in the limit as $k\to\infty$ , which leads to various different cases as a function of $\underline{c}$ and $\underline{\Lambda}$ . In Section 3.2.4 we identify the parameter regimes that correspond to coexistence, respectively, clustering. In Section 3.2.6 we link the different cases of scaling to five universality classes of cluster formation.

3.2.1 McKean-Vlasov process

We need some definitions and basic facts about the McKean-Vlasov process from [GdHKK14].

Let $\mathcal{F}\subseteq C_{\mathrm{b}}(\mathcal{P}(E),\mathbb{R})$ be the algebra of functions $F$ of the form

[TABLE]

For $c,d\in[0,\infty)$ , $\Lambda\in\mathcal{M}_{f}([0,1])$ subject to (2.9) and $\theta\in\mathcal{P}(E)$ , let $L_{\theta}^{c,d,\Lambda}\colon\,\mathcal{F}\to C_{\mathrm{b}}(\mathcal{P}(E),\mathbb{R})$ be the linear operator

[TABLE]

acting on $F\in\mathcal{F}$ as (recall (2.22))

[TABLE]

The three parts of $L_{\theta}^{c,d,\Lambda}$ correspond to: (1) a drift towards $\theta$ of strength $c$ (“immigration-emigration”); (2) a Fleming-Viot diffusion with volatility $d$ (“Moran resampling”); (3) a Cannings process with resampling measure $\Lambda$ (“Cannings resampling”). This model arises as the $M\to\infty$ limit of an individual-based model with $M$ individuals at a single site, with immigration at rate $c$ from a constant source with type distribution $\theta\in\mathcal{P}(E)$ , emigration at rate $c$ to a cemetery state, diffusive resampling at rate $d$ , and $\Lambda$ -resampling.

The following proposition was proved in [GdHKK14].

Proposition 3.5 (McKean-Vlasov martingale problem).

**

(a)

For every $y\in\mathcal{P}(E)$ , the martingale problem for $(L_{\theta}^{c,d,\Lambda},\mathcal{F},\delta_{y})$ is well-posed. The unique solution is a strong Markov process with the Feller property.

(b)

For every $c\in(0,\infty)$ , the solution from (a) is ergodic in time with unique equilibrium measure $\nu_{\theta}^{c,d,\Lambda}$ . For $c=0$ , the solution from (a) is not ergodic in time, and $\nu_{\theta}^{0,d,\Lambda}$ is the unique equilibrium measure obtained as the $t\to\infty$ limit with initial state $y=\theta$ .

(c)

For $c>0$ ,

[TABLE]

$\square$ **

Denote by

[TABLE]

the solution of the martingale problem in Proposition 3.5 for the special choice $y=\theta$ . This is called the McKean-Vlasov process with parameters $c,d,\Lambda$ and initial state $\theta$ .

3.2.2 Random environment for $N=\infty$

In order to be able to pass to the limit $N\to\infty$ , we need to define a random environment for $N=\infty$ in which all the random environments for finite $N$ are embedded. To that end, define $\Omega_{\infty}=\oplus_{\mathbb{N}}\mathbb{N}$ , and let (recall (2.26))

[TABLE]

Note that for any $N\in\mathbb{N}$ there is a natural embedding of $\Omega^{\mathbb{T}}_{N}$ into $\Omega^{\mathbb{T}}_{\infty}$ . Similarly as in Section 2.2.1, we let

[TABLE]

be a random field of $\mathcal{M}_{f}([0,1])$ -valued resampling measures index by the full tree, where $\omega$ again denotes the random environment. We retain the symbol $\mathbb{P}$ for the law of of $\omega$ . As in (2.31)–(2.36), we assume that $\Lambda^{\xi}(\omega)=\lambda_{|\xi|}\chi^{\xi}(\omega)$ where, under the law $\mathbb{P}$ , $\{\chi^{\xi}(\omega)\colon\,\xi\in\Omega^{\mathbb{T}}_{\infty}\}$ is stationary under translations in $\Omega^{\mathbb{T}}_{\infty}$ , and is such that the total masses $\rho^{\xi}(\omega)=\chi^{\xi}(\omega)((0,1])$ have first moment equal to $1$ , second moment finite, and a trivial sigma-algebra at infinity. For any $N\in\mathbb{N}$ , the natural restriction of the random field in (3.14) equals the random field in (2.30).

3.2.3 Renormalization via block averages

For each $k\in\mathbb{N}_{0}$ , we look at the $k$ -block averages defined by (recall Fig. 1)

[TABLE]

which constitute a renormalization of space where the component $\eta$ is replaced by the average of the components in $B_{k}(\eta)$ . After a corresponding renormalization of time where $t$ is replaced by $tN^{k}$ , i.e., $t$ is the associated macroscopic time variable, we obtain a renormalized interacting system

[TABLE]

which is constant in $B_{k}(\eta)$ and can be viewed as an interacting system indexed by the set $\Omega^{(k)}_{N}$ (see Fig. 1). This provides us with a sequence of renormalized interacting systems, which for fixed $N$ are not Markov.

The key ingredient to study the $N\to\infty$ limit of (3.16) is the following. Let $\underline{d}=(d_{k})_{k\in\mathbb{N}_{0}}$ be the sequence of volatility constants defined recursively as

[TABLE]

where

[TABLE]

$\rho$ is the $(0,\infty)$ -valued random variable whose law $\mathcal{L}_{\rho}$ is the same as that of $\rho^{0}(\omega)$ under $\mathbb{P}$ (recall (2.35–2.36)), and $\mathbb{E}_{\mathcal{L}_{\rho}}$ is expectation w.r.t. $\mathcal{L}_{\rho}$ . For fixed $\underline{c}$ , $\underline{\Lambda}$ and $d_{0}$ , the recursion in (3.17) determines $\underline{d}$ . The right-hand side is the average of a random Möbius transformation that depends on $\rho$ . Recall that $\rho$ has mean 1.

Heuristics behind the recursion formula for the volatilities.

In order to understand the recursion formula in (3.17), we consider the 1-block around the origin [math] on time scale $Nt$ and let $N\to\infty$ . Note that, in this limit, the time scales for the jumps to different levels separate (recall (2.5)), so that we can focus on each of the time scales separately.

If we randomly draw two lineages from the 1-block and ask whether they have a common ancestor some time back (so that they are of the same type), then we get exactly the event that generates the variance of the 1-block average (otherwise the lineages and their types would be independent and would have an asymptotically vanishing contribution to the variance). The fact that the lineages behave like a spatial coalescent follows from the duality introduced in Section 2. The lineages have to meet in order to have a common ancestor, which takes them a time of order $Nt$ . Note that triples of lineages have a negligible probability to meet at times of order $Nt$ in the limit of $N\to\infty$ .

If the lineages meet, then they may coalesce. This happens at rate

[TABLE]

when they both sit at $\eta\in\Omega$ . However, they may also move before they coalesce, i.e., make a migration jump away, which happens with probability $2c_{0}/(2c_{0}+\lambda^{(\eta,0)}(\omega))$ . Hence the effective coalescence rate is $\lambda^{(\eta,0)}(\omega)[2c_{0}/(2c_{0}+\lambda^{(\eta,0)}(\omega))]$ . Since the vertex where the lineages meet is uniformly distributed over the 1-block, the average rate is given by

[TABLE]

where we use that $\lambda^{(\eta,0)}(\omega)$ has the same distribution as $\lambda_{0}\rho(\omega)$ (recall (2.31)–(2.33)). If we would have a diffusive part as well, at constant rate $2d_{0}$ , then the lineages would coalesce at the same rate but with $\lambda(\omega)$ replaced by $2d_{0}+\lambda(\omega)$ . Since the volatility turns out to be equal to this rate, we get the recursion formula

[TABLE]

By the same reasoning for $k$ -blocks on time scale $tN^{k}$ , we get a heuristic explanation for the recursion formula in (3.17).

Our next theorem states that for each $k\in\mathbb{N}_{0}$ the $k$ -block averages in the limit as $N\to\infty$ evolve according to the McKean-Vlasov process defined in Section 3.2.1 with certain $k$ -dependent parameters.

Theorem 3.6 (Hierarchical mean-field limit and renormalization).

Suppose that for each $N$ the random field $X^{(\Omega_{N})}(\omega;0)$ is the restriction to $\Omega_{N}$ of a random field $X(\omega)$ indexed by $\Omega_{\infty}=\bigoplus_{\mathbb{N}}\mathbb{N}$ that is i.i.d. with single-component mean $\theta\in\mathcal{P}(E)$ . Then, for $\mathbb{P}$ -a.e. $\omega$ and every $k\in\mathbb{N}$ and $\eta\in\Omega_{\infty}$ ,

[TABLE]

where

[TABLE]

i.e., the label of the block (= macro-colony) of radius $k$ in $\Omega_{\infty}$ around $\eta\in\Omega_{\infty}$ (see Fig. 1). The same is true for $k=0$ when the initial condition for the McKean-Vlasov process in the right-hand side of (3.22) is $Z_{\theta}^{c_{0},d_{0},\Lambda^{\eta}(\omega)}(0)=X^{(\Omega_{N})}(\omega;0)$ instead of $Z_{\theta}^{c_{0},d_{0},\Lambda^{\eta}(\omega)}(0)=\theta$ . $\square$

Note that among the parameters $c_{k},d_{k},\Lambda^{\mathrm{MC}_{k}(\eta)}(\omega)$ of the limiting McKean-Vlasov process, the volatility $d_{k}$ is the result of a self-averaging with respect to the random environment up to and including level $k$ , as exemplified by (3.17). It is through this recursion relation that the renormalization manifests itself.

Our next theorem looks at successive block averages simultaneously.

Theorem 3.7 (Multi-scale analysis and the interaction chain).

Let $(t_{N})_{N\in\mathbb{N}}$ be such that

[TABLE]

Then, for $\mathbb{P}$ -a.e. $\omega$ , every $j\in\mathbb{N}$ and every $\eta\in\Omega_{\infty}$ ,

[TABLE]

where $M^{(j)}_{\eta}(\omega)=(M^{(j)}_{\eta,k}(\omega))_{k=-(j+1),-j,\ldots,0}$ is the time-inhomogeneous Markov chain with initial state

[TABLE]

and transition kernel from time $-(k+1)$ to $-k$ given by

[TABLE]

$\square$ **

The right-hand side of (3.24) describes the large space-time scaling behaviour of our hierarchical Cannings process.

Definition 3.8 (Interaction chain).

$M^{(j)}_{\eta}(\omega)$ * is called the interaction chain at level $j$ at location $\eta\in\Omega_{\infty}$ given $\omega$ . $\square$ *

Remark 3.9.

Theorem 3.7 only specifies the limiting distribution of the one-dimensional spatial marginals, i.e., the single interaction chains. Similarly as in Dawson, Greven and Vaillancourt [DGV95, Section 0e], it is possible to also specify the joint distribution of the interaction chains, which can be viewed as a field of Markov chains indexed by $\Omega^{\mathbb{T}}_{\infty}$ .* $\square$ *

An important characteristic of $M^{(j)}_{\eta}$ is the variance of $M^{(j)}_{\eta,0}$ , calculated as

[TABLE]

This shows that a key ingredient for $M_{\eta}^{(j)}$ is the sequence of volatilities $\underline{d}=(d_{k})_{k\in\mathbb{N}_{0}}$ and the way this sequence grows or decays. How is this affected by the randomness of the environment?

Our next theorem shows that the volatility $d_{k}$ in the random environment can be sandwiched between the volatility $d^{0}_{k}$ in the zero environment ( $\mathcal{L}_{\rho}=\delta_{0}$ , i.e., the system without resampling) and the volatility $d^{1}_{k}$ in the average environment ( $\mathcal{L}_{\rho}=\delta_{1}$ , i.e., the system with average resampling).

Theorem 3.10 (Randomness lowers volatility).

If $d^{0}_{0}=d_{0}=d^{1}_{0}$ , then $d^{0}_{k}<d_{k}<d^{1}_{k}$ for all $k\in\mathbb{N}$ . $\square$

3.2.4 Dichotomy for the interaction chain

How are the qualitative properties of the Cannings process for large $N$ reflected in the interaction chain? What about the dichotomy clustering versus coexistence? Before answering these questions we need to first establish the existence of the entrance law of the interaction chain from level $\infty$ , which we will obtain from the level $j$ interaction chain as limit $j\to\infty$ . With this object, we can address the question of coexistence versus clustering.

Proposition 3.11 (Entrance law of interaction chain exists).

The limit as $j\to\infty$ of $M^{(j)}_{\eta}$ exists. $\square$

The object corresponding to the equilibrium of the stochastic system for finite $N$ in the hierarchical mean-field limit $N\to\infty$ is the field of entrance laws of the interaction chain from level $\infty$ (recall Remark 3.9), in particular, its marginal law $\Pi_{\eta}\nu_{\theta}(\omega)$ at level [math] in $\eta$ , which is element of $\mathcal{P}(\mathcal{P}(E)$ .

Definition 3.12 (Entrance law of interaction chain).

For $\mathbb{P}$ -a.e. $\omega$ and all $\eta\in\Omega_{\infty}$ ,

[TABLE]

where $\nu_{\theta}(\omega)\in\mathcal{P}(\mathcal{P}(E))^{\Omega_{\infty}}$ is the entrance law from level $\infty$ of the (tree-indexed) interaction chain at level [math], and $\Pi_{\eta}\nu_{\theta}(\omega)$ denotes the projection of $\nu_{\theta}(\omega)$ on $\eta$ . $\square$

Our next theorem is indeed the analogue of Theorem 3.3 for $N\to\infty$ . In this limit, coexistence and clustering in $\omega$ are defined for $(M^{(\infty)}_{\eta,0})_{\eta\in\Omega_{N}}$ in the same way as in (3.5)–(3.6).

Theorem 3.13 (Dichotomy for $N=\infty$ ).

**

(a)

Let $\mathcal{C}=\{\omega\colon\,\text{in$ \omega $coexistence occurs}\}$ . Then $\mathbb{P}(\mathcal{C})\in\{0,1\}$ .

(b)

$\mathbb{P}(\mathcal{C})=1$ * if and only if*

[TABLE]

$\square$ **

Note that condition (3.30) is the limit of condition (3.7) as $N\to\infty$ . In fact, the two conditions are equivalent when the following weak regularity condition holds:

[TABLE]

An important question is whether the equilibrium measure $\nu_{\theta}(\omega)$ is the limit as $N\to\infty$ of the equilibrium measure $\nu_{\theta}^{N}(\omega)$ (recall (3.3)). The answer is yes. We only prove the following.

Corollary 3.14 (Hierarchical mean field limit of equilibrium).

For $\mathbb{P}$ -a.s. all $\omega$ and all $\eta\in\Omega_{\infty}$ ,

[TABLE]

$\square$ **

3.2.5 Scaling of the volatility

We are interested in the behaviour of the variance of the interaction chain $M^{(j)}$ as $j\to\infty$ , since this allows us to identify universality classes for the scaling behaviour of our stochastic system. From the variance formula, we see that $(d_{k})_{k\in\mathbb{N}_{0}}$ is the key input, and so we study this sequence first. Note that the variance formula really depends on the ratios $d_{k}/c_{k},\mu_{k}/d_{k}$ , which we encounter below.

Our next two theorems identify the scaling behaviour of $d_{k}$ as $k\to\infty$ in the regime of clustering. The first theorem considers the case of polynomial coefficients, i.e.,

[TABLE]

with $a,b\in\mathbb{R}$ and $L_{c},L_{\mu}$ slowly varying at infinity. In what follows, we assume that

[TABLE]

exist and write $K_{k}$ and $L_{k}$ for the respective sequences. There are five cases according to the values of $K$ and $L$ . Four of these, labelled (a)–(d), we can analyze in detail. For the remaining case, labelled (e), see Remark (3.16). For cases (c)–(d), we need extra regularity conditions on $L_{c},L_{\mu}$ in (3.33), for which we refer the reader to [GdHKK14, Eqs. (1.79)–(1.81)].

Theorem 3.15 (Scaling of the Fleming-Viot volatility: polynomial coefficients).

Under the polynomial scaling assumptions (3.33)–(3.34), the following cases apply:

(a)

If $K=\infty$ , then $\lim_{k\to\infty}d_{k}/c_{k}=1$ .

(b)

If $K\in(0,\infty)$ , then $\lim_{k\to\infty}d_{k}/c_{k}=M$ with $M\in(0,1)$ the unique solution of the equation

[TABLE]

(c)

If $K=0$ and $L=\infty$ , then $\lim_{k\to\infty}d_{k}/\sqrt{c_{k}\mu_{k}}=1$ .

(d)

If $K=0$ , $L\in[0,\infty)$ and $a\in(-\infty,1)$ , then $\lim_{k\to\infty}\sigma_{k}d_{k}=M$ with $\sigma_{k}=\sum_{l=0}^{k-1}(1/c_{l})$ and $M\in[1,\infty)$ given by

[TABLE]

$\square$ **

Remark 3.16.

It is straightforward to check with the help of (3.33)–(3.34) that all four cases (a)-(d) correspond to choices of $\underline{c}$ and $\underline{\lambda}$ for which clustering holds, i.e., the sum in (3.30) diverges (note that $\lim_{k\to\infty}\sigma_{k}=\infty$ in case (d)). However, they are not exhaustive: there is a fifth case (e), corresponding to $K=0$ , $L\in[0,\infty)$ , $a=1$ and $\lim_{k\to\infty}\sigma_{k}=\infty$ , for which we have no scaling result. This case lies at the border of the clustering regime. An example is $c_{k}\sim k(\log k)^{\gamma}$ , $\gamma\in(-\infty,1]$ , and $\mu_{k}=k^{-2}c_{k}$ , which we were able to handle in the deterministic model in [GdHKK14], but cannot handle in the random model treated here.* $\square$ *

The second theorem considers the case of exponential coefficients, i.e.,

[TABLE]

with $c,\mu\in(0,\infty)$ and $\bar{c}_{k},\bar{\mu}_{k}$ satisfying (3.33) with exponents $a,b$ . We further assume that

[TABLE]

exists.

Theorem 3.17 (Scaling of the Fleming-Viot volatility: exponential coefficients).

Under the exponential scaling assumptions in (3.37)–(3.38), the following cases apply (cf. Theorem 3.15):

(A)

[Like Case (a)]* $c<\mu$ , or $c=\mu$ and $\bar{K}=\infty$ : $\lim_{k\to\infty}d_{k}/c_{k}=1/c$ .*

(B)

[Like Case (b)]* $c=\mu$ and $\bar{K}\in(0,\infty)$ : $\lim_{k\to\infty}d_{k}/c_{k}=\bar{M}/c$ with $\bar{M}\in(0,1)$ the unique solution of the equation*

[TABLE]

(C)

The case $\bar{K}=0$ , with $c=\mu$ or $c>\mu$ , splits into three sub-cases:

(C1)

[Like Case (b)]* $c=\mu<1$ , $\bar{K}=0$ : $\lim_{k\to\infty}d_{k}/c_{k}=(1-c)/c$ .*

(C2)

[Like Case (c)]* $c=\mu>1$ , $\bar{K}=0$ , $\sum_{k\in\mathbb{N}_{0}}\bar{K}_{k}=\infty$ : 333In [GdHKK14] the condition $\sum_{k\in\mathbb{N}_{0}}\bar{K}_{k}=\infty$ was mistakenly omitted. $\lim_{k\to\infty}d_{k}/\mu_{k}=1/(\mu-1)$ .*

(C3)

[Like Case (d)]* $1>c>\mu$ and $\bar{K}=0$ , or $1=c>\mu$ , $\bar{K}=0$ and $a\in(-\infty,1)$ : $\lim_{k\to\infty}\sigma_{k}d_{k}=1$ .*

$\square$ **

The same observation as in Remark 3.16 applies. Again, the critical case $a=1$ is missing in (C3).

3.2.6 Cluster formation

Within the clustering regime it is of interest to study the size of the mono-type regions as a function of time, i.e., how fast the clusters where one type prevails grow.

This question has been addressed for other population models. For the voter model on $\mathbb{Z}^{2}$ , Cox and Griffith [CG86] showed that the radii of the clusters with opinion “all 1” or “all 0” scale as $t^{\alpha/2}$ with $\alpha\in[0,1)$ , i.e., clusters occur on all scales $\alpha\in[0,1)$ . For the model of hierarchically interacting Fleming-Viot diffusions with $c_{k}\equiv 1$ (= critically recurrent migration), Fleischmann and Greven [FG94] showed that, for all $N\in\mathbb{N}\setminus\{1\}$ and all $\eta\in\Omega_{N}$ ,

[TABLE]

in the sense of finite-dimensional distributions, where $(Y(t))_{t\in[0,\infty)}$ is the standard Fleming-Viot diffusion on $\mathcal{P}(E)$ . A similar behaviour occurs for other models, e.g. branching models as shown in Dawson and Greven [DG96].

The advantage of the hierarchical group is that we can analyze the cluster formation as a function of $N$ and let $N\to\infty$ to approach the critically recurrent case (recall Remark 2.1). We can do this by using the interaction chain. In [GdHKK14] we analysed the Cannings model in the limit as $N\to\infty$ , namely, we proved that for some level scaling function $k\colon\,\mathbb{N}_{0}\to\mathbb{N}_{0}$ , satisfying $0\leq k(j)\leq j+1$ and $\lim_{j\to\infty}k(j)=\infty$ , we obtained a non-trivial clustering limiting law (henceforth we pick $\eta=0$ and drop it from the notation)

[TABLE]

for some $M^{\infty}\in\mathcal{P}(E)$ satisfying $E[M^{\infty}]=E[X_{0}^{(\Omega_{N})}(0)]=\theta\in\mathcal{P}(E)$ that is not of the form $M^{\infty}=\delta_{U}$ for some possibly random $U\in E$ . We will do the same in the random environment $\omega$ , namely, our aim is to show that for $\mathbb{P}$ -a.e. $\omega$

[TABLE]

for some $M^{\infty}(\omega)\in\mathcal{P}(E)$ satisfying $\mathbb{E}[M^{\infty}(\omega)]=\theta$ that is not of the form $M^{\infty}(\omega)=\delta_{U(\omega)}$ for some possibly random $U(\omega)\in E$ .

As in Dawson and Greven [DG93, DGV95, DG96], and similarly as in (3.40), in order to obtain the profile of cluster formation it is necessary to consider a whole family of scalings $k_{\alpha}\colon\,\mathbb{N}_{0}\to\mathbb{N}_{0}$ , $\alpha\in I$ , with $I=\mathbb{N}_{0}$ , $I=[0,\infty)$ or $I=[0,1)$ , and with $j\mapsto k_{\alpha}(j)$ non-decreasing, $0\leq k_{\alpha}(j)\leq j+1$ and $\lim_{j\to\infty}k_{\alpha}(j)=\infty$ , such that

[TABLE]

for some non-constant Markov process $M^{*}=(M^{*}_{\alpha}(\omega))_{\alpha\in I}$ on $\mathcal{P}(E)$ that preserves the mean $\theta$ . The convergence in (3.43) is in the weak topology on the product space of $\mathcal{P}(E)$ and the space of the environment.

There are five universality classes of clustering behaviour (see [DG96]):

Definition 3.18 (Clustering classes).

**

(I)

Concentrated clustering* ( $M^{\ast}$ is a Markov chain):*

(I1)

$k_{\alpha}(j)=0\vee(j+1-\alpha)$ , $\alpha\in\mathbb{N}_{0}$ , $M^{*}$ is trapped after one step.

(I2)

$k_{\alpha}(j)=0\vee(j+1-\alpha)$ , $\alpha\in\mathbb{N}_{0}$ , $M^{*}$ is not trapped.

(II)

Diffusive clustering* ( $M^{\ast}$ is a diffusion process):*

(II1)

Fast clustering*: $k_{\alpha}(j)=0\vee\lfloor j+1-\alpha h(j)\rfloor$ , $\alpha\in[0,\infty)$ , where $h\colon\,\mathbb{N}_{0}\to[0,\infty)$ is such that $\lim_{j\to\infty}h(j)=\infty$ and $\lim_{j\to\infty}h(j)/j=0$ .*

(II2)

Moderate clustering*: $k_{\alpha}(j)=\lfloor(1-\alpha)(j+1)\rfloor$ , $\alpha\in[0,1)$ .*

(II3)

Slow clustering*: $\lim_{j\to\infty}k_{\alpha}(j)/j=0$ , $\alpha\in[0,1)$ .*

(The terminology is slightly different from [GdHKK14].) The volume of the clusters at time $t$ in these five universality classes (arranged in decreasing order of magnitude) equals, respectively, $N^{t}$ , $ZN^{t}$ , $N^{t-o(t)}$ , $N^{Zt}$ , $N^{o(t)}$ , with $Z\in(0,1)$ some random variable. Note that slow clustering borders with the regime of coexistence (= no clustering).

Recall (a)-(d) in Theorem 3.15 and (A)-(C) in Theorem 3.17. Recall that, under the law $\mathbb{P}$ , the law of the initial state $X^{(\Omega_{N})}(\omega;0)$ is stationary and ergodic under translations in $\Omega_{N}^{\mathbb{T}}$ , with mean single-coordinate measure

[TABLE]

The interaction chain on level $j$ , arising in the scaling limit $N\to\infty$ , starts in $\theta$ . Below this is also assumed for the scaling limit $j\to\infty$ .

Theorem 3.19 (Cluster formation).

Fix $N\in\mathbb{N}\backslash\{1\}$ . The five universality classes in the regime of clustering, linked to the different cases of scaling behaviour of $(d_{k})_{k\in\mathbb{N}}$ , are as follows:

$\bullet$

(a), (A):* The scaling in regime (I1) yields (3.43) with $I=\mathbb{N}_{0}$ . The scaling limit $M^{*}$ is the time-homogeneous Markov chain on $\mathcal{P}(E)$ starting in $\theta$ with transition kernel $K(\theta,\cdot)$ given by*

[TABLE]

which satisfies $K_{\alpha}=K$ , for all $\alpha\in\mathbb{N}$ .

$\bullet$

(b), (B), (C1), (C3)[first subcase]:* The scaling in regime (I2) yields (3.43) with $I=\mathbb{N}_{0}$ . The scaling limit $M^{*}$ is the time-inhomogeneous Markov chain on $\mathcal{P}(E)$ in random environment $(\chi_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ starting in $\theta$ with transition kernels $\{K_{\alpha}(\theta,\cdot)(\omega)\}_{\alpha\in\mathbb{N}_{0}}$ given by (recall (3.11))*

[TABLE]

with

[TABLE]

In the last two cases, the random environment does not affect the scaling limit, and the scaling is the same as for the homogeneous environment with the same mean. In the first two cases the measure-valued process $(\chi_{\alpha}(\omega))_{\alpha\in\mathbb{N}_{0}}$ in (3.46) is constructed by extending the one-sided stationary random environment $(\chi^{(\eta,k)}(\omega))_{\eta\in\Omega_{N},\,k\in\mathbb{N}_{0}}$ introduced in (2.32) to a two-sided stationary random environment $(\chi^{(\eta,k)}(\omega))_{\eta\in\Omega_{N},\,k\in\mathbb{Z}}$ , and defining

[TABLE]

which by stationarity does not depend on $\eta$ . Furthermore, $\chi_{\alpha}(\omega)$ is an $\mathcal{M}_{f}([0,1])$ -valued resampling measure with $\mathbb{E}[\chi_{\alpha}(\omega)]=\bar{\chi}_{\alpha}$ satisfying $\bar{\chi}_{\alpha}((0,1])=1$ .

$\bullet$

(c), (C2)[subcase $\lim_{k\to\infty}k\bar{K}_{k}=\infty$ ]:* The scaling in regime (II1) yields (3.43) with $I=[0,\infty)$ . The scaling limit $M^{*}$ is the time-changed standard Fleming-Viot process*

[TABLE]

with

–

(c):* $\ell(\alpha)=\alpha$ , $h(j)=1/\sqrt{K_{j}}$ .*

–

(C2)[ $\lim_{k\to\infty}k\bar{K}_{k}=\infty$ ]:* $\ell(\alpha)=\frac{\mu}{\mu-1}\alpha$ , $h(j)=1/K_{j}$ .*

$\bullet$

(d), (C2)[subcase $\lim_{k\to\infty}k\bar{K}_{k}=\bar{N}$ ], (C3)[second subcase]:* The scaling in regime (II2) yields (3.43) with $I=[0,1)$ . The scaling limit $M^{*}$ is the time-changed standard Fleming-Viot process 444[GdHKK14] contains a typo: there the time scaling $1/(1-\alpha)^{R}$ was wrongly written as $1/(1-\alpha^{R})$ .*

[TABLE]

with

–

(d):* $R=M(1-a)$ .*

–

(C2)[subcase $\lim_{k\to\infty}k\bar{K}_{k}=\bar{N}$ ]:* $R=\bar{N}\frac{\mu}{\mu-1}$ .*

–

(C3)[second subcase]:* $R=1-a$ .*

For reasons explained in Remark 8.2, in cases (c), (d), (C2), (C3)[second subcase] only convergence in $\mathbb{P}$ -probability and not $\mathbb{P}$ -a.s. is obtained. $\square$

Remark 3.20.

We expect that also slow clustering occurs, namely, in the case (e) that was not treated in Theorems 3.15 and 3.17 (recall Remark 3.16).* $\square$ *

3.3 Summary of the effects of the random environment

1.

Theorem 3.1 says that the hierarchical Cannings process in random environment is well-defined for $\mathbb{P}$ -a.e. $\omega$ , while Theorem 3.2 shows that it converges to an $\omega$ -dependent equilibrium that preserves the single-component mean.

2.

Theorem 3.6 (mono-scale) and Theorem 3.7 (multi-scale) identify the behaviour of the $k$ -block averages in the limit as $N\to\infty$ in terms of the McKean-Vlasov process with parameters that depend on $\omega$ and $k$ . The volatility $d_{k}$ depends on the parameters $c_{l},\lambda_{l}$ , $0\leq l<k$ , and on the law $\mathcal{L}_{\rho}$ via the recursion relation in (3.17), which is a randomized version of the recursion relation in [GdHKK14].

3.

Theorems 3.3 and 3.13 show that the dichotomy “coexistence versus clustering” is not affected by the random environment: the same conditions apply to the homogeneous hierarchical Cannings process studied in [GdHKK14]. Apparently, for the nature of the equilibrium only the large-scale properties of the random environment matter. Since the resampling measures are stationary under translations with total masses whose sigma-algebra at infinity is trivial, only the average medium behaviour is relevant. The proof of the dichotomy in Theorem 3.3 requires assumption (2.37) rather than assumption (2.35). We believe this strengthening to be redundant, but a proof would require considerable extra work.

4.

Theorem 3.10 shows that the effect of the random environment is to lower the volatility parameter $d_{k}$ on every hierarchical scale $k$ compared to the average environment. The intuition behind this is that the random environment causes fluctuations in the resampling, which in turn reduce the clustering. The sandwich between the volatilities for the zero environment and the average environment is useful to control the scaling.

5.

Theorem 3.15 (polynomial coefficients) and Theorem 3.17 (exponential coefficients) show that for Cases (b) and (B), where migration and resampling occur at comparable rates, the phenomenon of lower volatility $d_{k}$ in random environment persists in the limit as $k\to\infty$ : even though the scaling of $d_{k}$ as $k\to\infty$ is the same as for the average environment, it has a different prefactor (e.g. $M$ solving (3.35) is strictly smaller than $M^{*}$ solving (3.35) with $\mathcal{L}_{\rho}$ replaced by $\delta_{1}$ , as is easily shown by applying Jensen’s inequality). For all other cases both the scaling and the prefactor are the same as for the average environment.

6.

Theorem 3.19 shows that for Cases (b), (B), (C1), (C3)[first subcase] the scaling of the clusters in the random environment changes compared to that in the average environment: the random environment is visible even in the scaling limit. The effect of the random environment is to slow down the growth of the clusters, i.e., to enhance the diversity of types. For all other cases the scaling of the clusters is the same as for the average environment.

4 Existence, uniqueness, duality and equilibrium

In this section, we prove Theorems 3.1–3.2. In Section 4.1 we construct the dual process with the help of a graphical representation based on Poisson random measures. In Section 4.2 we exhibit the duality. In Section 4.3 we establish the existence and uniqueness of the dual process and show the existence of its equilibrium. In Section 4.4 we use these results to prove Theorems 3.1–3.2. Theorems 4.1–4.4 below do not need a separate proof: this is verbatim the same as the proof for the homogeneous environment given in [GdHKK14].

4.1 The spatial coalescent in random environment

In this section we introduce a hierarchical coalescent process in random environment that will serve as a dual to the hierarchical Cannings process in random environment.

The coalescent is a Markov process taking values in the set of partitions of $\mathbb{N}$ labelled by the points of a geographical space. We recall the basic objects and notations, and refer to [GdHKK14, Section 2] for details.

Let $G$ be a discrete geographical space. Our target geographical space is $G=\Omega_{N}$ . This will be approximated by a sequence of geographical spaces

[TABLE]

which are to be thought of as a sequence of blocks filling $\Omega_{N}$ . We will also need to consider the mean-field geographical space

[TABLE]

where $\{\ast\}$ is a cemetery location. The state space of the spatial coalescent is the set of $G$ -labelled partitions defined as

[TABLE]

where $n\in\mathbb{N}$ and

[TABLE]

We equip the set $\Pi_{G,n}$ with the discrete topology. Let $a$ be a random walk transition kernel on $G$ . When $G=\Omega_{N}$ we use the hierarchical random walk kernel $a=a^{(N)}$ in (2.5), when $G=G_{N,K}$ we use the same hierarchical random walk kernel but with $c_{k}=0$ for $k>K$ , and when $G=\{0,\ast\}$ we use the random walk kernel with $a(0,\ast)=c$ , $a(\ast,0)=0$ .

Given the random environment $\omega$ (recall Section 2.2.1), the spatial coalescent in random environment is the Markov process on state space $\Pi_{G,n}$ with the following dynamics:

•

[Migration] Each partition block performs an independent random walk on $G$ with random walk kernel $a^{*}$ , where $a^{*}(g_{1},g_{2})=a(g_{2},g_{1})$ , $g_{1},g_{2}\in G$ , is the conjugate random walk kernel.

•

[Local coalescence] Independently at each location $g\in G$ , the $l$ -tuples of the partition elements at $g$ coalesce into a single partition element at $g$ at rate

[TABLE]

where $b$ is the current total number of partition elements and $\Lambda^{[g]}(\omega)$ is the resampling measure at $g$ in environment $\omega$ .

•

[Non-local coalescence with reshuffling] In the case $G=\Omega_{N}$ , independently at each location $g\in B_{|\xi|}(\xi)$ , $\xi\in\Omega^{\mathbb{T}}_{N}$ , the $l$ -tuples of the partition elements in $B_{|\xi|}(\xi)$ coalesce into a single partition element at $g$ at rate

[TABLE]

Subsequently, all the partition elements located in $B_{|\xi|}(\xi)$ are uniformly reshuffled, i.e., all the partition elements in $B_{|\xi|}(\xi)$ get a new location that is drawn uniformly from $B_{|\xi|}(\xi)$ .

Note that in the case $G=\Omega_{N}$ the partition elements of the coalescent perform a hierarchical random walk on $\Omega_{N}$ in the environment $\omega$ with migration coefficients given by (recall (2.5))

[TABLE]

where $\mathrm{MC}_{k}(\eta)$ is the unique site at height $k$ above $\eta\in\Omega_{N}$ and $\lambda^{\mathrm{MC}_{k+1}(\eta)}(\omega)=\Lambda^{\mathrm{MC}_{k+1}(\eta)}(\omega)$ $((0,1])$ (recall the notation introduced in Section 2.2 and see Fig. 3). The extra term in the right-hand side of (4.7) comes from the reshuffling that takes place prior to the resampling.

The coalescence rate of two partition elements in $B_{|\xi|}(\xi)$ equals (recall (2.33))

[TABLE]

We specify the spatial coalescent as a Markov process on $\Pi_{G}=\cup_{n\in\mathbb{N}}\Pi_{G,n}$ by providing its generator. To that end, we need a space of test functions on $\Pi_{G}$ . Namely, let $\mathcal{C}_{G}$ be the algebra of bounded continuous functions $F\colon\,\Pi_{G}\to\mathbb{R}$ such that for all $F\in\mathcal{C}_{G}$ there exists an $n=n(F)\in\mathbb{N}$ and a bounded function

[TABLE]

with the property that $F(\cdot)=F_{n}(\cdot|_{n})$ . Consider the linear operator $L^{(G)*}(\omega)\colon\mathcal{C}_{G}\to\mathcal{C}_{G}$ defined as

[TABLE]

where the operators $L^{(G)*}_{\mathrm{mig}},L^{(G)*}_{\mathrm{coal}}(\omega)\colon\,\mathcal{C}_{G}\to\mathcal{C}_{G}$ are defined for $\pi_{G}\in\Pi_{G}$ and $F\in\mathcal{C}_{G}$ as

[TABLE]

and

[TABLE]

Here, the migration map $\textrm{mig}_{g\to f,i}(\pi_{G}|_{n})$ changes the spatial coordinate of the $i$ -th partition block from $g$ to $f$ (if such a partition element exists), the coalescence map $\textrm{coal}_{J,g}(\pi_{G,n})$ coalesces the partition blocks with indices in $J$ and location $g$ (if any) into one block, while the reshuffling map $\textrm{resh}_{B_{|\xi|}(\xi),U_{B_{|\xi|}(\xi)}}$ independently relocates each partition element located in $B_{|\xi|}(\xi)$ to a new location in $B_{|\xi|}(\xi)$ that is randomly chosen.

Theorem 4.1 (Existence and uniqueness).

For every $\pi\in\Pi_{G}$ the $(L^{(G)*}(\omega),\mathcal{C}_{G},\delta_{\pi})$ -martingale problem is well-posed. $\square$

We denote the solution of the $(L^{(G)*}(\omega),\mathcal{C}^{(G)},\delta_{\pi})$ -martingale problem by

[TABLE]

For every $n\in\mathbb{N}$ , when restricted to $\Pi_{G,n}$ , $\mathfrak{C}^{(G)}(\omega)$ becomes a strong Markov process $\mathfrak{C}_{n}^{(G)}(\omega)$ with the Feller property.

4.2 Dualities

Consider the map

[TABLE]

where $n\in\mathbb{N}$ , $\phi\in C_{\rm b}(\mathcal{P}(E)^{n})$ , $x=(x_{\eta})_{\eta\in G}\in\mathcal{P}(E)^{G}$ , $\pi_{G,n}\in\Pi_{G,n}$ , $b=b(\pi_{G,n})=|\pi_{G,n}|$ .

Theorem 4.2 (Duality).

Fix $N\in\mathbb{N}\backslash\{1\}$ . For each of the choices $G$ in (4.1) and (4.2),

[TABLE]

for all $n\in\mathbb{N}$ and $\phi\in C_{\rm b}(\mathcal{P}(E)^{n})$ , where the same $\omega$ is used on both sides. $\square$

This theorem is a consequence of the generator relation

[TABLE]

This relation has been verified for the homogeneous model in [GdHKK14], but here works the same.

4.3 Well-posedness of the martingale problems and equilibria

Theorem 3.1 can be formulated for geographic spaces that are countable Abelian groups, in particular, the hierarchical group and the Euclidean lattice. For us the following generalization suffices.

Theorem 4.3 (Well-posedness).

For each of the choices $G$ in (4.1) and (4.2) the following holds: For $\mathbb{P}$ -a.e. $\omega$ and every $\pi\in\Pi_{G}$ , the $(L^{(G)}(\omega),\mathcal{C}^{(G)},\delta_{\pi})$ -martingale problem is well-posed. $\square$

Theorem 4.4 (Equilibrium).

Fix $N\in\mathbb{N}\backslash\{1\}$ . Fix $n\in\mathbb{N}$ and start the $\mathfrak{C}^{(\Omega_{N})}(\omega)$ -process in a labelled partition $\{(\pi_{i},\eta_{i})\}_{i=1}^{n}$ , where $\{\pi_{i}\}_{i=1}^{n}$ form a partition of $\mathbb{N}$ and $\{\eta_{i}\}_{i=1}^{n}$ represent the labels. If $x$ is a random state with mean $\theta\in\mathcal{P}(E)$ whose law is invariant and ergodic under translations, then

[TABLE]

for all $\phi\in C_{\rm b}(\mathcal{P}(E)^{n})$ . $\square$

In order to prove Theorem 4.4, we follow the argument in [DGV95, Section 3]). The partition-valued process converges to a limiting partition. If the locations of the partition elements would follow a homogeneous random walk, then the key to the argument would be the averaging property one can prove via Fouries analysis

[TABLE]

where $p_{t}(\cdot,\cdot)$ is the time- $t$ transition kernel of the random walk on $\Omega_{N}$ and $\nu(\cdot)$ is the Haar measure on $\Omega_{N}$ (see Evans and Fleischmann [EF96]). We need to show that the same holds for our random walk in random environment $\omega$ , which goes as follows.

Place a Poisson clock at every $\xi\in\Omega_{N}^{\mathbb{T}}\backslash\Omega_{N}$ . Let the clock at $\xi$ ring at rate

[TABLE]

At any moment of time let the random walk look at the ancestral line above its current position (see Fig. 3) and redistribute itself uniformly over the block around its current position whose hierarchical label corresponds to the height of the first clock on that ancestral line that rings. The resulting random walk is the same as the hierarchical random walk in environment $\omega$ with migration coefficients given by (4.7).

Next, let $K_{t}(\eta)$ be the highest hierarchical level at which prior to time $t$ a Poisson clock that lies on the ancestral line above $\eta$ has rang. Then at time $t$ the random walk starting from $\eta$ is uniformly distributed on the $K_{t}(\eta)$ -block around $\eta$ . Hence we have

[TABLE]

where $p_{t}^{\omega}(\cdot,\cdot)$ is the time- $t$ transition kernel of the random walk in $\omega$ . Fix $\eta\in\Omega_{N}$ and $f\in C_{b}(\Omega_{N})$ . The first term between the square brackets in (4.20) tends to $\int_{\Omega_{N}}f(\xi)\nu(\mathrm{d}\xi)$ as $k\to\infty$ . The second term is bounded from above by $\|f\|_{\infty}\,p_{t}^{\omega}(\eta,\Omega_{N}\backslash B_{k}(\eta))$ , which tends to zero as $k\to\infty$ . Finally, since all Poisson clocks ring at a strictly positive rate, we have

[TABLE]

It therefore follows that the right-hand side of (4.20) tends to the right-hand side of (4.18) as $t\to\infty$ .

4.4 Consequences for the Cannings process

The claims in Theorems 3.1–3.2 follow from Theorems 4.3–4.4. As argued in [GdHKK14], the proof follows the strategy for the two-type case given in Evans [Eva97, Theorem 4.1], which says that for spatial coalescent processes well-posedness and existence carry over from the dual process to the original process.

We next prove Corollary 3.14.

Proof.

We analyze both $\nu_{\theta}^{N}$ and $\nu_{\theta}$ with the help of duality relations and show that the dual representation of the former converges to the dual representation of the latter.

Step 1: $\nu_{\theta}$ . We have to construct a dual process for the entrance law of a Markov chain, namely, the interaction chain running from level $\infty$ down to level [math]. We consider the process that is dual to the interaction chain at level $j$ . This dual process is a discrete-time Markov chain whose transition kernel we can determine, for fixed $j$ and in the limit as $j\to\infty$ , via an explicit construction. This dual Markov chain is a spatial coalescent on $\{0,1,\ldots,j\}$ , or on $\mathbb{N}$ when we consider all $j$ simultaneously and are interested in its limit state as $j\to\infty$ .

We first focus on the dual transition kernel at one particular level. In the interaction chain this is defined via the equilibrium of the McKean-Vlasov process. How did this equilibrium arise? We consider a mean-field system of size $N^{k}$ with parameters $c_{k}$ , $d_{k}$ , $\Lambda_{k}$ and take the mean-field dual started in $n$ individuals at mutual distance $k$ . This dual is shown to converge, in the limit as $N\to\infty$ and on time scale $t_{N}N^{k}$ with $t_{N}\to\infty$ and $t_{N}=o(N)$ , to a limiting process that is a coalescent on the geographic space $k\cup\{\bigtriangleup\}$ , with $\bigtriangleup$ a cemetery state, such that the process jumps from $k$ to $\bigtriangleup$ at rate $c_{k}$ and does Kingman coalescence at rate $d_{k}$ and $\Lambda$ -coalescence according to $\Lambda_{k}(\mathrm{MC}_{k}(0))(\omega)$ . This limiting process is run for infinite time to obtain the dual transition kernel at the $k$ -th step. This partition at $\bigtriangleup$ is used as input for the next step of the dual with label $k+1$ . Altogether this procedure defines the full Markov chain, i.e., the new site and the new partition element. We denote the path of the dual Markov chain by

[TABLE]

(Recall that partitions are ordered and hence the limiting state exists.) The line of argument is the same for each $k$ . A detailed argument can be found in [GdHKK14, Corollary 2.12].

We need an explicit description as an $\mathbb{N}$ -marked partition-valued process, namely, the above mentioned random walk, moving one step to the right on $\mathbb{N}$ , doing Kingman coalescence at rate $d_{k}$ and $\Lambda$ -coalescence according to $\Lambda_{k}(\mathrm{MC}_{k}(\eta))$ in state $k$ , provided the rate- $c_{k}$ clock does not ring first.

The dual chain after $j$ steps gives the expression $E_{\theta}[\langle M^{(j)}_{0},f\rangle^{n}](\omega)$ , which in the limit as $j\to\infty$ equals $\int^{1}_{0}\Pi_{\eta}\nu_{\theta}(\omega)(\mathrm{d}x)\,\langle x_{\eta},f\rangle^{n}$ by the definition of $\nu_{\theta}(\omega)$ . The dual expectation is the expression $E[\langle\theta,f\rangle^{\mid\Pi_{\infty}^{\infty}\mid}](\omega)$ . It therefore suffices to show that the latter is obtained from the dual representation of $\nu^{N}_{\theta}(\omega)$ as $N\to\infty$ .

Remark 4.5.

What is the dual counterpart of Theorem 3.6? The connection between the renormalized system and the interaction chain on the level of the dual is as follows. Consider the dual process for the $j$ -level hierarchical system for finite $N$ , starting with $n$ partition elements at one site and letting $t\to\infty$ and $N\to\infty$ in the following way. Consider time scales $(t^{k}_{N})_{k\in\mathbb{N}_{0}}$ with $t^{k}_{N}/N^{k+1}\to 0$ and $t^{k}_{N}/N^{k}\to\infty$ as $N\to\infty$ . Then the coalescent reaches a partition $\Pi^{k+1}_{\infty}$ , with the remaining partition elements in $B_{k}$ uniformly distributed. After that move to the next time scale. Finally, first take $N\to\infty$ and then take

[TABLE]

By our scaling result in (3.25), this object gives us the dual process of the interaction chain at level $j$ .* $\square$ *

Step 2: $\nu^{N}_{\theta}$ . We return to the representation of $\nu^{N}_{\theta}$ , respectively, its marginal law at level [math]. The convergence of the dual chain for the Cannings process on $\Omega_{N}$ , and its limit as $t\to\infty$ followed by $N\to\infty$ to the dual chain of the interaction chain, will follow from the fact that the partitions become successively finer and hence converge to a limit partition, and the fact that the time scales for the random walk to reach distance $k$ separate as $N\to\infty$ . Since the monomials are convergence determining, this will yield the claim.

We have to show that the coalescent on $\Omega_{N}$ , starting with $n$ individuals at site [math], converges to a limit process as $t\to\infty$ , which we can investigate in the limit as $N\to\infty$ . We need to show that this process has the property that, when we consider the times where the coalescent makes jumps to the next larger block, we get an embedded Markov chain with index in $\mathbb{N}_{0}$ , describing the successive maximal jump sizes and values in partitions. The corresponding partition converges to $\Pi^{N}_{\infty}$ as the index $k$ tends to infinity. The claim is that, as $N\to\infty$ , this Markov chain converges to a birth process in the first component, which moves one step to the right, with Kingman-coalescence at rate $d_{k}$ and $\Lambda$ -coalescence according to $\Lambda_{k}(\mathrm{MC}_{k}(0))(\omega)$ , provided the rate- $c_{k}$ clock does not ring first. This gives $\Pi^{\infty}_{k}$ . As $k\to\infty$ we get (4.23). This is done in Dawson and Greven [DG96] for the Kingman coalescent, but the necessary modification is straightforward. ∎

5 Dichotomy: coexistence versus clustering

In this section, we prove Theorem 3.3. The question is whether $\mathfrak{C}^{(G)}(\omega)$ , the hierarchical coalescent in the environment $\omega$ defined in (4.13), converges to a single labelled partition element as $t\to\infty$ with probability one. To answer this question, we have to investigate whether two tagged partition elements coalesce with probability one or not. Recall that, by the projective property of the coalescent, we may focus on the subsystem of just two dual individuals, because this translates into the same dichotomy for $\mathfrak{C}_{n}^{(G)}(\omega)$ for any $n\in\mathbb{N}$ , and hence for the entrance law starting from $n$ partition elements. However, there is additional reshuffling at all higher levels, which is triggered by a corresponding block-coalescence event. Therefore, we need to consider two coalescing random walks with slightly adapted migration coefficients, lacking in particular the random walk property.

Recall the notation introduced in Sections 2.1.1–2.1.3 and 2.2.1. Recall that $\mathrm{MC}_{k}(\eta)$ is the unique site at height $k$ above $\eta\in\Omega_{N}$ (see Fig. 3). Consider two independent copies

[TABLE]

of the hierarchical random walk on $\Omega_{N}$ in the environment $\omega$ with migration coefficients given by (4.7) and coalescence rates given by (4.8). Write $P^{\omega},P^{\omega,\prime}$ for the marginal laws of $Y(\omega),Y^{\prime}(\omega)$ and $\bar{P}^{\omega}=P^{\omega}\times P^{\omega,\prime}$ for the joint law of the pair $\bar{Y}(\omega)=(Y(\omega),Y^{\prime}(\omega))$ . Consider the time- $t$ accumulated hazard for coalescence:

[TABLE]

where we use that $\mathrm{MC}_{k}(\eta)=\mathrm{MC}_{k}(\eta^{\prime})$ when $d_{\Omega_{N}}(\eta,\eta^{\prime})\leq k$ . The rate $N^{-2k}$ to choose a $k$ -block for the coalescence is multiplied by $N^{k}$ because all partition elements in that block can trigger a coalescence event, which explains the factor $N^{-k}$ in (5.2).

Let $\lim_{t\to\infty}H_{N}(\omega;t)=H_{N}(\omega;\infty)$ . We have coalescence of the two random walks (“common ancestor”) with probability 1 when $H_{N}(\omega;\infty)=\infty$ $\bar{P}^{\omega}$ -a.s., but separation of the two random walks (“different ancestors”) with positive probability when $H_{N}(\omega;\infty)<\infty$ $\bar{P}^{\omega}$ -a.s. In Section 5.1 we identify the dichotomy for the mean hazard $\bar{E}^{\omega}[H_{N}(\omega;\infty)]$ combining Fouries analysis with potential theory of reversible Markov chains to handle the fact that our migration is no longer a random walk. In Section 5.2 we use a zero-one law to show that the same dichotomy holds for the hazard $H_{N}(\omega;\infty)$ .

5.1 Mean hazard

Lemma 5.1.

For every $N\in\mathbb{N}\backslash\{1\}$ and $\mathbb{P}$ -a.e. $\omega$ ,

[TABLE]

$\square$ **

Proof.

Write

[TABLE]

to denote the time- $t$ transition kernel. Let

[TABLE]

denote the Green function for $\bar{Y}(\omega)$ . Then (5.2) gives

[TABLE]

(recall (2.28)). The proof comes in two steps. In Step 1, we pretend that the $\omega$ -dependent term in the right-hand side of (4.7) is replaced by its mean, i.e., the two hierarchical random walks are homogenous with migration coefficients $\bar{c}_{k}$ given by

[TABLE]

and show that the same dichotomy as in (5.3) holds. In Step 2, we explain why this replacement does not affect the dichotomy. The Green function of the two homogeneous hierarchical random walks will be denoted by $G((0,0),(\eta,\eta^{\prime}))$ .

Step 1.

In what follows, we use the explicit form of the transition kernel $p_{t}(\eta,\zeta)$ , $\eta,\zeta\in\Omega_{N}$ , for the homogeneous hierarchical random walk computed in Dawson, Gorostiza and Wakolbinger [DGW05] with the help of Fourier analysis. Namely,

[TABLE]

where

[TABLE]

and

[TABLE]

with

[TABLE]

where $D(N)$ is the normalizing constant such that $\sum_{j\in\mathbb{N}}r_{j}(N)=1$ . Note that the expressions in (5.10)–(5.11) simplify considerably in the limit as $N\to\infty$ , namely, the term with $i=j$ dominates and

[TABLE]

Also note that, because of (2.7) and (2.13), the following holds:

[TABLE]

To compute the sum in (5.6), we need to distinguish two cases: (1) $\xi=0^{k}\in\Omega_{N}^{(k)}$ , the unique site in $\Omega_{N}^{\mathbb{T}}$ at height $k$ above $0\in\Omega_{N}$ ; (2) $\xi\in\Omega_{N}^{(k)}\backslash\{0^{k}\}$ .

(1)

$\xi=0^{k}$ . Write

[TABLE]

where $\eta^{(p)}$ is any site in $\Omega_{N}$ such that $d_{\Omega_{N}}(0,\eta^{(p)})=p$ , and

[TABLE]

With the help of (5.8) we obtain

[TABLE]

Inserting (5.9) and (5.12), we get

[TABLE]

where the asymptotics comes from the terms with $m=p+1$ and $n=q+1$ . Combining (5.14–5.17), we obtain

[TABLE]

where the asymptotics comes from the term with $p=k$ .

(2)

$\xi\in\Omega_{N}^{(k)}\backslash\{0^{k}\}$ . Now $p_{t}(0,\eta)$ is the same for all $\eta\in B_{|\xi|}(\xi)$ , and so we have

[TABLE]

where we use (5.16–5.17) with $p=q=k$ , and $d_{\Omega^{(k)}_{N}}$ denotes the distance within $\Omega_{N}^{(k)}$ .

Combining (4.8), (5.6), (5.18)–(5.19), we arrive at

[TABLE]

where we abbreviate

[TABLE]

Now, by (2.35) we have, for some $C<\infty$ ,

[TABLE]

Because $\{\rho^{\xi}(\omega)\colon\,\xi\in\Omega_{N}^{\mathbb{T}}\}$ is stationary, ergodic and tail trivial (recall (2.36)), it follows from a standard second-moment estimate that the sum in the right-hand of (5.20) is infinite if and only if its expectation w.r.t. $\mathbb{P}$ is infinite. Since

[TABLE]

we get the claim in (5.3) for the hierarchical random walk with homogeneous migration coefficients $\bar{c}_{k}(N)$ defined in (5.7) (the factor $\tfrac{1}{2}$ is harmless for the convergence or divergence of the right-hand side of (5.23)).

Step 2.

It remains to show that the same dichotomy holds for the coefficients in (4.7) rather than (5.7). We start with the observation that the hierarchical random walk in random environment is symmetric and therefore is reversible with respect to the Haar measure on $\Omega_{N}$ . We have the representation (see Bovier and den Hollander [BH15, Chapter 7])

[TABLE]

where $a^{\omega}((a,b))=\sum_{(c,d)}a^{\omega}((a,b),(c,d))$ is the total rate at which the random walk jumps out of $(a,b)$ , and

[TABLE]

are the first hitting time, respectively, the first return time of $(a,b)$ . The point of (5.24) is that both the numerator and the denominator can be controlled with the help of the Dirichlet Principle, as follows.

Let

[TABLE]

be the Dirichlet form associated with the two random walks in random environment. By classical potential theory, the escape probability in the denominator of (5.24) is given by the capacity of the pair $(\eta,\eta^{\prime})$ and $\infty$ ,

[TABLE]

where $f(\infty)=0$ stands for $\lim_{(\eta,\eta^{\prime})\to\infty}f((\eta,\eta^{\prime}))=0$ with $(\eta,\eta^{\prime})\to\infty$ short hand for $d_{\Omega_{N}}(0,\eta)+d_{\Omega_{N}}(0,\eta^{\prime})\to\infty$ (recall (2.2)). The hitting probability in the numerator of (5.24) can also be expressed in terms of capacities after we use a renewal argument. Write

[TABLE]

We have

[TABLE]

Moreover,

[TABLE]

The first term equals $\mathrm{cap}^{\omega}((0,0),\infty)$ , while the second term is bounded from above by $P^{\omega}_{(0,0)}(\tau_{(\eta,\eta^{\prime})}<\infty)$ , which tends to zero as $(\eta,\eta^{\prime})\to\infty$ when $G^{\omega}<\infty$ , i.e., when the random walk in random environment is transient. Below we will show that, under assumption (2.37), $G^{\omega}<\infty$ * if and only if $G<\infty$ *.

We are now ready to explain why the estimates in Step 1 carry over. The transition rates of the random walk in random environment are given by

[TABLE]

where $a^{\omega,(N)}$ is the transition kernel in (2.5), but with $c_{k}$ replaced by $c_{k}(\omega)(N,\eta)$ in (4.7):

[TABLE]

By (4.8), we have $\lambda^{\mathrm{MC}_{k}(\eta)}(\omega)=\lambda_{k}\rho^{\mathrm{MC}_{k}(\eta)}(\omega)$ . Assumption (2.37) implies $\delta\leq\lambda^{\mathrm{MC}_{k}(\eta)}(\omega)/\lambda_{k}$ $\leq\delta^{-1}$ for all $k\in\mathbb{N}_{0}$ , $\eta\in\Omega_{N}$ and $\mathbb{P}$ -a.e. $\omega$ , which in turn implies

[TABLE]

where $a$ is the transition kernel in (2.5), but with $c_{k}$ replaced by $\bar{c}_{k}(N)$ in (5.7). Inserting these bounds into the formulas for the capacities in (5.27) and (5.29), and recalling (5.24), we see that

[TABLE]

This shows that the Green function for the random walk in random environment is comparable to the Green function of the homogeneous random walk. Hence the argument in Step 1 carries over.

Note that $P^{\omega}_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)$ in (5.30) is comparable to $P_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)$ , which is a strictly positive constant when $G<\infty$ . Consequently, by the observation made below (5.30), also $P^{\omega}_{(0,0)}(\tau_{(\eta,\eta^{\prime})}=\hat{\tau}_{(0,0)}=\infty)$ in (5.30) is comparable to $P_{(0,0)}(\tau_{(\eta,\eta^{\prime})}=\hat{\tau}_{(0,0)}=\infty)$ when $G,G^{\omega}<\infty$ .

It remains to show that, under assumption (2.37), $G^{\omega}<\infty$ if and only if $G<\infty$ . This is easy. Indeed, if $G<\infty$ , then $P_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)>0$ , hence $P^{\omega}_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)>0$ , and hence $G^{\omega}<\infty$ by (5.24) because $P^{\omega}_{(0,0)}(\tau_{(\eta,\eta^{\prime})}<\infty)\leq 1$ . Conversely, if $G=\infty$ , then $P_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)=0$ , hence $P^{\omega}_{(0,0)}(\hat{\tau}_{(0,0)}=\infty)=0$ , and hence $G^{\omega}=\infty$ by (5.24) because $P^{\omega}_{(0,0)}(\tau_{(\eta,\eta^{\prime})}<\infty)>0$ . $\square$ ∎

5.2 Zero-one law

To conclude the proof of the dichotomy in Theorem 3.3, we use the following zero-one law.

Lemma 5.2 (Zero-one law).

For every $N\in\mathbb{N}\backslash\{1\}$ and $\mathbb{P}$ -a.e. $\omega$ , $H_{N}(\omega;\infty)=\infty$ if and only if $\bar{E}^{\omega}[H_{N}(\omega;\infty)]=\infty$ . $\square$

Proof.

The proof comes in five steps.

Step 1.

For $M,N\in\mathbb{N}$ , let $H_{N}^{(M)}(\omega;\infty)$ denote the truncation of $H_{N}(\omega;\infty)$ obtained by setting $\lambda_{k}=0$ for $k>M$ (no resampling in blocks of hierarchical size larger than $M$ ). The key to the proof is the following second-moment estimate:

[TABLE]

Before proving (5.35), we complete the proof of Theorem 3.3. By Cauchy-Schwarz, for any non-negative random variable $V$ we have

[TABLE]

Picking $V=H_{N}^{(M)}(\omega;\infty)/\bar{E}^{\omega}[H_{N}^{(M)}(\omega;\infty)]$ in (5.36) and using (5.35), we obtain

[TABLE]

Since $H_{N}^{(M)}(\omega;\infty)\leq H_{N}(\omega;\infty)$ and the lower bound in (5.37) is uniform in $M$ and $N$ , it follows that if $\bar{E}^{\omega}[H_{N}(\omega;\infty)]=\lim_{M\to\infty}\bar{E}^{\omega}[H_{N}^{(M)}(\omega;\infty)]=\infty$ , then $\bar{P}^{\omega}(H_{N}(\omega;\infty)=\infty)\geq 1/C$ . By (5.2), $\{\omega\colon\,H_{N}(\omega;\infty)=\infty\}$ is an element of the sigma-algebra at infinity defined in (2.36), which is trivial. The latter event therefore has probability either 0 or 1, and since it has positive probability we get the claim.

Step 2.

Write out (recall (5.2))

[TABLE]

In what follows, we consider the hierarchical random walk with homogeneous migration coefficients $\bar{c}_{k}$ defined in (5.7). In Step 4 we incorporate the $\omega$ -dependence.

Use symmetry to replace $\sum_{k,l=0}^{M}$ by $2\sum_{k,l=0}^{M}1_{\{k<l\}}+\sum_{k,l=0}^{M}1_{\{k=l\}}$ . Due to the ultrametricity of the hierarchical distance and the isotropy of the hierarchical random walk, we have $G((\eta,\eta^{\prime}),(\zeta,\zeta^{\prime}))=G((0,0),(\zeta,\zeta^{\prime}))$ for all $\eta,\eta^{\prime}\in B_{|\xi|}(\xi)$ in the following three cases (where $\xi<\xi^{\prime}$ means that $\xi^{\prime}$ is an ancestor of $\xi$ ):

[TABLE]

Therefore we have

[TABLE]

with $R$ a correction term given by

[TABLE]

If $R$ would be absent from (5.39), then we would have proved (5.35) with $C=2$ . Thus, it remains to show that $R$ can only raise the constant. We will do this by showing that $R\leq O(N^{-2})\,\bar{E}[H_{N}^{(M)}(\omega;\infty)]^{2}$ as $N\to\infty$ , uniformly in $M$ , and by appealing to the observation made in (5.13).

Step 3.

By translation invariance, $G((\eta,\eta^{\prime}),(\zeta,\zeta^{\prime}))=G((0,0),(\zeta-\eta,\zeta^{\prime}-\eta^{\prime}))$ . By isotropy, $\sum_{\zeta,\zeta^{\prime}\in B_{|\xi|}(\xi)}G((0,0),(\zeta-\eta,\zeta^{\prime}-\eta^{\prime}))=\sum_{\zeta,\zeta^{\prime}\in B_{|\xi|}(\xi)}G((0,0),(\zeta,\zeta^{\prime}))$ for all $\eta,\eta^{\prime}\in B_{|\xi|}(\xi)$ . Hence, in the first sum in (5.40) the term with $\zeta,\zeta^{\prime}\in B_{|\xi|}(\xi)$ vanishes, while the second sum in (5.40) vanishes altogether, and so $R$ simplifies to

[TABLE]

By isotropy, $\sum_{\zeta\in B_{|\xi|}(\xi)}G((0,0),(\zeta-\eta,\zeta^{\prime}-\eta^{\prime}))=\sum_{\zeta\in B_{|\xi|}(\xi)}G((0,0),(\zeta,\zeta^{\prime}-\zeta))$ for all $\eta,\eta^{\prime}\in B_{|\xi|}(\xi)$ when $\zeta^{\prime}\in B_{|\xi^{\prime}|}(\xi^{\prime})\backslash B_{|\xi|}(\xi)$ , and so $R$ simplifies further to

[TABLE]

If $0\in B_{|\xi^{\prime}|}(\xi^{\prime})$ , then $B_{l}(0)=B_{|\xi^{\prime}|}(\xi^{\prime})$ , in which case the term between brackets equals

[TABLE]

If also $0\in B_{|\xi|}(\xi)\subset B_{|\xi^{\prime}|}(\xi^{\prime})$ , then also $B_{k}(0)=B_{|\xi|}(\xi)$ , in which case the latter difference vanishes. Hence we obtain the bound

[TABLE]

The sums over $\eta,\eta^{\prime}$ and $\zeta,\bar{\zeta}$ can be computed with the help of (5.16). Recalling (5.17)–(5.19), we obtain

[TABLE]

with $d(\xi)=d_{\Omega_{N}^{(k)}}(0^{k},\xi)$ and

[TABLE]

with $d^{\prime}(\xi^{\prime})=d_{\Omega_{N}^{(l)}}(0^{l},\xi^{\prime})$ and $d^{\prime\prime}(\bar{\zeta})=d_{\Omega_{N}}(0,\bar{\zeta})$ . Here we use that $\xi\neq 0^{k}$ when $0\nleq\xi$ and $\xi^{\prime}\neq 0^{l}$ when $0\nleq\xi^{\prime}$ , and also that $l+d^{\prime}(\xi^{\prime})>d^{\prime\prime}(\bar{\zeta})$ for all $\bar{\zeta}\in B_{l}(0)$ . Inserting (5.45)–(5.46) into (5.44), we get

[TABLE]

Step 4.

If $0\nleq\xi$ , then $d(\xi)\in\mathbb{N}$ . Hence the first part of (5.47) equals $8\,[1+o(1)]$ times

[TABLE]

where we recall (4.8) and write $\xi^{l-k}$ to denote the ancestor of $\xi$ at height $l$ . Because $\{\rho^{\xi}(\omega)\colon\,\xi\in\Omega_{N}^{\mathbb{T}}\}$ is stationary, ergodic and tail trivial (recall (2.36)), the last sum scales as $\sim N^{d}\mathbb{E}[\rho^{0^{k}}(\omega)\rho^{0^{l}}(\omega)]$ , where the expectation is finite because of (2.35). Hence (5.48) is

[TABLE]

The last sum scales as $\sim 1/N\bar{c}_{k+1}(N)^{2}$ , and so (5.48) is

[TABLE]

where the equality follows from (5.20) with $k,l$ truncated at $M$ .

If $0\nleq\xi^{\prime}$ , then $d^{\prime}(\xi^{\prime})\in\mathbb{N}$ and $d(\xi)=l-k+d^{\prime}(\xi^{\prime})$ . Hence the second part of (5.47) equals $8\,[1+o(1)]$ times

[TABLE]

The last sum is $\leq C[1+o(1)]N^{l-k+d^{\prime}}$ . Hence (5.51) is

[TABLE]

The last sum scales as $\sim 1/N\bar{c}_{l+1}(N)\bar{c}_{l}(N)$ , and so (5.51) is

[TABLE]

Step 5.

We can again use (5.34) to show that the proof carries over to the random walk in random environment. ∎

Lemmas 5.1–5.2 combine to yield Theorem 3.3 (recall the discussion at the beginning of this section).

6 Multi-scale analysis

In this section we prove Theorem 3.6. We first consider a mean-field system, i.e., the geographic space is $G=\{1,\ldots,N\}$ with $N\to\infty$ . In Section 6.1 we look at this system on time scale $t$ (on which the single components evolve) and on time scale $Nt$ (on which the block average evolves). In Section 6.2 we use the results to analyze the system on $\Omega_{N}$ as $N\to\infty$ . Our general strategy runs parallel to that in [GdHKK14] for the homogeneous model. We only point out which new issues arise. Thus, this section is not autonomous, the principal steps of the arguments are given but not all formulas are repeated, and for an understanding of the fine details the reader must check the relevant passages in [GdHKK14].

6.1 The mean-field finite-system scheme

As geographic space and transition kernel we take

[TABLE]

As migration rate we take $c_{0}$ , and as resampling measures

[TABLE]

with total masses $\rho^{i}=\chi^{i}((0,1])$ . We assume that $(\chi^{i})_{i\in\mathbb{N}}$ is stationary and ergodic such that $\varrho^{i}$ has mean $1$ . We also allow a component with Fleming-Viot resampling at rate $d_{0}$ . The corresponding stochastic system is denoted by $(Z^{(N)}(t))_{t\geq 0}$ with $Z^{(N)}(t)=(Z_{1}^{(N)}(t),\ldots,Z^{(N)}_{N}(t))$ .

We consider time scales $t$ and $Nt$ for the components, respectively, the block average:

[TABLE]

Theorem 6.1 ([Mean-field finite-system scheme).

Suppose that the initial state is i.i.d. with mean measure $\theta\in\mathcal{P}(E)$ . Then

[TABLE]

and

[TABLE]

where $(Z_{\theta}^{c,d,\Lambda}(t))_{t\geq 0}$ is the McKean-Vlasov process defined in Section 3.2.1. $\square$

Proof.

We follow [GdHKK14, Section 6]. The proof of (6.4) carries over in a straightforward way. In the proof of (6.5) a new issue arises: the increasing process of the limit process incorporates an additional averaging over the random environment controlling the resampling for the single components. This is handled as follows.

Calculate the generator for a polynomial of $\bar{Z}^{(N)}(t)$ , namely, a function $F$ of the form

[TABLE]

applied to a $z\in E$ of the form $z=\frac{1}{N}\sum_{i=1}^{N}z_{i}$ . This expression can be expanded in terms of sums of products of monomials of single components. The action of the generator was calculated and analysed in [GdHKK14, Section 6]. We can argue in the same way with the following changes. In the action of the generator, integrals are taken with respect to the random sequence of resampling measures $(\Lambda^{i})_{i\in\Omega}$ rather than a fixed resampling measure $\Lambda$ . This entails that for the block average we get a sum of terms where the random sequence $(\rho^{i})_{i\in\Omega}$ appears as weights. This in turn requires us to change the definition of the set of configurations on which the generator converges in the limit as $N\to\infty$ (see [GdHKK14, Eq. (6.41)–(6.42)]) as follows.

Let $\mathbb{B}^{\ast}$ be the set of $\underline{x}=(x_{i})_{i\in\Omega}\in\mathcal{P}(E)^{\mathbb{N}}$ with

[TABLE]

where

[TABLE]

In order to calculate the sum of the resampling operators as in [GdHKK14, Eq. (6.46)], we have to account for the presence of $\chi^{i}$ , $i\in\Omega$ , and invoke the law of large numbers for the expression in the variance formula, namely, $2c_{0}/(2c_{0}+\lambda_{0}\rho^{i}+2d_{0})$ , $i\in\Omega$ . We write the latter as

[TABLE]

The expressions appearing in the generator, which are averages of local functions of the configuration and their shifts to any of the $N$ locations, result in the same expression as the one we obtain by using (6.9) averaged over $i\in\Omega$ . In the limit as $N\to\infty$ this leads to the recursion formula in (3.17) for $k=0$ . With these changes, the argument runs as in the case of the homogeneous environment. ∎

6.2 The hierarchical mean-field limit

In this section we prove the results claimed in Section 3.2.3. The strategy of the proof is to approximate our system with infinitely many hierarchies of components and time scales by systems with finitely many hierarchies of components and time scales, uniformly in $N$ . The latter are analyzed by using the multiscale analysis of the mean-field system. In Section 6.2.1 we consider 2-level systems with $N^{2}$ components, in Section 6.2.2 $k$ -level systems with $N^{k}$ components, and in Section 6.2.3 we pass to the limit $k\to\infty$ of infinitely many hierarchies. Along the way we make frequent reference to Dawson, Greven and Vaillancourt [DGV95] and the work on the homogeneous version of the model in [GdHKK14].

6.2.1 The $2$ -level system on 3 time scales

The geographic space is $G_{N,2}=\{0,1,\ldots,N-1\}^{2}=G_{N,1}^{2}$ . We pick $d_{0}>0$ , $c_{0},c_{1},\mu_{0},\mu_{1}>0$ and $c_{k},\mu_{k}=0$ for $k\geq 2$ . We choose the random environment that is obtained by restricting the random environment of Section 2 to the subtree corresponding to the 2-block around 0. We show that, on time scales $t$ and $Nt$ , we obtain the same limiting objects as described in Section 6.1, but with additional volatility and block resampling.

For the 1-block averages we use the notation

[TABLE]

and for the 2-block average (= total average)

[TABLE]

Proposition 6.2 ([Two-level rescaling).

Under the above assumptions,

[TABLE]

with

[TABLE]

$\square$ **

To prove the above results in the homogeneous environment, we used uniform estimates for higher-order perturbations of generators. These no longer hold in the random environment, due to the unboundedness of the random resampling rates $\rho^{(\cdot,\eta)}(\omega)$ . (There is no problem under assumption (2.37), and the proof carries over from [GdHKK14].)

To handle this problem we first consider the system where the coefficients $\lambda^{\mathrm{MC}_{k}(\cdot,\eta)}(\omega)$ , $k=1,2$ , are truncated at level $M<\infty$ . For this system we show, with the help of a coupling argument, that on time scale $N^{k}t$ , $k=1,2$ , and averaged over the random environment and the dynamics, the effect of the truncation goes to zero as $M\to\infty$ . The same holds for the limiting objects, so that we get the claim by using the existence of the expectation in combination with the stationarity of $\omega$ .

To get tightness of the approximating sequence of processes, as in [GdHKK14, Eq. (7.52), p. 117], we use the fact that the laws conditioned on the environment $\omega$ of the averages in (6.10)–(6.11) are tight. To prove the latter, we use the criterion of Joffe and Metivier in the form as given in Dawson [Daw93, p. 55], observing that $\chi^{\mathrm{MC}_{k}(\cdot,\eta)}(\omega)$ , $\eta\in G_{N,1}$ , $k=1,2$ , are integrable uniformly in $N$ . To check the criterion, we observe that we can code the information on the random environment into the initial condition of the process. With this observation, the proof works as for the homogeneous environment.

6.2.2 The $k$ -level system on $k+1$ time scales

The reasoning addresses the same points raised above and runs otherwise exactly as in [GdHKK14, Section 7.2].

6.2.3 The infinite-level system on infinitely many time scales

The problem is again the extension of the uniform perturbation arguments, which have to be adapted to guarantee that cutting off higher hierarchical levels leads to an approximation by finite systems, for which we can apply the reasoning in the previous section, on the relevant time scales. To get the necessary arguments and estimates we refer the reader to the material in [GdHKK14, Sections 8.1–8.2].

The argument used for the homogeneous environment to obtain uniforms bounds does not apply because the perturbation of the migration and the resampling coming from the hierarchical levels $\geq k+1$ is unbounded. However, the perturbation terms can be stochastically bounded by a random variable that has a finite expectation over the random environment. Again, it suffices to show with the help of a coupling argument that the stochastic dynamics with $k$ hierarchical levels approximates the infinite stochastic dynamics on time scales $tN^{l}$ with $0\leq l\leq k$ . Apart from that the argument is the same.

6.3 Dichotomy in the hierarchical mean-field limit

In this section we prove Theorem 3.13. First, we argue that the entrance law exists, a fact that was established in Dawson, Greven and Vaillancourt [DGV95][Section 6(a), Proposition 6.2] for the Fleming-Viot model, based on a variance estimate and the convergence of the sum in the coexistence criterion. The argument from that paper carries over despite the $\omega$ -dependence of the transition kernels of the interaction chain (read this of from (6.15) and (6.17) below).

Next, we argue that the dichotomy holds. Here, we again follow the strategy for the homogeneous environment by calculating the variance of $\langle M^{(j)}_{\eta,0},f\rangle$ for every $\eta\in\Omega_{\infty}$ and $f\in C_{b}(E,\mathbb{R})$ and showing that as $j\to\infty$ this variance converges to zero, respectively, remains positive, depending on whether the sum in (3.30) is infinite or finite.

The variance formula reads

[TABLE]

Consequently, by iteration,

[TABLE]

where $\underline{d}=(d_{k})_{k\in\mathbb{N}_{0}}$ is determined by the recursion relation in (3.17). Taking logarithms, we see that the product tends to a positive limit as $j\to\infty$ if and only if

[TABLE]

By assumptions (2.35)–(2.36), the sum converges $\omega$ -a.s. if and only if

[TABLE]

Indeed, the variance of the sum in (6.16) equals the variance of the $\rho$ -field times $\sum_{k\in\mathbb{N}_{0}}(\frac{\mu_{k}}{c_{k}})^{2}$ , and the latter is bounded from above by the square of the average of the sum. As shown in [GdHKK14, Theorem 3.7(c)], the criterion in (6.17) is the same as the criterion in (3.30).

7 The orbit of the renormalization transformations

In Section 7.1 we show the ordering in Theorem 3.10. In Sections 7.2–7.3, we derive the scaling behaviour in Theorems 3.15–3.17.

7.1 Random environment lowers the volatility

Proof of Theorem 3.10.

Recall the notation introduced in Section 2.2. Fix $\underline{c}$ and $\underline{\lambda}$ . Let $\underline{d}$ be the solution of the recursion relation in (3.17). Let $\underline{d}^{0},\underline{d}^{1}$ be the solutions when $\mathcal{L}_{\rho}$ is replaced by $\delta_{0},\delta_{1}$ (recall that $\rho$ has mean 1 under $\mathcal{L}_{\rho}$ ). As initial values take $d^{0}_{0}\leq d_{0}\leq d^{1}_{0}$ . We use induction on $k$ to show that $d^{0}_{k}<d_{k}<d^{1}_{k}$ for all $k\in\mathbb{N}$ .

Define (see Fig. 4)

[TABLE]

Because $a\mapsto c_{k}(\mu_{k}a+x)/[c_{k}+(\mu_{k}a+x)]$ is strictly increasing and strictly concave on $[0,\infty)$ for all $x\in[0,\infty)$ , it follows that $f^{0}_{k}(x)<f_{k}(x)<f^{1}_{k}(x)$ for all $x\in[0,\infty)$ . Hence, if $d^{0}_{k}\leq d_{k}\leq d^{1}_{k}$ , then $d^{0}_{k+1}=f^{0}_{k}(d^{0}_{k})<f_{k}(d^{0}_{k})\leq f_{k}(d_{k})=d_{k+1}$ and $d_{k+1}=f_{k}(d_{k})<f^{1}_{k}(d_{k})\leq f^{1}_{k}(d^{*}_{k})=d^{1}_{k+1}$ . ∎

The same argument proves the claim made in Section 3.3 that $M<M^{*}$ for the fixed points of (3.35) (random environment) and its analogue with $\mathcal{L}_{\rho}$ replaced by $\delta_{1}$ (average environment).

7.2 Scaling of the volatility: polynomial coefficients

Proof of Theorem 3.15.

We look at each of the four parameter regimes separately. Recall (3.33)–(3.34).

(a) Let $K_{k}=\mu_{k}/c_{k-1}$ , $R_{k}=c_{k}/c_{k-1}$ and $\mho_{k}=d_{k}/c_{k-1}$ . Rewrite (3.17) as

[TABLE]

Since $g_{k}$ is non-decreasing on $[0,\infty)$ , we have the sandwich

[TABLE]

We are in the regime where $\lim_{k\to\infty}K_{k}=K=\infty$ and $\lim_{k\to\infty}R_{k}=R=1$ . Hence $\lim_{k\to\infty}g_{k}(0)=1$ , and so (7.3) yields $\lim_{k\to\infty}d_{k}/c_{k}=\lim_{k\to\infty}\mho_{k}/R_{k}=1/R=1$ .

(b) Again use (7.2). We are in the regime where $\lim_{k\to\infty}K_{k}=K\in(0,\infty)$ and $\lim_{k\to\infty}R_{k}=R=1$ . Hence, we see that $g_{k}$ converges point-wise to $g$ given by

[TABLE]

Both $g$ and $g_{k}$ are strictly increasing and strictly concave on $[0,\infty)$ , with $g([0,\infty])\subseteq[0,1]$ and $g_{k}([0,\infty])\subseteq[0,1]$ , with unique attracting fixed points $M\in(0,1)$ and $M_{k}\in(0,1)$ , and with $M$ the solution of (3.35). To show that $\lim_{k\to\infty}\mho_{k}=M$ , we need two facts.

Lemma 7.1.

Let $s_{k}=\sup_{x\in[0,1]}|g_{k}(x)-g(x)|$ . Then $\lim_{k\to\infty}s_{k}=0$ . $\square$

Proof.

Estimate

[TABLE]

This gives

[TABLE]

Let $k\to\infty$ to get the claim. ∎

Lemma 7.2.

Function $g$ is a strict contraction around $M$ , i.e., there exists a $\beta\in(0,1)$ such that $\sup_{x\in[0,\infty)}(g(x)-M)/(x-M)=\beta$ . $\square$

Proof.

Consider the linear function $L(x)=g(0)+[1-\frac{g(0)}{M}]x$ , $x\in[0,\infty)$ , which satisfies $L(0)=g(0)$ and $L(M)=M=g(M)$ (see Fig. 5). Note that $g\geq L$ on $[0,M]$ while $g\leq L$ on $[M,\infty)$ . Hence, we have

[TABLE]

Since $g(0)>0$ , we get the claim with $\beta=1-\frac{g(0)}{M}$ . ∎

We can now complete the proof as follows. Let $\Delta_{k}=|\mho_{k}-M|$ . Then

[TABLE]

Iteration yields

[TABLE]

It follows from Lemma 7.1–7.2 that $\lim_{k\to\infty}\Delta_{k}=0$ . Hence $\lim_{k\to\infty}d_{k}/c_{k}=\lim_{k\to\infty}\mho_{k}/R_{k}=M/R=M$ .

(c–d) Like in Case (a), the scaling turns out to be the same as for the average environment. The proof is based on a comparison between the recursions for the random environment and the average environment (last two items in (7.1)). The key idea is the following lemma, which can be viewed as a stability property.

Lemma 7.3.

Let $d_{0}=d^{1}_{0}$ . Then, the solution of the recursion $d_{k+1}=f_{k}(d_{k})$ , $k\in\mathbb{N}_{0}$ , is the same as the solution of the recursion $d^{1}_{k+1}=f^{1}_{k}(d^{1}_{k})$ , $k\in\mathbb{N}_{0}$ , when in the latter recursion the coefficient $\mu_{k}$ is replaced by $\mu_{k}r_{k}$ with

[TABLE]

$\square$ **

Proof.

Check that

[TABLE]

and use induction on $k$ . ∎

Since $\rho\mapsto c_{k}/[c_{k}(1+K_{k}\rho)+d_{k}]$ is non-increasing, we have $N_{k}\leq D_{k}\mathbb{E}_{\mathcal{L}_{\rho}}[\rho]=D_{k}$ , and so $r_{k}\leq 1$ . The following result shows that $r_{k}$ tends to 1 as $k\to\infty$ in Cases (c) and (d).

Lemma 7.4.

If $\lim_{k\to\infty}K_{k}=K=0$ , then $\lim_{k\to\infty}r_{k}=1$ . $\square$

Proof.

For any $C\in(0,\infty)$ , we may estimate

[TABLE]

Since $\lim_{k\to\infty}K_{k}=0$ , we have $\lim_{k\to\infty}(c_{k}+d_{k})/[c_{k}(1+K_{k}C)+d_{k}]=1$ , and hence

[TABLE]

Now let $C\to\infty$ and use that $\lim_{C\to\infty}\mathbb{E}_{\mathcal{L}_{\rho}}[\rho\,1_{\{\rho\leq C\}}]=\mathbb{E}_{\mathcal{L}_{\rho}}[\rho]=1$ by monotone convergence. ∎

Lemma 7.3 implies that the scaling of $d_{k}$ is the same as the scaling of $d^{1}_{k}$ after $\mu_{k}$ is replaced by $\mu_{k}r_{k}$ . But the latter scaling was derived in [GdHKK14], and a glance at the results for Cases (c) and (d) obtained there shows that the scaling is unaffected by the extra factor $r_{k}$ because of Lemma 7.4. ∎

A technical remark is in order, for which we refer the reader to [GdHKK14, Section 11.3]. We have assumed that $k\mapsto\mu_{k}$ is regularly varying at infinity (recall (3.33)). Because $\lim_{k\to\infty}r_{k}=1$ , also $k\mapsto r_{k}\mu_{k}$ is regularly varying at infinity. Therefore, $(r_{k}\mu_{k})_{k\in\mathbb{N}_{0}}$ can be approximated from above and from below by sequences that have the same scaling behaviour but are smoothly varying, i.e., for all $n\in\mathbb{N}$ their $n$ -th order discrete differences are regularly varying as well. This approximation is harmless because the maps $\underline{c}\mapsto\underline{d}$ and $\underline{\mu}\mapsto\underline{d}$ are component-wise non-decreasing (a fact that is immediate from (3.17)), and so the approximating sequences provide a sandwich for the scaling. Now, if the tail exponent of $r_{k}\mu_{k}$ is non-integer, i.e., $b\notin\mathbb{N}$ in (3.33), then for all $n\in\mathbb{N}$ the $n$ -th order discrete differences are asymptotically monotone. This observation is important because it implies that certain sequences arising in [GdHKK14, Section 11.3] have summable variation, a property that is crucial for the proof of the scaling. If the tail exponent is integer, i.e., $b\in\mathbb{N}$ in (3.33), then the asymptotic monotonicity still holds for all $n\leq b$ , which turns out to be enough for the argument.

The extra regularity conditions on $L_{c},L_{\mu}$ in (3.33), which are stated in [GdHKK14, Eqs. (1.79)–(1.81)], need no modification: $(r_{k}\mu_{k})_{k\in\mathbb{N}_{0}}$ has the same slowly varying function $L_{\mu}$ as $(\mu_{k})_{k\in\mathbb{N}_{0}}$ .

7.3 Scaling of the volatility: exponential coefficients

Proof of Theorem 3.17.

We look at each of the five parameter regimes (= universality classes) separately. Recall (3.37–3.38).

(A)

Use (7.2). We are in the regime where $\lim_{k\to\infty}K_{k}=K=\infty$ and $\lim_{k\to\infty}R_{k}=c$ . The same argument as in the proof of Case (a) yields $\lim_{k\to\infty}d_{k}/c_{k}=\lim_{k\to\infty}\mho_{k}/R_{k}=1/c$ .

(B)

Let $\bar{K}_{k}=\bar{\mu}_{k}/\bar{c}_{k-1}$ and $\bar{R}_{k}=\bar{c}_{k}/\bar{c}_{k-1}$ . Then $K_{k}=c\bar{K}_{k}$ and $R_{k}=c\bar{R}_{k}$ by (3.37), and so (7.2) becomes

[TABLE]

We are in the regime where $\lim_{k\to\infty}\bar{K}_{k}=\bar{K}\in(0,\infty)$ and $\lim_{k\to\infty}\bar{R}_{k}=\bar{R}=1$ . The same argument as in Case (b) therefore yields $\lim_{k\to\infty}d_{k}/c_{k}=\lim_{k\to\infty}\mho_{k}/R_{k}=\bar{M}/c\bar{R}=\bar{M}/c$ with $\bar{M}$ the unique attracting fixed point of

[TABLE]

which is the analogue of (7.4).

(C1)

This case is the same as Case (B), but with $\bar{K}=0$ . The analogue of (7.15) reads $\bar{g}(x)=x/(c+x)$ . Since $\bar{g}$ has $\bar{M}=1-c\in(0,1)$ as unique attracting fixed point, we can copy the proof of Case (b) to get $\lim_{k\to\infty}d_{k}/c_{k}=\lim_{k\to\infty}\mho_{k}/R_{k}=(1-c)/c\bar{R}=(1-c)/c$ . Note: In the proof of Case (b) we used that $g(0)>0$ , which fails here. However, even when $d_{0}=0$ , the iterates $d_{k}$ , $k\in\mathbb{N}$ , are bounded away from [math] because the attracting fixed points of $f_{k}$ , $k\in\mathbb{N}$ , are bounded away from [math]. Hence we may restrict the entire argument to $[\epsilon,1]$ for some $\epsilon>0$ instead of $[0,1]$ , and use that $g(\epsilon)>0$ (recall Fig. 5).

(C2)

This case is like Case (c). Since $\bar{K}=0$ , we can copy the proof of Case (c) and show that the same scaling holds as in the average environment.

(C3)

This case is like Case (d). Since $\bar{K}=0$ , we can copy the proof of Case (d) and show that the same scaling holds as in the average environment. ∎

8 Identification of the universality classes of cluster formation

In this section we prove Theorem 3.19. In Section 8.1 we deal with cases (a), (A) and (b), (B), (C1), in Section 8.2 with cases (c), (C2) and (d), (C3). The strategy of proof is the same as for the homogeneous environment, except at a few points where the random environment comes into play seriously. We focus on the necessary modifications. Like Section 6, this section is not completely autonomous, and for an understanding of the fine details the reader must check the relevant passages in [GdHKK14].

Before we begin we recall why we may choose the starting configuration to be identically equal to $\theta$ , the mean of the starting configuration. The initial state and the environment of our Cannings process are such (recall Theorem 3.6) that the scaling limit in (3.22) yields on average $\theta$ on level $j+1$ .

8.1 Random cluster size

Proof of cases (b), (B), (C1), (C3)[first subcase].

In Step 1 we give the proof for an i.i.d. random environment. In Step 2 we extend the proof to a stationary and ergodic random environment.

Step 1.

We consider the set $\mathcal{M}_{f}([0,1])\times\mathcal{P}(E)$ , describing the environment and the state of a block. If the random environment is i.i.d., then the sequence

[TABLE]

is a time-inhomogeneous Markov chain. Let $(K^{\ast,(j)}_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ be its sequence of transition kernels. (We suppress the index $\eta$ from $M^{(j)}_{\eta,-(j+1-\alpha)}$ because its law is the same for all $\eta\in\Omega_{N}$ .) It suffices to prove three properties:

(1)

The sequence of transition kernels $(K^{\ast,(j)}_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ converges as $j\to\infty$ to the sequence $(K^{\ast,\infty}_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ of transition kernels given by

[TABLE]

where $\widetilde{M},\widetilde{K}$ are defined in (3.47) and $(\chi_{\alpha}(\omega))_{\alpha\in\mathbb{N}_{0}}$ in (3.48).

(2)

The map

[TABLE]

is continuous.

(3)

The map

[TABLE]

is continuous.

Items (1) and (3) imply the convergence of the process in (8.1), while item (2) is needed in the proof item (1).

Proof of (1)–(3).

Here a key is the duality relation for the McKean-Vlasov limit process. This duality arises as a special case of our duality relation by choosing a suitable geographic space. This coalescent is obtained by taking as space $\{0,\ast\}$ , where the rates for all transitions in $\ast$ are zero (cemetery) state and jumps occur from [math] to $\ast$ at rate $c$ . Kingman coalescence occurs at rate $d$ and the $\Lambda$ -coalescence is given via $\Lambda$ (all as long as we are in [math]). For a detailed discussion, see [GdHKK14, Section 4].

With the help of duality we can identify the equilibrium measure $\nu_{\theta}^{c,d,K\chi}$ by using a measure-determining sequence of test functions. The parameters $c,d,\chi$ enter via the rate of jump to the cemetery state (parameter $c$ ), the rate of pairwise coalescence (parameter $d$ ), and the rate of coalescence (parameter $\chi$ ). In the latter, the ratio $\chi/\chi((0,1])$ determines the probability for partition elements to coalesce in groups ( $\Lambda$ -coalescence). In this equilibrium representation, the coalescent has run for infinite time.

(1) With $\mathcal{L}$ acting on $\chi_{j+1-\alpha}(\omega)$ , we have

[TABLE]

From Theorems (3.15) and (3.17), we know that $d_{j+1-\alpha}/c_{j+1-\alpha}$ and $K_{j+1-\alpha}$ converge to $\widetilde{M}$ and $\widetilde{K}$ as $j\to\infty$ . The point is to show for every $\alpha\in\mathbb{N}_{0}$ the equilibrium measure in the right-hand side converges as $j\to\infty$ . By the stationarity of the random environment, the law of $\chi_{j+1-\alpha}(\omega)$ is independent of $j$ . Hence (2) and (3) yield the claim.

(2) The continuity in (8.3) can be deduced from the dual representation in the McKean-Vlasov limit dynamic, in particular, from the fact that the coalescent has run for infinite time, and depends continuously on the migration rate $c$ and the Kingman coalescence rate $d$ , respectively, the rates for the $\Lambda$ -coalescence. The coalescent has a monotone decreasing number of partition elements off the cemetery where all rates are zero and reaches the cemetery state after a finite time. This means we have a Markov chain hitting a trap in finite time and therefore depends continuously on the finitely many involved jump-rates.

(3) The continuity in (8.4) is deduced from the dual representation. We have to show that the dual expectation depends continuously on $\theta$ , which goes as follows. First note that the monomials $\{\,\langle\cdot\,,f\rangle^{\ell}\colon\,f\in C_{b}(E,\mathbb{R}),\,\ell\in\mathbb{N}\}$ are measure-determining on $(E,\mathcal{B})$ . The dual expectation is a finite sum over terms arising from partition elements that are coalescing before jumping to the cemetery state. If $\ell$ partition elements remain, then the $\theta$ -dependence is via $\langle\theta,f\rangle^{\ell}$ , which is a continuous function of $\theta$ . ∎

Step 2.

To deal with a stationary and ergodic random environment, we condition on the sequence $(\chi_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ . This leads to a sequence of Markov chains in random environment, indexed by $j$ , for which the result in (1) holds, as explained above. After that we argue that (1)–(3) again imply the claim, because of the stationarity and the fact that we need only consider finite $\alpha$ .

Next, we consider the finite-dimensional laws of the Markov chain in random environment conditional on $(\chi_{\alpha})_{\alpha\in\mathbb{N}_{0}}$ and we verify the appropriate versions of (1)–(3). To this end, we extend the duality to a space-time duality and obtain an expression for the mixed space-time moments in terms of triples of parameters

[TABLE]

with $L$ being the order of the marginal distribution we consider.

In the space-time dual, we work with frozen partition elements which are activated (then once and forever) at a present time. Namely we add partition elements marked by a label in $[0,\infty]$ , which indicates from which time on the mechanisms of the coalescent are activated. Before this time, the partition element neither moves nor coalesces. This allows us to characterize the finite-dimensional marginals of the forward process. Suppose that we want to study the finite-dimensional distributions associated with times $0\leq t_{1}<t_{2}<t_{3}\ldots<t_{n}<t$ . Then we take individuals marked with $0,t-t_{n},t-t_{n-1},\ldots,t-t_{1}$ , consider the test functions in the duality relations for the time horizon $t_{1},t_{2},\ldots,t_{n},t$ , and form the product. The duality relation holds again. Compare with Greven, Sun and Winter [GSW16, Corollary 1.20].

In this setting, (1)–(3) turn into claims about the expectation of the duality expression under the law of the space-time coalescent, after which the argument proceeds as above. ∎

Proof of cases (a), (A).

The limiting transition kernel of the rescaled interaction chain for a given environment degenerates to a transition kernel concentrated on the traps. We have

[TABLE]

We must therefore show that

[TABLE]

Taking the dual representation, we see that as $K\to\infty$ the rate of the $\Lambda$ -coalescence tends to infinity, implying that the coalescent converges before it jumps, and coalesces into a single partition element. The duality relation says that the original McKean-Vlasov process is in a mono-type equilibrium, where the type is chosen at random according to $\theta$ . The claim now follows because for $K(\theta,\cdot)=\int_{E}\theta(\mathrm{d}u)\delta_{\delta_{u}}(\cdot)$ the state $\delta_{u}$ is a trap, so that the limiting Markov chain is constant for every $\alpha\neq 0$ , the constant being chosen according to $\theta$ for every realization of the random environment. ∎

8.2 Random cluster order

Proof of cases (c), (C2) and (d), (C3)[second subcase].

In cases (c) and (d), averaging takes place via a law of large numbers and the situation is similar to the homogeneous environment, for which the results in Theorem 3.19 are of the same type, and it is only the formula for $d$ that changes.

The claim is that the interaction chain, which is a space-time rescaled Markov chain and a measure-valued square-integrable martingale, converges to a limit that is a measure-valued diffusion and a square-integrable martingale. In [DGV95, Section 6(b)], it was pointed out how, for the case of the Fleming-Viot process, this convergence reduces to the study of the process of conditioned variances along the path, which in turn reduces to showing the following asymptotic relations for these objects. Pick $\alpha_{1},\alpha_{2}\in I$ with $\alpha_{2}<\alpha_{1}$ , and suppose that $\lim_{j\to\infty}k_{\alpha}(j)/j=\beta(\alpha)$ with $0\leq\beta(\alpha_{1})<\beta(\alpha_{2})\leq 1$ . If the scaled Markov chain is such that

[TABLE]

with $\beta(\alpha)=1-\alpha$ , then by applying the transformation $\beta(\alpha)=\mathrm{e}^{-s}$ the right-hand side turns into the expression $(1-e^{-(s_{1}-s_{2})})\,\operatorname{Var}_{\theta}(f)$ . Since this scales like $(s_{1}-s_{2})\,\operatorname{Var}_{\theta}(f)$ for $s_{1}\downarrow s_{2}$ , we see that the standard Fleming-Viot process $Y(s)_{s\geq 0}$ appears as the scaling limit. Since $s=\log(1/(1-\alpha))$ , we get the time-scaled Fleming-Viot process $Y(\log(1/(1-\alpha))_{\alpha\in[0,1)}$ (see [DGV95, Section 6]).

With suitable time transformations, we can also handle the other forms of scaling $j\to k_{\alpha}(j)$ in Definition 3.18. Namely, we have to identify the function $F(\alpha_{1},\alpha_{2})$ appearing in front of $\operatorname{Var}_{\theta}(f)$ and find the transformation $\alpha=L(s)$ such that

[TABLE]

so that again the standard Fleming-Viot process $(Y(s))_{s\geq 0}$ appears as the scaling limit. Since $s=L^{-1}(\alpha)$ , we get the time-scaled Fleming-Viot process $(Y(L^{-1}(\alpha))_{\alpha\in I}$ .

It was pointed out in [GdHKK14, Section 9.3] how (8.9) is established for the homogeneous hierarchical Cannings process by using the scaling analysis of the coefficients $\underline{d}=(d_{k})_{k\in\mathbb{N}_{0}}$ . In our case, we need to work with a random sequence $(\mu_{k}\rho_{k}(\omega))_{k\in\mathbb{N}_{0}}$ instead of $(\mu_{k})_{k\in\mathbb{N}_{0}}$ , where $\rho_{k}(\omega)$ arises from the term $\Lambda=\Lambda^{(\eta,k)}((0,1])(\omega)$ in the following variance formula

[TABLE]

We thus have to see whether the product (with $\rho_{k}(\omega)=\rho^{\mathrm{MC}_{k}(0)}(\omega)$ )

[TABLE]

appearing in the expression for the variance in (8.9), does indeed exhibit averaging based on the tail triviality of the random sequence $(\rho_{k}(\omega))_{k\in\mathbb{N}_{0}}$ (see [GdHKK14, Eq. (8.14)]).

To that end, we abbreviate

[TABLE]

consider the relation

[TABLE]

and analyse its behaviour as $j\to\infty$ for appropriate choices of $j_{1}=j_{1}(j)$ and $j_{2}=j_{2}(j)$ . We must show that, for $\mathbb{P}$ almost all $\omega$ , (8.14) behave asymptotically like the right-hand side of (8.9), and we must identify the associated $F$ , $\Delta F$ and $L$ .

In order to decide how the product scales as $j_{2}-j_{1}\to\infty$ , we take logarithms to turn this into the question whether the sum

[TABLE]

has a certain scaling behaviour, and we link this to the scaling behaviour of $\mu_{k}/c_{k}$ and $d_{k}/c_{k}$ for $k\to\infty$ (which we know from Theorems 3.15 and 3.17) to derive the relevant asymptotics. We have to show that this asymptotics does not depend on $\omega$ and is equal to that with $\rho_{k}(\omega)$ replaced by its mean $1$ . To achieve the latter, we use the stationarity of $(\rho_{k}(\omega))_{k\in\mathbb{N}_{0}}$ , plus the fact that it has bounded and decaying covariances (recall (2.35)–(2.36)). The key is the following lemma.

Lemma 8.1.

Define $S(j_{1},j_{2})(\omega)=\sum_{k=j_{1}}^{j_{2}}m_{k}(\omega)$ . Then,

[TABLE]

Proof.

Define

[TABLE]

Then

[TABLE]

With the help of Chebyshev’s inequality we see that it suffices to show that

[TABLE]

We have $\mathbb{C}\mathrm{ov}[\rho_{k}(\omega),\rho_{l}(\omega)]=C_{|k-l|}$ with $\lim_{m\to\infty}C_{m}=0$ . Since, by our assumptions on $(c_{k})_{k\in\mathbb{N}_{0}}$ and $(\mu_{k})_{k\in\mathbb{N}_{0}}$ , we have

[TABLE]

the claim follows. ∎

Remark 8.2.

The role of Lemma 8.1 is to show that the same clustering behaviour occurs in the random environment as in the homogeneous environment. We are only able to prove convergence in $\mathbb{P}$ -probability and not $\mathbb{P}$ -a.s. In the prefactor in the right-hand side of (8.14) weighted averages over $j_{1},j_{2}$ -dependent sliding windows of the random environment appear, which would need to be shown to converge $\mathbb{P}$ -a.s. It is unclear how to do this, even for an i.i.d. random environment.**

Lemma 8.1 implies that the term between square brackets in (8.14) scales like

[TABLE]

where we use that $\lim_{l\to\infty}(\mu_{l}+d_{l})/c_{l}=0$ in all cases of interest. In the remainder of the proof, we pick $j_{1}=k_{\alpha_{1}}(j)$ and $j_{2}=k_{\alpha_{2}}(j)$ with $\alpha_{2}<\alpha_{1}$ , with $k_{\alpha}(j)$ as in Definition 3.18, and compute the limit of (8.21) as $j\to\infty$ . We omit writing $\lfloor\cdot\rfloor$ at places where labels are obviously integer. We determine $k_{\alpha}$ and identify $F$ , $L$ (recall the discussion leading up to (8.10)) for the different cases, in the order (c), (C2), (d), (C3). Recall that $K_{k}=\frac{\mu_{k}}{c_{k}}$ and $\bar{K}_{k}=\frac{\bar{\mu}_{k}}{\bar{c}_{k}}$ .

Case (c). Pick $k_{\alpha}(j)=j+1-\alpha h(j)$ with $h(j)=1/\sqrt{K_{j}}$ , and insert $d_{k}\sim\sqrt{c_{k}\mu_{k}}=c_{k}\sqrt{K_{k}}$ and $d_{k+1}\sim d_{k}$ , to obtain that (8.21) scales like

[TABLE]

Putting $x=(j+1-k)\sqrt{K_{j}}$ , and using that $\lim_{k\to\infty}K_{k}=0$ , $\lim_{k\to\infty}k^{2}K_{k}=\infty$ and $K_{k}\sim K_{l}\sim K_{j}$ uniformly in $k,l$ in both sums, we get

[TABLE]

Pick $\alpha=L(s)=s$ . Then $\Delta F\equiv 1$ . Since $s=L^{-1}(\alpha)=\alpha$ , this proves the claim.

Case (C2)[subcase $\lim_{k\to\infty}k\bar{K}_{k}=\infty$ ]. Pick $k_{\alpha}(j)=j+1-\alpha h(j)$ with $h(j)=1/\bar{K}_{j}$ , and insert $d_{k}\sim\mu_{k}/(\mu-1)=\bar{K}_{k}c_{k}/(\mu-1)$ and $d_{k+1}\sim\mu d_{k}$ , to obtain that (8.21) scales like

[TABLE]

Putting $x=(j+1-k)\bar{K}_{j}$ , and using that $\lim_{k\to\infty}\bar{K}_{k}=0$ , $\lim_{k\to\infty}k\bar{K}_{k}=\infty$ and $\bar{K}_{k}\sim\bar{K}_{l}\sim\bar{K}_{j}$ uniformly in $k,l$ in both sums, we get

[TABLE]

Pick $\alpha=L(s)=\frac{\mu-1}{\mu}s$ . Then $\Delta F\equiv 1$ . Since $s=L^{-1}(\alpha)=\frac{\mu}{\mu-1}\alpha$ , this proves the claim.

Case (d). Pick $k_{\alpha}(j)=(1-\alpha)(j+1)$ , and insert $d_{k}\sim M/\sigma_{k}$ , $\sigma_{k}c_{k}\sim k/(1-a)$ and $d_{k+1}\sim d_{k}$ , to obtain that (8.21) scales like

[TABLE]

Putting $x=(j+1-k)/(j+1)$ , and using that $\lim_{k\to\infty}k^{2}K_{k}=0$ , we get

[TABLE]

Pick $\alpha=L(s)=1-e^{-s/R}$ with $R=M(1-a)$ . Then $\Delta F\equiv 1$ . Since $s=L^{-1}(\alpha)=\log(1/(1-\alpha)^{R})$ we get the claim.

Case (C2)[subcase $\lim_{k\to\infty}k\bar{K}_{k}=\bar{N}$ ]. This is the same as case (d) with $M(1-a)$ replaced by $\bar{N}\frac{\mu}{\mu-1}$ .

Case (C3)[second subcase]. This is the same as case (d) with $M$ replaced by 1. ∎

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BEM 07] J. Blath, A. Etheridge, and M. Meredith. Coexistence in locally regulated competing populations and survival of branching annihilating random walk. Ann. Appl. Probab. , 17:1474–1507, 2007.
2[BEV 10] N.H. Barton, A.M. Etheridge, and A. Véber. A new model for evolution in a spatial continuum. Electron. J. Probab. , 15:paper no. 7, 162–216, 2010.
3[BEV 13] N. Berestycki, A.M. Etheridge, and A. Véber. Large scale behaviour of the spatial Λ Λ \Lambda -Fleming–Viot process. Ann. Inst. Henri Poincaré Probab. Stat. , 49:374–401, 2013.
4[BH 15] A. Bovier and F. den Hollander. Metastability - A Potential-Theoretic Approach , volume 351 of Grundlagen der mathematischen Wissenschaften . Springer-Verlag, New York, 2015.
5[Can 74] C. Cannings. The latent roots of certain Markov chains arising in genetics: a new approach. I. Haploid models. Adv. Appl. Prob. , 6:260–290, 1974.
6[Can 75] C. Cannings. The latent roots of certain Markov chains arising in genetics: a new approach. II. Further haploid models. Adv. Appl. Prob. , 7:264–282, 1975.
7[CG 86] J.T. Cox and D. Griffeath. Diffusive clustering in the two-dimensional voter model. Ann. Probab. , 14:347–370, 1986.
8[CK 00] J.T. Cox and A. Klenke. Recurrence and ergodicity of interacting particle systems. Probab. Th. Relat. Fields , 116:239–255, 2000.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The hierarchical Cannings process in random environment

Abstract

Contents

1 Introduction

1.1 Motivation and goal

Remark 1.1**.**

1.2 Summary of the main results

1.3 Outline

2 The model

2.1 The hierarchical Cannings process

2.1.1 The hierarchical group of order NNN

2.1.2 Block migration

Remark 2.1**.**

2.1.3 Block reshuffling-resampling

2.1.4 The generator and the martingale problem

Remark 2.2**.**

Proposition 2.3** **(Hierarchical martingale problem).

2.2 The hierarchical Cannings process in random environment

2.2.1 The random environment on the full tree

2.2.2 The generator in random environment

3 Main theorems

3.1 Results for fixed NNN

3.1.1 Well-posedness of the martingale problem

Theorem 3.1** **(Well-posedness of the martingale problem).

3.1.2 Dichotomy: coexistence versus clustering

Theorem 3.2** **(Equilibrium).

Theorem 3.3** **(Dichotomy for finite NNN).

Corollary 3.4** **(Change of types).

3.2 Results for N→∞N\to\inftyN→∞

3.2.1 McKean-Vlasov process

Proposition 3.5** **(McKean-Vlasov martingale problem).

3.2.2 Random environment for N=∞N=\inftyN=∞

3.2.3 Renormalization via block averages

Heuristics behind the recursion formula for the volatilities.

Theorem 3.6** **(Hierarchical mean-field limit and renormalization).

Theorem 3.7** **(Multi-scale analysis and the interaction chain).

Definition 3.8** **(Interaction chain).

Remark 3.9**.**

Theorem 3.10** **(Randomness lowers volatility).

3.2.4 Dichotomy for the interaction chain

Proposition 3.11** **(Entrance law of interaction chain exists).

Definition 3.12** **(Entrance law of interaction chain).

Theorem 3.13** **(Dichotomy for N=∞N=\inftyN=∞).

Corollary 3.14** **(Hierarchical mean field limit of equilibrium).

3.2.5 Scaling of the volatility

Theorem 3.15** **(Scaling of the Fleming-Viot volatility: polynomial coefficients).

Remark 3.16**.**

Theorem 3.17** **(Scaling of the Fleming-Viot volatility: exponential coefficients).

3.2.6 Cluster formation

Definition 3.18** **(Clustering classes).

Theorem 3.19** **(Cluster formation).

Remark 3.20**.**

3.3 Summary of the effects of the random environment

1.

2.

3.

4.

5.

6.

4 Existence, uniqueness, duality and equilibrium

4.1 The spatial coalescent in random environment

Theorem 4.1** **(Existence and uniqueness).

4.2 Dualities

Theorem 4.2** **(Duality).

4.3 Well-posedness of the martingale problems and equilibria

Theorem 4.3** **(Well-posedness).

Theorem 4.4** **(Equilibrium).

4.4 Consequences for the Cannings process

Proof.

Remark 4.5**.**

5 Dichotomy: coexistence versus clustering

5.1 Mean hazard

Lemma 5.1**.**

Proof.

Remark 1.1.

2.1.1 The hierarchical group of order $N$

Remark 2.1.

Remark 2.2.

Proposition 2.3 (Hierarchical martingale problem).

3.1 Results for fixed $N$

Theorem 3.1 (Well-posedness of the martingale problem).

Theorem 3.2 (Equilibrium).

Theorem 3.3 (Dichotomy for finite $N$ ).

Corollary 3.4 (Change of types).

3.2 Results for $N\to\infty$

Proposition 3.5 (McKean-Vlasov martingale problem).

3.2.2 Random environment for $N=\infty$

Theorem 3.6 (Hierarchical mean-field limit and renormalization).

Theorem 3.7 (Multi-scale analysis and the interaction chain).

Definition 3.8 (Interaction chain).

Remark 3.9.

Theorem 3.10 (Randomness lowers volatility).

Proposition 3.11 (Entrance law of interaction chain exists).

Definition 3.12 (Entrance law of interaction chain).

Theorem 3.13 (Dichotomy for $N=\infty$ ).

Corollary 3.14 (Hierarchical mean field limit of equilibrium).

Theorem 3.15 (Scaling of the Fleming-Viot volatility: polynomial coefficients).

Remark 3.16.

Theorem 3.17 (Scaling of the Fleming-Viot volatility: exponential coefficients).

Definition 3.18 (Clustering classes).

Theorem 3.19 (Cluster formation).

Remark 3.20.

Theorem 4.1 (Existence and uniqueness).

Theorem 4.2 (Duality).

Theorem 4.3 (Well-posedness).

Theorem 4.4 (Equilibrium).

Remark 4.5.

Lemma 5.1.

Lemma 5.2 (Zero-one law).

Theorem 6.1 ([Mean-field finite-system scheme).

6.2.1 The $2$ -level system on 3 time scales

Proposition 6.2 ([Two-level rescaling).

6.2.2 The $k$ -level system on $k+1$ time scales

Lemma 7.1.

Lemma 7.2.

Lemma 7.3.

Lemma 7.4.

Lemma 8.1.

Remark 8.2.