The Widom-Rowlinson model: Mesoscopic fluctuations for the critical droplet

Frank den Hollander; Sabine Jansen; Roman Koteck\'y; Elena Pulvirenti

arXiv:1907.00453·math-ph·March 16, 2026

The Widom-Rowlinson model: Mesoscopic fluctuations for the critical droplet

Frank den Hollander, Sabine Jansen, Roman Koteck\'y, Elena Pulvirenti

PDF

Open Access

TL;DR

This paper rigorously analyzes the mesoscopic surface fluctuations of the critical droplet in a two-dimensional Widom-Rowlinson particle system at low temperature, advancing understanding of phase separation in continuum models.

Contribution

It provides the first detailed mathematical analysis of surface fluctuations in a continuum particle system undergoing condensation.

Findings

01

Surface fluctuations are close to a deterministic disk shape.

02

Results establish a foundation for non-equilibrium Widom-Rowlinson models.

03

Analysis aids in refining the Arrhenius formula for phase transition times.

Abstract

We study the critical droplet for a close-to-equilibrium Widom-Rowlinson model of interacting particles, represented by disks of radius $1$ , in the two-dimensional plane at low temperature. The critical droplet is the set of macroscopic states that correspond to saddle points for the passage from a low-density supersaturated vapour to a stable high-density liquid. We analyse the mesoscopic fluctuations of the surface of the critical droplet, which turns out to be the set of particle configurations that are close to a disk of a certain deterministic radius. Our results represent the first detailed rigorous analysis of the surface fluctuations of a continuum interacting particle system exhibiting condensation and, as such, constitute a fundamental step in the study of phase separation from the perspective of stochastic geometry. At the same time, our results serve as a basis for the study…

Figures9

Click any figure to enlarge with its caption.

Equations907

dist (x, y) = k \in Z in f ∣ x - y + k L ∣, x, y \in R^{2} .

dist (x, y) = k \in Z in f ∣ x - y + k L ∣, x, y \in R^{2} .

Γ = {γ \subset T : N (γ) \in N_{0}},

Γ = {γ \subset T : N (γ) \in N_{0}},

h (γ) = x \in γ ⋃ B_{2} (x),

h (γ) = x \in γ ⋃ B_{2} (x),

E (γ) = V (γ) - V_{0} N (γ) = x \in γ ⋃ B_{2} (x) - x \in γ \sum ∣ B_{2} (x)∣,

E (γ) = V (γ) - V_{0} N (γ) = x \in γ ⋃ B_{2} (x) - x \in γ \sum ∣ B_{2} (x)∣,

H (γ) = E (γ) - λ N (γ), γ \in Γ.

H (γ) = E (γ) - λ N (γ), γ \in Γ.

μ_{β} (d γ) = \frac{1}{Ξ _{β}} e^{- β H (γ)} Q (d γ), γ \in Γ,

μ_{β} (d γ) = \frac{1}{Ξ _{β}} e^{- β H (γ)} Q (d γ), γ \in Γ,

Ξ_{β} = \int_{Γ} Q (d γ) e^{- β H (γ)} .

Ξ_{β} = \int_{Γ} Q (d γ) e^{- β H (γ)} .

z_{t} (β) = β e^{- β V_{0}}, β > β_{t} \in (0, \infty)

z_{t} (β) = β e^{- β V_{0}}, β > β_{t} \in (0, \infty)

z = κ z_{t} (β), κ \in (1, \infty), β \to \infty

z = κ z_{t} (β), κ \in (1, \infty), β \to \infty

μ_{β} (d γ) = \frac{1}{Ξ _{β}} (κ β)^{N (γ)} e^{- β V (γ)} Q (d γ), γ \in Γ,

μ_{β} (d γ) = \frac{1}{Ξ _{β}} (κ β)^{N (γ)} e^{- β V (γ)} Q (d γ), γ \in Γ,

\frac{d μ _{β}}{d P _{κ β}} (γ) = \frac{exp ( - β V ( γ ))}{\int _{Γ} exp ( - β V ) d P _{κ β}}, γ \in Γ.

\frac{d μ _{β}}{d P _{κ β}} (γ) = \frac{exp ( - β V ( γ ))}{\int _{Γ} exp ( - β V ) d P _{κ β}}, γ \in Γ.

Φ_{κ} (R) = π R^{2} - κπ (R - 2)^{2}, R \in [2, \infty), R_{c} (κ) = \frac{2 κ}{κ - 1} .

Φ_{κ} (R) = π R^{2} - κπ (R - 2)^{2}, R \in [2, \infty), R_{c} (κ) = \frac{2 κ}{κ - 1} .

Φ (κ) = Φ_{κ} (R_{c} (κ)) = \frac{4 π κ}{κ - 1}, Ψ (κ) = \frac{s κ ^{2/3}}{κ - 1}, s \in R,

Φ (κ) = Φ_{κ} (R_{c} (κ)) = \frac{4 π κ}{κ - 1}, Ψ (κ) = \frac{s κ ^{2/3}}{κ - 1}, s \in R,

I (κ, β; C)

I (κ, β; C)

\displaystyle=\Xi_{\beta}\,\mu_{\beta}\Bigl{(}|V(\gamma)-\pi R_{c}(\kappa)^{2}|\leq C\delta(\beta)\Bigr{)}.

I (κ, β; C) = e^{- β Φ (κ) + β^{1/3} Ψ (κ) + o (β^{1/3})} .

I (κ, β; C) = e^{- β Φ (κ) + β^{1/3} Ψ (κ) + o (β^{1/3})} .

\Phi(\kappa)=-(\kappa-1)\pi(R_{c}(\kappa)-2)^{2}+\big{[}\pi R_{c}(\kappa)^{2}-\pi(R_{c}(\kappa)-2)^{2}\big{]},

\Phi(\kappa)=-(\kappa-1)\pi(R_{c}(\kappa)-2)^{2}+\big{[}\pi R_{c}(\kappa)^{2}-\pi(R_{c}(\kappa)-2)^{2}\big{]},

Ξ_{β} = e^{- β (1 - κ) ∣ T ∣ + o (1)} .

Ξ_{β} = e^{- β (1 - κ) ∣ T ∣ + o (1)} .

d_{H} (F_{1}, F_{2})

d_{H} (F_{1}, F_{2})

\displaystyle=\min\Bigl{\{}\varepsilon\geq 0\colon\,F_{1}\subset F_{2}+\varepsilon B_{1}(0),F_{2}\subset F_{1}+\varepsilon B_{1}(0)\Bigr{\}},\qquad F_{1},F_{2}\neq\emptyset,

S_{T} = {S \subset T : \exists F such that h (F) = S},

S_{T} = {S \subset T : \exists F such that h (F) = S},

J (S) = ∣ S ∣ - κ ∣ S^{-} ∣, S \in S,

J (S) = ∣ S ∣ - κ ∣ S^{-} ∣, S \in S,

I (S) = J (S) - S in f J .

I (S) = J (S) - S in f J .

\mu_{\beta}\bigl{(}h(\gamma)\approx S\bigr{)}\approx\exp\bigl{(}-\beta I(S)\bigr{)},\qquad\beta\to\infty.

\mu_{\beta}\bigl{(}h(\gamma)\approx S\bigr{)}\approx\exp\bigl{(}-\beta I(S)\bigr{)},\qquad\beta\to\infty.

\min\big{\{}|S|-\kappa|S^{-}|\colon\,S\in\mathcal{S},|S|=\pi R^{2}\big{\}}=\pi R^{2}-\kappa\pi(R-2)^{2}

\min\big{\{}|S|-\kappa|S^{-}|\colon\,S\in\mathcal{S},|S|=\pi R^{2}\big{\}}=\pi R^{2}-\kappa\pi(R-2)^{2}

\bigr{(}|S|-\kappa|S^{-}|\bigr{)}-\bigl{(}\pi R^{2}-\kappa\pi(R-2)^{2}\bigr{)}\leq\pi\kappa\varepsilon\quad\text{ with }\quad|S|=\pi R^{2},

\bigr{(}|S|-\kappa|S^{-}|\bigr{)}-\bigl{(}\pi R^{2}-\kappa\pi(R-2)^{2}\bigr{)}\leq\pi\kappa\varepsilon\quad\text{ with }\quad|S|=\pi R^{2},

d_{H} (\partial S, \partial B_{R}) \leq \frac{3}{2} R ε,

d_{H} (\partial S, \partial B_{R}) \leq \frac{3}{2} R ε,

I (B_{R}) = Φ_{κ} (R) - (1 - κ) ∣ T ∣,

I (B_{R}) = Φ_{κ} (R) - (1 - κ) ∣ T ∣,

I^{*} (A) = in f {I (S) : ∣ S ∣ = A}, A \in [0, \infty) .

I^{*} (A) = in f {I (S) : ∣ S ∣ = A}, A \in [0, \infty) .

\mu_{\beta}\bigl{(}V(\gamma)\approx A\bigr{)}\approx\exp\bigl{(}-\beta I^{*}(A)\bigr{)}.

\mu_{\beta}\bigl{(}V(\gamma)\approx A\bigr{)}\approx\exp\bigl{(}-\beta I^{*}(A)\bigr{)}.

I^{*} (π R^{2}) = I (B_{R}) .

I^{*} (π R^{2}) = I (B_{R}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and statistical mechanics · Theoretical and Computational Physics · Complex Systems and Time Series Analysis

Full text

The Widom-Rowlinson model:

Mesoscopic fluctuations for the critical droplet

Frank den Hollander 111Mathematical Institute, Leiden University, P.O. Box 9512, 2300 RA Leiden, The Netherlands,

[email protected]

Sabine Jansen 222 Mathematisches Institut, Ludwig-Maximilians-Universität, Theresienstrasse 39, 80333 München, Germany,

[email protected]

Roman Kotecký 333Mathematics Institute, University of Warwick, Coventry CV4 7AL, United Kingdom and Center for Theoretical Study, Charles University, Prague, Czech Republic

[email protected] 44footnotemark: 4

Elena Pulvirenti 555Institut für Angewandte Mathematik, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 60, 53115 Bonn, Germany

[email protected]

Abstract

We study the critical droplet for a close-to-equilibrium Widom-Rowlinson model of interacting particles in the two-dimensional continuum at low temperature. The critical droplet is the set of macroscopic states that correspond to saddle points for the passage from a low-density vapour to a high-density liquid. We first show that the critical droplet is close to a disc of a certain deterministic radius. After that we analyse the mesoscopic fluctuations of the surface of the critical droplet. These are built on microscopic fluctuations of the particles in the boundary layer, and provide a rigorous foundation for what in physics literature is known as capillary waves.

Our results represent the first detailed analysis of the surface fluctuations down to mesoscopic and microscopic precision. As such they constitute a fundamental step in the study of phase separation in continuum interacting particle systems from the perspective of stochastic geometry. At the same time, they serve as a basis for the study of a non-equilibrium version of the Widom-Rowlinson model, to be analysed elsewhere, where they lead to a correction term in the Arrhenius formula for the average vapour-liquid crossover time.

AMS 2010 subject classifications. 60J45, 60J60, 60K35; 82C21, 82C22, 82C27.

Key words and phrases. Critical droplet, surface fluctuations, large deviations, moderate deviations, isoperimetric inequalities.

Acknowledgment. FdH was supported by NWO Gravitation Grant 024.002.003-NETWORKS, FdH and EP by ERC Advanced Grant 267356-VARIS, and RK by GAČR Grant 16-15238S. EP was supported by the German Research Foundation in the Collaborative Research Centre 1060 “The Mathematics of Emergent Effects”. The authors acknowledge support by The Leverhulme Trust through International Network Grant Laplacians, Random Walks, Bose Gas, Quantum Spin Systems. SJ thanks Christoph Thäle for fruitful discussions.

1 Introduction, background and motivation
1.1 The Widom-Rowlinson model
1.2 A key target: the critical droplet
1.3 Outline
2 Main theorems
2.1 Large deviation principles and isoperimetric inequalities
2.2 Near the critical droplet: moderate deviations
2.3 Context
3 Proof of large deviation principles and isoperimetric inequalities
3.1 Properties of admissible sets
3.2 Minimisers of the shape rate function and their stability
3.3 Large deviation principle for Widom-Rowlinson
4 Heuristics for moderate deviations
4.1 Reduction to a surface integral
4.2 Approximation of the surface term
4.3 Orders of magnitude
4.4 Global scaling: auxiliary random processes
4.5 Local scaling: effective interface model
5 Stochastic geometry: approximation of geometric functionals
5.1 A priori estimates on boundary points
5.2 Locality for boundary determination
5.3 Volume and surface approximation
5.4 Geometric centre of a droplet
6 Stochastic geometry: representation of probabilities as surface integrals
6.1 Only disc-shaped droplets matter
6.2 Integration over halo shapes
6.3 From surface integral to auxiliary random variables
7 Asymptotics of surface integrals: preparations
7.1 Moderate deviations for the angular point process
7.2 Properties of mean-centred Brownian bridge
7.3 Discretisation errors
8 Asymptotics of surface integrals: proof of moderate deviations
8.1 Upper bound
8.2 Lower bound

1 Introduction, background and motivation

Section 1.1 introduces the equilibrium Widom-Rowlinson model. Section 1.2 states a key target in this model, namely, a detailed description of the fluctuations of the so-called critical droplet, i.e., the saddle point in the set of droplets with minimal free energy connecting the vapour state and the liquid state. In [21] we define and analyse a dynamic version of the Widom-Rownlinson model, for which this critical droplet appears as the gate for the metastable transition from the vapour state to the liquid state. Understanding the fluctuations of the critical droplet is crucial for the computation of the metastable crossover time in the dynamic model. The main goal of the present paper is a proof of the key target subject to three conditions, whose proofs are given in [22]. A weaker version of the key target is proved without the three conditions, in order to make the present paper self-contained. Section 1.3 contains an outline of the remainder of the paper.

1.1 The Widom-Rowlinson model

The Widom-Rowlinson model is an interacting particle system in ${\mathbb{R}}^{2}$ where the particles are discs with an attractive interaction. It was introduced in Widom and Rowlinson [38] to model liquid-vapour phase transitions, and is one of the rare models in the continuum for which a phase transition has been established rigorously. In the present paper we place the particles on a finite torus in ${\mathbb{R}}^{2}$ .

Fix $L\in(4,\infty)$ and let ${\mathbb{T}}={\mathbb{T}}_{L}={\mathbb{R}}^{2}/(L{\mathbb{Z}})^{2}$ be the torus of side-length $L$ . We can identify ${\mathbb{T}}$ with the set $[-\tfrac{1}{2}L,\tfrac{1}{2}L)^{2}$ after we redefine the distance by

[TABLE]

The set $\Gamma=\Gamma_{\mathbb{T}}$ of finite particle configurations in ${\mathbb{T}}$ is

[TABLE]

where $N(\gamma)$ denotes the cardinality of $\gamma$ , i.e., particles are viewed as non-coinciding points that are indistinguishable. The halo of a configuration $\gamma\in\Gamma$ is defined as (see Fig. 1)

[TABLE]

where $B_{2}(x)$ is the closed disc of radius $2$ centred at $x\in{{\mathbb{T}}}$ . (The reason why we choose radius $2$ instead of radius $1$ is explained in [21]: the one-species model arises as the projection of a two-species model.) The energy $E(\gamma)$ of a configuration $\gamma\in\Gamma$ is defined as

[TABLE]

where $V(\gamma)=|h(\gamma)|$ and $V_{0}=|B_{2}(0)|=4\pi$ . The energy vanishes when the discs do not overlap, and reaches its minimal value when the discs coincide. Since $0\geq E(\gamma)\geq-V_{0}[N(\gamma)-1]$ , the interaction is attractive and stable (Ruelle [29, Section 3.2]).

We define the grand-canonical Hamiltonian $H=H_{{{\mathbb{T}}},\lambda}$ in ${{\mathbb{T}}}$ with chemical potential $\lambda$ as

[TABLE]

The grand-canonical Gibbs measure $\mu_{\beta}=\mu_{{{\mathbb{T}}},\lambda,\beta}$ is the probability measure on $\Gamma$ defined by

[TABLE]

where $\beta\in(0,\infty)$ is the inverse temperature, ${\mathbb{Q}}$ is the law of the homogeneous Poisson point process on ${{\mathbb{T}}}$ with intensity $1$ , and $\Xi_{\beta}=\Xi_{{{\mathbb{T}}},\lambda,\beta}$ is the normalisation

[TABLE]

Write $z=\mathrm{e}^{\beta\lambda}$ to denote the chemical activity. In the thermodynamic limit, i.e., when $L\to\infty$ , a phase transition occurs at $z=z_{t}(\beta)$ with (see Fig. 2)

[TABLE]

(Ruelle [30], Chayes, Chayes and Kotecký [8]). No closed form expression is known for the critical inverse temperature $\beta_{c}$ . We place ourselves in the metastable regime

[TABLE]

In other words, we choose $z$ to lie in the liquid phase region, above the phase coexistence line in Fig. 2 representing the phase transition in the thermodynamic limit, and we let $\beta\to\infty$ and $z\downarrow 0$ in such a way that we keep close to the phase coexistence line by a fixed factor $\kappa$ . For this choice the Gibbs measure in (1.6) becomes

[TABLE]

so that large particle numbers are favoured while large halos are disfavoured.

Let $\Pi_{\kappa\beta}$ be the homogeneous Poisson point process on $\mathbb{T}$ with intensity $\kappa\beta$ . If $\mathsf{P}_{\kappa\beta}$ denotes the law of $\Pi_{\kappa\beta}$ , then $\mu_{\beta}$ is absolutely continuous with respect to $\mathsf{P}_{\kappa\beta}$ with Radon-Nikodym derivative

[TABLE]

1.2 A key target: the critical droplet

For $\kappa\in(1,\infty)$ , abbreviate (see Fig. 3),

[TABLE]

Throughout the paper, $\kappa\in(1,\infty)$ and $\tfrac{1}{2}L>R_{c}(\kappa)$ are fixed (recall that $L$ is the linear size of the torus ${\mathbb{T}}={\mathbb{T}}_{L}$ ). Define

[TABLE]

where $s$ is a constant that will be identified below and that does not depend on $\kappa$ .

Fix $C\in(0,\infty)$ , abbreviate $\delta(\beta)=\beta^{-2/3}$ , and define

[TABLE]

As shown in [21], $I(\kappa,\beta;C)$ appears as the leading order term in a computation of the Dirichlet form associated with a dynamic version of the Widom-Rowlinson model. In this dynamic version, the special role of the critical disc $B_{R_{c}(\kappa)}(0)$ becomes apparent through the fact that the set $\{\gamma\in\Gamma\colon\,|V(\gamma)-\pi R_{c}(\kappa)^{2}|\leq C\delta(\beta)\}$ forms the gate for the metastable transition from the gas phase (‘ ${\mathbb{T}}$ empty’) to the liquid phase (‘ ${\mathbb{T}}$ full’) in the metastable regime (1.9). The main ingredient in [21] is the following sharp asymptotics.

TARGET: For $C$ large enough and $\beta\to\infty$ ,

[TABLE]

The above target is a statement about a restricted equilibrium: it provides a sharp estimates for the probability that the halo has a volume that lies inside an interval of width $C\beta^{-2/3}$ around the volume of the critical disc $B_{R_{c}(\kappa)}(0)$ . By writing

[TABLE]

we see that we may think of $\Phi(\kappa)$ as (a leading order approximation of) the free energy of the critical droplet, consisting of the bulk free energy and the surface tension, and of $\Psi(\kappa)$ as (a leading order approximation of) the entropy associated with the fluctuations of the surface of the critical droplet, which plays the role of a correction term to the free energy. We will see that there are order $\beta$ discs inside the critical droplet and order $\beta^{1/3}$ discs touching its boundary (see Fig. 4). We remark that $\beta$ is to be viewed as a dimensionless quantity, i.e., the inverse temperature divided by some unit of energy. Otherwise, its fractional powers would not make sense.

The goal of the present paper is to provide a proof of (1.15), subject to three conditions that are settled in [22]. To make the present paper self-contained, we also prove a rough asymptotics that does not require these conditions, but still provides the correct order of magnitude for the entropy term, with bounds on the constant $s$ in (1.13). Along the way we will see that

[TABLE]

Since $\Psi(\kappa)$ does not depend on $C$ , we may view (1.15) as a weak large deviation principle for the random variable $\beta^{2/3}|V-\pi R_{c}^{2}(\kappa)|$ under the Gibbs measure in (1.10). The rate is $\beta^{1/3}$ and the rate function is degenerate, being equal to the constant $\Psi(\kappa)$ . This degeneracy reflects the fact that the radius of the critical droplet is close to $R_{c}(\kappa)$ , for which $\Phi^{\prime}_{\kappa}(R_{c}(\kappa))=0$ .

1.3 Outline

In Section 2 we present our main theorems: a large deviation principle for the halo shape and the halo volume, and weak moderate deviations for the halo volume close to the critical droplet. These theorems are the main input for the analysis of the dynamic Widom-Rowlinson model in [21]. Section 3 proves the two large deviation principles, as well as certain isoperimetric inequalities that play a crucial role throughout the paper. In Section 4 we provide the heuristics behind the proof of the main theorems, which is carried out in Sections 5–8. Section 5 focusses on approximations of certain key geometric functionals, which are crucial for the analysis of the moderate deviations. Section 6 represents moderate deviation probabilities in terms of geometric surface integrals and introduces auxiliary random processes that are needed for the description of the fluctuations of the surface of the critical droplet. Section 7 contains various preparations involving exponential functionals of the auxiliary random variables. Section 8 uses these preparations, in combination with the geometric properties derived in Sections 5–7, to prove the moderate deviations for the halo volume close to the critical droplet.

The results in Sections 3–8 lead to a description of the mesoscopic fluctuations of the surface of the critical droplet in terms of a certain constrained Brownian bridge and quantifies the cost of moderate deviations for the surface free energy of droplets. The proof relies on three conditions involving the microscopic fluctuations of the surface of the critical droplet, whose proofs are given [22]. To make the present paper self-contained, we also prove a rough moderate deviation estimate that does not need the three conditions.

2 Main theorems

This section formulates and discusses our main theorems. In Section 2.1 we state large deviation principles for the halo shape (Theorem 2.1) and the halo volume (Theorem 2.3), and show that the corresponding rate functions are linked via an isoperimetric inequality (Theorem 2.2). In Section 2.2 we formulate a conjecture (Conjecture 2.4) about moderate deviations for the halo volume, and state a sharp asymptotics that settles a version of this conjecture for volumes that are close to the volume of the critical droplet (Theorem 2.5). This sharp asymptotics settles the target formulated in Section 1.2, subject to three conditions (Conditions (C1)–(C3) below), whose proof is given in [22]. In order to make our paper self-contained, we also prove a rougher asymptotics (Theorem 2.6), which does not require the conditions and still provides the correct order of the correction term, with explicit bounds on the constant. In Section 2.3 we place our results in a broader context and explain why they open up a new window in the area of stochastic geometry for interacting particle systems.

For background on large deviation theory, see e.g. Dembo and Zeitouni [11] or den Hollander [20].

2.1 Large deviation principles and isoperimetric inequalities

Admissible sets.

Let $\mathcal{F}_{\mathbb{T}}$ be the family of non-empty closed (and hence compact) subsets of the torus ${\mathbb{T}}$ . We equip $\mathcal{F}_{\mathbb{T}}$ with the Hausdorff metric

[TABLE]

where ${\operatorname{dist}}(x,F)=\min_{y\in F}{\operatorname{dist}}(x,y)$ . This turns $\mathcal{F}_{{\mathbb{T}}}$ into a compact metric space (Matheron [27, Propositions 12.2.1, 1.4.1, 1.4.4], Schneider and Weil [31, Theorems 12.2.1, 12.3.3]). Let $\mathcal{S}\subset\mathcal{F}_{{\mathbb{T}}}$ be the collection of all sets that are $({{\mathbb{T}}}$ -)admissible, i.e.,

[TABLE]

where $h(F)=\cup_{x\in F}B_{2}(x)$ is the halo of $F$ . In Section 3.1 we will see that there is a unique maximal $F$ such that $h(F)=S$ , which we denote by $S^{-}$ and which equals $S^{-}=\{x\in S\colon\,B_{2}(x)\subset S\}$ .

Obviously, not every closed set is admissible. For example, when we form $2$ -halos we round off corners, and so a shape with sharp corners cannot be in $\mathcal{S}$ . Also note that $S^{-}\neq\emptyset$ whenever $S$ is admissible: $S$ necessarily contains at least one disc $B_{2}(x)$ with $x\in S$ . In the following, we typically omit the subscript referring to the torus ${\mathbb{T}}$ .

Large deviation principles.

Define

[TABLE]

and

[TABLE]

We view the halo $h(\gamma)$ as a random variable with values in the space $\mathcal{S}$ , topologized with the Hausdorff distance. Note that $\inf_{\mathcal{S}}J=(1-\kappa)|\mathbb{T}|$ because $\kappa\in(1,\infty)$ .

Theorem 2.1 (Large deviation principle for the halo shape).

The family of probability measures $(\mu_{\beta}(h(\gamma)\in\cdot\,))_{\beta\geq 1}$ satisfies the LDP on $\mathcal{S}$ with speed $\beta$ and with good rate function $I$ .

Informally, Theorem 2.1 says that

[TABLE]

The contraction principle suggests a large deviation principle for the halo volume. To formulate this, we first state a minimisation problem. The condition $R\in(2,\tfrac{1}{2}L)$ below ensures that that the effect of the periodic boundary conditions on the torus ${\mathbb{T}}$ is not felt.

Theorem 2.2 (Minimisers of rate function for halo volume).

**

(1)

For every $R\in(2,\frac{1}{2}L)$ ,

[TABLE]

and the minimisers are the discs of radius $R$ .

(2)

The minimisers are stable in the following sense: There exists an $\varepsilon_{0}>0$ such that if $0<\varepsilon<\varepsilon_{0}$ and $S\in\mathcal{S}$ satisfies

[TABLE]

then $S^{-}$ is connected with connected complement (simply connected as a subset of ${\mathbb{R}}^{2}$ ), and

[TABLE]

where $d_{\text{\rm{H}}}$ denotes the Hausdorff distance.

Theorem 2.2 is a powerful tool because it shows that the near-minimers of the halo rate function are close to a disc and have no holes inside. In particular, it tells us that

[TABLE]

and allows us to describe the large deviations of the halo volume.

Theorem 2.3 (Large deviation principle for the halo volume).

The family of probability measures $(\mu_{\beta}(V(\gamma)\in\cdot\,))_{\beta\geq 1}$ satisfies the LDP on $[0,\infty)$ with speed $\beta$ and with good rate function $I^{*}$ given by

[TABLE]

Informally, Theorem 2.3 says that

[TABLE]

For every $R\in(2,\frac{1}{2}L)$ , we have

[TABLE]

2.2 Near the critical droplet: moderate deviations

Fluctuations of the halo volume.

The function $R\mapsto I(B_{R})$ is maximal at

[TABLE]

We assume that $L>\frac{1}{2}R_{c}$ . In the dynamic Widom-Rowlinson model that we introduce and analyse in [21], $R_{c}$ plays the role of the radius of the critical droplet for the metastable crossover from an empty torus to a full torus in the limit as $\beta\to\infty$ . We therefore zoom in on a neighborhood of the critical droplet. The large deviation principle yields the statement

[TABLE]

for $\varepsilon>0$ fixed. We would like to take $\varepsilon=\varepsilon(\beta)\downarrow 0$ , for which we need a stronger property.

Conjecture 2.4 (Weak moderate deviations for the halo volume).

There exists a function $\Psi_{R}\colon\,{\mathbb{R}}\to{\mathbb{R}}$ such that

[TABLE]

Conjecture 2.4 has the flavor of a weak large deviation principle on scale $\beta^{-2/3}$ with speed $\beta^{1/3}$ . For $R=R_{c}$ we expect the function $\Psi_{R_{c}}$ to be constant. In Theorem 2.5 below we establish a version of this claim, with $\Psi_{R_{c}}\equiv\Psi(\kappa)$ , the entropy defined in (1.13). In what follows we first state a theorem (Theorem 2.5 below) that relies on three conditions involving an effective interface model whose proof is given in [22]. Afterwards we state a weaker version (Theorem 2.6 below) that provides upper and lower bounds, whose proof is fully completed in the present paper. To state these theorems we need some additional notation.

Notation.

Let $S=h(\gamma)$ be the halo of some configuration $\gamma$ . The boundary of $S$ consists of a union of circular arcs that are disjoint except for their endpoints. We call the centres $z_{1},\ldots,z_{n}$ of these circles the boundary points of $S$ and we say that $z=(z_{1},\ldots,z_{n})$ is a connected outer contour if there exists a halo $S$ with a simply connected $2$ -interior $S^{-}$ having exactly these boundary points (see Fig. 5). Let

[TABLE]

Later on we parametrize the boundary points $z_{1},\ldots,z_{n}$ of an approximately disc-shaped droplet in polar coordinates. We will see that, roughly, we may think of the angular coordinates $t_{1},\ldots,t_{n}$ as the points of an angular point process, and of the radii $r_{1},\ldots,r_{n}$ as the values of a Gaussian process evaluated at those random angles. To make this picture more precise, we need to introduce auxiliary processes.

Let $(W_{t})_{t\geq 0}$ be standard Brownian motion starting in [math], and let

[TABLE]

be standard Brownian bridge on $[0,2\pi]$ . Consider the process

[TABLE]

called the mean-centred Brownian bridge (Deheuvels [10]). Set

[TABLE]

Let

[TABLE]

Thus, $N$ is a Poisson random variable with parameter $2\pi\lambda(\beta)=2\pi G_{\kappa}\beta^{1/3}$ . We assume that $(B_{t})_{t\in[0,2\pi]}$ and $\mathcal{T}$ are defined on a common probability space $(\Omega,\mathscr{F},{\mathbb{P}})$ and they are independent. Conditional on the event $\{N=n\}$ , we may write $\mathcal{T}=\{T_{i}\}_{i=1}^{n}$ with $0\leq T_{1}<\cdots<T_{n}<2\pi$ , and define

[TABLE]

Note that $\Theta_{i}\geq 0$ , $1\leq i\leq n$ , and $\sum_{i=1}^{n}\Theta_{i}=2\pi$ . For $m\in{\mathbb{R}}$ , set

[TABLE]

with

[TABLE]

Later we will see that the natural reference measure for the angles $t_{i}$ is not the Poisson process $\mathcal{T}$ but a periodic version of a renewal process, which is conveniently constructed by tilting the distribution of the Poisson process $\mathcal{T}$ . The precise expression for the exponential tilt will be derived later. Set

[TABLE]

and consider the tilted probability measure $\widehat{\mathbb{P}}$ on $(\Omega,\mathscr{F},{\mathbb{P}})$ defined by

[TABLE]

Finally let $\tau_{*}\in{\mathbb{R}}$ be the unique solution to the equation

[TABLE]

The change of variables $s=u^{3}/24$ together with $\int_{0}^{\infty}s^{-1/2}\mathrm{e}^{-s}\mathrm{d}s=\Gamma(\frac{1}{2})=\sqrt{\pi}$ yields

[TABLE]

and so $\tau_{*}>0$ . In the sequel we use the notation $\overline{u_{i}}=\tfrac{1}{2}(u_{i}+u_{i+1})$ , $1\leq i\leq N$ .

Conditions.

Our main theorem builds on three conditions whose validity is proven in [22]:

(C1)

The limit

[TABLE]

exists and is of the form

[TABLE]

for some $\tau_{**}>0$ that does not depend on $\kappa$ . 2. (C2)

The change from $Z^{(0)}$ to $Z^{(m)}$ does not affect (C1) when $m$ is not too large:

[TABLE] 3. (C3)

Let

[TABLE]

and

[TABLE]

Then, for $\delta>0$ sufficiently small,

[TABLE]

uniformly in $|m|=O(\beta^{1/6})$ .

Condition (C1) comes from the fact that for each of the $N\asymp\beta^{1/3}$ boundary points there is a constraint in terms of the two neighbouring boundary points that must be satisfied in order for the corresponding 2-disc to touch the boundary of the critical droplet. The constant $\tau_{**}$ is related to the free energy of an effective interface model. Condition (C2) says that the constraint imposed by condition (C1) is not affected by small dilations of the critical droplet, and implies that the free energy of the effective interface model is Lipschitz under small perturbations. Condition (C3) says that the first Fourier coefficient of the surface of the critical droplet is small. The term $\chi$ represents an energetic and entropic reward for the droplet boundary to fluctuate away from $\partial B_{R_{c}}$ . We require that this reward – which may be thought of as a background potential in the effective interface model – does not affect the microscopic free energy of the droplet.

Main theorem: sharp asymptotics.

We are now ready to formulate our main theorem.

Theorem 2.5 (Moderate deviations).

Suppose that conditions (C1)–(C3) hold. Then, for $C$ large enough and $\beta\to\infty$ ,

[TABLE]

where

[TABLE]

In view of (1.14) and (1.17), Theorem 2.5 settles the target in (1.15) subject to conditions (C1)–(C3).

Rough asymptotics.

Without conditions (C1)–(C3) we can still prove the following rougher asymptotics, which makes our paper self-contained.

Theorem 2.6 (Moderate deviation bounds).

For $C$ large enough,

[TABLE]

with $c\in[0,\infty)$ some constant (that may depend on $\kappa$ ).

Theorem 2.5 provides a full description of the fluctuations of the surface of the critical droplet in the Widom-Rowlinson model in the metastable regime (1.9). It makes fully precise and rigorous the heuristic arguments for capillary waves that are put forward in Stillinger and Weeks [37]. In the proof of Theorem 2.5 in Sections 5–8 we will see that the boundary points are given by (2.23), with $(B_{t})_{t\in[0,2\pi]}$ the mean-centred Brownian bridge, $\mathcal{T}$ the Poisson point process on $[0,2\pi]$ with intensity $2\pi G_{\kappa}\beta^{1/3}$ tilted via the probability measure in (2.25), and $|m|=O(\beta^{-1/6})$ (which is negligible). We will also see that the effect of centring of the droplet is that the discrete Fourier coefficients defined in (2.31) are asymptotically vanishing.

2.3 Context

In this section we place our main theorems in the broader context of stochastic geometry and continuum statistical physics. For general overviews we refer the reader to Chiu, Stoyan, Kendall and Mecke [9], respectively, Georgii, Häggström and Maes [17].

Large and moderate deviations for confined point processes.

There is a rich literature on limit laws for high-intensity point processes and extremal points, and also on the high-intensity Widom-Rowlinson model. Let us explain why Theorem 2.5 is considerably more involved than the theorems encountered in that literature. We first review two papers in the context of stochastic geometry that are close in spirit to our results and discuss the differences. The results from the point process literature are valid in higher dimensions $d\geq 2$ as well, but for simplicity we present the summary for $d=2$ only.

Consider a disc of fixed radius $R>2$ and a homogeneous Poisson point process with intensity $\alpha$ in $B_{R-2}(0)$ . The set $B_{R}(0)\setminus h(\Pi_{\alpha})$ is called the vacant set, the volume $|B_{R}(0)\setminus h(\Pi_{\alpha})|$ is called the defect volume. A point $x\in\mathrm{\Pi}_{\alpha}$ is extremal when $h(\Pi_{\alpha}\setminus x)\subsetneq h(\Pi_{\alpha})$ . Write $\xi(x,\Pi_{\alpha})$ for the indicator that $x$ is extremal in $\Pi_{\alpha}$ . (The problem has been also studied for general compact sets $K$ instead of $B_{2}(0)$ , in which case the points are called $K$ -maximal.) The following results are available in the limit as $\alpha\to\infty$ :

(I) Schreiber [33] focusses on a Boolean model that is a variation on the Widom-Rowlinson model, in which to each point of the Poisson point process a random closed set called grain is attached. Grains are deterministic compact convex smooth sets satisfying certain conditions, namely, they are contained in $B_{R}(0)$ , are twice differentiable on the boundary, and are parametrized by the point closest to the boundary of $B_{R}(0)$ . The following results are proved in [33]:

(i) A law of large numbers for the defect volume:

[TABLE]

(ii) A moderate deviation estimate: if $E_{\alpha}$ is the expected defect volume, then for all $\eta>0$ and some $\hat{I}(\eta)>0$ ,

[TABLE]

The limit $\lim_{\eta\to\infty}\frac{1}{\eta}\widehat{I}(\eta)\in{\mathbb{R}}$ exists. An LDP is proved for a point process whose law is a Gibbsian modification of a Poisson point process with a hard-core Hamiltonian, which describes the one-color process in the two-color Widom-Rowlinson model. This result is close to our LDP, the main difference being that the minimization is much easier than in our model.

(II) Schreiber [34] considers a homogeneous Poisson point process with intensity $\alpha$ restricted to $[0,1]^{d}$ (which by abuse of notation we again call $\Pi_{\alpha}$ ) and proves the following: $h_{r}(\Pi_{\alpha})=\cup_{x\in\Pi_{\alpha}}B_{r}(x)$ satisfies the full LDP on $L^{1}([0,1]^{d}$ with rate $r\alpha$ and with good rate function $\mathcal{P}$ , the so-called Caccioppoli perimeter. However, this is proved in a specific limit for $\alpha$ and $r=r(\alpha)$ jointly, namely,

[TABLE]

(i.e., the large-volume limit and the high-intensity limit are taken simultaneously), while in our setting the volume of the system and the radius of the discs are kept fixed. Furthermore, the parameters of the model are taken to be on the coexistence line. In our case this would amount to taking the limit $\kappa\downarrow 1$ instead of fixing $\kappa\in(1,\infty)$ arbitrarily.

In Section 3 we give a full description of large deviations for arbitrary droplets, and in Sections 5–8 of moderate deviations for near-critical droplets. The fluctuations that we consider are two-sided, i.e., the droplets are not confined to an ambient disk $B_{R}(0)$ but rather live on a torus.

Interface literature.

Another popular concept in stochastic geometry is that of random convex polytope and its relation with the paraboloid growth process (see Schreiber and Yukich [35], Calka, Schreiber and Yukich [7]). The former is the convex hull of $K\cap\Pi_{\alpha}$ (with $K$ a smooth convex set in $\mathbb{R}^{d}$ ), the latter is a growth model for interfaces that is studied because it provides information on the asymptotic behaviour as $\alpha\to\infty$ of geometric functionals of random polytopes, in particular, on the distribution of $K$ -maximal points. While we marginally touch on these concepts in the present paper, they play an important role in [22], where we discuss the microscopic fluctuations of the surface of the critical droplet. There we prove that, upon rescaling of the random variables describing the boundary of the critical droplet, the effective microscopic model is given by a modification of the paraboloid hull process, which is connected to the paraboloid growth process. In [22], the conditions (C1)–(C3) in Theorem 2.5 are formulated in the context of paraboloid hull processes, and are proven there.

The literature on stochastic interfaces is large, especially for phase boundaries separating two phases. In statistical mechanics interface analyses have been successfully carried out for discrete systems, such as the Ising model. Higuchi, Murai and Wang [19] adapted to the two-dimensional Widom-Rowlinson model what was done in Higuchi [18] for the two-dimensional Ising model. As for the Ising model, the interface is well approximated by that of the so-called Solid-on-Solid model. The results in [19] concerns limiting properties of the continuous random processes that model the fluctuations of the interface in the direction orthogonal to the line connecting the two end points of the phase boundary. Diffusive scaling of the interface is shown, with Brownian bridge appearing as the limit. However, the results in [19] are again only for parameters on the coexistence line, which in our case amounts to taking the limit $\kappa\downarrow 1$ .

Further variations on the Widom-Rowlinson model.

Several further variations on the Widom-Rowlinson model have been considered in the literature. One is the so-called area-interaction point process considered in Baddeley and van Lieshout [4], where the probability density of a point configuration depends on the area of $h(\Pi_{\alpha})$ through a parameter $\gamma$ . When $\gamma=1$ , the Widom-Rowlinson model is recovered. Another is the so-called quermass-interaction processes introduced in Kendall, van Lieshout and Baddeley [25], where a Boolean model interacting via a linear combination of Minkowski functionals generalizes the above area-interaction. In both these papers the problem of well-posedness of the processes are addressed via a proof of integrability and stability in the sense of Ruelle [29], but no result in terms of large deviations are obtained.

Another generalization regards the multi-color Widom-Rowlinson model (see Chayes, Chayes and Kotecký [8]): $q$ colors are considered and random radii $r_{i}$ , $1\leq i\leq q$ , are attached to any color. In Dereudre and Houdebert [12] the case with non-integrable random radii is studied, and a different type of phase transition is proved.

3 Proof of large deviation principles and isoperimetric inequalities

In this section we prove Theorems 2.1–2.3. Section 3.1 takes a closer look at the properties of admissible sets (Lemmas 3.1–3.2). Section 3.2 gives the proof of Theorem 2.2. The proof requires two isoperimetric inequalities (Lemmas 3.3–3.4), which are analogues of the classical isoperimetric and Bonnesen inequalities, and imply that the minimisers of $I$ in (2.4) are discs and that the difference of $I$ with its minimum can be quantified in terms of the Hausdorff distance to these discs. Section 3.3 proves the large deviation principle for the centres of the 2-discs in the Widom-Rowlinson model (Proposition 3.5), and uses this to prove Theorems 2.1 and 2.3.

3.1 Properties of admissible sets

Write

[TABLE]

for the $2$ -halo of $F\in\mathcal{F}_{{\mathbb{T}}}$ (Minkowski addition) and

[TABLE]

for the 2-interior of $F\in\mathcal{F}_{{\mathbb{T}}}$ (Minkowski subtraction). In integral geometry, the sets $F^{+}$ and $F^{-}$ are called the dilation and the erosion of $F$ , respectively. Note that the erosion and subsequent dilation of a set $F$ is contained in $F$ , i.e.,

[TABLE]

(Matheron [27, Section 1.5]). See Lemma 3.1(1) below.

Note that $(F^{-})^{+}$ is not necessarily equal to $F$ . We use $\mathcal{S}\subset\mathcal{F}_{{\mathbb{T}}}$ to denote the collection of all sets for which equality holds and call them ( ${{\mathbb{T}}}$ -)admissible (open with respect to $B_{2}(0)$ in the terminology of integral geometry), i.e.,

[TABLE]

In the following, we typically omit the subscript referring to the torus ${\mathbb{T}}$ , by writing $\mathcal{F}_{{\mathbb{T}}}=\mathcal{F}$ , $\mathcal{S}_{{\mathbb{T}}}=\mathcal{S}$ . There is another useful characterisation of admissible sets: $S\in\mathcal{S}$ if and only if it is the 2-halo of some $F\in\mathcal{F}$ , $S=F^{+}$ (see Lemma 3.1(2) below). Obviously, not every closed set is admissible. For example, when we form 2-halos we round off corners, and so a shape with sharp corners cannot be in $\mathcal{S}$ . Also note that $S^{-}\neq\emptyset$ whenever $S$ is admissible: $S$ necessarily contains at least one disc $B_{2}(x)$ with $x\in S$ .

In this section we summarise some known properties of admissible sets in a setting that will be needed later. The proofs of these properties rely on various sources. Below we only quote appropriate references, and when instructive we supply a short proof.

A key property is that for any set $S\in\mathcal{S}$ such that $S^{-}$ is connected and $S$ is simply connected, the set $S^{-}$ is of reach at least 2. Recall that the reach of a set $F\in\mathcal{F}$ is

[TABLE]

Lemma 3.1.

(1)

If $F\in\mathcal{F}$ , then $(F^{-})^{+}\subset F$ . 2. (2)

$S\in\mathcal{S}$ * if and only if $S$ is the $2$ -halo of some $F\in\mathcal{F}$ , i.e.,*

[TABLE] 3. (3)

Both $F\mapsto F^{+}=h(F)$ and $F\mapsto|F^{+}|=|h(F)|$ are continuous with respect to the Hausdorff metric. 4. (4)

If $S\in\mathcal{S}$ and $S^{-}$ is connected, then also $S$ is connected. 5. (5)

If $F\in\mathcal{F}$ is convex, then $F^{+}$ and $F^{-}$ are convex and $F=(F^{+})^{-}$ . If $F_{1},F_{2}\in\mathcal{F}$ are convex and $F_{1}^{+}=F_{2}^{+}$ , then also $F_{1}=F_{2}$ . 6. (6)

The set $\mathcal{S}$ is the closure in $\mathcal{F}$ of the set $\mathcal{S}^{\textrm{fin}}\subset\mathcal{S}$ , where $\mathcal{S}^{\textrm{fin}}$ consists of all $S$ of the form $S=h(\gamma)$ with $\gamma\subset{\mathbb{T}}$ finite. 7. (7)

If $S\in\mathcal{S}$ , then $\operatorname{reach}(S^{-})\geq 2$ , provided the following condition is satisfied:

[TABLE] 8. (8)

For any $S\in\mathcal{S}$ such that $\operatorname{reach}(S^{-})>0$ , the boundary $\partial S^{-}$ is $1$ -rectifiable. If $S\in\mathcal{S}^{\textrm{fin}}$ , then the boundary $\partial S^{-}$ is Lipschitz.

Proof.

We indicate the proper references to the literature. Part (7) is delicate.

(1) (Matheron [27, Chapter 1.5]) Note that $x\in(F^{-})^{+}$ is equivalent to $B_{2}(x)\cap F^{-}\neq\emptyset$ , which is equivalent to the existence of a $z\in B_{2}(x)\cap F^{-}$ . Hence, $x\in B_{2}(z)$ since $z\in B_{2}(x)$ and $B_{2}(z)\subset F$ since $z\in F^{-}$ . In [27], sets $F$ such that $(F^{-})^{+}=F$ are called open w.r.t. $B_{2}(0)$ (rather than admissible).

(2) (Matheron [27, Chapter 1.5]). For any $S\in\mathcal{S}$ , we have $S=F^{+}$ with $F=S^{-}$ . On the other hand, if $S=F^{+}$ , then $((F^{+})^{-})^{+}\supset F^{+}$ since $(F^{+})^{-}\supset F$ and hence $(S^{-})^{+}\supset S$ . The inclusion $((F^{+})^{-})^{+}\subset F^{+}$ , which amounts to $(S^{-})^{+}\subset S$ , was proven in 1.

(3) For the first claim, see Matheron [27, Proposition 1.5.1] Schneider and Weil [31, Theorem 12.3.5], for the second claim, see Kampf [24, Lemma 9].

(4) See Matheron [27, Chapter 1.5].

(5) See Matheron [27, Proposition 1.5.3]. In [27] , sets $F$ such that $(F^{+})^{-}=F$ are called closed w.r.t. $B_{2}(0)$ .

(6) This follows from the fact that finite sets are dense in $\mathcal{F}$ in combination with the first claim in (3).

(7) Use Federer [13, Theorem 5.9], which assures that the $\operatorname{reach}$ is conserved under taking limits of sets with respect to the Hausdorff metric. It therefore suffices to consider $S\in\mathcal{S}^{\textrm{fin}}$ with $S=h(\gamma)$ , where the finite set $\gamma$ is sufficiently dense so that condition (C) is satisfied for $S=h(\gamma)$ .

We will prove that, for such $\gamma$ , $\operatorname{reach}(h(\gamma)^{-})\geq 2$ . First, observe that the boundaries $\partial h(\gamma)$ and $\partial h(\gamma)^{-}$ are unions of circular arcs (of radius 2). Given that $h(\gamma)$ satisfies condition (C), the set $h(\gamma)\setminus h(\gamma)^{-}$ splits into connected components, each bordered by two Jordan curves: one connected component of the boundary $\partial h(\gamma)$ and one connected component of the boundary $\partial h(\gamma)^{-}$ . For each component of $\partial h(\gamma)$ , we label the centres of the arc circles in such a way that two consecutive arcs belong to two consecutive centres, with periodic boundary condition. The associated component of $\partial h(\gamma)^{-}$ is a union of circular arcs passing through the centres. The centre of the arc circle connecting two consecutive centres is the point on the boundary $\partial h(\gamma)$ in the intersection of the arcs with these centres.

Let us assume that $\operatorname{reach}(h(\gamma)^{-})<2$ . Then there exist a point $x\in h(\gamma)\setminus h(\gamma)^{-}$ and two distinct points $y_{1},y_{2}$ in a connected component $\sigma$ of the boundary $\partial h(\gamma)^{-}$ such that ${\operatorname{dist}}(x,h(\gamma)^{-})={\operatorname{dist}}(x,y_{1})={\operatorname{dist}}(x,y_{2})=r<2$ . (To belong to two distinct components of $\partial h(\gamma)^{-}$ contradicts assumption (C).) Given that ${\operatorname{dist}}(x,h(\gamma)^{-})=r$ , the interior $B_{r}(x)^{0}$ of the disc $B_{r}(x)$ does not contain any point from $h(\gamma)^{-}$ , i.e., $B_{r}(x)^{0}\cap h(\gamma)^{-}=\emptyset$ . In addition, there are at most finitely many points of $h(\gamma)^{-}$ in $\partial B_{r}(x)$ , all of which belong to $\partial h(\gamma)^{-}$ .

The admissibility of $h(\gamma)$ means that every disc of radius 2 with a centre on $\partial h(\gamma)^{-}$ is fully contained in $h(\gamma)$ . We will draw a contradiction with this statement from the fact that a Jordan curve $\sigma$ avoiding $B_{r}(x)^{0}$ and containing two distinct points on its boundary necessarily indents too sharply to be consistent with admissibility of $h(\gamma)$ and condition (C). To show the contradiction, consider a line $\ell$ through the point $x$ that separates the points $y_{1}$ and $y_{2}$ into opposite half planes determined by $\ell$ . Without loss of generality, we may assume that the points $\{y,y^{\prime}\}=\ell\cap\partial B_{r}(x)$ do not belong to $h(\gamma)^{-}$ . As a result, both $B_{2}(y)$ and $B_{2}(y^{\prime})$ contain points not in $h(\gamma)$ or, since $B_{2}(y_{1})\cup B_{2}(y_{2})\subset h(\gamma)$ , there exist points $w\in B_{2}(y)\setminus(B_{2}(y_{1})\cup B_{2}(y_{2}))$ and $w^{\prime}\in B_{2}(y^{\prime})\setminus(B_{2}(y_{1})\cup B_{2}(y_{2}))$ such that $w,w^{\prime}\not\in h(\gamma)$ . Now, $B_{2}(w)$ and $B_{2}(w^{\prime})$ cannot contain any point from $h(\gamma)^{-}$ , and hence the Jordan curve $\sigma$ must avoid $B_{r}(x)^{0}\cup B_{2}(w)\cup B_{2}(w^{\prime})\cup\{y,y^{\prime}\}$ with $w,w^{\prime}\not\in h(\gamma)$ , which yields the contradiction with condition (C).

Indeed, if $\sigma$ is the outer boundary of the set $h(\gamma)^{-}$ with a single outer component of ${\mathbb{T}}\setminus h(\gamma)^{-}$ , then in contradiction with condition (C) this component contains two components of ${\mathbb{T}}\setminus h(\gamma)$ , since the points $w$ and $w^{\prime}$ belong to different components of ${\mathbb{T}}\setminus(h(\gamma)^{-}\cup B_{2}(y_{1})\cup B_{2}(y_{2}))$ . Otherwise, the Jordan curve $\sigma$ is the inner boundary of the set $h(\gamma)^{-}$ and surrounds the set $B_{r}(x)^{0}\cup B_{2}(w)\cup B_{2}(w^{\prime})\cup\{y,y^{\prime}\}$ . However, the region encircled by $\sigma$ contains two different components of ${\mathbb{T}}\setminus h(\gamma)$ , one containing $w$ and the other containing $w^{\prime}$ .

(8) For the first claim, see Ambrosio, Colesanti and Villa [2, Proposition 3]. For the second claim, it suffices to note that for $S\in\mathcal{S}^{\textrm{fin}}$ the boundary $\partial S^{-}$ is a finite union of arcs. ∎

We will also need the Steiner formula for sets of positive reach (Federer [13]).

Lemma 3.2.

Let $S\in\mathcal{S}$ be an admissible set with $S^{-}$ of reach at least $2$ . Then

[TABLE]

where $\mathcal{S}\mathcal{M}(S^{-})$ is the outer Minkowski content of $S^{-}$ and $\chi(S^{-})$ is the Euler-Poincaré characteristic of $S^{-}$ (= the number of connected components minus the number of holes). If the boundary $\partial S^{-}$ is Lipschitz, then $\mathcal{S}\mathcal{M}(S^{-})=\mathcal{H}^{1}(\partial S^{-})$ , where $\mathcal{H}^{1}$ is the $1$ -dimensional Hausdorff measure.

Proof.

Reformulating the Steiner formula for sets of positive reach as defined by Federer [13, Theorem 5.5, Theorem 5.19], we get, for $S\subset{\mathbb{R}}^{2}$ and $S\in\mathcal{S}$ ,

[TABLE]

for any $0<r<2$ and by continuity also for $r=2$ . For continuity of the left-hand side, see Sz.-Nagy [28]. The last claim is the same as Ambrosio, Colesanti and Villa [2, Corollary 1]. ∎

3.2 Minimisers of the shape rate function and their stability

In this section we prove Theorem 2.2.

(1) The proof relies on the Brunn-Minkowski inequality and on Lemma 3.3 below, which provides three reformulations of the isoperimetric inequality in (2.6).

Lemma 3.3.

Let $S\in\mathcal{F}$ . If $R>2$ and $|S|=\pi R^{2}$ , then the following three statements are equivalent:

(a)

$|S|-\kappa|S^{-}|\geq\pi R^{2}-\kappa\pi(R-2)^{2}$ .

(b)

$16\pi|S|\leq(|S\setminus S^{-}|+4\pi)^{2}$ .

(c)

$16\pi|S^{-}|\leq(|S\setminus S^{-}|-4\pi)^{2}$ .

Moreover, equality holds in (a), (b), and (c) simultaneously, or in none.

Proof.

The equivalence of (b) and (c) is an immediate consequence of the fact that $|S|=|S^{-}|+|S\setminus S^{-}|$ . For the equivalence of (a) and (b), we observe that $|S|-\kappa|S^{-}|=\kappa|S\setminus S^{-}|-(\kappa-1)|S|$ and therefore, with $|S|=\pi R^{2}$ , (a) is equivalent to

[TABLE]

We add $4\pi$ to both sides and take the square to find that (a) is equivalent to (b). ∎

Proof of Theorem 2.2.

Armed with Lemma 3.3, we employ the Brunn-Minkowski inequality

[TABLE]

which is valid for any non-empty measurable $F$ , $B$ and $F+B$ (see Lusternik [26] and Federer [14, 3.2.41]). Indeed, (3.10) with $B=B_{2}(0)$ implies

[TABLE]

and yields inequality (c) with $F=S^{-}$ and $F^{+}=S$ , and thus also (2.6) by (a). Equality in (3.11) occurs only if $F=S^{-}$ is a disc or a point (see Burago and Zalgaller [3, Section 8.2.1]).

(2) We first prove that if $S$ is close to a minimiser, then necessarily $S^{-}$ is connected and simply connected.

Lemma 3.4.

There exist a function $\varepsilon\mapsto\xi(\varepsilon)$ satisfying $\lim_{\varepsilon\downarrow 0}\xi(\varepsilon)=0$ such that if $S\in\mathcal{S}$ satisfies (2.7) with $R-2\geq\frac{\xi(\varepsilon)}{1-\xi(\varepsilon)}$ , then

[TABLE]

for sufficiently small $\varepsilon$ , and $S^{-}$ is connected and simply connected.

Proof.

We use the claim about the stability of the Brunn-Minkowski inequality, first proven in qualitatively by Christ [6] and then quantitatively by Figalli and Jerrison [16]. Actually, for our purposes the qualitative version [6] suffices, since we are only using it as a springboard for more accurate bounds to be considered later.

Adapted to our setting, the claim is that there exist positive function $\widetilde{\xi}(\delta)$ with $\lim_{\delta\to 0}\widetilde{\xi}(\delta)=0$ , such that if

[TABLE]

with sufficiently small $\delta$ , then there exist a compact convex set $K\subset{\mathbb{T}}$ such that

[TABLE]

In the same vein, a (properly scaled and shifted) disc $D$ satisfies

[TABLE]

To verify the assumption in (3.13), we rewrite (2.7) as

[TABLE]

and use an equivalent formulation of (3.13), namely,

[TABLE]

Indeed, (3.16) with $\lvert S\rvert=\pi R^{2}$ implies that the LHS of (3.17) is bounded by $\pi\varepsilon$ which, with the choice $\delta=\sqrt{\varepsilon}$ is bounded by the right-hand side of (3.17) and thus also (3.13).

Note that, in view of the convexity of $K^{-}$ , the condition in (3.15) implies that $K^{-}\subset(D^{-}+B_{\tilde{\xi}(\sqrt{\varepsilon}\,)^{2}}(0))$ . Indeed, suppose without loss of generality that the centre of the disc $D$ is at the origin and write $r+2$ for its radius, so that $D^{-}=B_{r}(0)$ . If $x\in K^{-}\setminus D^{-}$ , then $K^{-}$ contains the union of $D^{-}$ and the wedge bordered by $\partial D^{-}$ and the tangents through $x$ to $D^{-}$ . The volume of this wedge is $r(\sqrt{x^{2}-r^{2}}-\arctan(\frac{\sqrt{x^{2}-r^{2}}}{r}))$ . Asymptotically, for small $\lvert x\rvert-r$ , this equals $\frac{1+r^{2}}{\sqrt{r}}\sqrt{\lvert x\rvert-r}$ and exceeds $\pi R^{2}\tilde{\xi}$ once $\lvert x\rvert>r(1+\tilde{\xi}^{2})$ . Thus, $K^{-}\subset B_{r+\tilde{\xi}}(0)$ and

[TABLE]

For an admissible $S\in\mathcal{S}$ , we can significantly strengthen the second claim in (3.14). We can argue that $S\supset B_{r+2-s}(0)=B_{r+2}(0)\ominus B_{s}(0)$ with $s=(\tfrac{1}{2}\pi R^{2}\tilde{\xi}(\sqrt{\varepsilon}\,))^{2/3}$ . Indeed, if $x\in({\mathbb{T}}\setminus S)\cap B_{r+2-s}(0)$ , then $B_{2}(x)\cap B_{r-s}(0)\cap S^{-}=\emptyset$ , while $\lvert B_{2}(x)\cap K^{-}\rvert\geq\lvert B_{2}(x)\cap B_{r}(0)\rvert\geq\frac{8}{3}s^{3/2}=\frac{4}{3}\pi R^{2}\tilde{\xi}(\sqrt{\varepsilon}\,)$ (see [21, Eq. (D.8)], in contradiction with the second inequality in (3.14). Combining the present claim with (3.18), we get

[TABLE]

Given that $\lvert S\rvert=\pi R^{2}$ , this implies

[TABLE]

yielding, for sufficiently small $\varepsilon$ , the statement in (3.12) with

[TABLE]

Using that

[TABLE]

we will prove connectedness and simple connectedness of the set $S^{-}$ by showing that every segment $\ell_{e}=\{te\colon\,t\in[0,R+\xi]\}$ in the direction of any unit vector $e\in{\mathbb{R}}^{2}$ , intersects the set $S^{-}$ in a closed interval: $S^{-}\cap\ell_{e}=\{te\colon\,t\in[0,T(e)]\}$ with $T(e)\in[R-2-\xi,R-2+\xi]$ . If the contrary were true, then there would be a direction $e$ and two points $x,y\in\{te\colon\,t\in[R-2-\xi,R-2+\xi]\}\subset\ell_{e}$ , $\lvert x\rvert<\lvert y\rvert$ such that $x\not\in S^{-}$ and $y\in S^{-}$ . The fact that $y\in S^{-}$ implies that $B_{R-\xi}\cup B_{2}(y)\subset S$ , while $x\not\in S^{-}$ implies that $B_{2}(x)\cap S^{\rm{c}}\neq\emptyset$ , and hence, in view of the preceding inclusion, $B_{2}(x)\cap(B_{R-\xi}\cup B_{2}(y))^{\rm{c}}\neq\emptyset$ . However, this cannot happen when $B_{2}(x)\setminus B_{2}(y)\subset B_{R-\xi}$ . Nevertheless, this is exactly what happens when $R-2\geq\xi/(\xi-1)$ . Indeed, $B_{2}(x)\setminus B_{2}(y)\subset B_{R-\xi}$ once $\partial B_{2}(y)\cap\ell\subset B_{R-\xi}(0)$ , where $\ell$ is the line through $y$ orthogonal to $\ell_{e}$ . For $z\in\partial B_{2}(y)\cap\ell$ we have

[TABLE]

when $\frac{R-2}{R-1}\geq\xi$ , which is equivalent with $R-2\geq\frac{\xi}{1-\xi}$ . ∎

To finish the proof of Theorem 2.2, we use that $S^{-}$ is connected and simply connected once (2.7) is satisfied and $R-2\geq\frac{\xi(\varepsilon)}{1-\xi(\varepsilon)}$ . For the latter, similarly as in the proof of Lemma 3.1 7, we may assume that $S\in S^{\textrm{fin}}$ . For fixed $R\in(2,\tfrac{1}{2}L)$ we choose $\varepsilon_{0}$ such that $R-2\geq\frac{\xi(\varepsilon)}{1-\xi(\varepsilon)}$ for any $0<\varepsilon\leq\varepsilon_{0}$ . Using now that $\operatorname{reach}(S^{-})\geq 2$ according to Lemma 3.1(7), we will rely on the Bonnesen inequality, which is more precise than the provisional claim in (3.12) with the bound $\xi(\varepsilon)$ whose dependence on $\varepsilon$ is not explicitly specified. For connected and simply connected $S^{-}$ , its boundary $\partial S^{-}$ is a Jordan curve and according to the Bonnesen inequality the difference of the radii $r_{\mathrm{out}}(\partial S^{-})$ and $r_{\mathrm{in}}(\partial S^{-})$ of the outer and the inner circle of the curve $\partial S^{-}$ can be bounded in terms of the isoperimetric defect $\mathcal{H}^{1}(\partial S^{-})^{2}-4\pi\lvert S^{-}\rvert$ :

[TABLE]

Assuming that $|S|=\pi R^{2}$ , we can write

[TABLE]

with the help of the Steiner formula (Lemma 3.2) and reformulate the isoperimetric defect as

[TABLE]

If the left-hand side of (3.25) is $\leq\pi\kappa\varepsilon$ , then the right-hand side of (3.26) is bounded from above by

[TABLE]

It follows from (3.24) that

[TABLE]

Since $r_{\mathrm{out}}(\partial S)-r_{\mathrm{in}}(\partial S)=r_{\mathrm{out}}(\partial S^{-})-r_{\mathrm{in}}(\partial S^{-})$ , we get the claim in (2.8). ∎

3.3 Large deviation principle for Widom-Rowlinson

In this section we prove Theorems 2.1 and 2.3.

Remember the Gibbs measure $\mu_{\beta}$ at inverse temperature $\beta$ and activity $z=\kappa\,z_{t}(\beta)$ . This is a probability measure on the space $\Gamma$ of particle configurations, which we may view as a subset of $\mathcal{F}$ equipped with the Hausdorff topology. By a slight abuse of notation, we identify $\mu_{\beta}$ on $\Gamma$ with the measure on $\mathcal{F}$ supported on $\Gamma$ . Theorem 2.1 builds on the large deviation principle for the Gibbs measure $\mu_{\beta}$ itself, i.e., the large deviation principle for the set of particle locations.

The LDP for $\mu_{\beta}$ is summarised in the following proposition (recall that a rate function is called good when it is lower semi-continuous and has compact level sets). This proposition is in the spirit of Schreiber [32], [33, Theorem 1]. The latter is stated in a slightly different setting, but the main ideas of the proof carry over.

Proposition 3.5 (Large deviation principle for Widom-Rowlinson).

The family of probability measures $(\mu_{\beta})_{\beta\geq 1}$ on $\mathcal{F}$ , supported on $\Gamma\subset\mathcal{F}$ , satisfies the LDP with rate $\beta$ and good rate function $I^{\mathrm{WR}}$ given by

[TABLE]

Proof.

Let $\Pi_{\kappa\beta}$ be the homogeneous Poisson point process on ${\mathbb{T}}$ with intensity $\kappa\beta$ . We may view $\Pi_{\kappa\beta}$ as a random variable on a probability space $(\Omega,\mathcal{B},{\mathbb{P}})$ taking values in $\mathcal{F}$ . Since ${\mathbb{P}}(\Pi_{\kappa\beta}\subset F)={\mathbb{P}}(\Pi_{\kappa\beta}\cap({{\mathbb{T}}}\setminus F)=\emptyset)=\mathrm{e}^{-\kappa\beta|{{\mathbb{T}}}\setminus F|}$ , $F\in\mathcal{F}$ , it follows that the family $(\Pi_{\kappa\beta})_{\beta\geq 1}$ satisfies the large deviation principle with rate $\beta$ and good rate function $I(F)=\kappa|{{\mathbb{T}}}\setminus F|$ , $F\in\mathcal{F}$ . Note that $F\mapsto|F|$ is upper semi-continuous (Schneider and Weil [31, Theorem 12.3.5]), but not continuous: sets $F$ of positive measure can be approximated by finite sets, which have measure zero. It follows that $F\mapsto I(F)=\kappa(|{{\mathbb{T}}}|-|F|)$ is lower semi-continuous, but not continuous. Nevertheless, the map $F\mapsto|F^{+}|=|h(F)|$ is continuous with respect to the Hausdorff metric (see Lemma 3.1(3)). Therefore, since

[TABLE]

the claim follows from the LDP for the Poisson point process $(\Pi_{\kappa\beta})_{\beta\geq 1}$ and Varadhan’s lemma. ∎

With the help of Proposition 3.5, the proof of Theorems 2.1 and 2.3 becomes straightforward via the contraction principle.

Proof of Theorem 2.1.

As mentioned above, the map $\mathcal{F}\to\mathcal{S}$ defined by $F\mapsto S=F^{+}$ is continuous with respect to the Hausdorff metric. Proposition 3.5 and the contraction principle therefore imply that the LDP for the law of $h(\gamma)$ under $\mu_{\beta}$ holds with rate $\beta$ and good rate function $I$ given by

[TABLE]

We show that $J(S)=|S|-\kappa|S^{-}|$ . Indeed, if $F^{+}=S$ , then $F\subset S^{-}$ and $|F^{+}|-\kappa|F|\geq|S|-\kappa|S^{-}|$ yielding $J(S)\geq|S|-\kappa|S^{-}|$ . On the other hand, taking $F=S^{-}$ with $F^{+}=(S^{-})^{+}=S$ in view of admissibility of $S$ , we get $J(S)\leq|F^{+}|-\kappa|F|=|S|-\kappa|S^{-}|$ . ∎

Proof of Theorem 2.3.

Theorem 2.3 follows from Proposition 3.5, the continuity of the map $F\mapsto|F^{+}|=|h(F)|$ , and the contraction principle. The rate function $I^{*}$ is given by

[TABLE]

In the difference of the two infima, the first infimum (when $A=\pi R^{2}$ with $R\in(2,\frac{1}{2}L)$ ) is equal to $\pi R^{2}-\kappa\pi(R-2)^{2}$ by Theorem 2.2. For the second infimum, we note that

[TABLE]

with equality for $S=\mathbb{T}$ . ∎

4 Heuristics for moderate deviations

In this section we provide the main ideas behind the proof of Theorems 2.5–2.6 in Sections 5–8. Guidance is needed because the proof is long and intricate. In Section 4.1 we explain how the moderate deviation probability for the halo volume can be expressed in terms of a certain surface integral. In Section 4.2 we explain how the weight in this surface integral can be approximated in terms of the polar coordinates of the boundary points. In Section 4.3 we provide a quick guess of what the orders of magnitude of the angles and the radii of the boundary points are as $\beta\to\infty$ . In Section 4.4 we introduce auxiliary random processes that allow us to transform the surface integral into an expectation of a certain exponential functional, capturing the global (= mesoscopic) scaling of the boundary of the critical droplet. In Section 4.5 we perform a further change of variable to rewrite the expectation in terms of an effective interface model, capturing the local (= microscopic) scaling of the boundary of the critical droplet.

4.1 Reduction to a surface integral

The starting point for the proof of Theorem 2.5 is the following. Because of the large deviation principles in Theorems 2.1 and 2.3 and the quantitative isoperimetric inequality 2.2, the dominant contribution to the event $|V(\gamma)-\pi R_{c}^{2}|\leq C\beta^{-2/3}$ should come from approximately disc-shaped halos (“droplets”). Consider the event

[TABLE]

that $h(\gamma)$ is close to a disc $B_{R_{c}}(x)$ , without any holes. Because of the translation invariance of the model, we may focus on $\mathcal{D}_{\varepsilon}(0)$ .

For $\gamma\in\mathcal{D}_{\varepsilon}(0)$ , the boundary $\partial h(\gamma)$ of the droplet is a union of circle arcs centreed at points $z_{1},\ldots,z_{n}\in\gamma$ , called boundary points. Each boundary point is extremal in the sense that $h(\gamma\setminus x)\subsetneq h(\gamma)$ . We call a collection of points $\{z_{1},\ldots,z_{n}\}$ a connected outer contour if there exists a halo $S$ with a simply connected $2$ -interior $S^{-}$ having exactly these boundary points. The halo $S$ , if it exists, is unique, and we denote it by $S(z)$ . The set of connected outer contours is denoted by $\mathcal{O}$ .

For $\gamma\in\mathcal{D}_{\varepsilon}(0)$ , both $h(\gamma)$ and $V(\gamma)$ are uniquely determined by the boundary points, since $h(\gamma)=S(z)$ and $V(\gamma)=S(z)^{-}$ . Therefore it makes sense to compute probabilities by conditioning on the boundary points. Abbreviate

[TABLE]

We will see that the following is true: for some constant $c>0$ and for all measurable sets $A\subset\mathcal{S}$ , as $\beta\to\infty$ ,

[TABLE]

where $\mathcal{D}^{\prime}_{\varepsilon}(0)$ is the collection of connected outer contours for which $d_{\mathrm{H}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , and $I^{*}$ is the rate function defined in (2.10). In view of this geometric constraint, the only contributions to the right-hand side of (4.3) are from connected outer contours $z\in\mathcal{O}$ that lie in an annulus: $|z_{i}-(R_{c}-2)|\leq\varepsilon$ , $1\leq i\leq n$ . Hence we may think of (4.3) as a surface integral.

4.2 Approximation of the surface term

In view of (4.3), our next task is to evaluate $\Delta(z)$ . Let us choose polar coordinates for the boundary points and write

[TABLE]

Upon relabelling the centres, we may without loss of generality assume that $0\leq t_{1}\leq\cdots\leq t_{n}<2\pi$ . We set $t_{n+1}=t_{0}+2\pi$ and $r_{n+1}=r_{1}$ , and define angular increments

[TABLE]

Note that $\theta_{i}\geq 0$ and $\sum_{i=1}^{n}\theta_{i}=2\pi$ . The volume of the shape $S(z)$ with boundary points $z=(z_{1},\ldots,z_{n})$ admits an expansion

[TABLE]

Also,

[TABLE]

The previous formulas are valid for general radius $R$ . Next we specialize to $R=R_{c}$ . With

[TABLE]

and using that $\kappa-1=2/(R_{c}-2)$ , we get

[TABLE]

with

[TABLE]

4.3 Orders of magnitude

Neglecting higher order terms in $\Delta(z)$ in (4.9), we see that the weight $\exp(-\beta\Delta(z))$ involves several terms. The factor $\exp(-\beta C_{1}\theta_{i}^{3})$ suggests that the typical angular increment $\theta_{i}$ is of order $\beta^{-1/3}$ , and the typical number of boundary points therefore is of order $\beta^{1/3}$ . The factor

[TABLE]

suggests that $\rho_{i+1}-\rho_{i}$ is approximately normal with variance proportional to $\theta_{i}/\beta$ . Hence, we expect that the radial increment $\rho_{i+1}-\rho_{i}$ is typically of order $\beta^{-2/3}$ . Combining these observations, we may expect that

[TABLE]

which explains the exponent $\beta^{1/3}$ in Theorem 2.5. Furthermore, it will be natural to think of $\rho_{i}$ as

[TABLE]

with $m$ some unknown mean value, and $(B_{t})_{t\geq 0}$ the mean-centred Brownian bridge from (2.18). The consistency with the guessed order of magnitude of the radial increment is guaranteed by the fact that $B_{t_{i+1}}-B_{t_{i}}\approx\sqrt{\theta_{i}}\approx\beta^{-1/6}$ and the observation that $\beta^{-1/2-1/6}=\beta^{-2/3}$ . Finally, we note that

[TABLE]

which should not contribute at the scale $\beta^{1/3}$ we are interested in (unless $m$ is large). Nevertheless, we will need to treat this term carefully because, for the mean-centred Brownian bridge we have

[TABLE]

and extra arguments will be needed to cure this divergence.

For later purpose, let us also have a closer look at the volume constraint $|V(\gamma)-\pi R_{c}^{2}|\leq C\beta^{-2/3}$ . If we substitute the expression (4.6) for $V(\gamma)$ and neglect higher order terms, then the volume constraint becomes

[TABLE]

Making a few leaps of faith, we may approximate

[TABLE]

using that $\int_{0}^{2\pi}B_{t}\mathrm{d}t=0$ for the mean-centred Brownian bridge. From the considerations above we should expect the sum overall to be of order $\beta^{-2/3}$ . Hence $[(\kappa-1)\beta]^{-1/2}m$ should also be of order $\beta^{-2/3}$ , i.e., $|m|=O(\beta^{-1/6})$ . Later we will only prove that $|m|=O(\beta^{1/6})$ , but this will turn out to be enough for our purpose.

4.4 Global scaling: auxiliary random processes

If we substitute the approximation (4.9) for $\Delta(z)$ into the surface integral in (4.3) and drop error terms and indicators, we are naturally led to the investigation of expressions of the type

[TABLE]

where $f$ is some non-negative test function, and we recall (4.4) and (4.8) (by convention the summand with $n=0$ equals $1$ ). The Gaussian term (see also (4.11)) is conveniently expressed with the heat kernel

[TABLE]

and we have

[TABLE]

Let us approximate $R_{c}-2+\rho_{i}\approx R_{c}-2$ , drop the term $\sum_{i}{\bar{\rho}_{i}}^{2}\theta_{i}$ , and change variables as $x_{i}=\sqrt{(\kappa-1)\beta}\,\rho_{i}$ . Then the integral in (4.18) becomes

[TABLE]

With the help of

[TABLE]

and $C_{1}=G_{\kappa}^{3}/24$ , the expression (4.21) is rewritten as

[TABLE]

This expression motivates the auxiliary processes and the expressions $Y_{0}$ and $Y_{1}$ introduced before Theorem 2.5. Moreover, $\beta$ and $\kappa$ only enter in the combination $\beta^{1/3}G_{\kappa}$ (except possibly in the test function $f$ ), which explains the scaling of the surface corrections in Theorem 2.5.

The picture that emerges of the droplet boundary is that its deviation from $\partial B_{R_{c}}(0)$ should be of the order of $\beta^{-1/2}$ , and that the boundary points are obtained by selecting points of a Gaussian curve according to an angular point process. The process $\mathcal{T}$ has a number of points of order $\beta^{1/3}$ . We may call this picture global since it describes the overall shape of the droplet – the Gaussian process $(B_{t})_{t\in[0,2\pi]}$ does not see the microscopic details.

4.5 Local scaling: effective interface model

Equation (4.18) suggests one last change of variables, namely, set

[TABLE]

Note that

[TABLE]

Then (4.21) becomes

[TABLE]

or, equivalently,

[TABLE]

Let us finally return to the term $\sum_{i=1}^{n}\bar{\rho}_{i}^{2}\theta_{i}$ that we had dropped from (4.18). By (4.25), we have

[TABLE]

Taking this term into account, we see that (4.27) should be replaced by the more accurate integral

[TABLE]

We may view the exponential, together with an additional indicator for the boundary points, as the Boltzmann weight for an effective interface model, which is studied in detail in [22]. The term $\frac{1}{2G_{\kappa}^{2}\beta^{2/3}}\sum_{i=1}^{n}\bar{\varphi}_{i}^{2}\vartheta_{i}$ plays the role of a background potential.

5 Stochastic geometry: approximation of geometric functionals

This section collects a number of geometric facts that will be needed for the moderate deviations of the halo volume. In Section 5.1 we prove a number of a priori estimates on the radial and the angular coordinates of the boundary points, i.e., the centres of the discs that lie at the boundary of the critical droplet (Lemma 5.1, Proposition 5.2, Definition 5.3 and Corollary 5.4). These estimates play a crucial role for the arguments in Sections 6–8. In Section 5.2 we show that the set of boundary points allows for a local characterisation, in the sense that whether or not a 2-disc touches the boundary of the critical droplet only depends on the centre of the two neighbouring 2-discs (Definition 5.5, Lemma 5.6 and Proposition 5.7). In Section 5.3 we derive an approximation for the volume and the surface of halos that are close to a critical disc, in terms of certain sums involving the radial and the angular coordinates of the boundary points (Proposition 5.8). In Section 5.4 we do the same for the geometric centre of halos that are close to a critical disc (Proposition 5.9 and Corollary 5.10). The a priori estimates in Sections 5.1 allow us to control the approximations derived in Sections 5.3–5.4.

5.1 A priori estimates on boundary points

Theorem 2.2 can be applied to sets of the form $S=h(\gamma)$ , the halo of the configuration $\gamma$ . In particular, the condition that the boundary $\partial S$ is close to a disc $B_{R}$ with $R>2$ is a strong restriction on the geometry of the boundary points $(z_{1},\ldots,z_{n})$ (recall Fig. 5).

In this section we collect several a priori estimates and constraints that follow from the fact that $S=h(\gamma)$ has a simply connected 2-interior $S^{-}$ and $d_{\text{\rm{H}}}(\partial S,\partial B_{R})\leq\varepsilon$ . Remember the notion of boundary points, connected outer contour $\mathcal{O}$ , and the polar introduced in Section 4.1 and the polar coordinates $(r_{i},t_{i})_{i=1}^{n}$ and angular increments $\theta_{i}$ from (4.4) and (4.5). We write $\ell_{z_{i}}$ or $\ell_{i}$ to denote the ray from the origin passing through the point $z_{i}$ , and $A_{R,\varepsilon}$ and $A_{R-2,\ varepsilon}$ to denote the $\varepsilon$ -annuli defined as the closures of $B_{R+\varepsilon}(0)\setminus B_{R-\varepsilon}(0)$ and $B_{R-2+\varepsilon}(0)\setminus B_{R-2-\varepsilon}(0)$ , respectively.

First we show that the intersections of the boundary circles in $\partial S$ follow the order of the corresponding boundary points:

Lemma 5.1.

Fix $R>2$ . If $z\in{\mathcal{O}}$ and $d_{\text{{\rm H}}}(\partial S(z),B_{R}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ :

(a)

$z_{i}\in A_{R-2,\varepsilon}$ * for all $1\leq i\leq n$ .*

(b)

The distance between any two points $x,x^{\prime}\in A_{R-2,\varepsilon}$ such that $\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})\cap A_{R,\varepsilon}\neq\emptyset$ satisfies

[TABLE]

and the angle $\theta_{xx^{\prime}}$ between the rays $\ell_{x}$ and $\ell_{x^{\prime}}$ satisfies

[TABLE]

(c)

For every $1\leq i\leq n$ there exists a unique $v_{i}\in\partial B_{2}(z_{i})\cap\partial B_{2}(z_{i+1})$ such that $v_{i}\in A_{R.\varepsilon}$ .

(d)

The boundary $\partial S(z)$ consists of the union of closed arcs of the circles $\partial B_{2}(z_{i})$ between the points $v_{i}$ and $v_{i+1}$ , $1\leq i\leq n$ , contained in $A_{R,\varepsilon}$ (with $v_{n+1}=v_{1}$ ).

Proof.

The proof is based on a number of geometric observations.

(a) This claim is immediate from the fact that ${\operatorname{dist}}_{\text{{\rm H}}}(\partial S(z),B_{R}(0))\leq\varepsilon$ and that each $z_{i}$ is a boundary point with $B_{2}(z_{i})\subset B_{R+\varepsilon}(0)$ and $\partial B_{2}(z_{i})\cap A_{R,\varepsilon}\neq\emptyset$ .

(b) Let $v\in\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})\cap A_{R,\varepsilon}$ . To get the bound in (5.1), we note that the maximal distance between $x$ and $x^{\prime}$ and the maximal angle $\theta$ consistent with the condition $v\in A_{R,\varepsilon}$ and $x,x^{\prime}\in A_{R-2,\varepsilon}$ occur when $v\in\partial B_{R-\varepsilon}(0)$ and $x,x^{\prime}\in\partial B_{R-2+\varepsilon}(0)$ , i.e., when the ray $0\,v$ is orthogonal to the segment $z\,z^{\prime}$ . Assume, without loss of generality, that $v=(R-\varepsilon,0)$ and $x,x^{\prime}=(x_{0},\pm y_{0})$ with $x_{0}^{2}+y_{0}^{2}=(R-2+\varepsilon)^{2}$ . Then the condition $v\in\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})$ reads

[TABLE]

which, in combination with the equation $x_{0}^{2}+y_{0}^{2}=(R-2+\varepsilon)^{2}$ , yields $x_{0}=\frac{R(R-2)-2\varepsilon+\varepsilon^{2}}{R-\varepsilon}$ and implies

[TABLE]

which settles (5.1). For the corresponding angle, we have $\lvert\tan\frac{\theta}{2}\rvert\leq\frac{y_{0}}{x_{0}}$ , and hence $\lvert\theta\rvert\leq 2\arctan(\frac{y_{0}}{x_{0}})$ , which settles (5.2).

(c) Let $\{v_{i},v_{i}^{\prime}\}=\partial B_{2}(z_{i})\cap\partial B_{2}(z_{i+1})$ and consider the boundary piece $\partial(B_{2}(z_{i})\cup B_{2}(z_{i+1}))$ . This consists of two arcs, $C_{i}\subset\partial B_{2}(z_{i}))$ and $C_{i+1}\subset\partial B_{2}(z_{i+1}))$ , both ending in the points $v_{i}$ and $v_{i}^{\prime}$ . A necessary condition for both $z_{i}$ and $z_{i+1}$ to be boundary points is that both arcs $C_{i}$ and $C_{i+1}$ intersect the annulus $A_{R,\varepsilon}$ . If, in addition, $v_{i},v_{i}^{\prime}\notin A_{R,\varepsilon}$ , then we get a contradiction with the assumption that both $z_{i}$ and $z_{i+1}$ are boundary points. Indeed, consider the line $\ell$ through $v_{i}^{\prime}$ and $v_{i}$ (and through $\overline{z_{i}}=\tfrac{1}{2}(z_{i}+z_{i+1})$ ) and the intersection point $p_{1}=\ell\cap\partial B_{R-\varepsilon}(0)$ as shown in Fig. 6.

There exists $j\neq i,i+1$ such that $p_{1}\in B_{2}(z_{j})$ . Otherwise there would be a gap in the boundary $\partial S(z)$ along $\ell\cap A_{R,\varepsilon}$ . Assuming, without loss of generality, that the line segment $z_{i}\,z_{j}$ intersects the ray $\ell_{i+1}$ (in view of (5.1), this segment cannot intersect both $\ell_{i+1}$ and $\ell_{i-1}$ ), we conclude that if $p_{1}\in B_{2}(z_{j})$ , then also $p_{2}\in B_{2}(z_{j})$ , where $p_{2}$ is the reflection of $p_{1}$ with respect to the ray $\ell_{i+1}$ . But then also $\partial B_{2}(z_{i+1})\cap A_{R,\varepsilon}\subset B_{2}(z_{j})$ , which is in contradiction with the fact that $z_{i+1}$ is a boundary point. Note that there is a severe restriction on the position of the point $z_{j}$ : it has to be contained in $B_{2}(p_{1})$ . The allowed region is shown in Fig. 6 in a darker shade. Thus, necessarily, $v_{i}\in A_{R,\varepsilon}$ , while $v_{i}^{\prime}\in B_{R-2-\varepsilon}$ , because $v_{i}^{\prime}$ is a reflection of $v_{i}$ with respect to $\overline{z_{i}}\in A_{R-2,\varepsilon}$ .

(d) If the arc of the circle $\partial B_{2}(z_{i})$ between the points $v_{i}$ and $v_{i+1}$ is intersected by a circle $\partial B_{2}(z_{j})$ for some $j\notin\{z_{i-1},z_{i},z_{i+1}\}$ , then necessarily $\{v_{i},v_{i+1}\}\cap B_{2}(z_{j})\neq\emptyset$ . Similarly as above, assuming that the line segment $z_{i}\,z_{j}$ intersects the ray $\ell_{i+1}$ and knowing that $v_{i}\in A_{R,\varepsilon}\cap B_{2}(z_{j})$ , we get that also its reflection with respect to the ray $\ell_{i+1}$ belongs to $B_{2}(z_{j})$ , which implies that $(\partial B_{2}(z_{i+1})\setminus B_{2}(z_{i}))\cap A_{R,\varepsilon}\subset B_{2}(z_{j})$ , in contradiction with the fact that $z_{i+1}$ is a boundary point. ∎

Let

[TABLE]

and note that $r_{i+1}-r_{i}=\rho_{i+1}-\rho_{i}$ . Abbreviate

[TABLE]

Proposition 5.2 (A priori estimates for angular and radial coordinates).

Fix $R>2$ . If $z\in\mathcal{O}$ and $d_{\text{\rm{H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ ,

[TABLE]

Proof.

The first estimate in (5.7) is a trivial consequence of $d_{\text{\rm{H}}}(\partial S(z),\partial B_{R}(0))\leq\varepsilon$ . The second estimate is the bound (5.2) in Lemma 5.1(b). The fourth estimate is a consequence of the second estimate. Indeed, if $\max_{1\leq i\leq n}\theta_{i}=O(\sqrt{\varepsilon}\,)$ , then $n^{-1}\leq(2\pi)^{-1}\max_{1\leq i\leq n}\theta_{i}=O(\sqrt{\varepsilon}\,)$ .

The third estimate is slightly more involved. Omit the index $i$ , similarly as in the proof of Lemma 5.1(c), and consider points $z,z^{\prime}\in A_{R-2,\varepsilon}$ with polar coordinates $z=(r,0)$ and $z^{\prime}=(r^{\prime},\theta)$ , with $\theta>0$ , $\theta=O(\sqrt{\varepsilon}\,)$ , $r^{\prime}=r-\delta$ , $\delta>0$ . Our aim is to evaluate the fraction $\frac{\delta}{\theta}$ .

To maximise $\delta$ for fixed $\theta$ , we assume that $r=R-2+\varepsilon$ and that the point $z^{\prime}$ is chosen so that a point $v$ on an intersection $\partial B_{2}(z)\cap\partial B_{2}(z^{\prime})$ lies on the inner boundary of $A_{R,\varepsilon}$ , $\lvert v\rvert=R-\varepsilon$ . We need to find $v=(x,y)\in\partial B_{R+\varepsilon}(0)\cap\partial B_{2}(z)$ satisfying the equations

[TABLE]

which yield

[TABLE]

The fact that $v\in\partial B_{2}(z^{\prime})$ implies that $\delta$ determining the position of the point $z^{\prime}$ satisfies the equation

[TABLE]

Solving for $\delta=\delta(R,\varepsilon,\theta)$ and expanding into powers in $\varepsilon$ and $\theta$ , we obtain

[TABLE]

where we drop two negative terms. ∎

In the sequel we will need four sums involving $\theta_{i},\rho_{i}$ , which we collect here.

Definition 5.3.

Fix $R>2$ . Recall (4.4) and (5.5)–(5.6). Define

[TABLE]

These expressions will appear in the expansions in Propositions 5.8 and 5.9 below. The following estimates will play a crucial role in the sequel. Note that $y_{1}(z),y_{2}(z),y_{3}(z)$ are non-negative, while $y_{4}(z),y_{5}(z),y_{6}(z)$ are not necessarily so.

Corollary 5.4 (A priori estimates for sums in approximations).

Fix $R>2$ . If $z\in\mathcal{O}$ and $d_{\text{{\rm H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ ,

[TABLE]

Proof.

Using the bounds (5.7), we estimate

[TABLE]

∎

5.2 Locality for boundary determination

In this section we present a crucial property of the boundary points, namely, their location is constrained only by the location of the two neighbouring boundary points (Proposition 5.7 below). This property will be used in the proof of the lower bound in Theorem 2.6 carried out in Section 8.2 (see, in particular, the proof of Lemma 8.3). It also plays an important role in [22].

Definition 5.5.

**

(a)

Let $z=(z_{i})_{1\leq i\leq n}=((r_{i}\cos t_{i},r_{i}\sin t_{i}))_{1\leq i\leq n}$ be a sequence of points in $A_{R-2,\varepsilon}$ in polar coordinates, ordered by angle, i.e., $t_{1}<\cdots<t_{n}$ (and with $z_{n+1}=z_{n}$ ). Suppose that for each pair $z_{i},z_{i+1}$ there exists a $v_{i}\in\partial B_{2}(z_{i})\cap\partial B_{2}(z_{i+1})\cap A_{R,\varepsilon}$ . Note that this condition implies that $S(z)$ is well defined ( $S(z)$ * is obtained by filling the inner part) and that $\partial S(z)\subset A_{R,\epsilon}$ . However, the condition does not necessarily imply that $z\in{\mathcal{O}}$ , because possibly only a subset of $z$ contributes to $\partial S(z)$ .*

(b)

For any $1\leq i,j\leq n$ , let $v_{i,j}\in\partial B_{2}(z_{i})\cap\partial B_{2}(z_{j})$ be such that $\lvert v_{i,j}\rvert=\max\{\lvert v\rvert\colon\,v\in\partial B_{2}(z_{i})\cap\partial B_{2}(z_{j})$ (if there is a tie, then take the intersection with minimal angle in polar coordinates). In polar coordinates, $v_{i,j}=(r_{i,j}\cos t_{i,j},r_{i,j}\sin t_{i,j})$ , $r_{i,j}\in[R-\epsilon,R+\epsilon]$ . Then:

(i)

A point $z_{i}$ is extremal in $z$ if $\partial B_{2}(z_{i})\cap\partial S(z)\neq\emptyset$ .

(ii)

A sequence $z$ is a set of boundary points, i.e., $z\in{\mathcal{O}}$ , if each $z_{i}$ , $1\leq i\leq n$ , are extremal.

(iii)

A triplet $(z_{i},z_{j},z_{k})$ with $t_{i}<t_{j}<t_{k}$ is called an extremal triplet if $(B_{2}(z_{j})\setminus B_{R-\epsilon}(0))\setminus(B_{2}(z_{i})\cup B_{2}(z_{k}))\neq\emptyset$ .

We need the following lemma.

Lemma 5.6.

Let $R>2+\frac{\epsilon}{1-\epsilon}$ . Consider two points $x,x^{\prime}\in A_{R-2,\varepsilon}$ . Set $\{v,v^{\prime}\}=\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})$ and suppose that $\{v,v^{\prime}\}\cap A_{R,\varepsilon}\neq\emptyset$ . Then the following hold:

(i)

Exactly one of the vectors $\{v,v^{\prime}\}=\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})$ , say $v$ , is in $A_{R,\varepsilon}$ . The other is in the interior of the ball $B_{R-\epsilon}(0)$ .

(ii)

Let $x=(r\cos t,r\sin t)$ , $x^{\prime}=(r^{\prime}\cos t^{\prime},r^{\prime}\sin t^{\prime})$ and assume that $t<t^{\prime}$ . Let $H$ be the halfplane with the boundary consisting of the line $vv^{\prime}$ containing the point $x^{\prime}$ . Then $B_{2}(x)\cap H\subset B_{2}(x^{\prime})\cap H$ .

(iii)

A triplet $(z_{i},z_{j},z_{k})$ with $v_{j,k}\in A_{R,\varepsilon}$ is extremal if and only if $t_{i,j}<t_{j,k}$ .

Proof.

(i) Choosing a position of $v$ in $A_{R,\varepsilon}$ , we see that the points $x$ and $x^{\prime}$ necessarily lie on the arc $\partial B_{2}(v)\cap A_{R-2,\varepsilon}$ . Thus, the barycentre $s=\frac{1}{2}(x+x^{\prime})$ , which is the centre of symmetry of the union $B_{2}(x)\cup B_{2}(x^{\prime})$ , is contained in $\partial B_{2}(v)\cap A_{R-2,\varepsilon}$ . The point $v^{\prime}$ is symmetric to $v$ with respect to the barycentre $s$ : $v^{\prime}=s-(v-s)$ . Suppose, without loss of generality, that $v=(0,y)$ with $y\in[R-\epsilon,R+\epsilon]$ . To show that $v^{\prime}\in B_{R-\epsilon}(0)$ , consider the most extremal case $y=R-\epsilon$ when $\lvert v\rvert=R-\epsilon$ . Indeed, for $y>R-\epsilon$ we could shift the points $x$ and $x^{\prime}$ , and thus also $v$ and $v^{\prime}$ , by the vector $u=(0,R-\epsilon-y)$ . The shifted $x+u,x^{\prime}+u$ lead to the shifted $v+u$ and $v^{\prime}+u$ . Notice also that necessarily $x+u,x^{\prime}+u\in A_{R-2,\epsilon}$ since $x+u,x^{\prime}+u\in\partial B_{2}(v)\cap A_{R-2,\epsilon}+u\subset\partial B_{2}(v+u)\cap A_{R-2,\epsilon}$ in view of the fact that the point $v+u-2=(0,R-2-\epsilon)\in A_{R-2,\epsilon}$ . Now, if $v^{\prime}+u\in B_{R-\epsilon}(0)$ , then necessarily also $v^{\prime}\in B_{R-\epsilon}(0)$ . Consider, in addition, the “most dangerous case” when $s$ is asymptotically approaching the extremal point $B_{2}(v)\cap\partial B_{R-2+\epsilon}(0)$ . Consider the tangent line from $v$ to the disc $B_{R-2+\epsilon}(0)$ touching in the point $\tau$ . This tangent line intersects the circle $B_{R-\epsilon}(0)$ in $v$ and a point $\tilde{v}$ symmetric with respect $\tau$ . Clearly, if the distance $\ell$ from $v$ to $\tau$ is larger than $2$ , then the point $v^{\prime}$ on the line $vs$ falls short of $\partial B_{R-\epsilon}(0)$ and we get the claim. To show that $\ell>2$ , we compute it from the rectangular triangle $v\tau 0$ :

[TABLE]

which yields $\ell>2$ if and only if $R>2+\frac{\epsilon}{1-\epsilon}$ .

(ii) The claim immediately follows by inspecting the union $B_{2}(x)\cup B_{2}(x^{\prime})$ with the intersection points $\{v,v^{\prime}\}=\partial B_{2}(x)\cap\partial B_{2}(x^{\prime})$ and the symmetry with respect to the barycentre $s$ .

(iii) Just observe that the condition $t_{i,j}=t_{j,k}$ means that the circle $\partial B_{2}(x_{j})$ is touching the set $\partial B_{2}(x_{i})\cup\partial B_{2}(x_{k})$ in the point $v_{i,k}$ . ∎

Proposition 5.7 (Local characterisation of sets of boundary points).

*Let $R>2+\frac{\epsilon}{1-\epsilon}$ and $z=((r_{i}\cos t_{i},r_{i}\sin t_{i}))_{1\leq i\leq n}$ be a sequence of points in $A_{R-2,\varepsilon}$ , ordered by angle. Then the following two conditions are equivalent:

(i) The set $z$ is a set of boundary points, $z\in{\mathcal{O}}$ .

(ii) Every triplet $(z_{j-1},z_{j},z_{j+1})$ , $1\leq j\leq n$ , is extremal.*

Proof.

(i) $\implies$ (ii):

If (ii) does not hold, then there exist a $j$ such that the triplet $(z_{j-1},z_{j},z_{j+1})$ is not extremal. According to Lemma 5.6(iii), this implies that $z_{j}$ is not extremal in $z$ and the condition (i) is not satisfied.

(ii) $\implies$ (i):

If (i) does not hold, then there exist either $k>j$ or $i<j$ such that either $v_{j-1,j}\in B_{2}(z_{k})\cap A_{R,\varepsilon}$ or $v_{j-1,j}\in B_{2}(z_{i})\cap A_{R,\varepsilon}$ . Consider the former case. We will show that, necessarily, one of the triplets

[TABLE]

is not extremal, which breaks (ii). Indeed, if all these triplets were extremal, then we would have $t_{j-1,j}<t_{j,j+1}<t_{j+1,j+2}<\dots<t_{k-2,k-1}<t_{k-1,k}$ . Just observe that $t_{j-1,j}<t_{j,j+1}$ because the triplet $(z_{j-1},z_{j},z_{j+1})$ is extremal, $t_{j,j+1}<t_{j+1,j+2}$ because the triplet $(z_{j},z_{j+1},z_{j+2})$ is extremal, etc. Now, given that $t_{k-1}<t_{k}$ and the fact that the arcs $\partial B_{2}(z_{k})\cap A_{R,\varepsilon}$ and $\partial B_{2}(z_{k-1})\cap A_{R,\varepsilon}$ intersect only once at $v_{k-1,k}$ , all points $x=(t,\varphi)\in B_{2}(z_{k})\cap A_{R,\varepsilon}$ with $t<t_{k-1,k}$ belong to $B_{2}(z_{k-1})$ . On the other hand, the point $v_{j-1,j}=\partial B_{2}(z_{j-1})\cap\partial B_{2}(z_{j})\cap A_{R,\varepsilon}$ does not belong to $B_{2}(z_{j+1}),B_{2}(z_{j+2}),\dots,B_{2}(z_{k-1}),B_{2}(z_{k})$ . This is in contradiction with the condition that $v_{j-1,j}\in B_{2}(z_{k})$ . ∎

Proposition 5.7 shows that $1_{{\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)}(z)$ is a product of indicators involving triples of successive boundary points. This means that the constraint given by ${\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ is nearest-neighbour only. This simplifying fact will play an important role in the analysis of the effective interface model in [22].

5.3 Volume and surface approximation

In this section we derive approximations of two key quantities:

$\bullet$

$(|S(z)|-\kappa|S^{-}(z)|)-(\pi R_{c}^{2}-\kappa\pi(R_{c}-2)^{2})$ , the volume of the sausage minus the volume of the critical annulus (see Fig. 5).

$\bullet$

$|S(z)|-\pi R_{c}^{2}$ , the volume of the halo minus the volume of the critical disc.

For fixed $\kappa\in(1,\infty)$ , we write $R_{c}=R_{c}(\kappa)$ and introduce three explicit constants

[TABLE]

Proposition 5.8 (Volume and surface approximation).

If $z\in\mathcal{O}$ and $d_{\text{\rm{H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ ,

[TABLE]

where $C_{k}^{\varepsilon}=[1+O(\varepsilon)]\,C_{k}$ , $k=1,3$ .

Proof.

The proof comes in 5 steps.

1. The halo $S(z)$ is a disjoint union of $n$ sets of area $V_{i}$ labelled by the boundary points $z_{i}$ , $|S(z)|=\sum_{i=1}^{n}V_{i}$ . Each of these sets is in its turn a disjoint union of 3 subsets: the triangle $0\,z_{i}\,z_{i+1}$ of $V_{i}^{(1)}$ , the isosceles triangle $z_{i}\,z_{i+1}\,v_{i}$ of area $V_{i}^{(2)}$ , and the boundary wedge $z_{i}\,v_{i-1}\,v_{i}$ of area $V_{i}^{(3)}$ (see Fig. 8). The areas are easily expressed in terms of the corresponding vertex angles, namely,

[TABLE]

with $\varphi_{i}$ denoting the angle between the line segments $v_{i}\,z_{i}$ and $v_{i}\,z_{i+1}$ touching at the point $v_{i}$ , and $\alpha_{i}$ denoting the angle between the line segments $z_{i}\,v_{i-1}$ and $z_{i}\,v_{i}$ at the point $z_{i}$ . Similarly, we have $|S^{-}(z)|=\sum_{i=1}^{n}V^{-}_{i}$ , with

[TABLE]

where $B_{i}$ is the area of the circular segment bounded by the line segment $z_{i}\,z_{i+1}$ and the arc of the circle $\partial B_{2}(v_{i})$ subtending the angle $\varphi_{i}$ .

It is easy to show that

[TABLE]

Indeed, let us introduce angles $\beta_{i}$ and $\gamma_{i}$ at the point $z_{i}$ between the line $\ell_{i}$ and the edges $z_{i}\,v_{i-1}$ and $z_{i}\,v_{i}$ , respectively (see Fig. 9). The angle $\beta_{i}$ is taken to be positive in the anticlockwise direction from $\ell_{i}$ , and $\gamma_{i}$ in the clockwise direction. Note that the angle $\beta_{i+1}$ in Fig. 9 is actually negative, while all the other angles $\beta_{j}$ and $\gamma_{j}$ are positive. On the one hand, $\beta_{i}+\gamma_{i}=\alpha_{i}$ for $1\leq i\leq n$ , and in the case of the point $z_{i+1}$ in Fig. 9, $\alpha_{i+1}=\beta_{i+1}+\gamma_{i+1}=\gamma_{i+1}-\lvert\beta_{i+1}\rvert$ . Nevertheless, $\alpha_{i}=\beta_{i}+\gamma_{i}>0$ , which is the condition that $z_{i}$ is a boundary point. On the other hand,

[TABLE]

It suffices to consider the triangles $0\,z_{i}\,z_{i+1}$ with angles are $\theta_{i}$ , $\pi-(\frac{\pi-\varphi_{i}}{2}+\gamma_{i})$ (the complement of the angle $\frac{\pi-\varphi_{i}}{2}+\gamma_{i}$ between the edge $z_{i}\,z_{i+1}$ and the line $\ell_{i}$ obtained by adding at $z_{i}$ the angle $\gamma_{i}$ to the angle $\frac{\pi-\varphi_{i}}{2}$ of the isosceles triangle $z_{i}\,z_{i+1}\,v_{i}$ ), and the angle $\pi-(\frac{\pi-\varphi_{i}}{2}+\beta_{i+1})$ at $z_{i+1}$ . This yields

[TABLE]

which implies $\varphi_{i}+\theta_{i}=\beta_{i+1}+\gamma_{i}$ . Note that this reasoning remains valid for negative $\beta_{i}$ or $\gamma_{i}$ (check the point $z_{i+1}$ in Fig. 9). Combining (5.22) with the equation $\alpha_{i}=\beta_{i}+\gamma_{i}$ , we get

[TABLE]

As a result, we can replace the sum in $|S(z)|=\sum_{i=1}^{n}V_{i}$ by the sum $\sum_{i=1}^{n}V_{i}^{\prime}$ with

[TABLE]

2. Next, we split the volume of the sausage

[TABLE]

and rewrite the two terms as

[TABLE]

where

[TABLE]

Thus, to compute the two quantities in (5.18) we need to compute $\sum_{i=1}^{n}(I_{i}^{(1)}-\kappa I_{i}^{(3)})+\sum_{i=1}^{n}(I_{i}^{(2)}-\kappa I_{i}^{(4)})$ and $\sum_{i=1}^{n}I_{i}^{(1)}+\sum_{i=1}^{n}I_{i}^{(2)}$ . Our aim now is to expand all the relevant terms in powers of $\theta_{i}$ and $\rho_{i+1}-\rho_{i}$ .

3. For the terms $I_{i}^{(2)}$ and $I_{i}^{(4)}$ , we use (5.5) to get the identities

[TABLE]

where again $\overline{\rho_{i}}=\frac{1}{2}(\rho_{i}+\rho_{i+1})$ . Since $R_{c}=\kappa(R_{c}-2)$ and $C_{3}=\tfrac{1}{2}(\kappa-1)$ (recall (5.17)), this in turn yields

[TABLE]

which accounts for the third term in the right-hand side of the first line of (5.18). Also, recalling the notation $C_{2}=R_{c}$ (recall (5.17)), we see that the first equality in (5.29) reads

[TABLE]

which accounts for the last two terms in the right-hand side of the second line of (5.18).

4. The terms $I_{i}^{(1)}$ and $I_{i}^{(3)}$ are a bit more elaborate and require an expansion of some terms. Begining with $I_{i}^{(3)}$ , we use (5.20) to rewrite the definition (5.28) as

[TABLE]

Then, using $r_{i}r_{i+1}=(\overline{r_{i}}-\tfrac{1}{2}(\rho_{i+1}-\rho_{i}))(\overline{r_{i}}+\tfrac{1}{2}(\rho_{i+1}-\rho_{i}))=\overline{r_{i}}^{2}-(\tfrac{\rho_{i+1}-\rho_{i}}{2})^{2}$ , we write the difference between the first and the third term in (5.32) as

[TABLE]

With the shorthand notation

[TABLE]

for the length of the line segment $z_{i}\,z_{i+1}$ , we express this in terms of the angle $\varphi_{i}$ in the isosceles triangle $z_{i}\,z_{i+1}\,v_{i}$ ,

[TABLE]

Inverting (5.35), we get

[TABLE]

and thus

[TABLE]

which together with (5.32) and (5.33) implies

[TABLE]

Furthermore, substituting $V_{i}$ from (5.25) into $I_{i}^{(1)}$ defined in (5.28) and comparing it with $I_{i}^{(3)}$ in (5.32), we get

[TABLE]

By (4.4) and (5.34), we have

[TABLE]

Approximating $u_{i}$ by $2\overline{r_{i}}\sin(\tfrac{\theta_{i}}{2})$ , we express the error $E_{i}$ as

[TABLE]

Whenever $z\in\mathcal{O}$ and $d_{\text{\rm{H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , we can use Proposition 5.2, which implies that $\overline{r_{i}}=R_{c}-2+O(\varepsilon)$ , $\max_{1\leq i\leq n}|\rho_{i}|=O(\varepsilon)$ , $\max_{1\leq i\leq n}\theta_{i}=O(\sqrt{\varepsilon}\,)$ , and $\max_{1\leq i\leq n}|\rho_{i+1}-\rho_{i}|/\theta_{i}=O(\sqrt{\varepsilon})$ . With help of these bounds, we get

[TABLE]

and thus

[TABLE]

For $u_{i}$ as determined by (5.41), where we keep explicitly the terms up to order $O(\varepsilon^{3/2})$ , we get

[TABLE]

and thus also

[TABLE]

Using the last two equations jointly with (5.36), we get

[TABLE]

Combining this with (5.38) and (5.39), and absorbing the term $\bigl{(}\rho_{i+1}-\rho_{i}\bigr{)}^{2}\theta_{i}$ into $\frac{(\rho_{i+1}-\rho_{i})^{2}}{\theta_{i}}O(\varepsilon)$ , we get

[TABLE]

and

[TABLE]

5. Using that $R_{c}=\kappa(R_{c}-2)$ , $\overline{r_{i}}=R_{c}-2+O(\varepsilon)$ , and applying Proposition (5.2) once more while referring to the definitions in (5.17), we arrive at

[TABLE]

and

[TABLE]

Recalling (5.26)–(5.27), inserting (5.30) and (5.50), and summing over $i$ , we find the first expansion in (5.18). Recalling (5.27), inserting (5.31) and (5.49), and summing over $i$ , we find the second expansion in (5.18). ∎

5.4 Geometric centre of a droplet

For $z\in\mathcal{O}$ , we define the geometric centre of the halo shape $S(z)$ as

[TABLE]

where $\overline{z_{i}}=\tfrac{1}{2}(z_{i}+z_{i+1})$ . The centre $\mathcal{C}(z)$ may be thought of as the baricentre of the boundary $\partial P(z)$ of the polygon $P(z)$ obtained by connecting the boundary points $z_{i}$ with the line segment of length $u_{i}$ connecting $z_{i}$ and $z_{i+1}$ (see Fig. 7), and $\sum_{i=1}^{n}u_{i}$ is the perimeter of $P(z)$ . This notion of centre is adopted for mathematical convenience, the principal feature that we need is that to leading order, the centre is given by the discretized Fourier coefficients $\frac{1}{\pi}y_{5}$ and $\frac{1}{\pi}y_{6}$ . Other definitions of centre might work equally well, but we stock with the above.

We will need the fact that the centre $\mathcal{C}(z)$ is not too far from the origin.

Proposition 5.9 (Centre approximation).

Let $P(z)$ be the polygon associated with $z$ . If $z\in\mathcal{O}$ and $d_{\text{{\rm H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ ,

[TABLE]

with (recall Definition 5.3)

[TABLE]

Proof.

We give the proof for $\Sigma_{1}$ only. The proof for $\Sigma_{2}$ is similar. Recall that, in polar coordinates, $z_{i}=(r_{i}\cos t_{i},r_{i}\sin t_{i})$ and $t_{i+1}-t_{i}=\theta_{i}$ . We begin by writing

[TABLE]

where in the left-hand side we have simply subtracted 0. Substituting $\tau=t-\bar{t}_{i}$ , we get

[TABLE]

Note that in passing to the third line we used that $\int_{-\theta_{i}/2}^{\theta_{i}/2}\sin\tau\,d\tau=0$ . Consequently,

[TABLE]

Next, define

[TABLE]

Use (5.44) to write

[TABLE]

Substituting $r_{i}=(R_{c}-2)+\rho_{i}$ (recall (5.5)), we can write

[TABLE]

and insert this expansion into (5.58). We estimate each of the four terms in (5.59) separately:

(1) The term with $(R_{c}-2)^{2}\overline{\cos t_{i}}$ can be estimated via (5.56) and gives $O(y_{1}(z))+O(y_{2}(z))$ .

(2) The term with $(R_{c}-2)\overline{\rho_{i}\cos t_{i}}$ gives $(R_{c}-2)y_{5}(z)+O(\varepsilon)[O(y_{1}(z))+O(y_{2}(z))]$ , where we use the a priori estimates in Proposition 5.2, which imply $|\overline{\rho_{i}\cos t_{i}}|=O(\varepsilon)$ .

(3) The term with $(R_{c}-2)\overline{\rho_{i}}\,\overline{\cos t_{i}}$ gives

[TABLE]

Indeed, observe that

[TABLE]

and use the same estimate as in (2) plus the bound

[TABLE]

(4) For the term with $\overline{\rho_{i}}\,\overline{\rho_{i}\cos t_{i}}$ , note that

[TABLE]

where we use $|\rho_{i}|\leq\varepsilon$ from the a priori estimates. With the bound (5.63), we can check that overall the term with $\overline{\rho_{i}}\,\overline{\rho_{i}\cos t_{i}}$ gives rise to

[TABLE]

Collect terms to get

[TABLE]

Finally, use (5.44) to write

[TABLE]

A similar substitution gives

[TABLE]

where we used that $\sum_{i=1}^{n}\theta_{i}=2\pi$ . Combine (5.51), (5.57), (5.65) and (5.67) to get the claim. ∎

An immediate consequence of Corollary 5.4 and Propositions 5.8–5.9 is the following.

Corollary 5.10 (A priori estimates volume, surface and centre).

If $z\in\mathcal{O}$ and $d_{\text{{\rm H}}}(\partial S(z),\partial B_{R_{c}}(0))\leq\varepsilon$ , then as $\varepsilon\downarrow 0$ ,

[TABLE]

and

[TABLE]

6 Stochastic geometry: representation of probabilities as surface integrals

In Section 6.1 we prove that the contribution to the free energy coming from halos that are not close to a critical disc either in volume or in Hausdorff distance is negligible (Lemma 6.1), and that the centre of the critical disc can be placed at the origin (Lemma 6.2). In Section 6.2 we show how to rewrite the integral over halo shapes in (1.14) representing the free energy of the critical droplet, by tracking the boundary points (Lemma 6.3 and Corollary 6.4). In Section 6.3 we introduce auxiliary random variables, list some of their properties (Lemma 6.5), and rewrite the integral over halo shapes as an expectation of a certain exponential functional over these auxiliary random variables (Proposition 6.6). The latter will serve as the staring point for the analysis in Sections 7–8.

6.1 Only disc-shaped droplets matter

For $\delta,\varepsilon>0$ , define the events

[TABLE]

i.e., the events where the halo is $\delta$ -close to a critical disc in volume and $\varepsilon$ -close to a critical disc in Hausdorff distance. From Theorem 2.2 we know that, for $\delta,\varepsilon$ small enough, on the event ${\mathcal{V}}_{\delta}\cap{\mathcal{D}}_{\varepsilon}$ , $h(\gamma)^{-}$ is connected and simply connected. First we check that we need not worry about ${\mathcal{V}}_{\delta}\cap{\mathcal{D}}_{\varepsilon}^{c}$ with ${\mathcal{D}}_{\varepsilon}^{c}=\Gamma\setminus{\mathcal{D}}_{\varepsilon}$ . Put

[TABLE]

Lemma 6.1.

For every $C\in(0,\infty)$ and all $\varepsilon>0$ small enough, there exists a $\eta(\varepsilon)>0$ (independent of $C)$ such that, as $\beta\to\infty$ ,

[TABLE]

Proof.

Fix $C\in(0,\infty)$ and $\varepsilon>0$ . Note that ${\mathcal{V}}_{C\beta^{-2/3}}\subset{\mathcal{V}}_{\varepsilon}$ for $\beta$ large enough. Since ${\mathcal{V}}_{\varepsilon}\cap{\mathcal{D}}_{\varepsilon}^{c}$ is closed, we can use the large deviation principle for the halo shape and the halo volume, derived in Theorems 2.1 and 2.3, to bound

[TABLE]

In view of (2.3) and (2.4), we are left with the minimisation problem

[TABLE]

If $|S|=\pi R^{2}$ , then on the event ${\mathcal{V}}_{\varepsilon}$ we have $|R^{2}-R_{c}^{2}|\leq\frac{\varepsilon}{\pi}$ . Let $\eta$ be such that $\varepsilon=\sqrt{4R\eta+\eta^{2}}$ . Then, by (2.7)–(2.8) in Theorem 2.2, on the event $\mathcal{D}_{\varepsilon}^{c}$ we have $|S|-\kappa|S^{-}|\geq(\pi R^{2}-\kappa\pi(R-2)^{2})+2\pi\kappa\eta$ . But $(\pi R^{2}-\kappa\pi(R-2)^{2})-(\pi R_{c}^{2}-\kappa\pi(R_{c}-2)^{2})=-\pi(\kappa-1)(R-R_{c})^{2}$ , while $\eta=\frac{\varepsilon^{2}}{4R_{c}}+O(\varepsilon^{3})$ and $(R-R_{c})^{2}=[\frac{\varepsilon}{2\pi R_{c}}+O(\varepsilon^{2})]^{2}$ for $\varepsilon\downarrow 0$ . Therefore we get

[TABLE]

where we use that $R_{c}=2\kappa/(\kappa-1)$ . Consequently, (6.4) yields

[TABLE]

with $\eta(\varepsilon)>0$ for $\varepsilon$ small enough. ∎

Next we check that on the event $\gamma\in\mathcal{D}_{\varepsilon}=\bigcup_{x\in\mathbb{T}}\mathcal{D}_{\varepsilon}(x)$ , we need only consider only consider droplets that are close to $B_{R_{c}}(0)$ . Here we exploit the translation invariance of the system. For $\delta,\varepsilon>0$ and $x\in{\mathbb{T}}$ , define the event

[TABLE]

i.e., the centre $\mathcal{C}(z)=(\Sigma_{1}(z),\Sigma_{2}(z))$ of the halo shape $S(z)$ is $\delta$ -close to $x$ (see the beginning of Section 5.4).

Lemma 6.2.

For every $C\in(0,\infty)$ , every $\varepsilon>0$ , suitable $k=k(\varepsilon,C)>0$ and $\varepsilon^{\prime}=\varepsilon^{\prime}(\varepsilon,C)$ , and all $\beta$ sufficiently large,

[TABLE]

Proof.

If $\partial h(\gamma)$ is $\varepsilon$ -close in Hausdorff distance to the boundary of a disc of radius $R_{c}$ , then by Corollary 5.4 the centre $\mathcal{C}(z)$ of $h(\gamma)$ is $(C^{\prime}\varepsilon)$ -close to the centre of that disc for some $C^{\prime}\in(0,\infty)$ . Let $G_{\delta}\subset{{\mathbb{T}}}$ be the grid of linear spacing $\delta$ . It follows that

[TABLE]

Because the torus is periodic, every indicator contributes the same. By picking $\delta=k\beta^{-2/3}$ with $k=C/2$ and $\varepsilon^{\prime}=(1+C^{\prime})\varepsilon+C\delta(\beta)/2$ we deduce from (6.10) that

[TABLE]

Clearly, $|G_{k\beta^{-2/3}}|=O(\beta^{4/3})\leq\exp(o(\beta^{1/3})$ , hence the proof is complete. ∎

Lemmas 6.1 and 6.2 leave us with the task of bounding $\mu_{\beta}(\mathcal{V}_{C\beta^{-2/3}}\cap\mathcal{D}_{\varepsilon^{\prime}}(0)\cap\mathcal{C}_{k\beta^{-2/3}}(0))$ from above. For the lower bound, it will be enough to bound $\mu_{\beta}(\mathcal{V}_{C\beta^{-2/3}}\cap\mathcal{D}_{\varepsilon}(0))$ from below.

6.2 Integration over halo shapes

Lemma 6.3.

Let $\Pi_{\alpha}$ be a Poisson point process of intensity $\alpha$ in $\mathbb{T}$ . There exists a constant $c>0$ such that, for $\varepsilon>0$ small enough and every bounded test function $f\colon\,\mathcal{S}\to{\mathbb{R}}$ , as $\alpha\to\infty$ ,

[TABLE]

Proof.

For $z\in{\mathcal{O}}$ , define

[TABLE]

Clearly,

[TABLE]

where

[TABLE]

is the probability that the halo has a hole. We argue as follows. Any configuration $\gamma$ with halo $h(\gamma)=S(z)$ must be of the form

[TABLE]

where $y$ represents the set of interior points of the configuration, i.e., $B_{2}(y_{i})\cap\partial S(z)=\emptyset$ for $1\leq i\leq{\ell}-n$ . Therefore, for every bounded test function $f$ ,

[TABLE]

We get the claim in (6.12), provided we show that there exists a $c>0$ such that, for $\varepsilon$ small enough and uniformly in $z\in{\mathcal{C}}^{\prime}_{C\delta(\beta)}(0)\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ ,

[TABLE]

The proof of the latter goes as follows. Let $\Pi^{S(z)}_{\alpha}$ be the Poisson point process on $S(z)^{-}$ with intensity $\alpha$ . Then

[TABLE]

In order to bound the last expression in (6.19), we discretize. As in the proof of Lemma 6.2, we consider the grid $G_{\delta}\subset\mathbb{T}$ of linear spacing $\delta$ . For every $y\in\mathbb{T}$ there exists an $x\in G_{\delta}$ such that $B_{2}(y)\supset B_{2-\delta}(x)$ . Therefore

[TABLE]

It is easy to see that there exists a $c=c(R_{c})>0$ such that, for $\varepsilon$ small enough,

[TABLE]

Analogously, for all $x\in G_{\delta}\cap S(z)^{-}$ we have $|B_{2-\delta}(x)\cap S(z)^{-}|\geq c-O(\delta)$ . Combining (6.19)–(6.21), we get

[TABLE]

Choosing $\delta=c\alpha^{-2/3}$ , we get the claim in (6.18). ∎

We next observe that

[TABLE]

where we recall that $\Xi_{\beta}=\mathrm{e}^{-\beta(1-\kappa)|\mathbb{T}|+o(1)}$ . Applying Lemma 6.3 with $\alpha=\kappa\beta$ and with test functions of the form

[TABLE]

and abbreviating

[TABLE]

we easily deduces (4.3). Specializing the choice of the indicator $\mathbf{1}_{A}$ , we obtain the following corollary. Define the analogue of (6.1) and (6.8) for connected outer contours rather than configurations,

[TABLE]

Corollary 6.4 (Representation as surface integral).

There exists a $c>0$ such that for every $C\in(0,\infty)$ and all $\varepsilon>0$ small enough, as $\beta\to\infty$ ,

[TABLE]

with

[TABLE]

6.3 From surface integral to auxiliary random variables

The following notation allows us to avoid indicators that order variables. For $(t,\rho)\in[0,2\pi)^{n}\times{\mathbb{R}}^{n}$ with $t=(t_{1},\ldots,t_{n})$ pairwise distinct, let $\sigma\in\mathfrak{S}_{n}$ (the set of permutations of $\{1,\ldots,n\}$ ) be such that $0\leq t_{\sigma(1)}<\cdots<t_{\sigma(n)}<2\pi$ . We abbreviate $(t_{(i)},r_{(i)})=(t_{\sigma(i)},r_{\sigma(i)})$ and $(t_{(n+1)},r_{(n+1)})=(t_{(1)}+2\pi,r_{(1)})$ . The reordering depends on the vector $t$ , but for simplicity we suppress the $t$ -dependence from $r_{(i)}$ and $t_{(i)}$ .

The following lemma can be viewed as a relation between measures on the space of marked point processes.

Lemma 6.5.

For every non-negative test function $f$ on $([0,\infty)\times[0,2\pi])^{n}$ with $f(\emptyset)=0$ ,

[TABLE]

Proof.

We revisit the computations from Section 4.4. Put

[TABLE]

and set $\theta_{i}=t_{(i+1)}-t_{(i)}$ , where $t_{(n+1)}=t_{(1)}+2\pi$ , and $\bar{f}_{t}(x)=f(\{(r_{i},t_{i})\}_{i=1}^{n})$ . Let $P_{\theta}(x-y)$ be the heat kernel, see (4.19). Proceeding as in Section 4.4, we see that the left-hand side of (6.29) is equal to

[TABLE]

We first evaluate the inner integral for fixed $n\in{\mathbb{N}}$ and $t\in[0,2\pi)^{n}$ with $t_{1}<\cdots<t_{n}$ , so that $(t_{(i)},r_{(i)})=(t_{i},r_{i})$ and $x_{i+1}=x_{i}$ . Change variables as $x\to(x_{1},x^{\prime}_{2},\ldots,x^{\prime}_{n})$ with $x^{\prime}_{i}=x_{i}-x_{1}$ . The inner integral becomes

[TABLE]

where $x^{\prime}_{1}=x^{\prime}_{n+1}=0$ . From the semi-group property of the heat kernel, we get

[TABLE]

Next we note

[TABLE]

(think of $x^{\prime}_{i}=\widetilde{W}_{t_{i}}$ ). Furthermore, for every non-negative test function $g$ on path space,

[TABLE]

where we have changed variables $m=x_{1}+M$ with $M=\frac{1}{2\pi}\int_{0}^{2\pi}\mathrm{d}t\,\widetilde{W}_{t}$ . It follows that

[TABLE]

This holds as well when the $t_{i}$ ’s are pairwise distinct but not necessarily labeled in increasing order. The case when $t_{i}=t_{j}$ for some $i\neq j$ has Lebesgue measure zero and need not be considered. Denote the value in (6.36) by $g(t)$ . Then (6.31) reads

[TABLE]

With the help of (2.20)–(2.21), this expression in turn is equal to

[TABLE]

and the proof is readily concluded. ∎

The integral over $m$ corresponds to a freedom of choice in the average height of the surface of the critical droplet with respect to critical radius $R_{c}$ , and constitutes a fine tuning of the volume. We will later see that the integral is dominated by values of $m$ that are at most of order $\beta^{1/6}$ .

Remember the process $Z^{(m)}=(Z_{i}^{(m)})_{i=1}^{N}$ from (2.22) and (2.23) and the random variables $\widehat{Y}_{0}$ and $\widehat{Y}_{1}$ from (2.24). Further define (recall the definition of $C_{1}$ in (5.17))

[TABLE]

Proposition 6.6 (Representation of key surface integrals).

The integrals in (6.28) equal

[TABLE]

where $C_{1}=\frac{\kappa^{2}}{6(\kappa-1)^{3}}$ .

Proof.

Return to (6.28) and recall (5.17), Proposition 5.8 and (4.2)–(6.26). First we rewrite the expression in (6.28) in terms of polar coordinates $z_{i}=(r_{i}\cos t_{i},r_{i}\sin t_{i})$ , $1\leq i\leq N$ , i.e., $\int_{{\mathbb{T}}^{N}}\mathrm{d}z$ becomes $\int_{{\mathbb{R}}^{N}}\mathrm{d}r\int_{[0,2\pi)^{N}}\mathrm{d}t\,\prod_{i=1}^{N}r_{i}$ . The latter product becomes

[TABLE]

where the last equality uses the constraint ${\mathcal{C}}^{\prime}_{\delta(\beta)}(0)\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ together with the a priori estimate from Proposition 5.2. In this way we obtain the factor $(R_{c}-2)^{N}$ needed for Lemma 6.5, together with an error term $\exp[O(\varepsilon)N]$ , which shows up as the term $O(\varepsilon)N$ in (6.41). The factor $\mathrm{e}^{2\pi G_{\kappa}\beta^{1/3}}$ is needed to compensate for the exponent in the Poisson distribution of $N$ (recall (2.19)).

The Gaussian density in Lemma 6.5 is obtained from $\exp[-\beta\Delta(z)]$ , more precisely, from the second term in the expansion of $\Delta(z)$ in the first line of (5.18), up to an error term $O(\varepsilon)$ in the constant $C_{3}^{\varepsilon}=C_{3}\,[1+O(\varepsilon)]$ , which shows up as the term $O(\varepsilon)Y_{2}$ in (6.41). Hence Lemma 6.5 is applicable. The first and the third term in the expansion of $\Delta(z)$ in the first line of (5.18) give rise to the term $-\beta C_{1}^{\varepsilon}Y_{1}+\tfrac{1}{2}Y_{3}^{(m)}$ (recall (6.30)). The factor $\prod_{i=1}^{N}\sqrt{2\pi G_{\kappa}\beta^{1/3}\Theta_{i}}$ in (6.29) gives rise to $Y_{0}$ . The indicator is inherited from the original expression for the integral. ∎

In what follows we abbreviate

[TABLE]

7 Asymptotics of surface integrals: preparations

Our primary task for proving Theorems 2.5–2.6 in Section 8 is the evaluation of the key surface integrals $\mathcal{I}^{\mathrm{UB}}$ and $\mathcal{I}^{\mathrm{LB}}$ in (6.41). In this section we collect some properties of the auxiliary random variables appearing in Section 6.3 that will help us to estimate these integrals. This requires various approximation arguments, including control of exponential moments and discretisation errors.

In Section 7.1 we look at moderate deviations for the angular process and compute the leading order contribution to the key surface integrals (Proposition 7.1 and Lemma 7.2). In Section 7.2 we analyse the radial process, which is controlled by the mean-centred Brownian bridge introduced in (2.17)–(2.18) (Lemma 7.3), and estimate two exponential moments involving the latter (Lemma 7.4–7.5). In Section 7.3 we focus on discretisation errors that arise because the mean-centred Brownian motion is only observed along the angular process (Lemmas 7.6–7.9).

7.1 Moderate deviations for the angular point process

The key technical result of this section is the following. Remember $\widehat{Y}_{0},\widehat{Y}_{1}$ from (2.24) and $Y_{0},Y_{1}$ from (6.39), $G_{\kappa}$ and $\lambda(\beta)=G_{\kappa}\beta^{1/3}$ from (2.19) and $\tau_{*}$ from (2.26).

Proposition 7.1 (Leading order prefactor).

[TABLE]

Proof.

The proof builds on an underlying renewal structure. Define the probability density

[TABLE]

where $\tau_{*}$ is given in (2.26). A close look at the relevant expressions in polar coordinates reveals that

[TABLE]

where $\theta_{n}=2\pi-\sum_{i=1}^{n-1}\theta_{i}$ . (The factor $\frac{2\pi}{n}$ represents the number of ways to rotate a configuration in such a way that the origin falls within in an interval of average length $\frac{2\pi}{n}$ , and is similar to a factor appearing in the definition of stationary renewal processes.) Rewrite (7.3) with $n$ -fold convolutions of $q_{*}$ as

[TABLE]

If the factor $\frac{2\pi}{n}$ were absent, then the sum over $n$ would correspond to the probability for a renewal process with interarrival distribution $q_{*}$ to have a renewal point at time $2\pi G_{\kappa}\beta^{1/3}$ given that it has a renewal point at time [math]. For large time intervals standard renewal theory tells us that the renewal probability converges to the inverse of the expected interarrival time, which is finite. Therefore we may expect the inner sum to converge and the overall expression to behave like a constant times $G_{\kappa}\beta^{1/3}\,\exp(-2\pi G_{\kappa}\beta^{1/3}(1-\tau_{*}))$ . Remember that $\tau_{*}>0$ , so the contribution from $n=0$ is negligible.

For the proof of the upper bound in (7.1), we bound the sum over $n$ in (7.4) by

[TABLE]

The quantity $\mathcal{R}(\ell)$ solves the renewal equation

[TABLE]

It follows from [15, Theorem 2, Chapter XI.3] and the smoothness of $\ell\mapsto\mathcal{R}(\ell)$ that

[TABLE]

and hence $\lim_{\ell\to\infty}\frac{1}{\ell}\log\mathcal{R}(\ell)=0$ . Combining this with (7.4) and recalling $\tau_{*}>0$ , we get

[TABLE]

For the proof of the lower bound in (7.1), we drop all except one term from the sum in (7.4), i.e.,

[TABLE]

This inequality holds for every $n\in{\mathbb{N}}$ , and a proper choice will be made later. Let $(X_{i})_{i\in{\mathbb{N}}}$ be i.i.d. random variables with probability density function $q_{*}$ . Then $\mathbb{E}[X_{1}]=\mu_{*}$ with $\mu_{*}=\int_{0}^{\infty}\mathrm{d}u\,u\,q_{*}(u)$ , and $(\sum_{i=1}^{n}X_{i}-n\mu_{*})/\sqrt{n}$ has probability density function

[TABLE]

Put differently,

[TABLE]

By the local central limit theorem for i.i.d. random variables with densities (see [23, Chapter 4.5]), we have

[TABLE]

with $\sigma^{2}$ the variance of $X_{1}$ . We now choose $n=n(\beta)=\lfloor 2\pi G_{\kappa}\beta^{1/3}/\mu_{*}\rfloor$ . Then $2\pi G_{\kappa}\beta^{1/3}=n(\beta)\mu_{*}+o(1)$ and

[TABLE]

Consequently,

[TABLE]

and so (7.9) gives

[TABLE]

∎

Proposition 7.1 is complemented by the following lemma, which will help us take care of small perturbations.

Lemma 7.2.

As $\delta\downarrow 0$ ,

[TABLE]

Proof.

Let $c\geq 0$ be an arbitrary constant that does not depend on $\beta$ . We write $-c\min(1,C_{1})\delta\leq O(\delta)\leq c\min(1,C_{1})\delta$ . Then

[TABLE]

The asymptotic behavior of the second term in the difference is given by Proposition 7.1. For the first term, let $\tau_{*}(c\delta)$ be the solution of

[TABLE]

Thus, $\tau_{*}(0)=\tau_{*}$ . For sufficiently small $\delta$ , the solution exists, is unique, and satisfies

[TABLE]

Arguments analogous to the proof of Proposition 7.1 show that

[TABLE]

Hence (7.17) yields

[TABLE]

A similar argument shows that

[TABLE]

∎

We close this section with the following observation, which will not be needed in the sequel but is nonetheless instructive. Let $\mathcal{P}(0,\infty)$ denote the space of probability measures on $(0,\infty)$ , equipped with the weak topology, and put

[TABLE]

Define

[TABLE]

For $(x,\mu)\in\mathcal{M}$ with $x>0$ , define

[TABLE]

where $\mathrm{EXP}(1)$ is the exponential distribution with parameter $1$ , and $H(\mu\mid\mathrm{EXP}(1))$ is the relative entropy of $\mu$ with respect to $\mathrm{EXP}(1)$ . For $(0,\mu)\in\mathcal{M}$ , define

[TABLE]

Then the family

[TABLE]

satisfies the weak LDP on $\mathcal{M}$ with rate $2\pi G_{\kappa}\beta^{1/3}$ and lower semi-continuous rate function $I_{\mathcal{T}}$ . (Not all level sets of $I_{\mathcal{T}}$ are compact.)

7.2 Properties of mean-centred Brownian bridge

Covariance of mean-centred Brownian bridge. The following lemma clarifies the nature of the process $(B_{t})_{t\in[0,2\pi]}$ and will be used repeatedly later on.

Lemma 7.3.

(a)

$(B_{t})_{t\in[0,2\pi]}$ * is a Gaussian process with mean ${\mathbb{E}}[B_{t}]=0$ and covariance ${\mathbb{E}}[B_{t}B_{s}]=k(t-s)$ , where*

[TABLE] 2. (b)

For every continuous function $f\colon\,[0,2\pi]\to{\mathbb{R}}$ ,

[TABLE]

with $\langle\cdot,\cdot\rangle$ the scalar product in $L^{2}([0,2\pi])$ , and $g(t)=(Kf)(t)=\int_{0}^{2\pi}\mathrm{d}s\,k(t-s)f(s)$ the solution of $-g^{\prime\prime}(t)=f(t)-\frac{1}{2\pi}\int_{0}^{2\pi}\mathrm{d}s\,f(s)$ , $g(2\pi)=g(0)$ and $\int_{0}^{2\pi}\mathrm{d}s\,g(s)=0$ .

Proof.

(a) $(B_{t})_{t\in[0,2\pi]}$ is a linear transformation of the Gaussian process $(W_{t})_{t\in[0,2\pi]}$ and therefore is itself Gaussian. The mean-zero property of $B_{t}$ is inherited from $W_{t}$ . The elementary computation of the covariance is similar to Deheuvels [10, Lemma 2.1]. We provide the details to identify constants. Set $M=\frac{1}{2\pi}\int_{0}^{2\pi}\mathrm{d}s\,B_{s}$ . Since ${\mathbb{E}}[W_{t}W_{s}]=\min(s,t)$ we have, for $s\leq t$ ,

[TABLE]

and hence

[TABLE]

By symmetry, the identity also holds for $t\leq s$ .

(b) (7.29) follows from standard arguments for Gaussian processes. The kernel $k\colon\,[-2\pi,2\pi]\to{\mathbb{R}}$ satisfies

[TABLE]

and is twice differentiable with second derivative $-1/2\pi$ , except at $t=0$ where the first derivative jumps from $+\frac{1}{2}$ to $-\frac{1}{2}$ . Let $g=Kf$ . Then $g$ has mean zero and satisfies $g(2\pi)=g(0)$ . Furthermore,

[TABLE]

∎

For later purpose we record the variance of the increments, namely,

[TABLE]

Thus, for small time increments we recover the variance of standard Brownian increments. We also record the covariance of two distinct increments, namely, for $h,u\geq 0$ and $t+h\leq s$ ,

[TABLE]

Thus, two distinct increments are not independent, however, for $h,u\downarrow 0$ the covariance is negligible compared to the variance of the individual increments (since $hu=o(h)+o(u)$ ). This will be needed in Lemma 7.5 below.

Exponential moments for the mean-centred Brownian bridge. For $k\in{\mathbb{N}}$ , define the random variables

[TABLE]

In view of Lemma 7.3, these random variables are i.i.d. standard normal. They represent the Fourier coefficients of $B$ , i.e.,

[TABLE]

where the series converges in $L^{2}(0,2\pi)$ ${\mathbb{P}}$ -a.s. The expansion in (7.37) is the Karhunen-Loève expansion of the Gaussian process $B$ (see Alexanderian [1], Deheuvels [10], and references therein).

Lemma 7.4.

For every $-\infty<s<1$ ,

[TABLE]

For every $-\infty<s<4$ ,

[TABLE]

Proof.

Note that (7.37) implies

[TABLE]

Since $A_{k}$ , $A_{k}^{*}$ are i.i.d. standard normal, the claim follows from the identity ${\mathbb{E}}[\exp(\tfrac{1}{2}uX^{2})]=(1-u)^{-1/2}$ when $X$ is standard normal and $u<1$ . Apply this identity with $u=s/k^{2}$ , $k\in{\mathbb{N}}$ to get (7.38). The proof of (7.39) is similar. ∎

The next lemma allows us to subsume the error term $O(\varepsilon)Y_{2}$ into the error term $O(\varepsilon)N$ appearing in (6.41).

Lemma 7.5.

Let $0\leq t_{1}<\cdots<t_{n}<2\pi$ , $t_{n+1}=t_{1}+2\pi$ , and $\theta_{i}=t_{i+1}-t_{i}$ . For every $s\in(0,1)$ ,

[TABLE]

Proof.

Define $L_{i}=(B_{t_{i+1}}-B_{t_{i}})/\sqrt{\theta_{i}}$ , $1\leq i\leq n$ . It follows from (7.34) and (7.35) that $(L_{i})_{1\leq i\leq n}$ is a Gaussian vector with covariance matrix $C=(C_{ij})_{1\leq i,j\leq n}$ given by

[TABLE]

Hence $C=\mathrm{id}-P$ , where $P$ is the orthogonal projection onto the linear span of $(\sqrt{\theta_{i}/2\pi})_{i=1}^{n}$ in ${\mathbb{R}}^{n}$ . Thus, $C$ is the orthogonal projection onto the $(n-1)$ -dimensional hyperplane defined by $\{\ell=(\ell_{i})_{i=1}^{n}\colon\,\sum_{i=1}^{n}\ell_{i}\sqrt{\theta_{i}}=0\}$ . Using orthonormal coordinates on that hyperplane, we find that

[TABLE]

which settles the claim. ∎

7.3 Discretisation errors

Deterministic discretisation errors.

Later we need to bound the error in the approximations

[TABLE]

and discretisation errors for Fourier coefficients. Lemmas 7.7 and 7.8 treat the errors as random variables and provide bounds for their exponential moments. The proofs build on bounds for deterministic discretisation errors, which we provide first.

Let $\mathcal{H}$ be the space of absolutely continous functions $f:[0,2\pi]\to{\mathbb{R}}$ with square-integrable derivative, satisfying $\tau(2\pi)=\tau(0)$ and $\int 0^{2\pi}\tau(s)\mathrm{d}s=0$ . Let $\tau\in{\mathcal{H}}$ . Note that $|\tau(t)-\tau(s)|=|\int_{s}^{t}\dot{\tau}(s^{\prime})\mathrm{d}s^{\prime}|\leq||\dot{\tau}||_{2}\sqrt{2\pi}$ , and hence

[TABLE]

Set

[TABLE]

and consider the sums

[TABLE]

Lemma 7.6.

Suppose that $\tau\in{\mathcal{H}}$ and put $\varepsilon_{n}=\sum_{i=1}^{n}\theta_{i}^{3}=y_{2}(z)$ . Then

[TABLE]

Proof.

Note that

[TABLE]

and

[TABLE]

It therefore follows that

[TABLE]

which is the inequality for $\Lambda_{1}$ .

Let $f\colon\,{\mathbb{R}}\to{\mathbb{R}}$ be absolutely continuous and $2\pi$ -periodic. Suppose for simplicity that $t_{1}=0$ (otherwise replace integrals over $[0,2\pi]$ by integrals over $[t_{1},t_{n+1}]=[t_{1},t_{1}+2\pi]$ ). By partial integration we have

[TABLE]

from which we get, by Cauchy-Schwarz,

[TABLE]

The bound on $\Lambda_{2}$ is obtained from (7.53) by picking $f(s)=\tau(s)$ and using that $\int_{0}^{2\pi}\mathrm{d}s\,\tau(s)=0$ . The bounds on $\Lambda_{3}$ and $\Lambda_{4}$ are obtained from (7.53) by picking $f(s)=\cos s$ and $f(s)=\sin s$ , respectively, and using that $\int_{0}^{2\pi}\mathrm{d}s\,\cos s=\int_{0}^{2\pi}\mathrm{d}s\,\sin s=0$ . ∎

Random discretisation errors.

The estimates of deterministic discretisation errors in Lemma 7.6 can be used to derive estimates of exponential moments of random discretisation errors which is needed later.

Lemma 7.7 (Discretised mean).

Put $\varepsilon_{n}=\sum_{i=1}^{n}\theta_{i}^{3}$ . Then, for every $s\in{\mathbb{R}}$ ,

[TABLE]

Proof.

Write

[TABLE]

where

[TABLE]

is a probability measure on $[0,2\pi]$ . Apply (7.29) to get

[TABLE]

with

[TABLE]

The proof proceeds in two approximation steps. First, use (7.32) to write

[TABLE]

Substitute this into (7.58) to get

[TABLE]

Next, use (7.32) to write

[TABLE]

Substitute this into (7.60) to get

[TABLE]

Finally, note that $|2u^{\prime}-t_{i}-t_{i+1}|\leq\theta_{i}$ , $|t_{j+1}-2u^{\prime\prime}-t_{j}|\leq\theta_{j}$ and $\|\ddot{k}\|_{\infty}=\tfrac{1}{2}$ , to estimate

[TABLE]

Combine this with (7.55) and (7.57), and note that $G\geq 0$ , to get the claim in (7.54). ∎

Lemma 7.8.

Put $\varepsilon_{n}=\sum_{i=1}^{n}\theta_{i}^{3}$ . Then, for every $s\in{\mathbb{R}}$ with $|s|<1/\sqrt{2\pi\varepsilon_{n}}$ ,

[TABLE]

Proof.

The proof uses a determinant formula for exponent moments of quadratic functionals in abstract Wiener spaces (see (7.73) below).

We view $B$ as a random variable taking values in the Banach space $E$ of mean-zero, $2\pi$ -periodic, continuous functions $f\colon\,{\mathbb{R}}\to{\mathbb{R}}$ , equipped with the supremum norm $\|\cdot\|_{\infty}$ . The scalar product $\langle f,g\rangle_{\mathcal{H}}=\int_{0}^{2\pi}\mathrm{d}t\,f^{\prime}(t)g^{\prime}(t)$ turns ${\mathcal{H}}\subset E$ into a real Hilbert space. For $\mu$ a finite signed measure on $[0,2\pi]$ with total mass zero, define $(-\Delta)^{-1}\mu\in{\mathcal{H}}$ by

[TABLE]

with $k(t-t^{\prime})$ the Green function from Lemma 7.3. Following the proof of Lemma 7.3(b), we can check that $g=(-\Delta)^{-1}\mu$ is in ${\mathcal{H}}$ and satisfies $-g^{\prime\prime}=\mu$ in a distributive sense:

[TABLE]

When $f\in E\setminus{\mathcal{H}}$ , we define $\langle g,f\rangle_{\mathcal{H}}$ as $\mu(f)$ , so that (7.66) remains true. A slight adaptation of the proof of Lemma 7.3(b) then shows that

[TABLE]

Now consider the continuous quadratic form

[TABLE]

The restriction of $Q$ to ${\mathcal{H}}$ is represented by a bounded symmetric operator $\tilde{Q}\colon\,{\mathcal{H}}\to{\mathcal{H}}$ that is uniquely defined by the requirement that

[TABLE]

From (7.45) and the first line of (7.48) we know that

[TABLE]

Hence $\|\tilde{Q}\|\leq\sqrt{2\pi\varepsilon_{n}}$ and so, for all $s\in{\mathbb{R}}$ with $|s|\leq 1/\sqrt{2\pi\varepsilon_{n}}$ , the operator $\mathrm{id}-s\tilde{Q}$ is positive definite and invertible in ${\mathcal{H}}$ , with bounded inverse.

Next, we check that $\tilde{Q}$ is a trace-class operator (Simon [36]). From(7.68) we see that $\tilde{Q}$ is a difference of two terms. The first term corresponds to the finite sum in (7.68), has finite rank, and is therefore trivially trace class. The second term is $(-\Delta)^{-1}$ . Indeed, for $f\in{\mathcal{H}}$ , setting $F=(-\Delta)^{-1}f\in{\mathcal{H}}$ and integrating by part, we have

[TABLE]

Consider the orthonormal basis of ${\mathcal{H}}$ consisting of the vectors

[TABLE]

Then $(-\Delta)^{-1}e_{k}=\frac{1}{k^{2}}e_{k}$ and $(-\Delta)^{-1}e^{*}_{k}=\frac{1}{k^{2}}e^{*}_{k}$ . Thus, $(-\Delta)^{-1}$ is trace class with trace $2\sum_{k=1}^{\infty}1/k^{2}<\infty$ , and $\tilde{Q}$ , as a difference of two trace class operators, is itself trace class. It follows from general results on abstract Wiener spaces (Chiang, Chow and Lee [5]) that, for every $s\in{\mathbb{R}}$ with $|s|<(2\pi\varepsilon_{n})^{-1}$ , the exponential moments of $Q(B,B)$ are given by

[TABLE]

where the determinant is a Fredholm determinant. Indeed, if $\lambda_{1}\geq\lambda_{2}\geq\cdots$ are the eigenvalues of $\tilde{Q}$ , then $\max_{j\in{\mathbb{N}}}|\lambda_{j}|=\|\tilde{Q}\|=O(\sqrt{\varepsilon_{n}})$ , and the Fredholm determinant is given by

[TABLE]

Thus, it remains to estimate the trace of $\tilde{Q}$ . To that aim we apply the first line of (7.48) for $\tau(t)=e_{k}(t)$ . Since $\|e_{k}\|_{2}=1/k$ and $\|\dot{e}_{k}\|_{\infty}=1/\sqrt{\pi}$ , this gives

[TABLE]

On the other hand, clearly $Q(e_{k},e_{k})\leq 4\pi\|e_{k}\|_{\infty}^{2}=4/k^{2}$ . Similar bounds hold for $Q(e^{*}_{k},e^{*}_{k})$ . Therefore

[TABLE]

The proof is concluded with (7.73) and (7.74). ∎

Fourier coefficients.

Define the discretized Fourier coefficients

[TABLE]

By a slight abuse of notation we shall use the same letter $D_{1}$ , $D_{1}^{*}$ for the random variable obtained by substituting $t_{i}\to T_{i}$ , $\theta_{i}\to\Theta_{i}$ .

Lemma 7.9.

Put $\varepsilon_{n}=\sum_{i=1}^{n}\theta_{i}^{3}$ . Then, for every $|s|<1/8\sqrt{2\pi\varepsilon_{n}/3}$ ,

[TABLE]

A similar bound holds for $A_{1}^{*2}-{D_{1}^{*}}^{2}$ .

Proof.

The proof is similar to that of Lemma 7.8. Let

[TABLE]

and let $\tilde{Q}\colon\,{\mathcal{H}}\to{\mathcal{H}}$ be the associated symmetric operator. Then $\tilde{Q}$ has finite rank and is therefore trace-class. Using the identity $|x^{2}-y^{2}|=|x+y|\,|x-y|$ , $x,y\in{\mathbb{R}}$ , in combination with (7.45) and the second line of (7.48), we see that for all $h\in{\mathcal{H}}$ ,

[TABLE]

Consequently, $\|\tilde{Q}\|\leq 8\sqrt{2\pi\varepsilon_{n}/3}$ , $\mathrm{id}-s\tilde{Q}$ is positive definite for all $|s|<1/8\sqrt{2\pi\varepsilon_{n}/3}$ , and

[TABLE]

and similarly for $e^{*}_{k}$ . The proof is concluded in the same way as that of Lemma 7.8. ∎

8 Asymptotics of surface integrals: proof of moderate deviations

In this section we collect the preparations in Sections 5–7 and prove Theorems 2.5–2.6. Section 8.1 proves an upper bound for the key surface integral $I^{\mathrm{UB}}(\kappa,\beta;C,\varepsilon)$ (Proposition 8.1), which together with conditions (C1) and (C3) in Section 2.2 proves the desired upper bound in Theorem 2.5. The proof consists of a sequence of steps involving decomposition, elimination and separation of terms. A particularly delicate point is how to control the integral over $m$ in (6.41): we show that only small values of $m$ contribute, namely, $|m|=O(\beta^{1/6})$ . Section 8.2 proves a lower bound for the key surface integral $I^{\mathrm{LB}}(\kappa,\beta;C,\varepsilon)$ (Proposition 8.2), which together with conditions (C1) and (C2) in Section 2.2 proves the desired lower bound in Theorem 2.5. In Sections 8.1–2.2 we also prove Theorem 2.6. The proof does not need conditions (C1)–(C3), but has non-matching upper and lower bound. The proof of the lower bound relies on a rough version of (C1) that can be proved easily.

8.1 Upper bound

We prove the following:

Proposition 8.1 (Upper bound key integral).

For every $C\in(0,\infty)$ and all $\varepsilon>0$ sufficiently small, there exists a $c=c(C,\varepsilon)>0$ such that

[TABLE]

where $\chi=\sum_{i=1}^{N}\Theta_{i}\overline{B_{T_{i}}}^{2}-D_{1}^{2}-{D_{1}^{*}}^{2}$ and $D_{1},D_{1}^{*}$ are defined in (7.77).

Before we embark on the proof of the proposition, we use it to complete the proofs of the upper bounds in Theorems 2.5 and 2.6.

Proof of the upper bound in Theorem 2.5.

Proposition 8.1, when combined with conditions (C1) and (C3) in Theorem 2.5, yields

[TABLE]

with $\tau_{**}$ given in (2.29). Combining with Corollary 6.4, we get

[TABLE]

Combining with Lemma 6.2, we find

[TABLE]

Contributions from $V_{C\beta^{-2/3}}\cap\mathcal{D}_{\varepsilon}$ are bounded by Lemma 6.1, and are negligible. The upper bound in Theorem 2.5 follows after letting $\varepsilon\downarrow 0$ with $C$ fixed. ∎

Proof of the upper bound in Theorem 2.6.

Without the conditions (C1)–(C3) from Theorem 2.5, we estimate the right-hand side of (LABEL:ubkey) by dropping the indicator. We decompose $\chi$ into three parts

[TABLE]

with

[TABLE]

We first look at conditional expectations. Note that $Y_{1}$ depends on $N$ and $\Theta_{i}$ , $1\leq i\leq n$ , alone. A repeated application of the Cauchy-Schwarz inequality yields

[TABLE]

By Lemma 7.8, for every $s\in(-1,1)$ with $|s|<1/\sqrt{2\pi Y_{1}}$ ,

[TABLE]

By Lemma 7.9, for every $|s|\leq 1/\sqrt{2\pi Y_{1}}$ ,

[TABLE]

A similar estimate holds for $A_{1}^{*2}-D_{1}^{*2}$ . Via Cauchy-Schwarz, it follows that

[TABLE]

as long as $2|s|\leq 1/\sqrt{2\pi Y_{1}}$ . We would like to apply the estimates in (8.8) and (8.10) with $s=3[1+O(\varepsilon)]$ . From the a priori estimates in Corollary 5.4 we know that $Y_{1}=O(\varepsilon)$ . Hence $1/\sqrt{2\pi Y_{1}}\geq c^{\prime}/\sqrt{\varepsilon}$ for some $c^{\prime}>0$ . Thus, $|s|\leq c^{\prime}/\sqrt{4\varepsilon}$ is sufficient to ensure the condition $2|s|<1/\sqrt{2\pi Y_{1}}$ . Therefore we see that $s=3[1+O(\varepsilon)]$ satisfies the bound $2|s|\leq 1/\sqrt{2\pi Y_{1}}$ , so that (8.8) and (8.10) are valid.

In order to get rid of $Y_{1}$ in the right-hand side of (8.10), we use two estimates: the a priori estimate $Y_{1}=O(\varepsilon)$ and the bound $Y_{1}\geq N(2\pi/N)^{3}$ , which gives $\log(1/Y_{1})=O(\log N)=O(N)$ . Therefore

[TABLE]

A similar bound holds for $E_{1}$ . We now estimate the term with $E_{2}$ . The tilt by $\mathrm{e}^{Y_{0}-\beta C_{1}Y_{1}}$ affects the angular point process only, so it is still true under $\widehat{\mathbb{P}}$ that the distribution of $(B_{t})_{t\in[0,2\pi]}$ is a mean-centred Brownian bridge independent from $N$ and the $\Theta_{i}$ , $1\leq i\leq N$ . Since $s=3[1+O(\varepsilon)]<4$ , by picking $\varepsilon$ small enough, we get from Lemma 7.4 that

[TABLE]

Note that this upper bound does not grow with $\beta$ . Hence, combining (8.8), (8.12) and (8.11), inserting into (8.7) and taking expectations, we find

[TABLE]

Dividing by $\beta^{1/3}$ and taking the limit $\beta\to\infty$ , we obtain via Lemma 7.2 that

[TABLE]

From here on the upper bound is proven in the same way as in Theorem 2.5. ∎

Next we turn to the proof of Proposition 8.1. The representation (6.41) for the key integral and the asymptotics from Proposition 7.1 leave us with the task of bounding

[TABLE]

The main idea is the following. The error terms $O(\varepsilon)(Y_{1}+Y_{3}+N)$ should be negligible, and therefore our primary concern is $\frac{1}{2}Y_{3}^{(m)}$ in the exponential. In order to deal with this term, we approximate

[TABLE]

(recall that $\int_{0}^{2\pi}B_{t}\mathrm{d}t=0$ ). The resulting expression is problematic because

[TABLE]

To cure the divergence, we use the geometric constraints. Roughly speaking, we show that the volume constraint imposes that the only relevant contributions are from $|m|=O(\beta^{1/6})$ . In addition, we show that the centring constraint imposes that the Fourier coefficients $D_{1}$ and $D_{1}^{*}$ are negligible, so that we may replace $\sum_{i=1}^{N}\overline{B_{T_{i}}}^{2}\Theta_{i}$ by $\chi=\sum_{i=1}^{N}\overline{B_{T_{i}}}^{2}\Theta_{i}-D_{1}^{2}-{D_{1}^{*}}^{2}$ . This helps because of (2.33) in condition (C3).

The proof comes in 8 steps.

1. Decomposition of $Y_{3}^{(m)}$ . As a preliminary step we decompose $Y_{3}^{(m)}$ as

[TABLE]

with (recall that $\overline{B_{T_{i}}}=\frac{1}{2}(B_{T_{i}}+B_{T_{i+1}})$ and $\chi$ is defined in (2.32))

[TABLE]

To check (8.18), we apply the variance formula

[TABLE]

with $n=N$ , $p_{i}=\frac{\Theta_{i}}{2\pi}$ and $x_{i}=m+\overline{B_{T_{i}}}$ , respectively, $x_{i}=\overline{B_{T_{i}}}$ , to get

[TABLE]

The claim in (8.18) now follows from the relation

[TABLE]

Note that $\mathcal{E}_{1}^{(m)}$ , $\mathcal{E}_{2}$ , $\mathcal{E}_{3}$ are non-negative, while $\chi$ is not necessarily so. The terms $\mathcal{E}_{1}^{(m)}$ and $\mathcal{E}_{2}$ will be taken care of in Steps 2 and 4 via the volume and centreing constraints. The term $\chi$ will be taken care of via (2.32) in condition (C3). The non-negative term $\mathcal{E}_{3}$ can simply be dropped for the upper bound.

2. Elimination of $\mathcal{E}_{1}^{(m)}$ with the help of the volume constraint. Next we exploit the volume constraint and the a priori estimates to get rid of $\mathcal{E}_{1}^{(m)}$ . By Proposition 5.8, recalling the definitions in Proposition 6.6, we have

[TABLE]

where we abbreviate

[TABLE]

By the triangle inequality, on the event that $||S(Z^{(m)})|-\pi R_{c}^{2}|\leq C\beta^{-2/3}$ we have

[TABLE]

An elementary computation based again on the variance formula in (8.20) gives

[TABLE]

Consequently, (8.25) becomes

[TABLE]

From the a priori estimate in Proposition 5.2 we have $\rho_{i}=O(\varepsilon)$ and

[TABLE]

Therefore

[TABLE]

which together with (8.27) gives

[TABLE]

Combine the estimates in (8.28) and (8.30) to obtain

[TABLE]

Insert this estimate into (8.18) and drop the term $\mathcal{E}_{3}$ , to find

[TABLE]

3. Estimation of $m^{2}$ . Next we estimate $m^{2}$ , which will be needed later. Write

[TABLE]

and use $(a-b)^{2}\leq 2(a^{2}+b^{2})$ , to estimate

[TABLE]

Up to a multiplicative constant, the first term is equal to $\mathcal{E}_{1}^{(m)}$ , which has been estimated in Step 2. For the second term we use Cauchy-Schwarz and (8.22). Hence

[TABLE]

4. Elimination of $\mathcal{E}_{2}$ with the help of the centreing constraint. Next we exploit the centreing constraint and the a priori estimates to get rid of $\mathcal{E}_{2}=D_{1}^{2}+{D_{1}^{*}}^{2}$ . We estimate $D_{1}^{2}$ only, since ${D_{1}^{*}}^{2}$ can be treated analogously. We have

[TABLE]

Hence

[TABLE]

In the first sum, we use the a priori estimate $\rho_{i}=([\kappa-1]\beta)^{-1/2}(m+B_{T_{i}})=O(\varepsilon)$ . In the second sum, we use that by (5.56) and the a priori estimate $\theta_{i}=O(\sqrt{\varepsilon}\,)$ we have $\sum_{i=1}^{N}\overline{\cos T_{i}}\,\Theta_{i}=O(Y_{1})=O(\varepsilon)$ . Hence

[TABLE]

The term $m^{2}$ appearing in the right-hand side has been estimated in (8.35). For the first term, we exploit the centreing constraint. Define

[TABLE]

which has been already estimated in (8.30). By Proposition 5.9, recalling Definition 5.3 and the notation in Proposition 6.6, we have $\mathcal{C}(Z^{(m)})=(\Sigma_{1},\Sigma_{2})$ with

[TABLE]

where

[TABLE]

By (8.40), on the event that $|\Sigma_{1}|\leq C\beta^{-2/3}$ we have

[TABLE]

Multiply both sides by $\beta$ and combine with (8.38), to get

[TABLE]

A similar estimate holds for ${D_{1}^{*}}^{2}$ . Combine (8.43) with the bounds in (8.28), (8.30), (8.32) and (8.35) for $Y_{3}^{(m)}$ , $|Y_{4}^{(m)}|$ and $m^{2}$ , to obtain

[TABLE]

We subtract $O(\varepsilon)\,\mathcal{E}_{2}$ on both sides and multiply by $[1-O(\varepsilon)]^{-1}=1+O(\varepsilon)$ , to conclude that we may drop the term $O(\varepsilon)\,\mathcal{E}_{2}$ from (8.44). Finally, we insert the estimate thus obtained into the bound (8.32) for $Y_{3}^{(m)}$ , to find

[TABLE]

5. Only small $m$ contribute. From (8.35) and (8.44) we get

[TABLE]

We may think of $M^{\varepsilon}$ as a random variable that is typically of order $\beta^{1/3}$ , so that $m$ is typically of order $\beta^{1/6}$ (at most). However, we need to estimate the contribution of the event that $M^{\varepsilon}$ is much larger than its typical order of magnitude. Fix $C^{\prime}>0$ (to be chosen later). By (8.46), we have

[TABLE]

In the last line we first use the bound on $Y_{3}^{(m)}$ from (8.45) and then drop the indicator on $m$ . The exponential in the last line is independent of $m$ . We bound $\mathbf{1}_{\{C^{\prime}\beta^{1/3}/\varepsilon<m^{2}\leq M^{\varepsilon}\}}\leq\mathbf{1}_{\{M^{\varepsilon}>C^{\prime}\beta^{1/3}/\varepsilon\}}\mathbf{1}_{\{m^{2}\leq M^{\varepsilon}\}}$ and perform the integral over $m$ , to find that we can further bound (8.47) by

[TABLE]

We can get rid of $\sqrt{M_{\varepsilon}}$ via the inequality $x\leq\mathrm{e}^{x-1}$ , $x\in{\mathbb{R}}$ , with $x=\varepsilon M_{\varepsilon}$ , which yields $\sqrt{M_{\varepsilon}}\leq\frac{1}{\sqrt{\varepsilon\mathrm{e}}}\exp(\frac{1}{2}\varepsilon\,M^{\varepsilon})$ . Because of (8.18), the term $\frac{1}{2}\varepsilon M^{\varepsilon}$ can be absorbed into the exponential. We can get rid of the indicator of the event $\{M^{\varepsilon}>C^{\prime}\beta^{1/3}/\varepsilon\}$ by estimating $\mathbf{1}_{\{M^{\varepsilon}>C^{\prime}\beta^{1/3}/\varepsilon\}}\leq\exp(-C^{\prime}\beta^{1/3}+\varepsilon M^{\varepsilon})$ and again absorbing the term $\varepsilon M^{\varepsilon}$ into the exponential. Thus (8.48) is bounded by

[TABLE]

Using Hölder’s inequality and Lemmas 7.2, 7.4 and 7.5, we get

[TABLE]

for some $k(\varepsilon)<\infty$ . The details are similar to Steps 6–7 below and therefore are omitted. Hence, given $\varepsilon>0$ we can make (8.49) arbitrarily small by making $C^{\prime}$ sufficiently large. Altogether we obtain the following statement of exponential tightness: For every $C^{\prime\prime}>0$ there exists a $C^{\prime}=C^{\prime}(\varepsilon,C,C^{\prime\prime})>0$ such that

[TABLE]

Hence we need only estimate contributions coming from $|m|\leq\sqrt{C^{\prime}/\varepsilon}\,\beta^{1/6}$ .

6. Separation of terms with the Hölder inequality. By (8.45) and the Hölder inequality, we have for all $c>0$ and $p,q\geq 1$ with $p^{-1}+q^{-1}=1$ ,

[TABLE]

(We have dropped the indicator in the second term, because it will not be needed.) We will want to choose $p$ close to $1$ , which makes $q$ large and potentially dangerous for the second term in (8.52). It will turn out that a good choice is

[TABLE]

for which $p=1+O(\sqrt{\varepsilon}\,)$ .

7. Estimation of the second term in (8.52). Note that $Y_{1}$ depends on $N$ and $\Theta_{i}$ , $1\leq i\leq N$ , alone. The tilt by $\mathrm{e}^{Y_{0}-\beta C_{1}Y_{1}}$ in the definition of $\widehat{\mathbb{P}}$ affects the angular point process only, so it is still true under $\widehat{\mathbb{P}}$ that the distribution of $(B_{t})_{t\in[0,2\pi]}$ is a mean-centred Brownian bridge independent from $N$ and the $\Theta_{i}$ , $1\leq i\leq N$ . Therefore, by (6.40) and Lemma 7.5, for every $s$ such that $s\in(-1,1)$ ,

[TABLE]

Applying this identity with $s=2qO(\varepsilon)=O(\sqrt{\varepsilon}\,)$ (which falls in $(-1,1)$ for $\varepsilon$ sufficiently small), we get

[TABLE]

Multiplying both sides by $\mathrm{e}^{qO(\varepsilon)(\beta Y_{1}+N)}$ and taking expectations, we find

[TABLE]

It now follows from Lemma 7.2 with $\delta=q\varepsilon=c\sqrt{\varepsilon}$ that

[TABLE]

Hence the second term in (8.52) is negligible.

8. Conclusion. Combine (8.51)–(8.52) and (8.57) to get

[TABLE]

for suitable $c=c(\varepsilon,C)>0$ . Together with the representation (6.41) for the key integral and Proposition 7.1, this completes the proof of Proposition 8.1.

8.2 Lower bound

The proof of the lower bound in Theorem 2.5 builds on the following key proposition.

Proposition 8.2.

For all $C\in(0,\infty)$ sufficiently large and all $\varepsilon>0$ sufficiently small,

[TABLE]

Proof of the lower bound in Theorem 2.5.

The lower bound in Theorem 2.5 follows from Corollary 6.4, Proposition 8.2 and conditions (C1) and (C2) in Section 2.2. ∎

Proof of the lower bound in Theorem 2.6.

For the proof of the lower bound in Theorem 2.6, we work directly with the surface integrals and skip the auxiliary random variables. By Corollary 6.4, we have

[TABLE]

where $n\in{\mathbb{N}}$ is arbitrary, $z=(z_{1},\ldots,z_{n})$ and $z_{i}=(r_{i}\cos t_{i},r_{i}\sin t_{i})$ . We pick $n=\lfloor k\beta^{1/3}\rfloor$ with $k>0$ some constant, write $r_{i}=R_{c}-2+\rho_{i}$ , and restrict the integral to the domain

[TABLE]

with $\varepsilon_{1}\in(0,\frac{1}{3})$ and $\varepsilon_{2}>0$ .

Lemma 8.3.

Let $z_{i}\in M$ , $i=1,\dots,n$ , and let $\varepsilon_{1}\leq 1/3$ and $\varepsilon_{2}\leq\frac{R_{c}-2}{2k^{2}}$ . Then $z\in{\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ for sufficiently large $\beta$ .

Proof.

First, we prove that $z\in{\mathcal{O}}$ . Employing Proposition 5.7, we need to show that every triplet $(z_{i-1},z_{i},z_{i+1})$ is extremal. Actually, we will show that the intersection of the halfline $\ell_{i}$ starting at the origin and passing through $z_{i}$ is intersecting the circle $\partial B_{2}(z_{i})$ in a point $p\notin B_{2}(z_{i-1})\cup B_{2}(z_{i+1})$ . Clearly, it suffices to show that $p\notin B_{2}(z_{i+1})$ .

Consider the most extremal case: $\rho_{i}$ attains the minimum allowed value $\rho_{i}=-\varepsilon_{2}\beta^{-2/3}$ and $\rho_{i+1}$ attain the maximum allowed value $\rho_{i+1}=\varepsilon_{2}\beta^{-2/3}$ . Also, take the minimal angle $\alpha$ between the half-lines $\ell_{i}$ and $\ell_{i+1}$ : $\alpha=\frac{2\pi}{n}(1-\frac{2}{3})$ . Without loss of generality, we may assume that $z_{i}=(0,R_{c}-2-\varepsilon_{2}\beta^{-2/3})$ and $z_{i+1}=(r\sin\alpha,r\cos\alpha)$ , where $r=R_{c}-2+\varepsilon_{2}\beta^{-2/3}$ . Let $s$ be the barycentre $s=\frac{1}{2}(z_{i}+z_{i+1})$ , and let $\ell$ be the half-line beginning at $s$ , orthogonal to the segment $(z_{i},z_{i+1})$ , and containing the points $(x,y)$ with $y>R_{c}$ (see Fig. 10). The point $v_{i}\in\partial B_{2}(z_{i})\cap\partial B_{2}(z_{i})$ belongs to $\ell$ . If the half-line $\ell$ does not intersect the positive $y$ -axis $\ell\cap\{(0,y)\colon y\geq 0\}=\emptyset$ , then $p=(0,R_{c}-\varepsilon_{2}\beta^{-2/3})\notin B_{2}(z_{i+1})$ .

To show that the half-line $\ell$ does not intersect the positive $y$ -axis, it suffices to show that $R_{c}-2-\varepsilon_{2}\beta^{-2/3}\geq s_{2}$ , where $s_{2}=\frac{1}{2}(R_{c}-2-\varepsilon_{2}\beta^{-2/3}+(R_{c}-2+\varepsilon_{2}\beta^{-2/3})\cos\alpha)$ is the second coordinate of the barycentre $s$ . This leads to the condition $(R_{c}-2)(1-\cos\alpha)\geq\varepsilon_{2}\beta^{-2/3}(1+\cos\alpha)$ . Given that

[TABLE]

we get the sufficient condition $\varepsilon_{2}\leq\frac{R_{c}-2}{2k^{2}}$ , as claimed.

For the proof that $z\in{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ , we refer to the beginning of the proof of Lemma 8.4, where we show that if $\alpha\leq\frac{2\sqrt{2}}{R_{c}(R_{c}-2)}\sqrt{\varepsilon}$ , then the intersecting point $v_{i}$ on the boundary belongs to $A_{R_{c},\varepsilon}$ . The above bound on $\alpha$ is clearly satisfied once $\beta$ is sufficiently large. ∎

Returning to the proof of the lower bound in Theorem 2.6, we check that the volume constraint is satisfied as well (recal the orders of magnitude of the relevant quantities from Definition 5.3). Note that for $(t,\rho)\in M$ the angular increments are bounded as

[TABLE]

and $\frac{2\pi}{n}=[1+o(1)]\,2\pi k^{-1}\beta^{1/3}$ , as $\beta\to\infty$ . On $M$ ,

[TABLE]

Choosing $k$ large enough and $\varepsilon_{2}$ small enough, we see that each of the $y_{i}$ ’s is of order at most $C^{\prime}\,\beta^{-2/3}$ , where $C^{\prime}$ can be made arbitrarily small compared to the given constant $C>0$ . By Proposition 5.8, the volume constraint is therefore satisfied for sufficiently large $\beta$ . Thus, we may drop the indicators from the last line of (8.60). Proposition 5.8 and the bounds in (8.63) also yield the estimate

[TABLE]

for some constant $C^{\prime}>0$ depending on $k$ and $\varepsilon_{2}$ . We deduce that

[TABLE]

and

[TABLE]

The right-hand side can be written as $2\pi G_{\kappa}(\tau_{*}-c)$ for some $c\geq 0$ , since we have already proven the upper bound in Theorem 2.6, and the upper bound must be larger than the lower bound. This completes the proof of Theorem 2.6. ∎

The proof of Proposition 8.2 comes in 4 steps. We again start from (6.41). For the lower bound, we can simply drop the non-negative term $Y_{3}^{(m)}$ and restrict the integral over $m$ to $|m|\leq\beta^{-1/6}$ .

1. Separation of terms with the reverse Hölder inequality.

We separate the exponential from the indicator $\Upsilon^{\mathrm{LB}^{\prime}}_{\beta,\varepsilon}$ (defined in (6.43)) with the help of the reverse Hölder inequality with $p\in(1,\infty)$ ,

[TABLE]

We choose

[TABLE]

2. Estimation of the second term in (8.68).

Proceeding as in the proof of (8.57), we can again use (8.54) with $s=-2(p-1)^{-1}O(\varepsilon)=-O(\sqrt{\varepsilon}\,)$ to estimate, as in (8.55),

[TABLE]

Taking expectations, we obtain

[TABLE]

Applying Lemma 7.2 with $\delta=\frac{1}{p-1}O(\varepsilon)=O(\sqrt{\varepsilon}\,)$ , we conclude that

[TABLE]

3. Estimate of the first term in (8.68).

Estimate

[TABLE]

We want to show that the last two probabilities are negligible. It suffices to show the following.

Lemma 8.4.

For some $\varepsilon_{0}>0$ small enough and uniformly on $|m|\leq\beta^{-1/6}$ :

(a)

$\lim_{\beta\to\infty}\frac{1}{\beta^{1/3}}\log\widehat{\mathbb{P}}\bigl{(}Z^{(m)}\notin{\mathcal{D}}^{\prime}_{\varepsilon}(0)\bigr{)}=-\infty$ * for every $0<\varepsilon\leq\varepsilon_{0}$ .*

(b)

$\lim_{C\to\infty}\sup_{0<\varepsilon\leq\varepsilon_{0}}\limsup_{\beta\to\infty}\frac{1}{\beta^{1/3}}\log\widehat{\mathbb{P}}\bigl{(}Z^{(m)}\in{\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0),\,Z^{(m)}\notin{\mathcal{V}}^{\prime}_{C\beta^{-2/3}}\bigr{)}=-\infty$ .

Proof.

(a) First, let us show that

[TABLE]

In polar coordinates, $Z_{i}^{(m)}=(r_{i}^{(m)}\cos T_{i},r_{i}^{(m)}\sin T_{i})$ with $\lvert r_{i}^{(m)}-(R_{c}-2)\rvert\leq\frac{1}{2}\varepsilon$ . To show that $Z^{(m)}\in{\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)$ , it clearly suffices to show that, for any $i$ , the boundary point $v_{i}\in\partial B_{2}(Z_{i}^{(m)})\cap\partial B_{2}(Z_{i+1}^{(m)})$ belongs to $A_{R_{c},\varepsilon}$ . The most extremal case occurs when $\lvert r_{i}^{(m)}\rvert=\lvert r_{i+1}^{(m)}\rvert=r=R_{c}-2-\frac{1}{2}\varepsilon$ and $\Theta_{i}=\frac{2\sqrt{2}}{R_{c}(R_{c}-2)}\sqrt{\varepsilon}$ . Assuming, without loss of generality, that $Z_{i}^{(m)}=(0,r)$ and $Z_{i+1}^{(m)}=(r\sin\Theta_{i},r\cos\Theta_{i})$ , we find $v_{i}$ as the intersection of the line $\{(x,y):x=y\tan(\Theta_{i}/2)\}$ (the axis of symmetry between points $Z_{i}^{(m)}$ and $Z_{i+1}^{(m)}$ ) and the circle $\partial B_{2}((0,r))$ . we get

[TABLE]

For the last inequality, notice that

[TABLE]

using the assumption $\Theta_{i}\leq\frac{2\sqrt{2}}{R_{c}(R_{c}-2)}\sqrt{\varepsilon}$ and the fact that $r+2<R_{c}$ .

With the help of the proven inclusion, we may estimate

[TABLE]

Abbreviate $k=2\sqrt{2}/R_{c}(R_{c}-2)$ . Then the first term in the right-hand side of (8.77) can be estimated as

[TABLE]

In the conditional expectation we estimate

[TABLE]

With the help of the inequality $N\leq\frac{1}{\delta\mathrm{e}}\,\mathrm{e}^{N\delta}$ we deduce that, for every $\delta>0$ ,

[TABLE]

We already know from Proposition 7.1 that the denominator equals $\exp(-[1+o(1)]\,c\beta^{1/3})$ for some constant $c>0$ as $\beta\to\infty$ . Arguments entirely analogous to those in the proof of Proposition 7.1 and Lemma 7.2 show that the same holds for the numerator. It follows that

[TABLE]

for some constant $c^{\prime}>0$ as $\beta\to\infty$ , and so we get the claim with a margin.

As to the second term in the right-hand side of (8.77), since the tilting only affects the angular process and not the radial process, we have

[TABLE]

where we use that the law of $(B_{t})_{t\in[0,2\pi]}$ is invariant under shifts. Moreover, recalling (2.17)–(2.18), we have

[TABLE]

where we use that $\frac{1}{2\pi}\int_{0}^{2\pi}\mathrm{d}t\,\widetilde{W}_{t}=\frac{1}{2\pi}\int_{0}^{2\pi}\mathrm{d}s\,W_{s}-\tfrac{1}{2}W_{2\pi}$ and

[TABLE]

But (see [11, Lemma 5.2.1])

[TABLE]

and so we get the claim with a margin.

(b) On the event $\{Z^{(m)}\in{\mathcal{O}}\cap{\mathcal{D}}^{\prime}_{\varepsilon}(0)\}$ we can use the expansion of the volume in Proposition 5.8 in the form given in (8.23), with $Y_{4}^{(m)}$ given in (8.39). We have

[TABLE]

Recall that $Y_{1},Y_{2},Y_{3}^{(m)}$ are non-negative, while $Y_{4}$ is not necessarily so. It follows that

[TABLE]

The four probabilities on the right-hand side of (8.87) are estimated with the help of large deviations, Markov’s inequality and the results from Section 7. The first probability is bounded by

[TABLE]

Using Lemma 7.2, we see that for $s\downarrow 0$ ,

[TABLE]

and therefore, for some $\varepsilon_{1}>0$ ,

[TABLE]

The other three probabilities in (8.87) are treated in a similar way, and so it suffices to show that, for $s\in{\mathbb{R}}$ with $|s|$ small enough,

[TABLE]

First term.

Write Use (8.54) to compute

[TABLE]

Taking the expectation, we have

[TABLE]

and therefore, using Lemma 7.2, we see that for $s\downarrow 0$ ,

[TABLE]

Second term.

Note that $Y_{3}^{(m)}=\sum_{i=1}^{N}(m+\overline{B_{T_{i}}})^{2}\Theta_{i}\leq\sum_{i=1}^{N}2(m^{2}+\overline{B_{T_{i}}}^{2})\Theta_{i}=4\pi m^{2}+2Y_{3}^{(0)}$ . The term $4\pi m^{2}=O(\beta^{-1/3})$ is harmless. Write

[TABLE]

Estimate

[TABLE]

By Lemma 7.8,

[TABLE]

Since $Y_{1}\geq N(2\pi/N)^{3}$ , we have $\log(1/Y_{1})=O(\log N)=O(N)$ . Since $Y_{1}=O(\varepsilon)$ , this gives

[TABLE]

where we use Lemma 7.2. By Lemma 7.4,

[TABLE]

which is finite for $s<\frac{1}{4}$ .

Third term.

Note that $Y_{4}^{(m)}=\sum_{i=1}^{N}(m+\overline{B_{T_{i}}})\Theta_{i}=2\pi m+Y_{4}^{(0)}$ . The term $2\pi m=O(\beta^{-1/6})$ is again harmless. We have

[TABLE]

Use Lemma 7.7 to bound this from above by

[TABLE]

Now use (8.94) to get

[TABLE]

This completes the proof of (8.91). ∎

4. Conclusion.

Combining (8.68), (8.72), (8.73) and Lemma 8.4, and choosing $C$ large enough, we get

[TABLE]

Proposition 8.2 follows with (6.41) and Proposition 7.1.

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Alexanderian, A brief note on the Karhunen-Loève expansion, [ar Xiv:1509.07526 v 2], unpublished.
2[2] L. Ambrosio, A. Colesanti and E. Villa, Outer Minkowski content for some classes of closed sets , Math. Ann. 342 (2008) 727–748.
3[3] Y.D. Burago and V.A. Zalgaller, Geometric Inequalities , Grundlehren der mathematischen Wissenschaften 285, Springer, 1988.
4[4] A.J. Baddeley and M.N.M. van Lieshout, Area-interaction point processes, Ann. Inst. Statist. Math. 47 (1995) 601–619.
5[5] T.S. Chiang, Y. Chow and Y.J. Lee, Exact formulas of certain functional integrals on Wiener spaces, Stoch. Rep. 59 (1994) 211–223.
6[6] M. Christ, Near equality in the two-dimensional Brunn-Minkowski inequality, [ar Xiv.org, 2012], unpublished.
7[7] P. Calka, T. Schreiber and J.E. Yukich, Brownian limits, local limits and variance asymptotics for convex hulls in the ball, Ann. Probab. 41 (2013) 50–108.
8[8] J.T. Chayes, L. Chayes and R. Kotecký, The analysis of the Widom-Rowlinson model by stochastic geometric methods, Commun. Math. Phys. 172 (1995) 551–569.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The Widom-Rowlinson model:

Abstract

Contents

1 Introduction, background and motivation

1.1 The Widom-Rowlinson model

1.2 A key target: the critical droplet

1.3 Outline

2 Main theorems

2.1 Large deviation principles and isoperimetric inequalities

Admissible sets.

Large deviation principles.

Theorem 2.1** (Large deviation principle for the halo shape).**

Theorem 2.2** (Minimisers of rate function for halo volume).**

Theorem 2.3** (Large deviation principle for the halo volume).**

2.2 Near the critical droplet: moderate deviations

Fluctuations of the halo volume.

Conjecture 2.4** (Weak moderate deviations for the halo volume).**

Notation.

Conditions.

Main theorem: sharp asymptotics.

Theorem 2.5** (Moderate deviations).**

Rough asymptotics.

Theorem 2.6** (Moderate deviation bounds).**

2.3 Context

Large and moderate deviations for confined point processes.

Interface literature.

Further variations on the Widom-Rowlinson model.

3 Proof of large deviation principles and isoperimetric inequalities

3.1 Properties of admissible sets

Lemma 3.1**.**

Proof.

Lemma 3.2**.**

Proof.

3.2 Minimisers of the shape rate function and their stability

Lemma 3.3**.**

Proof.

Proof of Theorem 2.2.

Lemma 3.4**.**

Proof.

3.3 Large deviation principle for Widom-Rowlinson

Proposition 3.5** (Large deviation principle for Widom-Rowlinson).**

Proof.

Proof of Theorem 2.1.

Proof of Theorem 2.3.

4 Heuristics for moderate deviations

4.1 Reduction to a surface integral

4.2 Approximation of the surface term

4.3 Orders of magnitude

4.4 Global scaling: auxiliary random processes

4.5 Local scaling: effective interface model

5 Stochastic geometry: approximation of geometric functionals

5.1 A priori estimates on boundary points

Lemma 5.1**.**

Proof.

Proposition 5.2** (A priori estimates for angular and radial coordinates).**

Proof.

Definition 5.3**.**

Corollary 5.4** (A priori estimates for sums in approximations).**

Proof.

5.2 Locality for boundary determination

Definition 5.5**.**

Lemma 5.6**.**

Proof.

Proposition 5.7** (Local characterisation of sets of boundary points).**

Proof.

5.3 Volume and surface approximation

Proposition 5.8** (Volume and surface approximation).**

Proof.

5.4 Geometric centre of a droplet

Proposition 5.9** (Centre approximation).**

Proof.

Corollary 5.10** (A priori estimates volume, surface and centre).**

6 Stochastic geometry: representation of probabilities as surface integrals

Theorem 2.1 (Large deviation principle for the halo shape).

Theorem 2.2 (Minimisers of rate function for halo volume).

Theorem 2.3 (Large deviation principle for the halo volume).

Conjecture 2.4 (Weak moderate deviations for the halo volume).

Theorem 2.5 (Moderate deviations).

Theorem 2.6 (Moderate deviation bounds).

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

Lemma 3.4.

Proposition 3.5 (Large deviation principle for Widom-Rowlinson).

Lemma 5.1.

Proposition 5.2 (A priori estimates for angular and radial coordinates).

Definition 5.3.

Corollary 5.4 (A priori estimates for sums in approximations).

Definition 5.5.

Lemma 5.6.

Proposition 5.7 (Local characterisation of sets of boundary points).

Proposition 5.8 (Volume and surface approximation).

Proposition 5.9 (Centre approximation).

Corollary 5.10 (A priori estimates volume, surface and centre).

Lemma 6.1.

Lemma 6.2.

Lemma 6.3.

Corollary 6.4 (Representation as surface integral).

Lemma 6.5.

Proposition 6.6 (Representation of key surface integrals).

Proposition 7.1 (Leading order prefactor).

Lemma 7.2.

Lemma 7.3.

Lemma 7.4.

Lemma 7.5.

Lemma 7.6.

Lemma 7.7 (Discretised mean).

Lemma 7.8.

Lemma 7.9.

Proposition 8.1 (Upper bound key integral).

Proposition 8.2.

Lemma 8.3.

Lemma 8.4.