Double jump phase transition in a soliton cellular automaton

Lionel Levine; Hanbaek Lyu; John Pike

arXiv:1706.05621·math.PR·August 13, 2020

Double jump phase transition in a soliton cellular automaton

Lionel Levine, Hanbaek Lyu, John Pike

PDF

TL;DR

This paper analyzes a soliton cellular automaton with random initial states, revealing phase transitions in soliton sizes and uncovering a condensation phenomenon, with implications for permutation subsequences and connections to stochastic processes.

Contribution

It provides new constructions of Young diagrams for the automaton, establishes limit theorems for soliton sizes, and uncovers a phase transition and condensation phenomena.

Findings

01

Number of solitons scales linearly with system size n.

02

Longest soliton length scales as log n, √n, or n depending on p.

03

Condensation occurs in the supercritical regime for p > 1/2.

Abstract

In this paper, we consider the soliton cellular automaton introduced in [Takahashi 1990] with a random initial configuration. We give multiple constructions of a Young diagram describing various statistics of the system in terms of familiar objects like birth-and-death chains and Galton-Watson forests. Using these ideas, we establish limit theorems showing that if the first $n$ boxes are occupied independently with probability $p \in (0, 1)$ , then the number of solitons is of order $n$ for all $p$ , and the length of the longest soliton is of order $lo g n$ for $p < 1/2$ , order $n$ for $p = 1/2$ , and order $n$ for $p > 1/2$ . Additionally, we uncover a condensation phenomenon in the supercritical regime: For each fixed $j \geq 1$ , the top $j$ soliton lengths have the same order as the longest for $p \leq 1/2$ , whereas all but the longest have order at most $lo g n$ for $p > 1/2$ . As an…

Figures13

Click any figure to enlarge with its caption.

Equations396

\begin{array}[]{*{2}{r|lllllllllllllllllllllllll@{\ }}}s=0&&0&1&1&0&1&1&1&0&0&0&1&0&0&0&0&0&0&0&0&0&0&0&\ldots\\[3.0pt] 1&&0&0&0&1&0&0&0&1&1&1&0&1&1&0&0&0&0&0&0&0&0&0&\ldots\\[3.0pt] 2&&0&0&0&0&1&0&0&0&0&0&1&0&0&1&1&1&1&0&0&0&0&0&\ldots\\[3.0pt] 3&&0&0&0&0&0&1&0&0&0&0&0&1&0&0&0&0&0&1&1&1&1&0&\ldots\\ \end{array}

\begin{array}[]{*{2}{r|lllllllllllllllllllllllll@{\ }}}s=0&&0&1&1&0&1&1&1&0&0&0&1&0&0&0&0&0&0&0&0&0&0&0&\ldots\\[3.0pt] 1&&0&0&0&1&0&0&0&1&1&1&0&1&1&0&0&0&0&0&0&0&0&0&\ldots\\[3.0pt] 2&&0&0&0&0&1&0&0&0&0&0&1&0&0&1&1&1&1&0&0&0&0&0&\ldots\\[3.0pt] 3&&0&0&0&0&0&1&0&0&0&0&0&1&0&0&0&0&0&1&1&1&1&0&\ldots\\ \end{array}

X^{p} (k) = 1 {ξ_{k} = 1},

X^{p} (k) = 1 {ξ_{k} = 1},

\frac{\rho_{i}(n)}{n}\rightarrow\mathbb{P}\Big{\{}\max_{0\leq k\leq\varsigma}S_{k}=i\Big{\}}>0\;\;\;\text{ a.s.}\;\;\;\text{as $\;n\rightarrow\infty$}.

\frac{\rho_{i}(n)}{n}\rightarrow\mathbb{P}\Big{\{}\max_{0\leq k\leq\varsigma}S_{k}=i\Big{\}}>0\;\;\;\text{ a.s.}\;\;\;\text{as $\;n\rightarrow\infty$}.

\frac{ρ _{1} ( n ) - n p ( 1 - p )}{n p ( 1 - p ) [ 1 - 3 p ( 1 - p )]} \Rightarrow Z

\frac{ρ _{1} ( n ) - n p ( 1 - p )}{n p ( 1 - p ) [ 1 - 3 p ( 1 - p )]} \Rightarrow Z

E_{b} (f) (t) = f (t) - b \land t \leq s \leq b \lor t min f (s),

E_{b} (f) (t) = f (t) - b \land t \leq s \leq b \lor t min f (s),

exp (- θ^{- x}) k = 0 \sum j - 1 \frac{θ ^{- k (x + 1)}}{k !}

exp (- θ^{- x}) k = 0 \sum j - 1 \frac{θ ^{- k (x + 1)}}{k !}

\leq n \to \infty lim sup P {λ_{j} (n) \leq x + μ_{n}} \leq exp (- θ^{- (x + 1)}) k = 0 \sum j - 1 \frac{θ ^{- k x}}{k !} .

n^{- 1/2} [λ_{1} (n), λ_{2} (n), \dots, λ_{j} (n)] \Rightarrow [max ∣ B ∣, max E (∣ B ∣), \dots, max E^{j - 1} (∣ B ∣)],

n^{- 1/2} [λ_{1} (n), λ_{2} (n), \dots, λ_{j} (n)] \Rightarrow [max ∣ B ∣, max E (∣ B ∣), \dots, max E^{j - 1} (∣ B ∣)],

n \to \infty lim n^{- k /2} E [(λ_{j} (n))^{k}] = E [(max E^{j - 1} (∣ B ∣))^{k}] .

n \to \infty lim n^{- k /2} E [(λ_{j} (n))^{k}] = E [(max E^{j - 1} (∣ B ∣))^{k}] .

\frac{λ _{1} ( n ) - ( 2 p - 1 ) n}{2 p ( 1 - p ) n} \Rightarrow Z \sim N (0, 1) .

\frac{λ _{1} ( n ) - ( 2 p - 1 ) n}{2 p ( 1 - p ) n} \Rightarrow Z \sim N (0, 1) .

P {∣ λ_{1} (n) - (2 p - 1) n ∣ \geq x} \leq c exp (- x^{2} / (8 n)),

P {∣ λ_{1} (n) - (2 p - 1) n ∣ \geq x} \leq c exp (- x^{2} / (8 n)),

(exp (- θ^{- \frac{x}{2}}) k = 0 \sum j - 1 \frac{θ ^{- k (\frac{x}{2} + 1)}}{k !}) - c θ^{\frac{x}{8}}

(exp (- θ^{- \frac{x}{2}}) k = 0 \sum j - 1 \frac{θ ^{- k (\frac{x}{2} + 1)}}{k !}) - c θ^{\frac{x}{8}}

\leq n \to \infty lim sup P {λ_{j} (n) \leq x + \overset{μ}{^}_{n}} \leq (exp (- θ^{- (\frac{3 x}{2} + 1)}) k = 0 \sum j - 1 \frac{θ ^{- \frac{3 k x}{2}}}{k !}) + c θ^{\frac{x}{8}} .

E [ρ_{1} (Σ^{n})] = (n + 1) /2, E [λ_{1} (Σ^{n})] = π n + O (1) .

E [ρ_{1} (Σ^{n})] = (n + 1) /2, E [λ_{1} (Σ^{n})] = π n + O (1) .

[ρ_{1} (Σ^{n}), ρ_{2} (Σ^{n}), \dots, ρ_{i} (Σ^{n})] =_{d} [# of leaves in T_{1}^{n}, # of leaves in T_{2}^{n}, \dots, # of leaves in T_{i}^{n}] .

[ρ_{1} (Σ^{n}), ρ_{2} (Σ^{n}), \dots, ρ_{i} (Σ^{n})] =_{d} [# of leaves in T_{1}^{n}, # of leaves in T_{2}^{n}, \dots, # of leaves in T_{i}^{n}] .

\frac{\rho_{i}(\Sigma^{n})}{2n}\rightarrow\mathbb{P}\Big{\{}\max_{0\leq k\leq\varsigma}S_{k}=i\Big{\}}>0\;\;\;\text{ a.s.}\;\;\;\text{as $\;n\rightarrow\infty$}.

\frac{\rho_{i}(\Sigma^{n})}{2n}\rightarrow\mathbb{P}\Big{\{}\max_{0\leq k\leq\varsigma}S_{k}=i\Big{\}}>0\;\;\;\text{ a.s.}\;\;\;\text{as $\;n\rightarrow\infty$}.

n^{- 1/2} [λ_{1} (Σ^{n}), λ_{2} (Σ^{n}), \dots, λ_{j} (Σ^{n})] \Rightarrow 2 [max B^{ex}, max E (B^{ex}), \dots, max E^{j - 1} (B^{ex})] .

n^{- 1/2} [λ_{1} (Σ^{n}), λ_{2} (Σ^{n}), \dots, λ_{j} (Σ^{n})] \Rightarrow 2 [max B^{ex}, max E (B^{ex}), \dots, max E^{j - 1} (B^{ex})] .

n \to \infty lim n^{- k /2} E [(λ_{j} (Σ^{n}))^{k}] = 2^{k /2} E [(max E^{j - 1} (B^{e x}))^{k}] .

n \to \infty lim n^{- k /2} E [(λ_{j} (Σ^{n}))^{k}] = 2^{k /2} E [(max E^{j - 1} (B^{e x}))^{k}] .

Γ (X)_{k + 1} - Γ (X)_{k} = ⎩ ⎨ ⎧ + 1 - 1 0 if X (k + 1) = 1 if X (k + 1) = 0 and Γ (X)_{k} \geq 1 if X (k + 1) = 0 and Γ (X)_{k} = 0

Γ (X)_{k + 1} - Γ (X)_{k} = ⎩ ⎨ ⎧ + 1 - 1 0 if X (k + 1) = 1 if X (k + 1) = 0 and Γ (X)_{k} \geq 1 if X (k + 1) = 0 and Γ (X)_{k} = 0

X_{s+1}(k+1)=\mathbf{1}\big{\{}\Gamma(X_{s})_{k+1}-\Gamma(X_{s})_{k}=-1\big{\}}

X_{s+1}(k+1)=\mathbf{1}\big{\{}\Gamma(X_{s})_{k+1}-\Gamma(X_{s})_{k}=-1\big{\}}

H (Γ)_{k} = {Γ_{k} - 1 Γ_{k} if k is contained a hill interval of Γ otherwise

H (Γ)_{k} = {Γ_{k} - 1 Γ_{k} if k is contained a hill interval of Γ otherwise

ρ (Γ) \geq ρ (H (Γ)) \geq ρ (H^{2} (Γ)) \geq \dots \geq ρ (H^{m a x Γ} (Γ)) = 0.

ρ (Γ) \geq ρ (H (Γ)) \geq ρ (H^{2} (Γ)) \geq \dots \geq ρ (H^{m a x Γ} (Γ)) = 0.

λ_{j} (Γ) = max E^{j - 1} (Γ), 1 \leq j \leq ρ (Γ) .

λ_{j} (Γ) = max E^{j - 1} (Γ), 1 \leq j \leq ρ (Γ) .

λ_{j} (X_{0}) = max E^{j - 1} (Γ (X_{0})) .

λ_{j} (X_{0}) = max E^{j - 1} (Γ (X_{0})) .

\Big{|}\max\mathcal{E}^{j-1}(f)-\max\mathcal{E}^{j-1}(g)\Big{|}\leq 2\lVert f-g\rVert_{\infty}.

\Big{|}\max\mathcal{E}^{j-1}(f)-\max\mathcal{E}^{j-1}(g)\Big{|}\leq 2\lVert f-g\rVert_{\infty}.

(a, Γ_{a}) \sim adj (b, Γ_{b}) ⟺ ∣ a - b ∣ = 1 and Γ_{a}, Γ_{b} not both 0.

(a, Γ_{a}) \sim adj (b, Γ_{b}) ⟺ ∣ a - b ∣ = 1 and Γ_{a}, Γ_{b} not both 0.

(a, Γ_{a}) \sim (b, Γ_{b}) ⟺ 0 < Γ_{a} = Γ_{b} \leq Γ_{x} for all x \in [a, b],

(a, Γ_{a}) \sim (b, Γ_{b}) ⟺ 0 < Γ_{a} = Γ_{b} \leq Γ_{x} for all x \in [a, b],

\Big{\lVert}\mathcal{L}^{i-1}\big{(}\mathfrak{F}(\Gamma)\big{)}\Big{\rVert}=\Big{\lVert}\mathfrak{F}\big{(}\mathcal{E}^{i-1}(\Gamma)\big{)}\Big{\rVert}=\max\mathcal{E}^{i-1}(\Gamma)=\lambda_{i}(\Gamma).\qed

\Big{\lVert}\mathcal{L}^{i-1}\big{(}\mathfrak{F}(\Gamma)\big{)}\Big{\rVert}=\Big{\lVert}\mathfrak{F}\big{(}\mathcal{E}^{i-1}(\Gamma)\big{)}\Big{\rVert}=\max\mathcal{E}^{i-1}(\Gamma)=\lambda_{i}(\Gamma).\qed

\Gamma(X^{n,p})(x)=H(x)\mathbf{1}_{[0,n]}(x)+\max\big{(}0,H(n)-x+n\big{)}\mathbf{1}_{[n,\infty)}(x).

\Gamma(X^{n,p})(x)=H(x)\mathbf{1}_{[0,n]}(x)+\max\big{(}0,H(n)-x+n\big{)}\mathbf{1}_{[n,\infty)}(x).

H_{k} = S_{k} - 0 \leq r \leq k min S_{r} .

H_{k} = S_{k} - 0 \leq r \leq k min S_{r} .

Z_{k + 1} = {ζ_{1}^{k + 1} + \dots + ζ_{Z_{k}}^{k + 1} 0 if Z_{k} > 0 if Z_{k} = 0 .

Z_{k + 1} = {ζ_{1}^{k + 1} + \dots + ζ_{Z_{k}}^{k + 1} 0 if Z_{k} > 0 if Z_{k} = 0 .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Double jump phase transition in a soliton cellular automaton

Lionel Levine

Lionel Levine, Department of Mathematics, Cornell University, Ithaca, NY 14853.

[email protected]

,

Hanbaek Lyu

Hanbaek Lyu, Department of Mathematics, University of California, Los Angeles, CA 90095.

[email protected]

and

John Pike

John Pike, Department of Mathematics, Bridgewater State University, Bridgewater, MA 02324.

[email protected]

(Date: March 13, 2024)

Abstract.

In this paper, we consider the soliton cellular automaton introduced in [25] with a random initial configuration. We give multiple constructions of a Young diagram describing various statistics of the system in terms of familiar objects like birth-and-death chains and Galton-Watson forests. Using these ideas, we establish limit theorems showing that if the first $n$ boxes are occupied independently with probability $p\in(0,1)$ , then the number of solitons is of order $n$ for all $p$ , and the length of the longest soliton is of order $\log n$ for $p<1/2$ , order $\sqrt{n}$ for $p=1/2$ , and order $n$ for $p>1/2$ . Additionally, we uncover a condensation phenomenon in the supercritical regime: For each fixed $j\geq 1$ , the top $j$ soliton lengths have the same order as the longest for $p\leq 1/2$ , whereas all but the longest have order $\log n$ for $p>1/2$ . As an application, we obtain scaling limits for the lengths of the $k^{\text{th}}$ longest increasing and decreasing subsequences in a random stack-sortable permutation of length $n$ in terms of random walks and Brownian excursions.

Key words and phrases:

box-ball system, phase transition, condensation, excursion operator, birth-death chain, Motzkin path, Galton-Watson forest, Brownian motion, random stack-sortable permutation

2010 Mathematics Subject Classification:

37K40, 60F05

1. Introduction

In 1990, Takahashi and Satsuma proposed a $1+1$ dimensional cellular automaton of filter type called the soliton cellular automaton, also known as the box-ball system [17, 25]. It is defined as a discrete-time dynamical system $\left(X_{s}\right)_{s\geq 0}$ whose states are binary sequences $X_{s}:\mathbb{N}\rightarrow\{0,1\}$ with finitely many $1$ ’s. We may think of the states as configurations of balls in boxes where box $k$ contains a ball at stage $s$ if $X_{s}(k)=1$ and is empty if $X_{s}(k)=0$ . The update rule $X_{s}\mapsto X_{s+1}$ is defined as follows: At the beginning of stage $s$ , each ball has been moved a total of $s$ times. To reach stage $s+1$ , successively move the leftmost ball which has been moved a total of $s$ times to the first empty box on its right, continuing until all balls have been moved. Alternatively, at each stage $s\geq 0$ a ‘carrier’ starts at the origin and sweeps rightward to infinity. Each time she encounters an occupied box, she pushes the ball to the top of her stack. Each time she encounters an empty box and her stack is nonempty, she pops any ball from her stack into the box. In keeping with this picture, we will refer to the stages of the box-ball system as sweeps henceforth.

As a concrete example, the system initially having balls in boxes $2,3,5,6,7,11$ evolves through sweep $s=3$ as

[TABLE]

In this model, a (non-interacting) soliton of length $k$ is defined to be a string of $k$ consecutive $1$ ’s followed by $k$ consecutive [math]’s. During one sweep, such a soliton travels to the right at speed $k$ . The physical interpretation is that of a traveling wave with velocity equal to its wavelength. If a $k$ -soliton precedes a $j$ -soliton with $j<k$ , then the two will eventually collide, resulting in interference. The subsequent states of the system depend on the congruence class of their initial distance modulo their relative speed, $k-j$ , but solitons are never created or destroyed in the course of these interactions. The case of three or more interacting solitons can be described similarly [25]. It is easy to see that since we have finitely many balls initially, after some finite time the system consists of non-interacting solitons whose lengths are nondecreasing from left to right. We will call such a configuration stable. This final macrostate of the system can be encoded in the Young diagram $\Lambda(X_{0})$ having $j^{\text{th}}$ column equal in length to the $j^{\text{th}}$ longest soliton.

In this paper, we start the soliton cellular automaton from a random initial configuration and study the limiting shape of the resulting Young diagram. We have two parameters, $n\in\mathbb{N}$ and $p\in(0,1)$ . Let $X^{n,p}$ be a random coloring of $\mathbb{N}$ so that each site in $[1,n]$ is $1$ with probability $p$ and [math] with probability $1-p$ , independently of all others, and all sites in $(n,\infty)$ are [math]. Let $\Lambda^{n,p}=\Lambda(X^{n,p})$ be the corresponding random Young diagram and denote the lengths of its $i^{\text{th}}$ row and $j^{\text{th}}$ column by $\rho_{i}(n)$ and $\lambda_{j}(n)$ , respectively. (Thus $\lambda_{j}(n)$ gives the length of the $j^{\text{th}}$ longest soliton and $\rho_{i}(n)$ the number of solitons of length at least $i$ .) We are going to show that each fixed row has order $n$ for all values of $p$ , but the column lengths vary drastically according to whether $p$ is less than, equal to, or greater than $1/2$ . The asymptotics of the rows and columns of $\Lambda^{n,p}$ are summarized in the following table, for which Theorem 1 proves the $\rho$ entries, and Theorem 2 proves the $\lambda$ entries. For the precise meaning of the Landau notation employed, see Subsection 1.2.

Erdős and Rényi coined the term double jump to describe the emergence of a giant component in the sparse random graph with $n$ vertices, each pair independently joined by an edge with probability $c/n$ , where $c>0$ is a parameter. The analogy between random graph components and box-ball solitons becomes apparent if we take $c=p/(1-p)$ . Then with high probability, all connected components of the Erdős-Rényi graph are of size $O(\log n)$ for $p<1/2$ ; components of size $\Theta(n^{2/3})$ emerge at $p=1/2$ ; and for $p>1/2$ , the largest component is of size $\Theta(n)$ while all the rest have size $O(\log{n})$ [8]. Except for the exponent $2/3$ (which becomes $1/2$ ) this is exactly the behavior of the soliton lengths in the box-ball system as summarized in the last two columns of Table 1.

1.1. Related work

There have been some exciting recent developments involving the box-ball system with a bi-infinite random initial configuration. A central question is to understand the invariant measures on $\{0,1\}^{\mathbb{Z}}$ under the box-ball dynamics. Ferrari, Nguyen, Rolla, and Wang [9] showed that the Bernoulli product measure with density $p<1/2$ is invariant and provided a recipe for constructing additional invariant measures based on a soliton decomposition of box-ball configurations. Croydon, Kato, Sasada, and Tsujimoto [4] found sufficient conditions for invariance using Pitman’s $2M-X$ transformation and considered extending the box-ball system from $\mathbb{Z}$ to $\mathbb{R}$ . See the references for more details.

1.2. Notation

We adopt the notation $\mathbb{R}^{+}=[0,\infty)$ , $\mathbb{N}=\{1,2,3,\ldots\}$ , and $\mathbb{N}_{0}=\mathbb{N}\cup\{0\}$ throughout. We employ the Landau notation $O(\cdot),\,\Omega(\cdot),\,\Theta(\cdot)$ in the sense of stochastic boundedness. That is, given $\{a_{n}\}_{n=1}^{\infty}\subseteq\mathbb{R}^{+}$ and a sequence $\{W_{n}\}_{n=1}^{\infty}$ of nonnegative random variables, we say that $W_{n}=O(a_{n})$ if for every $\varepsilon>0$ , there is a $C\in(0,\infty)$ such that $\mathbb{P}\{W_{n}>Ca_{n}\}<\varepsilon$ for all $n$ . We say that $W_{n}=\Omega(a_{n})$ if for every $\varepsilon>0$ , there is a $c\in(0,\infty)$ such that $\mathbb{P}\{W_{n}<ca_{n}\}<\varepsilon$ for all $n$ , and we say $W_{n}=\Theta(a_{n})$ if $W_{n}=O(a_{n})$ and $W_{n}=\Omega(a_{n})$ . The constants $c,C$ may depend on $p$ and $\varepsilon$ but not $n$ .

1.3. Main results

Fix $p\in(0,1)$ , and let $\xi_{1},\xi_{2},\ldots$ be a sequence of i.i.d. random variables with law $\mathbb{P}\{\xi_{1}=1\}=p$ and $\mathbb{P}\{\xi_{1}=-1\}=1-p$ . Define $X^{p}\in\{0,1\}^{\mathbb{N}}$ by

[TABLE]

and for each $n\in\mathbb{N}$ , set $X^{n,p}=X^{p}\mathbf{1}_{[1,n]}$ . The interpretation is that $X^{n,p}$ corresponds to an arrangement of balls in boxes where boxes $1,\ldots,n$ are each occupied independently with probability $p$ , and boxes $n+1,n+2,\ldots$ are empty.

For each fixed $n\geq 1$ and $p\in(0,1)$ , we consider the box-ball system $(X_{s})_{s\geq 0}$ with the random initial configuration $X_{0}=X^{n,p}$ . Recall that the soliton lengths are denoted by $\lambda_{1}(n)\geq\lambda_{2}(n)\geq\cdots$ . This information can be summarized by the Young diagram $\Lambda^{n,p}$ whose $j^{\text{th}}$ column has length $\lambda_{j}(n)$ . The length of its $i^{\text{th}}$ row, $\rho_{i}(n)$ , equals the number of solitons in the system having length at least $i$ . In particular, $\rho_{1}(n)$ gives the total number of solitons.

Many properties of this Young diagram can be described in terms of the simple random walk $\left\{S_{k}\right\}_{k=0}^{\infty}$ defined by $S_{0}=0$ and $S_{k}=\xi_{1}+\cdots+\xi_{k}$ . Our first result shows that the $i$ longest rows are of order $n$ for any $p\in(0,1)$ .

Theorem 1.

Let $X^{n,p}$ and $S_{k}$ be as above. Then the following statements hold.

(i)

(SLLN for rows) Let $\varsigma=\inf\{k>0\hskip 0.50003pt:\hskip 0.50003ptS_{k}=0\}\in\mathbb{N}\cup\{\infty\}$ be the first return time of $S_{k}$ to [math]. Then for any fixed $i\geq 1$ ,

[TABLE]

(ii)

(CLT for the first row)

[TABLE]

where $Z\sim\mathcal{N}(0,1)$ , the standard normal distribution.

Denote by $C(\mathbb{R})$ the space of continuous functions $f:\mathbb{R}\rightarrow\mathbb{R}$ endowed with the topology of uniform convergence on compact sets, and let $C_{0}^{+}(\mathbb{R})$ be the subspace of $C(\mathbb{R})$ consisting of nonnegative compactly supported functions $f$ such that $f\equiv 0$ on $(-\infty,0]$ . For any closed interval $I\subseteq\mathbb{R}$ containing [math], denote by $C(I)$ and $C_{0}^{+}(I)$ the space of restrictions $f|_{I}$ where $f\in C(\mathbb{R})$ and $f\in C_{0}^{+}(\mathbb{R})$ , respectively. For $b\in I$ , define the operator $\mathcal{E}_{b}:C(I)\rightarrow C(I)$ by

[TABLE]

where $y\wedge z=\min(y,z)$ and $y\vee z=\max(y,z)$ . We call $b$ the pivot of $\mathcal{E}_{b}$ . Define $\mathtt{m}:C_{0}^{+}(I)\rightarrow\mathbb{R}^{+}$ by $\mathtt{m}(g)=\sup\{x\in I\hskip 0.50003pt:\hskip 0.50003ptg(x)=\max(g)\}$ , the location of the rightmost global maximum of $g$ . Finally, define the excursion operator $\mathcal{E}$ on $C_{0}^{+}(I)$ by $\mathcal{E}(g)=\mathcal{E}_{\mathtt{m}(g)}(g)$ . See Figure 4 for an illustration.

We now state the main result of the paper.

Theorem 2.

Let $X^{n,p}$ be as above and set $\theta=(1-p)/p$ . Let $\lambda_{j}(n)$ denote the $j^{\text{th}}$ longest soliton length.

(i)

(Subcritical phase) For $p<1/2$ , $\lambda_{j}(n)$ is concentrated around $\mu_{n}:=\log_{\theta}\left(\frac{(1-2p)^{2}}{1-p}n\right)$ for each fixed $j\geq 1$ in the sense that for all $x\in\mathbb{R}$ ,

[TABLE]

In particular, $\lambda_{j}(n)=\Theta(\log n)$ .

(ii)

(Critical phase) For $p=1/2$ , let $B=\{B_{t}\}_{0\leq t\leq 1}$ be a standard Brownian motion on $[0,1]$ . Then for each fixed $j\geq 1$ ,

[TABLE]

In particular, $\lambda_{j}(n)=\Theta(\sqrt{n})$ .

Furthermore, for any integers $j,k\geq 1$ ,

[TABLE]

(iii)

(Supercritical phase) For $p>1/2$ ,

[TABLE]

Furthermore, there exists a constant $c=c(p)>0$ such that

[TABLE]

and for all $j\geq 2$ , $\lambda_{j}(n)$ is concentrated around $\hat{\mu}_{n}:=\log_{\theta^{-1}}\left(\frac{(1-2p)^{2}}{p}n\right)$ in the sense that for all $x\in\mathbb{R}$ ,

[TABLE]

In particular, $\lambda_{1}(n)=\Theta(n)$ and $\lambda_{j}(n)=\Theta(\log n)$ if $j\geq 2$ .

We call the statement in Theorem 2 (iii) a condensation phenomenon because in the supercritical regime, a linear number of balls condense into the longest soliton while the next $j$ longest solitons each have $\Theta(\log n)$ balls.

The methods that we develop in this paper to study the box-ball system yield several interesting results on lengths of monotone subsequences in random pattern avoiding permutations. The study of statistics involving longest increasing or decreasing subsequences in different types of random permutations has a long history and rich connections to many other fields [22]. In the context of the box-ball system, the class of $312$ -avoiding permutations arises naturally, and we are able to generalize some classical results on such permutations in multiple directions.

For each $n\in\mathbb{N}$ , let $\mathfrak{S}_{n}$ be the set of all permutations on $\{1,2,\ldots,n\}$ . Given two permutations $\sigma\in\mathfrak{S}_{n}$ and $\tau\in\mathfrak{S}_{k}$ with $1<k\leq n$ , we say that $\sigma$ is $\tau$ -avoiding if no subsequence of $\sigma$ has the same relative order as $\tau$ . (For example, a permutation is $312$ -avoiding if there is no subsequence of the form $z,x,y$ with $x<y<z$ .) Denote by $\mathfrak{S}^{\tau}_{n}$ the set of all $\tau$ -avoiding permutations in $\mathfrak{S}_{n}$ . Note that $\sigma$ is $\tau$ -avoiding if and only if $\sigma^{-1}$ is $\tau^{-1}$ -avoiding. (In particular, $\sigma$ is 231-avoiding if and only if $\sigma^{-1}$ is 312-avoiding.) Given a permutation $\sigma\in\mathfrak{S}_{n}$ , define integers $\lambda_{1},\ldots,\lambda_{k}$ (resp. $\rho_{1},\ldots,\rho_{k}$ ) recursively so that $\lambda_{1}(\sigma)+\cdots+\lambda_{k}(\sigma)$ (resp. $\rho_{1}(\sigma)+\cdots+\rho_{k}(\sigma)$ ) equals the length of the longest subsequence in $\sigma$ obtained by taking a disjoint union of $k$ decreasing (resp. increasing) subsequences.

In a classic work [23], Rotem studied properties of 231-avoiding permutations chosen uniformly at random among all such permutations of a given length. He showed that if $\Sigma^{n}$ is a permutation in $\mathfrak{S}^{231}_{n}$ chosen uniformly at random, then

[TABLE]

Our next theorem is an extension of the above result both to the higher moments and to ‘subsequent’ longest increasing and decreasing subsequences of $\Sigma^{n}$ .

Theorem 3.

Let $\Sigma^{n}$ be a uniformly chosen random $312$ - (or $231$ -) avoiding permutation of length $n$ .

(i)

Suppose that $T^{n}_{1},T^{n}_{2},\ldots,T^{n}_{i}$ is a sequence of rooted trees where $T^{n}_{1}$ is chosen uniformly at random among all rooted plane trees on $n+1$ nodes, and for $r\geq 1$ , $T^{n}_{r+1}$ is obtained from $T^{n}_{r}$ by deleting all leaves. Then

[TABLE]

(ii)

Let $\{S_{k}\}_{k=0}^{\infty}$ be a simple symmetric random walk with $S_{0}=0$ and let $\varsigma=\inf\{k>0\hskip 0.50003pt:\hskip 0.50003ptS_{k}=0\}$ be the time of its first return to [math]. Then for any fixed $i\geq 1$ ,

[TABLE]

(iii)

Let $B^{\text{ex}}=(B_{t}^{\text{ex}})_{0\leq t\leq 1}$ be a standard Brownian excursion on $[0,1]$ . Then for each fixed $j\geq 1$ ,

[TABLE]

Furthermore, for any integers $j,k\geq 1$ ,

[TABLE]

We remark that given a $312$ -avoiding permuatation $\sigma$ , we can actually interpret $\lambda_{k}(\sigma)$ as the length of the longest decreasing subsequence after successively deleting an arbitrary longest decreasing subsequence $k-1$ times. For the rows, we can interpret $\rho_{k}$ similarly but the longest increasing subsequence we delete at each step must be a special one; see Proposition 8.1. Note that such an interpretation is not valid for general permutations.

1.4. Outline and organization

Broadly speaking, we proceed by observing correspondences between various combinatorial objects related to box-ball configurations, such as Motzkin paths, rooted forests, and $312$ -avoiding permutations; see Figure 1. We can then interpret the rows and columns of the Young diagram associated with a box-ball configuration in terms of these objects (Table 2). This allows us to reformulate the original soliton problem in other languages and vice versa.

For us, Motzkin paths provide the most useful framework, especially in the random setting. This is because the random box-ball configuration $X^{n,p}$ can be viewed as the increment sequence of the first $n$ steps of a simple random walk driven by the $\text{Bernoulli}(p)$ measure. The corresponding random ( $h$ -restricted) Motzkin path is the same simple random walk except that downstrokes at height [math] are censored. The problem then essentially boils down to studying properties of the excursions of such censored random walks. The results for random Motzkin paths can then be translated back to solitons or permutations.

This paper is organized as follows: In Section 2, we describe relations between box-ball configurations, Motzkin paths, and rooted forests, and show how to construct the Young diagram from these objects. In Section 3, we discuss a correspondence between random box-ball configurations, a birth-and-death chain, and a Galton-Watson forest. We prove Theorem 1 in Section 4, and the proof of Theorem 2 is given in Sections 5, 6, and 7. In Section 8, we discuss a connection between box-ball configurations and pattern-avoiding permutations and prove Theorem 3. Finally, in Appendix A, we prove the three lemmas stated in Subsection 2.2 along with some results concerning 312-avoiding permutations.

2. Constructing the time-invariant Young Diagram

In this section, we establish some important statements about the Young diagram which will be used crucially in later sections.

2.1. Motzkin paths

We begin with a bijection between box-ball states and a class of lattice paths we call $h$ -restricted Motzkin, a minor variant of the bijection with Dyck paths in [26]. A function $f:\mathbb{R}^{+}\rightarrow\mathbb{R}$ is a lattice path if $f$ is the linear interpolation of some function $\gamma:\mathbb{N}_{0}\rightarrow\mathbb{Z}$ . A lattice path $f$ is called Motzkin if it is nonnegative, compactly supported, and consists only of $(1,1)$ , $(1,-1)$ , and $(1,0)$ steps (which we refer to as ‘upstrokes,’ ‘downstrokes,’ and ‘ $h$ -strokes,’ respectively). We say that a Motzkin path is $h$ -restricted if its $h$ -strokes occur only on the $x$ -axis. Finally, if $\Gamma$ is a Motzkin path, we write $\Gamma_{k}$ for $\Gamma(k)$ , $k\geq 0$ .

The aforementioned bijection maps a (compactly supported) configuration $X:\mathbb{N}\rightarrow\{0,1\}$ to the $h$ -restricted Motzkin path $\Gamma(X)$ defined by linear interpolation of its values on $\mathbb{N}_{0}^{2}$ , which are given recursively by $\Gamma(X)_{0}=0$ and

[TABLE]

for all $k\geq 0$ . The inverse map from paths to configurations proceeds by writing a [math] for each downstroke or $h$ -stroke and a $1$ for each upstroke. See Figure 2 for an illustration.

The shape of this path tells us how to evolve the system by a single sweep: A ball is picked up at each upstroke and deposited at each downstroke. Specifically, label the balls $1,\ldots,m$ from left to right. (This labeling applies only to states, not the system as a whole. In subsequent sweeps, the label of a particular ball may change.) Then the $j^{\text{th}}$ upstroke occurs at the site where the carrier picks up the ball labeled $j$ . The site at which she deposits ball $j$ is determined by drawing a horizontal line from the center of the $j^{\text{th}}$ upstroke to the first downstroke on its right. From this description, we see that the height of the path at any site equals the number of balls in the carrier’s stack after she visits that site. When the sweep is completed, the new state of the system corresponds to the unique path formed by converting each downstroke to an upstroke and then adding $h$ -strokes and downstrokes so that it is $h$ -restricted Motzkin.

Formally, the box-ball state $X_{s+1}$ is given in terms of the Motzkin path $\Gamma(X_{s})$ by

[TABLE]

where $\mathbf{1}$ is the indicator function.

2.2. Hill-flattening and excursion operators

We now describe two methods of constructing a Young diagram $\Lambda(\Gamma)$ associated with a (not necessarily $h$ -restricted) Motzkin path $\Gamma$ . As usual, we denote the $i^{\text{th}}$ row and $j^{\text{th}}$ column by $\rho_{i}(\Gamma)$ and $\lambda_{j}(\Gamma)$ .

First we give the row-wise construction using the hill-flattening operator $\mathcal{H}$ defined on the set of all Motzkin paths. To begin, we say that an interval $[a,b]$ with $a,b\in\mathbb{N}_{0}$ and $a\leq b$ is a hill interval of the Motzkin path $\Gamma$ if for every $c\in[a,b]$ , $\Gamma_{a-1}=\Gamma_{c}-1=\Gamma_{b+1}$ . We write $\mathcal{I}(\Gamma)$ for the collection of all hill intervals of $\Gamma$ , and denote the number of hill intervals by $\rho(\Gamma)=|\mathcal{I}(\Gamma)|$ . The hill-flattening operator $\mathcal{H}$ is then defined by

[TABLE]

for $k\in\mathbb{N}_{0}$ .

A hill of $\Gamma$ is the graph of $\Gamma$ over $[a-1,b+1]$ with $[a,b]$ a hill interval. Thus hills consist of a single upstroke, followed by zero or more $h$ -strokes, followed by a single downstroke. Call a hill with no $h$ -strokes a peak. Then the hill-flattening operator $\mathcal{H}$ , when applied to $\Gamma$ , flattens each hill of $\Gamma$ by replacing the upstroke and downstroke with $h$ -strokes and then lowering any intermediate $h$ -strokes so that the path remains connected.

Note that each application of the hill-flattening operator decreases the maximum height of the Motzkin path by $1$ and never increases the number of hills, so

[TABLE]

We define the Young diagram $\Lambda(\Gamma)$ associated to the Motzkin path $\Gamma$ as having $i^{\text{th}}$ row of length $\rho_{i}(\Gamma)=\rho(\mathcal{H}^{i-1}(\Gamma))$ for $1\leq i\leq\max\Gamma$ . Here repeated applications of $\mathcal{H}$ are denoted by $\mathcal{H}^{j+1}(f)=\mathcal{H}\big{(}\mathcal{H}^{j}(f)\big{)}$ with $\mathcal{H}^{0}$ the identity operator. In particular, given a box-ball configuration $X:\mathbb{N}_{0}\rightarrow\{0,1\}$ of finite support, we can construct the Young diagram $\Lambda(\Gamma(X))$ . See Figure 3 for an illustration.

Now consider a box-ball system $(X_{s})_{s\geq 0}$ started from a configuration $X_{0}:\mathbb{N}_{0}\rightarrow\{0,1\}$ . The following lemma says that for each $s\geq 0$ , the corresponding Young diagram $\Lambda(\Gamma(X_{s}))$ is independent of $s$ and its column lengths correspond to the lengths of the solitons.

Lemma 2.1.

$\Lambda(\Gamma(X_{s}))=\Lambda(\Gamma(X_{s+1}))$ * for all $s\geq 0$ . Moreover, $\Lambda(\Gamma(X_{0}))=\Lambda(X_{0})$ .*

Next, we give the column-wise construction of $\Lambda(\Gamma)$ . The key observation is that the $j^{\text{th}}$ longest column length, which we denote by $\lambda_{j}$ , is obtained by successively applying the excursion operator to $\Gamma$ $j-1$ times and then taking a maximum.

Lemma 2.2.

Let $\Gamma$ be a Motzkin path and let $\lambda_{j}(\Gamma)$ denote the length of the $j^{\text{th}}$ column of $\Lambda(\Gamma)$ . Then

[TABLE]

In particular, if $(X_{s})_{s\in\mathbb{N}_{0}}$ is a finitely supported box-ball system with initial configuration $X_{0}:\mathbb{N}\rightarrow\{0,1\}$ , then

[TABLE]

We relegate the proofs of these lemmas, along with that of Lemma 2.3 below, to Appendix A in order to maintain the flow of the paper.

Lemma 2.2 gives the following column-wise construction of $\Lambda(\Gamma)$ . Let $\mathtt{m}=\mathtt{m}(\Gamma)$ be the location of the rightmost global maximum of $\Gamma$ , and set $\lambda_{1}(\Gamma)=\Gamma_{\mathtt{m}}$ , the maximum height of $\Gamma$ . To find $\lambda_{2}(\Gamma)$ , one first computes $\mathcal{E}(\Gamma)$ by traversing $\Gamma$ to the left and right of $\mathtt{m}$ as follows: Starting with height [math] at $\mathtt{m}$ , move to the left, remaining at height [math] until the first local minimum, and then record the sequence of strokes until the original lattice path returns to the height of this minimum. Then repeat the process, staying at height [math] until encountering a local minimum and then recording the path of the second such excursion. Continue to the beginning of the path and then repeat the procedure moving to the right from $\mathtt{m}$ . The resulting path precisely records all ‘subexcursions’ which are not subsumed by the maximum $(\mathtt{m},\Gamma_{\mathtt{m}})$ . $\lambda_{2}(\Gamma)$ , the length of the second column of $\Lambda(\Gamma)$ , is equal to the maximum of $\mathcal{E}(\Gamma)$ . Continuing in this fashion gives $\lambda_{j}(\Gamma)=\max\mathcal{E}^{j-1}(\Gamma)$ for all $j\geq 1$ .

In light of Lemma 2.2, it is natural to call $\max\mathcal{E}^{j-1}$ the $j^{\text{th}}$ column length functional. A crucial advantage of extracting the column length $\lambda_{j}$ from the functional $\max\mathcal{E}^{j-1}$ is that this operation is continuous with respect to the topology of $C_{0}^{+}(\mathbb{R}^{+})$ as stated in the lemma below. This enables us to take various scaling limits of the system.

Lemma 2.3.

For any interval $I\subseteq\mathbb{R}^{+}$ , functions $f,g\in C_{0}^{+}(I)$ , and $j\geq 1$ ,

[TABLE]

Remark 2.4 (Depth process with drains).

In private communication with Jim Pitman, we learned that an operator equivalent to $\mathcal{E}_{b}$ was used in studying Brownian paths and continuum random trees. In our context, given a Motzkin path $\Gamma$ , flip it upside down and consider it as a bucket filled to the top with water. Given $b\in\mathbb{R}^{+}$ , poke a hole at point $(b,-\Gamma(b))$ . This will drain some of the water, and $-\mathcal{E}_{b}(\Gamma)(x)$ gives the water level at each $x\in\mathbb{R}^{+}$ . For instance, the red path in Figure 4 can be obtained from the black one in this way with drain at $b=\mathtt{m}(\Gamma)$ . A similar procedure can be defined with multiple drains. This operation was applied to Brownian paths to study, for example, the line-breaking construction of the continuum random tree in a Brownian excursion [1]; sampling bridges, meanders, and excursions at independent uniform times [18]; and developments in the tree setting with different metaphors such as “forest growth” and “bead crushing” [19, 20]. **

2.3. Rooted forests

In this subsection, we develop an alternative perspective for constructing the Young diagram from an associated rooted forest. The idea is to collapse a Motzkin path to a rooted forest by horizontal identification. Intuitively, one paints the underside of the graph of each excursion with glue and then compresses it horizontally to obtain a tree. Then the original Motzkin path can be viewed as the contour process (or Harris walk in the random setting) of the rooted forest so constructed. This point of view will be especially useful for thinking about arguments in Section 7.

To begin, recall that a rooted forest is a sequence of vertex-disjoint plane trees $\{T_{i}\}_{i\geq 1}$ such that each $T_{i}$ is rooted at a vertex $\mathtt{r}_{i}\in V(T_{i})$ . The level of a vertex $v\in T_{i}$ is defined as $\ell(v)=d(v,\mathtt{r}_{i})$ where $d$ is the graph distance. Given a Motzkin path $\Gamma$ , we define a rooted forest $\mathfrak{F}(\Gamma)$ as follows: Let $G(\Gamma)=(V,E)$ be the graph with vertex set $V=\left\{(k,\Gamma_{k})\right\}_{k\in\mathbb{N}_{0}}\subset\mathbb{N}_{0}^{2}$ and adjacency relation

[TABLE]

In words, $G(\Gamma)$ is obtained from $\Gamma$ by removing the $h$ -strokes at [math] but retaining all vertices. Clearly each component of $G(\Gamma)$ is isomorphic to a path beginning and ending at height [math], and there are only finitely many such paths since $\Gamma$ has finite support. Arranging the components from left to right so that their vertex labels are increasing, let $P_{i}$ denote the $i^{\text{th}}$ component from the left. Define an equivalence relation $\sim$ on the vertex set of $G(\Gamma)$ by

[TABLE]

and write $T_{i}=P_{i}/\mathord{\sim}$ for the resulting rooted tree; see Figure 5. The rooted forest associated with $\Gamma$ is $\mathfrak{F}(\Gamma)=\{T_{i}\}_{i\geq 1}$ .

We can recover $\Gamma$ from $\mathfrak{F}(\Gamma)$ by keeping track of the levels of the vertices explored in depth-first search. This exploration process begins at the root of $T_{1}$ and visits nodes from bottom to top and from left to right in such a way that it backtracks to the parent of the current node only if there is no child left to visit. After exhausting all nodes in $T_{1}$ , the explorer moves to the second tree $T_{2}$ , and so on.

More concretely, let $\iota:\mathbb{N}_{0}\rightarrow V(\mathfrak{F})$ be the function which maps $k$ to the location of the depth-first search at step $k$ so that $\iota(0)=\mathtt{r}_{1}$ , $\iota(k+1)$ is the leftmost unvisited child of $\iota(k)$ if such a child exists, and $\iota(k+1)$ is the parent of $\iota(k)$ if its children have all been visited. (Here the parent of $\mathtt{r}_{i}$ is taken to be $\mathtt{r}_{i+1}$ .) The depth-first-search ordering of the vertices of $\mathfrak{F}$ is given by $u\prec v$ if $\min\{k:\iota(k)=u\}<\min\{k:\iota(k)=v\}$ . Finally, the contour process on $\mathfrak{F}$ is the function $H(\mathfrak{F}):\mathbb{N}_{0}\rightarrow\mathbb{N}_{0}$ which maps $k$ to the level of $\iota(k)$ in $\mathfrak{F}$ . By construction, $H(\mathfrak{F})(k)=\Gamma_{k}$ for every $k\in\mathbb{N}_{0}$ .

Now we discuss how to compute the Young diagram $\Lambda(\Gamma)$ from the corresponding rooted forest $\mathfrak{F}(\Gamma)$ . In the previous subsection, we constructed the diagram from the Motzkin path via successive applications of the hill-flattening and excursion operators. In terms of rooted forests, these operators can be interpreted in terms of ‘trimming’ and ‘lopping.’ Namely, let $\Upsilon_{0}$ be the collection of all rooted forests with finitely many vertices and consider the trimming operator $\mathcal{T}:\Upsilon_{0}\rightarrow\Upsilon_{0}$ which deletes all leaves of the input forest; see Figure 5.

Next, the lopping operator $\mathcal{L}:\Upsilon_{0}\rightarrow\Upsilon_{0}$ is defined as follows: Given a rooted forest $\mathfrak{F}=\{T_{i}\}\in\Upsilon_{0}$ , find the rightmost node of maximal level, say $v_{\mathtt{m}}\in V(T_{k})$ . Set $q=\iota^{-1}(v_{\mathtt{m}})$ and let $\gamma$ be the unique path from $\mathtt{r}_{k}$ to $v_{\mathtt{m}}$ . Now let $\mathfrak{F}_{1}$ and $\mathfrak{F}_{2}$ be the rooted forests induced from $\mathfrak{F}$ such that $V(\mathfrak{F}_{1})=\iota([1,q])$ and $V(\mathfrak{F}_{2})=\iota([q,\infty))$ . Then $\mathcal{L}(\mathfrak{F})$ is obtained by first deleting all edges contained in the copies of $\gamma$ from $\mathfrak{F}_{1}$ and $\mathfrak{F}_{2}$ , and then taking the union of the resulting rooted forests with components ordered according to the depth-first search; see Figure 6.

The following proposition shows that these operators are compatible with each other and gives a way to construct the Young diagram $\Lambda(\Gamma)$ from $\mathfrak{F}(\Gamma)$ .

Proposition 2.5.

For each Motzkin path $\Gamma$ , we have the following:

(i)

$\mathfrak{F}\big{(}\mathcal{H}(\Gamma)\big{)}=\mathcal{T}\big{(}\mathfrak{F}(\Gamma)\big{)}$ .

(ii)

$\mathfrak{F}\big{(}\mathcal{E}(\Gamma)\big{)}=\mathcal{L}\big{(}\mathfrak{F}(\Gamma)\big{)}$ .

(iii)

For each $1\leq i\leq\max\Gamma$ , $\rho_{i}=\text{$ # $of leaves in$ \mathcal{T}^{i-1}(\mathfrak{F}(\Gamma)) $}$ .

(iv)

For each $1\leq j\leq\rho(\Gamma)$ , $\lambda_{j}=\text{ maximal level of nodes in$ \mathcal{L}^{j-1}(\mathfrak{F}(\Gamma)) $}$ .

Proof.

For (i), note that leaves in the forest correspond to hills in the path, so applying $\mathcal{H}$ to $\Gamma$ results in the forest obtained by applying $\mathcal{T}$ to $\mathfrak{F}(\Gamma)$ . For (ii), observe that $\mathcal{E}$ only affects the rightmost excursion of maximal height in $\Gamma$ , $\mathcal{L}$ only affects the rightmost tree of maximal height in $\mathfrak{F}(\Gamma)$ , and the ‘bushes’ growing off of the ‘trunk’ of this tree correspond precisely to the subexcursions in the corresponding path component which are not subsumed by the maximum.

Now assertion (i) shows that $\mathfrak{F}(\mathcal{H}^{i-1}(\Gamma))=\mathcal{T}^{i-1}(\mathfrak{F}(\Gamma))$ for all $1\leq i\leq\max\Gamma$ , and $\rho_{i}$ is the number of hill intervals of $\mathcal{H}^{i-1}(\Gamma)$ , which equals the number of leaves in $\mathfrak{F}(\mathcal{H}^{i-1}(\Gamma))=\mathcal{T}^{i-1}(\mathfrak{F}(\Gamma))$ , and (iii) follows. Finally, given a rooted forest $\mathfrak{F}$ , denote by $\lVert\mathfrak{F}\rVert$ the maximal level of nodes in $\mathfrak{F}$ . Then $\lVert\mathfrak{F}(\Gamma)\rVert=\max\Gamma$ , so (ii) implies

[TABLE]

We remark that Proposition 2.5 (iv) holds if we replace the lopping operator $\mathcal{L}$ by the much simpler one which simply contracts the rightmost longest path into a single root. However, for this contraction operator Proposition 2.5 (ii) no longer holds.

3. Random box-ball system and Harris walk

In this section, we describe stochastic objects corresponding to the random box-ball system introduced in Subsection 1.3.

3.1. Harris walks

Fix $p\in(0,1)$ , and let $\xi_{1},\xi_{2},\ldots$ be i.i.d. with $\mathbb{P}\{\xi_{1}=1\}=p$ and $\mathbb{P}\{\xi_{1}=-1\}=1-p$ . Let $X^{p},X^{n,p}\in\{0,1\}^{\mathbb{N}}$ be as in Subsection 1.3, and let $\{S_{k}\}_{k=0}^{\infty}$ be the associated random walk, where $S_{0}=0$ and $S_{k}=\xi_{1}+\cdots+\xi_{k}$ . The Harris walk $\{H_{k}\}_{k=0}^{\infty}$ associated with $X^{p}$ is defined by $H_{0}=0$ and $H_{k}=\left(H_{k-1}+\xi_{k}\right)\vee 0$ for $k\geq 1$ . In other words, $H_{k}$ is a simple random walk with increments $\xi_{j}$ , except that downsteps at [math] are censored.

This defines an irreducible and aperiodic birth-and-death chain on $\mathbb{N}_{0}$ with transition probabilities $P(x,x+1)=p$ , and $P(x,\left(x-1\right)\vee 0)=1-p$ . One readily verifies that the chain is reversible with respect to the measure $\mu(x)=\theta^{-x}$ where $\theta=(1-p)/p$ . Note that the sum $\sum_{k\geq 1}\theta^{k}$ converges if and only if $p>1/2$ , so the chain is transient for these values of $p$ and recurrent for $p\leq 1/2$ . It is null recurrent when $p=1/2$ since then $\sum_{k\geq 1}\theta^{-k}=\infty$ , and it is positive recurrent for $p<1/2$ as the latter sum converges in this case. (See [12] for background on recurrence criteria for birth-and-death chains.) In the ergodic regime, $p<1/2$ , we can normalize $\mu$ to obtain the stationary distribution $\pi(x)=[(1-2p)/(1-p)]\theta^{-x}$ .

Now the random Motzkin path $\Gamma(X^{n,p})$ is given by the trajectory of the Harris walk up to time $n$ , completed by appending downstrokes at the end until the height reaches [math] and appending $h$ -strokes thereafter. More precisely, if we define $H:\mathbb{R}^{+}\rightarrow\mathbb{R}^{+}$ to be the linear interpolation of the Harris walk, $H(t)=H_{\lfloor t\rfloor}+(t-\lfloor t\rfloor)(H_{\lfloor t+1\rfloor}-H_{\lfloor t\rfloor})$ , then we have

[TABLE]

Moreover, an easy induction argument shows that for all $k\in\mathbb{N}_{0}$ ,

[TABLE]

Thus if $S:\mathbb{R}^{+}\rightarrow\mathbb{R}$ is the linear interpolation of the random walk $\{S_{k}\}_{k=0}^{\infty}$ , then $H=\mathcal{E}_{0}(S)$ . This observation also shows that, marginally, $H_{k}=_{d}\max_{0\leq r\leq k}S_{r}$ .

3.2. Galton-Watson forests

Following the procedure outlined in Subsection 2.3, one can construct a random rooted forest $\mathfrak{F}(X^{n,p})=\mathfrak{F}\big{(}\Gamma(X^{n,p})\big{)}$ from the trajectory of the truncated Harris walk $\Gamma(X^{n,p})$ , and it turns out that $\mathfrak{F}(X^{n,p})$ has the same law as the sub-forest of a Galton-Watson forest with mean offspring number $p/(1-p)$ consisting of the first $n$ nodes revealed by depth-first search.

To be precise, let $\{\zeta^{k}_{j}\}_{j,k\geq 1}$ be an array of i.i.d. $\mathbb{N}_{0}$ -valued random variables, and define the sequence $\{Z_{k}\}_{k\geq 0}$ by $Z_{0}=1$ and

[TABLE]

The interpretation is that $Z_{k}$ is the population size in the $k^{\text{th}}$ generation of a species in which individuals survive for a single generation and produce an i.i.d. number of offspring before dying. $\zeta^{k+1}_{j}$ is the number of offspring of the $j^{\text{th}}$ individual in generation $k$ , and the common law of the $\zeta$ ’s is called the offspring distribution. The family tree $T$ for this population is known as a Galton-Watson tree. We will be interested in Galton-Watson trees with geometric offspring distribution

[TABLE]

which is the number of independent $\text{Bern}(p)$ trials preceding the first failure. Observe that $\mathbb{E}[\zeta^{k}_{j}]=p/(1-p)$ , so $T$ is subcritical if $0<p<1/2$ , critical if $p=1/2$ , and supercritical if $1/2<p<1$ . The law of a Galton-Watson tree with $\text{Geom}(1-p)$ offspring distribution will be denoted by $\mathtt{GWT}(p)$ .

We call a sequence of i.i.d. Galton Watson trees $\mathfrak{F}_{GW}=\{T_{i}\}_{i\geq 1}$ a Galton-Watson forest, and write $\mathtt{GWF}(p)$ for the law of a forest of i.i.d. $\mathtt{GWT}(p)$ trees. It is well known that for $0<p\leq 1/2$ , each component $T_{i}$ is finite with full probability [7, Ch. 5.3.4], so the depth-first-search visits all nodes in the forest. However, for $p>1/2$ , each component has a positive probability of being infinite, so almost surely there exists an index $I<\infty$ such that $|T_{i}|<\infty$ for all $i<I$ and $|T_{I}|=\infty$ . Thus for $p>1/2$ , the depth-first-search cannot pass beyond the leftmost infinite branch in $T_{I}$ ; see Figure 8.

Now let $\mathfrak{F}_{p}\sim\mathtt{GWF}(p)$ , write $\mathfrak{F}_{n,p}$ for the vertex-induced subforest of $\mathfrak{F}_{p}$ on the nodes $\iota([1,n])\subseteq V(\mathfrak{F}_{p})$ which are visited by the depth-first-search in the first $n$ steps, and write $\mathtt{GWF}(n,p)$ for the law of $\mathcal{F}_{n,p}$ .

Proposition 3.1.

$\mathfrak{F}(X^{n,p})\sim\mathtt{GWF}(n,p)$ .

Proof.

Let $\Gamma=\Gamma(X^{p})$ and $\mathfrak{F}=\mathfrak{F}(\Gamma)$ . Denote by $Z_{v}$ the number of children of node $v\in V(\mathfrak{F})$ . We will show that the $Z_{v}$ ’s are i.i.d. and have the law of the number of independent $\text{Bern}(p)$ trials before the first failure. This will imply that the Harris walk $\{H_{k}\}_{k=0}^{\infty}$ is distributed as the contour process of $\mathfrak{F}_{p}$ . Then the relation between $\Gamma(X^{n,p})$ and $H$ from the previous subsection yields the assertion.

Let $\mathfrak{F}(X^{p})=\{T_{i}\}_{i\geq 1}$ and fix a node $v\in V(T_{i})$ for some $i\geq 1$ . Let $P_{i}$ be the path component in $G(\Gamma)$ which is collapsed to $T_{i}$ via the equivalence relation $\sim$ . Note that the number of nodes in $P_{i}\setminus\{v\}$ which are identified with $v$ equals the number of children of $v$ . Let $x=(a_{0},\Gamma_{a_{0}})$ be such a vertex of $P_{i}$ with $a_{0}$ minimal. If $\Gamma_{a_{0}+1}-\Gamma_{a_{0}}=\xi_{a_{0}+1}$ is $1$ , then the depth-first search finds the first child of $v$ ; otherwise, $v$ is childless and the search moves to its parent or to the root of next tree $T_{i+1}$ depending on whether $\Gamma_{a_{0}}\geq 1$ or $\Gamma_{a_{0}}=0$ . If $\xi_{a_{0}+1}=1$ , then let $a_{1}=\min\{k\geq a_{0}\hskip 0.50003pt:\hskip 0.50003pt\Gamma_{k}=\Gamma_{a_{0}}\}$ be the first return time to level $\Gamma_{a_{0}}$ after $a_{0}$ . ( $a_{1}$ may be infinite if $p>1/2$ .) As before, the depth-first search finds the second child of $v$ if and only if $\xi_{a_{1}+1}=1$ . Continuing thusly, we see that $Z_{v}$ has a $\text{Geom}(1-p)$ distribution, and the proof is complete. ∎

Proposition 3.1 allows us to describe the joint distribution of the first $i$ rows or the first $j$ columns in the random box-ball system started at $X^{n,p}$ in terms of Galton-Watson Forests.

Corollary 3.2.

Suppose that $\mathfrak{F}\sim\mathtt{GWF}(n,p)$ . For each $i\geq 1$ , let $\mathfrak{l}_{i}$ and $\mathfrak{h}_{i}$ be the number of leaves in $\mathcal{T}^{i-1}(\mathfrak{F})$ and the maximum height of $\mathcal{L}^{i-1}(\mathfrak{F})$ , respectively. Then for any $1\leq i\leq\max(\Gamma(X^{n,p}))$ and $1\leq j\leq\rho(\Gamma(X^{n,p}))$ , we have

[TABLE]

and

[TABLE]

4. Asymptotics for the rows

In this section, we prove our first main result, Theorem 1. From the construction described in Subsection 2.2, we have that $\rho_{1}(n)$ , the length of the first row of $\Lambda^{n,p}$ , equals the number of peaks in $\Gamma(X^{n,p})$ , which equals the number of $1\,0$ patterns in $X^{n,p}$ . In general, $\rho_{i}(n)$ is the number of subexcursions of height $i$ in the Harris walk $\{H_{k}\}_{k=0}^{n}$ , and these can also be understood in terms of certain binary patterns in the initial configuration.

We begin with a proof of the $i=1$ case of Theorem 1 using arguments from renewal theory. Strong laws for the other rows can be deduced similarly by considering analogous (delayed) renewal processes, but we will find it more convenient to pursue an alternative approach that will be of use in Section 8.

Proof of Theorem 1 for $\boldsymbol{i=1}$ .

First observe that the number of solitons in $X^{n,p}$ is equal to the number of $1\,0$ patterns, so $\rho_{1}(n)=\mathbf{1}\{\xi_{n}=1\}+N_{10}(n)$ where $N_{10}(n)$ is the number of $1\,0$ patterns in the first $n$ terms. Because of the scaling, it suffices to prove that $N_{10}(n)=\sum_{k=1}^{n-1}\mathbf{1}\big{\{}\xi_{k}=1,\xi_{k+1}=-1\big{\}}$ satisfies the asserted limit theorems.

Now $N_{10}(n)$ counts occurrences of ‘head, tail’ patterns in a sequence of independent coin flips, which we view as a renewal process. Let $T_{10}$ be distributed as the inter-event times in this process. Then the elementary renewal theorem gives $\mathbb{E}[N_{10}(n)]/n\rightarrow 1/\mathbb{E}[T_{10}]$ . Since $\mathbb{E}[N_{10}(n)]=(n-1)p(1-p)$ , it follows from the strong law for renewal processes that

[TABLE]

Renewal theory also shows that $N_{10}(n)$ converges weakly to a standard normal random variable when appropriately normalized [3]. To compute the variance, we write $W_{k}=\mathbf{1}\big{\{}\xi_{k}=1,\xi_{k+1}=-1\big{\}}$ and observe that $\mathbb{E}[W_{k}]=p(1-p)$ , $\mathbb{E}[W_{k}W_{k+1}]=0$ , and $\mathbb{E}[W_{k}W_{\ell}]=p^{2}(1-p)^{2}$ when $|k-\ell|>1$ , hence

[TABLE]

so

[TABLE]

The second part of the theorem follows upon invoking Slutsky’s theorem to simplify the expression $(N_{10}(n)-\mathbb{E}[N_{10}(n)])/\operatorname{\textnormal{Var}}(N_{10}(n))^{1/2}$ . ∎

Remark 4.1.

The normal convergence of $\rho_{1}(n)$ can also be established using Stein’s method for sums of locally dependent random variables (see [2, Ch. 9]). Though this approach is more involved, it has the upshot of supplying a Berry-Esseen rate of order $O(n^{-1/2})$ . One can show that a central limit theorem also holds for the other row lengths by a similar renewal theory argument, but the corresponding variance computations are not as straightforward. **

To treat the $i>1$ case, we need to establish some more terminology and a useful lemma. Let $\gamma:\mathbb{R}^{+}\rightarrow\mathbb{R}$ be any nearest neighbor lattice path (so $|\gamma_{k+1}-\gamma_{k}|\in\{-1,0,1\}$ for all $k\in\mathbb{N}_{0}$ ). We say that $\gamma$ has a subexcursion of height $h$ on the interval $[r,t]$ if $\gamma_{r}=\gamma_{t}<\gamma_{s}$ for all $s\in(r,t)$ and $\max_{r<s<t}\gamma_{s}-\gamma_{r}=h$ . Such a subexcursion is said to begin at $r$ and end at $t$ .

Let $\{S_{k}\}_{k=0}^{\infty}$ be the simple random walk with increment distribution $\mathbb{P}\{S_{k+1}-S_{k}=1\}=1-\mathbb{P}\{S_{k+1}-S_{k}=-1\}=p$ . For each $i\geq 1$ and $\ell\geq 0$ , define the indicator random variable

[TABLE]

and let $\tau_{\ell}^{i}$ be the length of the subexcursion of $S_{k}$ beginning at $k=\ell$ , conditional on $J_{\ell}^{i}=1$ . Note that the distribution of $\tau_{\ell}^{i}$ does not depend on $\ell$ by the Markov property of $S_{k}$ , so we may drop the subscript when notationally convenient. Moreover, due to the negative drift of $S_{k}$ for $0<p<1/2$ , it is not hard to see that $\tau^{i}$ has an exponential tail.

The following lemma establishes a polynomial tail bound for the sum of centered indicators and thereby a strong law for the number of subexcursions of fixed height in the interval $[0,n]$ . This bound (with $m=4$ and $\varepsilon=1/\log n$ ) will also be used in in the proof of Theorem 3 in Section 8.

Lemma 4.2.

Let $\varsigma=\inf\{k>0\hskip 0.50003pt:\hskip 0.50003ptS_{k}=0\}$ be the first return time of $S_{k}$ to zero. Fix $i\geq 1$ and $\varepsilon>0$ . Set $\mu_{i}=\mathbb{P}\{\text{$ \max_{0\leq k\leq\varsigma}S_{k}=i $}\}$ . Then for each fixed $m\geq 2$ , there exists a constant $C=C(m,i,p)>0$ such that for each $n\geq 1$ ,

[TABLE]

With the above lemma (proved at the end of this section), it is easy to deduce Theorem 1.

Proof of Theorem 1 for $\boldsymbol{i\geq 1}$ .

The hill-flattening procedure produces a unique column of length at least $i$ for each such subexcursion, so $\rho_{i}(n)$ is the number of height $i$ subexcursions of $H$ on $[0,n]$ . Since the Harris walk $H_{k}$ and the associated simple random walk $S_{k}$ over $[0,n]$ share subexcursions of positive height, we may regard $\rho_{i}(n)$ as the number of subexcursions of $S_{k}$ occuring on $[0,n]$ . Furthermore, we can approximate $\rho_{i}(n)$ by $N_{i}(n):=\sum_{\ell=0}^{n}J_{\ell}^{i}$ since the two only differ when $H$ has a subexcursion of height at least $i$ beginning at or after $n-i$ , hence $|N_{i}(n)-\rho_{i}(n)|\leq 1$ . Therefore, the assertion follows from Lemma 4.2 with $m=3$ , $\varepsilon=1/\log n$ and the first Borel-Cantelli lemma. ∎

Our proof of Lemma 4.2 is based on joint moment estimates of the random variables $J_{\ell}^{i}-\mu_{i}$ . Before undertaking this task, we give some preliminary calculations and remarks to set the stage. Fix integers $i\geq 0$ and $0\leq\ell_{1}<\ell_{2}<\ell_{3}$ . Clearly $\mathbb{E}[J_{\ell_{1}}^{i}-\mu_{i}]=0$ , and we compute

[TABLE]

where we used the fact that $J_{\ell_{2}}^{i}$ is independent of $\big{\{}J_{\ell_{1}}^{i}=1\big{\}}$ if the excursion starting at $\ell_{1}$ ends at or before $\ell_{2}$ , and $J_{\ell_{2}}^{i}=0$ otherwise. Arguing analogously, we find that

[TABLE]

This shows that $J_{1}^{i},J_{2}^{i},\ldots,J_{n}^{i}$ are not negatively associated for $n\geq 3$ , so an immediate Chernoff-Hoeffding type bound is not applicable in our case.

Now in order to prove Lemma 4.2, we need to estimate the joint central moments of the random variables $J_{\ell}^{i}$ . For the sake of readability, this is split up into two propositions. Here and henceforth, the empty product is understood to equal one.

Proposition 4.3.

Fix integers $r\geq 2$ , $0\leq\ell_{1}<\ell_{2}<\cdots<\ell_{r}$ , and $\alpha_{1},\ldots,\alpha_{r}>0$ . Then

[TABLE]

Proof.

Write $\beta_{s}=\prod_{k=2}^{s-1}(-\mu_{i})^{\alpha_{k}}$ . Casing out according to whether $J^{i}_{\ell_{1}}$ is [math] or $1$ , we see that $(J^{i}_{\ell_{1}}-\mu_{i})^{\alpha_{1}}=c_{1}J^{i}_{\ell_{1}}+d_{1}$ where $d_{1}=(-\mu_{i})^{\alpha_{1}}$ and $c_{1}=(1-\mu_{i})^{\alpha_{1}}-(-\mu_{i})^{\alpha_{1}}$ . Since $\mu_{i}\in[0,1]$ , a little calculus shows that $\left|c_{1}\right|,\left|d_{1}\right|\leq 1$ . Now the strong Markov property for $S_{k}$ implies that for any $s\geq 2$ , $J^{i}_{\ell_{s}}$ is independent of $J^{i}_{\ell_{1}}$ if the excursion starting at $\ell_{1}$ ends at a site less than or equal to $\ell_{s}$ ; otherwise $J^{i}_{\ell_{s}}=0$ . By partitioning according to the length $\tau_{\ell_{1}}^{i}$ of the first excursion we compute

[TABLE]

Since $|c_{1}|,|\mu_{i}|,|\beta_{s}|\leq 1$ , $\mathbb{P}\big{\{}\tau^{i}\in(\ell_{s-1}-\ell_{1},\ell_{s}-\ell_{1}]\big{\}}\leq\mathbb{P}\big{\{}\tau^{i}>\ell_{s-1}-\ell_{1}\big{\}}$ , and $c_{1}\mu_{i}+d_{1}=\mathbb{E}\big{[}(J^{i}_{\ell_{1}}-\mu_{i})^{\alpha_{1}}\big{]}$ , the triangle inequality yields

[TABLE]

The key intuition for the next step is that each linear factor $(J^{i}_{\ell_{k}}-\mu_{k})$ effectively decreases the ‘degrees of freedom’ by at least a half. This idea is codified in the following proposition.

Proposition 4.4.

Fix integers $r\geq 2$ , $0\leq\ell_{1}<\ell_{2}<\cdots<\ell_{r}$ , and $\alpha_{1},\ldots,\alpha_{r}\geq 1$ . If $I:=\{1\leq k<r\hskip 0.50003pt:\hskip 0.50003pt\alpha_{k}=1\}$ is nonempty, then there exist constants $C_{r},D>0$ such that

[TABLE]

Proof.

Since the length $\tau^{i}$ of a subexcursion of height $i$ in $S_{k}$ has exponential tail, we may choose constants $D,D_{0}>0$ such that

[TABLE]

for all $t\geq 0$ .

Also, the exponential is nonnegative, so it’s enough to establish the inequality when the outer sum on the right-hand side is taken over a subset of those $I_{0}\subseteq[1,r)$ with cardinality at least half that of $I$ . Thus, for instance, we may dispense with the $I=\{1\}$ case by showing that the expectation on the left is bounded by a constant multiple of $\exp\left(-D(\ell_{2}-\ell_{1})\right)$ . This is an immediate consequence of Proposition 4.3 and Equation (12) since $\mathbb{E}[J^{i}_{\ell_{1}}-\mu_{i}]=0$ , $\left|\mathbb{E}\big{[}\prod_{k=s}^{r}(J_{\ell_{k}}-\mu_{i})^{\alpha_{k}}\big{]}\right|\leq 1$ , and $\mathbb{P}\big{\{}\tau^{i}>\ell_{s}-\ell_{1}\big{\}}\leq\mathbb{P}\big{\{}\tau^{i}>\ell_{2}-\ell_{1}\big{\}}$ for $s\geq 2$ .

We now proceed by induction on $r$ . The base case follows from the previous observation as the assumption that $I\neq\emptyset$ implies $I=\{1\}$ when $r=2$ . For the induction step, let $r\geq 3$ . Denote by $B_{1}$ and $B_{2}$ the first and second term in the right-hand side of the displayed inequality in Proposition 4.3, and let $K$ denote the sum over $I_{0}$ in the right-hand side of the displayed inequality in Proposition 4.4. By Proposition 4.3, it suffices to show that both $B_{1}$ and $B_{2}$ can be bounded by some constant times $K$ .

For the bound on $B_{2}$ , note that the induction hypothesis gives

[TABLE]

If $\alpha_{1}\geq 2$ , then $I\subseteq[2,r)$ , so we have $B_{2}\leq 2C_{r-1}K$ . Otherwise $\alpha_{1}=1$ and we are assuming $I\neq\{1\}$ , so the induction hypothesis and Equation (12) imply

[TABLE]

If we set $I_{0}:=\{1\}\cup I_{0}^{\prime}$ for each $I_{0}^{\prime}\subseteq[2,r)$ in the above summation, then $2|I_{0}|\geq 2+|I\cap[2,r)|=2+|I|-1>|I|$ . Moreover, the exponential terms can be written as $\exp(-D(\sum_{j\in I_{0}}\ell_{j+1}-\ell_{j}))$ . Accordingly, we have that $B_{2}\leq C_{r-1}K$ .

Next, we show that $B_{1}$ can be bounded by some constant times $K$ . Writing $m_{1}=\max(I)$ , we see that $|I\cap[s+1,r)|\geq 1$ for $s<m_{1}$ , so it follows from the inductive hypothesis, Equation (12), and the fact that all central moments are bounded in absolute value by one that

[TABLE]

with the convention that the empty sum is zero. For the first term, we view its inner sum as ranging over all $I_{0}\subseteq[0,r)$ with $I_{0}:=[1,s)\cup I_{0}^{\prime}$ . Note that $2|I_{0}|=2(s-1)+2|I_{0}^{\prime}|\geq 2(s-1)+|I\cap[s+1,r)|\geq|I|$ since $s\geq 2$ . Furthermore, the sum of the $\ell_{j+1}-\ell_{j}$ terms over $j\in[1,s)$ is exactly $\ell_{s}-\ell_{1}$ . Thus the first term above is at most some constant times $K$ . Finally, taking $m_{2}=\min(m_{1},r-|I|+1)\geq 2$ , we see that the second term is bounded by $D_{0}\sum_{s=m_{2}}^{r}\exp\left(-D(\ell_{s}-\ell_{s-1})\right)$ , which is a single summand in $K$ and so less than $K$ . This completes the inductive step and the proof. ∎

We are now ready to prove Lemma 4.2.

Proof of Lemma 4.2.

Fix $m\in\mathbb{N}$ , and use Chebyshev’s inequality and the linearity of expectation to write

[TABLE]

Our goal is to show that the right-hand side of the above inequality is $O(t^{-2m}n^{m+1})$ . Then letting $t=\varepsilon n$ gives the assertion. (The Landau notation is in terms of $n\rightarrow\infty$ throughout this proof.) We first observe that it suffices to bound the contribution from expectations involving at least $m+1$ distinct $\ell_{k}$ ’s as there are $O(n^{m})$ summands involving fewer and each is $O(1)$ .

Fix $m<r\leq 2m$ and let $\alpha_{1},\ldots,\alpha_{r}$ be positive integers such that $\sum_{k=1}^{r}\alpha_{k}=2m$ . Write $r=u+w$ where $w=\sum_{k=1}^{r}\mathbf{1}\{\alpha_{k}=1\}$ , and let $I=\{1\leq k<r\hskip 0.50003pt:\hskip 0.50003pt\alpha_{k}=1\}$ as in the preceding proposition. Since there are $O(1)$ choices for the $r$ ’s and $\alpha_{k}$ ’s, we need only to demonstrate the existence of a constant $C_{1}=C_{1}(r,i,p)>0$ such that

[TABLE]

for all $n\geq 1$ .

Note that $w\geq 2$ so that $|I|\geq 1$ and Proposition 4.4 applies. Thus we will be done upon showing that for each subset $I_{0}\subseteq[1,r)$ such that $2|I_{0}|\geq|I|$ , there exists a constant $C_{2}=C_{2}(i,p)>0$ such that

[TABLE]

for all $n\geq 1$ . (There are $O(1)$ subsets $I_{0}$ in the sum from Proposition 4.4.)

To verify Equation (15), first observe that if $\ell_{j+1}-\ell_{j}>n^{1/2m}$ for some $j\in I_{0}$ , then the corresponding summand is of order $O(\exp(-Dn^{1/2m}))$ . As there are $O(n^{2m})$ choices, the contribution from such terms is of order $O(1)$ . Accordingly, it suffices to show that there are $O(n^{m+1})$ sequences $0\leq\ell_{1}<\cdots<\ell_{r}\leq n$ not verifying this condition. To this end, let $L$ be the set of maps $\ell:[r]\rightarrow[n]\cup\{0\}$ such that $\ell(j+1)-\ell(j)\leq n^{1/2m}$ for all $j\in I_{0}$ , and let $G=([r],E)$ be the graph with vertex set $[r]$ and edge set $E=\big{\{}\{j,j+1\}\hskip 0.50003pt:\hskip 0.50003ptj\in I_{0}\big{\}}$ . Then $G$ contains at most $r-|E|=r-|I_{0}|$ connected components, say $P_{1},\ldots,P_{N}$ where $P_{i}$ is a path consisting of vertices $\{j_{i},j_{i}+1,\ldots,j_{i}+s_{i}-1\}$ , $s_{i}=|P_{i}|$ . Now for any $\ell\in L$ and $1\leq i\leq N$ , there at most $n^{1+s_{i}/2m}$ possible choices for $\ell(P_{i})$ — $n$ for $\ell(j_{i})$ and $n^{1/2m}$ for each of the $s_{i}-1$ successive vertices. Since $N\leq r-|I_{0}|$ and $\sum_{i=1}^{N}s_{i}=r\leq 2m$ , this gives

[TABLE]

The assertion then follows since

[TABLE]

where we have used the fact that $2|I_{0}|\geq|I|$ and $2u+w\leq\sum_{k=1}^{r}\alpha_{i}=2m$ . ∎

5. Top soliton lengths in the subcritical regime

In this section, we prove Theorem 2 (i). Fix $p\in(0,1/2)$ and let $\left\{H_{k}\right\}_{k=0}^{\infty}$ denote the Harris walk associated with the random box-ball configuration $X^{p}$ . The main insight is that the $j^{\text{th}}$ longest soliton length, $\lambda_{j}(n)$ , is asymptotically equal to the $j^{\text{th}}$ largest excursion height of $H_{k}$ over the interval $[0,n]$ , which we denote by $h_{j}(n)$ (Lemma 5.2). This allows us to obtain limit theorems for the $\lambda_{j}(n)$ in terms of the $h_{j}(n)$ (Lemma 5.1).

Before getting into the details, we discuss the main issue in comparing soliton lengths with excursion heights. Clearly $\lambda_{j}(n)\geq h_{j}(n)$ due to the hill-flattening construction of the invariant Young diagram (Lemma 2.1). For $j=1$ , we also have $\lambda_{1}(n)=h_{1}(n)$ since $\lambda_{1}(n)$ equals the maximum height of the Harris walk over $[0,n]$ by Lemma 2.2. However, this identity does not hold for $j\geq 2$ . Indeed, Lemma 2.2 shows that $\lambda_{2}(n)=\max\mathcal{E}(\Gamma(X^{n,p}))$ , the maximum excursion height of the modified Motzkin path $\mathcal{E}(\Gamma(X^{n,p}))$ . While all but the highest excursion of $\Gamma(X^{n,p})$ are preserved after applying the excursion operator $\mathcal{E}$ , it might be the case that there is a large subexcursion within the highest excursion which dominates the contribution from the second highest excursion of $\Gamma(X^{n,p})$ . In Subsection 5.3, we show that this is not the case asymptotically.

5.1. Overview and main results

We begin by stating the main results of this section and using them to prove Theorem 2 (i). Our first step is to obtain limit theorems for the $h_{j}(n)$ (which will be defined more carefully in the following subsection).

Lemma 5.1.

Set $\theta=(1-p)/p$ , $\sigma=(1-2p)/(1-p)$ , and $\mu_{n}=\log_{\theta}\left((1-2p)\sigma n\right)$ . Let $h_{j}(n)$ be the $j^{\text{th}}$ largest excursion height of the associate Harris walk over $[0,n]$ . Then for any nondecreasing real sequence $\{x_{n}\}_{n\geq 1}$ ,

[TABLE]

and

[TABLE]

Next, we show that the soliton lengths and excursion heights are essentially the same objects.

Lemma 5.2.

Fix $p\in(0,1/2)$ . Then for each $j\geq 1$ ,

[TABLE]

It is then straightforward to derive the main result for soliton lengths in the subcritical regime.

Proof of Theorem

2 (i).

Fix $j\geq 1$ , $x\in\mathbb{R}$ , and let $\mu_{n}=\log_{\theta}\left((1-2p)\sigma n\right)$ . Since $\lambda_{j}(n)\geq h_{j}(n)$ , we have

[TABLE]

Hence Lemma 5.1 shows

[TABLE]

For the other inequality, we have

[TABLE]

so Lemmas 5.1 and 5.2 show that

[TABLE]

5.2. Excursion heights

This subsection is devoted to proving Lemma 5.1. Roughly speaking, we proceed by showing that the Harris walk has $\Theta(n)$ excursions by time $n$ . By relating the excursion heights to a gambler’s ruin problem, we argue that their distribution has an exponential tail. Taking the maximum over the $\Theta(n)$ excursions shows that the law of $h_{1}(n)$ is approximated by a Gumbel distribution after scaling appropriately. The other order statistics are handled similarly.

To begin, set $\tau_{1}=0$ and for $k>1$ , define $\tau_{k}=\inf\{j>\tau_{k-1}\hskip 0.50003pt:\hskip 0.50003ptH_{j}=0\}$ to be the time of the $k^{\text{th}}$ visit to [math]. Thus $\tau_{k}$ is the beginning of the $k^{\text{th}}$ excursion above the $x$ -axis, and $\tau_{k+1}$ is the end of the $k^{\text{th}}$ such excursion. (In this section, if the random walk stays at [math], this counts as an excursion of height [math].) Let

[TABLE]

be the maximum height of the $k^{\text{th}}$ excursion. The strong Markov property ensures that $h_{1},h_{2},\ldots$ are i.i.d. $\mathbb{N}_{0}$ -valued random variables. To compute their distribution function, $F(x)=\mathbb{P}\{h_{1}\leq x\}$ , we observe that $\mathbb{P}\{h_{1}=0\}=1-p$ and $\mathbb{P}\{h_{1}\leq x\}=\mathbb{P}\{1\leq h_{1}\leq x\}+\mathbb{P}\{h_{1}=0\}$ for $x\geq 1$ . In order for the event $\left\{1\leq h_{1}\leq x\right\}$ to occur, the random walk must begin with an upstep and then visit zero before visiting $x+1$ . The latter occurs with the ‘gambler’s ruin’ probability that a simple random walker, started at the origin and moving right with probability $p$ , hits $-1$ before hitting $x$ , which is given by $\big{(}\theta^{x}-1\big{)}/\big{(}\theta^{x}-\theta^{-1}\big{)}$ [7, Ch. 5.7]. Putting all of this together shows that $F(x)=(1-p)+p\big{(}\theta^{x}-1\big{)}/\big{(}\theta^{x}-\theta^{-1}\big{)}$ for all $x\in\mathbb{N}_{0}$ . After a bit of rearranging, we get

[TABLE]

Now let $h_{1:m},\ldots,h_{m:m}$ denote the (reversed) order statistics of $h_{1},\ldots,h_{m}$ so that $h_{1:m}\geq\cdots\geq h_{m:m}$ and $\{h_{1:m},\ldots,h_{m:m}\}=\{h_{1},\ldots,h_{m}\}$ as multisets. Then

[TABLE]

In particular, the maximum $h_{1:m}$ has distribution function

[TABLE]

Write $M_{n}=\sup\{k\hskip 0.50003pt:\hskip 0.50003pt\tau_{k+1}\leq n\}$ for the number of excursions completed by time $n$ and let $r_{n}=\max\{\sum_{i=\tau_{M_{n}+1}}^{r}\xi_{i}\hskip 0.50003pt:\hskip 0.50003pt\tau_{M_{n}+1}\leq r\leq n\}$ be the maximum height attained after the last complete excursion. The excursion heights $h_{1}(n)\geq h_{2}(n)\geq\cdots\geq h_{M_{n}+1}(n)$ are the (reversed) order statistics for $h_{1},\ldots,h_{M_{n}},r_{n}$ . We begin by showing that $M_{n}$ is sharply concentrated around its mean so that we can essentially treat it as a deterministic sequence.

Proposition 5.3.

If $M_{n}$ is the number of excursions of $H$ completed by time $n$ , then

[TABLE]

Proof.

We may write $M_{n}=\sum_{k=1}^{n}\mathbf{1}\{H_{k}=0\}$ , the number of visits to [math] in $[1,n]$ . Since the Harris walk is ergodic with stationary distribution $\pi(x)=[(1-2p)/(1-p)]\theta^{-x}$ for $p<1/2$ , we can apply the Markov chain ergodic theorem to obtain

[TABLE]

The next ingredient in our argument is a simple stochastic monotonicity result.

Proposition 5.4.

Set $\sigma=(1-2p)/(1-p)$ , $p\in(0,1/2)$ . For any real sequence $\{x_{n}\}_{n\geq 1}$ and any positive integer $j$ , we have that for all $\varepsilon>0$ ,

[TABLE]

and

[TABLE]

Proof.

Define

[TABLE]

and

[TABLE]

It follows from Proposition 5.3 that there is an a.s. finite $N$ such that

[TABLE]

with probability one for all $n\geq N$ . Because $r_{n}\leq h_{M_{n}+1}$ and the probability that $h_{M_{n}+1}$ is among the $j$ largest of $h_{1},\ldots,h_{M_{n}+1}$ goes to zero as $n\rightarrow\infty$ , we see that for any $\varepsilon>0$ ,

[TABLE]

when $n$ is sufficiently large, hence

[TABLE]

and

[TABLE]

The desired assertion follows by noting that $M_{N^{-}(n,\varepsilon)}=\lfloor(\sigma-\varepsilon)n\rfloor$ and $M_{N^{+}(n,\varepsilon)}=\lceil(\sigma+\varepsilon)n\rceil$ a.s. since [math] is a recurrent state of $\{H_{k}\}$ . ∎

We are now in a position to prove the main result of this subsection.

Proof of Lemma

5.1.

First, we claim that for any sequence $\{b_{n}\}_{n\geq 1}$ with $\lim_{n\rightarrow\infty}b_{n}/n=c>0$ and any nondecreasing sequence $\{y_{n}\}_{n\geq 1}$ , we have

[TABLE]

Indeed,

[TABLE]

Since $\theta>1$ and $\{y_{n}\}_{n\geq 1}$ is nondecreasing, $\theta^{-y_{n}}$ is bounded. The claim follows since a Taylor expansion of the log term shows that

[TABLE]

Now fix $\varepsilon>0$ and a nondecreasing sequence $\{x_{n}\}_{n\geq 1}$ . Recall that for any deterministic sequence of integers $\{b_{n}\}_{n\geq 1}$ ,

[TABLE]

when $x_{n}+\mu_{n}\geq 0$ . (Since $\{x_{n}\}_{n\geq 1}$ is nondecreasing and $\mu_{n}=\log_{\theta}\left((1-2p)\sigma n\right)\nearrow\infty$ , this restriction is satisfied for all large $n$ .)

Writing $\nu_{n}=(x_{n}+\mu_{n})-\lfloor x_{n}+\mu_{n}\rfloor$ , we have

[TABLE]

so, since $\theta>1$ and $1-\nu_{n}\in(0,1]$ , we see that

[TABLE]

Set $b_{n}=\lfloor(\sigma-\varepsilon)n\rfloor$ and note that $\lim_{n\rightarrow\infty}b_{n}^{-k}\binom{b_{n}}{k}=\frac{1}{k!}$ . Then the above estimates and show that for all sufficiently large $n$ ,

[TABLE]

Thus Equation (19) with $y_{n}=x_{n}+1$ and $b_{n}=\lfloor(\sigma-\varepsilon)n\rfloor$ gives

[TABLE]

By taking $y_{n}=x_{n}$ and $b^{\prime}_{n}=\lceil(\sigma+\varepsilon)n\rceil$ , a similar argument shows that

[TABLE]

Letting $\varepsilon\searrow 0$ and applying Proposition 5.3 completes the proof. ∎

Remark 5.5.

Because we are taking the maximum of a random number of excursions, the sequence $\{h_{j}(n)-\mu_{n}\}_{n\geq 1}$ does not have a weak limit (and thus neither do the normalized subcritical soliton lengths). To see this, we first recall that

[TABLE]

for any real sequence $\{b_{n}\}_{n\geq 1}$ and any $x\in\mathbb{R}$ . Now fix $x>\mu_{1}$ , write $\nu_{n}=(x+\mu_{n})-\lfloor x+\mu_{n}\rfloor$ , and choose subsequences $\{\nu_{n_{k}}\}_{k\geq 1}$ and $\{\nu_{n_{\ell}}\}_{\ell\geq 1}$ such that $\nu_{n_{k}}\leq\frac{1}{3}$ and $\nu_{n_{\ell}}\geq\frac{2}{3}$ for all $k,\ell\in\mathbb{N}$ . This is possible since $\mu_{n}=\log_{\theta}\big{(}(1-2p)\sigma\big{)}+\log_{\theta}(n)$ and the fractional part of $\log_{\theta}(n)$ is dense in $[0,1]$ .**

Since

[TABLE]

$\theta>1$ , $1-\nu_{\ell}\in[0,1/3]$ , and $1-\nu_{k}\in[2/3,1]$ , we have the following analogues of Equation (22):

[TABLE]

and

[TABLE]

Repeating the last part of the proof of Lemma 5.1 (and restricting attention to $h_{1}(n)=\lambda_{1}(n)$ to simplify notation) shows that

[TABLE]

and

[TABLE]

In particular,

[TABLE]

Since the sequence $\big{\{}\lambda_{1}(n)-\mu_{n}\big{\}}$ is tight by Lemma 5.1, both $\big{\{}\lambda_{1}(n_{k})-\mu_{n_{k}}\big{\}}$ and $\big{\{}\lambda_{1}(n_{\ell})-\mu_{n_{\ell}}\big{\}}$ have subsequential weak limits. As Inequality (26) implies that the limiting distribution functions disagree at $x$ , it follows that $\big{\{}\lambda_{1}(n)-\mu_{n}\big{\}}$ does not converge weakly.**

5.3. Subexcursions within an excursion

Given an excursion $\gamma$ of $H$ with length $\varsigma$ and rightmost global maximum at $(m^{\ast},h)$ , define $a_{\ell}=\max\{t\leq m^{\ast}\hskip 0.50003pt:\hskip 0.50003pt\gamma(t)=\ell\}$ and $b_{\ell}=\min\{t\geq m^{\ast}\hskip 0.50003pt:\hskip 0.50003pt\gamma(t)=\ell\}$ for $\ell=0,\ldots,h$ . Write $\gamma_{a,\ell}=\gamma|_{[a_{\ell-1},a_{\ell}]}-\ell$ and $\gamma_{b,\ell}=\gamma|_{[b_{\ell},b_{\ell-1}]}-\ell$ . These paths correspond to the portions of $\gamma$ which, moving away from $m^{\ast}$ , begin at the point where $\gamma$ first descends to height $\ell$ and end where $\gamma$ first descends to height $\ell-1$ , except that they are shifted down by $\ell$ ; see Figure 7.

Set $\widetilde{\gamma}_{a,\ell}=\gamma_{a,\ell}\vee 0$ , $\widetilde{\gamma}_{b,\ell}=\gamma_{b,\ell}\vee 0$ (which has the effect of changing the downstroke furthest from $m^{\ast}$ to an $h$ -stroke) and define

[TABLE]

Then $\mathcal{E}(\gamma)$ is the concatenation of $\widetilde{\gamma}_{a,1},\ldots,\widetilde{\gamma}_{a,h},\widetilde{\gamma}_{b,h},\ldots,\widetilde{\gamma}_{b,1}$ , so $\max\mathcal{E}(\gamma)=\max_{1\leq k\leq 2h}\vartheta(k)$ .

Now let $m_{\ast}$ denote the leftmost global maximum of $\gamma$ and set $c_{\ell}=\min\{t\geq m_{\ast}\hskip 0.50003pt:\hskip 0.50003pt\gamma(t)=\ell\}$ , $\gamma_{c,\ell}=\gamma|_{[c_{\ell},c_{\ell-1}]}-\ell$ , $\widetilde{\gamma}_{c,\ell}=\gamma_{c,\ell}\vee 0$ . Define $\omega(k)=\max\widetilde{\gamma}_{c,k}=\max\gamma_{c,k}$ for $k=1,\ldots,h$ . We first observe that $\max\{\omega(1),\ldots,\omega(h)\}\geq\max\{\vartheta(h+1),\ldots,\vartheta(2h)\}$ . To see that this is so, let $j=\min_{m_{\ast}\leq t\leq m^{\ast}}\gamma(t)$ . Then $b_{k}=c_{k}$ for all $k<j$ , hence $\omega(k)=\max_{c_{k}\leq t\leq c_{k-1}}\gamma(t)-k\geq\max_{b_{k}\leq t\leq b_{k-1}}\gamma(t)-k=\vartheta(h+k)$ for all $k\leq j$ because $c_{\ell}\leq b_{\ell}$ for all $\ell$ . On the other hand, since $c_{j}\leq m^{\ast}<c_{j-1}$ , $\omega(j)=h-j>h-k\geq\vartheta(h+k)$ for all $k>j$ . It follows that $\max\mathcal{E}(\gamma)=\max\{\vartheta(1),\ldots,\vartheta(2h)\}\leq\max\{\vartheta(1),\ldots,\vartheta(h),\omega(1),\ldots,\omega(h)\}$ .

Next we observe that $\gamma$ is symmetric about $\varsigma/2$ in distribution. This is because, conditional on the excursion length, the law of $\gamma$ depends only on the number of up and down steps. Accordingly, $\mathcal{E}\gamma|_{[0,m^{\ast}]}$ and $\mathcal{E}\gamma|_{[m_{\ast},\varsigma]}$ have the same distribution, so $\max\{\vartheta(1),\ldots,\vartheta(h)\}=_{d}\max\{\omega(1),\ldots,\omega(h)\}$ , and thus

[TABLE]

To treat the latter probability, note that given $\{h=r\}$ , $m_{\ast},c_{r-1},\ldots,c_{0}$ are stopping times with respect to the natural filtration, so $\omega(1),\ldots,\omega(r)$ are independent by the strong Markov property. Also, each $\omega(k)$ is stochastically dominated by the random variable $Y$ which gives the maximum value taken by a simple random walker started at [math] and moving right with probability $p$ before hitting $-1$ (as the path $\gamma_{c,k}$ is constrained to be at height at most $h-k$ whereas the random walker’s path has no such restriction). We conclude that on the event $\{h\leq r\}$ ,

[TABLE]

where $G(x)=\mathbb{P}\{Y\leq x\}$ is the gambler’s ruin probability [7, Ch. 5.7]

[TABLE]

Proof of Lemma

5.2.

Fix $j\geq 1$ and let $\varepsilon>0$ be given. Lemma 5.1 implies that there exist $\delta>0$ , $N\in\mathbb{N}$ such that for each $n\geq N$ , the event

[TABLE]

has probability at least $1-\varepsilon$ . Write $H^{(k,n)}$ for the $k^{\text{th}}$ highest excursion of $H|_{[0,n]}$ , so that $h_{k}(n)=\max H^{(k,n)}$ . As each application of the excursion operator affects only one excursion, $\lambda_{1}(n),\ldots,\lambda_{j}(n)$ are the $j$ largest values among $\mathcal{E}^{i-1}\left(H^{(k,n)}\right)$ as $i$ and $k$ range over $\{1,\ldots,j\}$ . On $E_{j,\delta,n}$ , these coincide with $h_{1}(n),\ldots,h_{j}(n)$ when $\mathcal{E}\big{(}H^{(k,n)}\big{)}\leq 2\delta\log_{\theta}(n)$ for $k=1,\ldots,j$ . Since

[TABLE]

for $x>0$ , Equation (27) implies

[TABLE]

Consequently,

[TABLE]

and the claim follows since $\varepsilon$ is arbitrary. ∎

6. Top soliton lengths at criticality

In this section we observe that when $p=1/2$ , the (suitably scaled) Harris walk converges weakly to a reflected Brownian motion at the process level. In fact, this weak convergence can be strengthened to “polynomial convergence” by appealing to a result from Drmota [6]. This enables us to deduce scaling limits for the top soliton lengths.

Recall that $C([0,1])$ denotes the space of continuous functions $f:[0,1]\rightarrow\mathbb{R}$ equipped with the supremum norm. We say a continuous functional $F:C([0,1])\rightarrow\mathbb{R}$ is of polynomial growth if there exists $r\geq 1$ such that $|F(\gamma)|\leq\lVert\gamma\rVert_{\infty}^{r}$ for all $\gamma\in C([0,1])$ .

Theorem 6.1 (Theorem 9 of [6]).

Suppose that a sequence of stochastic processes $x_{n}(t)$ defined on $C([0,1])$ converges weakly to $x(t)$ . Furthermore suppose that there exists $s_{0}\in[0,1]$ such that for all $r\geq 0$ ,

[TABLE]

and that for every $\alpha>1$ , there exists $\beta>0$ and $C>0$ with

[TABLE]

If $F:C([0,1])\rightarrow\mathbb{R}$ is any continuous functional of polynomial growth, then

[TABLE]

We show the following polynomial convergence of Harris walk to the reflected Brownian motion.

Theorem 6.2.

Let $\{B(t)\hskip 0.50003pt:\hskip 0.50003pt0\leq t\leq 1\}$ be a standard Brownian motion and define $H^{n}(t)=H(nt)/\sqrt{n}$ for $0\leq t\leq 1$ . Then for $p=1/2$ ,

[TABLE]

Furthermore, if $F:C([0,1])\rightarrow\mathbb{R}$ is any continuous functional of polynomial growth, then

[TABLE]

Proof.

Since the rescaled Harris walk $H^{n}(t)$ is uniformly bounded by $n^{-\frac{1}{2}}|S^{n}(t)|$ , which has moments of all orders and satisfies the Hölder criterion in Theorem 6.1, we only need to show that $H^{n}$ converges weakly to $|B|$ . To this end, recall from Subsection 1.3 that the linear interpolation of the $p=1/2$ Harris walk is given by $H(t)=\mathcal{E}_{0}(S)(t)=S(t)-\min_{0\leq r\leq t}S(r)$ , where $S$ is the linear interpolation of symmetric simple random walk.

Donsker’s Theorem shows that after scaling diffusively, $S(t)$ converges weakly to a standard Brownian motion in the space $C([0,1])$ . That is, writing $S^{n}(t)=S(nt)/\sqrt{n}$ , we have

[TABLE]

for every bounded and continuous functional $F:C([0,1])\rightarrow\mathbb{R}$ .

A direct computation shows that for any fixed $b\in[0,1]$ , $\mathcal{E}_{b}$ is ( $2$ -Lipschitz) continuous and satisfies $\mathcal{E}_{b}(cf)=c\mathcal{E}_{b}(f)$ for all $b,c\geq 0$ (see Proposition A.6 (i) in Subsection A.3), so for every bounded and continuous $G:C([0,1])\rightarrow\mathbb{R}$ ,

[TABLE]

hence $H^{n}$ converges weakly to $\mathcal{E}_{0}(B)$ . As

[TABLE]

Lévy’s $M-B$ theorem (see [16, Ch. 2.3]) implies $\mathcal{E}_{0}(B)=_{d}|B|$ and the proof is complete. ∎

Now we can use the Lipschitz continuity of column length functionals $\max\mathcal{E}^{j-1}$ to obtain Theorem 2 (ii).

Proof of Theorem

2 (ii)..

First recall that the Motzkin path $\Gamma=\Gamma(X^{n,1/2})$ agrees with the Harris walk $H$ on $[0,n]$ , and has only downstrokes until it reaches height [math] on $[n,\infty)$ , hence all of its peaks are contained in $[0,n]$ . Recall also that the excursion operator deletes the peak at the rightmost maximum and preserves all the other peaks. Thus by Lemma 2.2, we have

[TABLE]

Lemma 2.3 in Section 2 shows that the column length functionals $\max\mathcal{E}^{j-1}:C([0,1])\rightarrow\mathbb{R}$ are Lipschitz, so taking powers gives continuous functionals of polynomial growth, and the claimed convergence follows from Theorem 6.2. A stronger version of the second part of the assertion (concerning orders of column lengths) is shown in Theorem 6.4 below. ∎

To establish the order of the other top soliton lengths, we appeal to known results about the marginal densities of the ranked maxima of $|B|$ over all excursions. To state our conclusions precisely, note that the continuity of $B$ ensures that the random subset $\{t\hskip 0.50003pt:\hskip 0.50003ptB(t)\neq 0\}$ of $[0,1]$ is a countable union of maximal disjoint intervals, called the excursion intervals of $B$ . We call an excursion interval $(a,b)$ complete if $B(a)=B(b)=0$ , and incomplete otherwise. All of the excursion intervals are complete except possibly the last one $(g(t),1]$ , where $g(t)=\sup\{0\leq t\leq 1\hskip 0.50003pt:\hskip 0.50003ptB(t)=0\}$ is the last zero of $B$ .

Let $\mathtt{h}_{1}\geq\mathtt{h}_{2}\geq\cdots>0$ be the ranked sequence of values $\sup_{t\in(a,b)}|B_{t}|$ as $(a,b)$ ranges over all excursion intervals of $B$ . The marginal distributions of the ranked heights over excursions in the reflected Brownian bridge were first obtained by Pitman and Yor [21]. Lagnoux, Mercier, and Vallois [15] pointed out that the probability that the maximum of reflected Brownian motion is obtained during the last incomplete excursion is approximately $0.3069$ . Csaki and Hu [5] obtained the following explicit expressions for the marginal densities of ranked maxima of reflected Brownian motion over all excursions, including the final meander:

Theorem 6.3.

For each $j\geq 1$ and $y>0$ ,

[TABLE]

where $\Phi(\cdot)$ is the standard normal distribution function.

Accordingly, Theorem 6.2 and Lemma 2.2 imply

Theorem 6.4.

At criticality, we have that for each $x>0$

[TABLE]

Furthermore,

[TABLE]

In particular, for any $j\geq 1$ , $\lambda_{j}(n)=\Theta(\sqrt{n})$ .

Remark 6.5.

One might wonder whether the top soliton lengths agree with the top excursion heights as in the subcritical phase. This would imply that the right-hand side of (28) gives the limiting distribution of $\lambda_{j}(n)/\sqrt{n}$ for all $j\geq 1$ . However, we conjecture that this is not the case for $p=1/2$ . This is because the random variable $Y$ appearing in the proof of Lemma 5.2 would then have distribution function $G(x)=1-1/(x+2)$ [7, Ch. 4.1], and one cannot find $x_{n}\in O(\sqrt{n})$ , $r_{n}\in\Omega(\sqrt{n})$ with $1-\big{(}1-\frac{1}{x_{n}+2}\big{)}^{r_{n}}\rightarrow 0$ .**

7. Top soliton lengths in the supercritical regime

In this section, we fix $p\in(1/2,1)$ and prove Theorem 2 (iii). The intuition is the following. According to Proposition 3.1, the top soliton lengths are encoded in the first $n$ nodes of a Galton-Watson forest $\mathfrak{F}=(T_{i})_{i\geq 1}\sim\mathtt{GWF}(p)$ . Since the offspring distribution has mean $p/(1-p)>1$ in the supercritical regime, the random index $I=\min\{i\hskip 0.50003pt:\hskip 0.50003pt|T_{i}|=\infty\,\}$ is almost surely finite. For $n$ large, about $np$ nodes of the infinite component $T_{I}$ will be exposed by the Harris walk, which climbs up along the ‘leftmost’ infinite branch in $T_{I}$ . Hence $\lambda_{1}(n)$ should behave like the maximum of a random walk with positive drift, and $\lambda_{2}$ will be the maximum height of the first few finite components $T_{1},\ldots,T_{I-1}$ together with the ‘bushes’ attached to the infinite branch in $T_{I}$ . We prove the $\lambda_{1}(n)$ assertion by approximating $\{H_{k}\}$ by $\{S_{k}\}$ . For subsequent soliton lengths, we appeal to a duality argument: A backward Harris walk started at the highest node will encounter a subcritical Galton-Watson forest, so $\lambda_{2}$ for density $p$ should behave as $\lambda_{1}$ for density $1-p$ ; see Figure 8.

7.1. Duality and proof of Theorem 2 (iii)

To make the above sketch rigorous, we introduce the notion of flip and dual configurations, which will be used to provide a coupling between the random box-ball configurations $X^{n,p}$ and $X^{n,1-p}$ .

Given a random box-ball configuration $X^{n,p}=X^{p}\mathbf{1}_{[1,n]}$ , define the associated box-ball configurations $\widetilde{X}^{n,p},\hat{X}^{n,p}:\mathbb{N}\rightarrow\{0,1\}$ (which we call the flip and dual) by

[TABLE]

For each $j\geq 1$ , denote $\lambda_{j}(n)=\lambda_{j}(X^{n,p})$ and $\widehat{\lambda}_{j}(n)=\lambda_{j}(\widehat{X}^{n,p})$ .

For $1/2<p<1$ , it is easy to see from the postive drift that $\lambda_{1}(n)=\max_{1\leq k\leq n}H_{k}\approx\max_{1\leq k\leq n}S_{k}\approx S_{n}$ , where $\{S_{k}\}_{k\geq 0}$ and $\{H_{k}\}_{k\geq 0}$ denote the random walk and Harris walk associated with $X^{p}$ . For the subsequent soliton lengths, we establish a duality with corresponding soliton lengths in an appropriate subcritical configuration.

Lemma 7.1.

Fix $\varepsilon>0$ , $j\in\mathbb{N}$ , and $\theta=(1-p)/p<1$ . Then there exists a constant $c=c(p)>1$ such that for each $n,x\geq 1$ ,

[TABLE]

and

[TABLE]

It is straightforward to deduce Theorem 2 from the above lemma.

Proof of Theorem 2 (iii)..

First, we may write

[TABLE]

The first term on the right-hand side converges in probability to zero by Lemma 7.1, and the second term converges in distribution to a standard normal by the usual central limit theorem, so the first part of the assertion follows from Slutsky’s theorem.

The concentration inequality for $\lambda_{1}(n)$ is a consequence of Lemma 7.1 and Hoeffding’s inequality applied to the associated random walk $S_{n}$ :

[TABLE]

for a suitable constant $C$ .

Now let $\hat{\mu}_{n}=\log_{\theta^{-1}}\left(\frac{(1-2p)^{2}}{p}n\right)$ . (This is the $\mu_{n}$ term from Section 5 but with $p$ and $1-p$ switched since we are now working in the supercritical regime.) Then for $j\geq 1$ fixed, Lemma 7.1 implies

[TABLE]

The lower bound is established similarly:

[TABLE]

and the assertion then follows from Theorem 2 (i). ∎

7.2. Proof of Lemma 7.1

We now prove Lemma 7.1, establishing a duality principle between the super- and sub-critical box ball systems. Positive drift ensures that $S$ and $H$ are not too different, so the first claim seems reasonable since $S$ should attain its maximum over $[0,n]$ near $n$ . To explain why the second claim is true, let $\widehat{H}\in C_{0}^{+}(\mathbb{R}^{+})$ be the Harris walk for the dual configuration so that $\widehat{\lambda}_{1}(n)=\max\widehat{H}$ . Now $H$ and $\widehat{H}$ are coupled in such a way that the latter is a time-reversal of $\mathcal{E}_{n}(S)$ , which is approximated by $\mathcal{E}_{n}(H)$ . Thus it all boils down to showing that the path $\mathcal{E}_{n}(H)$ pivoted at $n$ is close to $\mathcal{E}(H)=\mathcal{E}_{\mathtt{m}}(H)$ , pivoted at the actual location $\mathtt{m}=\mathtt{m}(H)$ of the rightmost maximum of $H$ . But again positive drift ensures that $H$ attains its maximum near the end. Continuity of the column length functionals can then be used to show that the two paths must be close to each other in an appropriate sense.

We begin by introducing the following random variable:

[TABLE]

Also, let $\widetilde{S}_{k}$ and $\widetilde{H}_{k}$ be the random walk and Harris walk associated with the flip $\widetilde{X}^{n,p}$ . Observe that $\widetilde{X}^{n,p}$ has the same law as $X^{n,1-p}$ , and for each $1\leq k\leq n$ , we have $\widetilde{S}_{k}=(-\xi_{1})+\cdots+(-\xi_{k})=-S_{k}$ and

[TABLE]

In the following proposition, we show that the maximum of the Harris walk $\{H_{k}\}_{0\leq k\leq n}$ on the interval $[0,n]$ is exponentially concentrated around its last value $H_{n}$ .

Proposition 7.2.

Fix $1/2<p<1$ and let $\widehat{\theta}=p/(1-p)$ . Then for any $n,x\geq 1$ ,

[TABLE]

Proof.

To show the first inequality, let $\mathtt{a}=\mathtt{a}(X^{n,p})$ be the location of the leftmost global minimum of the random walk $\{S_{k}\}_{0\leq k\leq n}$ . Then for any $k\geq\mathtt{a}$ ,

[TABLE]

It follows that

[TABLE]

Now $\widetilde{H}_{k}$ gives the height of the subcritical Harris walk which moves up with probability $1-p$ , so writing $\mathtt{b}$ for the beginning of the excursion interval containing $n$ , Equation (16) shows that

[TABLE]

Proposition 7.3.

Fix $1/2<p<1$ and $\widehat{\theta}=p/(1-p)>1$ . Let $R$ and $\widetilde{H}_{n}$ be as defined at (41) and (42). Then there exists a constant $c=c(p)>0$ such that for all $n,x\geq 1$ ,

[TABLE]

Proof.

Casing out according to the value of $\xi_{1}=\mathbf{1}\big{\{}X^{p}(1)=1\big{\}}-\mathbf{1}\big{\{}X^{p}(1)=0\big{\}}$ shows that for any integer $k\geq 1$ , $\mathbb{P}\big{\{}R\leq k\big{\}}=p\mathbb{P}\big{\{}R\leq k+1\big{\}}+(1-p)\mathbb{P}\big{\{}R\leq k-1\big{\}}$ , hence

[TABLE]

so $\mathbb{P}\big{\{}R=k+1\big{\}}=\widehat{\theta}^{-1}\mathbb{P}\big{\{}R=k\big{\}}$ . It follows that

[TABLE]

for each $x\in\mathbb{N}$ . Thus Proposition 7.2 implies that there is a $c=c(p)>0$ such that

[TABLE]

for all $x\geq 2$ . ∎

We are now ready to prove Lemma 7.1.

Proof of Lemma 7.1..

Fix $n\geq j$ and let $R$ and $\widetilde{H}_{n}$ be as defined at (41) and (42), respectively. According to the exponential bound in Proposition 7.3, it suffices to show the following inequalities:

[TABLE]

Note that the first inequality in (46) follows from Lemma 2.2 and the triangle inequality upon observing that

[TABLE]

To establish the second inequality, let $n^{\ast}:=\mathtt{m}(S\mathbf{1}_{[0,n]})$ denote the rightmost maximum of $S$ on $[0,n]$ , and define the sequence of random variables $\{\check{S}_{k}\}_{0\leq k\leq n}$ by $\check{S}_{k}=S_{k}$ for all $k\neq n$ and $\check{S}_{n}=S_{n^{\ast}}$ . As usual, let $\check{S}$ denote the linear interpolation of $\{\check{S}_{k}\}$ . By construction, $\lVert\check{S}-S\rVert_{\infty}=\widetilde{H}_{n}$ . Also, observe that $\mathcal{E}_{n}(S)(n)=0=\mathcal{E}(\check{S})(n)$ , and for $0\leq j<n$ , writing $m_{j}=\min(S_{j},\ldots,S_{n-1})$ , we have $\mathcal{E}_{n}(S)(j)=S_{j}-\min(m_{j},S_{n})$ and $\mathcal{E}(\check{S})(j)=S_{j}-\min(m_{j},S_{n^{\ast}})=S_{j}-m_{j}$ . If $\min(m_{j},S_{n})=S_{n}$ , then $m_{j}=S_{n}+\mathbf{1}\{S_{n}<m_{j}\}$ . It follows that

[TABLE]

Writing $\widehat{S}_{k}=-(S_{n}-S_{n-k})$ for the random walk associated with the dual configuration, we see that the Harris walk $\widehat{H}_{k}$ can be written as

[TABLE]

for all $0\leq k\leq n$ . As $S_{n}<m_{n-k}$ implies $\widetilde{H}_{n}=\lVert\check{S}-S\rVert_{\infty}\geq 1$ , we have

[TABLE]

for all $k\geq 1$ . Since the functional $\max\mathcal{E}^{j-1}$ is invariant under time reversal, the above observation together with Lemmas 2.2 and 2.3 yields

[TABLE]

Finally, the triangle inequality, Lemma 2.2, and Lemma 2.3 give

[TABLE]

8. Random 312-avoiding permutations

In this section, we discuss some relations between box-ball systems and 312-avoiding permutations and prove Theorem 3.

Recall that for a given permutation $\sigma$ , one can use the Robinson-Schensted algorithm (see [24, Ch. 3.1]) to obtain a pair of standard Young tableaux with common shape $\mathtt{RS}(\sigma)$ . Greene’s theorem [10] relates the sum of the lengths of the first $k$ rows (resp. columns) of the Young diagram $\mathtt{RS}(\sigma)$ to the length of a longest subsequence in $\sigma$ which can be obtained by taking the union of $k$ increasing (resp. decreasing) subsequences. In Proposition 8.1, we show that if $\sigma$ is $312$ -avoiding, then a ‘naive’ version of Greene’s theorem holds: We can subsequently delete longest increasing/decreasing subsequences to obtain subsequent row/column lengths of $\mathtt{RS}(\sigma)$ . Hence, roughly speaking, Theorem 3 gives the asymptotics of the ‘ $k^{\text{th}}$ longest’ increasing/decreasing subsequences of a random $312$ - (or $231$ -) avoiding permutation.

For a precise statement, we introduce some notation. Given two finite sequences $\alpha$ , $\beta$ of positive integers, denote by $\alpha\setminus\beta$ the sequence obtained by deleting all elements in $\beta$ from $\alpha$ . Denote by $\alpha_{+}$ (resp. $\alpha_{-}$ ) an arbitrary longest increasing (resp. decreasing) subsequence of $\alpha$ . Furthermore, let $\alpha_{+}^{\ast}$ (resp. $\alpha_{-}^{\ast}$ ) be the unique longest increasing (resp. decreasing) subsequence in $\alpha$ such that the sum of all numbers used in $(\alpha_{+}^{\ast})^{-1}$ (resp. $(\alpha_{-}^{\ast})^{-1}$ ) is as small (resp. large) as possible. This ensures that $\sigma_{+}^{\ast}$ (resp. $\sigma_{-}^{\ast}$ ) is the ‘leftmost’ (resp. ‘rightmost’) longest increasing (resp. decreasing) subsequence in $\sigma$ . For instance, if $\sigma=146532$ , then both $146$ and $145$ are longest increasing subsequences, where the former is $\sigma_{+}^{\ast}$ . The following proposition is proved in Appendix A.4.

Proposition 8.1.

Let $\tau$ be a 312-avoiding permutation and fix arbitrary $\tau_{-}$ . Then $\mathtt{RS}(\tau\setminus\tau_{-})$ is obtained from $\mathtt{RS}(\tau)$ by deleting its first column. Moreover, $\mathtt{RS}(\tau\setminus\tau_{+}^{\ast})$ is obtained from $\mathtt{RS}(\tau)$ by deleting its first row.

In order to prove Theorem 3, we begin by explaining (an equivalent version of) the construction of the time-invariant Young diagram introduced in [26], which was built upon a connection between box-ball configurations and $312$ -avoiding permutations. The first step is to map a box-ball configuration $X_{0}$ of $m$ balls to a $312$ -avoiding permutation $\sigma=\sigma(X_{0})\in\mathfrak{S}_{m}^{312}$ using the pushing and popping stack operations from [13, Ch. 2.2.1]. To do so, label the balls $1,\ldots,m$ from left to right so that the $i^{\text{th}}$ ball gets label $i$ . Then the one-line notation for $\sigma$ gives the left to right labels of the balls after a single update $X_{0}\mapsto X_{1}$ . That is, we push the symbol $1$ onto an empty stack at the first ball and then, advancing to the right, pop the top of the stack off for storage at each empty box and push $k$ onto the stack upon encountering the $k^{\text{th}}$ ball. See Figure 10 for an illustration.

To get a Young diagram from this stack-representable permutation $\sigma(X_{0})$ , one applies the Robinson-Schensted algorithm to obtain a pair of standard Young tableaux, and records their common shape as $\mathtt{RS}(\sigma(X_{0}))$ . It was shown in [26] that $\mathtt{RS}(\sigma(X_{s}))$ is invariant in $s\geq 0$ and its $j^{\text{th}}$ column length is the $j^{\text{th}}$ longest soliton length in the system. Thus, by Lemma 2.1, this construction gives the same Young diagram which was obtained by hill-flattening operations applied to the Motzkin path.

Proposition 8.2.

Let $X_{0}:\mathbb{N}_{0}\rightarrow\{0,1\}$ be a finitely supported box-ball configuration. Then

[TABLE]

The following proposition (proved in Appendix A.4) shows that there is a bijection between $312$ -avoiding permutations of length $n$ and Dyck paths of length $2n$ which ‘factors through’ box-ball configurations in a natural way. Let $\mathfrak{S}_{n}^{312}$ be the set of all $312$ -avoiding permutations of length $n$ and let $\textup{Dyck}_{2n}$ be the set of all Dyck paths of length $2n$ —that is, lattice paths from $(0,0)$ to $(2n,0)$ consisting only of upstrokes and downstrokes and never dipping below the horizontal axis.

Proposition 8.3.

**

(i)

There exists a bijection $\varphi:\textup{Dyck}_{2n}\rightarrow\mathfrak{S}_{n}^{312}$ .

(ii)

For each $\tau\in\mathfrak{S}_{n}^{312}$ and $\digamma\in\textup{Dyck}_{2n}$ such that $\varphi(\digamma)=\tau$ , there is a box-ball configuration $X_{0}$ such that $\tau=\sigma(X_{0})$ and $\digamma=\Gamma(X_{0})$ .

We now prove Theorem 3 using similar ideas from the proof of Theorem 1 together with some known results on random Dyck paths and random walk excursions.

Proof of Theorem 3.

Recall that $\sigma\in\mathfrak{S}_{n}^{312}$ if and only if $\sigma^{-1}\in\mathfrak{S}_{n}^{231}$ , so the map $\sigma\mapsto\sigma^{-1}$ preserves the uniform distribution on the sets $\mathfrak{S}_{n}^{312}$ and $\mathfrak{S}_{n}^{231}$ . Moreover, $\mathtt{RS}(\sigma)=\mathtt{RS}(\sigma^{-1})$ . Hence it suffices to prove the assertion only for the 312-avoiding permutations.

Let $\digamma$ be a Dyck path of length $2n$ and let $\tau=\varphi(\digamma)$ be the corresponding $312$ -avoiding permutation. Proposition 8.3 enables us to choose a box-ball configuration $X_{0}$ such that $\tau=\sigma(X_{0})$ and $\digamma=\Gamma(X_{0})$ , and Proposition 8.2 implies that $\mathtt{RS}(\tau)=\Lambda(\digamma)$ . If we denote by $\Sigma^{n}$ and $\digamma^{n}$ uniformly random elements of $\mathfrak{S}^{312}_{n}$ and $\textup{Dyck}_{2n}$ , this yields

[TABLE]

Now the contour process described in Subsection 2.3 gives a bijection between Dyck paths of length $2n$ and rooted plane trees with $n+1$ nodes, so part (i) of Theorem 3 follows from (47) and Proposition 2.5.

Part (iii) also follows easily from known results. Indeed, it is well known that under diffusive scaling the random walk excursion converges weakly to a standard Brownian excursion [1]. Moreover, by Theorem 6.1, the convergence is polynomial. Thus (iii) follows from (47) and Lemmas 2.2 and 2.3.

Lastly, we establish the strong law for $\rho_{i}(\digamma^{n})$ stated in part (ii). To begin, fix $i\geq 1$ , and let $\{S_{k}\}_{k\geq 0}$ be a simple symmetric random walk with $S_{0}=0$ . We may view the uniformly random Dyck path $\digamma^{n}$ of length $2n$ as the trajectory of $S_{k}$ over the interval $[0,2n]$ conditioned to stay non-negative and satisfy $S_{2n}=0$ . By (47) and the hill-flattening procedure, $\rho_{i}(\digamma^{n})$ equals the number of subexcursions of $\digamma^{n}$ of height $i$ . Recall the definitions of $\mu_{i}$ and $J_{\ell}^{i}$ given in Lemma 4.2 and above the same lemma, respectively. Let $N_{i}(n)=\sum_{\ell=0}^{n}J_{\ell}^{i}$ . Then $N_{i}(2n)=\rho_{i}(\digamma^{n})$ , so for all $n\geq 1$ and $\varepsilon<1/2n$ ,

[TABLE]

It is well known that the number of Dyck paths of length $2n$ is the $n^{\text{th}}$ Catalan number $\frac{1}{n+1}\binom{2n}{n}$ , so by Stirling’s approximation, $\mathbb{P}\left\{\text{$ S_{k} $is a Dyck path over$ [0,2n] $}\right\}\sim n^{-3/2}/\sqrt{\pi}$ . Now by Lemma 4.2 with $m=4$ and $\varepsilon=\varepsilon(n)=1/\log n\searrow 0$ , we get

[TABLE]

In particular, these probabilities are summable, so the first Borel-Cantelli lemma implies $\rho_{i}(\digamma^{n})/2n\rightarrow\mu_{i}$ a.s. as $n\rightarrow\infty$ . ∎

A Proofs of combinatorial lemmas

In this appendix, we provide proofs of Lemmas 2.1, 2.2, and 2.3, and Propositions 8.1 and 8.3.

A.1. Time invariance of the Young diagram

Our proof of Lemma 2.1 is similar to the argument from [26], which is formulated in terms of Dyck words intead of Motzkin paths. The argument is simplified by Proposition A.1.

To begin, recall that given a box-ball configuration $X_{s}$ of finite support, the associated lattice path $\Gamma(X_{s})$ is constructed by reading $X_{s}$ from left to right: Starting at height [math], increase by $1$ every time a $1$ is encountered, decrease by $1$ whenever a [math] is encountered at positive height, and remain at height [math] otherwise. A simple but useful observation is that reading $X_{s}$ from right to left produces the lattice path $\Gamma(X_{s-1})$ . More precisely, let $(X_{s})_{s\geq 0}$ be a box-ball system started from a finitely supported configuration $X_{0}$ . For each $s\geq 0$ , let $r_{s}=\max\{k\geq 0\hskip 0.50003pt:\hskip 0.50003ptX_{s}(k)=1\}$ be the location of the rightmost 1 at time $s$ . Construct a (backward) lattice path $\reflectbox{$ \vec{\reflectbox{ $\Gamma$ }} $}(X_{s}):\mathbb{N}_{0}\rightarrow\mathbb{N}_{0}$ by $\reflectbox{$ \vec{\reflectbox{ $\Gamma$ }} $}(X_{s})_{k}=0$ for $k\geq r_{s}$ and

[TABLE]

for $0\leq k<r_{s}$ . See Figure A.1 for an illustration. In this appendix, we denote the ordinary lattice path $\Gamma$ by $\vec{\Gamma}$ to emphasize the reading direction.

Proposition A.1.

$\displaystyle{\reflectbox{$ \vec{\reflectbox{ $\Gamma$ }} $}(X_{s+1})=\vec{\Gamma}(X_{s})}$ * for all $s\geq 0$ .*

Proof.

Fix $s\geq 0$ , and observe that both paths are [math] on $[r_{s+1},\infty)$ , so the assertion holds on this interval. Now suppose the paths agree on $[k+1,\infty)$ for some $k<r_{s+1}$ . We must show that $\reflectbox{$ \vec{\reflectbox{ $\Gamma$ }} $}(X_{s+1})_{k}=\vec{\Gamma}(X_{s})_{k}$ .

The definition of the box-ball dynamics shows that $X_{s+1}(k+1)=1$ if and only if $\vec{\Gamma}(X_{s})_{k}-1=\vec{\Gamma}(X_{s})_{k+1}$ , hence

[TABLE]

The inductive hypothesis implies

[TABLE]

and

[TABLE]

To facilitate the proof of Lemma 2.1, it is convenient to reformulate the procedure for building Young diagrams row by row: Rather than flatten hills, we contract peaks by deleting the upstroke-downstroke pair and then identifying the endpoints so that the path remains connected. The number of hills after flattening is the same as the number of peaks after contracting, so everything is exactly same as before. The advantage here is that if one begins with an $h$ -restricted Motzkin path, then the hills are always peaks and the Motzkin paths are always $h$ -restricted. Moreover, the contraction operation can be understood in terms of the environment as deleting $1\,0$ patterns.

Proof of Lemma 2.1..

The second part of the assertion clearly holds for all stable box-ball configurations $X_{0}:\mathbb{N}\rightarrow\{0,1\}$ of finite support. Since the system always stabilizes, the second part follows from the time invariance as stated in the first part.

Now let $(X_{s})_{s\geq 0}$ be as before. To show the time invariance of $\Lambda(X_{s})$ , recall that the construction of $\Lambda(X_{s})$ begins by counting the number of peaks in the path corresponding $X_{s}=X_{s}^{(0)}$ . This is equal to the number of $1\,0$ patterns, which is equal to the number of $1$ -strings, which is equal to the number of $0\,1$ patterns. The length of the first row of $\Lambda(X_{s})$ is given by this number. The peaks are then contracted by deleting the $1\,0$ patterns from $X_{s}$ to obtain $X_{s}^{(1)}$ and the process is repeated with $\Gamma(X_{s}^{(1)})$ . At each step, the $1$ -strings are counted, the diagram is updated, and the $1\,0$ patterns are deleted, continuing until the path consists only of $h$ -strokes.

The key insights are that the number of $1$ strings is the same regardless of whether the environment is read from left to right or conversely, and that the number of $1$ -strings after $1\,0$ patterns are deleted is the same as the number of $1$ strings after $0\,1$ patterns are deleted. In the first case, each $1$ string either decreases in length by $1$ (possibly disappearing), or it merges with the string on its right. In the second, each string either decreases in length by $1$ or merges with the string on its left.

Now for any fixed $s\geq 0$ , $\vec{\Gamma}(X_{s})$ and $\vec{\Gamma}(X_{s+1})$ can be read off from $X_{s+1}$ by proceeding from right to left and from left to right, respectively. The update rule for the former is to count $1$ -strings and then delete $1\,0$ patterns, and the update rule for the latter is to count $1$ -strings and then delete $0\,1$ patterns. By the previous observations, both result in the same final Young diagram.

At this point, it remains only to show that soliton lengths are given by the column lengths of the Young diagram $\Lambda(X_{0})$ . To see that this is so, observe that the path $\Gamma(X_{\tau})$ , which corresponds to the first stable configuration, consists of a series of single peaks of nondecreasing height, each as tall as the length of the associated soliton. As each flattening step reduces the height of the peaks by $1$ , we see that the number of rows of $\Lambda(X_{\tau})$ having length at least $\ell$ corresponds to the number of solitons of length at least $\ell$ . Therefore, the columns of $\Lambda(X_{\tau})$ encode the soliton lengths, so the same is true of $\Lambda(X_{0})$ by invariance. ∎

A.2. Extracting column lengths with excursion operators

In this subsection, we prove Lemma 2.2. The key observation is that the hill-flattening and excursion operators commute on the space of Motzkin paths.

To begin, we need to establish a couple of technical results. First, for any interval $I\subseteq\mathbb{R}$ , recall that $C_{0}^{+}(I)$ denotes the space of continuous functions $I\rightarrow[0,\infty)$ with compact support. For any $f\in C_{0}^{+}(I)$ , we denote by $\textup{supp}^{+}(f)$ the open set $\{x\in I\,:\,f(x)>0\}$ , which is a finite disjoint union of open intervals. Accordingly, we may write $\textup{supp}^{+}(f)=\bigsqcup_{i=1}^{n}(c_{i},d_{i})$ , where $d_{i}<c_{j}$ if $i<j$ . We call $J_{i}:=(c_{i},d_{i})$ the $i^{\text{th}}$ excursion interval of $f$ . Recall that $\mathcal{I}(\Gamma)$ denotes the set of hill intervals of $\Gamma$ (see the beginning of Subsection 2.2).

Proposition A.2.

Fix a Motzkin path $\Gamma$ and let $x\in\mathbb{N}$ be contained in a hill interval $I_{x}$ of $\,\Gamma$ . Denote $\textup{supp}^{+}(\mathcal{E}_{x}(\Gamma))=\bigsqcup_{i=1}^{n}J_{i}$ as above. Then $\Gamma-\mathcal{E}_{x}(\Gamma)$ is constant on each $J_{i}$ . In addition, $\mathcal{I}(\mathcal{E}_{x}(\Gamma))=\mathcal{I}(\Gamma)\setminus\{I_{x}\}$ and $\max\mathcal{E}^{j-1}(\Gamma)\geq 1$ for all $1\leq j\leq\rho(\Gamma)$ .

Proof.

To establish the first part, write $M=\Gamma_{x}\geq 0$ , and define integers $a_{0}<a_{1}<\cdots<a_{M-1}<a_{M}=x=b_{M}<b_{M-1}<\cdots<b_{1}<b_{0}$ by

[TABLE]

for each $0\leq i\leq M$ . In words, they are the first locations where $\Gamma$ has height $i$ when moving to the left and right from $x$ ; see Figure 4. To simplify notation, we set $a_{-1}=0$ and $b_{-1}=\infty$ . Now $\Gamma_{y}-\mathcal{E}_{x}(\Gamma)_{y}=\min\big{\{}\Gamma_{z}\hskip 0.50003pt:\hskip 0.50003ptx\wedge y\leq z\leq x\vee y\big{\}}$ , so on $\mathbb{N}_{0}$

[TABLE]

It follows that $\mathcal{E}_{x}(\Gamma)$ vanishes at the $a_{i}$ ’s and $b_{i}$ ’s, and differs from $\Gamma$ by a constant on $(a_{M-1},b_{M-1})$ and each interval of the form $(a_{k-1},a_{k}]$ or $[b_{k},b_{k-1})$ , $0\leq k\leq M-1$ . $J_{i}$ is the $i^{\text{th}}$ such interval (from left to right) where $\mathcal{E}_{x}(\Gamma)$ is not constant. This shows the first part of the assertion.

The preceding argument also implies that $\mathcal{I}(\mathcal{E}_{x}(\Gamma))\subseteq\mathcal{I}(\Gamma)$ . In addition, $\mathcal{E}_{x}(\Gamma)=0$ on $[a_{M-1},b_{M-1}]$ and $I_{x}\subseteq(a_{M-1},b_{M-1})$ , so $I_{x}$ is not a hill interval of $\mathcal{E}_{x}(\Gamma)$ . Finally, the definition of the $a$ and $b$ terms ensures that if $J\in\mathcal{I}(\Gamma)\setminus\{I_{x}\}$ , then either $J\subseteq(a_{i-1},a_{i}]$ or $J\subseteq[b_{i},b_{i-1})$ for some $0\leq i\leq M-1$ . Since $\mathcal{E}_{x}(\Gamma)$ is a vertical translate of $\Gamma$ on these intervals, $J$ must be a hill interval of $\mathcal{E}_{x}(\Gamma)$ . This shows $\mathcal{I}(\mathcal{E}_{x}(\Gamma))=\mathcal{I}(\Gamma)\setminus\{I_{x}\}$ .

Lastly, taking $x=\mathtt{m}$ in the first part gives $\mathcal{I}(\mathcal{E}(\Gamma))=\mathcal{I}(\Gamma)\setminus\{I_{\mathtt{m}}\}$ , and the second part of the second assertion follows from the first since each application of $\mathcal{E}$ removes a single hill interval and the height of a Motzkin path is at least one while hill intervals remain. ∎

Proposition A.3.

For any interval $I\subseteq\mathbb{R}^{+}$ , $f\in C_{0}^{+}(I)$ , $x,y\in I$ , if $f$ is constant on the interval $[x,y]\subseteq I$ , then $\mathcal{E}_{x}(f)=\mathcal{E}_{y}(f)$ .

Proof.

Casing out according to whether $t<x$ , $x\leq t\leq y$ , or $t>y$ shows that

[TABLE]

Proposition A.4.

For any Motzkin path $\Gamma$ and any $x\in\mathbb{N}$ contained in a hill interval of $\Gamma$ , $\mathcal{E}_{x}\circ\mathcal{H}(\Gamma)=\mathcal{H}\circ\mathcal{E}_{x}(\Gamma)$ . In particular, $\mathcal{E}\circ\mathcal{H}(\Gamma)=\mathcal{H}\circ\mathcal{E}(\Gamma)$ .

Proof.

Let $\mathtt{m}=\mathtt{m}(\Gamma)$ and $\mathtt{m}^{\ast}=\mathtt{m}(\mathcal{H}(\Gamma))$ . Note that $\mathtt{m}<\mathtt{m}^{\ast}$ and that $\mathcal{H}(\Gamma)$ is constant on $[\mathtt{m},\mathtt{m}^{\ast}]$ . This holds for any Motzkin path $\Gamma$ . Thus by Proposition A.3 with $I=\mathbb{R}^{+}$ , it suffices to prove the first part. To this end, we first note that for any $k\in\mathbb{N}_{0}$ ,

[TABLE]

where $I_{x}$ denotes the hill interval of $\Gamma$ containing $x$ . Indeed, $\mathcal{H}(\Gamma)=\Gamma-1$ on $I_{x}$ , so the left-hand side is $1$ for all $k\in I_{x}$ . Now fix $k\notin I_{x}$ , and let $x_{\ast}$ be the location of the leftmost minimum of $\Gamma$ over the interval $[k\wedge x,k\vee x]$ . Then $x_{\ast}$ is an integer which is not contained in any hill interval of $\Gamma$ , so $\mathcal{H}(\Gamma)_{x_{\ast}}=\Gamma_{x_{\ast}}$ . Moreover, $x_{\ast}$ minimizes $\mathcal{H}(\Gamma)$ on $[k\wedge x,k\vee x]$ since the only integer points with $\mathcal{H}(\Gamma)_{y}<\Gamma_{y}$ are those contained in a hill interval of $\Gamma$ , in which case $\Gamma_{y}\geq\Gamma_{x_{\ast}}+1$ . This shows that the left-hand side is [math] for $k\notin I_{x}$ as desired.

In conjunction with Proposition A.2, we have

[TABLE]

∎

Now we prove Lemma 2.2.

Proof of Lemma 2.2.

Let $\Gamma$ be a Motzkin path and write $\lambda_{j}$ for the length of the $j^{\text{th}}$ column of $\Lambda(\Gamma)$ for each $1\leq j\leq\rho(\Gamma)$ . We show

[TABLE]

by induction on $\max\,\Gamma$ . If the maximum is zero, then the assertion is trivial, so we may assume that it holds for all Motzkin paths with maximum less than $M\in\mathbb{N}$ . Now fix a path $\Gamma$ with $\max\,\Gamma=M$ . The inductive hypothesis implies that the assertion holds for $\mathcal{H}(\Gamma)$ since it has maximum $M-1\geq 0$ . Moreover, $\Lambda(\mathcal{H}(\Gamma))$ is obtained by deleting the first row of $\Lambda(\Gamma)$ . Thus by Proposition A.4, we have

[TABLE]

where the final equality used the second part of Proposition A.2 to ensure $\max\mathcal{E}^{j-1}(\Gamma)\geq 1$ for any $1\leq j\leq\rho(\Gamma)$ . ∎

Remark A.5.

An easy modification of Proposition A.4 and applying the same proof of Lemma 2.2 shows that the excursion operator $\mathcal{E}=\mathcal{E}_{\mathtt{m}}$ in the statement of Lemma 2.2 could be replaced by $\mathcal{E}_{\mathtt{m}^{\ast}}$ , where the pivot $\mathtt{m}^{\ast}=\mathtt{m}^{\ast}(\Gamma)$ is chosen to be an arbitrary element in the set $\{x\geq 0\,:\,\Gamma(x)=\max\Gamma\}$ where the Motzkin path $\Gamma$ achieves its maximum. **

A.3. Regularity of the column length functionals

In this subsection we prove Lemma 2.3, establishing Lipschitz continuity of the ‘column length functionals’ $\max\mathcal{E}^{j-1}(\cdot)$ . The general strategy is to show that the column length functionals satisfy a Lipschitz condition on Motzkin paths and then extend the result to arbitrary functions in $C_{0}^{+}(\mathbb{R}^{+})$ by an approximation argument. We begin by establishing some preparatory results.

Proposition A.6.

**

(i)

Fix an interval $I\subseteq\mathbb{R}^{+}$ , a point $b\in I$ , and functions $f,g\in C_{0}^{+}(I)$ . Then

[TABLE]

(ii)

For any Motzkin paths $f,g\in C_{0}^{+}(\mathbb{R}^{+})$ ,

[TABLE]

Proof.

For (i), the triangle inequality gives

[TABLE]

since the minima of two functions over a given interval can differ by no more than their maximum difference over the interval.

For (ii), observe that the maximum distance between Motzkin paths is necessarily $\mathbb{N}_{0}$ -valued and the claim is clearly true if $f=g$ , so we may assume that $\left\|\mathcal{H}(f)-\mathcal{H}(g)\right\|_{\infty}\geq 1$ . Let

[TABLE]

and assume without loss of generality that $\mathcal{H}(f)_{x^{\ast}}>\mathcal{H}(g)_{x^{\ast}}$ . If $x^{\ast}$ is not in a hill interval of $g$ , then $g(x^{\ast})=\mathcal{H}(g)_{x^{\ast}}<\mathcal{H}(f)_{x^{\ast}}\leq f(x^{\ast})$ , so

[TABLE]

If $x^{\ast}$ is in a hill interval of both $f$ and $g$ , then

[TABLE]

Finally, suppose that $x^{\ast}$ is in a hill interval $[a,b]$ of $g$ but is not in any hill interval of $f$ . Then $g$ is constant on $[a,b]$ , so our choice of $x^{\ast}$ implies that $f(x^{\ast})\geq f(y)$ for all $y\in[a,b]$ . By considering whether or not $x^{\ast}<b$ , we see that we must have $f(x^{\ast}+1)=f(x^{\ast})-1$ . A similar consideration of whether $f(x^{\ast})=f(y)$ for all $a\leq y\leq x^{\ast}$ leads to the contradiction that $x^{\ast}$ is in a hill interval of $f$ . ∎

To state our next result, we say that a function $\varphi:\mathbb{R}\rightarrow\mathbb{R}$ is an affine scaling if $\varphi(x)=ax+b$ for some $a>0$ , $b\in\mathbb{R}$ . The set of all affine scalings forms a group under composition. Given $f\in C_{0}^{+}(\mathbb{R})$ and an affine scaling $\varphi$ , we write $\varphi^{\ast}(f)$ for the function $f\circ\varphi^{-1}$ . A function $\Gamma:\mathbb{R}\rightarrow\mathbb{R}^{+}$ is an extended Motzkin path if $\Gamma(n)=0$ for all $n\leq 0$ and $\Gamma|_{[0,\infty)}$ is a Motzkin path.

Proposition A.7.

For any $f_{1},f_{2}\in C_{0}^{+}(\mathbb{R})$ which are not identically zero and any $\varepsilon>0$ , there exist affine scalings $\varphi,\psi$ and extended Motzkin paths $\Gamma_{1},\Gamma_{2}$ such that $\psi(0)=0$ and for $i=1,2$ , the function $\bar{f_{i}}=\psi\circ\varphi^{\ast}(\Gamma_{i})\in C_{0}^{+}(\mathbb{R})$ satisfies

[TABLE]

Proof.

By hypothesis, $\mathtt{m}(f_{1}),\mathtt{m}(f_{2})\in(0,\infty)$ . Also, the $f_{i}$ ’s are uniformly continuous, so there is some $\delta>0$ such that $|x-y|<\delta$ implies $|f_{1}(x)-f_{1}(y)|,|f_{2}(x)-f_{2}(y)|<\varepsilon/4$ . Set $\mathtt{s}=|\mathtt{m}(f_{1})-\mathtt{m}(f_{2})|+\mathbf{1}\{\mathtt{m}(f_{1})=\mathtt{m}(f_{2})\}$ and choose $N$ large enough that $\Delta:=\mathtt{s}/2^{N}<\delta$ . Define the lattice

[TABLE]

Note that $\mathtt{m}(f_{1}),\mathtt{m}(f_{2})\in\mathfrak{L}$ . Set $a=2\Delta/\varepsilon$ , $\mathfrak{L}^{+}=\mathfrak{L}\cap[0,\infty)$ , and let $\ell_{0}$ denote the smallest element of $\mathfrak{L}^{+}$ . Observe that $0\leq\ell_{0}<\Delta$ by construction.

For $i=1,2$ , define the function $\gamma_{i}:\mathfrak{L}\rightarrow\mathfrak{L}^{+}$ by

[TABLE]

Note that $af_{i}$ changes by no more than $\Delta/2$ when the argument changes by no more than $\Delta$ . In conjunction with the fact that $f_{i}\equiv 0$ on $(-\infty,0]$ , $f_{i}\geq 0$ , and $\ell_{0}\in[0,\Delta)$ , this implies that $\gamma_{i}$ is an extended Motzkin path on $\mathfrak{L}$ . That is, $\gamma_{i}(\ell)=\ell_{0}$ for all $\ell\in\mathfrak{L}\cap(-\infty,\ell_{0}]$ and for each $\ell,\ell^{\prime}\in\mathfrak{L}$ with $|\ell-\ell^{\prime}|=\Delta$ , we have $\gamma_{i}(\ell)\geq\ell_{0}$ and $|\gamma_{i}(\ell)-\gamma_{i}(\ell^{\prime})|\in\{0,\Delta\}$ .

Let $\varphi(x)=\Delta\cdot x+\ell_{0}$ . Then $\varphi$ is an affine scaling which maps $\mathbb{Z}$ bijectively to $\mathfrak{L}$ . Also define the affine scaling $\sigma(x)=(x-\ell_{0})/a$ . By a slight abuse of notation, we will henceforth let $\gamma_{i}$ denote its extension to $\mathbb{R}$ by linear interpolation. Let $\Gamma_{i}\in C_{0}^{+}(\mathbb{R})$ be the extended Motzkin path defined by $\Gamma_{i}=\varphi^{-1}\circ\gamma_{i}\circ\varphi$ . Now define

[TABLE]

where $\psi(x)=\sigma\circ\varphi(x)=\frac{\varepsilon}{2}x$ . Then $\psi(0)=\sigma(\ell_{0})=0$ and $\mathtt{m}(\bar{f}_{i})=\mathtt{m}(\gamma_{i})=\mathtt{m}(f_{i})$ . For $x\in\mathfrak{L}$ , a direct computation gives $|f_{i}(x)-\bar{f}_{i}(x)|<\varepsilon/2$ . For $x\notin\mathfrak{L}$ , writing $\ell_{x}$ for the nearest lattice point to $x$ gives

[TABLE]

hence $\lVert f_{i}-\bar{f}_{i}\rVert_{\infty}<\varepsilon$ as desired. ∎

We are now ready to prove Lemma 2.3.

Proof of Lemma 2.3..

Fix $j\geq 1$ . To begin, we observe that it is enough to show the assertion for $I=\mathbb{R}$ . Indeed, for any $I\subseteq\mathbb{R}$ and any $h\in C_{0}^{+}(I)$ , we can define a function $\tilde{h}\in C_{0}^{+}(\mathbb{R})$ which equals $h$ on $I$ and drops linearly to zero on $[b,b+1]$ where $b$ is the rightmost boundary point of $I$ . This construction ensures that $\max\mathcal{E}^{j-1}(h)=\max\mathcal{E}^{j-1}(\tilde{h})$ and $\lVert h_{1}-h_{2}\rVert_{\infty}=\lVert\tilde{h}_{1}-\tilde{h}_{2}\rVert_{\infty}$ .

Next we show that the result holds if the graphs of $f$ and $g$ are (extended) Motzkin paths by induction on $m=\max\mathcal{E}^{j-1}(f)+\max\mathcal{E}^{j-1}(g)$ . The assertion is trivial when $j=1$ or $m=0$ . If $\max\mathcal{E}^{j-1}(f)\geq 1$ and $\max\mathcal{E}^{j-1}(g)=0$ , write $\mathtt{m}_{j}:=\mathtt{m}(\mathcal{E}^{j-1}(f))$ . Let $J=[a,b]$ be the excursion interval of $\mathcal{E}^{j-1}(\Gamma)$ which contains $\mathtt{m}_{j}$ . By Proposition A.2, $\Gamma-\mathcal{E}^{j-1}(\Gamma)$ is constant on the excursion intervals of $\mathcal{E}^{j-1}(\Gamma)$ . Hence we get

[TABLE]

As $f,g\geq 0$ , consideration of whether or not $g(\mathtt{m}_{j})\geq g(a)$ shows that

[TABLE]

By symmetry, the result also holds when $m\geq 1$ and $\max\mathcal{E}^{j-1}(f)=0$ , so we may assume that both $\max\mathcal{E}^{j-1}(f)$ and $\max\mathcal{E}^{j-1}(g)$ are at least 1. As the maxima are necessarily attained on hill intervals, Proposition A.4, the inductive hypothesis, and part (ii) of Proposition A.6 imply

[TABLE]

This completes the proof for Motzkin paths.

Now we show the assertion for $f,g\in C_{0}^{+}(\mathbb{R})$ by induction on $j\geq 1$ . The base case is tautological. For the inductive step, choose $\psi,\varphi,\Gamma_{1},\Gamma_{2},\bar{f}$ , $\bar{g}$ as in Proposition A.7 with $f_{1}=f,f_{2}=g$ . Then by the choice of $\bar{f}$ , Proposition A.4, the inductive hypothesis, and Proposition A.6 (i), we have

[TABLE]

and similarly for $g$ . Also, since $\psi(0)=0$ , the triangle inequality gives

[TABLE]

Lastly, observe that the functional $\max\mathcal{E}^{k}$ satisfies

[TABLE]

Thus in conjunction with the assertion for the Motzkin paths, we obtain

[TABLE]

Letting $\varepsilon\searrow 0$ completes the inductive step and the proof. ∎

A.4. Statistics of 312-avoiding permutations

In this subsection, we provide proofs of Propositions 8.3 and 8.1.

Recall that for each $n\geq 1$ and permutation $\tau\in\mathfrak{S}_{3}$ of length 3, we denote by $\mathfrak{S}_{n}^{\tau}$ the set of all $\tau$ -avoiding permutations of length $n$ . Also recall that $\text{Dyck}_{2n}$ denotes the set of all Dyck paths of length $2n$ . Note that a permutation $\sigma$ is $312$ -avoiding iff its inverse $\sigma^{-1}$ is $231$ -avoiding. Also, if we denote by

$\vec{\reflectbox{$ \sigma $}}$

the reversal of $\sigma$ obtained by reading $\sigma$ from right to left, then $\sigma$ is $231$ -avoiding iff its reversal

$\vec{\reflectbox{$ \sigma $}}$

is $132$ -avoiding.

There are a number of bijections between $\tau$ -avoiding permutations and Dyck paths in the literature. For instance, Krattenthaler [14] obtained a bijection $\mathfrak{S}_{n}^{132}\rightarrow\text{Dyck}_{2n}$ , and later Hoffman, Rizzolo, and Silvken [11] used a bijection $\text{Dyck}_{2n}\rightarrow\mathfrak{S}_{n}^{231}$ to study random $231$ -avoiding permutations in terms of random walks and Brownian excursions. In fact, the inverse of the latter bijection is the conjugation of the former by reversals of permutations and Dyck paths, where the reversal of a Dyck path is its left-right mirror image. In the forthcoming proof of Proposition 8.3, we will make use of the bijection $\text{Dyck}_{2n}\rightarrow\mathfrak{S}_{n}^{231}$ mentioned above, which we give below in a slightly more general version.

For a given $h$ -restricted Motzkin path $\Gamma$ , we define a permutation $\sigma(\Gamma)$ as follows: Let $v_{k}$ be the location of the $k^{\text{th}}$ upstroke of $\Gamma$ . (Thus if $\Gamma=\Gamma(X_{0})$ , then $v_{k}$ is the location of the $k^{\text{th}}$ ball in $X_{0}$ ). Then we define a $231$ -avoiding permutation $\sigma(\Gamma)$ by

[TABLE]

When restricted to Dyck paths, this map $\digamma\mapsto\sigma(\digamma)$ is shown to be a bijection between $\textup{Dyck}_{2n}$ and $\mathfrak{S}^{231}_{n}$ in [11, Thm. 4.3].

Remark A.8.

For a given rooted forest $\mathfrak{F}$ , a permutation $\sigma(\mathfrak{F})$ can be defined similarly: Let $v_{k}$ be the $k^{\text{th}}$ non-root node in $\mathfrak{F}$ according to the depth-first order and define

[TABLE]

Note that the maps (A.2) and (A.1) yield the same permutation for corresponding rooted forest and its contour process. Namely, let $\Gamma$ be the $h$ -restricted Motzkin path which is a contour process of $\mathfrak{F}$ . Then $\sigma(\Gamma)=\sigma(\mathfrak{F})$ ; see Figure A.3 for an illustration. **

Proof of Proposition 8.3..

We define a map $\varphi:\textup{Dyck}_{2n}\rightarrow\mathfrak{S}^{312}_{n}$ by

[TABLE]

where the first map is given by (A.1). As a composition of two bijections, $\varphi$ is a bijection from $\textup{Dyck}_{2n}$ to $\mathfrak{S}^{312}_{n}$ . This shows (i).

To show (ii), fix $\digamma\in\textup{Dyck}_{2n}$ and let $X_{0}$ be the box-ball configuration obtained from $\digamma$ by

[TABLE]

for all $i\geq 0$ . It then suffices to show that

[TABLE]

To this end, label the balls $1,\ldots,n$ from left to right, and recall the push-pop stack construction $X_{0}\mapsto\sigma(X_{0})$ described in Section 8. Fix a label $1\leq k\leq n$ . We are going to track the trajectory of ball $k$ during the push-pop stack construction. Using the notation from Equation (A.1), let ball $k$ be at site $v_{k}$ . Note that $\digamma_{v_{k}}$ equals the number of balls in the stack after ball $k$ is pushed onto it. Hence the number of balls which have been popped off in previous steps equals $k-\digamma_{v_{k}}$ . Next, while the stack sweeps sites to the right of $v_{k}$ , balls with larger labels will be pushed on and popped off until ball $k$ is finally deposited. This happens precisely when $\digamma$ first hits height $\digamma_{v_{k}}-1$ after location $v_{k}$ . Accordingly, the number of balls that are deposited during the period when ball $k$ is in the stack equals the height of the subexcursion of $\digamma$ started at $v_{k}$ , which equals to half of the duration of this excursion. Thus

[TABLE]

Therefore, $\sigma(\digamma)(k)$ , which is one more than the above quantity, is the position of $k$ in $\sigma(X_{0})$ as desired. ∎

Proof of Proposition 8.1..

Before we begin, recall the definition of the longest ‘leftmost’ increasing and ‘rightmost’ decreasing subsequences $\tau_{+}^{\ast}$ and $\tau_{-}^{\ast}$ given above the statement of Proposition 8.1.

We first show the assertion for $\tau_{+}^{\ast}$ . By induction on the length of the permutation, we suppose that the assertion holds for all $312$ -avoiding permutations of length less than $n$ for some $n\geq 3$ , and fix a $312$ -avoiding permutation $\tau$ of length $n$ . (The result is true by inspection when $n=3$ .) Using Proposition 8.3, choose a box-ball configuration $X_{0}$ and a Dyck path $\digamma$ such that $\tau=\sigma(X_{0})$ and $\digamma=\Gamma(X_{0})$ .

By Greene’s theorem ([10]), we know that the length of the first row of $\mathtt{RS}(\tau)$ equals the length of any longest increasing subsequence in $\tau$ . Since $\mathtt{RS}(\tau)=\Lambda(\digamma)$ , we see that the length of the longest increasing subsequence of $\tau$ equals the number of peaks in $\digamma$ .

Let $X_{0}^{\prime}$ be the box-ball configuration obtained from $X_{0}$ by deleting all $1\,0$ patterns from $X_{0}$ , as in the proof of Lemma 2.1, and let $\Gamma^{\prime}=\Gamma(X_{0}^{\prime})$ and $\tau^{\prime}=\sigma(X_{0}^{\prime})$ be the $h$ -restricted Motzkin path and $312$ -avoiding permutation constructed from $X_{0}^{\prime}$ (see the commutative diagram (A.4)). It is easy to see that $\Gamma^{\prime}$ can be directly obtained from $\Gamma$ by first applying the hill-flattening operator $\mathcal{H}$ and then contracting new $h$ -strokes which are not at height [math].

On the other hand, let $L$ be the number of $1\,0$ patterns in $X_{0}$ , which is the same as the number of peaks in $\Gamma$ . When reading $X_{0}$ from left to right, let $\ell_{i}$ be the label of the ball that corresponds to the ‘1’ in the $i^{\text{th}}$ $1\,0$ pattern. Then $\tilde{\tau}:=\ell_{1}\ell_{2}\cdots\ell_{L}$ is an increasing subsequence in $\sigma$ satisfying $\tau^{\prime}=\tau\setminus\tilde{\tau}$ . Moreover, Greene’s theorem shows that this is a longest increasing subsequence. By an easy induction argument, one sees that $\ell_{i+1}$ is the first number to the right in $\sigma$ that exceeds $\ell_{i}$ . Thus by definition, $\tilde{\tau}=\tau_{+}^{\ast}$ , is the ‘leftmost’ longest increasing subsequence in $\sigma$ . From the construction it is clear that $\tau^{\prime}=\tau\setminus\tau^{\ast}_{+}$ .

[TABLE]

To complete the argument, recall that $\Lambda(\Gamma^{\prime})$ is obtained from $\Lambda(\Gamma)$ by deleting the first row. Since $\mathtt{RS}(\tau)=\Lambda(\Gamma)$ and $\mathtt{RS}(\tau^{\prime})=\Lambda(\Gamma^{\prime})$ by Proposition 8.2, we have that $\mathtt{RS}(\tau^{\prime})$ is obtained from $\mathtt{RS}(\tau)$ by deleting its first row. Since $\tau^{\prime}$ can be obtained from $\tau$ by deleting a longest increasing subsequence, the inductive hypothesis applied to $\tau^{\prime}$ completes the proof.

Next, we show the assertion for the columns. Let $\tau$ , $X_{0}$ , $\Gamma$ be as before. To begin, observe that in the stack construction of $\tau$ from $X_{0}$ , every decreasing subsequence in $\tau$ is generated by the balls that occupy the stack at the same time. (For instance, in Figure 10, the decreasing subsequence $432$ in $\sigma=14632$ is generated by the balls in the stack on top of the ball of label 4.) Thus, every longest decreasing subsequence in $\tau$ is generated by the balls in the stack where $\Gamma$ achieves its maximum.

Let $\mathtt{m}^{\ast}$ be any location where $\Gamma$ attains its global maximum. During the stack operation to construct $\tau$ from $X_{0}$ , let $\bar{\tau}=\ell_{1}\ell_{2}\cdots\ell_{M}$ be the decreasing sequence consisting of the numbers in the stack after pushing all the balls over the interval $[1,\mathtt{m}^{\ast}]$ . This is a longest decreasing subsequence in $\tau$ . Denote $\tau^{\dagger}=\tau\setminus\tau^{\ast}_{-}$ . Let $X^{\dagger}_{0}$ be the box-ball configuration that is obtained by converting $1$ ’s that correspond to balls with labels in $\bar{\tau}$ to [math]’s. Then observe that $\sigma(X^{\dagger}_{0})=\tau^{\dagger}$ and $\Gamma^{\dagger}=\mathcal{E}_{\mathtt{m}^{\ast}}(\Gamma)$ , where $\mathcal{E}_{\mathtt{m}^{\ast}}$ is the excursion operator pivoted at location $\mathtt{m}^{\ast}$ instead of the rightmost one $\mathtt{m}$ . According to Lemma 2.2 and the following remark, $\Lambda(\mathcal{E}_{\mathtt{m}^{\ast}}(\Gamma))$ is obtained by deleting the first column of $\Lambda(\Gamma)$ . Since $\Lambda(\mathcal{E}_{\mathtt{m}^{\ast}}(\Gamma))=\mathtt{RS}(\tau)$ and $\Lambda(\Gamma)=\mathtt{RS}(\tau^{\dagger})$ , the assertion follows. ∎

Acknowledgments

We thank Karthik Karnik, Thomas Lam, Yuval Peres, Pavlo Pylyavskyy, and Mikaeel Yunus for inspiring conversations.

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] David Aldous. The continuum random tree. III. Ann. Probab. , 21(1):248–289, 1993.
2[2] Louis H. Y. Chen, Larry Goldstein, and Qi-Man Shao. Normal approximation by Stein’s method . Probability and its Applications (New York). Springer, Heidelberg, 2011.
3[3] David Roxbee Cox. Renewal theory . Methuen & Co. Ltd., London; John Wiley & Sons, Inc., New York, 1962.
4[4] David A Croydon, Tsuyoshi Kato, Makiko Sasada, and Satoshi Tsujimoto. Dynamics of the box-ball system with random initial conditions via pitman’s transformation. ar Xiv preprint ar Xiv:1806.02147 , 2018.
5[5] Endre Csáki and Yueyun Hu. Lengths and heights of random walk excursions. In Discrete random walks (Paris, 2003) , Discrete Math. Theor. Comput. Sci. Proc., AC, pages 45–52. Assoc. Discrete Math. Theor. Comput. Sci., Nancy, 2003.
6[6] Michael Drmota. Stochastic analysis of tree-like data structures. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. , 460(2041):271–307, 2004. Stochastic analysis with applications to mathematical finance.
7[7] Rick Durrett. Probability: theory and examples . Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, fourth edition, 2010.
8[8] Paul Erdős and Alfred Rényi. On the evolution of random graphs. Magyar Tud. Akad. Mat. Kutató Int. Közl. , 5:17–61, 1960.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Double jump phase transition in a soliton cellular automaton

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

1.1. Related work

1.2. Notation

1.3. Main results

Theorem 1**.**

Theorem 2**.**

Theorem 3**.**

1.4. Outline and organization

2. Constructing the time-invariant Young Diagram

2.1. Motzkin paths

2.2. Hill-flattening and excursion operators

Lemma 2.1**.**

Lemma 2.2**.**

Lemma 2.3**.**

Remark 2.4** (Depth process with drains).**

2.3. Rooted forests

Proposition 2.5**.**

Proof.

3. Random box-ball system and Harris walk

3.1. Harris walks

3.2. Galton-Watson forests

Proposition 3.1**.**

Proof.

Corollary 3.2**.**

4. Asymptotics for the rows

Proof of Theorem 1 for i=1\boldsymbol{i=1}i=1.

Remark 4.1**.**

Lemma 4.2**.**

Proof of Theorem 1 for i≥1\boldsymbol{i\geq 1}i≥1.

Proposition 4.3**.**

Proof.

Proposition 4.4**.**

Proof.

Proof of Lemma 4.2.

5. Top soliton lengths in the subcritical regime

5.1. Overview and main results

Lemma 5.1**.**

Lemma 5.2**.**

Proof of Theorem

5.2. Excursion heights

Proposition 5.3**.**

Proof.

Proposition 5.4**.**

Proof.

Proof of Lemma

Remark 5.5**.**

5.3. Subexcursions within an excursion

Proof of Lemma

6. Top soliton lengths at criticality

Theorem 6.1** (Theorem 9 of [6]).**

Theorem 6.2**.**

Proof.

Proof of Theorem

Theorem 6.3**.**

Theorem 6.4**.**

Remark 6.5**.**

7. Top soliton lengths in the supercritical regime

7.1. Duality and proof of Theorem 2 (iii)

Lemma 7.1**.**

Proof of Theorem 2 (iii)..

7.2. Proof of Lemma 7.1

Proposition 7.2**.**

Proof.

Proposition 7.3**.**

Proof.

Proof of Lemma 7.1..

8. Random 312-avoiding permutations

Proposition 8.1**.**

Proposition 8.2**.**

Proposition 8.3**.**

Theorem 1.

Theorem 2.

Theorem 3.

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

Remark 2.4 (Depth process with drains).

Proposition 2.5.

Proposition 3.1.

Corollary 3.2.

Proof of Theorem 1 for $\boldsymbol{i=1}$ .

Remark 4.1.

Lemma 4.2.

Proof of Theorem 1 for $\boldsymbol{i\geq 1}$ .

Proposition 4.3.

Proposition 4.4.

Lemma 5.1.

Lemma 5.2.

Proposition 5.3.

Proposition 5.4.

Remark 5.5.

Theorem 6.1 (Theorem 9 of [6]).

Theorem 6.2.

Theorem 6.3.

Theorem 6.4.

Remark 6.5.

Lemma 7.1.

Proposition 7.2.

Proposition 7.3.

Proposition 8.1.

Proposition 8.2.

Proposition 8.3.

Proposition A.1.

Proposition A.2.

Proposition A.3.

Proposition A.4.

Remark A.5.

Proposition A.6.

Proposition A.7.

Remark A.8.