Box-ball system: soliton and tree decomposition of excursions

Pablo A Ferrari; Davide Gabrielli

arXiv:1906.06405·math.PR·May 5, 2020

Box-ball system: soliton and tree decomposition of excursions

Pablo A Ferrari, Davide Gabrielli

PDF

Open Access

TL;DR

This paper reviews the combinatorial properties of solitons in the Box-Ball system, introduces a new tree-based soliton decomposition, and explores its probabilistic implications for random walk excursions and geometric branching processes.

Contribution

It proposes a novel soliton decomposition based on tree branch decomposition, linking combinatorial and probabilistic structures in the Box-Ball system.

Findings

01

Soliton decomposition corresponds to a branch decomposition of excursion trees.

02

Random walk excursions with Bernoulli-distributed configurations have independent geometric soliton vectors.

03

The branch decomposition shares properties with the soliton decomposition, leading to a geometric branching process.

Abstract

We review combinatorial properties of solitons of the Box-Ball system introduced by Takahashi and Satsuma in 1990. Starting with several definitions of the system, we describe ways to identify solitons and review a proof of the conservation of the solitons under the dynamics. Ferrari, Nguyen, Rolla and Wang 2018 proposed a soliton decomposition of a configuration into a family of vectors, one for each soliton size. Based on this decompositions, the authors have proposed a family of measures on the set of excursions which induces invariant distributions for the Box-Ball System. In this paper, we propose a new soliton decomposition which is equivalent to a branch decomposition of the tree associated to the excursion, see Le Gall 2005. A ball configuration distributed as independent Bernoulli variables of parameter $λ < 1/2$ is in correspondence with a simple random walk with negative…

Equations120

((() () (() ()) () () ()) (()))

((() () (() ()) () () ()) (()))

ξ (i) - ξ (i - 1) = 2 η (i) - 1

ξ (i) - ξ (i - 1) = 2 η (i) - 1

∣ E_{n} ∣ = \frac{1}{n + 1} (n 2 n);

∣ E_{n} ∣ = \frac{1}{n + 1} (n 2 n);

r (0, ξ)

r (0, ξ)

r (k + 1, ξ) - r (k, ξ)

ξ (r (k, ξ) + i) = - k + ε_{k} (i), i \in {0, \dots, 2 n (ε_{k})}, k \in Z .

ξ (r (k, ξ) + i) = - k + ε_{k} (i), i \in {0, \dots, 2 n (ε_{k})}, k \in Z .

ε_{k} (i) = ξ (r (k, ξ) + i) - (- k), i \in {0, 1, \dots, r (k + 1, ξ) - r (k, ξ) - 1} .

ε_{k} (i) = ξ (r (k, ξ) + i) - (- k), i \in {0, 1, \dots, r (k + 1, ξ) - r (k, ξ) - 1} .

X_{a} := {η \in {0, 1}^{Z} : n \to \pm \infty lim \frac{\sum _{j = 0}^{n} η ( j )}{n + 1} = a}, a \in [0, 1],

X_{a} := {η \in {0, 1}^{Z} : n \to \pm \infty lim \frac{\sum _{j = 0}^{n} η ( j )}{n + 1} = a}, a \in [0, 1],

∙ ∙ \dots ∙ ∣ i ∣ balls \circ \circ \dots \circ ∣ i ∣ empty boxes

∙ ∙ \dots ∙ ∣ i ∣ balls \circ \circ \dots \circ ∣ i ∣ empty boxes

T\eta(x)=\left\{\begin{array}[]{ll}0&\textrm{if}\ x\ \textrm{is\ a \ record\ of}\ W\eta\\ 1-\eta(x)&\textrm{otherwise}\,.\end{array}\right.

T\eta(x)=\left\{\begin{array}[]{ll}0&\textrm{if}\ x\ \textrm{is\ a \ record\ of}\ W\eta\\ 1-\eta(x)&\textrm{otherwise}\,.\end{array}\right.

T ξ (x) = [y \leq x min ξ (y)] - [ξ (x) - y \leq x min ξ (y)],

T ξ (x) = [y \leq x min ξ (y)] - [ξ (x) - y \leq x min ξ (y)],

x_{k} (i) := #{k -solitons attached to k -slot number i} .

x_{k} (i) := #{k -solitons attached to k -slot number i} .

x_{k} = (x_{k} (0), \dots, x_{k} (s_{k} - 1)) \in N^{s_{k}}

x_{k} = (x_{k} (0), \dots, x_{k} (s_{k} - 1)) \in N^{s_{k}}

s_{k} = s_{k} (x) = 1 + 2 i = k + 1 \sum M (i - k) n_{i} .

s_{k} = s_{k} (x) = 1 + 2 i = k + 1 \sum M (i - k) n_{i} .

x_{4}

x_{4}

x_{3}

x_{2}

x_{1}

s_{k} (i) = {s_{k}^{⋄} (i) + k, s_{k}^{⋄} (i), if s_{k} (i) belongs to the head of a soliton^{⋄}; if s_{k} (i) belongs to the tail of a soliton^{⋄} .

s_{k} (i) = {s_{k}^{⋄} (i) + k, s_{k}^{⋄} (i), if s_{k} (i) belongs to the head of a soliton^{⋄}; if s_{k} (i) belongs to the tail of a soliton^{⋄} .

k \to

k \to

4 \to

3 \to

2 \to

1 \to

r_{i} (η) = r_{i} (T η) for any i .

r_{i} (η) = r_{i} (T η) for any i .

r_{i} (η) = r_{i}^{*} (T η), \forall i .

r_{i} (η) = r_{i}^{*} (T η), \forall i .

r_{i} (η) = r_{i}^{*} (η), \forall i .

r_{i} (η) = r_{i}^{*} (η), \forall i .

\yng (8, 2, 1, 1) .

\yng (8, 2, 1, 1) .

r_{i} = m = i \sum M n_{m}, n_{i} = r_{i} - r_{i + 1}

r_{i} = m = i \sum M n_{m}, n_{i} = r_{i} - r_{i + 1}

\yng (1, 1, 1, 1) \yng (1, 1) \yng (1) \yng (1) \yng (1) \yng (1) \yng (1) \yng (1)

\yng (1, 1, 1, 1) \yng (1, 1) \yng (1) \yng (1) \yng (1) \yng (1) \yng (1) \yng (1)

\yng (3, 1, 1) \yng (2) \yng (1, 1)

\yng (3, 1, 1) \yng (2) \yng (1, 1)

\yng (1, 1, 1) \yng (1) \yng (1)

\yng (1, 1, 1) \yng (1) \yng (1)

\yng (6, 2, 1)

\yng (6, 2, 1)

\yng (4, 3, 1, 1) .

\yng (4, 3, 1, 1) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and statistical mechanics · Random Matrices and Applications · Bayesian Methods and Mixture Models

Full text

Box-ball system: soliton and tree

decomposition of excursions

Pablo A. Ferrari

Universidad de Buenos Aires, [email protected]

Davide Gabrielli

Università di L’Aquila, [email protected]

Abstract

We review combinatorial properties of solitons of the Box-Ball system introduced by Takahashi and Satsuma in 1990 [17]. Starting with several definitions of the system, we describe ways to identify solitons and review a proof of the conservation of the solitons under the dynamics. Ferrari, Nguyen, Rolla and Wang 2018 [7] proposed a soliton decomposition of a configuration into a family of vectors, one for each soliton size. Based on this decompositions, the authors [6] propose a family of measures on the set of excursions which induces invariant distributions for the Box-Ball System. Furthermore, we propose a new soliton decomposition which is equivalent to a branch decomposition of the tree associated to the excursion, see Le Gall [12]. A ball configuration distributed as independent Bernoulli variables of parameter $\lambda<1/2$ is in correspondence with a simple random walk with negative drift $2\lambda-1$ and infinitely many excursions over the local minima. In this case the soliton decomposition of the walk consists on independent double-infinite vectors of iid geometric random variables [6]. We show that this property is shared by the branch decomposition of the excursion trees of the random walk and discuss a corresponding construction of a Geometric branching process with independent but not identically distributed Geometric random variables.

Keywords: Box-Ball system, solitons, excursions, planar trees.

AMS 2010 Subject Classification: 37B15, 37K40, 60C05, 82C23.

1 Introduction
2 Preliminaries and notation
2.1 Box-Ball System
2.2 Walk representation and excursions
2.3 BBS with infinitely many balls and on the ring
3 Conserved quantities and solitons
3.1 Runs
3.2 Takahashi-Satsuma soliton decomposition
3.3 Slot diagrams
3.4 Head-Tail soliton decomposition
3.5 Attaching solitons
3.6 Conserved quantities
3.7 Young diagrams
4 Trees, excursions and slot diagrams
4.1 Tree representation of excursions
4.2 Trees and pairing algorithm
4.3 Branch identification of planar trees
4.4 Tree-induced soliton decomposition of excursions
4.5 Slot diagrams of planar trees
4.6 From paths to trees
5 Soliton distribution
5.1 A distribution on the set of excursions
5.2 Branch distribution of the random walk excursion tree
5.2.1 Geometric branching processes
5.3 Soliton decomposition of product measures in $\{0,1\}^{\mathbb{Z}}$
Acknowledgments

1 Introduction

The Ball-Box-System (BBS) is a cellular automaton introduced by Takahashi and Satsuma [17] describing the deterministic evolution of a finite number of balls on the infinite lattice $\mathbb{Z}$ . A ball configuration $\eta$ is an element of $\{0,1\}^{\mathbb{Z}}$ , where $\eta(i)=1$ indicates that there is a ball at box $i\in\mathbb{Z}$ . For ball configurations with a finite number of balls, the dynamics is as follows. A carrier starts with zero load to the left of the occupied boxes, visits successively boxes from left to right and at each box proceeds as follows (a) if the box is occupied, the carrier increases its load by one and the box becomes empty or (b) if the box is empty and the carrier load is positive, then the carrier load decreases its load by one and the box becomes occupied. This mechanism is illustrated with an example in Figure 1.

In §2.1 we describe alternative equivalent descriptions of the dynamics. We denote $T\eta$ the configuration obtained after the carrier has visited all boxes in $\eta$ , and $T^{t}\eta$ the configuration obtained after the iteration of this procedure $t$ times, for positive integer $t$ . The dynamics can be defined for suitable configurations with infinitely many balls satisfying that “there are more empty boxes than occupied boxes” and conserves the set of configurations with density of balls less than $1/2$ [7]; see details in §2.1.

The main motivation of [17] was to identify objects conserved by the dynamics that they called basic sequences, later called solitons by [14]; we follow this nomenclature. The Box-Ball system has been proposed as a discrete model with the same behavior of the Korteweg-de Vries equation [18], an integrable partial differential equation having solitonic behavior.

Given a ball configuration $\eta$ a $k$ -soliton consists of $k$ occupied boxes denoted $\mathtt{h}_{1},\dots,\mathtt{h}_{k}\in\mathbb{Z}$ and $k$ empty boxes denoted $\mathtt{t}_{1},\dots,\mathtt{t}_{k}\in\mathbb{Z}$ . Takahashi and Satsuma showed that if $\eta$ has a finite number of occupied boxes, then all occupied box belongs to some soliton and proposed an algorithm to identify solitons. We explain their algorithm in detail in §3.2; for the moment we give the simplest example of $k$ -soliton. Let $\eta$ have only $k$ balls occupying $k$ successive boxes, then the (only) $k$ -soliton of $\eta$ consists on the $k$ successive occupied boxes $\mathtt{h}_{1},\dots,\mathtt{h}_{k}$ and the $k$ successive empty boxes $\mathtt{t}_{1},\dots,\mathtt{t}_{k}\in\mathbb{Z}$ given by $\mathtt{t}_{j}=\mathtt{h}_{j}+k$ .

For the $\eta$ just described, the configuration $T\eta$ has balls in boxes $\mathtt{t}_{1},\dots,\mathtt{t}_{k}$ and no balls in the other boxes, implying that $\eta^{\prime}=T\eta$ has a $k$ -soliton consisting of occupied boxes $\mathtt{h}_{1}^{\prime},\dots,\mathtt{h}_{k}^{\prime}$ with $\mathtt{h}_{j}^{\prime}=\mathtt{t}_{j}$ and empty boxes $\mathtt{t}_{j}^{\prime}=\mathtt{t}_{j}+k$ . Hence, in one step, an isolated $k$ -soliton preserves its shape and moves $k$ steps forward. Iterating the evolution $t$ times, we conclude that not being other balls in the system, a $k$ -soliton moves $kt$ boxes forward, that is, it travels at speed $k$ . Since for different $k$ ’s the solitons have different speeds, they “collide”, that is the order and the positions of $\mathtt{h}_{i}$ and $\mathtt{t}_{i}$ of each soliton change. In fact, solitons can be identified for configurations with finite number of balls [17] and for suitable configurations $\eta$ with infinitely many balls [7] and moreover solitons are conserved by the dynamics [17] [7]; we explain this in detail in §3. Given a suitable configuration $\eta$ , a $k$ -soliton consists always on $k$ occupied boxes and $k$ empty boxes, but they are not necessarily consecutive and the empty boxes of the soliton may precede the occupied ones. In any case, different solitons occupy disjoint sets of boxes. The trajectory of each soliton can be identified along time [7]. When the distribution of the initial ball configuration is translation invariant and invariant for the dynamics, the asymptotic soliton speeds satisfy a system of linear equations [7] which is a feature of several other integrable systems [1].

A ball configuration can be mapped to a walk indexed by boxes that jumps one unit up at occupied boxes and one unit down at empty boxes. If the configuration has density less than $\frac{1}{2}$ , then the walk has down records and finite excursions consisting on the pieces of configuration between two consecutive records. A ball configuration can be codified as a set of infinite vectors, based on the concept of slots [7]. Given a ball configuration with identified solitons, there are boxes called $k$ -slots satisfying that any $k$ -soliton is strictly in between two successive $k$ -slots; in this case we say that the $k$ -soliton is attached to the left $k$ -slot. To be more precise, the set of $k$ -slots of $\eta$ consists on the records and the boxes belonging to any bigger soliton of the form $\mathtt{h}_{j}$ or $\mathtt{t}_{j}$ for any $j>k$ . Taking a ball configuration with a record at the origin, the $k$ -slots are enumerated and then for each integer $i$ , the $i$ -th coordinate of the $k$ -component of the configuration is the number of $k$ -solitons attached to the $i$ -th $k$ -slot. The obtained components can be composed again to recover the initial configuration $\eta$ . A large part of the paper is dedicated to a complete explanation of these constructions.

It is useful to perform the decomposition in each excursion to obtain what [6] call slot diagram, a combinatorial object that encodes the structure of the excursion. We discuss also the relationship between the soliton decomposition of an excursion and other combinatorial objects as the excursion tree [10, 4, 12, 5, 13], Catalan numbers [5] and Dyck and Motzkin paths [5, 14].

A notable property proven by [7] is that the $k$ -component of the configuration $T\eta$ is a shift of the $k$ -component of $\eta$ , the amount shifted depending on the $m$ -components for $m>k$ . As a consequence, [7] prove that measures with independent and translation invariant soliton components are invariant for the dynamics. In [6] a special class of these measures is studied in detail. The papers [2, 3] show families of invariant measures for the BBS based on reversible Markov chains on $\{0,1\}$ .

The Box-Ball system is strictly related to several remarkable combinatorial constructions (see for example [8, 9, 14, 15, 19, 20]); we illustrate some of them. Sometimes, instead of giving formal proofs and detailed descriptions we adopt a more informal point of view trying to illustrate the different constructions through explicative examples.

The paper is organized as follows.

In §2 we fix the notation and review several different equivalent definitions of the dynamics. We start considering the simple case of a finite number of balls. Then, following [7] we introduce the walk representation and give a definition of the dynamics in the general case for configurations of balls whose walk representation can be cut into infinitely many finite excursions.

In §3 we discuss some conserved quantities of the dynamics, the identification of the solitons, a codification of the conserved quantities in terms of Young diagrams and define the slot diagrams from [6].

In §4 we recall the construction of the excursion tree and propose a new soliton decomposition of the excursion based on the tree. We introduce a branch decomposition of the tree and conclude that its slot diagram coincides with the one discussed in §3. The contents of this section are new.

In §5 we review results from [6] related with the distribution of the soliton decomposition of excursions. In particular, the soliton decomposition of a simple random walk consists on independent double-infinite vectors of iid geometric random variables. We discuss also an application to branching processes.

2 Preliminaries and notation

2.1 Box-Ball System

The Box-Ball System (BBS) [17] is a discrete-time cellular automaton. We start considering a finite number of balls evolving on the infinite lattice $\mathbb{Z}$ . The elements of $\mathbb{Z}$ are called boxes. A configuration of balls is codified by $\eta\in\{0,1\}^{\mathbb{Z}}$ , that is, by a doubly infinite sequence of $1^{\prime}s$ and $0^{\prime}s$ , corresponding respectively to the boxes occupied by balls and the empty boxes. Pictorially a ball will be denoted by $\bullet$ while an empty box by $\circ$ .

There are several equivalent ways of defining the evolution. We denote by $T:\{0,1\}^{\mathbb{Z}}\to\{0,1\}^{\mathbb{Z}}$ the operator defining the evolution in one single step. This means that the configuration $\eta$ evolves in a single step into the configuration $T\eta$ . In the following definitions we consider configurations having only a finite number of $1^{\prime}s$ .

The equivalence among all the definitions is simple. The different definitions are however related to different classic combinatorial constructions and illustrate the evolution from different perspectives.

First definition We define the dynamics through a pairing between the balls and some empty boxes. Consider a ball configuration $\eta$ containing only a finite number of balls. The evolution is defined iteratively. At the first step we consider the balls that have an empty box in the nearest neighbor lattice site to the right, that is, local configurations of the type $\bullet\circ$ and we pair the two boxes drawing a line. Remove all the pairs created and continue following the same rule with the configuration obtained after the deletion of the paired boxes. This procedure will stop after a finite number of iterations because there are only a finite number of balls. See Fig. 2, where we assumed that there are no balls outside the window and the lines connect balls with the corresponding paired empty boxes. The evolved configuration of balls, denoted $T\eta$ is obtained by transporting every ball along the lines to the corresponding paired empty box. Note that the lines pairing balls and empty boxes can be drawn without intersections in the upper half plane.

Second definition [17]: This is the original definition of the model. Consider an empty carrier that starts to the left of the leftmost ball and visit the boxes one after another moving from left to right. The carrier can transport an arbitrary large number of balls. When visiting box $i$ , the carrier picks the ball if $\eta(i)=1$ and the number of balls transported by the carrier augments therefore by one and site $i$ is updated to be empty: $T\eta(i)=0$ . If instead $\eta(i)=0$ and the carrier contains at least one ball then he deposits one ball in the box getting $T\eta(i)=1$ . After visiting a finite number of boxes the carrier will be always empty and will not change any more the configuration, see Figure 1. The final configuration $T\eta$ is the same as the one obtained by the previous construction.

Third definition: Dyck words (after Walther von Dyck). Substitute any ball with an open parenthesis and any empty box with a closed one. The sequence of Fig. 1 becomes for example

[TABLE]

and outside this window there are only closed $)$ parenthesis. According to the usual algebraic rules we can pair any open parenthesis to the corresponding closed one. Recalling that open parenthesis correspond to balls, we move each ball from the position of the open parenthesis to the position of the corresponding closed one.

Forth definition: As a first step we duplicate each ball. After this operation on each occupied box there will be exactly 2 balls, one is the original one while the second is the clone. We select an arbitrary occupied box and move the cloned ball to the first empty box to the right. Then we select again arbitrarily another box containing two balls and do the same. We continue according to an arbitrary order up to when there are no more boxes containing more than one ball. At this point we remove the original balls and keep just the cloned ones. The configuration of balls that we obtain does not depend on the arbitrary order that we followed and coincides with $T\eta$ .

Fifth definition: Start from the leftmost ball and move it to the nearest empty box to its right. Then do the same with the second leftmost ball (according to the original order). Proceed in this way up to move once all the balls. This is a particular case of the fourth definition. It correspond to move the balls according to the order given by the initial position of the balls.

Our viewpoint will be to consider all the balls indistinguishable and from this perspective all the above definitions are equivalent. If we are instead interested in the motion of a tagged ball then we can have different evolutions according to the different definitions given above.

The construction can be naturally generalized to a class of configurations with infinitely many balls or to configuration of balls on a ring. This can be done under suitable assumptions on the configuration $\eta$ [2, 7]. We will discuss briefly this issue following the approach of [7], but to do this we need some notation and definitions.

2.2 Walk representation and excursions

A function $\xi:\mathbb{Z}\to\mathbb{Z}$ satisfying $|\xi(i)-\xi(i-1)|=1$ is called walk. We map a ball configuration $\eta$ to a walk $\xi=W\eta$ defined up to a global additive constant by

[TABLE]

The constant is fixed for example by choosing $\xi(0)=0$ . Essentially the map between ball configurations and walks is fixed by the correspondence $\bullet\longleftrightarrow\diagup$ and $\circ\longleftrightarrow\diagdown$ , where $\bullet$ represents a ball, $\circ$ an empty box and $\diagup,\diagdown$ pieces of walk to be glued together continuously. The map $W$ is invertible (when the additive constant is fixed) and the configuration of balls $\eta=W^{-1}\xi$ can be recovered using (1). We remark that there are several walks that are projected to the same configuration of balls and all of them differ by a global additive constant. This means that $W$ is a bijection only if the arbitrary additive constant is fixed and this will be always done in such a way that $\xi(0)=0$ .

We call $i\in\mathbb{Z}$ a (minimum) record for the walk $\xi$ if $\xi(i)<\xi(i^{\prime})$ for any $i^{\prime}<i$ . The hitting time of $-j$ for the walk $\xi$ is a record denoted $r(j,\xi)$ . We call excursion of a walk the piece of trajectory between two successive records. A pictorial perspective on the decomposition of the walk into records and disjoint excursions is the following. Think the walk as a physical profile and imagine the sun is at the sunshine on the left so that the light is coming horizontally from the left. The parts of the profile that are enlightened correspond to the records while the disjoint parts in the shadow are the different excursions.

We call a finite walk a finite trajectory of a random walk. More precisely a finite walk $\xi=(\xi(i))_{i\in\mathbb{[}0,k]}$ , $k\in\mathbb{N}$ , is an element of $\mathbb{Z}^{[0,k]}$ such that $|\xi(i)-\xi(i-1)|=1$ . Again we always fix $\xi(0)=0$ and like before there is a bijection $W$ between finite walks and finite configurations of balls, i.e. elements $\eta\in\{0,1\}^{k}$ for some $k\in\mathbb{N}$ . We use the same notation $\xi$ for finite and infinite walks and $\eta$ for finite and infinite configurations of balls. It will be clear from the context when the walk/configuration is finite or infinite.

We introduce the set $\mathcal{E}$ of finite soft excursions between records 0 and 1. An element $\varepsilon\in\mathcal{E}$ is a finite walk that starts and ends at zero, it is always non-negative and it has length $2n(\varepsilon)$ . More precisely $\varepsilon=\Big{(}\varepsilon(0),\dots,\varepsilon(2n(\varepsilon))\Big{)}$ with the constraints $|\varepsilon(i)-\varepsilon(i-1)|=1$ , $\varepsilon(i)\geq 0$ and $\varepsilon(0)=\varepsilon(2n(\varepsilon))=0$ . The empty excursion $\emptyset$ is also an element of $\mathcal{E}$ with $n(\emptyset)=0$ . We call $\mathcal{E}_{n}$ the set of soft finite excursions of length $2n$ so that $\mathcal{E}=\cup_{n=0}^{+\infty}\mathcal{E}_{n}$ .

Using the same correspondence as before between walks and configuration of balls we can associate a finite configuration of balls $\big{(}\eta(1),\dots,\eta(2n(\varepsilon))\big{)}=W^{-1}\varepsilon$ to the finite excursion $\varepsilon$ . If $\eta=W^{-1}\varepsilon$ , then we have $\sum_{i=1}^{2n(\varepsilon)}(2\eta(i)-1)=0$ but obviously not all configuration of balls satisfying this constraint generates a soft excursion by the transformation $W$ . It is well known [16] that the number of excursions of length $2n$ is given by

[TABLE]

the right hand side is the Catalan number $C_{n}$ .

We denote by $\mathcal{E}^{o}\subset\mathcal{E}$ the set of strict excursions. An element $\varepsilon\in\mathcal{E}^{o}$ is an excursion that satisfies the strict inequality $\varepsilon(i)>0$ when $i\neq 0,2n(\varepsilon)$ . Likewise we call $\mathcal{E}^{o}_{n}$ the strict excursions of length $2n$ .

There is a simple bijection between $\mathcal{E}_{n}$ and $\mathcal{E}^{o}_{n+1}$ . This is obtained by considering an element $\varepsilon\in\mathcal{E}_{n}$ and adding a $\diagup$ at the beginning and a $\diagdown$ at the end. The result is an element of $\mathcal{E}^{o}_{n+1}$ . The converse map is obtained removing a $\diagup$ at the beginning and a $\diagdown$ at the end of an element of $\mathcal{E}^{o}_{n+1}$ obtaining an element of $\mathcal{E}_{n}$ . This can be easily shown to be a bijection. In particular we deduce by (2) that $|\mathcal{E}^{o}_{n}|=\frac{1}{n}\binom{2(n-1)}{n-1}$ .

Concatenating excursions

Given a finite soft excursion $\varepsilon$ we call $\tilde{\varepsilon}$ the finite walk $\left(\tilde{\varepsilon}(i)\right)_{i=0}^{2n(\varepsilon)+1}$ such that $\tilde{\varepsilon}(i)=\varepsilon(i)$ when $0\leq i\leq 2n(\varepsilon)$ and $\tilde{\varepsilon}(2n(\varepsilon)+1)=-1$ . This corresponds essentially to add a $\diagdown$ at the end of the soft excursion. Given two such finite walks $\tilde{\varepsilon}_{0}$ and $\tilde{\varepsilon}_{1}$ we introduce their concatenation $\tilde{\varepsilon}_{0}\star\tilde{\varepsilon}_{1}$ . This is a finite walk such that $\left[\tilde{\varepsilon}_{0}\star\tilde{\varepsilon}_{1}\right](i)=\tilde{\varepsilon}_{0}(i)$ when $0\leq i\leq 2n(\varepsilon_{0})+1$ and $\left[\tilde{\varepsilon}_{0}\star\tilde{\varepsilon}_{1}\right](i)=\tilde{\varepsilon}_{1}(i-2n(\varepsilon_{0})-1)-1$ if $2n(\varepsilon_{0})+1<i\leq 2(n(\varepsilon_{0})+n(\varepsilon_{1}))+2$ . Essentially this operation corresponds to glue the graphs of the walks one after the other continuously. Iterating this operation we can define similarly also the concatenation of a finite number of finite walks $\tilde{\varepsilon}_{0}\star\tilde{\varepsilon}_{2}\star\dots\star\tilde{\varepsilon}_{k}$ . Likewise we consider an infinite walk $\left(\tilde{\varepsilon}_{i}\right)_{i\in\mathbb{Z}}^{\star}$ obtained by a doubly infinite concatenation of finite walks. Informally this is obtained concatenating continuously the graphs as before with the condition that $\left(\tilde{\varepsilon}_{i}\right)_{i\in\mathbb{Z}}^{\star}(j)=\tilde{\varepsilon}_{0}(j)$ for $0\leq j\leq 2n(\varepsilon_{0})+1$ .

Formally the walk $\xi$ is defined in terms of a family of excursions $(\varepsilon_{j})_{j\in\mathbb{Z}}$ as follows. First fix the position of the records of the walk $\xi$ iteratively by

[TABLE]

so that the number of boxes between records $k$ and $k+1$ is the size of excursion $k$ . Now complete the definition by inserting excursion $k$ between those records:

[TABLE]

The resulting walk $\xi$ attains the level $-k$ for the first time at position $r(k,\xi)$ . In particular, $\xi$ has infinite many records, one for each element of $\mathbb{Z}$ . When this happens we say shortly that the walk has all the records. Clearly a similar concatenation procedure can be performed for any collection of finite walks and not just for excursions. We do not give the straightforward details.

Conversely, if we have a walk $\xi$ with all the records and such that record 0 is at 0 and record $k$ is at $r(k,\xi)$ , then for each $k\in\mathbb{Z}$ we can define the excursion $\varepsilon_{k}=\varepsilon_{k}[\xi]$ by

[TABLE]

then we have that $\left(\tilde{\varepsilon}_{i}\right)_{i\in\mathbb{Z}}^{\star}$ coincides with the original walk $\xi$ .

We proved therefore that an infinite walk is obtained by an infinite concatenation of finite soft excursions separated by a $\diagdown$ if and only if it has all the records.

The set of configurations with density $a$ is defined by

[TABLE]

and call $\mathcal{X}:=\cup_{a<1/2}\mathcal{X}_{a}$ , the set of configurations with some density below $\frac{1}{2}$ . Consider $\eta\in\mathcal{X}$ and let $\xi=W\eta$ . Since the walk $\xi$ is a nearest neighbor random walk with negative drift, it can assume any given value $k\in\mathbb{Z}$ only a finite number of times and therefore the walk will have all the records and hence we have that any element of $W\mathcal{X}$ can be seen as a concatenation of infinitely many finite excursions.

The converse statement is however in general not true. It is possible to construct walks concatenating finite excursions that belong to $\mathcal{X}_{1/2}$ or also such that the limits involved in the definition (5) do not exist.

An example for the first case is a concatenation $\left(\tilde{\varepsilon}_{i}\right)^{*}_{i\in\mathbb{Z}}$ where the walk $\tilde{\varepsilon}_{i}$ is obtained adding an $\diagdown$ to the excursion $\varepsilon_{i}$ that has length $2^{|i|+1}$ and is composed by an alternating sequence of $\diagup$ and $\diagdown$ .

An example for which the limits do not exist is when the excursion $\varepsilon_{i}$ is formed by a sequence of $2^{|i|}$ pieces of the type $\diagup$ followed by the same numbers of $\diagdown$ .

2.3 BBS with infinitely many balls and on the ring

We can now generalize the definition of the dynamics to infinite configurations of balls. This can be done in a natural way under suitable assumptions. In particular the dynamics can be defined for configuration of balls whose corresponding walk has all the records. We have already shown that in this case the walk is a suitable horizontal translation of the concatenation $\left(\tilde{\varepsilon}_{i}\right)^{*}_{i\in\mathbb{Z}}$ of infinite many finite excursions with a $\diagdown$ appended at the end. We define also the action of the operator $T$ on configurations of balls on a ring with $N$ sites containing $k\leq\frac{N}{2}$ balls. In both cases the basic idea is that we can define the action of the evolution operator $T$ on each single excursion of the decomposition of an associated walk.

We discuss this issue using the first definition of the dynamics. Similar arguments can be given also for the other definitions. The basic fact is that, when the walk of an infinite configuration of balls has all the records then in the pairing procedure all the lines joining balls and empty boxes can be constructed locally. More precisely drawing a vertical line going through a record $r(k,\eta)\in\mathbb{Z}$ we have that there are no lines of the construction that cross this vertical line. All the balls belonging to an excursion are paired to empty boxes belonging to the same excursion.

Therefore, if the walk representation $W\eta$ of an infinite configuration of balls $\eta$ is the concatenation of infinitely many finite excursions separated by records, then $T\eta$ can be naturally defined, using the first definition. More precisely the operative definition of $T$ is the following. Consider an excursion of the walk and consider the balls that are in the corresponding lattice sites. Erase all the other balls of the configuration. In this way we obtain a configuration with a finite number of balls and we can apply the original first definition. All the balls will be paired with boxes belonging to lattice sites of the excursion. We do this for all the excursions of the walk. In this way we obtain the configuration $T\eta$ . There are no overlaps since all the constructions stay inside the disjoint excursions of $W\eta$ .

The example of Fig. 2 corresponds to a configuration of balls having one single non empty excursion. The example of Fig. 4 corresponds instead to a configuration having 3 non empty excursions that are surrounded by rectangles. The lines constructed for any excursion are naturally divided into blocks. These blocks correspond exactly to the natural subdivision of any excursion into the concatenation of strict excursions. Each block has a maximal line surrounding all the others and there are no other lines surrounding the maximal ones. This means that balls and empty boxes corresponding to a strict excursion are paired among themselves and therefore the evolution of the balls of each strict excursion is determined independently of what happens outside. In Fig. 4 the leftmost excursion is the concatenation of 2 strict excursions and correspondingly there are 2 maximal lines inside the rectangle. The same happens to the central excursion while the rightmost excursion has only one maximal line so that the excursion is strict.

We observe (see [2] for more details) that there are configurations $\eta$ such that $T\eta$ is well defined but $T(T\eta)$ is not. For simplicity we use a configuration $\eta$ that is built up by the concatenation of infinite strict excursions, but we could as well start from a configuration with infinitely many excursions separated by records. The configuration $\eta$ that we consider is $\eta=(\varepsilon_{i})_{i\in\mathbb{Z}}^{*}$ (the definition of this concatenation is straightforwardly similar to the one given for excursions with a $\diagdown$ appended at the end) where the excursions $\varepsilon_{i}$ with $i\geq 0$ are all obtained by ball configurations of the form $\bullet\circ$ while the excursion $\varepsilon_{i}$ with $i<0$ is of the form

[TABLE]

By our previous arguments it is possible to implement the transformation $T$ since we can operate separately on each strict excursion. As the reader can easily see it is not possible to define $T(T\eta)$ . This is because the configuration $T\eta$ has no records and hence cannot be divided into finite disjoint excursions.

It is important to see that if $\eta\in\mathcal{X}$ then we can define $T^{k}\eta$ for any $k$ . This is because it can be easily shown that $T$ maps elements of $\mathcal{X}$ to elements of $\mathcal{X}$ . The example just illustrated does not indeed belong to $\mathcal{X}$ .

A similar discussion can be done using the other definitions in §2.1. Let us consider the second definition. In the case of a finite number of balls we imagine the carrier starting empty just on the left of the first ball. In the case of infinitely many balls we can consider however the carrier starting empty in correspondence of a record and moving to the right. The carrier is performing a transformation on the configuration of balls corresponding to the first excursion that he meets. After this he will reach a new record box and correspondingly he will be again empty. Then the carrier can proceed afresh to the second excursion and so on. This means that equivalently the transformation $T$ can be performed by infinitely many carriers, one for each finite excursion. They start empty to the left of the excursion and end empty at the right of the excursion. The evolved configuration $T\eta$ can therefore be computed locally restricting to each single excursion.

The definition of the dynamics on a ring can be done simply associating to each configuration on the ring an infinite periodic configuration on $\mathbb{Z}$ . When the number of balls is strictly less than $N/2$ we have that the corresponding walk has all the records and we can perform the construction as discussed above on each excursion independently. The evolved configuration is again periodic and can be interpreted as a ball configuration on the ring. This fact does not hold in the case that the number of balls is exactly $N/2$ since in this case the infinite associated walk will have no records. However the dynamics in this case consists simply in flipping the value of each box. Empty boxes becomes full while full boxes becomes empty.

Sixth definition: Using the walk representation of a configuration of balls it is possible to give another equivalent definition of the Box-Ball dynamics. Since we know that the evolution operator $T$ acts independently on each excursion let us consider a configuration of balls $\eta$ on $\mathbb{Z}$ having just a finite number of balls and such that the corresponding walk has one single excursion. The updating rule of the evolution $T$ corresponds in flipping the graph of the excursion like in Fig. 5. When there are more than one single excursion the same symmetry operation has to be done on each single excursion. The configuration $T\eta$ is recovered applying $W^{-1}$ to the new walk obtained. This dynamics was already proposed by Le Gall [12].

Seventh definition: Here we write in formulas the construction done in the above definition. These formulas apply directly to infinite configurations of balls having all the records. The first simple and general formula that summarize the evolution is

[TABLE]

The second formula is the following. For a walk $\xi$ having all the records, the curve $\min_{y\leq x}\xi(y)$ is well defined. The operator $T$ essentially reflects the walk $\xi$ with respect to this curve. We have

[TABLE]

where we denote by $T\xi$ the walk corresponding to $T\eta$ when $\xi=W\eta$ , i.e. $T\xi:=WT\eta$ .

3 Conserved quantities and solitons

In this section we discuss how to identify the solitons that are traveling through the system. We obtain different combinatorial structures and discuss the relationship among them. Solitons are conserved quantities of the system.

3.1 Runs

Given a configuration of balls $\eta$ , a zero-run is a maximal integer interval of empty boxes and a one-run is a maximal integer interval of boxes occupied by balls; we call run the union of these to sets. The runs of $\eta$ form a partition of the lattice $\mathbb{Z}$ . In statistical mechanics a run is usually called a cluster.

More precisely the finite interval $[x,y]\subseteq\mathbb{Z}$ is a run if $\eta(z_{1})=\eta(z_{2})$ for any $z_{1},z_{2}\in[x,y]$ and moreover $\eta(x-1)\neq\eta(x)$ and $\eta(y+1)\neq\eta(y)$ . A run can be a zero run or a one run depending if the boxes are respectively empty or occupied. A run can be also semi-infinite or infinite. We can have therefore runs of the form $(-\infty,x]$ or $[x,+\infty)$ or even $(-\infty,+\infty)$ . In the first case we have $\eta(z_{1})=\eta(z_{2})$ for any $z_{1},z_{2}\leq x$ and $\eta(x+1)\neq\eta(x)$ , in the second case we have $\eta(z_{1})=\eta(z_{2})$ for any $z_{1},z_{2}\geq x$ and $\eta(x-1)\neq\eta(x)$ while in the last case we have $\eta(z_{1})=\eta(z_{2})$ for any $z_{1},z_{2}$ .

Given the run $[x,y]$ we call $|x-y|$ its size. The size of the semi-infinite or infinite runs is $+\infty$ .

Any configuration of balls generates a partition of $\mathbb{Z}$ into disjoint runs alternating between zero-runs and one-runs. A configuration containing a finite nonzero number of balls induces a finite collection of runs being the leftmost and the rightmost semi-infinite zero-runs. The configuration with no balls has only a double infinite zero-run. In this situation it is possible to implement the following algorithm.

3.2 Takahashi-Satsuma soliton decomposition

The following is a small variation of the original algorithm.

Start with a ball configuration $\eta$ with a finite number of balls.

1)

If there is just one single infinite zero-run then stop, otherwise go to the next step.

2)

Search for the leftmost among the smallest runs; assume that the run size is $k$ . The $k$ boxes belonging to the run and the first $k$ boxes belonging to the nearest neighbor run to its right (whose size is necessarily not smaller than $k$ ) identify a soliton $\gamma$ , call $\mathtt{h}_{1}(\gamma)<\dots<\mathtt{h}_{k}(\gamma)$ the positions of the occupied boxes of $\gamma$ and $\mathtt{t}_{1}(\gamma)<\dots<\mathtt{t}_{k}(\gamma)$ the positions of the empty boxes of $\gamma$ .

3)

Ignore the boxes of the identified solitons, update the runs gluing together the remaining boxes and go to step 1.

Since there is a finite number of balls, the algorithm stops after a finite number of iterations and identifies a finite number of solitons. This algorithm is called TS decomposition. For instance, the TS decomposition of the excursion of Fig. 6 is given in Fig.7.

The soliton decomposition of a ball configuration with infinitely many balls and infinitely many records is done performing the above algorithm on each single excursion.

Each soliton $\gamma$ is composed by two disjoint sets of the same cardinality, the set of occupied boxes $\mathtt{h}(\gamma)$ , called the head and the set of empty boxes $\mathtt{t}(\gamma)$ called the tail. They satisfy $\gamma=\mathtt{t}(\gamma)\dot{\cup}\mathtt{h}(\gamma)$ . If $|\mathtt{t}(\gamma)|=|\mathtt{h}(\gamma)|=k$ we call $\gamma$ a $k$ -soliton and write $\mathtt{t}(\gamma)=(\mathtt{t}_{1}(\gamma),\dots,\mathtt{t}_{k}(\gamma))$ with $\mathtt{t}_{i}(\gamma)<\mathtt{t}_{i+1}(\gamma)$ and $\mathtt{h}(\gamma)=(\mathtt{h}_{1}(\gamma),\dots,\mathtt{h}_{k}(\gamma))$ with $\mathtt{h}_{i}(\gamma)<\mathtt{h}_{i+1}(\gamma)$ . Note that either $\mathtt{h}_{i}(\gamma)<\mathtt{t}_{j}(\gamma)$ for any $i,j$ or $\mathtt{h}_{i}(\gamma)>\mathtt{t}_{j}(\gamma)$ for any $i,j$ and that the walk has the same height at $\mathtt{h}_{i}(\gamma)$ and $\mathtt{t}_{i}(\gamma)$ : $\xi(\mathtt{h}_{i}(\gamma))=\xi(\mathtt{t}_{i}(\gamma))$ , for this reason we say that the head and tail of a soliton are paired.

We have the following key definition.

Definition 1.

We say that a box $i$ is a $k$ -slot if either $i$ is a record or $i$ belongs to $\{\mathtt{t}_{\ell}(\gamma),\mathtt{h}_{\ell}(\gamma)\}$ for some $\ell>k$ for some $m$ -soliton $\gamma$ with $m>k$ .

The set of $k$ -slots contains the set of $m$ -slots for all $m>k$ . We illustrate in Fig. 8 the slots induced by the soliton decomposition of the excursion in Fig. 7,

obtaining one 4-slot (at the record), 5 3-slots, 11 2-slots and 21 1-slots. According to definition 1, the number of $k$ -slots is $s_{k}:=1+\sum_{\ell>k}2(\ell-k)n_{k}$ , where $n_{k}$ is the number of $k$ -solitons in the excursion. The $1$ in the above formula corresponds to the record on the left that is a slot of any order.

For each $k$ we enumerate the $k$ -slots in the excursion starting with 0 for the $k$ -slot in the record preceding the excursion, We say that a $k$ -soliton $\gamma$ is attached to the $k$ -slot number $i$ if the boxes occupied by $\gamma$ are contained in the segment with extremes the $i$ th and $(i+1)$ th $k$ -slots in the excursion. We define

[TABLE]

3.3 Slot diagrams

We define a combinatorial family of objects called slot diagrams that according to [7] is in bijection with $\mathcal{E}$ , see also §4 below.

A generic slot diagram is denoted by $x=\left(x_{k}:1\leq k\leq M\right)$ , where $M=M(x)$ is a non negative integer,

[TABLE]

and $s_{k}$ is a non-negative integer. We say that $x_{k}(j)$ is the number of $k$ -solitons attached to the $k$ -slot number $j$ . We denote $n_{k}:=\sum_{i=0}^{s_{k}-1}x_{k}(i)$ , the number of $k$ -solitons in $x$ . A precise definition is the following.

We say that $x$ is a slot diagram if

•

There exists a non negative integer number $M=M(x)$ such that $s_{M}=1$ and $x_{m}(0)=0$ for $m>M$ . Hence, we can ignore soliton sizes above $M$ and denote $x=\left(x_{k}:1\leq k\leq M\right)$ .

•

For any $k$ , the number of $k$ slots $s_{k}$ is determined by $(x_{\ell}:\ell>k)$ via the formula

[TABLE]

Consider now the soliton decomposition of an excursion and the corresponding vectors defined by (8) and define $M=\min\{k\geq 0:x_{k^{\prime}}(0)=0$ for all $k^{\prime}>k\}$ . Then, the family of vectors $(x_{k}:k\leq M)$ forms a slot diagram; if $M=0$ the slot diagram is empty and corresponds to an empty excursion. The slot diagram of the excursion in Fig.8 is as follows. We have $M=4$ and

[TABLE]

For example the vector $x_{3}$ has just $x_{3}(1)=1\neq 0$ since there is just one 3-soliton and its support (the boxes corresponding to the green part of the walk in Fig. 8) is contained between the 3-slots number 1 and number 2. Recall that the $k$ -slot located at the record to the left of the excursions is numbered 0 for all $k$ and then the 3-slot number 1 is the second green box from the left in Fig. 8.

3.4 Head-Tail soliton decomposition

We propose another decomposition, called HT soliton decomposition.

Start with a ball configuration $\eta$ with a single excursion.

1)

If there is just one single (infinite) run then stop, otherwise go to the next step.

2)

Search for the leftmost among the smallest runs. If the run contains 1’s, then pair the boxes belonging to the run with the first boxes (with zeroes) belonging to the nearest neighbor run to its right. If the run contains 0’s, then pair the boxes with the nearest boxes (with ones) to the left of the run. The set of paired boxes and their contents identifies a soliton $\gamma$ .

3)

Ignore the boxes of the identified solitons, update the runs gluing together the remaining boxes and go to step 1.

The HT soliton decomposition of the excursion in Fig.6 is given in Fig.9. The name of the decomposition comes from the fact that the head of each soliton is to the left of its tail in all cases. We will denote soliton*⋄* those solitons identified by the HT decomposition. We will see that this decomposition arises naturally in terms of a tree associated to the excursion.

We say that a box $i$ is a $k$ -slot*⋄* if either $i$ is a record or $i\in\{\mathtt{h}_{\ell}(\gamma^{\diamond}),\mathtt{t}_{m-\ell+1}(\gamma^{\diamond})\}$ for some $\ell\in\{1,m-k\}$ for some $m$ -soliton*⋄* $\gamma^{\diamond}$ for some $m>k$ ; for example, if $\gamma^{\diamond}$ is a 4-soliton*⋄, $\mathtt{h}_{1}(\gamma^{\diamond})$ and $\mathtt{t}_{4}(\gamma^{\diamond})$ are 3-slots⋄. See the upper part of Fig.10. Observe that, as before, the set of $k$ -slots⋄* is contained in the set of $\ell$ -slots*⋄* for any $\ell<k$ .

As before we say that a $k$ -soliton*⋄* is attached to $k$ -slot*⋄* number $i$ if the boxes of the soliton*⋄* are strictly between $k$ -slots*⋄* $i$ and $i+1$ . If we enumerate the $k$ -slots*⋄* of the excursion starting with 0 for the $k$ -slot*⋄* at record 0, we can again define $x_{k}^{\diamond}(i):=$ number of $k$ -solitons*⋄* attached to $k$ -slot*⋄* number $i$ . This produces a slot diagram $x^{\diamond}$ associated to the excursion. We denote $x^{\diamond}[\varepsilon]$ the slot diagram produced by the HT soliton decomposition of the excursion $\varepsilon$ .

See Fig.10 for a comparison of the slots induced by the HT soliton decomposition and the TS soliton decomposition.

The next result says that the slot diagrams produced by both decompositions are identical. Observe that a slot diagram gives information about the number of solitons and about their combinatorial arrangement so that codifies completely the corresponding excursion.

Theorem 2.

The slot diagram of the Head-Tail decomposition to an excursion $\varepsilon\in\mathcal{E}$ coincides with the slot diagram of the Takahashi-Satsuma decomposition of $\varepsilon$ . That is, $x[\varepsilon]=x^{\diamond}[\varepsilon]$ .

Proof.

Let $\varepsilon$ be an excursion and denote $x[\varepsilon]$ and $x^{\diamond}[\varepsilon]$ the TS and HT slot diagrams of $\varepsilon$ , respectively; let $m$ and $m^{\diamond}$ be the maximal soliton size in each representation. Let $\mathtt{s}_{k}(i)$ and $\mathtt{s}^{\diamond}_{k}(i)$ be the position of the $i$ -th $k$ -slot in the TS and HT decompositions of $\varepsilon$ , respectively.

Assume $\varepsilon$ has neither $\ell$ -solitons nor $\ell$ -solitons*⋄* for all $\ell\leq k$ . Then, $\mathtt{s}_{k}(0)=\mathtt{s}^{\diamond}_{k}(0)=0$ and for $0<i<s_{k}$ , we will show that

[TABLE]

which implies the theorem. We prove (10) by induction. If $\varepsilon$ has only $m$ -solitons, then (10) holds for any $k<m$ by definition. Assume (10) holds if $\varepsilon$ is an excursion with no $\ell$ -solitons for $\ell\leq k$ . Now attach a $k$ -soliton*⋄* $\gamma^{\diamond}$ to $\mathtt{s}^{\diamond}_{k}(i)$ and a $k$ -soliton $\gamma$ to $\mathtt{s}_{k}(i)$ .

We have 2 cases:

(1) $s_{k}(i)$ is the record or belongs to the tail of a $m$ -soliton $\alpha$ with $m$ bigger than $k$ . In this case also $s_{k}^{\diamond}(i)$ belongs to the record or to the tail of a $m$ -soliton*⋄* $\alpha^{\diamond}$ and $\gamma$ is attached to the same place as $\gamma^{\diamond}$ , hence it does not affect the distances between $\ell$ -slots and $\ell$ -slots*⋄* in the excursion –indeed, they coincide in the record and in the tail of $\alpha$ and $\alpha^{\diamond}$ – for $\ell\leq k$ . On the other hand, the $\ell$ -slots carried by $\gamma$ and the $\ell$ -slots*⋄* carried by $\gamma^{\diamond}$ satisfy (10).

(2) $s_{k}(i)$ is in the head of $\alpha$ . In this case necessarily $s^{\diamond}_{k}(i)$ is in the head of $\alpha^{\diamond}$ by inductive hypothesis and $s_{k}(i)=s^{\diamond}_{k}(i)+k$ . We consider 2 cases now:

(2a) $k$ -slots. The attachments of $\gamma$ to $s_{k}(i)$ and $\gamma^{\diamond}$ to $s_{k}^{\diamond}(i)$ does not change the distance between $k$ -slots and $k$ -slots*⋄* because either $s_{k}(j)<s_{k}(i)$ and $s^{\diamond}_{k}(j)<s^{\diamond}_{k}(i)$ and in this case the insertions do not change their positions or otherwise both slots are translated by $2k$ , the number of boxes occupied by the $k$ -solitons. We conclude that (10) is satisfied by $k$ -slots and $k$ -slots*⋄* after the attachments.

(2b) $\ell$ -slots for $\ell<k$ . Take an $\ell<k$ and an $\ell$ -slot $s_{\ell}(j)$ in the head of $\alpha$ . If $s^{\diamond}_{\ell}(j)<s^{\diamond}_{k}(i)$ and $s_{\ell}(j)<s_{k}(i)$ , neither will be displaced, so (10) is satisfied for $\ell$ -slots to the left of $s_{k}(i)$ . On the other hand, if $s^{\diamond}_{\ell}(j)>s^{\diamond}_{k}(i)$ , then $s^{\diamond}_{\ell}(j)$ keeps its place after the attachment of $\gamma^{\diamond}$ and $s_{\ell}(j)$ is to the left of the attachment, hence they satisfy (10) after the attachments (this is the case of the 4th violet $1$ -slot and $1$ -slot*⋄*).

We have proved that if the slot and slot*⋄* diagrams of an excursion with no $\ell$ -solitons for $\ell\leq k$ coincide, then they coincide after attaching $k$ -solitons and $k$ -solitons*⋄*. ∎

3.5 Attaching solitons

In the previous subsection we discussed the decomposition of a configuration into elementary solitons/solitons*⋄* and how to codify each single excursion using a slot diagram that takes care of the combinatorial arrangement of the solitons/solitons*⋄* into the available slots/slots*⋄*. In this Section we discuss the reverse construction. Given a slot diagram we illustrate how to construct the corresponding excursion. The procedure is particularly simple and natural in the case of the HT decomposition.

The basic idea is illustrated in Fig. 12, which was obtained from Figure 9 by drawing horizontal lines from the leftmost point in the graph of the excursion associated to the head to the rightmost point associated to the tail of each soliton*⋄*. These lines cut the epigraph of the excursion into disjoint regions that we color with the corresponding color of the boundary. We imagine each colored region as a physical two dimensional object glued recursively to generate the interface. Indeed we will show that the excursion can be obtained as the final boundary of a region obtained adding with a tetris-like construction one after the other upside oriented triangles having elastic diagonal sides

It is convenient to represent the walk associated to a ball configuration in $\mathcal{X}$ as follows. We transform each down oriented step $\diagdown$ associated to a record into an horizontal line $\frac{\ \ }{\ \ }$ at height [math]. The parts of the walk associated to the excursions are vertically shifted to level 0, remaining concatenated one after the other by an horizontal line of length equal to the number of records separating the excursions in the walk. The walk is therefore represented by infinitely many pieces of horizontal lines at the zero level (the sea level) separated by infinitely many finite excursions (mountain profiles). This is the construction associated to the Harris walk (see for example [14]). See Fig. 13 for an example with three excursions where we implemented also the same coloring of Fig. 12.

We discuss how to generate one single excursion from a slot diagram using the HT decomposition. We represent an isolated $k$ -soliton*⋄* as a right-angle isosceles triangle having hypotenuse of size $2k$ . The triangle is oriented in such a way that the hypotenuse is horizontal and the triangle is upside oriented, see Fig. 14.

The basic mechanism of attaching solitons*⋄* is illustrated in Fig. 15. In the first up left drawing we represent a 4 soliton*⋄* as an upper oriented triangle and draw below it the corresponding slots*⋄. The leftmost slot⋄* corresponds to a record located just on the left of the excursion. Colors are like before: violet=1, red=2, green=3, blue=4. In the drawing number $i$ with $i=0,\dots,6$ we attach one 1-soliton*⋄* to the 1-slot*⋄* number $i$ . This corresponds to attach a triangle with horizontal hypotenuse of size 2 in correspondence of the position of the corresponding slot*⋄*.

The Figure is exhaustive and represents all the possible ways of attaching the 1-soliton*⋄. The precise rules and the change of the positions of the slot⋄* during the attaching procedure to generate an excursion, are illustrated using as an example the following slot diagram

[TABLE]

We construct now the excursion that corresponds to this slot diagram. We do this using the HT decomposition since it is simpler but the TS decomposition gives as a result the same excursion. First we observe that the maximal soliton*⋄* size in (3.5) is $4$ and there is just one maximal soliton*⋄*.

We start therefore with drawing 1 of Fig. 16 where we have a blue 4-soliton*⋄* represented by a upside oriented triangle. Below it we represent also the $\ell$ -slots*⋄* for $\ell<4$ ; the leftmost $\ell$ -slot*⋄* is always located in the record just on the left of the excursion. Since there are no 3-solitons*⋄* we do not have to add green triangles having hypotenuse of size 6. We proceed therefore attaching 2-solitons*⋄* represented as upside oriented triangles with hypotenuse of size 4. We have two of them and we have to attach to the 2-slot*⋄* number 1 and 3. We label as $\ell$ -slot*⋄* number zero the one associated to the record and number the other ones increasingly from left to right. There are 5 2-slot*⋄* in the drawing 1 of Fig. 16 (that are the piles of colored squares containing a red one). We start attaching the 2-soliton*⋄* to the 2-slot*⋄* number 1. This means that the left corner of the red triangle has to be attached to the boundary of the colored region in correspondence to the intersection of the boundary with the dashed line just on the right of 2-slot*⋄* number one. Since the bottom edge of the triangles is rigid the blue diagonal side deforms in order to have a perfect gluing. This is illustrated in the drawing number 2 of Fig. 16. Note that the slots*⋄* in correspondence with the shifted diagonal sides of the blue triangle are shifted accordingly. There are moreover new 1-slot*⋄* created in correspondence with some red diagonal sides. The same gluing procedure is done with a second red triangle in correspondence of the 2-slot*⋄* number 3, and this is shown in the drawing number 3 of Fig. 16. Note that we do this two gluing operations one after the other to illustrated better the rules but they can be done simultaneously or in the reversed order, the final result is the same. This is because attaching a $k$ -soliton we generate just new j-slot*⋄* with $j<k$ . Finally we have to attach a 1-soliton*⋄* that is a violet triangle in the 1-slot*⋄* number 3 and this is shown in the final drawing 4 of Fig. 16.

3.6 Conserved quantities

We discuss a way to identify conserved quantities using the first definition of the dynamics in §2.1 applied to a finite excursion. Recall that the basic step consists on pairing all neighboring boxes of type 10 by drawing a line from the 1 to the 0 and then remove the paired boxes to iterate, see Fig.2. Call $r_{i}$ the number of lines drawn in the $i$ -th iteration of the construction. We have $r_{1}\geq r_{2}\geq\dots\geq r_{M}$ , where $M$ is the number of iterations necessary to pair all the balls. In the example of Fig. 2 we have $M=4$ and $r_{1}=8,r_{2}=2,r_{3}=r_{4}=1$ .

Proposition 3 (Yoshihara, Yura, Tokihiro [19]).

The numbers $r_{i}$ are invariant for the dynamics. That is,

[TABLE]

Proof.

We present a simplified version of the argument given by [19]. The basic property that we use is the reversibility of the dynamics. Introduce the evolution $T^{*}$ that is defined exactly as the original dynamics apart the fact that balls move to the left instead of to the right. The reversibility of the dynamics is encoded by the relation $T^{*}T\eta=\eta$ . This fact follows from the definition: looking at Fig. 2 the configuration $T\eta$ is obtained just coloring black the white boxes and white the black ones. The evolution $T^{*}$ is obtained pairing balls with empty boxes to the left. The lines associated to $T^{*}$ for the configuration $T\eta$ are exactly the same as those already drawn. The only difference is that the balls are now transported from right to left along these lines. Denote $r^{*}_{i}$ the number of lines drawn at iteration number $i$ for the evolution $T^{*}$ . Since the lines used are the same we have

[TABLE]

Now evolve the original configuration $\eta$ according to $T^{*}$ . In Fig. 17 we draw above the lines corresponding to the evolution $T$ and below those corresponding to $T^{*}$ . We want now to show that

[TABLE]

Recall that a run is a sequence of consecutive empty or full boxes. In the configuration $\eta$ of our example there are two infinite empty runs and then alternated respectively 8 and 7 full and empty finite runs.

The first step is to show that $r_{1}(\eta)=r^{*}_{1}(\eta)$ . This is simple because these numbers coincide with the number of full runs in the configuration $\eta$ . The second step of the algorithm consists on erasing the rightmost ball of every occupied run and the leftmost empty box of every empty run for $T$ , while the leftmost ball of every occupied run and the rightmost empty box of every empty run are erased for $T^{*}$ . Observe that $r_{2}(\eta)$ coincides with the number of full runs in a configuration obtained removing the balls and the empty boxes paired in the first step. This configuration is obtained from $\eta$ decreasing by one the size of every finite run. If in $\eta$ there are some runs of size 1 then they disappear. The same happens for computing $r^{*}_{2}(\eta)$ . Since we are just interested on the sizes of the alternating sequences of empty and full runs, erasing on the left or on the right is irrelevant. We deduce $r_{2}(\eta)=r^{*}_{2}(\eta)$ since both coincide with the number of finite occupied runs of two configurations having the same sequence of sizes of the runs. Iterating this argument we deduce (14). Now, using (13) and (14) we deduce (12). ∎

3.7 Young diagrams

We discuss now a generalization of the conservation property (12) to the case of infinite configurations and the relation with the conservation of the solitons. Since the numbers $r_{i}$ are monotone, it is natural to represent them using a Young diagram, [11]. A Young diagram is a diagram of left-justified rows of boxes where any row is not longer than the row on top of it. We can fix for example the number $r_{i}$ representing the length of the row number $i$ from the top. The number of iterations $M$ corresponds to the number of rows. The Young diagram associated to the example in Fig. 2 is therefore

[TABLE]

This diagram can be naturally codified by the numbers $r_{i}$ , representing the sizes of the rows, as $(8,2,1,1)$ . Another way of codifying a Young diagram is by the sizes of the columns. This gives another Young diagram that is called the conjugate diagram and it is obtained by reflecting the diagram across the diagonal. The same diagram (15) can therefore be codified as $[4,2,1,1,1,1,1,1]$ . Finally another equivalent codification can be given specifying the numbers $n_{1},n_{2},\dots,n_{M}$ of columns of length respectively $1,2,\dots,M$ . For the Young diagram above we have for example $n_{1}=6,n_{2}=1,n_{3}=0,n_{4}=1$ . The numbers $r_{i}$ and $n_{i}$ give alternative and equivalent coding of the diagram and are related by

[TABLE]

where we set $r_{M+1}:=0$ .

The number $n_{i}$ can be interpreted as the number of solitons of length $i$ . Take for example the diagram (15) and cut it into vertical slices obtaining

[TABLE]

The original Young diagram can be reconstructed gluing together the columns in decreasing order from left to right and justifying all of them to the top. Each column of height $k$ in (17) will represent a $k$ -soliton on the dynamics. We are not giving a formal proof of this statement it can however easily be obtained by the construction in §4.2. We will show indeed that the soliton decomposition can be naturally done using trees codifying excursions. In §4.2 we show how the trees can be constructed using the lines of the first definition in §2.1 getting directly the relationship among the Young diagrams and the solitons. According to this, the configuration $\eta$ having associated the Young diagram (15) obtained gluing again together the columns in (17), contains one 4-soliton one 2-soliton and 6 1-solitons.

The Young diagram contains only some information about the configuration of balls, i.e. the map that associates to $\eta$ its Young diagram is not invertible, and for example there are several configurations of balls giving (15) as a result. The one in Fig. 2 is just one of them. Essentially the Young diagram contains just the information concerning the numbers of solitons contained in the configuration but not the way in which they are combinatorially organized.

In the example discussed above we worked with a configuration of balls having one single non trivial finite excursion. Consider now a finite configuration $\eta$ whose walk representation contains more than one excursion. Our argument on the conservation of the numbers $r_{i}$ proves that the global Young diagram associated to the whole configuration is invariant by the dynamics. Let us consider however separately the single excursions. Recall that two different excursions are separated by empty boxes from which there are no lines exiting. For example in Fig. (4) there are 3 excursions that we surrounded by rectangles to clarify the different excursions.

We construct for each excursion separately the corresponding Young diagram. For the example of Fig. 4 the three Young diagrams are

[TABLE]

By definition the global Young diagram that is preserved by the dynamics is the one having as length of the first row (the number $r_{1}$ ) the sum of the lengths of the first rows of the three diagrams, as length of the second row (the parameter $r_{2}$ ) the sum of the length of the second rows of all the Young diagrams and so on. This means that the global Young diagram is obtained suitably joining together the single Young diagrams. In particular the gluing procedure is the following. We have to split the columns of each single diagram then put all the columns together and glue them together as explained before, i.e. arranging them in decreasing order from left to right and justifying all of them to the top. For example the first Young diagram on the left in (18) is split into

[TABLE]

For the second diagram in (18) we have two columns of size 1 $\yng(1)\ \yng(1)$ while for the third one we have one single column of size 2 $\yng(1,1)$ .

The global Young diagram for the example of Fig. 4 is therefore

[TABLE]

The number $n_{i}$ of columns of length $i$ in the global diagram is obtained as the sum of the number of columns of size $i$ on the single diagrams. Also the numbers $r_{i}$ are obtained summing the corresponding row lengths on each single group (with the usual convention that a Young diagram with $M$ rows has $r_{j}=0$ for $j>M$ ).

The shapes of the single diagrams in (18) are not invariant by the dynamics. Even the number of such diagrams is not conserved since during evolution the number of excursions may change. It is instead the total number of columns of each given size to be conserved. More precisely given a configuration $\eta$ we can construct the Young diagrams for each excursions and then we can cut them into single columns. The configuration of balls $T\eta$ will have different excursions with different Young diagrams but they will be obtained again combining differently into separated Young diagrams the same columns obtained for the configuration $\eta$ . The Box-Ball dynamics preserves the number of columns of size $k$ for each k. Indeed this is nothing else that a different identification of the traveling solitons again by the construction in §4.2.

If $\eta$ is an infinite configuration with a walk having all the records, we can construct a Young diagram for each excursion. Cutting the diagrams along the columns we obtain the solitons contained in the excursion.

*Slot diagrams and Young diagrams. * Since a slot diagram describes the number of solitons per slot, we can associate a Young diagram to a slot diagram $x$ as follows: $M(x)$ is the number of rows and $n_{k}$ is the number of columns of length $k$ . The diagram is constructed gluing $n_{M}$ columns of length $M$ , then $n_{M-1}$ columns of length $M-1$ up to $n_{1}$ columns of length $1$ . For example the Young diagram associated to the slot diagram (3.5) is given by

[TABLE]

4 Trees, excursions and slot diagrams

In this section we provide an alternative decomposition of an excursion using a bijection between soft excursions and planar trees. The construction is a slight variant of the classical bijection of strict excursions and planar rooted trees, see [12, 5, 13]. There are several ways of codifying planar trees (see for example [13]). We will try to use a direct pictorial approach introducing less algebraic notation as possible.

4.1 Tree representation of excursions

In this subsection we summarize classical results mapping finite trees to excursions, see for instance §1.1 of the lecture notes of Le Gall [13]. Start with the graph of a soft excursion as in Fig. 18. Draw horizontal lines corresponding to the integer values of the height. The region below the graph of the excursion is cut into disjoint components by the horizontal lines. Associate one node to each connected component. The root is the node corresponding to the bottom region. The tree is obtained by drawing an edge between nodes whose associated components share a piece of a horizontal line. The construction is illustrated in Fig. 18 where the root is drawn as a $\bullet$ while the other nodes as a $\circ$ .

The tree that we obtain is rooted since there is a distinguished vertex and it is planar. This means that it is embedded on the plane where the graph of the excursion is drawn. A consequence of this specific embedding is that every vertex different from the root has an edge incoming from below and all the other edges are ordered from left to right going clockwise.

In [13] the map from the planar tree to the excursion is given in terms of a Dyck path. The excursion gives the distance to the root of a vehicle that turns around the tree at speed one edge per unit of time. The reverse bijection amount to glue the edges face to face below the excursion (in order to recover the edges), as in Fig.19.

4.2 Trees and pairing algorithm

The tree associated to an excursion can be constructed using the pairing definition of the dynamics of Fig. 2. As before, draw dashed horizontal lines in correspondence of the integer heights that cut the epigraph of the excursion into disjoint regions. Pair the opposite diagonal faces of each region, connected by dashed double arrows in Fig. 19. Since the left face is of type $\diagup$ and the right one is of the type $\diagdown$ , corresponding respectively to balls and empty boxes, we obtain exactly the pairing of the first definition of the dynamics. Indeed, the pairings of the first iteration of the first definition of the dynamics coincide exactly with the pairing of the two opposite diagonal sides near each local maxima. Then remove the paired objects and iterate to obtain a proof.

We construct the planar tree associating the root to the unbounded upper region of the upper half plane and one node to each pairing line. Nodes associated to maximal lines are linked to the root. Consider a node A associated to a maximal line. Node B associated to another line is connected to A if: 1) the line associated to B is surrounded by the line associated to A and 2) removing the maximal line associated to A the line associated to B becomes maximal. The tree is constructed after a finite iteration of this algorithm, see Fig. 20 where the planar tree is red and downside oriented.

4.3 Branch identification of planar trees

We discuss a natural branch decomposition of a rooted planar tree that is in correspondence with the soliton decompositions previously discussed.

We give 3 equivalent algorithms to identify the branches of a planar rooted tree.

Branch identification I

Step 1. Let $A_{1}$ be the set of the leaves (nodes with only one neighbor). Associate a distinct color and the generation number 1 to each leaf. The root is black, a color not allowed for the other nodes.

Step $\ell$ . Let $A_{\ell-1}$ be the set of numbered and colored nodes after $\ell-1$ steps. Let $\mathtt{N}_{\ell}$ be the set of nodes with all offsprings in $A_{\ell-1}$ . To each $\mathtt{n}\in\mathtt{N}_{\ell}$ give the color of the rightmost neighbor among those with bigger generation number, say $g$ , and give generation number $g+1$ to $\mathtt{n}$ . Stop when all nodes are colored.

In Fig. 21 give a distinct color to each leaf (we have for simplicity repeated colors in the picture). In each step to each not-yet-colored node with all offsprings already colored give the color of the rightmost maximal offspring. After coloring all nodes, identify the color of branches of the same size (knowing the result, we have started with those colors already identified).

A $k$ -branch is a one-dimensional path with $k$ nodes all of the same color and $k$ edges, one of which is incident to a node of a different color. In Fig.21 we have colored the tree produced by the excursion in Fig.18 and have identified 2 violet 1-branches, 2 red 2-branches, 1 green 3-branch and 2 blue 4-branches (for simplicity we used a simplified convention for color, see the caption for the explanation).

Branch identification II

Step 0. Enumerate the colors. In our example we use violet for 1-branches, red for 2-branches, green for 3-branches and blue for 4-branches.

Step 1. Paint all leaves with color 1, violet.

Step $\ell$ . Update those nodes with all offsprings entering into nodes already colored during steps 1 up to $\ell-1$ . Give color $\ell$ to updating nodes and change to color $\ell$ those nodes belonging to the rightmost offspring path of size $\ell$ starting from each updating node. See Fig. 22.

In Fig. 22 we give color 1 (violet in this case) to each leaf. In step $2$ (a) give color $2$ (red) to all nodes having all offsprings already colored and (b) change to color $2$ each already colored node belonging to the rightmost offspring path with $2$ nodes starting at each updating node. In step 3 use color green and in step 4 use color blue. The final branch decomposition is the same as in Fig.21.

Branch identification III

Step 0. Orient the tree toward the root. Consider the oriented paths starting from the leaves of the tree. Remove the root but not the edges incident to the root.

Step 1: Search for the maximal directed paths starting from the leaves. If two or more of them share at least one edge, select just the rightmost path among those. Observe that the last edge is incident only to one node. A selected path with $k$ nodes is named $k$ -branch. Remove the selected branches.

Step 2. If all paths have been removed, then stop. Otherwise go to step 1.

The tree is oriented just to define the procedure. The branches selected and removed constitute the branch decomposition of the tree. In Figure 23 we apply this procedure to the same example of the previous procedures. The result is the same.

In Fig. 23. First square represents the first iteration. There are 3 paths of length 4 sharing the left edge incident to the root and two paths of length 4 sharing 3 edges. The rightmost path of each group is identified as a 4-branch and colored blue. The second iteration identifies one 3-branch in green; the third iteration identify two 2-branches and the forth iteration identifies two 1-branches. Putting back the colored branches to their original position we obtain the last picture of Fig. 22

4.4 Tree-induced soliton decomposition of excursions

We now take the tree produced by an excursion, as illustrated in Fig. 18, use any algorithm of §4.3 to identify its branches and use the colored tree to identify solitons, as follows. Put the colored tree back into the excursion and color the diagonal boundaries of the region associated to each node with the color of the node. Each $k$ -branch is then associated to $k$ empty and $k$ occupied boxes with the same color; we call those boxes and their content a $k$ -soliton*. We use the * to indicate solitons and slots in the tree-induced decomposition. In this case all solitons* are oriented up, that is, the head of each soliton* is to the left of its tail. See Fig. 24.

Proposition 4 (HT and tree decomposition).

Given any excursion $\varepsilon$ , the HT soliton decomposition of $\varepsilon$ coincides with the tree decomposition of $\varepsilon$ .

Proof.

This proposition is consequence of Proposition 5 below, given in terms of the slot diagrams of both objects. ∎

4.5 Slot diagrams of planar trees

Think each node of a tree as a geometric object. More precisely identify each node with a circumference that is exactly the boundary of the associated colored region like in Figure 25. Each incident edge to the node is now a segment intersecting the circumference; different edges intersect different points, called incident points. By convention, we assume that there is a segment incident to the root from below. The arcs of the circumference with extremes in the incident points and with no incident point in the interior are called slots*. We will describe a procedure to attach new branches to slots*. We use the same symbol $*$ for the solitons of the previous section and slots here since there is a direct correspondence between the solitons* and the slot* diagram for the branches of the tree.

We say that a node of a tree has $k$ generations if it is colored in the iteration number $k$ of the algorithm Branch identification II. This is equivalent to say that the maximal path from the node to a leaf, moving always in the opposite direction with respect to the root, has $k$ nodes, including the node and the leaf.

Slots identification of trees I

Consider a colored tree with maximal branch of size $m$ . Declare the whole circumference of the root of the tree as the $m$ -slot* number 0; recall there is an incident edge to this node from below. Attach the $m$ -branches to the unique $m$ -slot*. Proceed then iteratively for $k<m$ . Assume that the tree has no $\ell$ -branches for $\ell\leq k$ and call a slot* $\mathtt{s}$ a $k$ -slot* if one of the following conditions hold (a) $\mathtt{s}$ belongs to the root, (b) $\mathtt{s}$ belongs to a node with more than $k$ generations, (c) $\mathtt{s}$ belongs to a node with $k$ generations and all path with $k$ nodes containing a leaf incident to the node, is incident to the right of $\mathtt{s}$ . $k$ -slots* are numbered from left to right, starting with $k$ -slot* [math] at the left side of the node associated to the record. See Fig. 25.

Slot diagram of a tree

The slot diagram of the tree is a collection of vectors

[TABLE]

where $m$ is the length of the longest path in the tree and

[TABLE]

In particular the slot* diagram of Fig.25 is given by $m=4$ and

[TABLE]

using the slot enumeration in Fig.25. We illustrate this slot diagram in Fig. 26.

Slots identification of trees II

A reverse way to find the slot* diagram of a colored tree with identified slots* is the following. Remove the $1$ -branches keeping track of the 1-slot* index each branch was attached to. Assume we have removed the $\ell$ -branches for $\ell<k$ . Then, remove the $k$ -branches keeping track of the $k$ -slot* number associated to each removed $k$ -branch. The slot* diagram associated to the tree consists on the removed branches and its associated slots* number. See Fig.27.

Fig. 27. Upper-left: a colored tree. Upper-right: erasing 1-branches in the tree, we identify and enumerate $1$ -slots*. Lower-left, erasing 1-branches and 2-branches, we identify and enumerate 2-slots*. Lower-right: in a tree with 4-branches we identify and enumerate 3-slots*. The node associated to the record, in black, has one $3$ -slot* for each arc.

4.6 From paths to trees

We illustrate now the reverse operation. Start with the slot diagram obtained in Fig.26. Put the root. Let $m$ be the biggest size of the branches in the slot diagram. Attach the $m$ -branches to the root. Then successively for $k=m-1,\dots,1$ attach the $k$ branches to the associated $k$ -slot in the tree. The result is illustrated in Fig.27 looking at it backwards: In rectangle 4 we attach 2 4-branches to 4-slot 0 and indicate the place and number of each 3-slot; in rectangle 3 we attach one 3-branch to 3-slot number 1 and so on.

Proposition 5.

Given a finite excursion $\varepsilon$ we have $x^{\diamond}[\varepsilon]=x^{*}[\varepsilon]$ .

Sketch proof.

We give a sketch of the proof showing the basic idea. Consider an arbitrary slot diagram $x$ . We are going to show that the excursion $\varepsilon$ characterized by $x^{\diamond}[\varepsilon]=x$ and the excursion $\varepsilon^{\prime}$ characterized by $x^{*}[\varepsilon^{\prime}]=x$ are the same, i.e. $\varepsilon=\varepsilon^{\prime}$ . This implies the statement of the Proposition. Recall that we have constructed the excursion associated to $x^{\diamond}[\varepsilon]$ iteratively in §3.5 gluing one after the other some special triangles. We just showed instead that to construct $x^{*}[\varepsilon^{\prime}]$ we have to glue recursively the branches like the ones in Fig. 26 glued in Fig. 27 (recall that the gluing procedure has to be followed in the reverse order).

Since $x$ is the same, both procedures deal with the same number of $k$ -triangles and $k$ -branches to be attached to the same slots. The proof is therefore based on the correspondence between the two different procedures once we fix the basic correspondence of Fig. 28 between the two basic building blocks.

Considering the example of §3.5 we show in Fig. 29 the construction of the tree associated to the excursion $\varepsilon^{\prime}$ such that $x^{*}[\varepsilon^{\prime}]=x$ where $x$ is the slot diagram (3.5). This Figure has to be compared with Fig. 16 where we constructed the excursion $\varepsilon$ such that $x^{\diamond}[\varepsilon]=x$ where $x$ is again (3.5). In Fig. (29) for simplicity we draw just the slots* useful for the attachments. Looking carefully in parallel to the two construction the reader can see that at each step the excursion is the same and the allocations of the slots is again the same. A long formal proof could be given following this strategy.

5 Soliton distribution

We report in §5.1 a family of distributions on the set of excursions proposed by the authors [6] based on the slot decomposition of the excursions. In particular, the slot diagram of the excursion of a random walk satisfies that given the $m$ -components for $m>k$ , the distribution of the $k$ -component is a vector of independent Geometric random variables; the size of the vector is a function of the bigger components. As a consequence, we obtain that the distribution of the $k$ -branches of the tree associated to the excursion of the random walk given the $m$ -branches, for $m>k$ , is a vector of independent geometric random variables.

Theorem 7 considers a random ball configuration consisting on iid Bernoulli of parameter $\lambda<\frac{1}{2}$ , conditioned to have a record at the origin and shows that their components are independent and that the $k$ -component consists of iid geometric random variables.

Since the measure is given in terms of the number of solitons and slots of the excursion, and those numbers are the same in all the slot diagrams we have introduced, we just work with a generic slot diagram.

5.1 A distribution on the set of excursions

Let $n_{k}(\varepsilon)$ be the number of $k$ -solitons in the excursion $\varepsilon$ and for $\alpha=(\alpha_{k})_{k\geq 1}\in[0,1)^{\mathbb{N}}$ define

[TABLE]

with the convention $0^{0}=1$ . Define

[TABLE]

This set has a complex structure since the expression (25) is difficult to handle. For $\alpha\in\mathcal{A}$ define the probability measure $\nu_{\alpha}$ on $\mathcal{E}$ by

[TABLE]

For $q\in(0,1]^{\mathbb{N}}$ define the operator $A:q\mapsto\alpha$ by

[TABLE]

Reciprocally, define the operator $Q:\alpha\mapsto q$ by

[TABLE]

Let

[TABLE]

The next results gives an expression of $\nu_{\alpha}(\varepsilon)$ in terms of the slot diagram of $\varepsilon$ .

Theorem 6 (Ferrari and Gabrielli [6]).

(a)

Let $q\in\mathcal{Q}$ , $\alpha=Aq$ and $\nu_{\alpha}$ given by (27). Then, $\alpha\in\mathcal{A}$ and

[TABLE]

where $n_{k}$ and $s_{k}$ are the number of $k$ -solitons, respectively $k$ -slots, of $\varepsilon$ . 2. (b)

The map $A:\mathcal{Q}\to\mathcal{A}$ is a bijection with $Q=A^{-1}$ .

The proof of (a) given below shows that if $q\in\mathcal{Q}$ then $Aq\in\mathcal{A}$ with $Z_{Aq}=(\prod_{k\geq 1}q_{k})^{-1}$ . On the other hand, to complete the proof of (b) it suffices to show that $Q\alpha\in\mathcal{Q}$ . The proof of this fact is more involved and can be found in [6].

If we denote $x_{k}^{\infty}=(x_{k},x_{k+1},\dots)$ , the expression (32) is equivalent to the following (with the convention $q_{0}:=0$ to take care of the empty excursion).

[TABLE]

where we abuse notation writing $x_{m}$ as “the set of excursions $\varepsilon$ whose $m$ -component in $x[\varepsilon]$ is $x_{m}$ ”, and so on. Recall that $n_{k}$ is the number of $k$ -solitons of $x$ and $s_{k}$ is the number of $k$ -slots of $x$ , a function of $x_{k+1}^{\infty}$ .

Formulas (33) to (35) give a recipe to construct the slot diagram of a random excursion with law $\nu_{\alpha}$ : first choose a maximal soliton-size $m$ with probability (33) and use (34) to determine the number of maximal solitons $x_{m}(0)$ (a Geometric $(q_{m})$ random variable conditioned to be strictly positive). Then we use (35) to construct iteratively the lower components. In particular, (35) says that under the measure $\nu_{\alpha}$ and conditioned on $x_{k+1}^{\infty}$ , the variables $\left(x_{k}(0),\dots x_{k}(s_{k}-1)\right)$ are i.i.d. Geometric $(q_{k})$ .

Proof of Theorem 6 (a).

Using formula (9), we have

[TABLE]

because $Z_{\alpha}=\big{(}\prod_{n\geq 1}q_{n}\big{)}^{-1}<\infty$ since $q\in\mathcal{Q}$ . ∎

5.2 Branch distribution of the random walk excursion tree

For $\lambda\leq\frac{1}{2}$ define $\alpha=\alpha(\lambda)$ by

[TABLE]

Then $\alpha(\lambda)\in\mathcal{A}$ and $\nu_{\alpha(\lambda)}$ is the law of the excursion of a simple random walk that has probability $\lambda$ to jump up and $1-\lambda$ to jump down. A computation using the Catalan numbers shows that

[TABLE]

On the other hand, the probability that the random walk perform a fixed excursion with length $2n$ is $\lambda^{n}(1-\lambda)^{n+1}$ , where the extra $(1-\lambda)$ is the probability that the walk jumps down after the $2n$ steps of the excursion. This gives (41) with no computations.

In terms of the branches of the tree associated to the excursion, one chooses the size of the largest branch $m$ of the tree with (33) and use (34) to decide how many maximal branches are attached to the root of the tree. Then identify the $(m-1)$ slots and proceed iteratively using (35) to attach the branches of lower size. Given the branches of size bigger than $k$ already present in the tree, the number of $k$ -branches per $k$ -slot $\left(x_{k}(0),\dots x_{k}(s_{k}-1)\right)$ are i.i.d. Geometric $(q_{k})$ given iteratively by

[TABLE]

5.2.1 Geometric branching processes

Let $\rho$ be a probability measure on $\mathbb{N}\cup\{0\}$ . A branching process with offspring distribution $\rho$ is a random growing tree defined as follows. Let $\left(X_{i}^{j}\right)_{i,j\in\mathbb{N}}$ be a double indexed sequence of i.i.d. random variables having law $\rho$ .

At initial time zero the tree is constituted by one single vertex, the root. At time 1 there are $X_{1}^{1}$ individuals on the first generation, all of them are generated by the root and are drawn as vertices connected to the root. Give to them an arbitrary order from left to right embedding the tree on a plane. At time 2 each individual of the first generation produces independently a number of new vertices with distribution $\rho$ . More precisely the number of vertices produced by the individual number $i$ of the first generation is $X_{2}^{i}$ . Every such new vertex is connected by an edge to the parent vertex of the previous generation with an arbitrary order from left to right given by the embedding. Continue iteratively in this way with $X_{j}^{i}$ being the number of vertices of the generation $j$ produced by the individual number $i$ of the generation $j-1$ .

If $\sum_{k=0}^{+\infty}k\rho(k)<1$ the branching process is called subcritical and the above procedure produces a.e. a finite random planar tree. See [12] for more details, references, and the relation with the law of the corresponding excursions.

Let us consider the case when $\rho$ is the law of a geometric random variable Geometric( $1-\lambda$ ), i.e. $\rho(k)=\mathbb{P}(X^{i}_{j}=k)=(1-\lambda)\lambda^{k}$ , $k=0,1,\dots$ . In this case the probability of any given finite tree is given by $(1-\lambda)^{|V|}\lambda^{|V|-1}$ , where $|V|$ is the number of vertices, included the root. Since $|V|-1=2n$ that is the length of the corresponding excursion (the correspondence is described in section 4.1), we have that the law of the excursion coincides with the law of the excursion of a simple random walk having probability $\lambda$ to jump up and probability $1-\lambda$ of jumping down (see [12] for more details).

Using the result discussed in this paper we obtain therefore an alternative procedure of construction of a geometric branching process using independent but not identically distributed geometric random variables.

Consider the parameters $(q_{k})_{k\in\mathbb{N}}$ defined as in (42). The law of the maximal generation $M$ of the branching process is given by the right hand side of (33), i.e.

[TABLE]

Once the maximal generation has been fixed we attach, directly to the root, a number of maximal branches os size $M=m$ according to the distribution given on the right hand side of (34), i.e. a Geometric( $q_{m}$ ) conditioned to be positive. Se for example Figure 25 where $m=4$ and we attach two 4 branches directly to the root in the frame number 2.

We proceed now iteratively. Suppose that all the branches of size bigger than $k$ have been attached. Consider all the k-slots* of the tree and attach to all of then a Geometric( $q_{k}$ ) number of $k$ -branches (see for example frame 3 of Figure 25 where we attach 3-branches and frame 4 where we attach 2-branches).

The final random tree obtained this way is a branching process with offspring law $\rho$ given by Geometric( $1-\lambda$ ).

5.3 Soliton decomposition of product measures in $\{0,1\}^{\mathbb{Z}}$

Forest of trees associated to configurations with infinitely many balls

Consider a configuration $\eta$ (with possibly infinitely many balls) and assume the walk $\xi=W\eta$ has a record at the origin and all records, that is $r(i,\xi)\in\mathbb{Z}$ for all $i\in\mathbb{Z}$ . Let $(\varepsilon^{i})_{i\in\mathbb{Z}}$ be the excursion decomposition of $\xi$ . Associating to each excursion the corresponding tree, we finish with a forest of trees each associated with an excursion, and sharing the slot diagrams of the excursion. See Fig. 31 for the trees associated to the ball configuration in Fig. 4.

Soliton decomposition of configurations with infinitely many balls

For the same walk $\xi$ with excursion components $(\varepsilon^{i})_{i\in\mathbb{Z}}$ , consider $(x^{i})_{i\in\mathbb{Z}}$ , the set of slot diagrams associated to those excursions. Recall $\varepsilon^{i}$ is the excursion between Record $i$ and Record $i+1$ .

We define the vector $\zeta\in\big{(}(\mathbb{N}\cup\{0\})^{\mathbb{Z}}\big{)}^{\mathbb{N}}$ obtained by concatenation of the $k$ -components of $x^{i}$ as follows:

[TABLE]

The components of $\zeta$ are $\zeta_{k}(j)\in\mathbb{N}\cup\{0\}$ with $k\in\mathbb{N}$ and $j\in\mathbb{Z}$ . For example, Fig. 31 contains a piece of $\xi$ between Record $-1$ and Record 6. The excursions $\varepsilon^{-1},\varepsilon^{1},\varepsilon^{3},\varepsilon^{4}$ are empty, so $s^{i}_{k}=1$ for $k\geq 1$ and $x^{i}_{k}(0)=0$ for $k\geq 0$ and $i=-1,1,3,4$ . The corresponding slot diagrams are

[TABLE]

So that the maximal soliton number in the slot diagram $i$ is $m^{i}=0$ for $i\in\{-1,1,3,4\}$ , $m^{0}=3$ , $m^{2}=2$ and $m^{5}=2$ .

Young diagram. To better explain graphically the definitions (43) and construct the piece of configuration $\zeta$ corresponding to the above excursions, we associate a Young diagram to each slot diagram, as follows: for each soliton size $k$ on the slot diagram $x$ pile one row of size $s_{k}$ for $k\leq m$ and one row of length 1 for all $k>m$ . We finish with an infinite column at slot 0 and all $k$ -slots of the same number piled on the same column. Taking the vertical coordinate as $k$ and the horizontal coordinate as $j$ , in box $(j,k)$ put $x_{k}(j)$ . Fig. 32 shows the Young diagrams corresponding to (5.3).

Once we have the slot diagrams of the excursions of $\xi$ as decorated Young tableaux, to obtain $\zeta$ it suffices to glue the rows of the same height into a unique row justified by column 0, as in the Fig. 33.

We call $\mathcal{A}^{+}$ the set of $\alpha$ such that the mean excursion size under $\nu_{\alpha}$ is finite:

[TABLE]

By definition we have $\mathcal{A}^{+}\subseteq\mathcal{A}$ . We define also

[TABLE]

Theorem 7 (From independent solitons to independent iid geometrics [6]).

If $\alpha\in\mathcal{A}^{+}$ and $(\varepsilon^{i})_{i\in\mathbb{Z}}$ are iid excursions with distribution $\nu_{\alpha}$ , then $(\zeta_{k})_{k\in\mathbb{Z}}\in(\mathbb{N}\cup\{0\})^{\mathbb{Z}}$ , as defined in (43) is a family of independent configurations and for each $k$ , $(\zeta_{k}(j))_{j\in\mathbb{Z}}$ are iid random variables with distribution Geometric( $q_{k}$ ), where $A^{-1}\alpha=q\in\mathcal{Q}^{+}$ .

Proof.

A part of the proof is a direct consequence of (33)-(34)-(35). For the proof of the remaining statements see [6]. ∎

Acknowledgments

We thank Leo Rolla for many fruitful discussions and Jean François Le Gall for pointing out relevant references on excursion trees.

This project started when PAF was visiting GSSI at L’Aquila en 2016. He thanks the hospitality and support. Part of this project was developed during the stay of the authors at the Institut Henri Poincaré - Centre Émile Borel during the trimester Stochastic Dynamics Out of Equilibrium. We thank this institution for hospitality and support.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Cao, X., Bulchandani, V. B., and Spohn, H. The GGE averaged currents of the classical Toda chain. ar Xiv:1905.04548 (2019).
2[2] Croydon, D. A., Kato, T., Sasada, M., and Tsujimoto, S. Dynamics of the box-ball system with random initial conditions via Pitman’s transformation. ar Xiv:1806.02147 (2018).
3[3] Croydon, D. A., and Sasada, M. Invariant measures for the box-ball system based on stationary Markov chains and periodic Gibbs measures. ar Xiv:1905.00186 (2019).
4[4] Dwass, M. Branching processes in simple random walk. Proc. Amer. Math. Soc. 51 (1975), 270–274.
5[5] Evans, S. N. Probability and real trees , vol. 1920 of Lecture Notes in Mathematics . Springer, Berlin, 2008. Lectures from the 35th Summer School on Probability Theory held in Saint-Flour, July 6–23, 2005.
6[6] Ferrari, P. A., and Gabrielli, D. BBS invariant measures with independent soliton components. ar Xiv:1812.02437 (2018).
7[7] Ferrari, P. A., Nguyen, C., Rolla, L., and Wang, M. Soliton decomposition of the box-ball system. ar Xiv:1806.02798 (2018).
8[8] Inoue, R., Kuniba, A., and Takagi, T. Integrable structure of box–ball systems: crystal, Bethe ansatz, ultradiscretization and tropical geometry. Journal of Physics A: Mathematical and Theoretical 45 , 7 (2012), 073001.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Box-ball system: soliton and tree

Abstract

Contents

1 Introduction

2 Preliminaries and notation

2.1 Box-Ball System

2.2 Walk representation and excursions

Concatenating excursions

2.3 BBS with infinitely many balls and on the ring

3 Conserved quantities and solitons

3.1 Runs

3.2 Takahashi-Satsuma soliton decomposition

Definition 1**.**

3.3 Slot diagrams

3.4 Head-Tail soliton decomposition

Theorem 2**.**

Proof.

3.5 Attaching solitons

3.6 Conserved quantities

Proposition 3** (Yoshihara, Yura, Tokihiro [19]).**

Proof.

3.7 Young diagrams

4 Trees, excursions and slot diagrams

4.1 Tree representation of excursions

4.2 Trees and pairing algorithm

4.3 Branch identification of planar trees

4.4 Tree-induced soliton decomposition of excursions

Proposition 4** (HT and tree decomposition).**

Proof.

4.5 Slot diagrams of planar trees

4.6 From paths to trees

Proposition 5**.**

Sketch proof.

5 Soliton distribution

5.1 A distribution on the set of excursions

Theorem 6** (Ferrari and Gabrielli [6]).**

Proof of Theorem 6 (a).

5.2 Branch distribution of the random walk excursion tree

5.2.1 Geometric branching processes

5.3 Soliton decomposition of product measures in {0,1}Z\{0,1\}^{\mathbb{Z}}{0,1}Z

Forest of trees associated to configurations with infinitely many balls

Soliton decomposition of configurations with infinitely many balls

Theorem 7** (From independent solitons to independent iid geometrics [6]).**

Proof.

Acknowledgments

Definition 1.

Theorem 2.

Proposition 3 (Yoshihara, Yura, Tokihiro [19]).

Proposition 4 (HT and tree decomposition).

Proposition 5.

Theorem 6 (Ferrari and Gabrielli [6]).

5.3 Soliton decomposition of product measures in $\{0,1\}^{\mathbb{Z}}$

Theorem 7 (From independent solitons to independent iid geometrics [6]).