Random sorting networks: local statistics via random matrix laws
Vadim Gorin, Mustazee Rahman

TL;DR
This paper establishes the local statistical behavior of random sorting networks using random matrix theory, revealing universal laws governing swap timings and gaps through determinantal processes and eigenvalue distributions.
Contribution
It introduces a local limit law for the swap process in random sorting networks, connecting it to eigenvalue distributions of the antisymmetric GUE and universal random matrix gap laws.
Findings
The local limit of the swap process is described by a determinantal point process.
The first swap time distribution matches the closest eigenvalue distribution in antisymmetric GUE.
Bulk gaps between swaps follow the Gaudin-Mehta law, a universal eigenvalue gap distribution.
Abstract
This paper finds the bulk local limit of the swap process of uniformly random sorting networks. The limit object is defined through a deterministic procedure, a local version of the Edelman-Greene algorithm, applied to a two dimensional determinantal point process with explicit kernel. The latter describes the asymptotic joint law near of the eigenvalues of the corners in the antisymmetric Gaussian Unitary Ensemble. In particular, the limiting law of the first time a given swap appears in a random sorting network is identified with the limiting distribution of the closest to eigenvalue in the antisymmetric GUE. Moreover, the asymptotic gap, in the bulk, between appearances of a given swap is the Gaudin-Mehta law -- the limiting universal distribution for gaps between eigenvalues of real symmetric random matrices. The proofs rely on the determinantal structure and a double…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Random sorting networks: local statistics via random matrix laws
Vadim Gorin
Department of Mathematics, MIT, Cambridge, MA, USA and Institute for Information Transmission Problems of Russian Academy of Sciences, Moscow, Russia.
and
Mustazee Rahman
Department of Mathematics, MIT, Cambridge, MA, USA.
Abstract.
This paper finds the bulk local limit of the swap process of uniformly random sorting networks. The limit object is defined through a deterministic procedure, a local version of the Edelman-Greene algorithm, applied to a two dimensional determinantal point process with explicit kernel. The latter describes the asymptotic joint law near [math] of the eigenvalues of the corners in the antisymmetric Gaussian Unitary Ensemble. In particular, the limiting law of the first time a given swap appears in a random sorting network is identified with the limiting distribution of the closest to [math] eigenvalue in the antisymmetric GUE. Moreover, the asymptotic gap, in the bulk, between appearances of a given swap is the Gaudin-Mehta law – the limiting universal distribution for gaps between eigenvalues of real symmetric random matrices.
The proofs rely on the determinantal structure and a double contour integral representation for the kernel of random Poissonized Young tableaux of arbitrary shape.
Key words and phrases:
Sorting network, reduced decomposition, Gaudin-Mehta law, GUE corners, Young tableau, determinantal point process
1. Introduction
1.1. Overview
The main object of this article is the uniformly random sorting network, as introduced by Angel, Holroyd, Romik, and Virág in [AHRV]. Let denote the symmetric group and denote the transposition between and for . The are called adjacent swaps. Let denote the reverse permutation of . A sorting network of is a sequence of permutations of shortest length with the property that for every ,
[TABLE]
In other words, the permutations change by swapping adjacent labels at each step and must go from the identity to the reverse in the shortest number of swaps. The number of adjacent swaps required in any sorting network of is . See Figure 1 for an example of a sorting network in the wiring diagram representation. We identify a sorting network of by its sequence of swaps
[TABLE]
where denotes the adjacent swap .
A random sorting network of is a sorting network of chosen uniformly at random. Computer simulations were used to conjecture many beautiful asymptotic properties of random sorting networks. See [AHRV] for an account of these statements and the first rigorous results, and also [ADHV, AGH, AH, AHR, DVi, Ko, RVV, R] for other asymptotic theorems. The proofs of the conjectures from [AHRV] were recently announced in [D].
In many examples, random combinatorial structures built out of symmetric groups are known to exhibit the same asymptotic behavior as random matrices. The most famous result of this sort due to Baik–Deift–Johansson [BDJ] identifies the fluctuations of longest increasing subsequences of random permutations with the fluctuations of largest eigenvalue of random Hermitian matrices. Its further upgrades, [BOO, J, O] link fluctuations of several first rows of the Young diagram distributed according to the Plancherel measure for symmetric groups to those of several largest eigenvalues. A connection also exists for “bulk” (i.e. not largest) rows and eigenvalues, see [BOO].
On the other hand, up to now no such connections were known for random sorting networks. In the present article we find such a connection. It exists for a sort of local limit of random sorting networks. Indeed, we find the bulk local limit of random sorting networks, by proving that it is given by a simple, local, deterministic algorithm (the local Edelman-Greene algorithm) applied to a specific random point process on . In turn, we describe this point process by showing that its correlation functions have determinantal form and provide explicit expressions for the corresponding correlation kernel. The very same point process appeared in the work of Forrester–Nordenstam [FN] (see also Deffoseux [De]) as the hard edge limit of antisymmetric GUE–corners process; it describes the asymptotic distribution of the closest to 0 eigenvalues of the corners of large antisymmetric matrix with i.i.d. (modulo symmetry) Gaussian entries of mean [math].
A corollary of our theorem is that the rescaled, asymptotic distribution of the first time when the swap between and appears, for , is the same as the rescaled, asymptotic distribution of the closest to [math] eigenvalue of an antisymmetric-GUE random matrix. Another corollary is that within the bulk, the asymptotic gap between appearances of the aforementioned swap is described by the Gaudin–Mehta law — the asymptotic universal distribution of the gap between eigenvalues of real symmetric random matrices in the bulk. Complete statements are given in the next section.
In an independent and parallel work, Angel, Dauvergne, Holroyd, and Virág [ADHV] also study the bulk local limit of random sorting networks. Their approach is very different from ours. We deduce explicit formulas for the prelimit local structure of random sorting networks, and then analyze the asymptotic of these formulas in the spirit of Integrable Probability, see [BG, BP] and also [Ro]. On the other hand, [ADHV] argue probabilistically, analyzing a Markov chain (whose transition probabilities are expressed through the hook formula for dimensions) for sampling random Young tableaux. The connection to random matrices remains invisible in the results of [ADHV]. It would be interesting to match these two approaches, but it has not been done so far.
1.2. Bulk limit of random sorting networks.
We now describe our main result. Informally, we study the asymptotics of the point process , , in a window of finite height and order width, so that the number of points in the window remains finite; see Figure 1. Here are swaps of a random sorting network of .
In [AHRV] it is proven that the point process is stationary with respect to the second coordinate. Therefore, it suffices to study windows adjacent to [math] in second coordinate, which we do.
The limiting object is a point process on defined by a two–step procedure. First, we introduce an auxiliary point process on through its correlation functions.
Definition 1.1**.**
* is the (unique) determinantal point process on with correlation kernel*
[TABLE]
We refer to [B] and Section 2.4 for more detailed discussions of determinantal point processes. We note that the particles of on adjacent lines and almost surely interlace, see Figure 2. The point process has appeared in the random matrix literature before in [FN], [De]. In more details, let be an infinite random matrix with rows and columns indexed by , and whose entries are independent and identically distributed, real-valued, standard Gaussians. Let . The top–left corner of almost surely has non-zero eigenvalues of the form
[TABLE]
where . Forrester and Nordenstam prove that is the weak limit of the point process , , , as .
Particle configurations of are obtained from by a deterministic procedure, which is a local version of the well-known Edelman–Greene bijection [EG] between staircase shaped tableaux and sorting networks. In the following we describe this procedure. A rigorous definition of the procedure utilizes properties of that are not immediate from Definition 1.1. We provide the rigorous construction in Section 6.3 where the description is given in the language of Young tableau, which is the more standard setup for defining the Edelman-Greene bijection.
Local Edelman-Greene algorithm
Fix a configuration of and suppose that we want to define the positions of all particles of inside the rectangle with . Then almost surely there are two integers and such that has no particles on the segments and . The particles of outside are further ignored.
We now define a particle configuration – the restriction of onto – through an iterative procedure. Start by declaring , and setting to be the restriction of onto . Repeat the following until is empty:
- (1)
Let be an element of with smallest second coordinate. The parity of will be even. Add to , i.e., redefine . 2. (2)
Define the sliding path as a unique collection of points in (of maximal length) such that
- •
,
- •
and for ,
- •
For each , the only points of in the rectangle are and .
In other words, is the point in , which is closest to . See Figure 2 for an illustration. 3. (3)
Remove the points ,…, from and replace them by points , , …, . 4. (4)
Go back to Step (1), unless is empty.
The first coordinates of the particles of will be integral; this follows from the interlacing property of the particles of , which is preserved throughout the steps of the procedure.
One immediate property of the just defined map is that the position of the first particle of in the ray almost surely coincides with the position of the first particle of in the ray . Therefore, the joint law of the positions of the first particles of in the rays , for , can be explicitly evaluated as a Fredholm determinant. See Corollary 1.3 for the case and [B] for general statements.
We also show that is invariant under translations and reflections of the first (–valued) coordinate, ergodic with respect to translations of the first coordinate, and stationary in the second (–valued) coordinate; see Proposition 6.4.
We are ready to formulate the main result.
Theorem 1.2** (Local random sorting network).**
Fix , and let be swaps of a random sorting network of . Define the point process of rescaled swaps near the point through
[TABLE]
Then as , the point process converges weakly to .
It is proven in [AHRV, Theorem 2] that the global scaling limit of the space-time swap process of random sorting networks is the product of the semicircle law and Lebesgue measure. The scaling of Theorem 1.2 is consistent with the semicircle result.
We emphasize that Theorem 1.2 states both that converges and the limit is obtained by applying the localized Edelman-Greene algorithm to . Theorem 1.2 does not cover the case , where the asymptotic behavior changes. It is plausible that the methods of the present article can be adapted to this remaining case, but we not address it here; see [R] for another approach to case.
Theorem 1.2 implies that the first swap times in random sorting networks converge to a one–dimensional marginal of ; the distribution of the latter can be expressed as a Fredholm determinant. Figure 3 shows the approximate sample distribution of the rescaled first swap time and (6.2) shows the tail asymptotics.
Corollary 1.3** (First swap law).**
. Let be the first time the swap interchanging with appears in a random sorting network of . The following convergence in law holds:
[TABLE]
where
[TABLE]
Connection to the Gaudin-Mehta law
A further consequence deals with the limiting law of the gap between swaps on the same horizontal line in random sorting networks. Fix . Given a random sorting network of , let be the distance between and the closest to its right swap interchanging with . Let be the analogous distance to the closest to its left swap.
Due to stationarity of random sorting networks, the joint law and is given by
[TABLE]
Indeed, due to stationarity, both sides of (1.2) give the probability of the event that there are no swaps in the interval after the appropriate re-centerings. Equation 1.2 shows that the law of , and hence, of the gap , is determined by the law of the first swap time . In particular, their limiting law after rescaling by is uniquely determined from the distribution function (1.1).
This is connected to the scaling limit of the point process of eigenvalues of GOE random matrices in the bulk. The scaling limit of the eigenvalues of GOE random matrices near [math] is stationary. Let and be, respectively, the closest to [math] negative and closest to [math] positive point in the limit process. If the matrices are scaled so that the mean eigenvalue gap near [math] is , then (1.1) is the distribution function of . In other words, (1.1) is the asymptotic probability to see no eigenvalues in an interval of length for large GOE random matrices, normalized so that the mean eigenvalue gap around the interval is ; see e.g. [G], [Dy], [F, (8.139) and (9.81)]. The gap between points, , has its law determined from that of according to (1.2). This is the celebrated Gaudin–Mehta law, originally put forward by Wigner as a model for the gap between energy levels in heavy nuclei and later found in numerous systems. We arrive at the following corollary.
Corollary 1.4** (Gap law).**
For and , let be the distance in a random sorting network of between the two swaps interchanging with : the one closest from the left to time and the one closest from the right to . Then, the distributional limit
[TABLE]
is the Gaudin–Mehta law, i.e. the asymptotic gap in the bulk between eigenvalues of real symmetric random matrices with mean gap one.
The proof of Theorem 1.2 builds upon two ideas. The first one (which is also used in most of the rigorous results on sorting networks) is to reduce the study of random sorting networks to uniformly random staircase shaped standard Young tableaux via the Edelman–Greene bijection [EG] (see also [HY]). Our observation is that if we Poissonize uniformly random standard Young tableaux (of arbitrary shape!), then the result can be described by a determinantal point process with an explicit correlation kernel written as a double contour integral. We further show that the Poissonization does not change the local statistics, and therefore, the limit theorem is reduced to the asymptotic analysis of the aforementioned double contour integral, which we perform.
Our results on the correlations and limiting behavior of random standard Young tableaux might be of independent interest, and so we present them in the next section.
1.3. Random Standard Young Tableaux
A partition is a sequence of non-negative integers such that . The length of , denoted , is the number of positive and the size of is .
We identify a partition with a Young diagram (YD), which is the set of lattice points
[TABLE]
The points of the Young diagram are its cells and we say the Young diagram has shape . Given a pair of YDs and , we write if the cells of are contained within the cells of . If the containment is strict then . If then denotes the cells of that are not in . A standard Young tableau (SYT) of shape is an insertion of the numbers into the cells of such that they strictly increase along the rows (from left to right) and also along the columns (from bottom to top). The numbers within a SYT are its entries. The set of SYTs of shape is in bijection with the set of increasing sequences of YDs
[TABLE]
such that the entry is inserted into the singleton cell of .
A staircase shaped SYT of length (or also rows) is a SYT of shape , which we denote . The Edelman–Greene bijection [EG] gives a one-to-one correspondence between staircase shaped SYTs and sorting networks; see Section 6.1 for the details. This is the reason for our interest in SYTs.
A Poissonized Young tableau (PYT) of shape is an insertion of distinct real numbers from the interval into the cells of such that they strictly increase along the rows and along the columns. Note that if we replace the entries of a PYT by their relative ranks then we get a SYT. The set of PYTs of shape is in bijection with the set of increasing sequences of YDs indicating the times of jumps:
[TABLE]
such that the entry is inserted in the singleton cell of . These increasing sequences of Young diagrams with labels were discussed in [BO] in the connection to the Young bouquet; see also [N].
We would like to identify a PYT with a collection of non-intersecting paths. For that we first map a Young diagram to a countable particle configuration . This procedure can be viewed as projecting the boundary of the Young diagram in Russian notation onto a horizontal line, see Figure 4. The empty Young diagram corresponds to .
Give a PYT, for each consider the countable particle configuration corresponding to the Young diagram filled with the entries in the PYT. The trajectories of particles then form a collection of paths, making jumps to the right at the times indexed by the entries of the tableau (equivalently, labels in (1.4)). Let us draw a cross at a point , , , if a particle jumps from to at time ; see Figure 5. Although there are infinitely many particles, the only ones that move are the particles that correspond to the rows of with positive size.
Theorem 1.5** (Poissonized tableaux).**
Given a finite Young diagram , consider the point process of jumps of a uniformly random Poissonized Young tableau of shape . is a determinantal point process on with correlation kernel as follows. For and ,
[TABLE]
The contours and are as shown in Figure 6. Both are counter-clockwise, encloses only the integers in the respective half open intervals and , and arranged such that remains uniformly bounded away from 0 along the contours.
Remark 1.6*.*
When or equals 1, is to be understood in the limit as or tends to 1. The contours and may also be replaced by unbounded contours and with bounded imaginary parts, respectively.
The proof of Theorem 1.5 is through a limit transition in the correlation kernel of [P] (see also [DM]) for the uniformly random Gelfand–Tsetlin patterns; the proof is in Section 3. Such a limit transition can be viewed as a degeneration of the combinatorial structures related to the representation theory of the unitary groups to those related to the symmetric groups , see [BO] for a discussion.
Let us emphasize that the very same procedure can be used to identify a uniformly random SYT with a point process of jumps, however, the resulting process will not be determinantal – this is why we need to pass from SYTs to PYTs.
1.4. From Poissonized tableaux to local statistics
We close the introduction with an outline of the argument that takes us from Poissonized tableaux to local statistics of sorting networks. By the nature of the Edelman-Greene bijection, the swaps of a sorting network of near time 0 are determined by the location of the largest entries of an SYT of shape . These entries reside within unit order distance of the edge of , which consist of the cells for . As a result, the first step to deriving local statistics of random sorting networks is to derive the statistics of a uniformly random PYT of shape near its edge.
Let denote a uniformly random PYT of shape . We are interested in the statistics of the entries of that lie within the following windows. A window is parameterized by a center (corresponding to the center of the swaps of a sorting network), a length , and an entry height . The window then consist of entries that satisfy and . In other words, roughly the largest entries of with row indices in the interval .
We study the statistics of in a window in terms of its associated process of jumps , rescaled accordingly. For each integer , let be an integer having the same parity as and such that as . Consider the rescaled process of jumps
[TABLE]
Theorem 4.1 and Proposition 4.3 together imply that converges weakly to the point process from Definition 1.1. It is the building block for the proof of Theorem 1.2.
In Section 5 we construct the local staircase shaped tableau by using the local jump process . We prove in Theorem 5.2 that it provides the limiting statistics of in local windows. Using a de-Poissonization argument we conclude in Theorem 5.3 that uniformly random staircase shaped SYTs also converge within local windows to . Although Poissonization is not important for the local limit, it is important for the proof.
In Section 6 we prove Theorem 1.2. First, we give a proof of Corollary 1.3 in Section 6.2. In Section 6.3 we define the local version of the Edelman-Greene algorithm that maps to . In Section 6.4 we complete the proof of Theorem 1.2 and conclude with some statistical properties of .
Acknowledgements
We would like to thank Balint Virág, who brought our attention to the problem of identifying the local limit of sorting networks. We are grateful to Alexei Borodin, Percy Deift, and Igor Krasovsky for valuable discussions and references. We would also like to thank an anonymous referee for an exceptionally careful reading of the paper and some helpful feedback.
V. Gorin’s research was partially supported by NSF grants DMS-1407562, DMS-1664619, by a Sloan Research Fellowship, by The Foundation Sciences Mathématiques de Paris, and by NEC Corporation Fund for Research in Computers and Communications. M. Rahman’s research was partially supported by an NSERC PDF award.
2. Preliminaries
This section presents basic facts about Young tableaux, Poissonization and determinantal point processes. Some material from the Introduction is repeated for convenience.
2.1. Gelfand-Tsetlin patterns
A semi-standard Young tableau of shape , where are integers, is an insertion of numbers from into the cells of the YD such that the entries weakly increase along each row and strictly increase along every column. It is important to emphasize that while for the definitions of Young diagrams and standard Young tableaux the value of is not important, here the object substantially depends on . Semi-standard Young tableaux (SSYTs) are in bijection with interlacing particle systems, often known as Gelfand-Tsetlin patterns (or schemes).
A Gelfand-Tsetlin pattern (GTP) with rows is a triangular array of non-negative integers with row containing entries . The array satisfies the following order and interlacing constraints.
[TABLE]
Let denote the -th row of the GTP. Each row corresponds to a YD due to the order constraints. The interlacing conditions ensure that , and in fact, is a horizontal strip which means that the cells in any row of are to the left of the cells in the previous row. Figure 7 provides an example.
The set of GTPs with a fixed top row is in bijection with the set of SSYTs of shape . Indeed, given a GTP with top row , such a tableaux is obtained by inserting the value into the cells of for every (set ). If is empty then is not inserted. In the reverse direction, given a SSYT of shape , a GTP with top row is obtained by setting to be the YD consisting of the cells of with entries and removing trailing zero rows to ensure that has entries.
A GTP may also be represented as an interlacing particle system on as follows. Given a GTP with rows, the particle system associated to it has rows of particles, with particles on row being placed on the horizontal line of the plane, and the position of the -th particle on row is
[TABLE]
The transformation makes the order constraints strict and the interlacing constraints semi-strict:
[TABLE]
The jumps of an interlacing particle system with rows is a set of points in , defined inductively from the top row to the bottom as follows. Given two consecutive rows and , the jumps on row consist of particles at the positions
[TABLE]
In other words, the jumps of row are placed on the horizontal line and fill out integers in the intervals for every . Note that may determined from its top row and set of jumps.
2.2. Poissonized Young tableaux
For a YD , let denote the set of functions from the cells of into . Let denote the set of all functions that satisfy the following tableau constraints.
[TABLE]
The Poissonized tableau (PYT) of shape is an element of . The Poissonized staircase shaped tableau of size is an element of .
Let denote a uniformly random element of . Then is related to a uniformly random SYT of shape in the following way. First, the entries of are distinct with probability 1. Given that, consider the random SYT obtained by inserting into the cell that contains the -th smallest element of . Then is a uniformly random SYT of shape . In the other direction, can be generated by first sampling , then independently sampling a uniformly random , and setting to be the -th smallest entry of .
Throughout the paper, denotes a uniformly random element of and denotes a uniformly random SYT of shape .
2.3. Jumps of Poissonized tableaux and local limit
Any can be represented as an interlacing particles system with a fixed top row in the following manner. Consider and let
[TABLE]
The tableau constraints (2.1) ensure that is a YD for every . Recall that a YD can be made to have an infinite number of rows by appending rows of size 0 after the last positive row. Encode as particle configuration on by placing a particle at position
[TABLE]
This is an infinite particle configuration on such that and for (shown in Figure 4). Let be the particle configuration associated to via (2.2) and let be the particle system on with a particle at position if and only if .
The particle system viewed in reverse time, i.e., from to , can be interpreted as an ensemble of non-intersecting and non-increasing paths , for . Let be the -valued path starting from and decreasing an integer unit at the times . If some of the entries are equal then decreases by the number of consecutive equal entries. The paths should be left continuous so that the jumps occur immediately after the jump times. The path decreases by units with final position . Due to the columns of being non-decreasing – condition (2) of (2.1) – the paths are non-intersecting: for every and . Figure 5 shows the paths associated to a staircase shaped PYT.
The jumps of consists of points such that
- (1)
is a discontinuity point of , i.e., equals some entry of on row . 2. (2)
in an integer in the interval \big{[}p(i,(1-t)+),\,p(i,1-t)\big{]}, where .
The paths can be reconstructed from their jumps and initial positions. The jumps of , and also of , is the (possibly) multiset of defined by
[TABLE]
may be a multiset because two adjacent paths may jump at the same time by amounts that causes some of their jumps to coincide. The coinciding jumps has be counted with multiplicity. However, if the entries of are distinct then is a simple set. The tableau can be reconstructed from its jumps and the initial position of the paths.
Let denote the jumps of a uniformly random element of . is simple almost surely since has distinct entries almost surely. Theorem 1.5 asserts that is a determinantal point process on .
2.4. Determinantal point processes
We describe some basic notions about point processes; for a thorough introduction see [B, DVe]. Let be a locally compact Polish space. A discrete subset of is a countable multiset of with no accumulation points. By identifying with the measure , the space of discrete subsets can be given the topology of weak convergence of Borel measures on . This means that if for every compact subset , , where cardinality is taken with multiplicity.
A discrete set is simple if every point in it has multiplicity one. A point process on is a Borel-measurable random discrete set of . All point processes considered in this paper will be simple almost surely.
Throughout the paper we denote to be counting measure on and to be Lebesgue measure on a measurable subset . Also, denotes the product of measures and , and denotes the -fold product of .
A determinantal point process on is a simple point process for which there is a correlation kernel , and a Radon measure on , called the reference measure, with the following property. For every continuous of compact support,
[TABLE]
Expectations of the form given by the l.h.s. of (2.4) determine the law of under mild conditions on [Le]. This will be the case in this paper as the correlation kernels we consider will be continuous. If is discrete then it is customary to take the reference measure to be counting measure. In this case is determinantal if for every finite ,
[TABLE]
Remark 2.1*.*
The correlation kernel of a determinantal point process is not unique. If is a determinantal point process with correlation kernel then may be replaced by , for any non-vanishing function , without changing determinants on the r.h.s. of (2.4). Thus the new kernel determines the same process. This observation will be used multiple times.
The determinantal point processes that we consider will be on spaces of the form , or , or , with reference measures being, respectively, counting measure, and . The following lemma records some facts that will be used in deriving weak limits of determinantal point processes. We do not include the proof as it is rather standard; see [DVe, Le].
Lemma 2.2**.**
I) Let be a determinantal point process on with correlation kernel . For and , let
[TABLE]
Suppose that uniformly on compact subsets of . Then the point process
[TABLE]
restricted to converges weakly to a determinantal point process whose reference measure is and whose correlation kernel is .
II) Let be a determinantal point process on with reference measure and correlation kernel . For and , define a point process on by
[TABLE]
The correlation kernel of with reference measure is
[TABLE]
If uniformly on compact subsets of then converges weakly to a determinantal point process with reference measure and correlation kernel .
Extend to a point process on without additional points. Then converges weakly to on if the points of do not accumulate at the boundary in the sense that for every ,
[TABLE]
3. Determinantal representation of Poissonized tableaux
3.1. Determinantal representation of discrete interlacing particle systems
In order to prove Theorem 1.5 we use a determinantal description of discrete interlacing particle systems due to Petrov. This is the main tool behind the proof.
Let be a fixed particle configuration on . Here we abuse notation from Section 2.1 to have the s be integers instead of half-integers. Let be the uniform measure on all interlacing particle systems or, equivalently, GTPs as described in Section 2.1, with fixed top row . Let be the point process of jumps of an interlacing particle system sampled according to , where the jumps are as described in Section 2.1.
Petrov [P, Theorem 5.1] proves that is a determinantal point process on with an explicit correlation kernel. According to the notation there, particles live on but we have translated particle systems by so that the jumps are integral. In particular, in the notation of [P, Theorem 5.1], one has , and the variables take integer values. In [P, Section 6.1] it is explained that the point process of jumps, , is also determinantal on and its correlation kernel is given in terms of the correlation kernel of in [P, Theorem 6.1] (up to the translation by ).
In particular, [P, Theorem 6.1] proves that the correlation kernel of the jumps is
[TABLE]
where is the kernel presented in [P, Theorem 5.1]. The discussion there is in terms of lozenge tilings of polygonal domains using three types of lozenges as depicted in Figure 8. It is proved that the positions of any of the three types of lozenges in such a uniformly random tiling is a determinantal point process. The jumps of an interlacing particle system are given by the positions of the lozenges of the rightmost type from Figure 8, where as the particles themselves are given by the positions of lozenges of the leftmost type. Jumps occur when a lozenge of the leftmost type is glued along its bottom diagonal to a lozenge of the rightmost type; see [P, Figure 3] for such a tiling. By Remark 2.1, defines the same determinantal point process as , and we will use the latter kernel.
Some notation is needed in order to express the kernel for the point process of jumps. For integers and , let denote a closed, counter-clockwise contour on that encloses only the integers if , and empty otherwise. Throughout the paper, all contours intersect the real line at points which have distance at least from the integers. This ensures that the integrands of all contour integrals will be a uniform distance away from their poles.
For and an integer , let
[TABLE]
Theorem 3.1** ([P, Theorem 5.1]).**
The process of jumps, , of a uniformly random interlacing particle system with fixed top row \nu=\big{(}\nu(M,1)+\frac{1}{2}>\nu(M,2)+\frac{1}{2}>\cdots>\nu(M,M)+\frac{1}{2}\big{)} is a determinantal point process on with correlation kernel as follows. For and ,
[TABLE]
The contour contains without intersecting it.
3.2. Proof of Theorem 1.5
Let be a Young diagram with at most rows of positive length, that is, . For , consider semi-standard Young tableaux of shape
[TABLE]
The effect of adding zero rows is to allow the entries in the non-zero rows of to be between to . The law of a uniformly random PYT of shape is the weak limit of a uniformly random semi-standard Young tableau of shape , as , after the entries are rescaled onto the interval . Indeed, the law of a uniformly random PYT of shape can be approximated by the uniform distribution on points that satisfy the tableau constraints (2.1), with each for some , and the column constraints being strict.
The top row of particle systems associated to semi-standard Young tableaux of shape under the bijection described in Section 2.1 is
[TABLE]
Due to the approximation scheme above, and the bijection between semi-standard Young tableaux and interlacing particle systems discussed in Section 2.2, the process of jumps, , of a uniformly random PYT of shape is the weak limit of the process of jumps, , of a uniformly random interlacing particle system with top row after these jumps are rescaled onto . Thus, we derive a determinantal description of the rescaled jumps of in the large limit. Since almost surely contains no jumps on the boundary , it suffices to derive the determinantal description with restricted to .
Let denote the kernel from Theorem 3.1 for the process . As explained in Remark 2.1, the kernel determines the same determinantal point process. By Lemma 2.2, in order to prove the theorem it suffices to show that
[TABLE]
uniformly over compacts subsets of and .
We begin by deforming the contours in the double contour integral from (3.1) that defines . This will simplify the representation of for taking the large limit. Deform the -contour, , by pushing it leftward past the -contour, , so that it encloses the consecutive integers . The deformation results in picking up residues at and also at the consecutive integers , if .
Let denote the integrand of the double contour integral from (3.1) but without the factor of . Note that . Calculating the residues at and leaving the remaining residues as a contour integral provides the following representation of .
[TABLE]
The following lemma simplifies .
Lemma 3.2**.**
For ,
[TABLE]
Proof.
The integral is evaluated in [P, Lemma 6.2] by summing over residues at and evaluating the resulting sum in closed form via a hypergeometric identity. We have
[TABLE]
Therefore,
[TABLE]
If then the above is 0 because both terms in the difference are equal to . Hence, in non zero only if and and equals what is given in the statement of the lemma. ∎
Now consider the expression (II) from (3.3). Observe that the zeroes of are at consecutive integers . On the other hand, the polynomial also has zeroes at for . Therefore, the only poles of in the variable are at the integers so long as . For fixed and , say, the condition is satisfied for all large . Thus, the contour integral over may be shortened to for all large if remains fixed and .
Having done so, (II) becomes the following integral after changing variables and :
[TABLE]
By a slight abuse of notation, let henceforth denote the integrand of (3.4) without the factor . The following lemma provides the asymptotic form of .
Lemma 3.3**.**
Fix integers and . Suppose and and remain bounded and have distance at least from the integers. Then for and ,
[TABLE]
The big O term is uniform over so long as the stated assumptions hold and remain in compact subsets of . The function is as stated in Theorem 1.5.
Proof.
We will use the following identity:
[TABLE]
We will also use Stirling’s approximation of the Gamma function in the following form:
[TABLE]
The big O term is uniform in so long as is bounded and bounded away from negative integers. Using these two properties, if and then
[TABLE]
Now consider the term
[TABLE]
Applying (3.6) to and gives
[TABLE]
Substituting in and then gives
[TABLE]
Combining (3.7) with (3.8) provides the desired conclusion of the lemma. ∎
We now prove that converges to . Suppose are fixed and for and for some . Recall that is given in (3.3). For all sufficiently large values of , Lemma 3.2 and then the identity (3.5) followed by the estimate (3.6) show that of (3.3) equals
[TABLE]
Now we consider (II) in the form given in (3.4) following the change of variables. Lemma 3.3 implies that as ,
[TABLE]
The convergence is uniform over compact subsets of and so long as and are uniformly bounded away from the integers. For all large values of the contours of integration of become free of , namely, and . The contours may also be arranged such that they remain bounded away from the integers and throughout, say. This implies that as , converges to
[TABLE]
We have thus concluded that converges to the kernel given in Theorem 1.5. Moreover, the estimates show that the converge is uniform over compact subsets of . Indeed, so long at and , the error term in the convergence is of order , by Lemma 3.3, because the double contours eventually become free of and the integrand converges uniformly over the contours. Part (I) of Lemma 2.2 now implies that the rescaled process of jumps, , converges weakly on to a determinantal point process with kernel as given in Theorem 1.5. ∎
4. Bulk local limit of the jumps of Poissonized staircase shaped tableaux
In this section we prove that the point process from (1.5) converges weakly to the point process from Definition 1.1. This is done in a two-step procedure. First, we prove in Theorem 4.1 that the limit of is a determinantal point process whose kernel is given in terms of a double contour integral. Second, we identifying this kernel with the one from Definition 1.1 in Proposition 4.3.
Theorem 4.1**.**
The point process from (1.5) converges weakly to a limiting determinantal point process on . The correlation kernel of with respect to reference measure is given as follows. For and ,
[TABLE]
The contours and are unbounded, contain the non-negative integers but remain uniformly bounded away from them and are arranged such that remains uniformly bounded away from 0. They may also be arranged such that their imaginary parts remain bounded and contains .
The value of when or equals 0 is to be understood in the sense of the limit as or tends to 0.
Proof.
The proof proceeds in two steps, each verifying the conditions of part (II) of Lemma 2.2. First, we will show that the correlation kernel of converges to uniformly on compact subsets of and . Then we will argue that points of do not accumulate on the boundary as .
Let . Part (II) of Lemma 2.2 and Theorem 1.5 imply that the correlation kernel of with reference measure is
[TABLE]
The kernel determines the same point process by Remark 2.1. Using part (II) of Lemma 2.2 it suffices to show that converges uniformly over compact subsets of to as in order to deduce convergence of to on .
Let denote the function , where is as in Theorem 1.5. Then
[TABLE]
Substitute in for the variables and for the variables in . Then,
[TABLE]
Using the formula for from (4.2) and applying the identity
[TABLE]
The estimate for from (3.6) along with the observation that implies the following asymptotic bevaviour as . The symbol denotes a multiplicative term where the big O error is uniform over in compact subsets of .
[TABLE]
Therefore,
[TABLE]
Substituting in for and for in this estimate gives
[TABLE]
The error in the above estimate vanishes as so long as and the lie in compact subsets of their respective domains. We also have that
[TABLE]
Since is assumed to be even we conclude from (4.8) that so long and are bounded and remain uniformly bounded away from the integers then
[TABLE]
The above displays the pointwise limit of the part of the integrand from (4.3) that depends on . In order to interchange the pointwise limit with the contour integral we must show that the integrand is bounded uniformly over by a function that is integrable over the contours and , also uniformly over . Then we may apply the dominated convergence theorem.
Towards this end suppose is such that (1) , (2) is uniformly bounded over , say by 100, and (3) remains bounded away from by distance at least . In this case Stirling approximation to the Gamma function implies that modulus of the ratio of the l.h.s. of (4.6) to its r.h.s. is bounded above and below by exponential factors of uniform rate in . The same holds for the ratio of the l.h.s. of (4.7) to its r.h.s. That is, for some constant ,
[TABLE]
Throughout the following denotes a constant that is free of but its value may change from line to line. We combine the estimates above with the equation for from (4.5) and observe that there is a such that due to the assumptions on . This in turn implies that there is a such that
[TABLE]
The contours and can certainly be arranged such that for fixed and the variables and satisfy the aforementioned assumptions (1)–(3) uniformly over . Thus, we get the following uniform estimate over with and :
[TABLE]
The contours may also be arranged such that , say. Then the modulus of the integrand of the double contour integral from (4.3) satisfies
[TABLE]
Stirling’s approximation implies that if and remains bounded then
[TABLE]
Applying (4.4) with also gives . Note that so long as remains uniformly bounded away from the integers. Combining this with (4.11) shows that the r.h.s. of (4.10) is integrable over unbounded double contours and as long as the contours are arranged such that remain uniformly bounded away from the integers, have uniformly bounded imaginary parts, and remains uniformly bounded away from 0. Thus, the limit (4.9), upper bound (4.10) and the dominated convergence theorem implies that as
[TABLE]
Furthermore, our estimates show that the convergence is uniform over in compact subsets of and in compact subsets of . (In fact, when some the integral contributes only through residues at the origin. Lemma 4.2 computes the limit as .) Comparing the limit integrand with the one presented in (4.1) we observe that the proof of the kernel convergence will be complete once it is shown that
[TABLE]
From (4.4), . Also, . Finally, double angle trigonometric formulae imply . Substituting these equations into the l.h.s. of the above verifies the equality with the r.h.s.
To complete the proof of convergence of to on it is enough to show, using Lemma 2.2, that for every ,
[TABLE]
In other words, points do not accumulate at the boundary in the limit. From relation (2.4) for determinantal point processes and (4.3) we get
[TABLE]
The quantity above is of the form . Arguing exactly as in the derivation of the limit kernel, , where is given by the double contour integral in the definition of from (4.1) but with an additional factor of in the denominator of the integrand. Indeed, , which is the expected number of points of on . The quantity remains uniformly bounded near since, as , the contribution to the integral that defines comes from the residues at and these residues do not depend on . (See Lemma 4.2 where is derived analogously.) Consequently, as and thus the condition from (4.12) holds. ∎
4.1. Integral representation of the edge kernel
We begin with an auxiliary lemma.
Lemma 4.2**.**
Let and be contours as in the statement of Theorem 4.1 and let be as in (4.1). For let
[TABLE]
Then,
[TABLE]
Moreover, is continuously differentiable on and can be differentiated by interchanging differentiation with the contour integration.
Proof.
The integrand of is continuously differentiable in . Observe that the contours of integration contain no singularities of the integrand, and in fact, are arranged to be a positive distance from all zeroes of and in the denominator. The estimate for from (4.11) shows that the derivative of the integrand in the variable is absolutely integrable over the contours as long as lies in a compact subset of . Consequently, by the dominated convergence theorem, is continuously differentiable and the derivative may be interchanged with integration.
Let us now consider the limiting value of as . Decomposing as , and similarly for , gives
[TABLE]
These four contours may also be arranged such that unless both and . Recall that
[TABLE]
Thus, the integrand of converges to 0 as so long as and . So each of the double contour integrals above except for the first has a limit value of 0 as (the limit operation may be interchanged with integration as argued above). To complete the proof it suffices to calculate
[TABLE]
The integral above is evaluated via residues at and . If then (4.13) equals
[TABLE]
This is the limit value of in the statement of the lemma for .
Now consider the limit (4.13) in the case . As the contour can be arranged to be contained inside , the integral in equals the residue of the integrand at the only possible pole at . This equals
[TABLE]
If is odd then the above equals 0. Otherwise, and the integral of the above over is given by its residue at the pole (note that since is odd). The residue equals
[TABLE]
Thus, if then (4.13) equals and this completes the proof. ∎
Proposition 4.3**.**
The kernel has the following form.
[TABLE]
Proof.
For let
[TABLE]
where is as defined in Lemma 4.2. By Lemma 4.2, is continuous differentiable on and the function may be differentiated by interchanging differentiation with integration. Differentiating and clearing common powers of gives
[TABLE]
The contour integrals can be evaluated by summing over residues of . Inside the contour , the function has simple poles at integers such that . In other words, has to have the same parity as . Let . The residues of the integral come from integers for . Since , summing over the residues gives
[TABLE]
Consequently,
[TABLE]
Multiplying (4.14) by then implies that
[TABLE]
For , the r.h.s. of (4.15) is integrable over in . Moreover, as because , and the latter limit is finite whereas as . Therefore, (4.15) implies
[TABLE]
Next, consider the case . Now the relation from (4.15) should be integrated from 1 to , which is convergent since . The formula follows so long as . Rather than derive this limit we take a slightly indirect approach by considering the limit of near . For define
[TABLE]
Upon differentiating it follows readily that satisfies the same differential equation as displayed in (4.14). Therefore, for some constant . In order to identify as zero it suffices to show that . Since as , due to , both and have the same limit as . Thus, utilizing Lemma 4.2, showing amounts to proving
[TABLE]
The limit of can be found using L’Hôspital’s rule, which shows that
[TABLE]
We conclude that for , and in particular that , as required.
Finally, consider the case . The r.h.s. of (4.15) is continuous for in because one of or has a zero at depending upon the parity of . From Lemma 4.2, . Therefore, (4.15) implies that
[TABLE]
We now express (4.17) as an integral over as given in the proposition. First, note may be modified on the measure zero set consisting of such that without changing determinants in (2.4), and thus, this does not affect the law of . We will modify the kernel on this zero set after the following calculations to get the form given in the proposition.
Observe that if . Using trigonometric formulae the integrand of (4.17) becomes
[TABLE]
Using the fact that , we get that for ,
[TABLE]
Using (4.19) and the representation of the integrand in (4.18) we infer that if is even then
[TABLE]
This shows that (4.17) equals the expression given in the statement of the proposition for and even except for the additive term . By modifying on the zero set we may ignore this term.
For being odd we argue in the same manner as above to infer that (4.17) equals
[TABLE]
Once again, we modify on the zero set to ignore the additive term and get the expression of the kernel given in the proposition. ∎
4.2. Statistical properties of
This section derives certain properties of , namely, Proposition 4.4, Proposition 4.6, Lemma 4.8 and Lemma 4.9, that will be used to derive the local limit of staircase shaped tableaux and of sorting networks.
Proposition 4.4**.**
The process has the following statistical properties.
I) Translation and reflection invariance: For any integer the translated process
[TABLE]
and the reflected process
[TABLE]
have the same law as .
II) One dimensional marginals: For any and ,
[TABLE]
Therefore, is a determinantal point process with reference measure and correlation kernel (4.20).
Proof.
Part I) From Lemma 2.2 the correlation kernel of equals . The integral representation of in Proposition 4.3 implies that
[TABLE]
upon observing that , which implies that the integrands do not change after the kernel is transformed. Consequently, the translated point process has the same law as the original. Similarly, the correlation kernel for the reflected process is . The latter kernel defines the same determinantal point process as in law. Part II) Proposition 4.3 gives that
[TABLE]
The fact hat is determinantal with kernel as stipulated follows from the relation (2.4) for determinantal point processes. ∎
Lemma 4.5**.**
There is a universal constant such that for and ,
[TABLE]
Proof.
Throughout this argument denotes a universal constant whose value may change from line to line. We begin with the case . From the integral representation of we see that if then
[TABLE]
Similarly, if then and
[TABLE]
Combining these bounds we deduce that if then
[TABLE]
Now we consider decay in the -variables, assuming that . Define as
[TABLE]
Then . Using that , and the formula for , we observe that there is a such that
[TABLE]
Applying integration by parts to the integral form of gives, for ,
[TABLE]
Now the triangle inequality and (4.25) imply that if then
[TABLE]
If , then we use the bound (4.24) to reach the same conclusion as above. An entirely analogous bound holds when because then , and is integrable over . Therefore, for ,
[TABLE]
Combining (4.24) with (4.26) implies the required inequality (4.21) for .
The case requires some care. The representation (4.18) for the integrand of gives
[TABLE]
Integration by parts and the triangle inequality imply that for ,
[TABLE]
For , we have
[TABLE]
Together, these bounds imply that for ,
[TABLE]
Separating (4.27) naturally into two integrals and applying (4.28) implies that
[TABLE]
This establishes (4.21) for the case and completes the proof. ∎
Spatial ergodicity of
For , denote by the translation that maps for . So is invariant under the action of every by Proposition 4.4. An event associated to is invariant if for every , , where and is the action of on a sample outcome of . The invariant sigma-algebra of is the sigma-algebra consisting of all the invariant events.
Proposition 4.6**.**
* is ergodic w.r.t. spatial translations in that if then .*
Proof.
For , let be the sigma-algebra generated by the points of restricted to . For , let
[TABLE]
For , suppose is continuous and compactly supported. Let
[TABLE]
Now suppose are continuous and compactly supported such that there are disjoint subsets with and . This implies that if and , then for every . We first show that in this case
[TABLE]
where is the universal constant from Lemma 4.5 and is the -norm of with respect to .
Indeed, the assumption on the supports of and imply from (2.4) that
[TABLE]
Let us expand the determinant of the matrix above as a sum over all permutations. We break up the permutations into two types: permutations that map the subsets and into themselves and those that do not. When summed over permutations of the first type the integral above equals . For a permutation of the second type, observe that there are two indices and , with and , such that and . Then for , Lemma 4.5 gives
[TABLE]
For all other indices we have . Consequently, the term involving contributes at most in absolute value to the determinant above for every and . Since there are such permutations , we conclude that
[TABLE]
Let be the sigma-algebra generated by the random variables , where is continuous and compactly supported. The bound (4.29) implies that if and are disjoint, is -measurable and is -measurable, then,
[TABLE]
The bound in (4.30) implies ergodicity of as follows. Let . Given , we may choose an event , for some and , such that . Since is invariant under , we have that for every . Therefore by the triangle inequality,
[TABLE]
Due to invariance of this implies that .
Set for an integer . Then . We now apply (4.30) with and , observing that . Since by translation invaraince, we infer that
[TABLE]
Since , we conclude that
[TABLE]
Letting followed by shows that , as required. ∎
Remark 4.7*.*
The proof above may be used to deduce that is in fact space-time mixing.
Lemma 4.8**.**
Almost surely, has an unbounded collection of points on every line . In fact, the following holds. Let N_{x}(t)=\#\big{(}\pazocal{X}_{\mathrm{edge}}\cap(\{x\}\times[0,t])\big{)}. For every , the sequence in probability as .
Proof.
Fix . Using part (II) of Proposition 4.4 we see that for any interval ,
[TABLE]
Observe from (4.31) that as .
From (4.20) we see that is symmetric in the variables and . Thus,
[TABLE]
From the relation (2.4) for determinantal point processes we have that
[TABLE]
This inequality implies that . Since , Chebyshev’s inequality implies that for any ,
[TABLE]
This provides the claimed convergence in probability.
Convergence in probability implies that there is a sequence of times such that almost surely as . This in turn implies that there is an unbounded collection of points of on almost surely. An union bound over provides the claim in the lemma. ∎
Lemma 4.9**.**
The following event occurs almost surely. For every there exists a doubly infinite sequence of integers , , such that contains no points on each of the segments .
Proof.
By monotocity and an union (or rather intersection) bound over rational values of , it suffices to show that the event occurs almost surely for every fixed . Given a fixed , let be the indicator of the event that has no points on . It suffices to show that almost surely infinitely many of the s equal 1 for . Then, reflection invariance of and another union bound imply that almost surely a doubly infinite collection of the s are equal to 1, as required.
Due to translation invariance of the sequence , , is stationary in that has the same law as . It is also ergodic by Proposition 4.6. Therefore, by the Ergodic Theorem,
[TABLE]
The probability that is the probability that has no points in . This is strictly positive by (6.2) below. As a result, an infinite number of the s equal 1 whenever the limit in the above holds. ∎
5. The local staircase shaped tableau
The local staircase shaped tableau, henceforth, local tableau, is a random function on
[TABLE]
Figure 9 provides an illustration. The rows and columns of are given by the diagonal lines
[TABLE]
In order to define tableaux on and their convergence, we first explain the topology on since tableau entries will take values there (we allow the value ). The topology on is the usual topology on extended in the natural way by stipulating that a sequence converges to if its values diverge to , possibly stabilizing to the value . In this case we will say that the sequence grows to . For example, grows to , as does , as well as .
A tableau is a function such that it satisfies the tableau constraints
[TABLE]
The YD embeds into via . This is a rotation that puts row of on row of ; see Figure 9. In this manner any PYT of shape embeds as a tableau by setting
[TABLE]
By an abuse of notation we denote by .
We say that a sequence of tableaux converges if there is a tableau such that, in the aforementioned topology on , for every . Note we stipulate that a limit of tableaux remain a tableau.
A random tableau is a Borel probability measure on tableaux with respect to the topology above. Convergence of a sequence of random tableaux means weak convergence with respect to this topology.
5.1. Bulk local limit of staircase shaped tableaux
Section 2.3 describes how PYTs of a given shape are in bijection with ensembles of non-increasing and non-intersecting paths whose initial positions are given in terms of the shape. We describe the bijection explicitly for tableaux defined on as it will be useful in the proof of the local limit theorem.
Consider an ensemble of paths , for and , that satisfy the following.
[TABLE]
For a tableau , paths satisfying (5.4) are obtained by setting for , and for ,
[TABLE]
In other words, is left continuous and decreases by integer units at times indexed by row of . The paths are non-intersecting due to the columns of being non-decreasing. Indeed, , where is the number of entries of on row with value at most . Due to the columns being non-decreasing, , and thus, . The jumps of the paths form a discrete set due to the rows and columns of growing to . When a row entry equals then the corresponding path jumps only a finite number of times.
Let denote the jumps for the ensemble of paths associated to a tableau on . The jumps can be read off from in the following manner. For every , the jumps on the line are the entries of whose cells have first coordinate in . More precisely, if is the -th smallest point of on then
[TABLE]
If there are less than points on , there is no such and the tableau entry above equals .
To see this, observe that is the time when the path starting at initial position jumps for the -th time. Indeed, this is the -th path starting at or to the right of position and it hits position after jump number . The first jumps on are the times when the first paths starting at or to the right of position hits position . Also, when there is no such it means that the path starting from has exhausted its jumps and it does not get to position .
Let denote the map from tableaux defined on to jumps of paths satisfying (5.4). This map is invertible with the inverse given by the relation (5.5). Namely, is the \big{[}(y+1+\mathbf{1}_{\{x\,even\}})/2\big{]}–th smallest jump of on with the convention that if no such jump exists. Let denote the inverse map.
Lemma 5.1**.**
The set of jumps of paths satisfying (5.4) is closed in the topology on discrete subsets of . The map is a homeomorphism from the set of tableaux on to the set of jumps of paths satisfying (5.4), with inverse given by .
Proof.
The maps and are inverses by design. We must show that they are continuous. We begin with continuity of . Let be a sequence of tableaux such that converges to a tableau . Let and . Recall from Section 2.4 that convergence of to requires that for every and , the -th smallest point of on must converge to the corresponding point of , while accounting for the case that there may be less than points.
Let . Then, by (5.5), is the -th smallest point of on and similarly for . There are two cases: or . In the former case, the -th smallest point of on is eventually finite and the same for . Moreover, we have convergence of these points since . In the latter case, given any bounded subset of , the -th smallest point of eventually escapes the set or is non-existent due to . This is as required since has no -th smallest point on . This proves that is continuous.
Now we show that the set of jumps of paths satisfying (5.4) is a closed set in the space of discrete subsets of , as well as that that is continuous. Suppose is a sequence of such jumps sets and that it converges to a discrete subset .
Let . First, we show that converges for every . Indeed, is the -th smallest point of on for . Therefore, convergence of to implies that must converge to some function . Note that if and only if has less than points on .
The function is a tableau because the tableau inequalities from (5.2) continue to hold in the entry-wise limit, and the rows and columns will grow to due to being a discrete set. Thus, consider . By the first part of the proof, . But then, because limits of discrete subsets of are unique. This shows both the closure property of sets of jumps for paths satisfying (5.4) and the continuity of . ∎
We now state the local limit theorem for Poissonized staircase shaped tableaux in the bulk. For , let . For a PYT having shape , embed it as a tableau on according to (5.3) and consider the rescaled tableau defined by
[TABLE]
Let be a uniformly random PYT of shape and denote by the random tableau associated to by (5.6).
Theorem 5.2**.**
The sequence of random Poissonized tableaux converges weakly to a random tableau . Moreover, the law of is .
Proof.
Observe that for every . With this choice of , the jump process associated to has law from (1.5) because these jumps are simply the jumps of rescaled onto as in (1.5). Theorem 4.1 asserts that converges weakly to . Due to being a weak limit of the jumps of ensembles of paths satisfying (5.4), is also almost surely the jumps of such an ensemble of paths by the closure property given in Lemma 5.1. The continuity of then implies that converges weakly to . Therefore, converges weakly to a random tableau having the law of . ∎
Bulk local limit theorem for random staircase shaped SYT
Recall denotes a uniformly random SYT of shape and . Consider the rescaled tableau
[TABLE]
Theorem 5.3**.**
The random tableau converges to in the bulk local limit, that is, under the embedding and rescaling from (5.6).
Proof.
Consider the following coupling between and . Given , independently sample according to the order statistics of i.i.d. random variables distributed uniformly on . Insert the entry into the cell of that contains entry of . The resulting tableau has the law of . Using this coupling, and due to the manner the scaling from (5.6) is defined, it suffices to show the following in order to conclude that converges to by way of Theorem 5.2.
Fix an and consider any such that . Let be the entry of inside under the embedding from (5.6). Then as , we need to show that
[TABLE]
The number is random, its distribution depends on as well as and .
In order to establish (5.8) we will use the following auxiliary fact, which is a byproduct of [AHRV, Theorem 11]. There is a number of order as , such that with probability tending to 1 as , every entry of within the cells of under the embedding (5.6) has value at least . As a consequence, with probability tending to 1. We write
[TABLE]
For a fixed deterministic , has a Beta distribution with parameters and , which has mean and variance
[TABLE]
Since the s are independent of , employing the bound above for and summing over the probabilities of give
[TABLE]
The latter quantity is of order as . Since , we conclude that both terms on the right hand side of (5.9) tend to 0 as . ∎
Statistical properties of the local staircase shaped tableau
The set can be made into a directed graph by putting directed edges from each vertex to the vertices and . The automorphisms of this graph consists of translations , for , given by , as well as a reflection given by . Tableaux are preserved by these automorphisms.
A random tableau is translation invariant if has the same law as for every translation . The random tableau is reflection invariant if has the same law as . The translation invariant sigma-algebra of is the sigma-algebra of events that remain invariant under every translation:
[TABLE]
(Recall that .) We say is ergodic under translations if is the trivial sigma-algebra.
Proposition 5.4**.**
The local tableau has the following statistical properties.
- (1)
Almost surely, is finite for every and the entries of are all distinct. 2. (2)
The law of is both translation and reflection invariant. 3. (3)
* is ergodic under translations.* 4. (4)
Almost surely, for every there are infinitely many positive and negative such that .
Proof.
Almost surely, has an infinite and unbounded collection of points on every line by Lemma 4.8. Also, almost surely, does not contain two points of the form and with . To see this, observe from the relation (2.4) for determinantal point processes that the expected number of such pairs of points in is 0 due to the set of such pairs having measure zero with respect to the measure \big{(}\#\mathbb{Z}\otimes\pazocal{L}(\mathbb{R}_{\geq 0})\big{)}^{\otimes 2}. When both these properties hold, satisfies (1).
The law of is invariant under translations because for every translation , the tableau is constructed from the jump process , which has the same law of by Proposition 4.4. Similarly, reflection invariance of follows from reflection invariance of . This establishes (2).
The ergodicty of under translations follows from the ergodicity of under translations (Proposition 4.6). This is because a translation invariant event for is the image of a translation invariant event for under the map . Finally, (4) is the statement of Lemma 4.9. ∎
6. Random sorting networks
6.1. Sorting networks, Young tableaux and Edelman-Greene bijection
Stanley [S] enumerated the number of sorting networks of , which equals
[TABLE]
Following Stanley, Edelman and Greene [EG] provided an explicit bijection between sorting networks and staircase shaped SYT. An account of further combinatorial developments may be found in [Ga, HY]. We describe the part of the Edelman-Greene bijection that maps staircase shaped tableaux to sorting networks. The inverse map is a modification of the RSK algorithm; we do not describe it here since it is not used in the paper. See [EG] or [AHRV, Section 4] for a full description of the bijection.
Recall that a sorting network of is identified by its sequence of adjacent swaps , where . For the rest of the paper we will use to denote . For , we adopt the convention that if .
The Schützenberger operator
Let denote the cell containing the maximum entry of a SYT . The Schützenberger operator is a bijection defined as follows. Given , construct the sliding path of cells iteratively in the following manner. Set and . Then set
[TABLE]
Let where for , , and for all other cells . Figure 10 provides an illustration.
The Edelman-Greene map is defined by
[TABLE]
where is the -th iterate of . Edelman and Greene [EG, Theorem 5.4] proved that indeed maps to sorting networks and that is has an inverse.
6.2. First swap times of random sorting networks: proof of Corollary 1.3
Let be the first time the adjacent swap appears in a sorting network of . According to the Edelman-Greene bijection, this time is recorded in the entry of . Thus,
[TABLE]
In terms of the rescaled tableau from (5.7) we have that
[TABLE]
This implies the following for – the first time an adjacent swap between and appears in a random sorting network of :
[TABLE]
Here, is the tableau rescaled and embedded into according to (5.6). Theorem 5.3 implies that converges weakly to . Since , we conclude that the rescaled from above converges weakly to . Thus, has the law of .
Now we explain how to get the distribution function of given in (1.1). Observe that the event is the event . The probability of the latter (often known as “gap probability”) has the representation given by (1.1), which is the Fredholm determinant of over . This is a well-known property of determinantal point processes under the condition that the kernel be of trace class [DVe]. The kernel is of trace class on simply because .
The asymptotic behaviour of the distribution function of is well-known:
[TABLE]
The formula (6.2) has a history. In theoretical physics literature, the leading term in (6.2) was first studied in [dCM], while the full expansion was given in [Dy]. The complete mathematical treatment was developed in [DIZ, Kr, E1, DIKZ, E2]; the present form of (6.2) is given in the last reference.
We will only need the simple corollary of (6.2) that for every .
6.3. Edelman-Greene algorithm on the local tableau
The procedure described here is the same as the one given in the Introduction except that it is in the language of tableaux instead of their jumps. In order to define the Edelman-Greene algorithm on the local tableau we first introduce some concepts that allow us to define Edelman-Greene algorithm on tableaux defined on .
A directed path from to is a sequence of cells of such that for every . The cells of can be partially ordered as follows: if there is a directed path from to . Recall that is a directed graph with edges from to . It can also be thought of as an undirected graph by forgetting the direction of the edges. A connected subset of is a connected subgraph of in the undirected sense.
A Young diagram (YD) of is a connected subset that is downward closed in the partial order, that is, if and then . For example, is a YD of . The boundary of , , consists of cells such that there is a directed edge from some cell to . The peaks of consists of the maximal cells of in the partial order.
Let be a tableau as in (5.2). A sub-tableau is the restriction of to a YD ; we say is the support of the sub-tableau. Let . We take the support of to be the support of . Observe that is a countable disjoint union of sub-tableaux of , say . Indeed, the support of the s are the connected components of the subgraph spanned by cells such that . We will call the s the clusters of . The tableau is EG-admissible if all the entries of are distinct and every cluster is supported on a YD of finite size.
Edelman-Greene algorithm on a finite tableau
Let be a YD of of finite size and a tableau such that all its entries are distinct. The Edelman-Greene map takes as input and outputs a triple , where , and is a sub-tableau.
The sliding path of is a directed path defined by
- (1)
. 2. (2)
. 3. (3)
peak of obtained when both belong to .
Let and define by
[TABLE]
The cell must be on the bottom level of and has the form for some . Set . The output is , and empty if is the empty tableau.
The Edelman-Greene algorithm on outputs a discrete subset , denoted the swaps of . Let , for , be defined iteratively by and for . Then,
[TABLE]
If the cell contains the -th smallest entry of then its entry is removed during the -th iteration of the algorithm. We will say that the entry at exits at time from row . We will also say that originates from cell .
Edelman-Greene algorithm on an admissible tableau
Let be the clusters of an EG-admissible tableau . Observe that for , the swaps of and exit from mutually disjoint rows. Thus, the swap sets are row-wise mutually disjoint. The swaps of are defined as
[TABLE]
The local tableau is not EG-admissible. In order to define swaps for the local tableau we cut off large entries so that it becomes EG-admissible, and then process the tableau in a graded manner. For this to be successful, the EG algorithm ought to be consistent in the sense that running it on a tableau, and then restricting to swaps that originate from a sub-tableau, must produce the same outcome as the algorithm applied to the sub-tableau. This is not always the case and the following explains when it may be so.
Given two tableaux and , we say if the following criteria hold.
- (1)
for every . 2. (2)
For every , and , if belongs to the same cluster of as then .
Lemma 6.1**.**
Let , and suppose that is EG-admissible. Then, applying the EG algorithm to and restricting to the swaps that originate from the cells of produces the same outcome as applying the EG algorithm to . In particular, .
Proof.
Observe that the clusters of are contained within the clusters of . The EG algorithm acts independently on each cluster of is a row-wise disjoint manner. Fix a particular cluster of , and suppose that the clusters of that are contained inside are . It suffices to prove that the EG algorithm applied to , and then restricted to the swaps that originate from , produces the same outcome as the algorithm applied to each individual .
Let and . The assumption is that each entry of is larger than every entry of . Therefore, the EG algorithm applied to will process every entry of before it ever processes an entry from the complement. When some entry from enters a cell of some during the first steps, the algorithm treats that entry as if it were . Since agrees with on , the EG algorithm will output the swaps of during the first steps, and then output the remaining swaps of . This is what was claimed. ∎
A tableau is graded EG-admissible if all of its finite-valued entries are distinct and, if for every , the sub-tableau
[TABLE]
Observe that whenever . Lemma 6.1 thus implies that . Therefore, for a graded EG-admissible tableau , we may define
[TABLE]
Lemma 6.2**.**
Suppose a sequence of tableaux , and also that every and are graded EG-admissible. Then for every integer and , there is a finite YD that contains the cluster of in for every .
Proof.
This follows from a diagonalization argument, more precisely, König’s infinity lemma, which states that every infinite connected graph with finite vertex degrees contains an infinite path.
Suppose for the sake of a contradiction that the conclusion of the lemma fails. Let denote the cluster of in . Call a cell bad if there is a undirected path in from to that is contained in infinitely many of the clusters . Consider the connected component of in that is spanned by the subgraph of bad vertices. If the component is finite then there is a finite YD that contains the component. This implies that for all sufficiently large , every cell of lies outside because any path from to a cell outside must pass through . Therefore, for all large . Since every is graded EG-admissible, this means that there is a finite YD that contains every , which is a contradiction.
Therefore, the connected component of spanned by the bad vertices is infinite. Since every vertex of has degree at most 4, König’s lemma provides an infinite path of (distinct) bad vertices starting from . By definition of being bad, for every , there is a path from to that is contained in some infinite subsequence of the clusters with as . Let be the length of this path. Observe that with because the distance from to in must tend to infinity due to every vertex having degree at most 4.
The YD formed by the cells of that are at or below the cells on the path from to must be contained in every cluster . Since converges to , this implies that for every . Since , we deduce that is infinite. However, this is a contradiction to being graded EG-admissible. ∎
Theorem 6.3**.**
Suppose a sequence of tableaux , and that every as well as is graded EG-admissible. Then as discrete subsets of .
Proof.
A compact subset of is a finite, disjoint union of sets of the from for and compact . Therefore, we must show that for every such and ,
[TABLE]
Fix a such that .
Suppose is a graded EG-admissible tableau. The swaps of on are the entries of that exit from row . Let denote the cluster of in . By Lemma 6.1, the swaps of on are completely determined by running the EG algorithm on . We deduce from Lemma 6.2 that there is a finite YD such that
[TABLE]
Since , we conclude that the following must occur for all sufficiently large .
- (1)
The order of the entries of on stabilizes to the order of the entries of on . 2. (2)
For every , if then .
Once condition (1) holds then, due to , a swap from lies on if and only if there is a cell such that and, when the EG algorithm is applied to restricted to , the entry at cell exits from row . The same conclusion holds for swaps of on . This property along with condition (2) implies that
[TABLE]
This completes the proof. ∎
6.4. Completing the proof of Theorem 1.2
Theorem 1.2 will follow from Theorem 6.3 once we prove that the local tableau is graded EG-admissible almost surely. To this end, first observe that the entries of are finite and distinct by part (1) of Proposition 5.4. We must show that, almost surely, the clusters of are finite for every .
By part (4) of Proposition 5.4, the local tableau satisfies the following almost surely: for every and , there are integers such that and . When this property holds the tableau constraints imply that the cluster of containing must be contained within cells whose row and column indices are both between and . The set of such cells is finite, and so the cluster of every bottom level cell in is finite. Now if then cell belongs to the same cluster as in since the row entries are non-decreasing. This implies that, almost surely, is EG-admissible for every , as required.
Finally, we complete the proof. The law of is that of the Edelman-Greene algorithm applied to the rescaled uniformly random staircase shaped tableau from (5.7). Theorem 5.3 asserts that converges weakly to as a tableau embedded in . By Skorokhod’s representation theorem, there exists random tableaux and defined on a common probability space such that has the law of , has the law of , and almost surely.
The tableaux and are graded EG-admissible almost surely. Theorem 6.3 then implies that converges to almost surely. This means that , which has the law of , converges weakly to , which is the law of . ∎
We conclude with some statistical properties of the local swap process.
Proposition 6.4**.**
The process has the following properties.
- (1)
* is invariant under translations and reflection of the -coordinate.* 2. (2)
* is stationary in time in that for every , the process has the same law as (shifted) .* 3. (3)
* is ergodic under translations of the -coordinate in that the sigma-algebra*
[TABLE]
Remark 6.5*.*
We believe that is also ergodic in the time coordinate. However, the proof of this is more challenging and, therefore, we leave it as a conjecture.
Proof.
We have that in law. Applying a -automorphism to is the same as first applying its analogue to (the maps and ), and then applying the EG algorithm to the resulting tableau. Thus, the invariance of under -automorphisms follows from the corresponding invariance of stated in Proposition 5.4.
Time stationarity of is a consequence of the stationarity of finite random sorting networks [AHRV, Theorem 1(i)], as we explain. If is the sequence of swaps of a random sorting network of , then has the same law as .
The ergodicity of under -translations is a consequence of the ergodicty of under translations (part 3 of Proposition 5.4). Indeed, a translation invariant event for is the image of a translation invariant event of under the EG algorithm. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AGH] O. Angel, V. Gorin, A. E. Holroyd, A pattern theorem for random sorting networks. Electron. J. Probab. , 17 (99) (2012), pp. 1–16. ar Xiv:1110.0160.
- 2[ADHV] O. Angel, D. Dauvergne, A. E. Holroyd, B. Virág, The Local Limit of Random Sorting Networks, to appear in Ann. Inst. Henri Poincaré Probab. Stat. , ar Xiv:1702.08368.
- 3[AH] O. Angel, A. E. Holroyd, Random subnetworks of random sorting networks. Electron. J. Combin. , 17 :paper 23, 2010, ar Xiv:0911.2519.
- 4[AHR] O. Angel, A. Holroyd, D. Romik, The oriented swap process, Ann. Probab. 37 (2009), pp. 1970–1998. ar Xiv:0806.2222.
- 5[AHRV] O. Angel, A. Holroyd, D. Romik, B. Virág, Random sorting networks, Adv. Math. 215 (2) (2007), pp. 839–864, ar Xiv:0609538.
- 6[BDJ] J. Baik, P. Deift, K. Johansson, On the distribution of the length of the longest increasing subsequence of random permutations, J. Amer. Math. Soc. 12 no. 4 (1999), pp. 1119–1178. ar Xiv:math/9810105.
- 7[B] A. Borodin, Determinantal point processes, Oxford Handbook of Random Matrix Theory , Oxford University Press, 2011, ar Xiv:0911.1153.
- 8[BG] A. Borodin, V. Gorin, Lectures on Integrable probability. In: Probability and Statistical Physics in St. Petersburg, Proceedings of Symposia in Pure Mathematics, Vol. 91, 155–214. AMS 2016. ar Xiv:1212.3351.
