Universal limits of substitution-closed permutation classes
Fr\'ed\'erique Bassino, Mathilde Bouvel, Valentin F\'eray, Lucas, Gerin, Micka\"el Maazoun, Adeline Pierrot

TL;DR
This paper investigates the limiting behavior of uniform random permutations within substitution-closed classes, revealing a universal limit under certain conditions and identifying two other regimes with distinct limiting objects.
Contribution
It introduces a framework linking permutation class generating series to permuton limits, including a universal limit and regimes related to stable trees, using singularity analysis.
Findings
The limit depends on the generating series of simple permutations.
Under mild conditions, the limit is a deformation of the Brownian separable permuton.
Two other regimes with different limiting objects are identified.
Abstract
We consider uniform random permutations in proper substitution-closed classes and study their limiting behavior in the sense of permutons. The limit depends on the generating series of the simple permutations in the class. Under a mild sufficient condition, the limit is an elementary one-parameter deformation of the limit of uniform separable permutations, previously identified as the Brownian separable permuton. This limiting object is therefore in some sense universal. We identify two other regimes with different limiting objects. The first one is degenerate; the second one is nontrivial and related to stable trees. These results are obtained thanks to a characterization of the convergence of random permutons through the convergence of their expected pattern densities. The limit of expected pattern densities is then computed by using the substitution tree encoding of permutations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Universal limits of substitution-closed permutation classes
Frédérique Bassino
Université Paris 13, Sorbonne Paris Cité, LIPN, CNRS UMR 7030, F-93430 Villetaneuse, France
,
Mathilde Bouvel
,
Valentin Féray
Institut für Mathematik, Universität Zürich, Winterthurerstr. 190, CH-8057 Zürich, Switzerland
,
Lucas Gerin
CMAP, École Polytechnique, CNRS, Route de Saclay, F-91128 Palaiseau Cedex, France
,
Mickaël Maazoun
École Normale Supérieure de Lyon, UMPA UMR 5669 CNRS, 46 allée d’Italie, F-69364 Lyon Cedex 07, France
and
Adeline Pierrot
LRI, Université Paris-Sud, Bat. 650 Ada Lovelace, F-91405 Orsay Cedex, France
Abstract.
We consider uniform random permutations in proper substitution-closed classes and study their limiting behavior in the sense of permutons.
The limit depends on the generating series of the simple permutations in the class. Under a mild sufficient condition, the limit is an elementary one-parameter deformation of the limit of uniform separable permutations, previously identified as the Brownian separable permuton. This limiting object is therefore in some sense universal. We identify two other regimes with different limiting objects. The first one is degenerate; the second one is nontrivial and related to stable trees.
These results are obtained thanks to a characterization of the convergence of random permutons through the convergence of their expected pattern densities. The limit of expected pattern densities is then computed by using the substitution tree encoding of permutations and performing singularity analysis on the tree series.
Key words and phrases:
permutation patterns, Brownian excursion, permutons
2010 Mathematics Subject Classification:
60C05,05A05
Contents
-
4.1 Generating functions of -canonical trees (possibly with marked leaves)
-
4.2 Generating function counting trees with marked leaves inducing a given tree
-
5.1 Definition of the biased Brownian separable permuton and statement of the theorem
-
5.2 Asymptotics of the generating function of trees with no or one marked leaf
-
5.3 Asymptotics of the generating function of marked trees with a given induced tree
-
6.3 Hypothesis and convergence of uniform random simple permutations
1. Introduction
The aim of this paper is to study the asymptotic behavior of a permutation of large size, picked uniformly at random in a substitution-closed permutation class generated by a given (finite or infinite) family of simple permutations satisfying additional conditions. We first give a few definitions necessary to present the recent literature on related problems, and to state our results.
1.1. Permutation classes and their limit
For any positive integer , the set of permutations of is denoted by . We write permutations of in one-line notation as . For a permutation in , the size of is denoted by .
For , and of cardinality , let be the permutation of induced by . For example for and we have
[TABLE]
since the values in the subsequence are in the same relative order as in the permutation . A permutation is a pattern involved (or contained) in , and the subsequence is an occurrence of in . When a pattern has no occurrence in , we say that avoids . The pattern containment relation defines a partial order on : we write if is a pattern of .
A permutation class is a family of permutations that is downward closed for , i.e. for any and any pattern , it holds that . For every set of patterns, we denote by the set of all permutations that avoid every pattern in . Clearly, for all , is a permutation class. Conversely (see for instance [20, Paragraph 5.1.2]), every class of permutations can be defined by a set of excluded patterns. Moreover, for any given , we can define uniquely a set such that : it is enough to impose that is chosen minimal (for set inclusion) among all such that . This (which happens to be an antichain) is called the basis of the class . The basis of a permutation class may be finite or infinite (see [20, Paragraph 7.2.3]).
One in many ways permutation classes can be studied is by looking at the features of a typical large permutation in the class. A particularly interesting characteristic is the frequency of occurrence of a pattern , especially when it is considered for all simultaneously. Denote by the number of occurrences of a pattern in and by the pattern density of in . More formally
[TABLE]
where is randomly and uniformly chosen among the subsets of with elements. The study of the asymptotics of , where is a uniform random permutation of size in a permutation class and is a fixed pattern, has been carried out in several cases.
- •
The behavior of for various classes and fixed was investigated by Bóna [18, 19], Homberger [35], Chang, Eu and Fu [24] and Rudolf [55].
- •
Janson, Nakamura and Zeilberger [39] considered higher moments and joint moments, rigourously when , and also empirically for various classes . A bit later, Janson [38] has given for the joint limit in distribution of the random variables , properly rescaled as to yield a nontrivial limit.
A parallel line of work consists in studying the asymptotic shape of the diagram of . The diagram of a permutation is the set of points in the Cartesian plane. To gain understanding of typical large permutations in , one can investigate the geometric properties of the diagram of as , possibly after rescaling this diagram so that it fits into a unit square.
- •
Madras and Liu [43], Atapour and Madras [10] and Madras and Pehlivan [44] considered the asymptotic shape of when for small patterns .
- •
In parallel, Miner and Pak [48] described very precisely the asymptotic shape of , when for the patterns in . These shapes are related to Brownian excursion, as explained by Hoffman, Rizzolo and Slivken [33, 34].
- •
Bevan describes the limit shape of permutations in so-called connected monotone grid classes [15, Chapter 6].
These two points of view may seem different, but they are in fact tightly bound together. Indeed, as we shall see in Section 2, it follows from results of [36] that the convergence of pattern densities characterizes the convergence of the diagrams, seen as permutons. This important property was actually the main motivation for the introduction of permutons in [36].
1.2. The permuton viewpoint
A permuton is a probability measure on the unit square with uniform marginals, i.e. its pushforwards by the projections on the axes are both the Lebesgue measure on . Permutons generalize permutation diagrams in the following sense: to every permutation , we associate the permuton with density
[TABLE]
Note that it amounts to replacing every point in the diagram of (normalized to the unit square) by a square of the form , which has mass uniformly distributed.
Permutons were first considered by Hoppen, Kohayakawa, Moreira, Rath and Sampaio in [36], with the point of view of characterizing permutation sequences with convergent pattern densities. The name permuton and the measure point of view were given afterwards by Glebov, Grzesik, Klimošová and Král [31]. This recently introduced concept has already been the subject of many articles, including:
- •
results on the set of possible pattern densities of permutons [31, 32, 40];
- •
a large deviation principle in the space of permutons, giving access to the analysis of random permutations with fixed pattern densities [40];
- •
the description of the limiting distribution of the number of fixed points (and more generally of cycles of a given length) for “equi-continuous” sequences of permutations with a limiting permuton [50];
- •
central limit theorems and refinements for pattern occurrences in random permutation models associated to permutons [29];
- •
the permuton convergence of some exponentially tilded models of random permutations [49];
- •
a study of permuton-valued processes, in the context of random sorting networks [53].
In the context of this article, the theory of permutons is a nice framework to state scaling limit results for sequences of (random) permutations. Indeed, the space of permutons is equipped with the topology of weak convergence of measures, which makes it a compact metric space. This allows one to define convergent sequences of permutations: we say that converges to when weakly. Accordingly, for a sequence of random permutations, we will consider the convergence in distribution of the associated random measures in the weak topology. The limiting object is then a random permuton.
By definition, convergence to a permuton encodes the first-order asymptotics of the shape of a sequence of permutations. As we shall see in Section 2, it also encodes the first-order asymptotics of pattern densities: a sequence of random permutations converges in distribution to a random permuton if and only if the sequences of random variables converge in distribution, jointly for all . Moreover, for any pattern , the limit distribution of the density of can be expressed as a function of the limit permuton.
Our previous article [12] studies the limit of the class of separable permutations, in terms of pattern densities and permutons.
Theorem 1.1**.**
Let be a uniform random separable permutation of size . There exists a random permuton , called the Brownian separable permuton, such that converges in distribution to .
The result of [12] is more precise, and describes the asymptotic joint distribution of the random variables as a measurable functional of a signed Brownian excursion. This object is a normalized Brownian excursion whose strict local minima are decorated with an i.i.d. sequence of balanced signs in . In another paper [42], the fifth author gives a direct construction of from this signed Brownian excursion. In particular, is not equal almost surely to a given permuton; in this regard, separable permutations behave differently from all other classes analysed so far in the literature (which converge to a deterministic permuton).
The class of separable permutations is the smallest nontrivial substitution-closed class, as defined in the next subsection. The present paper aims at showing a convergence result similar to Theorem 1.1 for other substitution-closed classes. We will see that in many cases the limit belongs to a one-parameter family of deformations of the Brownian separable permuton: the biased Brownian separable permuton of parameter is obtained from a biased signed Brownian excursion (defined similarly to the signed Brownian excursion but with each sign having probability of being a ). Simulations of the biased Brownian separable permuton are given in Fig. 1. A precise definition will be given in Section 5 (Eq. 20).
Finally, we mention that although permutons are a very nice, natural, and powerful way of studying “limits of permutation classes” (in particular because they unify many earlier results, as explained above), this approach has its weaknesses. Most importantly, it gives no information beyond the first order. For instance, the results of [48, 33] describe the canoe shape of large permutations in classes avoiding one pattern of length three. As the width of the canoe is a , one only sees the diagonal (or antidiagonal) in the permuton limit, but has no information about the fluctuations around this limit.
1.3. Substitution-closed classes
Definition 1.2**.**
*Let be a permutation of size , and let be other permutations. The substitution of in is the permutation of size obtained by replacing each by a sequence of integers isomorphic to while keeping the relative order induced by between these subsequences.
This permutation is denoted by . We sometimes refer to as the skeleton of the substitution.*
When is (resp. ), for any value of , we rather write (resp. ) instead of . Note that the specific value of does not appear in this notation, but can be recovered counting the number of permutations which are substituted in (resp. ).
Examples of substitution (see Fig. 2 below) are conveniently presented representing permutations by their diagrams: the diagram of is obtained by blowing up each point of onto a square containing the diagram of .
By definition of permutation classes, if for some permutation class , then . The converse is not always true.
Definition 1.3**.**
A permutation class is substitution-closed if, for every in , .
The focus of this paper will be substitution-closed classes. To study such classes it is essential to observe that any permutation has a canonical decomposition using substitutions, which can be encoded in a tree. This decomposition is canonical in the same sense as the decomposition of integers into products of primes. In this analogy, simple permutations play the role of prime numbers and the substitution plays the role of the product. We first give a simple definition: a permutation is -indecomposable (resp. -indecomposable) if it cannot be written as (resp. ), (or equivalently, if there is no such that can be written as (resp. )).
Definition 1.4**.**
A simple permutation is a permutation of size that does not map any nontrivial interval (i.e. a range in containing at least two and at most elements) onto an interval.
For instance, is not simple as it maps onto . The smallest simple permutations are and (there is no simple permutation of size ).
Remark: Usually in the literature, the definition of a simple permutation requires only and not , so that and are considered to be simple. However in our work, and do not play the same role as the other simple permutations, that is why we do not consider them to be simple.
Theorem 1.5** (Decomposition of permutations, Proposition 2 in [1]).**
Every permutation of size can be uniquely decomposed as either:
- •
, where is simple (of size ),
- •
, where and are -indecomposable,
- •
, where and are -indecomposable.
This decomposition theorem can be applied recursively inside the permutations appearing in the items above, until we reach permutations of size . Doing so, a permutation can be naturally encoded by a rooted planar tree, whose internal nodes are labeled by the skeletons of the substitutions that are considered along the recursive decomposition process, and whose leaves correspond to the elements of . This construction provides a one-to-one correspondence between permutations and canonical trees (defined below) that maps the size to the number of leaves.
Definition 1.6**.**
A canonical tree is a rooted planar tree whose internal nodes carry labels satisfying the following constraints.
- •
Internal nodes are labeled by , or by a simple permutation.
- •
A node labeled by has degree111Throughout the paper, by degree of a node in a tree, we mean the number of its children (which is sometimes called arity in other works). Note that it is different from the graph-degree: for us, the edge to the parent (if it exists) is not counted in the degree.* , nodes labeled by and have degree at least .*
- •
A child of a node labeled by (resp. ) cannot be labeled by (resp. ).
Canonical trees are known in the literature under several names: decomposition trees, substitution trees,…We choose the term canonical because we consider many variants of substitution trees in this paper, but only these canonical ones provide a one-to-one correspondence with permutations.
The representation of permutations by their canonical trees is essential in the study of substitution-closed classes. The reason is that, for any such class , the set of canonical trees of permutations in can be easily described.
Proposition 1.7**.**
Let be a substitution-closed permutation class, and assume222Otherwise, or or and these cases are trivial. that . Denote by the set of simple permutations in . The set of canonical trees encoding permutations of is the set of all canonical trees built on the set of nodes .
Proof.
First, if a canonical tree contains a node labeled by a simple permutation , then the corresponding permutation contains the pattern , and hence . Second, by induction, all canonical trees built on encode permutations of , because is substitution-closed. If necessary, details can be found in [1, Lemma 11]. ∎
For instance, the class of separable permutations studied in [12] corresponds to the set of all canonical trees built on , i.e., to . It is therefore the smallest nontrivial substitution-closed class.
Observation 1.8*.*
Let be any substitution-closed permutation class, and let be the set of simple permutations in . Because is a class, it holds that for all , if is a simple permutation such that , then . Whenever a set of simple permutations satisfies this property, we say that is downward-closed (implicitly: for and among the set of simple permutations).
Thanks to their encoding by families of trees, it can be proved that substitution-closed permutation classes (possibly, satisfying additional constraints) share a common behavior. For example, the canonical tree representation of their elements imply that all substitution-closed classes with finitely many simple permutations have an algebraic generating function [1, Corollary 14]. (This is actually easy, the main contribution of [1] being to generalize this algebraicity result to all classes containing a finite number of simple permutations, again using canonical trees as a key tool.) Our work illustrates this universality paradigm in probability theory: we prove that the biased Brownian separable permuton is the limiting permuton of many substitution-closed classes (see Theorem 1.10 and to a lesser extent Theorem 7.8).
1.4. Our results: Universality
Let be a (finite or infinite) set of simple permutations. We denote by the set of permutations of size whose canonical trees use only nodes , and , and we define . From Proposition 1.7 and Observation 1.8, every substitution-closed permutation class containing and can be written as for a downward-closed set of simple permutations (which is just the set of simple permutations in ).
Remark 1.9*.*
For a generic (not necessarily downward-closed) set of simple permutations, is a family of permutations more general than a substitution-closed permutation class. The results that we obtain apply not only to permutation classes but also to such sets of permutations.
Note however that our work does not consider substitution-closed sets of permutations not containing either or (as mentioned above, a permutation class not containing one of these two permutations is necessary trivial, but there might be interesting such substitution-closed sets). In principle, such sets of permutations could also be studied by the approach developed in this paper, but we prefer to leave such cases outside of our study. Indeed, to cover them, it would require to re-do all computations, modifying the combinatorial equations that we start from (see Proposition 12 p. 12) and all equations that follow, so as not to allow the nodes labeled and/or .
We are interested in the asymptotic behavior of a uniform permutation in which we describe in terms of permutons. Let
[TABLE]
be the generating function of and let be the radius of convergence of .
Theorem 1.10** (Main Theorem: the standard case).**
Let be a set of simple permutations such that
[TABLE]
For every , let be a uniform permutation in , and let be the random permuton associated with . The sequence tends in distribution in the weak convergence topology to the biased Brownian separable permuton whose parameter is given in (21) p. 21.
An important point in Theorem 1.10 is that the limiting object depends on only through the parameter . It turns out that only depends on the number of occurrences of the patterns and in the elements of . We illustrate this universality of the limiting object on Fig. 3, by showing large uniform random permutations in two different substitution-closed classes: the first one has a finite set of simple permutations , while the second is the substitution closure of , which contains infinitely many simple permutations and satisfies (H1) (as we will explain below). Although this is hard to see on the picture, the corresponding values of the biaised parameter are different, namely .5 and around .6 respectively (see Section 5, Examples 5.3 and 5.5).
In the following, to lighten the notation, we write . Note that may be .
The case when Condition (H1) of Theorem 1.10 is not satisfied is discussed in the next section. When Condition (H1) is satisfied the case is called standard because there are natural and easy sufficient conditions to ensure this case (that are given below). Moreover, this case includes most sets studied so far in the literature on permutation classes, to our knowledge. This gives a fairly precise (and positive) answer to an important question raised in our previous article [12]: is the Brownian separable permuton universal (in the sense that it describes the limit of a large family of substitution-closed classes)?
We now give several cases in which Condition (H1) of Theorem 1.10 is satisfied.
- •
If is a generating function with radius of convergence , (H1) is satisfied. Indeed, the condition implies , and is nonnegative since (like ) is a series with nonnegative coefficients. In particular, the situation where covers the cases where there are finitely many simple permutations in the class (then is a polynomial and ), and more generally where (i.e. the number of simple permutations of size grows subexponentially).
- •
If is divergent at , (H1) is trivially verified. In particular, this happens when is a rational generating function, or when has a square root singularity at .
In the literature, there are quite a few examples of permutations classes whose set of simple permutations has been enumerated. We can therefore ask whether Condition (H1) applies to them. In most examples we could find, it is indeed satisfied, and this follows from the discussion above. We record these examples here.
- •
Classes with finitely many simple permutations have attracted a fair amount of attention, see [1] and subsequently [13, 21, 23].
- •
Several families of simple permutations with a bounded number of elements of each size have appeared in the literature: the family of exceptional simple permutations (also called simple parallel alternations in [22]), the family of wedge simple permutations (see also [22]), the families of oscillations and quasi-oscillations (see [14]), and the families of simple permutations contained in the following three classes: , and – see [5].
- •
The family of simple pin-permutations has a rational generating function – see [14].
- •
The generating function is also rational when is the set of simple permutations contained in several permutation classes defined by the avoidance of two patterns of size , namely – see [51], and – see [2], – see[3], – see [6], and – see [5].
- •
The set of simple permutations of the class enumerated in [7] is also rational.
- •
We come back to the above example, where is the substitution of . This class has been studied in [11], where an explicit basis of avoided patterns is given. In this case, is the set of simple permutations avoiding , whose generating function is computed in [8]: it has a square-root singularity at , which proves that (H1) is fulfilled.
In addition to verifying Condition (H1), we have computed the numerical value of the parameter for some of the above-mentioned sets of simple permutations; see Examples 5.3, 5.4 and 5.5 (p. 5.3).
Notably absent from the above list is the class , enumerated in [56, 17]. Since the avoided pattern, , is simple, this class is substitution-closed. Its generating series behaves as around its dominant singularity . This prevents the set of simple permutations in this class to satisfy Condition (H1); compare with Proposition 5.8.
1.5. Our results: Beyond universality
When , for the two remaining cases and , the asymptotic behavior of is qualitatively different, and the results require slight additional hypotheses and notation. As a consequence, for the moment we only briefly describe these behaviors, the results being stated with full rigor later.
- •
Case . This is a degenerate case.
We first show in Theorem 6.9 that, with a small additional assumption which will be called , the sequence of random permutons converges. If uniform simple permutations in have a limit (in the sense of permutons), we show that the limit of permutations in is the same (see Proposition 6.10 and the subsequent comment). This explains the terminology “degenerate”: all permutations in the class (or set) are close to the simple ones, and the “composite” structure of permutations does not appear in the limit.
- •
Case . This critical case is more subtle.
We again need to assume the above mentioned hypothesis . According to the behavior of near , the limiting permuton of can either be the (biased) Brownian separable permuton (Theorem 7.8) or belong to a new family of stable permutons (Theorem 7.6). Finite substructures of stable permutons are connected to those of the random stable tree (see [28]), which explains the terminology. Two simulations are presented in Fig. 4.
We believe that the above-mentioned class belongs to the degenerate regime. Indeed, in the critical regime, the singularity exponent of the class should be smaller than , and cannot be , as for . Since there is no direct description of the simple permutations in , it seems however out of reach to prove our hypothesis for this specific class. We are therefore unable to describe its limiting permuton and let this open for further research. We refer to [45, Fig. 7] for simulations of uniform random permutations in this class.
Remark 1.11*.*
The variety of behaviors that we observe can be informally understood in terms of trees. We have seen in Section 1.3 that permutations in can be encoded by trees. Taking a uniform element in , we can prove that the corresponding tree is a multi-type Galton-Watson tree conditioned on having leaves. This link with conditioned Galton-Watson trees is not used in this paper, but may give intuition on our results.
In the standard and critical case, these Galton-Watson trees are critical. It is therefore not surprising to see two different limiting behaviors. When the law of reproduction has finite variance, we get one behavior related to the Brownian excursion and the Brownian continuum random tree (that we describe as the universal one). When the law of reproduction has infinite variance, we get the behavior related to stable trees. In the standard case, the law of reproduction always has finite variance.
On the contrary, in the degenerate case, the underlying Galton-Watson tree model is subcritical. At the limit, such trees conditioned to being large have one internal node of very high degree ([37, Theorem 7.1, case (ii)]). This node corresponds to a large simple permutation in the tree encoding a uniform random permutation in . It is therefore not surprising that is asymptotically close to a uniform simple permutation in .
Remark 1.12*.*
The reader may have noticed that all cases where we describe the asymptotic behavior of are such that .
Observe that it is always the case for proper permutation classes (i.e., permutation classes different from ). Indeed, from the Marcus-Tardos Theorem [46], the number of permutations of size in a proper class is at most , for some constant . For the class , we however do have , since there are asymptotically simple permutations of size [4, Theorem 5]. In this case, the sequence of permutons associated with a uniform permutation in converges in distribution to the uniform measure on . The situation where may happen as well for sets where is not downward-closed, but we leave these cases open.
1.6. Limits of proportions of pattern occurrences
Let us change our approach and discuss in this section the asymptotic behavior (as ) of the proportion of occurrences of a fixed pattern in (as done in [18, 19, 24, 35, 38, 39, 55] for uniform random permutations in various classes). Since most examples fit in that regime, we focus here on the standard case (when (H1) is satisfied).
As mentioned in Section 1.2 and explained in more details in Section 2, the convergence of towards implies the (joint) convergence in distribution
[TABLE]
The limiting random variables have been studied in [12, Section 9] (for ): in particular, is non-deterministic if and only if is separable of size at least 2 and it is possible to compute their moment algorithmically. These results are easily extended to the general case . Therefore, for separable patterns , (2) establishes the convergence of to a non-deterministic limit. Since these are bounded variables, their (joint) moments also converge to the (joint) moments of the limiting vector, which can be computed algorithmically (even if in practice only low order moments can be effectively computed; see the discussion in [12, Section 9]). Note that all these limiting moments are trivially nonzero, since these are moments of nondeterministic nonnegative random variables.
For nonseparable patterns however, the situation is different: (2) only entails the convergence of to [math]. Indeed, if is nonseparable, the limiting quantity is identically [math] (this is a consequence of [12, Proposition 9.1] when , the result being easily extended to ).
We can go further and ask whether \big{(}\operatorname{\widetilde{occ}}(\pi,\bm{\sigma}_{n})\big{)}_{n} has a limit in distribution with some appropriate normalization. We therefore investigate the moments of . In Section 5.6, we define some permutation statistics (see Eq. 33) and show that under the hypothesis (H1) we have the following asymptotic behavior333We say that the sequence behaves as if there are such that for every ..
Proposition 5.13. * For each and , we have . *
Proposition 5.13 also holds for separable patterns : in that case =0 and we have . No news here, since the moments of have nonzero limits, as previously explained. For nonseparable patterns, is positive and measures in some sense how nonseparable is. Note that the order of magnitude of is independent of , which implies that there is a set of probability on which the variables stays bounded away from [math] (see Corollary 5.15). This event of small probability contributes to the asymptotic behavior of moments, and thus the method of moments is inappropriate to find a limiting distribution for some appropriate normalization of . Finding such a limiting distribution is therefore left as an open question.
1.7. Outline of the proof
In our previous paper [12] (i.e. when the family of simple permutations is ), the proof of the convergence to the Brownian separable permuton strongly relied on a connection to Galton-Watson trees conditioned on having a given number of leaves. This allowed us to use fine results by Kortchemski [41] or Pitman and Rizzolo [52] on such conditioned random tree models.
For a general family , generalizing this approach would require delicate results on the asymptotic behavior of conditioned multitype Galton-Watson trees. Moreover, there are several other steps in the main proofs of [12], in particular the subtree exchangeability argument, that are not easily adapted.
The strategy developed in the present paper is different. We strongly use the framework of permutons. Indeed, we first show that to establish the convergence in distribution of to some random permuton , it is enough to prove the convergence of \Big{(}\mathbb{E}\left[\operatorname{\widetilde{occ}}(\pi,\bm{\sigma}_{n})\right]\Big{)}_{n} for every pattern (see Theorem 2.5). By definition, if and ,
[TABLE]
The asymptotic behavior of the numerator and denominator is then obtained with analytic combinatorics, which allows us to transfer from the behavior of a generating series near its singularity to the asymptotic behavior of its coefficients. This goes in three steps.
Step 1: Enumeration. We compute (or characterize by an implicit equation) some generating series. For instance to estimate the denominator of (3) we consider . We readily use the size-preserving bijection between and the class of -canonical trees, counted by the number of leaves. Hence the generating function we want to compute is the same as that of , denoted .
Using again the encoding of permutations by trees, the numerator can be described as a number of trees with marked leaves and some conditions on the tree induced by these marked leaves. Obtaining generating functions for such combinatorial classes is possible, and needs the introduction of several intermediary functions which count trees with various contraints, and possibly one marked leaf. This is detailed in Section 4.
Step 2: Singularity analysis. Then we want to know the singular behavior of the generating functions we computed so far. As it turns out, the singular behavior of some intermediate function drives the singular behavior of all the other series. The function is characterized by the implicit equation
[TABLE]
where is a known analytic function with radius of convergence that involves and some rational functions (see Eq. 23 p. 23). Hence the behavior of depends on whether there is a point inside the disk of convergence where , because around such a critical point, the equation (4) is not invertible. Since is a series with positive integer coefficients, it suffices to check the sign of , which can easily be translated in terms of the function . This is where the sign of appears, leading to the three different cases. More precisely444In this informal description, we left out some conditions on the singularity of that appear in the critical and degenerate cases.
- •
The standard case is equivalent to . In this case there is a unique critical point . As a result, the radius of convergence of is , and the analyticity of around implies that has a singularity of exponent (Proposition 5.8). Such a behavior is sometimes called branch point in the literature: is analytic at but the equation (4) has two solutions (called branches) near and one cannot find an analytic solution in a neighbourhood of . The solution is therefore singular at .
- •
The degenerate case is equivalent to . In this case there is no critical point in the disk nor at its boundary. As a result, the unique dominant singularity of is the point where reaches the singularity of . Moreover, has a bounded derivative at its singularity, and so has exponent , which is the same as the exponent of (Lemma 6.3).
- •
The critical case is equivalent to . In this case there is no critical point inside the disk , but the singularity of is a critical point. Once again the radius of convergence of is , but has no first derivative at its singularity. Here the exponent of the singularity of depends on that of the singularity of , and belongs to (see Lemma 7.3).
Once we have found the asymptotic behavior of , we should analyze the tree series found in Step 1. It is purely routine from an analytic point of view, but involves some combinatorial arguments, regarding the encoding of permutations by substitution trees.
Step 3: Transfer. Finally we use a transfer theorem of analytic combinatorics (Theorem A.3) to translate the singularity exponents we found in Step 2 into a limiting behavior for (3). Informally, a square-root singularity, which is the same as in "usual" families of trees, will lead to the Brownian separable permuton. A singularity of exponent in will lead to the -stable tree, where is the inverse of the exponent. A singularity of exponent will invariably lead to the degenerate case.
1.8. Organization of the paper
The paper is organized as follows (see also Fig. 5).
- •
Section 2 is devoted to proving useful results on the convergence of random permutons. The proofs heavily rely on previous estimates for deterministic permutons [36]. We believe that these general results regarding random permutons are interesting on their own, therefore these are presented in a self-contained way.
- •
In Sections 3 and 4, we prove nonasymptotic enumeration results for the number of permutations encoded by some given families of (decorated) trees. The main result is Proposition 4.5, which is the first step towards the estimation of .
- •
In Sections 5, 6, 7 we prove our main results: the convergence of the sequence of permutons. As already mentioned, the quantitative behavior depends on the family , more precisely on the sign of :
- –
Section 5 is devoted to the standard case . We show in Theorem 5.2 the convergence to the biased Brownian separable permuton.
- –
In Section 6, we consider the degenerate case .
- –
In Section 7, we consider the critical case . This case itself is divided into two subcases, according to whether the exponent (defined in Definition 6.6) is smaller (Section 7.1) or greater (Section 7.2) than .
- •
We postpone to Appendix A many useful results of complex analysis.
- •
Finally, Appendix B discusses how Figs. 1, 4 and 3 have been obtained.
2. Convergence of random permutons
In this section, we first recall the terminology of (deterministic) permutons, introduced in [36]. We also adapt their results to obtain criteria for the convergence in distribution of random permutons.
Notation. * Since this section involves many different probability spaces, we use a superscript on (and similarly on expectation symbols ) to record the source of randomness. In the case where the event (or the function ) depends on two random variables and , we interpret (or ) as the conditional probability (expectation) with respect to . *
2.1. Deterministic permutons and extracted permutations
Recall from Section 1.2 that a permuton is a probability measure on the unit square with uniform marginals. To a permutation of size , we can associate the permuton which is essentially the (normalized) diagram of , where each dot has been replaced with a small square of dimension carrying a mass .
Let be the set of permutons. We need to equip with a topology. We say that a sequence of (deterministic) permutons converges weakly to (simply denoted ) if
[TABLE]
for every bounded and continuous function . With this topology, is compact and metrizable by a metric which has been introduced in [36] (see Lemmas 2.5 and 5.3 in [36]):
[TABLE]
Since is compact, Prokhorov’s theorem ensures that the space of probability distributions on is compact (for convergences of measure, we refer to [16]).
Recall from Section 1.1 that for and , we have
[TABLE]
where is randomly and uniformly chosen among the subsets of with elements. The random permutation is called the induced subpermutation (of size ) in . We will define the pattern density of a pattern in a permuton by analogy with this formula.
Take a sequence of random points in , independently with common distribution . Because has uniform marginals and the ’s (resp. ’s) are independent, it holds that the ’s (resp. ’s) are almost surely distinct. We denote by the -ordered sample of , i.e. the unique reordering of the sequence such that . Then the values are in the same relative order as the values of a unique permutation, that we denote . Since the points are taken at random, is a random permutation of size . We call it the induced subpermutation (of size ) in . Then we set
[TABLE]
Rewriting this probability in an integral form, we get immediately:
[TABLE]
which identifies as a measurable function on the space of permutons.
In the following, as we consider a random permuton , we need to construct a finite sequence of points , which are independent with common distribution conditionally on . This is possible up to considering a new probability space where the joint distribution of is characterized as follows: for every positive measurable functional ,
[TABLE]
In this new probability space, we call the vector , and we use the notation , to highlight the two levels of randomness.
We end this section by the following two estimates, proved in [36].
Lemma 2.1** (Occurrences in a permutation and its associated permuton [36, Lemma 3.5]).**
If and , then
[TABLE]
Lemma 2.2** (Approximation of a permuton by a permutation [36, Lemma 4.2]).**
There is a such that if , for any permuton ,
[TABLE]
2.2. Random permutons and convergence in distribution
We now consider a sequence of random permutations (with of size ). An example of interest for the present paper is when, for each , is a uniform random permutation of size in a given class . Another example are the random permutations constructed above from a given random permuton . In the case where is deterministic, these correspond to the -random permutations from [36], used to prove that each permuton is the limit of some permutation sequence.
Taking independently from , we have for every of size :
[TABLE]
Similar, for a random permuton , we have
[TABLE]
This is a consequence of (6) above, applied to and combined with (5). The same argument may be applied to
[TABLE]
yielding a randomized version of Lemma 2.2.
Lemma 2.3** (Approximation of a random permuton by a random permutation).**
There is a such that if , for any random permuton ,
[TABLE]
This result has an important consequence for the distribution of random permutons.
Proposition 2.4** (Subpermutations characterize the distribution of ).**
Let , be two random permutons. If there exists such that for and every of size we have
[TABLE]
then .
Proof.
We need to prove that for every bounded and continuous function . Fix . It holds that
[TABLE]
where denotes a sequence of independent points with common distribution , conditionally on . The second term in the above display is zero by assumption. Moreover, from Lemma 2.3 the first and third terms go to zero when . ∎
Our main theorem in this section deals with the convergence of sequences of random permutations to a random permuton. It generalizes the result of [36] which states that deterministic permuton convergence is characterized by convergence of pattern densities. We extend their proof to the case of random sequences, where permuton convergence in distribution is characterized by convergence of average pattern densities, or equivalently of the induced subpermutations of any (fixed) size.
Theorem 2.5**.**
For any , let be a random permutation of size . Moreover, for any fixed , let be a uniform random subset of with elements, independent of . The following assertions are equivalent.
- (a)
* converges in distribution for the weak topology to some random permuton .* 2. (b)
The random infinite vector \big{(}\operatorname{\widetilde{occ}}(\pi,\bm{\sigma}_{n})\big{)}_{\pi\in\mathfrak{S}} converges in distribution in the product topology to some random infinite vector . 3. (c)
For every in , there is a such that
[TABLE] 4. (d)
For every , the sequence \big{(}\operatorname{pat}_{{\bm{I}}_{n,k}}(\bm{\sigma}_{n})\big{)}_{n} of random permutations converges in distribution to some random permutation .
Whenever these assertions are verified, we have and for every ,
[TABLE]
Observation 2.6*.*
In item (c) above, it is enough to consider all of size at least . Indeed, for , the statement is trivial, since is identically .
**Proof of (a)(b). ** Let be a finite sequence of patterns. By [36, Lemma 5.3], the map is continuous. Therefore, implies
[TABLE]
Using Lemma 2.1, one can replace each by in the above convergence. This proves the convergence in distribution of all induced permutations \big{(}\operatorname{\widetilde{occ}}(\pi_{i},{\bm{\sigma}_{n}})\big{)}_{1\leq i\leq k}, and hence of \big{(}\operatorname{\widetilde{occ}}(\pi,\bm{\sigma}_{n})\big{)}_{\pi\in\mathfrak{S}} in the product topology (see for instance [16, ex. 2.4 p. 19]).
**Proof of (b)(c). ** If , as takes values in , we have
[TABLE]
**Proof of (c)(d). ** Fix and consider the sequence
[TABLE]
which converges if (c) holds (the equality comes from Eq. 7). Since is a random variable taking its values in the finite set , this says exactly that the sequence \big{(}\operatorname{pat}_{{\bm{I}}_{n,k}}(\bm{\sigma}_{n})\big{)}_{n} converges in distribution.
Proof of (d)(a). Consider a sequence of random permutations satisfying (d), i.e. for every , there is a random permutation such that . Put differently, for every pattern of size , we have
[TABLE]
From Lemma 2.1 and Eq. 7, we get
[TABLE]
Set . Then, using Eq. 8, for every , we have
[TABLE]
In other words, . Since takes its values in a finite set of permutons, this implies
[TABLE]
Let be a bounded continuous functional. It holds that
[TABLE]
The first term can be bounded by introducing the modulus of continuity of , which is defined as . Since is compact, it goes to [math] when goes to [math]. Hence,
[TABLE]
As for the second term, for large enough, Lemma 2.3 yields
[TABLE]
Putting things together, we obtain
[TABLE]
Assume that has a subsequence converging in distribution to a random permuton . Taking the limit when of (10) along this subsequence, we get
[TABLE]
(Recall indeed that converges to in distribution.) The right-hand side tends to [math] when tends to infinity, which proves that converges to in distribution as well.
Therefore, all converging subsequences of converge to the same limit , which is the limit of . Thanks to the compactness of the space of probability distributions on , this is enough to conclude that has indeed a limit. Item (a) is proved.
**Proof of additional statements. ** Assume that (a)–(d) hold. That follows from the proof of (a)(b). Fix any integer , and any permutation of size . The above equality in distribution implies . That is clear from the proof of (b)(c). The equality follows from the proof of (c)(d). Finally, comes from Eq. 8. ∎
Remark 2.7*.*
In some sense, Theorem 2.5 can be seen as an analogue of a theorem of Aldous for random trees [9, Theorem 18]. Both in permutations and trees, there is a natural way to construct a smaller structure from elements of a big structure (induced subpermutations or subtrees). The goal is then to reduce the convergence of the big structure to the convergence, for each , of the induced substructures. For trees, we need an extra tightness assumption (that the family of trees is “leaf-tight” in Aldous’ terminology). In our case, since the space of permutons is compact, we do not need such an assumption.
We finish this section by a comment on the existence of random permutons with prescribed induced subpermutations.
Definition 2.8**.**
A family of random permutations is consistent if
- i)
for every , , 2. ii)
for every , if is a uniform subset of of size , independent of , then .
It turns out that consistent family of random permutations and random permutons are essentially equivalent:
Proposition 2.9**.**
If is a random permuton, then the family defined by is consistent. Conversely, for every consistent family of random permutations , there exists a random permuton whose distribution is uniquely determined, such that . In that case, .
Proof.
Set . The first assertion follows from the following coupled construction of and : is a uniform random subset of , chosen independently of it. It follows that , for some random subset of . By construction, the distribution of is uniform and independent of . Hence the consistency follows.
The converse is immediate, by applying the implication (d)(a) and the last assertion of Theorem 2.5 to the sequence . Consistency ensures that we get the prescribed induced subpermutations, and uniqueness in distribution follows by Proposition 2.4. ∎
3. Coding permutations by trees
3.1. Substitution trees
As seen in Section 1.3 (Theorem 1.5), any permutation can be recursively decomposed using substitutions in a canonical way and this decomposition can be encoded in a canonical tree. However, if we do not impose conditions on and the ’s (as done in Theorem 1.5), a permutation may be represented in many ways as a substitution , where the ’s themselves may be further decomposed using substitutions. Such decompositions can be recorded in substitution trees.
Definition 3.1**.**
*A rooted planar tree is either a leaf, or consists of a root node with an ordered -tuple of subtrees attached to the root, which are themselves rooted planar trees.
In our context, the size of a tree is its number of leaves. It is denoted , whereas denotes the number of nodes of (including both leaves and internal nodes).*
Internal vertices of all trees considered in this paper have degree at least . It is natural (and also convenient for counting purposes in Section 4) to consider that the single leaf of the tree of size is also its root (and is therefore also denoted ).
Since we work with planar trees, we can label their leaves canonically with the integers from to : the leaf labeled by is the th leaf met in the depth-first traversal of which choses left before right. A subset of the set of leaves of a tree is therefore canonically represented by a subset of .
Definition 3.2**.**
A substitution tree of size is a labeled rooted planar tree with leaves, where any internal node with children is labeled by a permutation of size . Internal nodes with only one child are forbidden.
Internal nodes labeled by the ascending permutation or the descending permutation (for some ) will play a particular role. Therefore we replace every such label with a (for ascending permutations) or a (for descending permutations). Since the size of a label corresponds to the number of children and since there is exactly one ascending (resp. descending) permutation of each size, there is no loss of information in this replacement. Internal nodes labeled or are called linear nodes, the other nodes being called nonlinear. Among nonlinear nodes, the ones labeled by simple permutations are called simple nodes.
An example of substitution tree is shown in Fig. 6, left.
Definition 3.3**.**
Let be a substitution tree. We define inductively the permutation associated with :
- •
if is just a leaf, then ;
- •
if the root of has children with corresponding subtrees (from left to right), and is labeled with the permutation , then is the permutation obtained as the substitution of in :
[TABLE]
Fig. 6 illustrates this construction. When , when say that is a tree that encodes , or a tree associated with . Nonsimple permutations are encoded by several trees . However, if we restrict ourselves to canonical trees (which are particular cases of substitution trees; see Definition 1.6), we have uniqueness. Indeed, from Theorem 1.5, to any permutation we can associate uniquely a canonical tree such that .
The remaining of Section 3.1 is devoted to the proof of simple combinatorial lemmas on the structure of the set of substitution trees associated with a given permutation . These lemmas are useful in Section 5.
We first make the following observation. Take a substitution tree of some permutation with a marked node labeled by . Consider also a substitution tree of . Then replacing by the tree yields a new substitution tree of the same permutation . (When doing this replacement the subtrees attached to are glued on the leaves of , respecting their order, see Fig. 7.) This operation will be referred to as the inflation of with .
Conversely, consider a connected set of internal nodes in a substitution tree of . From this set we build a substitution tree whose set of internal nodes is , the ancestor-descendant relation in is inherited from the one in , and we add leaves so that the degree of each node of is the same in than in . We denote . Then merging all nodes in into a single node labeled by turns into a new substitution tree of the same permutation . We call this a merge operation. For example, the tree of Fig. 7 can be obtained from the tree of the same figure by merging the nodes labeled and .
We now consider a last family of substitution trees. An expanded tree is a substitution tree where nonlinear nodes are labeled by simple permutations, while linear nodes are required to be binary.
Lemma 3.4**.**
Any expanded tree of is obtained from its canonical tree by inflating all nodes labeled by (resp. ) with binary trees whose internal nodes are all labeled by (resp. )
Proof.
Let be an expanded tree of . Consider, if any, two adjacent linear nodes of with the same label (either both or both ) and merge them. Note that the resulting node will still have label or . We repeat this operation until there is no adjacent linear nodes with the same label. Nonlinear nodes in the resulting tree are all labeled by simple permutations: it is the case in (by definition of expanded trees) and we did not create any new nonlinear nodes. Therefore satisfy all conditions of canonical trees (see Definition 1.6). By uniqueness, is the canonical tree of . Reversing the merge operations, can be obtained from by inflating its linear nodes, which proves the proposition. ∎
We recall a fact well-known to combinatorialists: the number of complete binary trees (i.e. plane rooted trees, whose internal vertices have all degree ) with leaves is . Therefore each linear node of degree of the canonical tree, can be inflated with a binary tree in ways. We therefore get the following interesting corollary, regarding the number and properties of expanded trees.
Corollary 3.5**.**
Let be a permutation and (resp. ) be the degrees of the nodes labeled (resp. ) in the canonical tree of . Then
- •
the number of expanded trees of is , where we denote by the -th Catalan number, which counts complete binary trees with leaves.
- •
each expanded tree of has nodes labeled and nodes labeled .
- •
the labels of the nonlinear nodes in any expanded tree of are the same as in its canonical tree.
Lemma 3.6**.**
Any substitution tree of can be obtained from some expanded tree of by merge operations.
Proof.
The proof is similar to that of Lemma 3.4. Starting from any substitution tree of and inflating every node that is neither simple nor binary by an expanded tree encoding its label, we get an expanded tree. Reversing these inflation operations, we can obtain any substitution tree from some expanded tree of , using only merge operations. ∎
3.2. Induced trees
Since permutations are encoded by trees and since we are interested in patterns in permutations, we consider an analogue of patterns in trees: this leads to the notion of induced trees.
Definition 3.7** (First common ancestor).**
Let be a tree, and and be two nodes (internal nodes or leaves) of . The first common ancestor of and is the node furthest away from the root that appears on both paths from to and from to in .
The following simple observation allows to read the relative order of and in any substitution tree encoding .
Observation 3.8*.*
Let be two leaves of a substitution tree and . Let be the first common ancestor of in and be the permutation labeling . We define (resp. ) such that the -th (resp. -th) child of is an ancestor of (resp. ).
Then if and only if .
Definition 3.9** (Induced tree).**
Let be a substitution tree, and let be a subset of the leaves of . The tree induced by is the substitution tree of size defined as follows. The tree structure of is given by:
- •
the leaves of are the leaves of labeled by elements of ;
- •
the internal nodes of are the nodes of that are first common ancestors of two (or more) leaves in ;
- •
the ancestor-descendant relation in is inherited from the one in ;
- •
the order between the children of an internal node of is inherited from .
The label of an internal node of is defined as follows:
- •
if is labeled by a permutation in , the label of in is given by the pattern of induced by the children of having a descendant that belongs to (or equivalently, to ).
A detailed example of the induced tree construction is given in Fig. 8.
Note that if has label in , it has also label in . Indeed, nodes correspond to increasing permutations and all patterns of increasing permutations are increasing permutations. The same holds with . The converse is however not true: a node can be linear in but nonlinear in ( e.g. the bottommost green node in Fig. 8).
Observation 3.10*.*
By definition, for any substitution tree with leaves and subset of , is a substitution tree. However, if is a canonical tree, is a substitution tree which is not necessarily canonical (see for example Fig. 8).
An important feature of induced trees is the following, which follows from 3.8 and is illustrated in Fig. 9.
Lemma 3.11**.**
Let be a substitution tree with leaves, and be a subset of . We have
[TABLE]
As a consequence of this formula, counting the total number of occurrences of a given pattern in some family of permutations can be reduced to counting the total number of induced trees equal to a given in the corresponding family of canonical trees. This is precisely the goal of the next section.
4. Exact enumeration of various families of trees
Let be a fixed family of simple permutations. Recall that its generating function is
[TABLE]
where is the number of permutations of size in . An -canonical tree is any canonical tree whose simple nodes carry labels in . We denote by the combinatorial class of -canonical trees, the size of a tree being its number of leaves. Recall that is by definition the set of permutations whose canonical tree is in . Since canonical trees encode permutations in a unique way, defines a size-preserving bijection between and . Both have therefore the same generating function which we denote by
[TABLE]
In Section 4.1 below, we explain how to compute starting from the datum . We then study families of -canonical trees with one marked leaf, with constraints on the root and/or on the marked leaf. These are building blocks for Section 4.2, where we consider the family of -canonical trees with marked leaves, inducing a given tree .
4.1. Generating functions of -canonical trees (possibly with marked leaves)
In order to compute in terms of , we need to introduce the auxiliary family (resp. ) of -canonical trees with a root (always denoted ) that is not labeled (resp. ), and its generating function (resp. ):
[TABLE]
Note that replacing all labels by and by defines an involution on -canonical trees. This implies in particular and will be used to get other similar identities below.
Proposition 4.1**.**
Together with the condition , the generating function is determined by the following implicit equation
[TABLE]
The main series is then simply given in terms of by
[TABLE]
Proof.
A tree of is either a leaf, or a root labeled and a sequence of at least two trees in , or a root labeled and a sequence of at least two trees in , or a root labeled by and a sequence of unconstrained trees. Therefore
[TABLE]
Similarly,
[TABLE]
By combining these two equations we get , that is Eq. 12. Substituting it back in Eq. 13 gives Eq. 11.
Observe, that under the assumption , Eq. 11 allows one to compute inductively the coefficients of . Hence is uniquely determined by and Eq. 11, as claimed. ∎
We now consider trees with a marked leaf. As before, subscripts indicate a constraint on the root. The generating function of trees with a marked leaf counted by their number of unmarked leaves is obtained by differentiating the generating function of trees without marked leaf: , , . Indeed,
[TABLE]
Accordingly, we denote by , and the families of trees counted by , and .
Consistently, we use superscripts when we consider families of trees with a marked leaf that satisfies an additional constraint, and similarly for their generating function. We say that a leaf is -replaceable (resp. -replaceable) if it may be replaced by a tree whose root is labeled (resp. ) without violating the definition of canonical trees (see the third item in Definition 1.6). In other words, its parent (if it exists) should be labeled by or by a simple permutation (resp. by or by a simple permutation). We then denote (resp. ) the families of trees in with a -replaceable marked leaf (resp. a -replaceable marked leaf). Similar definitions hold for , , and .
As for , we take the convention that and all generating functions with superscript count trees according to the number of unmarked leaves. By definition, has constant coefficient (corresponding to the tree consisting of a single leaf). We however take the convention that has constant coefficient [math]: in other words, the single leaf is excluded from the family (intuitively, a single leaf cannot be replaced by a tree with root labeled , since the trees in should not have a root labeled ).
Proposition 4.2**.**
The generating functions , and are given by the following formulas:
[TABLE]
where and are given by Eq. 12 and .
Quantities with a minus superscript are obtained by symmetry: , and .
Proof.
Consider a tree in . As explained above, and we distinguish cases according to the label of the root of , which may be either or a simple permutation.
- i)
The root of is labeled (see left of Fig. 10). Then can be decomposed as a tree in (which may be a single leaf) and a nonempty pair of sequences of unmarked trees in . 2. ii)
The root of is labeled by a simple permutation of size (see right of Fig. 10). Then can be decomposed as a -uple of unconstrained trees, with one of them having a -replaceable marked leaf.
Therefore we have
[TABLE]
where counts nonempty pairs of sequences of unmarked trees in (since ).
Similarly, we have
[TABLE]
The above three equations form a system with three indeterminates: , and ( and are known thanks to Eq. 12). Solving this system gives Eqs. 14, 15 and 16.
The symmetry argument giving , and consist as before in exchanging and labels in -canonical trees. ∎
4.2. Generating function counting trees with marked leaves inducing a given tree
To enumerate trees with marked leaves inducing a given tree, we introduce another kind of generating functions. Recall that for any permutations and , is the number of occurrences of in . For a permutation , we set
[TABLE]
Observation 4.3*.*
For and any fixed , . Therefore is related to the -th derivative of by . This implies that the radius of convergence of each is at least , the radius of convergence of .
Fix a substitution tree with leaves. Let us call the family of -canonical trees with marked leaves such that these leaves induce :
[TABLE]
We define the size of an object as the number of leaves in (both marked and unmarked). The corresponding generating series is denoted
Let . As noted after the definition of induced trees, a nonlinear node of has to come from a nonlinear node of , whereas a linear node of may come from a linear or a nonlinear node of . In order to ease the enumeration, we partition according to the set of nodes of coming from nonlinear nodes of (that is, simple nodes of since is canonical).
More formally, let be the set of internal nodes of a tree . With each , we associate the set of the first common ancestors of in . From the definition of induced tree, a node in corresponds to a unique node in , that we denote . For , let
[TABLE]
Clearly is nonempty if and only if contains every nonlinear node of . An example of a marked tree with the corresponding pair is shown on Fig. 11. In pictures, we will always circle nodes in and the corresponding nodes in .
Definition 4.4**.**
A decorated tree is a pair where
- •
* is a substitution tree;*
- •
* is a subset of that contains all nonlinear nodes.*
Therefore we have the following decomposition:
[TABLE]
Let be a decorated tree. We consider the generating function of , the size of being its number of leaves (both marked and unmarked):
[TABLE]
To compute , we introduce some notation. For every internal node of , let
- •
be the permutation labeling ,
- •
be its number of children which are leaves or in ,
- •
be its number of children which are not in and are labeled by ,
- •
be its number of children which are not in and are labeled by ,
- •
be its total number of children.
We also set the type of root to be if the root of is in , and (resp. ) if the root is not in and labeled (resp. ).
Proposition 4.5** (Enumeration of trees with marked leaves inducing a given decorated tree).**
Let be a decorated tree and be its number of leaves. Then
[TABLE]
where
[TABLE]
Proof.
The proof is based on a decomposition of marked -canonical trees of followed by a study of the series depending on the type of the node in .
**First step: Decomposing a tree in .
**We fix a decorated tree with leaves and a marked -canonical tree . We want to decompose into subtrees, one for each internal node of plus one attached to the root of . Recall that is the correspondence between the internal nodes of and the set of first common ancestors of leaves in .
For every internal node of , let be the subtree of defined as follows.
- •
The root of is .
- •
The nodes of are descendants of .
- •
A descendant of in belongs to if and only if its first proper ancestor in is (proper meaning different from the node itself).
Moreover we define as the subtree of rooted at the root of and containing the nodes of having no proper ancestor in ; stands for “bottom” and is used here as a symbol, not as a variable. (If maps the root of to the root of , then is reduced to a leaf.)
A schematic representation of the trees and is given in Fig. 12.
By definition, a node of that is not in belongs to exactly one . On the contrary, if is in , then is the root of and is a leaf of another , where is the parent of . (If is the root of , then there is no such , and is a leaf of .)
By construction of and , their leaves are either leaves of or belong to . We mark the leaves that belong to or that are marked leaves of . In this way, the trees and that we have constructed are marked trees.
The following properties are straightforward to check.
- i)
The tree is an -canonical tree with marked leaves. 2. ii)
The root of is nonlinear if and only if . 3. iii)
The root of is if and only if and is labeled . 4. iv)
The root of is if and only if and is labeled . 5. v)
The marked leaves of belong to subtrees coming from distinct children of the root of . The pattern induced by the position of those children on the permutation labeling the root of is . (For example, in Fig. 11, four marked leaves are branched on the node labeled at positions . This implies that the corresponding node in is labeled with .) 6. vi)
Let be the -th child of in . If , and its label is a (resp. a ), then the -th marked leaf of must be -replaceable (resp. -replaceable).
The combinatorial class of trees satisfying properties i) to vi) will be denoted .
In addition, we observe that is an -canonical tree with one marked leaf; moreover, if the root of is not in and is labeled by (resp. ), then the marked leaf of must be -replaceable (resp. -replaceable).
This yields a map
[TABLE]
We claim that this map is a bijection, and that the inverse map is obtained as follows. Let us be given and a collection of trees , one for each internal node of . We first take and glue on it, the root of replacing the marked leaf of . We then proceed inductively: if has already been glued and is the -th child of , we glue on , by replacing the -th marked leaf of with the root of . This yields a tree with marked leaves, denoted . This tree is -canonical because of items i), iii), iv) and vi) of the definition of : we only glue trees with root (resp. ) on -replaceable leaves (resp. -replaceable leaves). By construction, the marked leaves induce a tree having the same structure as and item v) of the definition of ensures that the labels in the induced tree and in do match. Because of item ii), the tree is indeed in . We have therefore constructed a map from to . By construction, this map indeed inverts , and is a bijection.
Let be the generating function of the combinatorial class , counted by the number of unmarked leaves. If verifies (19), then (18) follows from the fact that is a bijection. Note indeed that the factor in (18) comes from the fact that we count marked leaves in the series in the left-hand side and but not in the series in the right hand side (the bijection leaves the number of unmarked leaves invariant).
We are left to show that the generating function verifies (19).
**Second step (i): Computing when .
**Recall that the marked leaves of any tree belong to subtrees coming from distinct children of the root of . Since , the elements of can be uniquely decomposed as follows (this decomposition is illustrated on Fig. 13).
- i)
The root of should be labeled by a simple permutation in ; among the children of the root, are marked (corresponding to the subtrees containing a marked leaf) and the pattern of corresponding to the positions of these marked children should be . (In Fig. 13, , the marked leaves are the first, third and fourth subtrees, and the pattern of corresponding to positions is indeed ) 2. ii)
We glue unmarked -canonical trees with arbitrary roots on the unmarked children of . (In Fig. 13, we have only one such tree, which is glued on the second child of the root); 3. iii)
We glue -canonical trees with one leaf marked and an arbitrary root on the marked children of . In addition,
- •
for of these trees, there is no constraint on the marked leaf. (In Fig. 13, the trees glued on the third and fourth children of the root are unconstrained trees with a marked leaf.)
- •
For (resp. ) of these trees, the marked leaf must be -replaceable (resp. -replaceable). (In Fig. 13, we must glue a tree with a -replaceable marked leaf on the first child of the root.)
The generating functions of the first two steps can be computed as
[TABLE]
where is defined in Eq. 17 p. 17. Indeed, once the label of the root is chosen, counts the number of ways to mark children of the root in step i), and comes from step ii). Step iii) yields an additional factor . This proves the formula (19) in the case where is in .
**Second step (ii): Computing when .
**When is not in and labeled by , the elements of the class can be uniquely decomposed as follows (this decomposition is illustrated on Fig. 14).
- i)
The root is labeled by . 2. ii)
We attach to the root -canonical trees whose root is not labeled by , each with one marked leaf. In addition,
- •
for of these trees, there is no constraint on the marked leaf. (In Fig. 14, the two right-most nonhatched trees attached to the root are trees with an unconstraint marked leaf.)
- •
for (resp. ) of these trees, the marked leaf must be -replaceable (resp. -replaceable). (In Fig. 14, the left-most nonhatched tree attached to the root should have a replaceable marked leaf.) 3. iii)
Between and around these trees, we attach possibly empty sequences of unmarked -canonical trees whose root is not labeled by . (In Fig. 14, each of these sequences is represented by a hatched blob.)
Item i) does not involve any choice. Choices in item ii) are counted by , while item iii) yields a factor . This proves the formula (19) in the case where is not in and labeled by .
The case when is not in and is labeled by follows by symmetry. This ends the proof of the combinatorial identity (19) and therefore of Proposition 4.5. ∎
5. Asymptotic analysis: The standard case
Let be a set of simple permutations. The goal of this section is to precisely state, and then prove, Theorem 1.10 (p.1.10): the convergence to the biased Brownian separable permuton of uniform random permutations in when satisfies Condition (H1).
5.1. Definition of the biased Brownian separable permuton and statement of the theorem
The (unbiased) Brownian separable permuton was defined in [12]. Because the biased Brownian separable permuton is a one-parameter deformation of it, it is useful to first recall some facts about the substitution trees encoding separable permutations and the (unbiased) Brownian separable permuton.
As noted in Section 1.3, the canonical trees (called decomposition trees in [12]) of separable permutations are those whose internal nodes are all labeled or . If we consider more generally substitution trees, the following implication still holds: if is a substitution tree whose nodes are labeled or , then is a separable permutation.
Recall from Section 3.1 that an expanded tree is a substitution tree where nonlinear nodes are labeled by simple permutations, while linear nodes are required to be binary. In the case of separable permutations, we do not have simple nodes, so that expanded tree are binary trees labeled with and . These are also referred to as separation trees in the literature. Fig. 15 shows a separable permutations together with two separation trees associated with it.
For any separable permutation , we denote by its number of separation trees. If is not separable, we set . It is shown in [12, Prop. 9.1] that the Brownian separable permuton satisfies the following property: for any and any ,
[TABLE]
where, as before, we denote by the -th Catalan number, which counts complete binary trees with leaves. In other words, the random permutation of size extracted from is distributed like the permutation encoded by a uniform complete binary tree with leaves, and hence internal nodes, whose signs are chosen uniformly and independently in . In light of Proposition 2.4 and Eq. 8 (p. 8), this characterizes the law of the Brownian separable permuton among random permutons.
The biased Brownian separable permuton of parameter has a similar characterization, except that the signs are now chosen with a bias. For a separable , let (resp. ) be the number of internal nodes labeled (resp. ) in a separation tree of . Even if this is not relevant for the present paper, let us observe that (resp. ) is simply the number of ascents (resp. descents) of 555 To see this, observe that each internal node of a separation tree is the first common ancestor of exactly one pair of consecutive leaves (the right-most leaf of its left subtree and the left-most leaf of its right subtree). This two consecutive leaves, corresponding to consecutive elements of the permutation, form an ascent (resp. a descent) if and only if is labeled by (resp. ).. In particular, and do not depend on the choice of a separation tree (this is also a particular case of Corollary 3.5).
Definition 5.1**.**
The biased Brownian separable permuton of parameter is the random permuton characterized by the following relations: for all and all ,
[TABLE]
(Note that the right-hand side is zero if is not separable.)
Several remarks are in order.
- •
For , we get the unbiased Brownian separable permuton.
- •
This characterization of is equivalent to the following: for every ,
[TABLE]
where is a uniform binary planar tree with leaves, where each internal node is labeled (resp. ) with probability (resp. ), independently from each other.
- •
The existence of is not immediate from this definition, but according to Proposition 2.9, it suffices to show that forms a consistent family of random permutations. This is indeed the case, and follows from the fact that a uniform induced subtree of of size is distributed like (this is, e.g., a consequence of Rémy’s algorithm to generate uniform random binary trees [54]).
- •
The definition of , and the above argument justifying its existence, are not constructive. For an explicit construction of starting from a Brownian excursion, see [42].
- •
Knowing a priori that such a permuton exists is not necessary for the proof of our main theorem. Indeed, we will prove that the quantity converges to the right-hand side of Eq. 20 (for all patterns ), where is a uniform permutation in , and the parameter depends on . From Theorem 2.5, this implies the existence of a random permuton satisfying (20) (only for the relevant value of ; not for all ) and the convergence in distribution of to .
Now, we have all the necessary definitions to make explicit the parameter of the statement of Theorem 1.10, that we restate in a full version. (Recall also the definition of from Eq. 17, p. 17.)
Theorem 5.2**.**
Let be a set of simple permutations such that
[TABLE]
For every , let be a uniform permutation in , and let be the random permuton associated with . The sequence tends in distribution in the weak convergence topology to the biased Brownian separable permuton of parameter , where
[TABLE]
and is the unique point in such that .
Since is a power series with nonnegative coefficients, is increasing and continuous on (as the sum of two increasing and continuous functions). It therefore takes all values from to some positive number (possibly ) exactly once. This entails the existence and uniqueness of .
Example 5.3*.*
In many cases , and then and is the unbiased Brownian separable permuton. This is the case with separable permutations (), with or , and with any set of simple permutations stable by taking reverse or complement, like the one considered in the introduction .
Example 5.4*.*
When is the family of increasing oscillations (see for instance [14]), we can compute
[TABLE]
We get through numerical approximation and deduce .
Example 5.5*.*
Taking to be the family of simple permutations in , we are interested in the class which is the substitution-closure of . In this case, [8] gives
[TABLE]
We get through numerical approximation . It seems hard to compute the generating series , but we can locate its value at by exhaustively computing the number of inversions of each permutation in up to a certain order , and controlling the rest of the series using the fact that a permutation of size in cannot have more than inversions666Permutations avoiding consist of two increasing subsequences. The number of inversions of of size is therefore at most . The claim follows.. Performing this with yields .
The remainder of Section 5 is devoted to the proof of Theorem 5.2, using generating functions from Section 4 and methods of analytic combinatorics. More precisely, using Theorem 2.5 we are interested in the limit of , which we will express in terms of probability that a tree with marked leaves induces a given subtree. This probability itself will be expressed as the ratio of the coefficients of the generating function of trees with marked leaves inducing a given subtree and of the generating function of trees without marked leaves. We begin with the study of the asymptotics of these generating functions.
Notation: throughout the article, the class , or equivalently its set of simple permutations , are considered as fixed, and so is the pattern or the tree of which we are studying the proportion of occurrences (and therefore their size ). Constants in asymptotic expansions, including the ones in , and symbols, may therefore depend on these objects.
5.2. Asymptotics of the generating function of trees with no or one marked leaf
[TABLE]
where
[TABLE]
We denote by the radius of convergence of . Note that . We will also use repeatedly the inverse equation . In the following, to lighten the notation, we write . Note that may be .
Observation 5.6*.*
Differentiating Eq. 23, we get
[TABLE]
In particular, it follows that the condition (H1) is equivalent to and .
Observation 5.7*.*
Since is analytic at [math] with nonnegative coefficients, the same holds for . Moreover, the series expansion of is
[TABLE]
In particular it is aperiodic, in the sense given in Section A.1.
Proposition 5.8** (Asymptotics of the generating function of -canonical trees with no marked leaf).**
Assume that (H1) holds, and recall that is defined by . There is a unique such that , and we have . The generating functions and have the same radius of convergence and have a unique dominant singularity777For the reader who is not familiar with complex analysis, all useful definitions and results are given in Appendix A. In particular, ”near ” means ”in a -neighborhood of ”, where ”-neighborhood” is defined in Definition A.2. The formal definition of (unique) dominant singularity is given in Eq. 57 p.57. in . Their asymptotic expansions near are:
[TABLE]
where and . In particular, and are convergent at and
[TABLE]
This type of behavior with a square-root dominant singularity is classical for series defined by an implicit equation (such as , which is characterized by ), that belong to the smooth implicit function schema [30, Def. VII.4]. This schema is defined by the existence of a solution to some characteristic equation, which in our case reduces to hypothesis (H1), as explained in Section 1.7. Our result is a special case of [21, Th. 1], a general result for equations of the form . Implicit equations of this form characterize generating functions of weighted trees counted by their number of leaves, and are also considered in [52, Prop. 8].
Proof.
From 5.7, is strictly increasing in the real interval . Together with the fact that and the assumption (see 5.6), this proves the existence and uniqueness of such that .
Setting in , which is given by (24), we have . It follows that . By uniqueness of , we conclude .
We now consider the expansion of and deduce afterwards the one of . From Eq. 22, we have . Then Theorem 1 in [21] gives that is analytic at [math] and has a unique dominant singularity of exponent in , with the expansion given in Eq. 26: .
The next step is to justify that . That follows from (since ). Moreover, since has nonnegative coefficients and no constant term, we have , so that .
Finally, we look at the series (see Eq. 12). Observe that . Consequently, the dominant singularity of is the same as , i.e. , and is still unique – indeed this singularity is reached before that the denominator vanishes; more formally, this is a particular case of subcritical composition (Lemma A.6). The asymptotic expansion of near is obtained through the following computation:
[TABLE]
Proposition 5.9** (Asymptotics of the generating function of -canonical trees with marked leaves).**
All generating functions , , , , , , , and have a unique dominant singularity in . They diverge at the singularity and behave as , where the constant is given in the table below:
[TABLE]
with (recall that was defined in Proposition 5.8).
Note that the table is in fact a rank matrix. Namely, passing from a root different from (resp. ) to a nonconditioned root always adds a factor , independently of the condition on the marked leaf. Similarly, removing the leaf condition always yields the same factor independently of the conditions on roots.
Proof.
By singular differentiation (see Theorem A.4) of Eqs. 25 and 26, we have (near )
[TABLE]
Since , we have obtained the constants in the first line of the table. We now turn to the two last lines. In order to use Eqs. 14, 15 and 16 p.14, we first compute the expansions of all intermediate quantities appearing in these formulas. From Eq. 26, we obtain the following expansion near :
[TABLE]
We turn to the expansion of . Putting in (which is given by (24)) and using Eq. 12, we have
[TABLE]
Recall that . Therefore, the composition is subcritical (see Lemma A.6). This implies that has a unique dominant singularity at , and plugging in the asymptotic expansion (26) of at , we obtain
[TABLE]
where we used the equalities (by definition of ) and (by definition of ). Combining Eq. 26 and Section 5.2 into Eq. 28, we obtain, after simplification:
[TABLE]
The expansion of then follows from Eq. 27 and Eq. 30:
[TABLE]
We can now derive the expansions of our generating functions, using Eqs. 14, 15 and 16. First,
[TABLE]
Then,
[TABLE]
Since takes value at , the series has the same first-order expansion:
[TABLE]
By symmetry, we have , and , and this completes the proof of the proposition. ∎
5.3. Asymptotics of the generating function of marked trees with a given induced tree
We recall some notation introduced in Section 4.2. Let be a substitution tree with leaves and edges.
Let (resp. , ) be the set of nonlinear nodes (resp. nodes labeled , ) in . Recall that, for , is the degree of and the permutation labeling , and that is the set of -canonical trees with marked leaves such that these leaves induce . Denote by the generating function of (where the size is the number of leaves, both marked and unmarked).
Proposition 5.10**.**
The series has a unique dominant singularity in , with the asymptotic expansion , where the constant is
[TABLE]
Proof.
By definition, , where the sum runs over sets such that is a decorated tree. We start from the formula for , which is given by Proposition 4.5. From Proposition 5.9, the nine series of trees with one marked leaf all have unique dominant singularities in . This is also the case for the functions and by subcritical composition (see Lemma A.6). Indeed, from 4.3 the radius of convergence of is at least and from Proposition 5.8, and are convergent at with and . As a consequence, has a unique dominant singularity in (see Lemma A.5).
For exact asymptotics near , note that and converge respectively to and at (see Proposition 5.8), while the nine series behave as , where the constants are given in Proposition 5.9. We thus get using the notation of Proposition 4.5:
[TABLE]
The asymptotic behavior of near is then obtained by multiplying the above expressions. The formula can be simplified by observing that and , and we obtain:
[TABLE]
To write the second line, we have used that
[TABLE]
Now we have that is the sum of over sets such that is a decorated tree. By definition, such can be written as for some (the notation , introduced right before the proposition, is the set of nonlinear nodes of ). This change of variables leads to
[TABLE]
We first observe that since , the last factor simplifies as . The proposition then follows by writing the sum of products on the second line as a product of sums. ∎
5.4. Probability of tree patterns
Recall that is the set of -canonical trees (ie canonical trees of permutations in ). We take a uniform random tree with leaves in and mark of its leaves, also chosen uniformly at random. We denote by the tree induced by the marked leaves.
Proposition 5.11**.**
Let , and let be any substitution tree with leaves. Then
[TABLE]
where is given by Eq. 32, and is the number of edges of .
Proof.
Directly from the definition, we have:
[TABLE]
The Transfer Theorem (Theorem A.3) gives us the asymptotic behavior of and from the asymptotic expansions in Proposition 5.8 (Eq. 25) and Proposition 5.10. Deriving the result from there is a routine exercise. ∎
5.5. Back to permutations
Let be a permutation of size . Recall from Section 3.1 that an expanded tree is a substitution tree where nonlinear nodes are labeled by simple permutations, while linear nodes are required to be binary. As in Corollary 3.5, we denote the number of expanded trees of . We know (see Corollary 3.5) that they have all the same number of linear nodes labeled (resp. ), this number being denoted by (resp. ) and they all contain the same simple nodes, whose labels will be denoted as .
We introduce the default of binarity of the permutation :
[TABLE]
Observe that if and only if is separable.
Finally, to state the next proposition, we also need to introduce the quantities
[TABLE]
Note that this is the same as in Theorem 5.2.
Proposition 5.12**.**
Let with and let be a uniform random permutation in . With notation as above, we have
[TABLE]
Proof.
We denote by a uniform random -element subset of and by a uniform random -canonical tree with leaves. It holds that . As a consequence of Eqs. 7 and 3.11, we have
[TABLE]
After plugging in the estimate of Proposition 5.11, we get
[TABLE]
where is the default of binarity of the tree .
We claim that if is a substitution tree of , then with equality if and only if is an expanded tree. Indeed,
[TABLE]
Moreover from Lemma 3.6 any substitution tree can be obtained from an expanded tree of by merging some internal nodes along edges connecting them and such merges always increase (strictly) the considered sum, which proves the claim.
It follows that, in the sum of Eq. 34, only expanded trees appear asymptotically. Moreover, and the constant does not depend on the choice of an expanded tree of . As a result, we get where
[TABLE]
Differentiating Eq. 24 p.24 and using yield the identity . Moreover, since (see 4.3), this gives us
[TABLE]
Finally, after collecting everything together, we get
[TABLE]
where the last equality above has been obtained using that, for any expanded tree of , we have and . This allows us to simplify Eq. 35 and yields the desired value of . ∎
We can now conclude the proof of Theorem 5.2. Let be a uniform random permutation in . Our goal is to show that converges to the biased Brownian separable permuton of parameter . Let be any permutation of size . As a consequence of Theorem 2.5 (with 2.6) and Eq. 20, we just have to show that
[TABLE]
Assume first that is not separable. In this case, we have . It also holds that , and Proposition 5.12 implies that .
Assume on the contrary that is separable. In this case, , and . Therefore, from Proposition 5.12 we get that
[TABLE]
where we have used the identity coming from the duplication formula of the Gamma function. This concludes the proof.∎
5.6. Occurrences of nonseparable patterns
Since tends to [math] whenever is a nonseparable pattern and the random variable takes only nonnegative values, tends to [math] in probability.
Here, we discuss more precisely the asymptotic behavior of in this case. The first result gives the order of magnitude of its moments; we then present a consequence for the random variable itself.
Proposition 5.13**.**
For and , .
Remark 5.14*.*
This result does consider separable patterns , but in this case it is a direct consequence of our main theorem. Indeed, according to Theorem 2.5, Theorem 5.2 entails convergence in distribution of to , jointly for all , and hence of all moments and mixed moments (since those random variables are bounded by ). Namely, we have . This limiting value is positive if and only if is separable, and can be computed exactly by adapting the method exposed in [12, Section 9.1].
Proof.
By definition, , where we use for the size of the pattern , as usual. Consequently,
[TABLE]
We split the sum according to the different possible values of and . Denoting the set of possible ordered covers of by sets of size , this gives
[TABLE]
Let us now remark that the unique increasing bijection between and induces a bijection between and . Let denote the image of by this bijection. Clearly,
[TABLE]
The sum can now be decomposed according to the different values of yielding
[TABLE]
Since the summation index sets do not depend on , it is enough to consider each summand separately to get the asymptotics. From Proposition 5.12, the summand is of order .
Whenever Eq. 36 holds, is a pattern of . As a consequence, an expanded tree of must have a substitution tree of as an induced tree. Since the default of binarity may only decrease when taking induced trees, this implies that . Since additionally , we deduce that which gives .
To prove that , it is then enough to find one summand, which grows as for large . This is achieved considering the summand indexed by
[TABLE]
Indeed in this case, , so that , which concludes the proof of the proposition. ∎
Corollary 5.15**.**
For and small enough, , where the constant in the symbol depends on .
Proof.
The upper bound is an immediate consequence of Markov’s inequality. For the lower bound, let be a random variable in , we have
[TABLE]
The corollary follows by taking and small enough. ∎
6. Asymptotic analysis: The degenerate case
In this section, we are interested in the case where the generating function of simple permutations in satisfies the following condition.
Definition 6.1** (Hypothesis ).**
The generating function of a family of simple permutations is said to satisfy hypothesis if meets the following conditions at its radius of convergence :
- i)
* is convergent at and*
[TABLE] 2. ii)
* has a dominant singularity of exponent in .*
Item ii means that, around the singularity , one has
[TABLE]
for some analytic function and constant . We refer to Section A.4 for a precise definition. Clearly, under , it holds that . Note also that the assumption is redundant with the convergence of at .
6.1. Asymptotic behavior of the main series
As in Section 5.2, the first step is to derive the asymptotic behavior of all generating functions for marked trees around their common dominant singularity. In this section, we will not compute constants explicitly, but only focus on the singularity exponent. Indeed, keeping track only of singularity exponents is here sufficient to determine the limiting permuton.
The function is defined in (23) by:
[TABLE]
Lemma 6.2**.**
Assume that satisfies hypothesis . Then has a unique dominant singularity of exponent in . Moreover, is convergent at and .
Proof.
The first assertion follows from Lemma A.6 (Supercritical case), using also that . The convergence of at follows from that of at . Finally, the inequality is a straightforward computation from (37) (recall that is given in (24)). ∎
Recall that from Eq. 11 (p. 11) is implicitly defined by the equation
[TABLE]
As explained in Section 1.7, the condition implies that the singularity of is not a branch point, but is inferred from the singularity of .
Lemma 6.3**.**
Assume that satisfies hypothesis . Then there is a unique such that . Moreover, is the radius of convergence of and has a unique dominant singularity of exponent in .
The proof is rather technical and postponed to Section A.6.
Now since from Eq. 12: , at the singularity of , the denominator of is , hence the singularity of is inherited from the one of . More precisely, from Lemma A.6 (Subcritical case), we have the following corollary.
Corollary 6.4**.**
Assume that satisfies hypothesis . Then has a unique dominant singularity of exponent in , with .
We now turn to the behavior of generating function of trees with one marked leaf.
Lemma 6.5**.**
Assume that satisfies hypothesis . Then each of the nine generating functions , , …, has a unique dominant singularity of exponent in .
Proof.
For (resp. ) this follows immediately from Corollary 6.4 (resp. Lemma 6.3) by singular differentiation (Lemma A.7).
Recall that is explicitly given in Proposition 4.2 as
[TABLE]
where . We examine and to determine the exponent of the singularity of at its radius of convergence (which we will prove to be ).
Since , again by subcritical composition, Lemma 6.3 gives that has a unique dominant singularity of exponent in . Moreover, .
As for we need to analyse and and determine whether the composition is critical, supercritical or subcritical (see Lemma A.6).
- •
By singular differentiation (Lemma A.7), it follows from that has a dominant singularity of exponent in . In addition, by .
- •
Recall that from Corollary 6.4 has a unique dominant singularity of exponent in , with . The composition is therefore critical.
- •
Moreover, is aperiodic since it counts a superset of the separable permutations.
From Lemma A.6 (Critical case-A), we obtain that has a unique dominant singularity of exponent in , and therefore so does (using Lemma A.5).
In , the value of the series is less than . Therefore, by subcritical composition, has a unique dominant singularity of exponent in . Since and have unique dominant singularities in of respective exponents and , the other cases follow by Lemma A.5, using the formulas given in Proposition 4.2. ∎
6.2. Probability of given patterns
Recall that the function was defined in Eq. 17 by .
Unlike in the previous section, the functions will appear in the asymptotic behaviors, thus we need some additional assumptions on them. First, as noticed in 4.3, we have
[TABLE]
which, under , has a dominant singularity of exponent in (see Lemma A.7, about singular differentiation). The following hypothesis is thus reasonable.
Definition 6.6** (Hypothesis ).**
Let have a dominant singularity of exponent in . The family of simple permutations satisfies the hypothesis if, for each pattern of size , the corresponding series has a dominant singularity of exponent at least in .
Let be a substitution tree with leaves. Recall that is the set of internal nodes of , and that for any node , denotes its number of children. Recall also from Section 4.2 (p. 4.2) that is the family of canonical trees with marked leaves which induce a tree equal to , and that is its generating function.
Combining Proposition 4.5 and the above results, we obtain the following.
Proposition 6.7**.**
For any substitution tree of size , assuming and , the series has a unique dominant singularity of exponent at least in , where
- •
, if there is at least one node such that ;
- •
* otherwise888Note for future reference that the two expressions for are equal when there is a unique such that ..*
Proof.
First recall that can be decomposed as a union , where are subsets of which contain all nonlinear nodes (see Section 4.1). It is therefore enough to prove that each series has a unique dominant singularity in with at least the desired exponent.
Recall from Proposition 4.5 the following formula for :
[TABLE]
where each is given by (19) and depends on the type of . Using hypothesis , Corollary 6.4 and Lemma A.6 (Critical case-A), we know that has a dominant singularity of exponent at least in , and, from Lemma 6.5, that it is the term with the lowest exponent and the only possibly divergent term arising in . It then follows from Lemma A.5 that has a dominant singularity of exponent at least in .
The series is then the product of the ’s and of some convergent series. The result of the proposition is obtained from Lemma A.5: has a dominant singularity of exponent at least in , if this sum is nonempty (i.e., when one of the series is divergent) and of the smallest singularity exponent among them, that is otherwise, i.e., if all factors are convergent. ∎
As in Section 5.4 (p. 5.4), we take a uniform random tree with leaves in with marked leaves (chosen also uniformly at random). Denote as before by the tree induced by the marked leaves.
Corollary 6.8**.**
Assume and . For any substitution tree with leaves, the probability tends to [math], unless has only one internal node.
Proof.
As in Section 5.4, we use the formula
[TABLE]
From Theorem A.3 and Proposition 6.7 (and using the notation herein defined), we get that
[TABLE]
for some constant , possibly equal to [math]. On the other hand, Theorem A.3 and Corollary 6.4 imply that
[TABLE]
for some constant . Putting everything together, we obtain
[TABLE]
where .
For any subset of the internal nodes of a tree with leaves, we claim that the following inequality holds: . It is clear when . For , we decompose as a root with subtrees . The chosen set of nodes of determines a set of nodes in each tree that has leaves. Assume that for all . Then, we have
[TABLE]
and with the observation that , this proves our claim. Set ; if , one has
[TABLE]
which is negative for (indeed, by ). When or , we have (see also Footnote 8) and, therefore, is negative unless has exactly one internal node (which is then of degree ). ∎
It is now straightforward to translate this result in terms of the probability to find a given pattern in a random permutation in the set . As recalled in Section A.4, the hypothesis is equivalent to the following: for every and every permutation of size , there exists an analytic function and a constant (possibly equal to [math]) such that, on an -neighborhood of , it holds that
[TABLE]
The quantities are involved in the statement of the following theorem.
Theorem 6.9**.**
Let be a uniform random permutation in . We assume hypotheses and . Then, for any and for any ,
[TABLE]
Consequently, there exists a random permuton with
[TABLE]
such that tends to in distribution.
Proof.
The starting point is the same as in the standard case (Proposition 5.12). As before we denote by a random uniform -element subset of , and is the tree of size induced by uniform leaves in a uniform canonical tree of size in . From Lemma 3.11, we have
[TABLE]
where the sum runs over all substitution trees encoding . Denote by the substitution tree with only one internal node labeled by . When tends to infinity, using Corollary 6.8, we know that every term in the above sum vanishes, but the term corresponding to :
[TABLE]
Now we can compute directly from Proposition 4.5 that
[TABLE]
The first term has a dominant singularity of exponent at least , while the dominant singularity exponent of the second term is at least . We therefore focus on the second term and get by an easy computation that, around , we have
[TABLE]
and thus
[TABLE]
where and are analytic functions. Then we can apply the Transfer Theorem (Theorem A.3) to and (as in the proof of Corollary 6.8) and obtain that is proportional to for . Since the left-hand side of Eq. 43 sums to one (when summed over , for a fixed ), this proves Eq. 41.
The rest of the statement follows immediately, using Theorem 2.5 (with 2.6). ∎
6.3. Hypothesis and convergence of uniform random simple permutations
We may wish to replace the hypothesis with a less technical hypothesis, such as the convergence of a random simple permutation in our set to the random permuton uniquely determined by Eq. 42. We show here that, though not equivalent, these hypotheses are strongly related. Remark that we do not assume here, so that the existence of cannot be inferred from Theorem 6.9 and needs a separate proof.
Proposition 6.10**.**
Suppose that has a dominant singularity of exponent and assume condition . Then there exists a permuton such that
[TABLE]
where the are given by Eq. 40 (which holds under hypothesis ).
Let be a uniform random permutation of size in . If converges in distribution, then its limit is . Conversely, if we assume that and all series have a unique dominant singularity, then converges in distribution (and the limit must be , using the first part of the proposition).
Before giving the proof, let us do the following observation. If both and are satisfied, then we can apply both Theorems 6.9 and 6.10. By comparing Eqs. 42 and 44, we have in distribution (recall that the distribution of a random permuton is determined by its expected pattern densities, see Proposition 2.4). In particular, assuming that a uniform random simple permutation in the class converges in distribution to some random permuton, then this random permuton is , that is the limit of a uniform random permutation in the class. This justifies a claim in the introduction.
Proof.
We start with the existence of . For every , let be a random permutation in such that . By Proposition 2.9, we only need to show that forms a consistent family. Let , then for ,
[TABLE]
On the other hand, the following combinatorial identity can be derived from the definition of the :
[TABLE]
Indeed, the left-hand side is the series of simple permutations in whose entries are partitoned into a set of marked entries forming a pattern and a set of marked entries. The right-hand side counts the same object, according to the pattern formed by all the marked entries. To distinguish the marked entries of the first set from the ones of the second set, we need to specify a subpattern inside the pattern , which explains the factor .
We now differentiate both sides of Eq. 46 times so that , and replace all series with their asymptotic estimates obtained thanks to hypothesis , Eq. 40 and singular differentiation (Theorem A.4) 999For and , we denote by the falling factorial .
[TABLE]
As only the singular parts diverge, taking the limit in allows to identify the constants, yielding . Plugging this back in Eq. 45 yields
[TABLE]
As probabilities sum to , we get , proving the consistency of .
As noticed in the proof of Theorem 6.9, the convergence of to is equivalent to the following: for any fixed and any , the limit \lim_{n\to\infty}\mathbb{E}\big{[}\operatorname{\widetilde{occ}}(\pi,\bm{\alpha}_{n})\big{]} exists and is proportional to for . Furthermore, by consistency, we only need to show it for large . Directly from the definitions, we have
[TABLE]
For the proof of the direct implication, we assume that converges in distribution. From Theorem 2.5, this means that \mathbb{E}\big{[}\operatorname{\widetilde{occ}}(\pi,\bm{\alpha}_{n})\big{]} has a limit for every of size . Then when goes to infinity,
[TABLE]
As a consequence, for any fixed and , there exists polynomials , such that for any real in ,
[TABLE]
Hypothesis implies that in we have for some analytic function . Also has a dominant singularity of exponent in so for some analytic function and constant . Plugging these asymptotic estimates into (48) yields
[TABLE]
Let be such that , so that the singular parts are the only diverging quantities when . After taking the limit we get for every and hence equality. We have proven that is proportional to for large , as desired.
For the converse, we start from Eq. 47. Theorem A.3 (which we can apply because of the hypotheses on and ) gives the following asymptotic behavior when :
[TABLE]
For fixed , the limit of the right-hand side is proportional to , which concludes the proof of the proposition. ∎
7. Asymptotic analysis: The critical case
The goal of this section is to describe the limiting permuton of a uniform permutation in a substitution-closed class , whose set of simple permutations satisfies the following hypothesis.
Definition 7.1** (Hypothesis (H3)).**
A family of simple permutations is said to satisfy hypothesis if the generating function meets the following conditions at its radius of convergence :
- •
* has a dominant singularity of exponent in ;*
- •
* is convergent at and*
[TABLE]
This hypothesis implies the following behavior of near its singularity.
Lemma 7.2**.**
Assume that satisfies hypothesis and set, as before,
[TABLE]
Then has a unique dominant singularity of exponent in . Moreover, is convergent at and .
Proof.
The statement on the singularity exponent follows from Lemma A.6 (Supercritical case). The statement on is a simple computation from (H3) and Eq. 24. ∎
In the following, we denote . The behavior of is given in the following lemma, whose proof is postponed to Section A.7. Note that the exponent is different from the one observed in Section 6 (the singularity comes here from a mixture of a branch point and of the singularity of , as explained in Section 1.7).
Lemma 7.3**.**
Assume that satisfies hypothesis . Then there is a unique such that . Moreover, is the radius of convergence of and has a unique dominant singularity of exponent in .
From here, the strategy is similar as the previous sections, we therefore skip unnecessary details. As in the previous section, we deduce immediately the asymptotic behavior of all generating functions of marked trees:
Corollary 7.4**.**
Assume that satisfies hypothesis and define as above. Then has a unique dominant singularity of exponent in , with . Moreover each of the generating functions , , …, has a unique dominant singularity of exponent in .
To go further, we have to assume hypothesis , i.e. that, for any of size , the series has a unique dominant singularity of exponent at least in . The composition is then critical, and from Lemma A.6 (Critical case-B), has a unique singularity of exponent at least in .
By a straightforward computation from Proposition 4.5, we get the following singularity for the series .
Lemma 7.5**.**
Fix a tree and a subset of its internal nodes. Then the series has a singularity of exponent at least
[TABLE]
The behavior of the uniform permutation in now depends on whether is smaller or larger than .
7.1. The case .
Theorem 7.6**.**
Let be a family of simple permutations verifying hypothesis and , with . We consider the permuton as in Proposition 6.10 and denote for
[TABLE]
Let also
[TABLE]
be the probability distribution of the induced subtree with leaves in the -stable tree (see [28, Thm 3.3.3]).
If is a uniform permutation in , then
[TABLE]
As a consequence, converges in distribution to a random permuton, whose average pattern densities are determined by Eq. 49 and depend only on and . We call it the -stable permuton driven by .
A construction of the -stable permuton driven by for every and random permuton is given in Lemma B.2.
Remark 7.7*.*
In this case, all possible patterns, in particular nonseparable ones, appear with positive probability in the limit (as long as they appear with positive probability in a uniform simple permutation in the class). More precisely, the proof will show the following: random leaves in a uniform canonical tree induce substitution trees with arbitrary large node degrees, and the first common ancestors of those leaves are all simple permutations with probability tending to 1.
Proof.
We start from the estimate of Lemma 7.5. In the present case and is always negative. If we fix and apply Theorem A.3 to and , we get
[TABLE]
with
[TABLE]
where the last identity uses . Therefore unless , i.e. all internal nodes of are in . As a consequence, if is a uniform canonical tree of size and is a uniform subset of the leaves of length ,
[TABLE]
Now we focus on the case . The series has a dominant singularity of exponent (see Eq. 50). We are left with identifying the constant in its singular expansion. In what follows, we will always denote by the constant in front of the singular part of the expansion of the analytic function , i.e. , with the radius of convergence of and analytic at . In particular, given that and , we have the following expansions (recall ):
[TABLE]
Using Proposition 4.5, we can find the singular expansion of :
[TABLE]
From the Transfer Theorem (Theorem A.3) applied to and we deduce
[TABLE]
The recursive property of the Gamma function gives . Furthermore, by definition of , relation (39) and singular differentiation of , we get
[TABLE]
This allows us to rewrite (51) as
[TABLE]
Now the proof of Lemma 7.3 (see Eq. 74) yields . We have also the following relations between the various constants (see Eqs. 65 and 64 in the appendix):
[TABLE]
From there we deduce . Therefore Eq. 52 rewrites
[TABLE]
Summing over trees with gives the theorem. ∎
7.2. The case .
Theorem 7.8**.**
Let be a family of simple permutations verifying hypotheses and , with . If is a uniform permutation in , then converges in distribution to the biased Brownian separable permuton of parameter , where
[TABLE]
Remark 7.9*.*
While the limiting permuton in this case is independent of and is the same as in the standard case, the fine details of this convergence might be different. In particular, if is a nonseparable pattern, the order of magnitude of depends on and is in general bigger than in the standard case – compare Eq. 55 to Proposition 5.13.
Proof.
Let be a decorated tree with leaves. Once again, applying Theorem A.3 to and leads to
[TABLE]
But in this case,
[TABLE]
The above inequality is justified as follows:
[TABLE]
The first inequality is an equality if and only if for all (recall that ). In the second part, the equality case occurs when for all in , either or . This implies that if is not binary, Eq. 55 is a strict inequality (regardless of ) and
[TABLE]
We can show that, up to replacing with and with , the estimates of the singular parts of ,,, , …, in Propositions 5.8 and 5.9 still hold. Indeed, the proofs can be transposed verbatim up to replacing some calls to the standard case of Lemma A.6 with the critical case.
Proposition 5.10 does not however hold in its generality anymore, because is not necessarily convergent at for every . It is nonetheless convergent when , since the singularity of is in . This is enough to show that Proposition 5.10 still holds for binary trees. Moreover nonbinary trees still disappear in the limit according to Eq. 56. This allows us to conclude as in Section 5. ∎
Appendix A Complex analysis toolbox
A.1. Aperiodicity and Daffodil Lemma
To study the asymptotic behavior of combinatorial generating functions, it is important to locate dominant singularities. The following lemma is useful to this purpose.
Recall that a function analytic at [math] is aperiodic if there do not exist two integers and and a function analytic at [math] such that .
Lemma A.1** (Daffodil Lemma).**
Let be a generating function (with nonnegative coefficients) analytic in . If is aperiodic, then for and . (The case can only be considered if .)
This lemma can be found in [30, Lemma IV.1, p. 266]. Note that this reference does not consider the case of on the circle of convergence, i.e. (although this case is used later in the book, e.g. in the proof of Theorem VI.6, p. 405); the proof of the lemma in this case is similar to . The complete statement of Daffodil Lemma in [30] also deals with cases where the function is periodic, but we do not need these cases in our work.
A.2. Transfer theorem
We start by defining the notion of -domain. We use for the principal determination of the argument of in taking its values in .
Definition A.2** (-domain and -neighborhood).**
A domain is a -domain at if there exist two real numbers and such that
[TABLE]
By extension, for a complex number , a domain is a -domain at if it the image by the mapping of a -domain at . A -neighborhood of is the intersection of a neighborhood of and a -domain at .
We will make use of the following family of -neighborhoods: for , , , set .
When a function is analytic on a -domain at some , the asymptotic behavior of its coefficients is closely related to the behavior of the function near the singularity . The following theorem is a corollary of [30, Theorem VI.3 p. 390].
Theorem A.3** (Transfer Theorem).**
Let be a function analytic on a -domain at , be an arbitrary real number in and a constant possibly equal to [math].
Suppose when tends to in . Then the coefficient of in satisfies
[TABLE]
A.3. Singular differentiation
The next result is also useful to us.
Theorem A.4** (Singular differentiation).**
Let be an analytic function in a -neighborhood of with the following singular expansion near its singularity
[TABLE]
where .
Then, for each , the -th derivative is analytic in some -domain at and
[TABLE]
We refer the reader to [30, Theorem VI.8 p. 419] for a proof of this theorem (this reference considers functions defined on a -domain, but the proof still works with functions defined on a -neighborhood).
A.4. Exponents of dominant singularity
In this section, we introduce some compact terminology and easy lemmas to keep track of the exponent of the singularities and of the shape of the domain of analycity without computing the functions explicitly.
Recall that the radius of convergence of an analytic function is the modulus of the singularities closest to the origin, called dominant singularities. Recall also that for series with positive real coefficients, by Pringsheim’s theorem [30, Th. IV.6 p. 240], is necessarily a dominant singularity. This justifies the following definition:
Let be a real, which is not an integer. We say that a series with radius of convergence has a dominant singularity of exponent in (resp. of exponent at least ) if has an analytic continuation on a -neighborhood of and, on , we have
[TABLE]
where is an analytic function on a neighbourhood of (called the analytic part), and a nonzero constant (resp. any constant); is sometimes referred to as the singular part.
If furthermore, has no other singularity on the disk of convergence, we say that it has a unique dominant singularity of exponent (resp. at least ) in . Since we assumed that has an analytic continuation on a -neighborhood of , by a standard compactness argument, this is equivalent to say that can be extended to a -domain in .
We make the following observation. According to the value of , we may move (part of) in the error term and write Eq. 57 in a simpler form, still on a -neighborhood of .
- •
For , so .
- •
For , considering the constant term is the Taylor series expansion of we find that .
- •
Similarly, for , we obtain
[TABLE]
in which the third dominant term (after the constant and the linear term) depends on how compares with . But in each case, we have
[TABLE]
where .
We now record a few easy lemmas to manipulate these notions. First consider the stability by product.
Lemma A.5**.**
Let and be series with nonnegative coefficients and the same radius of convergence . Assume they have each a dominant singularity of exponent and respectively in . Then has a dominant singularity in of exponent defined by
- •
* if both and are negative;*
- •
* otherwise.*
Moreover, if both and have unique dominant singularities, so has .
Proof.
The proof is easy. The analytic function can be extended to the intersection of the domain of and . The exponent of the singular expansion around is obtained by multiplying singular expansion of and : note that, if is negative, the series is divergent and the singular part is the dominant part around . On the opposite, when is positive, the dominant part of the expansion is the value of the analytic part at point , which is always positive, since the series has nonnegative coefficients. The same holds of course for , which explains the case distinction in the lemma. ∎
We now consider the composition . We should differentiate cases where , or (called sometimes supercritical, subcritical and critical cases [30, Sec.VI.9]).
Lemma A.6** (Dominant singularity of ).**
*Let and be series with nonnegative coefficients with radii of convergence in .
***Supercritical case: *** Assume that ( may be finite or infinite).
Call the unique positive number with .*
We assume that has a dominant singularity of exponent in . Then:
- i)
* has also a dominant singularity of exponent in .* 2. ii)
Moreover, if is aperiodic, then the dominant singularity of is unique.
*Subcritical case: *** Assume that .
We assume that has a dominant singularity of exponent in . Then:
- i)
* has also a dominant singularity of exponent in .* 2. ii)
Moreover, if the dominant singularity of is unique, then the dominant singularity of is unique.
*Critical case-A: *** Assume that .
We assume that and both have a dominant singularity of respective exponents and . Suppose furthermore . Then:
- i)
* has also a dominant singularity of exponent in .* 2. ii)
Moreover, if is aperiodic, then the dominant singularity of is unique.
Critical case-B:* Assume again that . We assume that and both have a dominant singularity of respective exponents and . Suppose furthermore . Then:*
- i)
* has a dominant singularity of exponent in .* 2. ii)
Moreover, if is aperiodic, then the singularity is unique.
Proof.
**Supercritical case: ** It is clear that is analytic around any and has nonnegative coefficients, hence it has radius of convergence at least .
To show that is defined in a -neighborhood of , we show that is included in . This follows easily from the fact that is analytic in and has a derivative which is a positive real number.
When is close to , plugging in the expansion (57) of we obtain
[TABLE]
The first term is analytic at . Since and is differentiable at we have
[TABLE]
Combining these two expansions yields
[TABLE]
which proves i).
Item ii) is also easy. In the case where we assume aperiodic, we need Lemma A.1, which ensures that for , .
Subcritical case. Most arguments are similar to the ones of the supercritical case. Therefore we only explain the differences in the singular expansion of . Using the singular expansion (57) of , we have
[TABLE]
Since , the exponent is positive and the term tends to [math] at . Both and tend to as , so that, by standard calculus arguments, we have
[TABLE]
Since and are analytic at and respectively, this expansion is of the desired form.
Critical case-A. As above, we focus on the expansion of . Since , is differentiable at and Eq. 61 still holds. The difference is that is not analytic anymore. Namely, when is close to ,
[TABLE]
Then
[TABLE]
Since is analytic at , the exponent of the dominant singularity of is . Note that the singular terms cannot cancel each other since when the constants have the same sign.
Critical case-B. Again, we focus on the singular expansion of . Now, since , is not differentiable at . Instead of (60) we have
[TABLE]
Eq. (61) becomes
[TABLE]
(In this case, must be negative, as can be observed by writing the transfer theorem for the coefficients of which are non-negative by assumption.) As for , (63) still holds. We obtain
[TABLE]
We conclude that the exponent of the dominant singularity is . ∎
We note that the above proof also yields the constant appearing in the singular expansion of . The two following particular cases were used in Section 7, p. 53 (in particular, we assume in the discussion below that hypothesis holds).
- •
We take and . The composition is , and is analytic at . Since diverges at its singularity and has a finite radius of convergence, the composition is supercritical. Extracting the constant from (61), we get
[TABLE]
since in this case , and .
- •
We take and . The composition is (Eq. 12). Since , (here we use Hypothesis (H3) and is defined in Lemma 7.3) the composition is subcritical. Extracting the constant from (62), we get
[TABLE]
Finally, we state the following result, which follows from Theorem A.4.
Lemma A.7** (Singular differentiation).**
If has a (unique) dominant singularity of exponent (at least) in , then its -th derivative has a (unique) dominant singularity of exponent (at least) in .
A.5. An analytic implicit function theorem
The following theorem allows to locate the dominant singularity of series defined by an implicit equation.
Lemma A.8** (Analytic Implicit Functions).**
Let be a bivariate function analytic at , we denote . If and , then there exists a unique function analytic in a neighbourhood of such that and .
We refer the reader to [30, Lemma VII.2, p. 469] for a proof of this result.
A.6. Proof of Lemma 6.3
Let be the radius of convergence of . If , then by intermediate value theorem, we know that there exists as in the lemma.
We will prove this by contradiction. Assume . We apply Lemma A.8. The bivariate function we consider is . It vanishes at and the derivative with respect to at that point is nonzero since
[TABLE]
(Recall that this last inequality is equivalent to (37) in Hypothesis (H2).)
Therefore, has an analytic continuation on a neighborhood of . Since it has positive coefficients, by Pringsheim’s theorem [30, Th. IV.6 p. 240], this is in contradiction with the fact that is the radius of convergence of .
We have therefore proved that there exists such that . Note that it implies the relation .
We now consider around . Equation (38) defining can be rewritten as , where
[TABLE]
Since has a dominant singularity of exponent in , Equation (58), together with elementary computations, yield the following: for in a -neighborhood of ,
[TABLE]
We now use Picard’s method of successive approximants to show the existence and analycity of on a -neighborhood of . We refer to [30, Appendix B.5 p. 753] for a synthetic description of the method in the case where is analytic in ; we have to adapt it carefully to our setting.
Define and whenever is in . We have . Also, Theorem A.4 of singular differentiation applied to Eq. 66 implies that
[TABLE]
Therefore101010There is a slight subtlety here: we would like to apply the classical inequality , but this is not possible since the domain is not convex. Note however that a -neighborhood is always a quasi-convex set, in the sense that we can always find a path between and whose length is smaller than , where depends on the angle defining but not on and . Therefore the following weaker inequality holds: , which is good enough for our purpose (the constant disappears in the symbol). for , if and are defined and lie in , we have
[TABLE]
where . Fix . Up to reducing the radius of , we can therefore assume that
[TABLE]
Thus, if is in for every , then is defined and we have
[TABLE]
If we take small enough, the argument of is close to the one of . Furthermore if the modulus of is small so is the one of . This ensures the existence of a -neighborhood of (not depending on and ), such that for and , is in as long as it is defined. In particular, is also defined and by immediate induction, all are defined and analytic on .
Eq. 67 also implies that converges locally uniformly on . The limit is the unique solution in of the fixed point equation (the uniqueness of the solution comes from the fact that for every , is a contraction for in ). This limit is therefore an analytic continuation of to . Note also that from Eq. 68, the following estimate holds on :
[TABLE]
Using the expansion given in Eq. 57 of around , we have for ,
[TABLE]
As is analytic at with a nonzero derivative , it can be inverted analytically around by an analytic function and hence
[TABLE]
As , it follows from the Taylor expansion of up to exponent that has a singularity of exponent exactly in . In particular has a singularity of exponent in and hence .
We now prove that has no singularity with , except . By a classical compactness argument (see e.g. [26, end of proof of Theorem 2.19]), this implies that is analytic on a -domain at .
Take such a singularity. Since has nonnegative coefficients, the triangular inequality gives and since is aperiodic, from Lemma A.1 we have a strict inequality unless . Therefore, if and , we have and we can apply Lemma A.8 as above as in the second paragraph of this proof to argue that cannot be a singularity. ∎
A.7. Proof of Lemma 7.3
As in the proof of Lemma 6.3, the existence of and the fact that the convergence radius of is at least is straightforward. The key point is to prove that has an analytic continuation to a -neighborhood of .
By assumption, is analytic on a -neighborhood of , and the following approximation holds:
[TABLE]
where as before, ; is or depending on whether is smaller or bigger than ; and is an analytic function on tending to [math] in .
Fix in a neighborhood of , whose parameters and will be made precise later. The equation then rewrites as
[TABLE]
or, as a fixed point equation for
[TABLE]
We again use Picard’s method of successive approximants to find an analytic solution for (69), which will be the analytic continuation of that we are looking for. For , set and, whenever lies in , set . In particular,
[TABLE]
Since , we have . We choose the parameters defining the -neighborhood of to be and . In this way, if is in , then then lives in , for some .
We define an intermediate -neighborhood . This ensures that we have a constant , depending only on and , such that the circle of center and radius is contained in for every and in for every (cf. Fig. 16).
Consider the partial derivative
[TABLE]
We take in the domain . The quantity can now be evaluated through a contour integral on :
[TABLE]
This yields the inequality
[TABLE]
Plugging this back in Eq. 70, we get, for in
[TABLE]
Now we shall find a domain where we have enough control on as to guarantee the stability of the iterates. A subtlety here is that this control is impossible near . So we need to consider a domain around , hence that depends on . For every , we have , so the disk
[TABLE]
is included in . For in , we have
[TABLE]
which implies after plugging back into Eq. 71
[TABLE]
By possibly reducing the radius of , we can make as small as wanted: for any in ,
[TABLE]
Similarly,
[TABLE]
can be made smaller than by reducing . In particular, is in .
For , assume that lie in . Then for each , using (72),
[TABLE]
Since lies in , the next term is defined and
[TABLE]
In particular, also lies in and an immediate induction shows that this is indeed the case for all .
By (73), the series is uniformly bounded by a geometric series and converges towards an analytic function on . The limit is a solution of and is the analytic continuation of that we were looking for.
A small modification of the above argument shows that, when , or equivalently , tends to [math], the quotient
[TABLE]
also tends to [math]. This proves that
[TABLE]
The proof that has no other singularities than on the circle of convergence is similar to that of Lemma 6.3. ∎
Appendix B On the simulations given in the introduction
In this appendix, we explain how the simulations in Figs. 1 and 4 have been obtained.
B.1. Biased Brownian permuton
Fix in and consider a uniform binary planar tree with leaves, where each internal node is labeled (resp. ) with probability (resp. ), independently from each other. As mentioned in Section 5.1, forms a consistent family of random permutations. Therefore, from Proposition 2.9, we have the following lemma (using the notation defined in Section 2.1)
Lemma B.1**.**
There exists a random permuton , whose induced subpermutation are the ( i.e. for all , ) and we have the convergence in distribution
[TABLE]
By definition, is the biased Brownian separable permuton with parameter (indeed implies Eq. 20 of Eq. 20). This lemma is not needed to prove the results of this paper, but was used in the simulations. The three pictures in Fig. 1 p. 1 are obtained by drawing the diagram of a random permutation distributed as , for and , and , and .
B.2. Stable permutons
Fix . For every the following probability distribution on unlabeled plane trees with leaves was introduced in [28, Thm 3.3.3] and is the distribution of the induced subtree with leaves in the -stable tree:
[TABLE]
Now if we fix the distribution of a random permuton , for every , we build a random substitution tree as follows: the tree is chosen according to , and conditional on that choice, all internal nodes are independently labeled by a permutation distributed like (the notation is defined in Section 2.1).
Now we can define the permutations . This family of permutations is consistent: we omit the proof of this fact, which follows from the consistency of the family (Proposition 2.9) and from Marchal’s algorithm [47] to generate trees of distribution . We deduce the following lemma.
Lemma B.2**.**
For every and random permuton , there exists a random permuton , whose induced subpermutations are the ( i.e. for all , ) and we have the convergence in distribution
[TABLE]
We call the -stable permuton driven by .
Once again, this lemma is used in our simulations. The pictures in Fig. 4 are the rescaled diagrams of realizations of for and , where we have taken to be the (nonrandom) uniform measure on .
B.3. Simulations of permutations in classes
The uniform random permutations in substitution-closed classes shown on Fig. 3 have been obtained through a Boltzmann sampler. Obtaining such a sampler is routine from the equations on the generating series given in Eq. 12 [27]. The only input that we need is a Boltzmann sampler for the set of simple permutations in the class. In the case of a finite set , this is trivial. For the set of simple permutations in , we built a Boltzmann sampler for the whole class , which has an easy recursive structure, and we run it until the output is simple (this happens with probability bigger than when the parameter of the Boltzmann sampler is close to the radius of convergence of ). Code is available on request.
Acknowledgements
The authors are grateful to the referees for suggestions which improved the presentation of the paper.
MB and VF are partially supported by the Swiss National Science Foundation, under grants number 200021-172536 and 200020-172515. LG’s research is supported by ANR grants GRAAL (ANR-14-CE25-0014) and PPPP (ANR-16-CE40-0016).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. H. Albert, M. D. Atkinson. Simple permutations and pattern restricted permutations. Discrete Mathematics , 300(1):1–15, 2005.
- 2[2] M. H. Albert, M. D. Atkinson, R. Brignall. The enumeration of three pattern classes using monotone grid classes. Electronic Journal of Combinatorics , 19(3) #P 20, 2012.
- 3[3] M. H. Albert, M. D. Atkinson, R. Brignall. The enumeration of permutations avoiding 2143 2143 2143 and 4231 4231 4231 . Pure Mathematics and Applications , 22(2):87–98, 2011.
- 4[4] M.H.Albert, M.D. Atkinson, M. Klazar. The enumeration of simple permutations. Journal of Integer Sequences vol.6 (2003), article 03.4.4.
- 5[5] M. H. Albert, M. D. Atkinson, V. Vatter. Inflations of geometric grid classes: three case studies. Australasian Journal of Combinatorics , vol. 58 (2014), p. 27-47.
- 6[6] M. H. Albert, M. D. Atkinson, V. Vatter. Counting 1324 , 4231 1324 4231 1324,4231 -Avoiding Permutations. Electronic Journal of Combinatorics , 16(1) #R 136, 2009.
- 7[7] M. H. Albert, R. Brignall. Enumerating indices of Schubert varieties defined by inclusions. Journal of Combinatorial Theory, Series A , 123(1): 154–168, 2014.
- 8[8] M. H. Albert, V. Vatter. Generating and Enumerating 321 321 321 -Avoiding and Skew-Merged Simple Permutations. Electronic Journal of Combinatorics , 20(2) #P 44, 2013.
