The asymptotics of the partition of the cube into Weyl simplices, and an encoding of a Bernoulli scheme
A.Vershik

TL;DR
This paper introduces a combinatorial encoding method for continuous symbolic dynamical systems, demonstrating that partitioning the infinite-dimensional cube into Weyl simplices effectively distinguishes almost all points, advancing understanding of symbolic dynamics.
Contribution
It presents a novel combinatorial encoding approach that transforms the infinite-dimensional cube into a path space, showing that Weyl simplices partition distinguishes almost all points.
Findings
Partition into Weyl simplices is almost surely distinguishable.
The encoding maps the shift to a transfer transformation.
The method applies to Bernoulli schemes and graded graphs.
Abstract
We suggest a combinatorial method of encoding continuous symbolic dynamical systems. A~continuous phase space, the infinite-dimensional cube, turns into the path space of a tree, and the shift is mapped to a transformation which was called a "transfer." The central problem is that of distinguishability: does the encoding separate almost all points of the space? The main result says that the partition of the cube into Weyl simplices satisfies this property.\footnote{{\it Keywords:} combinatorial encoding, transfer, Bernoulli scheme, graded graph.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph theory and applications · Topological and Geometric Data Analysis · Complex Network Analysis Techniques
The asymptotics of the partition of the cube into Weyl simplices, and an encoding of a Bernoulli scheme††thanks: Partially supported by the RFBR grant 17-01-00433.
A. M. Vershik St. Petersburg Department of Steklov Institute of Mathematics, St. Petersburg State University, and Institute for Information Transmission Problems.
(02.02.2019)
Abstract
We suggest a combinatorial method of encoding continuous symbolic dynamical systems. A continuous phase space, the infinite-dimensional cube, turns into the path space of a tree, and the shift is mapped to a transformation which was called a “transfer.” The central problem is that of distinguishability: does the encoding separate almost all points of the space? The main result says that the partition of the cube into Weyl simplices satisfies this property.111Keywords: combinatorial encoding, transfer, Bernoulli scheme, graded graph.
Contents
-
3 The main example: encoding a Bernoulli scheme by Weyl simplices and the triangular compactum
-
3.2 The triangular compactum of paths in the tree of i-permutations
-
3.4 Transfer for the tree of i-permutations and the triangular compactum
-
4.7 An analog of the partition into Weyl simplices for an arbitrary Bernoulli scheme
1 Introduction
In the theory of dynamical systems and, more generally, measure theory, the following question often proves fruitful: find a cover of a given measure space by a space endowed with a product measure (i.e., a product of independent variables) for which a given transformation (or a group of transformations) is a homomorphic image of the shift (respectively, are homomorphic images of the shifts) in the covering space. If such a cover (homomorphism) exists, and even is an isomorphism, then we obtain important information on the original object.
On the other hand, the inverse problem is not less important: how one can economically encode a sequence of independent identically distributed continuous random variables (i.e., a Bernoulli scheme) using finite codes? That is, can one replace a continuous scheme by a locally finite one and how can this be done?
In both cases, it is important to find a method of economical encoding of a continuous scheme and Bernoulli shift, or even a more general system.
We suggest a general method of combinatorial encoding of a Bernoulli scheme, and consider a simplest nontrivial example of such an encoding, using the partition of the cube into Weyl simplices. This example is related to a simplest tree (of permutations) and produces an isomorphism between the classical Bernoulli scheme with a continuous set of states and a new type of a measure-preserving transformation called transfer. This is a nonstationary Markov shift acting in the Cantor-like space of paths in a graded graph (in the simplest case, in the path space of a tree).
In this context, objects of ergodic theory become related to the combinatorial theory of graded graphs and, consequently, to combinatorics and representation theory.
The main problem arising here is the distinguishability problem: is the encoding faithful, i.e., does it separate almost all points of the Bernoulli scheme? In other words, is the encoded system isomorphic to the original one or is it only a homomorphic image of this system? In the example with Weyl simplices, this problem can be stated very simply: can one recover a realization of a Bernoulli scheme with state space from all pairwise inequalities between its coordinates? Quite paradoxically, the answer (see Section 3) is positive: this can be done with probability . Actually, this question is related to the theory of equidistributed sequences, but the extensive literature on the subject seems to contain no mention of this fact.
A much more complicated example is related to the Young graph and the RSK correspondence. It was initiated by the old paper [3] and was resolved, also positively, in the recent papers [5, 6]. In this case, the encoding uses -tableaux of the RSK correspondence. The proofs in [5, 6] are based on a thorough analysis of the theorem on the limit shape of Young diagrams and the study of the Schützenberger transformation.
We will return to this example in another paper, where we will apply a general method of resolving the distinguishability problem, which consists in proving certain laws of large numbers “along” realizations. These laws can be quite complicated. For example, our proof is based on a new theorem on the limit shape for -tableaux; however, the approach itself, which is mentioned in the present paper too, is universal.
It is worth noting that our notion of transfer (see Sections 4 and 5) is a far generalization of the Schützenberger transformation (jeu de taquin) and seems to be important for the general theory of graphs and transformations with an invariant measure. Describing all invariant measures for a transfer and the study of its properties is a new interesting area of the theory of dynamical systems and the combinatorics of graphs.
The main technique here is a combinatorial method of studying increasing invariant sequences of finite measurable partitions of Lebesgue spaces, which are, according to V. A. Rokhlin, separable complete measure spaces, or, in other words, spaces isomorphic mod 0 to an interval with the Lebesgue measure (if the original measure is continuous). In the author’s opinion, the most important problems in measure theory and its applications to different fields of mathematics are related to the geometry and combinatorics of -algebras (= measurable partitions) and their sequences. Here we are interested in infinite increasing sequences; the combinatorics underlying the theory of such sequences is the study of properties of infinite trees and graded graphs. Another class of sequences of -algebras is that of decreasing sequences, or filtrations (see [9]); the corresponding theories are closely related, but strongly different.
In Section 2, we define a combinatorial encoding and state the main problems. In particular, we define a frame, i.e., a tree endowed with a translation which is a combinatorial invariant of an increasing sequence of partitions. We state and discuss the distinguishability problem, and also discuss numerical invariants of exhausting sequences, for which the distinguishability problem has a positive answer. The main Section 3 introduces an encoding of a Bernoulli scheme by Weyl simplices (= intersections of Weyl chambers with the unit cube); it is used to establish an isomorphism between a continuous Bernoulli shift and the transfer of a triangular compactum (= the path space of the tree of i-permutations). Section 4 contains various problems related to this example and its generalizations. In particular, we state a problem on different compactifications of the infinite symmetric group, one being the compactum of virtual permutations and another one being the main example of this paper. In Section 5, we give a general definition of transfer for a graded graph and describe problems related to this notion.
2 Classical and combinatorial encoding of transformations
2.1 Classical encoding and generators
We begin by recalling the method of encoding endomorphisms and automorphisms or, more generally, arbitrary actions of groups and semigroups with an invariant measure used in symbolic dynamics, when one defines an action of the group by shifts in the space of functions on the group with a shift-invariant measure.
For simplicity, we consider a Bernoulli endomorphism (= one-sided Bernoulli shift) , which is a transformation of the space , where , with an invariant product measure , where is the Lebesgue measure on . The usual method of encoding endomorphisms (and automorphisms) in ergodic theory and information theory is to choose an -valued measurable function on the space of trajectories of the process (“symbolic space”) and study the family of all its shifts . If these shifts (regarded as functions on ) separate the points of , i.e., the product of the partitions of into the preimages of points corresponding to the shifts of is the partition into singletons, then the partition into the level sets of the original function (and the function itself) is called a generator. In this case, we obtain a new isomorphic model of the shift in the same space , but, in general, with another invariant measure different from . To check whether a given function is a generator is a difficult and instructive problem even for a Bernoulli scheme, and more so in the general case. But since the product of partitions is a partition invariant under the shift, we always have a well-defined quotient endomorphism, which may or may not be isomorphic to the original endomorphism. This classical method of encoding endo- and automorphisms is studied in many papers both in ergodic theory and information theory.
2.2 Combinatorial encoding
We suggest to encode an endomorphism (in particular, a Bernoulli shift) in another way: instead of classical codes (functions or partitions), we will construct a shift-invariant increasing sequence of finite measurable partitions on a measure space (in particular, on the infinite-dimensional cube with a product measure).
In more detail, let and consider the infinite-dimensional cube with the Lebesgue measure where is the Lebesgue measure on the interval . By we denote the one-sided shift, i.e., the endomorphism of the space with the invariant measure .
We consider arbitrary increasing sequences of finite cylinder partitions , , of the infinite-dimensional cube .
A finite partition consists of finitely many (measurable) sets of positive measure, called its elements. A sequence of finite partitions is increasing (notation: ) if for every every element is a subset of some element , and every element is the union of all elements of belonging to it. To exclude degenerate cases, we will assume that every element of contains at least two elements of .
The invariance of a sequence of partitions under the shift means that the images of (almost) any two points belonging to the same element of belong to the same element of , for .
Besides, we require to be cylinder partitions, with being a partition into cylinders whose bases are subsets of the finite-dimensional cube , which is the projection of the infinite-dimensional cube to the first coordinates. The more general case where is a cylinder partition with respect to the cube , where is an increasing sequence with , is no different from this one.
Hence, describing a sequence of partitions of the type under consideration reduces to describing a sequence of coherent partitions of finite-dimensional cubes, which we will denote by the same symbols .
We will assume that is the trivial partition (whose unique element is a set of full measure); its base is the trivial partition of the first cube, i.e., the interval .
Assume that we have already constructed an increasing chain of partitions invariant under finitely many shifts in which the elements of are cylinder sets with bases in , for . Consider the cube . For the base of the next partition , we can take an arbitrary partition of the cube that is a refinement of the product of two partitions defined as follows: take two projections of the cube to the cube , the first one along the axes , and the second one along the axes ; then and are the preimages of the partition of under these projections:
[TABLE]
Continue this process; the fact that the resulting sequence of partitions is increasing and invariant immediately follows from construction. Thus, we have obtained an infinite increasing shift-invariant sequence of finite cylinder partitions. It is not difficult to see that this procedure allows one to construct an arbitrary monotone invariant sequence of finite partitions. One can say that with this method of encoding, the continuity of the state space “escapes to infinity.” This phenomenon is worth a more detailed analysis.
Two sequences and of the type under consideration are said to be metrically isomorphic if there exists an invertible measurable transformation preserving the measure and sending one sequence to the other one.
Note that the described procedure can be applied to an arbitrary Lebesgue space and a measure-preserving transformation of this space defined in symbolic form, i.e., with a generator fixed beforehand.
2.3 The frame as a combinatorial invariant of an encoding
With an increasing invariant sequence of finite cylinder partitions of the space we associate a most important combinatorial object: an infinite tree with an additional structure introduced below. The vertices of the th level of correspond to the elements of the partition ; let be one of these elements; then it is joined by an edge with the vertex corresponding to the element of the previous partition that contains ; we assume that this edge is directed from to . Thus, we have defined an infinite, locally finite tree corresponding to a sequence of partitions. Such a tree can be defined for every increasing sequence of finite partitions of every Lebesgue space.
But we have also the following map defined on the vertices of : since the sequence of partitions is shift-invariant, to every element for there corresponds a unique element different from that is the image of all points of under the shift ; take the edge connecting the vertices corresponding to the elements and and direct it from to . Thus, our tree is endowed with a bijection from the set of all its vertices to the set of all vertices except the first one, and this bijection preserves the partial order, i.e., sends a pair of vertices that constitute an edge to a similar pair at the previous level. We will call this map a translation and denote by . Using , we define a map from the set of all infinite paths in to itself; namely, given a path , the vertices of its image are the translations of its vertices: , . This map will be called a transfer on the path space of the tree, and a tree for which a transfer is defined will be called a tree with a transfer.
Finally, recall that every element of every partition has a measure, a positive number from the interval , and the sum of these numbers over each level of the tree is equal to . It is more convenient to fix the conditional measure on the elements of the quotient partition , i.e., fix a probability vector for each element .
Definition 1**.**
The frame of a combinatorial encoding of the space , i.e., of an increasing invariant sequence of finite cylinder partitions , is the tree with a transfer defined above endowed with a coherent system of probability vectors on its levels. In the case most interesting for our purposes, the conditional measures are uniform and determined by numbers .
The frame, being a graded tree with a transfer endowed with a system of measures, is a combinatorial (finite) invariant of an increasing sequence of partitions of the space . On the other hand, every graded tree with a transfer and a system of measures can be realized as the frame of an increasing sequence of invariant finite measurable partitions of a Lebesgue space with a measure-preserving transformation (which can be different from ).
We will say that two sequences of partitions (or two encodings) are combinatorially isomorphic if their frames are isomorphic as graded trees with a transfer and a system of measures.
Clearly, two metrically isomorphic sequences of partitions are combinatorially isomorphic; however, the converse is not true, since the behavior of the sequences at infinity can be different (see the distinguishability problem below).
For the tree of Weyl simplices of type , the frame and the corresponding transfer are considered in Section 3.
2.4 Distinguishability problem
Consider an infinite increasing shift-invariant sequence of finite cylinder partitions of the space . Recall that the limit of an increasing sequence of finite partitions , , is their product, i.e., the measurable partition whose elements are all nonempty intersections of sequences of elements of .
The fundamental question is whether the limiting partition, i.e., the product of partitions, coincides with the partition into singletons, i.e., whether it separates almost all, with respect to the measure , points of the space . If the answer is positive, this means that our encoding loses no information. In this case, the sequence will be called exhausting. It is more conventional to say that such a sequence is a basis of the measure space, since the -algebra spanned by all elements of all partitions of such a sequence is dense in the full -algebra.
In the classical encoding, a partition (or a function defining it) is called a generator if the product of the shifts of the original partition (into the level sets of the function) coincides with the partition into singletons. In the language of information theory, this means that this encoding loses no information.
Both in the classical and combinatorial cases, the same question arises: does the product of some set of partitions coincide with the partition into singletons? The crucial difference is that in the combinatorial encoding we consider a limit of finite partitions, which allows us to use combinatorial tools for solving the problem.
Another difference is in the realization of the quotient by the limiting partition. As we will see, in contrast to the classical case, where the quotient space is realized as the same symbolic cube with a new measure and a shift, in the combinatorial encoding it is realized as the path space of a locally finite graded graph, or, in other words, as a quasi-stationary (see below) topological chain with finite sets of states, and the quotient endomorphism is realized as a generalized shift, called a transfer.
Of course, the combinatorial encoding in the form described above applies not only to a Bernoulli endomorphism with a continuous set of states, but also to an arbitrary stationary measure in ; moreover, one can start with an automorphism of an abstract Lebesgue space.
Let us state the main problem once again in the most general form.
Problem 1** (distinguishability problem).**
In what cases a sequence of partitions separates the points of the space , or is exhausting, or, in other words, when does it solve the distinguishability problem? More formally: in what cases the product coincides with the partition into singletons (traditionally denoted by )?
The term “distinguishability” comes from the fact that the condition introduced above means that almost any two trajectories (from a set of full measure) fall into different elements of the partition for sufficiently large .
Distinguishability is equivalent to the fact that almost every trajectory of the shift can be uniquely recovered from the countable set of elements of the partitions containing it. Since the encoding is shift-invariant, it suffices to recover only the first coordinate, all the other coordinates can then be recovered using the shift. The partitions are finite, so the positive answer to the distinguishability problem reduces the study of a continuous Bernoulli scheme to that of a countable encoding, i.e., to the study of a sequence of coordinates each taking finitely many values. Recall that a continuous Bernoulli scheme cannot have a finite generator (in the classical sense), so our construction essentially extends the possibilities of encoding.
2.5 Entropy estimates
Now we will consider numerical characteristics of the combinatorial encoding in the cases where the distinguishability problem has a positive answer.
Denote by the number of elements in the partition and by the number of elements of the partition lying in an element . Recall that , and let ; we have .
It follows from the definition of the sequence that . On the other hand, since the entropy is infinite, none of the partitions is a generator, whence . In fact, our aim is to construct a “diagonal” refinement of the family of sequences .
If , then every finite partition can be approximated in the entropy metric (see [4]) by a partition that is measurable with respect to for sufficiently large ; hence, approximating a sequence that approaches a continuous generator and using the invariance of , we conclude that
[TABLE]
In particular, if for every the partition is homogeneous, i.e., all its elements have the same measure, then
[TABLE]
Of course, the distinguishability problem will have a positive answer if we allow to grow too rapidly; for instance, it suffices to let (a grid with step along all axes of the cube ); the distinguishability is obvious, since in this case all coordinates are approximated separately. The question is: how slow can grow provided that the distinguishability holds?
On the other hand, we will give a lower entropy estimate on which provides a necessary condition for a positive answer to the distinguishability problem in terms of the growth and can be proved by entropy considerations.
Proposition 1**.**
If for a sequence the distinguishability problem has a positive answer, i.e., , then
[TABLE]
and the rate of convergence can be arbitrarily small.
But one can give both a direct estimate and a direct construction of a required sequence of partitions.
In the first nontrivial example (the “Weyl simplices”, see Section 3 below), where and , the estimate is as follows: . In the second example (the RSK correspondence), the growth is and , where is the Euler function (the number of partitions).
In the formal sense, the distinguishability problem in a combinatorial formulation reduces to a purely computational problem, namely, checking that almost all conditional measures on the elements of the partitions converge in some metric to -measures, which in other terms is equivalent to some (nonlinear) law of large numbers. From this viewpoint, the combinatorial encoding is better adapted to the proof of distinguishability than the classical encoding to the proof that some partition is a generator, since in the former case we deal with a limit of finite partitions.
3 The main example: encoding a Bernoulli scheme by Weyl simplices and the triangular compactum
In this paper, we will study the simplest nontrivial example of a combinatorial encoding, namely, the combinatorial encoding of a Bernoulli scheme by Weyl simplices. From the abstract point of view, a special feature of the case under consideration, in terms of a notion introduced above, is that the frame of this encoding is a homogeneous tree, in which the number of outgoing edges is the same for all vertices of every level. In this example, i.e., in the “tree of i-permutations222We introduce the term “i-permutation” (coming from “image of a permutation”) to emphasize the difference, often neglected, between a permutation as an element of the symmetric group and as an ordering of objects. If we fix some order on these objects, then an i-permutation is the image of this order under the action of the corresponding permutation.,” this number is equal to for vertices of level , and the number of vertices at this level, as well as the number of paths leading to it, is equal to .
The main problem is to find the asymptotic behavior of the collection of Weyl chambers (more exactly, Weyl simplices) of type , as well as establish links to related problems.
In fact, this example can be generalized, with the same proofs, to Bernoulli schemes with arbitrary state spaces. However, for definiteness, we will speak mainly of the interval with the Lebesgue measure.
3.1 The partition of the cube into Weyl simplices
We will define an increasing shift-invariant sequence of finite cylinder partitions of the cube ; namely, is the cylinder partition of whose base is the partition of the finite-dimensional cube into open Weyl simplices, by which we mean the intersections of open Weyl chambers in the Cartan subalgebra with the unit cube.333We assume that in a correspondence is fixed between the Weyl chambers and the i-permutations of the set (in other words, a root system is chosen).
We will consider the set of full measure in consisting of the vectors with pairwise distinct coordinates. The frame of this sequence of partitions is shown in Fig. 1. This tree can also be called the tree of i-permutations indexing the Weyl simplices (and chambers), or the factorial homogeneous tree.
On the one hand, the set of vertices of level consists of all i-permutations of symbols, which should not be confused with elements of the symmetric group . An edge joins two i-permutations and if is obtained from by removing the element .
By definition, the translation (see Section 2.3) associates with every vertex of level (for ) a vertex of level following the rule according to which the simplex of sequences starting from a vector changes when we remove after application of the shift the first coordinate , that is, pass to the vector . Recall that the i-permutation corresponding to the simplex is given by the formula
[TABLE]
Proposition 2**.**
The i-permutation corresponding to the simplex is given by the formula
[TABLE]
The proof immediately follows from definitions. Thus, we have defined a translation, which is a map from the set of i-permutations of length to the set of i-permutations of length . In the next section, where we compute the transfer for this graph, we use this map and interpret it in a slightly different way.
Using this rule, we construct the frame corresponding to the tree of Weyl simplices, see Fig. 1.
The first transition (removing ) is called passing to the smaller i-permutation, and it is natural to say that the translation is passing to the previous i-permutation.
It is appropriate at this point to mention the difference between the notions introduced above and the theory of virtual permutations, see [2]: the operation that in [2] and related papers is called passing to the derivative permutation also consists in removing , but from a permutation rather than an i-permutation (for example, the derivative permutation of (2413) is (231), while the smaller i-permutation is (213)). That is why, the projective limits with respect to the operations of taking the derivative permutation and taking the previous i-permutation are different spaces: in the first case, this is the compactum of virtual permutations, while in the second case, we obtain a space whose nature is not quite clear.
Thus, a combinatorial setting for the partition into Weyl simplices is ready. We emphasize the importance of the notion of translation, and hence that of transfer.
An important property of the sequence of partitions of the cube into Weyl simplices follows from the fact that the cube can be represented as the direct product
[TABLE]
where is the symmetric group acting on by permutations of coordinates and is the standard convex open simplex: . The above decomposition is a decomposition of measure spaces, with the normalized Lebesgue measures on and and the uniform measure on . Thus, with the partition we can associate its independent complement, which is the partition into the orbits of . This direct product decomposition of the cube can be lifted to the infinite-dimensional cube with the Lebesgue measure; namely, the independent complement to the cylinder partition is the partition of the cube into the orbits of , which is no longer a cylinder partition.
3.2 The triangular compactum of paths in the tree of i-permutations
Consider the “triangular” compactum, by which we mean the space of all infinite sequences of positive integers in which the th coordinate takes values in the set n:
[TABLE]
This compactum can be regarded as the set of all paths in the -graded graph whose th level is the set and all pairs of vertices of neighboring levels are adjacent. In other words, two neighboring levels in this graph form a complete bipartite graph. The space of all infinite paths of the -graded graph , denoted by , is . But the same compactum is, obviously, the path space of the tree of Weyl simplices.
The compactum resembles the compactum of so-called virtual permutations (see [2]), since both are compactifications of the infinite symmetric group, though in different senses; nevertheless, the relation to virtual permutations and representations of the group is very important and will be discussed below. Recall that the compactum of virtual permutations was defined as the projective limit of the symmetric groups with respect to the maps
[TABLE]
where is the operation of deleting the last symbol from a permutation; is not a group homomorphism, but it commutes with the right and left actions of . Hence, on the projective limit space we have a right and left actions of the infinite symmetric group . Of course, a virtual permutation is an infinite path in the graph of permutations (the graph of Weyl simplices), or a point of the triangular compactum.
One can also define actions of the symmetric group on the triangular compactum, but here we do not discuss this issue. However, an analog of the Haar measure is defined on and on in the same way: it is the measure
[TABLE]
where is the uniform measure on the set . This is the unique measure invariant under the left and right actions of the group . Besides, there is a one-parameter family of measures , , defined as follows: , . For , we have . All measures are invariant under the diagonal action.
3.3 An isomorphism between the cube and the triangular compactum. The positive answer to the distinguishability problem
Weyl simplices can be used to define a simple but important isomorphism between the measure spaces and .
Consider the measurable map from the infinite-dimensional cube to the path space defined as follows: for ,
[TABLE]
that is, the th coordinate of the image is equal to the number of coordinates of the preimage with indices at most whose values are not less than the value of the th coordinate of the preimage.
We may assume that the map is defined only on the set of sequences with pairwise distinct coordinates, which has a full measure in , and is not defined on the remaining set (of zero measure). This is a cylinder map, i.e., the images and preimages of cylinder sets coincide with cylinders in the corresponding spaces.
Consider in more detail the map for a finite-dimensional cube. Obviously, the -preimage of a point of the space is an open Weyl simplex, i.e., the intersection of an open Weyl chamber with the unit cube: this is the open simplex of all vectors of the unit cube with a fixed collection of pairwise inequalities between their (distinct) coordinates. Thus, the finite-dimensional level sets of the map divide () the unit cube into the Weyl simplices.
Theorem 1**.**
The map is a measure-preserving isomorphism between the spaces (infinite-dimensional cube) and (triangular compactum). It sends the sequence of partitions of the space defined above to the sequence of complete cylinder partitions of the compactum .
To make it clear, is the partition of into the classes of sequences of positive numbers in which the first coordinates coincide, .
One can see from the structure of the map that the values of the measures and on cylinder sets agree; indeed, both the measure of a finite-dimensional Weyl simplex of order and the measure of a point in a finite -fragment of are equal to , which implies that the measures of preimages and images coincide, i.e., the map is defined almost everywhere in , is surjective, and preserves the measure.
It remains to prove that is injective, i.e., separates almost all points of the preimage . This simple but remarkable statement is worth highlighting.
Lemma 1**.**
The limiting partition of the infinite-dimensional cube (the limit of the partitions into open Weyl simplices) coincides (with respect to the Lebesgue measure) with the partition into singletons. In other words, the distinguishability problem for the partition into Weyl simplices has a positive answer. Therefore, the map is an isomorphism of measure spaces. In more detail, there exists a set of full Lebesgue measure in such that for any two points and of this set there exist indices and for which the corresponding coordinates satisfy the opposite inequalities:
[TABLE]
In a somewhat paradoxical form, the lemma can be stated as follows: almost every (with respect to the Lebesgue measure) infinite sequence of points from the interval can be uniquely recovered from the list of pairwise inequalities between its coordinates.
Or, even more paradoxically: almost every infinite-dimensional Weyl simplex consists of a single point.444If, instead of the cubes we consider the spaces with the standard infinite-dimensional Gaussian measures, then our statement looks as follows: almost every infinite-dimensional Weyl chamber consists of a single ray.
Proof.
Assume that two sequences have the same inequalities for all pairs of coordinates but differ in at least one coordinate: . Since the coordinates are independent, it follows that with probability there exists a number such that
[TABLE]
which implies a contradiction: , but . ∎
In fact, the proof uses not the independence of coordinates, but the equidistribution of almost every sequence , which follows from the pointwise ergodic theorem; hence, we can replace the Lebesgue measure with any measure in for which the coordinates are equidistributed.
Let us sketch another, more conceptual argument, which is applicable in a much more general situation.
To prove the distinguishability of Bernoulli realizations (trajectories) for a sequence of partitions means to prove the following: for every measurable function (it suffices to consider only cylinder functions), its average over the conditional measure on an element of the partition approaches for large . This, in turn, is equivalent to the fact that the projection of the conditional measure on the element to the finite-dimensional simplex whose cylinder hull contains is close to the -measure at the barycenter of this simplex. But this is true by the equidistribution of almost all trajectories and their fragments. Indeed, the equidistribution implies that the projections of long fragments of the sequence concentrate near the barycenter, since the interval between the coordinates should be filled uniformly. These considerations suffice to recover this proof of the theorem.
In contrast to the proof given above, the last argument does not use specific features of the partition into Weyl simplices; it is universal, since everything reduces to equidistribution or, more generally, to a law of large numbers. Elsewhere, we will apply the same argument to obtain a new proof of the Romik–Sniady theorem [5, 6] (which, in our terms, is a theorem on the distinguishability of the encoding via -tableaux in the RSK correspondence).
3.4 Transfer for the tree of i-permutations and the triangular compactum
Now we must write the image of the shift under the isomorphism in terms of sequences , i.e., find the corresponding transformation of the triangular compactum :
[TABLE]
This is exactly the transfer of the triangular compactum regarded as the path space of the graph . To find it, we will use the formula we have obtained for the translation.
Denote , and let be the number of coordinates in that are less than . Clearly, each is equal either to (if ), or to (if ). It is convenient to use the following terminology.
For each finite fragment of a path in the image (i.e., in the compactum ), we define marked positions by induction as follows. The first position is marked by definition. Assume that the number of marked positions among the first coordinates is equal to ; then the position is marked if and only if , i.e., does not exceed the number of previously marked positions.
Theorem 2**.**
The formula for is as follows: where
[TABLE]
In other words, the th coordinate of the image either coincides with the th coordinate of the preimage, or is less by one, depending on whether the number of coordinates less than the first one increases by when we add the th coordinate in the preimage.
In short, the transfer sends a virtual permutation to a new virtual permutation in which the th position is occupied either by the number that occupied the th position in the original permutation, or by this number decreased by , depending on its value.
Besides, the following relations hold:
[TABLE]
The proof immediately follows from the previous formulas and considerations.
Let us turn to the inversion formula. The formulas for directly imply the following theorem.
Theorem 3**.**
For almost all trajectories with respect to the measure ,
[TABLE]
In the same way we can find the other coordinates , . This and other similar formulas can be regarded as inversion formulas for the isomorphism .
Thus, we have completely described an isomorphism between the triples
[TABLE]
The action of the operator on the space of virtual permutations with the Haar measure (which is isomorphic to a Bernoulli action) is of interest. One can prove the Bernoulli property for this operator directly (i.e., without using the isomorphism ), and also present a Bernoulli generator, which is “expelled to infinity,” as shown by the formula from Theorem 3. Here we see a remote analogy with the boson-fermion correspondence in a combinatorial version. The operator is exactly the transfer defined in the previous section.
The map also establishes an above-mentioned isomorphism between the sequence of partitions of the space and the sequence of partitions of the space .
Let us summarize our considerations.
Theorem 4**.**
The one-sided Bernoulli shift on the space is metrically isomorphic to the transfer defined on the triangular compactum (the compactum of virtual permutations) with the Haar measure. An isomorphism is established by the encoding of the cube with the system of Weyl simplices, which form an exhausting increasing sequence of finite partitions.
Another model of the triangular compactum is the path space of the infinite homogeneous tree of permutations with the transfer and the uniform central measure.
A nontrivial property of the isomorphism is that it reverses the direction of time: the first coordinate (as well as other cylinder functions on the cube) is mapped by this isomorphism to an infinitely remote limiting function on the triangular compactum, which is not a cylinder function; though some cylinder functions remain cylinder functions, but their order increases. The continuity of the original system “escapes” to infinity.
4 Comments and remarks
Several comments are in order related to the above example.
4.1 More on the distinguishability problem
Using the combinatorial encoding, we have obtained an example of a realization of a Bernoulli endomorphism as a shift in the path space of a graph or, in other words, as a shift in the space of a nonstationary chain. This realization is a special case of the notion of transfer.
Another special, but much more complicated case of transfer is the Schützenberger transformation. It arises when one considers a covering by a Bernoulli scheme of the space of infinite standard Young tableaux with the Plancherel measure, which was (in the general case of ergodic central measures) suggested in [3]. A deep analysis and a proof of the fact that this covering is an isomorphism (rather than only a homomorphism) between the Bernoulli shift and the Schützenberger transformation, i.e., in our terms, a proof of distinguishability, was recently given by D. Romik and P. Sniady [5, 6]. Their analysis is based on the study of the Schützenberger transformation from the viewpoint of what I have called the nerve of a tableau using limit shape techniques, which allow one to obtain a complicated inversion formula. We will consider the distinguishability problem in the general setting, in particular, for the RSK correspondence, in another paper.
4.2 The list of isomorphisms
Let us enumerate isomorphic spaces with an invariant measure and a transfer:
- •
the path space of the factorial tree;
- •
the path space of the graph ;
- •
the triangular compactum with the Lebesgue measure;
- •
the space with the Bernoulli endomorphism.
This list can be extended by other graphs and their path spaces. For instance, by the Young graph and its path space (i.e., the space of infinite standard Young tableaux) with the Plancherel measure.
4.3 The relation to matrix distributions
Consider the space of all infinite symmetric matrices with zeros on the principal diagonal, and the product measure on it such that all coordinates , , are independent and have the distribution .
Theorem 5**.**
Let a map be defined by the formula
[TABLE]
It is a metric isomorphism between the above spaces endowed with the measures and , respectively.
The fact that is an isomorphism of measure spaces follows from the lemma proved above saying that the limit of the partitions coincides with the partition into singletons: .
For completeness, observe that the measure on the space of matrices is the matrix distribution (in the sense of [8]) of the following measurable function on the unit square endowed with the Lebesgue measure: . It is invariant under the simultaneous action of the infinite symmetric group on the rows and columns of matrices.
In fact, above we have established a not quite obvious isomorphism between the space of such matrices endowed with the measure and the space of virtual permutations endowed with the Lebesgue measure . The measure has interesting properties, it has also appeared in other contexts (universality), see [1].
4.4 Other compactifications of symmetric groups
The notion of translation of i-permutations suggests the idea of considering the projective limit of the symmetric groups with respect to this operation. We will assume that it applies to permutations rather than i-permutations, i.e., under the identification of i-permutations with permutations, maps onto . This map resembles the operation of taking the derivative permutation, but is more complicated. Namely, we remove the first element (the image of ), and then decrease the coordinates that are greater than the removed one by , leaving the other coordinates unchanged. The projective limit of the symmetric groups with respect to these operations should be regarded as a completion of the infinite symmetric group.
It seems that apart from this operation and that of taking the derivative permutation (see [2]), there is a whole series of reasonable operations and the corresponding projective limits. For example, one can delete in the same way as is deleted when taking the derivative permutation; this also results in an interesting object, a new extension of the infinite symmetric group different from the space of virtual permutations. Here is a more detailed description of the map that deletes from a permutation and decreases all numbers by :
[TABLE]
Obviously, this map is defined as a map (but not a homomorphism) from the group to itself such that every element has countably many preimages. It seems that this “nonhomomorphism” of the group to itself has every right to exist.
4.5 Relation to -algebras
From the viewpoint of the general theory, the example considered above corresponds to the Glimm -algebra . Indeed, the spectrum of a finite-dimensional diagonal subalgebra of this graded algebra is exactly a finite fragment of the triangular compactum, and the whole compactum is the spectrum of the Gelfand–Tsetlin algebra for this -algebra. The unique central measure on the triangular compactum is the Haar measure.
4.6 Representations of the infinite symmetric group
The established isomorphism between the space of virtual permutations and the infinite-dimensional cube allows one to obtain new models of representations of the infinite symmetric group. It is more important that the result obtained above establishes an asymptotic isomorphism between the -space of the infinite symmetric group and a space of -functions on the Cartan subalgebra of a Lie algebra of type .
For every , we define an embedding of the space into the space of -functions on the distinguished Cartan subalgebra of the Lie group with respect to the Lebesgue measure on the unit cube in the chosen coordinates.555We can just as well consider the -space with respect to the standard Gaussian measure on corresponding to the Killing form.
An embedding is defined by the formula
[TABLE]
where is the -function at an element and is the characteristic function of the Weyl simplex . This correspondence can be extended by linearity to the whole group algebra of , and we obtain a map from this algebra to the space of functions on the Cartan subalgebra, which will be denoted by the same symbol .
Proposition 3**.**
The embedding is an isometry with respect to the norms of the spaces and .
Note that for every the image of the space under the isometry is a proper finite-dimensional subspace in .
It is natural that the -space on the Weyl group can be embedded into the space of functions on the Cartan algebra, but it is by no means obvious that in the limit the image coincides with the whole space.
It turns out that this is true by the positive answer to the distinguishability problem: in the limit we obtain an isometry. In more detail, we assume that is, in a natural sense, a subspace in , and the spaces of -functions on the Cartan subalgebras also constitute an inductive family, since we have embeddings of groups. Hence we can consider the limit of the isometries .
Corollary 1**.**
The limit of the isometries exists and is an isometry between the spaces and . It follows that the regular representation of the group can be realized, in a natural way, in the -space over a Bernoulli scheme as the limit of actions of finite Weyl groups on Cartan subalgebras.
This realization and its generalizations should be studied further. As we have already mentioned, such a realization of a Bernoulli endomorphism exists for a whole series of graphs, and each of them corresponds to a realization of the representation of the infinite symmetric group which in [10] was called the basic representation.
4.7 An analog of the partition into Weyl simplices for an arbitrary Bernoulli scheme
We conclude this section by showing that the method of encoding via Weyl simplices can be used to encode an arbitrary sequence of independent variables, and the linear order on the interval can be replaced by an arbitrary measurable ordering on a set of full measure.
Consider an arbitrary Lebesgue space and divide its square into a measurable set and its complement satisfying the following condition: for every pair of points there exists a set of positive measure such that for every one of the pairs lies in and the other one lies in .
Theorem 6**.**
Consider a Bernoulli scheme with the state space and assume that we have chosen a set of measure satisfying the condition stated above. Construct an increasing invariant sequence of finite measurable partitions as follows: an element of the partition is a cylinder with base consisting of all collections , , for which every pair for lies in a fixed one of the sets or . Then
1.* The sequence has the same frame as the sequence of Weyl simplices and is metrically isomorphic to it. In particular,*
2.* The distinguishability problem for this sequence of partitions has a positive answer.*
The proof is exactly the same as for Weyl simplices. Thus, from the metric point of view, the linear order on the interval has no specific features.
4.8 Distinguishability of random matrices
The statement of the distinguishability problem and its analysis suggested above can be used in a large number of problems similar to those considered in the paper. Let us give an example from the theory of random Gaussian matrices (GOE). Consider the space of all infinite symmetric real matrices endowed with the standard Gaussian measure . Here we consider the measure space instead of . Define a (no longer increasing) sequence of (no longer finite) measurable partitions of this space: an element of is the set of all matrices for which the principal minors have the same spectrum. The parameter of is an -vector of eigenvalues of the corresponding submatrix.
We obtain the distinguishability problem for the sequence : does there exist a set of matrices of full measure that can be uniquely recovered from the collection of parameters ? Cf. Lemma 1 on the distinguishability of points by the collection of pairwise inequalities.
5 Graded graphs, transfer, and quasi-stationary processes
5.1 Why not only trees
The combinatorial encoding of a sequence of random variables involves the study of increasing invariant sequences of finite partitions, and the combinatorial counterpart of the problem reduces to the study of trees with a transfer. However, in some natural settings of the distinguishability problem, the original object is not necessarily an increasing sequence of partitions.
Assume that we have a sequence of finite partitions of a Lebesgue space . If we regard the set of all elements of all partitions as the set of vertices of a graph to be constructed, and assume that an element of does not necessarily lie in one element of , but may intersect several elements of this partition, then we obtain a graph with edges connecting elements of neighboring partitions (i.e., vertices of neighboring levels) and if and only if the measure of their intersection is positive. Thus, we obtain a graded graph in which for each vertex there is a probability vector on the set of edges entering this vertex (i.e., an equipped graph in the terminology of [9]). The tree defined in Section 2 as the frame of an increasing sequence is a special case of this construction.
If we pass from the partitions to their products , then we obtain an increasing sequence of partitions , which reduces the problem to the case of a tree considered above; moreover, in terms of the graph constructed above, this tree is nothing else than the tree of finite paths in the original graph leading from the initial vertex to all the other vertices, and the graph itself is a natural quotient of this tree.
However, this reduction does not at all mean that there is no benefit from considering the graph itself, which can be illustrated by the example of encoding a sequence of independent variables by the Young graph and the RSK correspondence, as in [3, 6, 5]. In other words, the distinguishability problem can be analyzed in terms of the graph itself without passing to the tree of paths.
5.2 Definition of transfer for a graded graph
Let us sketch the main idea of applying the theory of graded graphs to ergodic problems of the type under consideration. The key role here is played by the notion of transfer in the path space of a graded graph which we introduce below.
Consider an arbitrary Bratteli diagram, i.e., an -graded locally finite graph (or even a multigraph) . An infinite tree is an example of such a graph. A path in is an infinite maximal sequence of edges in which the beginning of each edge coincides with the end of the previous edge. Denote the space of all paths by ; this is a Cantor-like compactum in the inverse limit topology. A transformation
[TABLE]
is called a transfer if it is continuous, decreases the level of each edge by , and satisfies the following locality (Markov) property: for every , the rule according to which an edge between levels and in a path is mapped to an edge between levels and in the image of this path depends only on the fragment of the path between levels and . This means that a transfer is determined by a set of local rules for the translation of an edge to the previous level.
For stationary graphs, in which the sets of vertices of all levels (except the first one) are isomorphic and these isomorphisms are fixed, the translation rule depends on nothing: an edge connecting vertices and of levels and goes to the edge connecting the vertices and of levels and identified with the vertices and , respectively. In this case, the transfer is an ordinary shift.
For trees regarded as graded graphs, the definition of transfer coincides with that from Section 2.3.
Definition 2**.**
A graded graph is said to be quasi-stationary if a “transfer” operation, corresponding to some translation of edges in the sense described above, is defined on its path space. The path space of a quasi-stationary graph, regarded as a topological Markov compactum, will be called a quasi-stationary Markov compactum.
Thus, we have described a new type of realizations of automorphisms and endomorphisms with infinite entropy as transfers on quasi-stationary Markov compacta.
According to our definition, a transfer is a shift of sequences of edges, and not of sequences of vertices as in the stationary case. Thus, this notion opens new possibilities for realizations of transformations.
A transfer defines an additional structure on the graph and, in general, is not uniquely determined by the graph itself, though in some cases there exists a distinguished transfer. For example, in the important special case of graphs in which every -interval contains either one or two intermediate vertices (this is the case for Hasse diagrams of arbitrary distributive lattices, in particular, for the Young graph and other examples), a translation of edges between adjacent vertices is determined in a natural way by the very structure of the graph, see Fig. 2.
Proposition 4**.**
For the Young graph, a transfer on the path space (i.e., on the space of infinite standard Young tableaux) is defined automatically in the sense described above, since the Young graph is the Hasse diagram of the distributive lattice of finite ideals of the lattice . In this case, it coincides with the well-known Schützenberger transformation, which is a special case of transfer.
The proof follows from a detailed analysis of the definition of transfer. (For the Schützenberger transformation, see [7, 3, 6, 5].)
If a transfer is defined on the path space of a graded graph, then this space should be regarded as a nonstationary (or quasi-stationary) Markov chain, meaning that the transfer is an analog of the shift. If we have a central measure on the path space that is invariant under the transfer, then we obtain a quasi-stationary Markov chain with an invariant measure. Hence the theory of transfer becomes part of ergodic theory, as a nonconventional realization of measure-preserving transformations. A more detailed exposition of the theory of transfer will be presented elsewhere.
The author is grateful to P. P. Nikitin for preparing the figures, P. B. Zatitskii for reviewing the literature and G.M.Zukerman for creative attitude to the text.
Translated by N.V.Tsilevich.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Glasner and B. Weiss, The universal minimal system for the group of homeomorphisms of the Cantor set, Fund. Math. 176 (2003), 277–289.
- 2[2] S. Kerov, G. Olshanski, and A. Vershik, Harmonic analysis on the infinite symmetric group, Inv. Math. 158 , No. 3 (2004), 551–642.
- 3[3] S. Kerov and A. Vershik, The characters of the infinite symmetric group and probability properties of the Robinson–Schensted–Knuth algorithm, SIAM J. Alg. Discr. Methods 7 , No. 1 (1986), 14–123.
- 4[4] V. A. Rokhlin, Lectures on the entropy theory of measure-preserving transformations, Russian Math. Surveys 22 , No. 5 (1967), 1–52.
- 5[5] D. Romik and P. Sniady, Jeu de taquin dynamics on infinite Young tableaux and second class particles, Ann. Probab. 43 , No 2 (2015), 682–737.
- 6[6] P. Sniady, Robinson–Schensted–Knuth algorithm, jeu de taquin and Kerov–Vershik measures on infinite tableaux, SIAM J. Discrete Math. 28 , No. 2 (2014). 598–630.
- 7[7] R. Stanley, Enumerative Combinatorics , Vol. 2, Cambridge University Press, 1999.
- 8[8] A. M. Vershik, Classification of measurable functions of several arguments, and invariantly distributed random matrices, Funct. Anal. Appl. 36 , No. 2 (2002), 93–105.
