Measure Preserving Diffeomorphisms of the Torus are Unclassifiable
Matthew Foreman, Benjamin Weiss

TL;DR
This paper proves that classifying measure-preserving diffeomorphisms of the torus up to isomorphism is impossible due to the complexity of their classification problem, which is shown to be a complete analytic set.
Contribution
It demonstrates that the isomorphism problem for ergodic diffeomorphisms of the torus is unclassifiable, extending the understanding of complexity in ergodic theory.
Findings
The set of isomorphic pairs of ergodic diffeomorphisms is a complete analytic set.
The classification problem is not Borel, indicating high complexity.
The result applies to smooth transformations on the 2-torus.
Abstract
The isomorphism problem in ergodic theory was formulated by von Neumann in 1932 in his pioneering paper Zur Operatorenmethode in der klassischen Mechanik (Ann. of Math. (2), 33(3):587--642, 1932). The problem has been solved for some classes of transformations that have special properties, such as the collection of transformations with discrete spectrum or Bernoulli shifts. This paper shows that a general classification is impossible (even in concrete settings) by showing that the collection of pairs of ergodic, Lebesgue measure preserving diffeomorphisms of the 2-torus that are isomorphic is a complete analytic set in the - topology (and hence not Borel).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Dynamics and Fractals · Computability, Logic, AI Algorithms · Advanced Topology and Set Theory
Measure Preserving Diffeomorphisms of the Torus
are Unclassifiable
Matthew Foreman and Benjamin Weiss
Abstract
In 1932 von Neumann proposed classifying the statistical behavior of differentiable systems. In modern language this is interpreted as classifying diffeomorphisms of compact manifolds up to measure isomorphism. This paper proves that this is impossible in a rigorous sense.
Contents
1 Introduction
The isomorphism problem in ergodic theory was formulated by von Neumann in 1932 in his pioneering paper [23].111Two measure preserving transformations (abbreviated to ‘MPTs’ in the paper) and are isomorphic if there is an invertible measurable mapping between the corresponding measure spaces which commutes with the actions of and . The problem has been solved for some classes of transformations that have special properties. Halmos and von Neumann [15] used the unitary operators defined by Koopman to completely characterize ergodic measure preserving transformations with pure point spectrum. They showed that these are exactly the transformations that can be realized as translations on compact groups. Another notable success in solving this problem was the classification of Bernoulli shifts using the notion of entropy introduced by Kolmogorov.
Starting in the late 1990’s a different type of result began to appear: anti-classification results that demonstrate in a rigorous way that classification is not possible. This type of theorem requires a precise definition of what a classification is. Informally, a classification is a method of determining isomorphism between transformations by computing (in a liberal sense) other invariants for which equivalence is easy to determine.
The key words here are method and computing. For negative theorems, the more liberal a notion one takes for these words, the stronger the theorem. One natural way of what a computation is uses the Borel/non-Borel distinction. Saying a set or function is Borel is a loose way of saying that membership in or the computation of can be done using a countable (possibly transfinite) protocol whose basic input is membership in open sets. Saying that or is not Borel is saying that determining membership in or computing cannot be done with any countable amount of resources. (See [6] for an elementary discussion and a comparison with the more strict notion of recursive computation, which requires inherently finite resources.)
In the context of classification problems, saying that an equivalence relation on a space is not Borel is saying that there is no countable amount of initial information and no countable, potentially transfinite, protocol based on this information for determining, for arbitrary whether . Any such method must inherently use uncountable resources.222Many well known classification theorems have as immediate corollaries that the resulting equivalence relation is Borel. An example of this is the Spectral Theorem, which has a consequence that the relation of Unitary Conjugacy for normal operators is a Borel equivalence relation.
An example of a positive theorem in the context of ergodic theory is due to Halmos ([14]) who showed that the collection of ergodic measure preserving transformations is a dense set in the space of all measure preserving transformations of endowed with the weak topology. Moreover he showed that the set of weakly mixing transformations is also a dense .333Relatively straightforward arguments show that the set of strongly mixing transformation is a first category set. See [5].
The first anti-classification result in the area is due to Beleznay and Foreman [3] who showed that the class of measure distal transformations used in early ergodic theoretic proofs of Szemeredi’s theorem is not a Borel set. Later Hjorth [16] introduced the notion of turbulence and showed that there is no Borel way of attaching algebraic invariants to ergodic transformations that completely determine isomorphism. Foreman and Weiss [11] improved this result by showing that the conjugacy action of the measure preserving transformations is turbulent–hence no generic class can have a complete set of algebraic invariants.
In considering the isomorphism relation as a collection of pairs of measure preserving transformations, Hjorth ([17]) showed that is not a Borel set. However the pairs of transformations he used to demonstrate this were inherently non-ergodic, leaving open the essential problem:
Is isomorphism of ergodic measure preserving transformations Borel?
This question was answered in the negative by Foreman, Rudolph and Weiss in [8]. This answer can be interpreted as saying that determining isomorphism between ergodic transformations is inaccessible to countable methods that use countable amounts of information.
In the same foundational paper from 1932 von Neumann expressed the likelihood that any abstract MPT is isomorphic to a continuous MPT and perhaps even to a differentiable one. This brief remark eventually gave rise to one of the yet outstanding problems in smooth dynamics, namely:
Does every ergodic MPT with finite entropy have a smooth model? 444In [23] on page 590, “Vermutlich kann sogar zu jeder allgemeinen Strömung eine isomorphe stetige Strömung gefunden werden [footnote 13], vielleicht sogar eine stetig-differentiierbare, oder gar eine mechanische. Footnote 13: Der Verfasser hofft, hierfür demnächst einen Beweis anzugeben.”
By a smooth model it is meant an isomorphic copy of the MPT which is given by smooth diffeomorphism of a compact manifold preserving a measure equivalent to the volume element. Soon after entropy was introduced, A. G. Kushnirenko showed that such a diffeomorphism must have finite entropy, and up to now this is the only restriction that is known. The current paper is the culmination of a series whose purpose is to show that the variety of ergodic transformations that have smooth models is rich enough so that the abstract isomorphism relation, when restricted to these smooth systems, is as complicated as the general isomorphism problem for ergodic measure preserving systems. We show that even when restricting to diffeomorphisms of the 2-torus that preserve Lebesgue measure this is the case. The formal statement of our solution to the isomorphism problem is:
Theorem 1
If is either the torus , the disk or the annulus then the measure-isomorphism relation among pairs of measure preserving -diffeomorphisms of is not a Borel set with respect to the -topology.
Thus the isomorphism problem is impossible even for diffeomorphisms of compact surfaces.
How does one prove a result such as Theorem 1? The main tool is the idea of a reduction (see [6] and Section 4.6). A function reduces to if and only for all :
[TABLE]
If and are completely metrizable spaces and is a Borel function then is a method of reducing the question of membership in to membership in . Thus if is not Borel then cannot be either.
In the current context, the -topology on the smooth transformations refines the weak topology. Thus, by Halmos’ result quoted earlier, on the torus (disc etc.), the ergodic transformations are still a -set. (However the famous KAM theory shows that the ergodic transformations are no longer dense.) In particular the -topology induces a metrizable complete and perfect topology on the measure preserving diffeomorphisms of . If is a manifold with supporting a measure we denote the space of , -measure preserving diffeomorphisms of with the notation . Elements of are also members of the group MPT of -measure preserving transformations. For the centralizer of in MPT is denoted .
If is perfect and completely metrizable, a set is analytic if and only is the continuous image of a Borel set. A is complete analytic if and only if every analytic set can be reduced to . It is a classical fact that complete analytic sets are not Borel.
The proof of Theorem 1 uses a well-known example of a complete analytic set. The underlying space is the space and is the collection of ill-founded trees; those that have infinite branches. A precise statement of the main result of the paper:
Theorem 2
There is a continuous function , taking values among the ergodic transformations, such that for , if :
* has an infinite branch if and only if , and* 2. 2.
* has two distinct infinite branches if and only if*
[TABLE]
Corollary 3
- •
* is complete analytic.*
- •
* is complete analytic.*
Since the map is a continuous mapping of to and reduces to , it follows that:
Corollary 4
* and are ergodic diffeomorphisms of and are isomorphic is a complete analytic set and hence not Borel.*
We note that the problem of finding even one measure preserving transformation not isomorphic to its inverse is difficult. This was not done until Anzai in [2]. In Math Review MR0047742, Halmos said, “By constructing an example of the type described in the title the author solves (negatively) a problem proposed by the reviewer and von Neumann [Ann. of Math. (2) 43, 332?350 (1942): MR0006617]”.
More fine-grained information is now known and will be published elsewhere. For example, Foreman, in unpublished work, showed that the problem of “isomorphism of countable graphs” is Borel reducible to the isomorphism problem for ergodic measure preserving transformations.
The techniques of this paper also have foundational interest. A close analysis of our construction shows that the problem of whether is isomorphic to its inverse is “-hard.” (See [7]). This enables one to prove that truth or falsity of various open problems like the Riemann hypothesis is equivalent to the question of is isomorphic or not to its inverse for a specific measure preserving diffeomorphism of the torus given by our construction. Another consequence is the existence of a different diffeomorphism such that the question of whether is isomorphic to its inverse is independent of ZFC, the usual axioms for mathematics.
Here are two problems that remain open:
Problem 1
In contrast to [11], where the authors were able to show that the equivalence relation of isomorphism on abstract ergodic measure preserving transformations is turbulent, this remains open for ergodic diffeomorphisms of a compact manifold.
Problem 2
The problem of classifying diffeomorphisms of compact surfaces up to topological conjugacy remains largely open. Work of the first author with A Gorodetski shows that the isomorphism relation itself is not Borel, but for a very specific type of diffeomorphisms of manifolds of dimension 5 and above. It is not know, for example for topologically minimal transformations.
We owe a substantial debt to everyone who has helped us with this project. Jean-Paul Thouvenot brought the Anosov-Katok technique to our attention and suggested using it to solve the von Neumann problem. Philipp Kunde aided us by reading the paper and providing comments and corrections. Others include Eli Glasner, Anton Gorodetski, Alekos Kechris, and Anatole Katok.
We particularly want to acknowledge the contribution of the late Dan Rudolph, who helped pioneer these ideas and was a co-author in [8], contributing techniques fundamental to this paper.
2 An Outline of the Argument
This section gives an outline of the argument for Theorem 2. It uses the main results from our earlier papers: A Symbolic Representation of Anosov-Katok Systems([13]) and From Odometers to Circular Systems: A Global Structure Theorem ([12]) which we briefly summarize. In [13], the Anosov-Katok technique of Approximation by Conjugacy is used to give a new symbolic representation for a class of measure preserving diffeomorphisms that are extensions of the rotations by certain Liouvillean . These are called strongly uniform Circular Systems.555In a forthcoming paper we show how to drop the “strongly uniform” assumption.
In [12] two classes of symbolic systems are defined. The first, called Odometer Based systems, contains representatives of every finite entropy measure preserving transformation with an odometer factor. The second class is the collection of Circular Systems. These classes are made into categories by taking as morphisms synchronous and anti-synchronous factor maps. The main result is that there is a functorial isomorphism between between these categories that takes strongly uniform systems to strongly uniform systems.
Since the main construction in [8] uses Odometer Based systems this map enables us to adapt that construction to the smooth setting. However in order to prove our main result we still have to take into account potential isomorphisms of Circular Systems that are neither synchronous nor anti-synchronous. It is to deal with this difficulty that we analyze what we call the displacement function.
To each arising as a rotation factor of a circular system one can associate a displacement function (Section 7.1) and use it to associate the set of central values, a subgroup of the unit circle. Its significance is the following:
(Theorem 84) If is central then there is an such that the rotation factor of is rotation by .
- 2.
(Theorem 90) If is built from sufficiently random words,666i.e. satisfies the Timing Assumptions. and , then the canonical rotation factor of is rotation by a central value.
- 3.
It follows that if there is a and , then there is a synchronous such that .
- 4.
(Theorem 92) The analogous results relating isomorphisms between and with central values is proved, allowing us to conclude that if is isomorphic to then there is an anti-synchronous isomorphism between and .
- 5.
The previous two items are the content of Theorem 93, which says that for satisfying the Timing Assumptions, to decide whether or it suffices to consider anti-synchronous and synchronous isomorphisms.
In [8] a continuous function from the space of Trees to the strongly uniform odometer based transformations is constructed that:
- •
reduces the set of ill-founded trees to the transformations that are isomorphic to their inverses (and if then this is witnessed by an anti-synchronous isomorphism) and
- •
reduces the set of trees with two infinite branches to the transformations whose centralizer is different from the powers of (and if the centralizer contains an exotic element, it contains a synchronous exotic element).
Moreover, in the second case, there is a synchronous element of the centralizer with a specific piece of evidence that it is not the identity (it moves a -equivalence class).
Composing one concludes that :
- •
reduces the set of ill-founded trees to collection of circular systems that are isomorphic to their inverses and
- •
reduces the set of trees with two infinite branches to the circular systems whose centralizer is different from the closure of the powers of .
Continuously realizing the circular systems by (as in [13]) completes the proof that:
- •
The collection of ergodic measure preserving diffeomorphisms of the torus that are isomorphic to their inverses is complete analytic. Consequently the set of pairs of ergodic conjugate measure preserving diffeomorphisms is a complete analytic set.
- •
The collection of ergodic measure preserving diffeomorphisms whose centralizer is different from the closure of the powers of is complete analytic.
Figure 1 illustrates .
The next two sections review basic facts in ergodic theory and descriptive set theory, define odometer based and circular systems and review their properties and the facts shown in [13] and [12].
The analysis of the displacement function and the associated central values, which are a subgroup of the circle canonically associated to the Liouvillean , is carried out in sections 5-7. Finally the proof of the main theorems are given in section 8 modulo certain properties which impose some additional conditions on the parameters of the construction in [8]. These are verified in section 9 and in section 10 we spell out the dependencies between the various parameters and show that they can be realized.
3 Numerical Requirements
The proof of Theorem 2 uses a construction with many interconnecting pieces, most of which are built by taking limits. This results in a large number of related sequences of variables, each having their own requirements and the estimates for the different pieces must be compatible.
The least interesting part of this paper is is verifying the consistency of the numerical requirements. Sorting these requirements out is completely independent of the rest of the paper. For this reason, we list the numerical requirements in section 11.1, and then give an argument for their consistency. We also note the specific requirement by number in the text as they are posited and used.
Contributing to the complexity of the situation is that many of the relationships between the variables come from internal arguments of the general form “taking small enough you can guarantee that ”, with various variables in place of and . The exact relationship between and is not clear from the argument, but there is a requirement of the form “ is small as a function of .” A typical example of this is Sublemma 99 which says that, as a function of , if is take sufficiently small then an involved inequality involving and holds.
Complicating this task further is the fact that the construction in this paper depends on the construction in [8], which has its own numerical requirements. For a reader tracking the correspondence, in the appendix, we include a table for translating between the notation in this paper and the notation in [8]
The variables
Here is a list of variable sequences that have to be chosen during the construction:
[TABLE]
Some of these variables have clear relationships that are externally determined. The main construction is of a function that has a tree as in input. That tree directly determines a sequence of parameters, such as and that are not chosen during the construction. (In section 11, we call these exogenous variables.) These parameters determine some of the numerical requirements.
Example 5
The words in the collection are built by a sequence of substitutions into equivalence classes of the relations , where for the collection of heights on nodes in the given tree at stage . These substitution instances are closed under a sequence of actions of the groups . The number and dimension of the actions is also determined by the tree. Thus is determined by the exogenous variables , , and the internally chosen variable . In this particular example, It is possible to give a completely explicit formula for in terms and vice versa.777 for numbers and determined exogenously.
However that would be uninformative. What we need to see is that if is large then is and vice versa and that each determines the other. This is the only relevant information for determining the consistency of the numerical requirements. We have thus eliminated one variable.
It would perhaps be more conventional to define all of the variables in advance, write down the list of inequalities and then show they are consistent. However the examples above illustrate the difficulties with this. The inequalities are intimately intertwined with the details of the construction and are completely enigmatic without that context. For this reason we note the Numerical Requirements one by one as they accumulate and collect them in section 11.1. We then proceed to show that they are consistent by the method we describe next. A reader with a preference for the conventional presentation is advised to skip directly to section 11, read the reconciliation and then return to read the rest of the paper.
What could possibly go wrong?
The only potential issue is that that there may be a situation where the requirements are circular: for example, might have to be small as a function of , small as a function of and small as a function of . In symbols
[TABLE]
So if you choose first, then then you might find that your choice of was inadequate. Indeed, because there is a cycle in the dependency diagram there is no variable you can choose first and be certain of consistency.
Method for showing consistency
In section 11 we analyze the dependencies and draw a dependency diagram giving the order of choice. Since that diagram is cycle free, all of the variables can be chosen to satisfy the accumulated requirements.
4 Preliminaries
The reader is referred to standard texts such as [22], [24] or [21]. Facts that are not standard and are simply cited here are proved in [12], [13] and [8].
4.1 Measure Spaces
We will call separable non-atomic probability spaces standard measure spaces and denote them where is the Boolean algebra of measurable subsets of and is a countably additive, non-atomic measure defined on . Maharam and von Neumann proved that every standard measure space is isomorphic to where is Lebesgue measure and is the algebra of Lebesgue measurable sets.
If and are measure spaces, an isomorphism between and is a bijection such that is measure preserving and both and are measurable. We will ignore sets of measure zero when discussing isomorphisms; i.e. we allow the domain and range of to be subsets of and of measure one.
A measure preserving system is an object where is a measure isomorphism. A factor map between two measure preserving systems and is a measurable, measure preserving function such that . A factor map is an isomorphism between systems iff is a measure isomorphism.
Let be measure preserving, be a measurable space, a measurable map and be a measurable map such that . Then we can define a measure by setting . This measure makes a factor map from to .
4.2 Presentations of Measure Preserving Systems
Measure preserving systems occur naturally in many guises with diverse topologies. As far as is known, the Borel/non-Borel distinction for dynamical properties is the same in each of these presentations and many of the presentations have the same generic classes. (See the forthcoming paper [9] which gives a precise condition for this.)
Here is a review the properties of the types of presentations relevant to this paper, which are: abstract invertible preserving systems, smooth transformations preserving volume elements and symbolic systems.
4.2.1 Abstract Measure Preserving systems
Since every standard measure space is isomorphic to the unit interval with Lebesgue measure, every invertible measure preserving transformation of a standard measure space is isomorphic to an invertible Lebesgue measure preserving transformation on the unit interval.
In accordance with the conventions of [5] we denote the group of measure preserving transformations of by MPT.888Recently several authors have adopted the notation for the same space. Two measure preserving transformations are identified if they are equal on sets of full measure.
Two measure preserving transformations are isomorphic if and only if they are conjugate in the group MPT and we will use isomorphic and conjugate as synonyms. However some caution is order. If is a manifold, is a smooth measure preserving transformation and is an arbitrary measure preserving transformation from to , then is unlikely to be smooth. Thus, the equivalence relation of isomorphism of diffeomorphisms is not given by an action of the group of measure preserving transformations in an obvious way.
Given a measure space and a measure preserving transformation , define the centralizer of to be the collection of measure preserving such that . This group is denoted . Note that this is the centralizer in the group of measure preserving transformations. In the case that is a manifold and is a diffeomorphism, differs from the centralizer of inside the group of diffeomorphisms.
To each invertible measure preserving transformation , associate a unitary operator by defining . In this way MPT can be identified with a closed subgroup of the unitary operators on with respect to the weak operator topology999Which coincides with the strong operator topology in this case. on the space of unitary transformations. This makes MPT into a Polish group. We will call this the weak topology on MPT. Halmos ([14]) showed that the ergodic transformations, which we denote , is a dense set in MPT. In particular the weak topology makes into a Polish subspace of MPT.
There is another topology on the collection of measure preserving transformations of to for measure spaces and . If are measure preserving transformations, the uniform distance between and is defined to be:
[TABLE]
This topology refines the weak topology and is a complete, but not a separable topology.
4.2.2 Diffeomorphisms
Let be a -smooth compact finite dimensional manifold and be a standard measure on determined by a smooth volume element. For each there is a Polish topology on the -times differentiable homeomorphisms of , the -topology. If is , then the -topology is the coarsest topology refining the -topology for each . It is also a Polish topology and a sequence of -diffeomorphisms converges in the -topology if and only if it converges in the -topology for each .
The collection of -preserving diffeomorphisms forms a closed nowhere dense set in the -topology on the -diffeomorphisms, and as such, inherits a Polish topology.101010One can also consider the space of measure preserving homeomorphisms with the topology, which behaves in some ways similarly. We will denote this space by .
Viewing as an abstract measure space one can also consider the space of abstract -preserving transformations on with the weak topology. In [4] it is shown that the collection of a.e.-equivalence classes of smooth transformations form a -set in MPT(M), and hence the collection has the Property of Baire.
4.2.3 Symbolic Systems
Let be a countable or finite alphabet endowed with the discrete topology. Then can be given the product topology, which makes it into a separable, totally disconnected space that is compact if is finite.
Notation: If is a finite sequence of elements of , then we denote the cylinder set based at in by writing . If we abbreviate this and write . Explicitly: . The collection of cylinder sets form a base for the product topology on .
Let be finite sequences of elements of having length . Given intervals and in of length we can view and as functions having domain and respectively. We will say that and are located at and . We will say that is shifted by relative to iff is the shift of the interval by . We say that is the -shift of iff and are the same words and is the shift of the interval by .
The shift map:
[TABLE]
defined by setting is a homeomorphism. If is a shift-invariant Borel measure then the resulting measure preserving system is called a symbolic system. The closed support of is a shift-invariant closed subset of called a symbolic shift or sub-shift.
Symbolic shifts are often described intrinsically by giving a collection of words that constitute a clopen basis for the support of an invariant measure. Fix a language , and a sequence of collections of words with the properties that:
for each all of the words in have the same length , 2. 2.
each occurs at least once as a subword of every , 3. 3.
there is a summable sequence of positive numbers such that for each , every word can be uniquely parsed into segments
[TABLE]
such that each , and for this parsing
[TABLE]
The segments in condition 1 are called the spacer or boundary portions of .
Definition 6
A sequence satisfying properties 1.)-3.) will be called a construction sequence.
If is a collection of words in an alphabet , we will say that is uniquely readable if and only if whenever and then either:
- •
and or
- •
and .
Equation 1 of clause 3 implies that each is uniquely readable. We will need unique readability to parse elements of , the symbolic shift associated with the construction sequence.
Definition 7
Let be the collection of such that every finite contiguous subword of occurs inside some belonging to some . Then is a closed shift-invariant subset of that is compact if is finite.
The symbolic shifts built from construction sequences coincide with transformations built by cut-and-stack constructions.
Notation: For a word we will write for the length of .
Here is a natural set of measure one for the relevant measures:
Definition 8
Suppose that is a construction sequence for a symbolic system with each uniquely readable. Let be the collection such that there are sequences of natural numbers , going to infinity such that for all there is an .
Note that is a dense shift-invariant set.
Lemma 9
[13]** Fix a construction sequence for a symbolic system in a finite language. Then:
* is the smallest shift-invariant closed subset of such that for all , and , has non-empty intersection with the basic open interval .* 2. 2.
Suppose that there is a unique invariant measure on , then is ergodic. 3. 3.
(See **[12]**) If is an invariant measure on concentrating on , then for -almost every there is an for all , there are such that .
Example 10
Let be a construction sequence. Then is uniform* if there is a summable sequence of positive numbers and , where such that for each all words and if is the number of such that *
[TABLE]
It is shown in [13] that uniform construction sequences are uniquely ergodic. A special case of uniformity is strong uniformity: when each occurs exactly the same number of times in each . This property holds for the circular systems considered in [13] and that are used for the proof of the main theorem of this paper (Theorem 2).
4.2.4 Locations
Let be a uniquely readable construction sequence and let be a shift invariant measure on . For and each either lies in a well-defined subword of belonging to or in a spacer of a subword of belonging to some . By Lemma 9 for -almost all and for all large enough there is a unique with such that .
Definition 11
Let and suppose that for some . Define to be the unique with with this property. We will call the interval the principal -block of , and its principal -subword. The sequence of ’s will be called the location sequence of .
Thus is saying that is the symbol in the principal -subword of containing [math]. We can view the principal -subword of as being located on an interval inside the principal -subword. Counting from the beginning of the principal -subword, the position is located at the position in .
Remark 12
It follows immediately from the definitions that if is well-defined and , the position of the word occurring in the principal -block of is in the position inside the principal -block of .
Lemma 13
[12]** Suppose that and and for all , and have the same principal -subwords. Then .
Thus an element of is determined by knowing any tail of the sequence together with a tail of the principal subwords of .
Remark 14
Here are some consequences of Lemma 13:
Given a sequence with , if we specify which occurrence of in is the principal occurrence, then determines an completely up to a shift with . 2. 2.
A sequence and sequence of words comes from an infinite word if both and go to infinity and that the position in is in the position in a subword of identical to .
Caveat*: just because is the location sequence of some and is the sequence of principal subwords of some , it does not follow that there is an with location sequence and sequence of subwords .* 3. 3.
If have the same principal -subwords and for all large enough , then .
4.2.5 A note on inverses of symbolic shifts
We define operators we label , and apply them in several contexts
Definition 15
If is in , define the reverse of by setting . For , define:
[TABLE]
If is a word, let to be the reverse of sitting on the same interval. Explicitly, if is the word then and . If is a collection of words, is the collection of reverses of the words in .
If is an arbitrary symbolic shift then its inverse is . It will be convenient to have all of the shifts go in the same direction, thus:
Proposition 16
The map sending to is a canonical isomorphism between and .
The notation stands for the system and for the system .
4.3 Generic Points
Let be a measure preserving transformation from to , where is a compact separable topology, and is a standard measure. Then a point is generic for if and only if for all ,
[TABLE]
The Ergodic Theorem tells us that for a given and ergodic equation (4) holds for a set of -measure one. Intersecting over a countable dense set of gives a set of -measure one of generic points. For symbolic systems the generic points are those such that the -measure of all basic open intervals is equal to the density of such that occurs in at .
4.4 Stationary Codes and -Distance
In this section we briefly review a standard idea, that of a stationary code. A reader unfamiliar with this material who is interested in the proofs of the facts cited here should see [22]
Definition 17
Suppose that is a countable language. A code of length is a function (where is the interval of integers starting at and ending at ).
Given a code , the stationary code determined by is the function where, given
[TABLE]
Let be a symbolic system. Given two codes and (not necessarily of the same length), define and . Then is a semi-metric on the collection of codes. The following is a consequence of the Borel-Cantelli lemma.
Lemma 18
Suppose that is a sequence of codes such that . Then there is a shift-invariant Borel map such that for -almost all , .
A shift-invariant Borel map , determines a factor of by setting . Hence a convergent sequence of stationary codes determines a factor of .
Let and be codes. Define to be
[TABLE]
More generally define the metric on by setting
[TABLE]
For , we set
[TABLE]
provided this limit exists.
To compute distances between codes we will use the following application of the Ergodic Theorem.
Lemma 19
Suppose that is ergodic. Let and be codes. Then for almost all :
[TABLE]
The next proposition is used to study alleged isomorphisms between measure preserving transformations. We again refer the reader to [22] for a proof.
Proposition 20
Suppose that and are symbolic systems and is a factor map. Let . Then there is a code such that for almost all ,
[TABLE]
To show that equation 5 cannot hold (and hence show that is not a factor of ) we will want to view as limits of -images of large blocks of the form with . There is an ambiguity in doing this: if the code has length it does not make sense to apply it to for or . However if is quite large with respect to , then filling in the values for arbitrarily as ranges over these initial and final intervals makes a negligible difference to the -distances of the result. In particular if then for all large enough , we have
[TABLE]
no matter how we fill in the ambiguous portion.
The general phenomenon of ambiguity or disagreement at the beginning and end of large intervals is referred to by the phrase end effects. Because the end effects are usually negligible on large intervals we will often neglect them when computing distances.
The next proposition is standard:
Proposition 21
Suppose that is an ergodic symbolic system and is a sequence of functions from that commute with the shift. Then the following are equivalent:
The sequence converges to in the weak topology. 2. 2.
. 3. 3.
For -almost all . 4. 4.
For some -generic , for all we can find an for all , for all large enough , the distance .
We finish with a remark that we will use in several places:
Remark 22
If and are words in a language defined on an interval and with , then .
4.5 Rotations of the circle
Many of the arguments in this paper are based on an understanding of rational approximations to rotations of the circle. It is usually convenient to adopt additive notation and work on the unit interval , but this introduces ambiguities. Fix an . We use the symbol in two ways. The first way is that
[TABLE]
by rotating the circle by radians. The second, equivalent, way is that
[TABLE]
and is given by the formula
[TABLE]
We note in both cases that we are really concerned with .
4.6 Descriptive Set Theory Basics
Let and be Polish spaces and .111111The ideas in section are just summaries, they are exposited in [5] and [19]. A function reduces to if and only if for all :
[TABLE]
For this definition to have content there must be some definability restriction on . The relevant restrictions for this paper are either that is a Borel function (i.e. the inverse image of an open set is Borel) or that is a continuous function (i.e. the inverse image of an open set is open). The latter is clearly a stronger condition. If is Borel and is a Borel reduction, then is clearly Borel. Taking the contrapositive, if is not Borel then is not. If is Borel (resp. continously) reducible to we will write (resp. ). Both and are clearly pre-partial-orderings.121212The reader should be aware that this is a different notion than the notion of a reduction of equivalence relations.
If is a collection of pairs and , then is -complete for Borel reductions (resp. continuous reductions) if and only if every is Borel reducible (resp. continuously reducible) to . Being complete is interpreted as being at least as complicated as each set in .
For this to be useful there must be examples of sets that are not Borel. If is a Polish space and , then is analytic () if and only if it the continuous image of a Borel subset of a Polish space. This is equivalent to there being a Polish space and a Borel set such that is the projection to the -axis of .
Correcting a famous mistake of Lebesgue, Suslin proved that there are analytic sets that are not Borel. It follows immediately that complete analytic sets are not Borel. This paper uses a canonical example of such a set.
Let be an enumeration of , the finite sequences of natural numbers. Using this enumeration subsets can be identified with functions \mbox{\Large{\chi}}_{S}:{\mathbb{N}}\to\{0,1\}.
A tree is a set such that if and is an initial segment of , then . The set \{\mbox{\Large{\chi}}_{\mathcal{T}}:{\mathcal{T}} is a tree is a closed subset of , hence a Polish space with the induced topology. We call the resulting space . (In the sequel we will not always distinguish between and \mbox{\Large{\chi}}_{\mathcal{T}}.)
Because the topology on the space of trees is the “finite information” topology, inherited from the product topology on , the following characterizes continuous maps defined on .
Proposition 23
Let be a topological space and . Then is continuous if and only if for all open and all with there is an for all :
if , then .
An infinite branch through is a function such that for all . A tree is ill-founded if and only if it has an infinite branch.
The following theorem is classical; proofs can be found in [19], [20].
Fact 24
Let be the space of trees. Then:
The collection of ill-founded trees is a complete analytic subset of . 2. 2.
The collection of trees that have at least two distinct infinite branches is a complete analytic subset of .
The main results of this paper (Theorem 2 and Corollary 3) are proved by reducing the sets mentioned in Theorem 24 to conjugate pairs of diffeomorphisms and concluding that the sets of conjugate pairs is complete analytic–so not Borel.
5 Odometer and Circular Systems
Two types of symbolic shifts play central roles for the proofs of the main theorem, the odometer based and the circular systems. Most of the material in this section appears in [12] in more detail and is reviewed here without proof.
5.1 Odometer Based Systems
We now define the class of Odometer Based Systems. In a sequel to this paper ([10]), we prove that these are exactly the finite entropy transformations that have non-trivial odometer factors. We recall the definition of an odometer transformation. Let be a sequence of natural numbers greater than or equal to 2. Let
[TABLE]
be the -adic integers. Then naturally has a compact abelian group structure and hence carries a Haar measure . The set becomes a measure preserving system by defining to be addition by 1 in the -adic integers. Concretely, this is the map that “adds one to and carries right”. Then is an invertible transformation that preserves the Haar measure on . Let .
The following results are standard:
Lemma 25
Let be an odometer system. Then:
* is ergodic.* 2. 2.
The map is an isomorphism between and . 3. 3.
Odometer maps are transformations with discrete spectrum and the eigenvalues of the associated linear operator are the roots of unity ().
Any natural number can be uniquely written as:
[TABLE]
for some sequence of natural numbers with .
Lemma 26
Suppose that is a sequence of natural numbers with and . Then there is a unique element such that for each .
We now define the collection of symbolic systems that have odometer maps as their timing mechanism. This timing mechanism can be used to parse typical elements of the symbolic system.
Definition 27
Let be a uniquely readable construction sequence with the properties that and for all for some . The associated symbolic system will be called an odometer based system.
Thus odometer based systems are those built from construction sequences such that the words in are concatenations of words in of a fixed length . The words in all have length and the words in equation 1 are all the empty words.
Equivalently, an odometer based transformation is one that can be built by a cut-and-stack construction using no spacers. An easy consequence of the definition is that for odometer based systems, for all and for all , exists.131313 is defined in Definition 8.
The next lemma justifies the terminology.
Lemma 28
Let be an odometer based system with each . Then there is a canonical factor map
[TABLE]
where is the odometer system determined by .
For each , for all is defined and both and go to infinity. By Lemma 26, the sequence defines a unique element in . It is easily checked that intertwines and .
Heuristically, the odometer transformation parses the sequences in by indicating where the words constituting begin and end. Shifting by one unit shifts this parsing by one. We can understand elements of as being an element of the odometer with words in filled in inductively.
The following remark is useful when studying the canonical factor of the inverse of an odometer based system.
Remark 29
If is the canonical factor map, then the function is also factor map from to (i.e. with the operation “”). If is the construction sequence for , then is a construction sequence for . If is the canonical isomorphism given by Proposition 16, then Lemma 25 tells us that the projection of to a map is given by .
The following is proved in [12]:
Proposition 30
Let be an odometer based system and suppose that is a shift invariant measure. Then concentrates on .
5.2 Circular Systems
We now define circular systems. In [13] it is shown that the strongly uniform circular systems give symbolic characterizations of certain smooth diffeomorphisms defined by the Anosov-Katok method of conjugacies.
These systems are called circular because they are related to the behavior of rotations by a convergent sequence of rationals . The rational rotation by permutes the intervals of the circle cyclically in a manner that the interval occurs in position (mod ).141414We assume that and are relatively prime and the exponent indicates the multiplicative inverse modulo . The operation which we are about to describe models the relationship between rotations by and when is very close to .
Let be positive natural numbers with relatively prime. For , setting with . It is easy to verify that:
[TABLE]
For notational convenience later we set .
Let be a non-empty set such that neither nor belongs to and be words in . Define:151515We use and powers for repeated concatenation of words.
[TABLE]
We note that the product symbol is repeated concatenation as is the exponent. If is a word then is the empty string, , and so forth. The formula in equation 7 is a concatenation of words, each of which is itself, a concatenation of words. The words inside the parenthesis in equation 7 start with ’s, followed by concatenating many ’s, followed by concatenating many ’s. Written with parenthesis:
[TABLE]
Informally, the term, can be written as a block of ’s followed by concatenated with itself times, followed by a block of many ’s, followed by a block of ’s followed by concatenated with itself times followed by a block of ’s and so forth, ending with a block of repeated times followed by repeated many times:
[TABLE]
Remark 31
- •
Suppose that each has length , then the length of is .
- •
For each occurrence of an in there is an occurrence of to the left of it.
- •
Suppose that and occurs at and occurs at and neither occurrence is in a . Then there must be some occurring between and .
- •
Words constructed with are uniquely readable.
The operation is used to build a collection of symbolic shifts. Circular systems will be defined using a sequence of natural number parameters and that is fundamental to the version of the Anosov-Katok construction presented in [18].
Fix an arbitrary sequence of positive natural numbers . Let be an increasing sequence of natural numbers such that
Numerical Requirement 1
* and .*
From the and we define sequences of numbers: . Begin by letting and and inductively set
[TABLE]
(thus ) and take
[TABLE]
Then clearly is relatively prime to .161616 and being relatively prime for , allows us to define the integer in equation 5.2 . For , has one element, , so we set .
Setting , then it is easy to check that there is an irrational such that the sequence converges rapidly to .
Definition 32
A sequence of integers such that , will be called a circular coefficient sequence*.*
Let be a non-empty finite or countable alphabet. Build collections of words in by induction as follows:
- •
Fix a circular coefficient sequence .
- •
Set .
- •
Having built choose a set and form by taking all words of the form with .171717Passing from to , use with parameters and and take modulo . By Remark 31, the length of each of the words in is .
We will call the elements of prewords. The operator automatically creates uniquely readable words, however we will need a stronger unique readability assumption for our definition of circular systems.
Strong Unique Readability Assumption: Let , and view as a collection of letters. Then each element of can be viewed as a word with letters in . In the alphabet , each is uniquely readable.
Definition 33
A construction sequence will be called circular if it is built in this manner using the -operators, a circular coefficient sequence and each satisfies the strong unique readability assumption.
Definition 34
A symbolic shift built from a circular construction sequence will be called a circular system.
Notation: we will often write and to emphasize that we are building circular systems and circular construction sequences. Circular words will often be denoted for emphasis.
Definition 35
Suppose that . Then consists of blocks of repeated times, together with some ’s and ’s that are not in the ’s. The interior of is the portion of in the ’s. The remainder of consists of blocks of the form and . We call this portion the boundary of .
In a block of the form the first and last occurrences of will be called the boundary occurrences of the block . The other occurrences will be the interior occurrences.
While the boundary consists of sections of made up of ’s and ’s, not all ’s and ’s occurring in are in the boundary, as they may be part of a power .
The boundary of constitutes a small portion of the word:
Lemma 36
Suppose that and each has length . Then the proportion of the word that belongs to its boundary is . Moreover the proportion of the word that is within letters of boundary of is .
The length of is . The boundary portions are long. The number of letters within letters of the boundary is .
Remark 37
Let and be sequences of words of length . The boundary portions of and occur in the same positions and by Lemma 36 have proportion of the length. Since all of the ’s and ’s have the same length and the same multiplicity in the circular words we see:
[TABLE]
where and are the concatenations of the various words.181818Equality holds, a fact we won’t use.
For proofs of the next lemma see [13] (Lemma 20) and [12].
Lemma 38
Let be a circular system and be a shift-invariant measure on . Then the following are equivalent:
* has no atoms.* 2. 2.
* concentrates on the collection of such that is unbounded in both and .* 3. 3.
* concentrates on .*
If is a uniform circular system (Example 10), then there is a unique invariant measure concentrating on .
Moreover there are only two ergodic invariant measures with atoms: the one concentrating on the constant sequence and the one concentrating on .
Remark 39
If is circular and has a principal -subword and , then has a principal -subword.
5.3 An Explicit Description of .
The symbolic system is built by an operation applied to collections of words. The system is built by a similar operation applied to the reverse collections of words. In analogy to equation 7, we define as follows:
Definition 40
Suppose that are words in a language . Given coefficients with and relatively prime, let with . Define
[TABLE]
From equation 7, a is of the form :
[TABLE]
where and with . By examining this formula we see that
[TABLE]
Applying the identity in formula 6, we see that this can be rewritten as191919Recall that we take , so .
[TABLE]
Thus
[TABLE]
In particular if is a construction sequence of a circular system , then is the collection:
[TABLE]
and is a construction sequence for .
5.4 Understanding the Words
The words used to form circular transformations have quite specific combinatorial properties. Fix a sequence defining a circular system. Each has three subscales:
- Subscale 0, the scale of the individual powers of of the form ; We call each such occurrence of a a 0-subsection.
- Subscale 1, the scale of each term in the product that has the form ; We call these terms 1-subsections.
- Subscale 2, the scale of each term of that has the form ; We call these terms 2-subsections.
Summary
[TABLE]
For , we will discuss “-subwords” of a word . These will be subwords that lie in , the stage of the construction sequence. We will use “-block” to mean the location of the -subword.
Lemma 41
Let for some and . View .
If and are such that and are at the beginning of -subwords in the same 2-subsection, then . 2. 2.
If and are such that is the beginning of an -subword occurring in a -subsection and is the beginning of an -subword occurring in the next 2-subsection then .
To see the first point, the indices of the beginnings of -subwords in the same -subsection differ by multiples of coming from powers of a and intervals of of the form .
To see the second point, let and be consecutive -subsections. In view of the first point it suffices to consider the last -subword of and the first -subword of . These sit on either side of an interval of the form . Since , we see that .
Assume that and and is shifted with respect to . On the overlap of and , the 2-subsections of split each 2-subsection of into either one or two pieces. Since the 2-subsections all have the same length, the number of pieces in the splitting and the size of each piece is constant across the overlap except perhaps at the two ends of the overlap. If splits a 2-subsection of into two pieces, then we call the leftmost piece of the pair the even piece and the rightmost the odd piece.
If is shifted only slightly, it can happen that either the even piece or the odd piece does not contain even one entire -subsection. In this case we will say that the split is trivial on the left or trivial on the right
Lemma 42
Assume that and and is shifted with respect to . Suppose that the -subsections of divide the -subsections of into two non-trivial pieces. Then
the boundary portion of occurring between each consecutive pair of 2-subsections of completely overlaps at most one -subword of 2. 2.
there are two numbers and such that the positions of the [math]-subsections of in even pieces are shifted relative to the [math]-subsections of by and the positions of the [math]-subsections of in odd pieces are shifted relative to the [math]-subsections of by . Moreover .
This follows easily from Lemma 41
In the case where the split is trivial Lemma 42 holds with just one coefficient, or . A special case of Lemma 42 that we will use is:
Lemma 43
Assume that and and is shifted with respect to . Suppose that the -subsections of divide the -subsections of into two pieces and that for some occurrence of a -subword
in an even (resp. odd) piece is lined up with an occurrence of some -subword in . Then every occurrence of a -subword in an even (resp. odd) piece of is either:
- a.)
lined up with some -subword of or 2. b.)
lined up with a section of a -subsection that has the form .
Moreover, no -subword in an odd (resp. even) piece of is lined up with a -subword in .
5.5 Full Measure Sets for Circular Systems
Fix a sequence such that
Numerical Requirement 2
* is a decreasing sequence of numbers in such that .*
From Lemma 36, the boundary of a word has proportion . Hence Numerical Requirement 2 implies that for all choices with , the sum of the proportion of the boundary sections of is finite.
Definition 44
Let:
* be the collection of such that either does not have a principal -block or is in the boundary of the principal -block of ,* 2. 2.
* is in the first or last copies of in a power of the form where ,* 3. 3.
* is in the first or last 1-subsections of the 2-subsection in which is located.,* 4. 4.
* is in the first or last 2-subsections of its principal -block*.
Lemma 45
Assume numerical requirements 1 and 2. Let be a shift-invariant measure on , where is a circular system. Then:
[TABLE]
For : 2. 2.
[TABLE]
By the Ergodic Theorem we have , and for . The result then follows by the summability of and
In particular we see:
Corollary 46
For -almost all there is an such that for all ,
* is in the interior of its principal -block,* 2. 2.
For .
In particular, for almost all and all large enough : 3. 3.
if , then
[TABLE] 4. 4.
* is not in a string of the form or .*
Apply the Borel-Cantelli Lemma using the previous lemma.
The elements of such that some shift fails one of the conclusions 1.)-4.) of Corollary 46 form a measure zero set. Consequently we work on those elements of whose whole orbit satisfies the conclusions of Corollary 46. Note however that for , the in Corollary 46, depends on .
Definition 47
We will call mature for (or say that is mature at stage ) iff is so large that for all .
If is mature at stage then is mature at stage . Moreover, if has the same principal -block as does then is mature if and only if is not in the boundary portion of the principal -block.
Numerical Requirement 3
The following hold:
[TABLE]
Definition 48
We will use the symbol in multiple equivalent ways. If or define to be the collection of such that is in the boundary portion of an -subword of . In the spatial context define by putting if is the boundary of an -subword of .
For
[TABLE]
The relationship between and is that for :
[TABLE]
The next lemma says that if is mature at stage , then we can detect locally those for which the -shifts of are mature.
Lemma 49
Suppose that , is mature for and .
Assume the first three numerical requirements. Suppose that . Then is mature for iff
- (a)
* and* 2. (b)
. 2. 2.
For all but at most proportion of the , the point is mature for .
Hence by numerical requirement 2, the proportion of for which the -shift of is not mature for is less than .
The first item is immediate from the definition of mature. For the second item, first note that
[TABLE]
Let . Since has proportion of , it suffices to show that for a fixed , the proportion of such that is less than .
There are at most -words appearing in . There are at most many in the boundary of each of these -words. So total number of in is less than or equal to , hence has proportion less than or equal to of .
Similarly for the number of with and is in the block corresponding to a -subword of is at most , and hence those have proportion bounded by in . It follows that the collection of such that is bounded by .
Numerical requirements 1 and 2 imply that the sum in item 2 is bounded by .
A very similar statement is the following:
Lemma 50
Suppose that and has a principal -block. Then is mature provided that . In particular, if is mature for and is not in a boundary portion of its principal -block or in , then is mature for .
5.6 The Circle Factor
Let be a circular coefficient sequence and be the associated sequence defined by formulas 9 and 10. Let and .
For a natural number , let be the partition of with atoms , and refer to as .202020If then refers to where and . Since and are relatively prime, the rotation enumerates the partition starting with . Thus has two natural orderings–the usual geometric ordering and the dynamical ordering given by the order that enumerates . Since (mod ), is the interval in the dynamical ordering.
Definition 51
For we will write if belongs to the interval in the dynamical ordering of . Equivalently if .
Informal description: Following [13], for each stage , we have a periodic approximation to consisting of towers of height whose levels correspond to subintervals of . This approximation refines the periodic permutation of determined by . If is mature then lies is the level of in the dynamical ordering. Passing from to the mature points remain in the same levels of the -towers as they are spread into the -towers in . The towers of can be viewed as cut-and-stack constructions–filling in boundary points between cut -towers. The fillers are taken from portions of the -towers.
With this view each mature point remains in the same interval of when viewed in . Moreover if and , then .
Thus the -tower for has multiple contiguous sequences of levels of length that are sublevels of the -tower and the action of and agree on these levels.
Definition 52
Let . We define a circular construction sequence such that each has a unique element as follows:
* and* 2. 2.
If then .
Let be the resulting circular system.
It is easy to check that has unique non-atomic measure since the unique -word, , occurs exactly many times in . This measure is ergodic.
Let be an arbitrary circular system with coefficients . Then has a canonical factor isomorphic to . This canonical factor plays a role for circular systems analogous to the role odometer transformations play for odometer based systems.
To see is a factor of , define the following function:
[TABLE]
Notation: Write for the unique element of in the construction sequence for . Then lies in the principal -block of the projection to of any for which is mature.
Theorem 53
([13], Theorem 43.) Let be the unique non-atomic shift-invariant measure on . Then
[TABLE]
where is the rotation of the unit circle by radians and are the -algebras of measurable sets.
The isomorphism asserted to exist in Theorem 53 is constructed as a limit of functions , where is defined by setting
[TABLE]
iff is the interval in the dynamical ordering.212121Thus and both have the same subset of as their domain and contain the same information. They map to different places , whereas and is the left endpoint of the interval in the dynamical ordering. Equivalently, since the interval in the geometric ordering is :
[TABLE]
The following follows from Proposition 44 in [13].
Proposition 54
Suppose that is mature for , then
[TABLE]
The proof of Theorem 2 requires understanding the correspondence between the geometric construction and its symbolic representation. The words in correspond to cut-and-stack constructions, passing from stage to via the operator corresponds to basing the cut and stack construction on which agrees with the for most consecutive intervals of length . A first step in understanding this correspondence is the next remark and lemma.
Remark 55
It will be helpful to understand explicitly. To each point in the range of , belongs to . By Lemma 13, to determine it suffices to know for some as well as the sequence of principal subwords of . Since we are working with , the only choice for is . For mature , Proposition 54 tells us that . Thus is the unique element of with the property that agrees with for all large .
We isolate the following fact for later use:
Lemma 56
Suppose that and are mature for . Then if and are the and intervals in the dynamical orderings of and , then .
The natural way of representing the complex unit circle as an abelian group is multiplicatively: the rotation by radians is multiplication by . It is often convenient to identify the unit circle with . In doing so, multiplication by corresponds to “mod one” addition and the complex conjugate corresponds to .
The following result is standard:
Proposition 57
Let be irrational. Suppose that is an invertible measure preserving transformation that commutes with . Then for some , almost everywhere. Identifying with there is a such that for almost all
[TABLE]
It follows that if is an isomorphism between and , then .
Definition 58
Using the identification of with we view . Given a rotation , we get a map such that
[TABLE]
We will occasionally abuse notation and write for .
5.7 Points of view
Circular systems can be viewed from multiple perspectives: geometrically, as limits of periodic processes222222See section 5 of [13] for the formal definition. and as symbolic shifts.
The periodic process consists of a collection of periodic towers with each tower having one level designated as a base. To pass from to the bulk of the -towers are repeated many times in blocks of length in each -tower. In between these blocks there are filler levels.
The words are in one-to-one correspondence with the towers in . The “” operation encodes the transition from to . The towers in correspond to words . Each -tower has a corresponding word . Repeating stacking of corresponds to the powers of in . The levels of a tower in are either contained in levels of -tower or are filler blocks labelled “” or “.” The repetitions of each in [math]-subsections correspond to stacking parts of the levels of the corresponding tower in periodically times.
The circle factor captures exactly the structure of the levels of the towers and how they interact as one moves from to . This is the idea behind for the construction of the isomorphism between and and made explicit in Proposition 54.
Given an that is mature for we can view its restriction to its principal -subword as a particular tower in . Since is mature for , the principal subword is repeated many times on either side of . In particular we see:
Remark 59
Suppose that is mature for , and . Then
[TABLE]
The circle factor of punctuates the elements of . Since there is only one word in each element of the construction sequence for , we can view the levels of its tower as being of the form in the dynamical ordering. Then the cyclic permutation of these levels given by . This permutation preserves the dynamical ordering and, for that are mature at stage , reflect the behavior of .
5.8 The Natural Map
A specific isomorphism will serve as a benchmark for understanding of potential maps . Viewing as a rotation of the unit circle by radians one can view the transformation as a symbolic analogue of complex conjugation on the unit circle, which is an isomorphism between and . Indeed, by Theorem 53, and so . Copying over to a map on the unit circle will give an isomorphism between and . If we view and as elements of the unit interval and the rotation as addition modulo 1, Proposition 57 says that such an isomorphism must be of the form
[TABLE]
for some . It follows immediately from this characterization that is an involution.232323The particular given by is determined by the specific variation of the definition one uses–indeed any central value can occur as a . (See section 8 for the definition and use of central values.)
The map is defined as the limit of a sequence of codes that converge to an isomorphism from to (see [12] for more details). The will be shifting and reversing words. The amount of shift is determined by the Anosov-Katok coefficients defined in equations 10 and 9.
Let and inductively
[TABLE]
It is easy to check that
[TABLE]
Define a stationary code with domain that approximates elements of by defining
[TABLE]
The following result appears in [12]:
Theorem 60
The sequence of stationary codes converges to a shift invariant function that induces an isomorphism from to .
Remark 78 of [12] implies that the convergence is prompt: for a typical and all large enough , agrees with on the principal -block of .
Caveat
Since is trivially isomorphic to we often don’t distinguish them. However, as in Definition 63 of the synchronous and anti-synchronous joinings, the notational distinction becomes important.
When viewing and with the backwards shift and considering the action on the circle factor instead of using , one must use
[TABLE]
instead of simply .
5.9 Categories and the Functor .
Fix a circular coefficient sequence . Let be a language and be a construction sequence for an odometer based system with coefficients . Then for each the operation is well-defined. Define a construction sequence and bijections by induction as follows:
Let and be the identity map. 2. 2.
Suppose that and have already been defined.
[TABLE]
(Words in are concatenations of words in and so can be written in the required form: as with .)
Define the map by setting
[TABLE]
Note in case 2 the prewords are:
[TABLE]
Remark 61
Some useful facts are:
- •
It follows from Lemma 36 and Numerical Requirement 1 that if is an odometer based construction sequence, then is a construction sequence; i.e. the spacer proportions are summable.
- •
If each occurs exactly the same number of times in every element of , then is strongly uniform.
- •
Odometer words in have length . The length of the circular words in is .
Definition 62
Define a map from the set of odometer based subshifts to circular subshifts as follows. Suppose that is an odometer based shift built from a construction sequence . Define
[TABLE]
where has construction sequence .
The map is one to one by the unique readability of words in . Suppose that is a circular system with coefficients . We can recursively build functions from words in to words in . The result is a odometer based system with coefficients . If is the resulting odometer based system then . Thus is a bijection.
If is an odometer based system, denote the odometer base by and let be the canonical factor map. If is a circular system, let be the rotation factor and be the canonical factor map. For both odometer based and circular systems the underlying canonical factors serve as timing mechanisms. This motives the following.
Definition 63
Synchronous* and anti-synchronous joinings are defined as follows:242424We use for the notation for the rotation factor of a circular system . In this context, when taking inverses of symbolic systems we keep the same orientation for the symbolic system and use *
Let and be odometer based systems with the same coefficient sequence, and a joining between and . Then is synchronous if joins and and the projection of to a joining on is the graph joining determined by the identity map (the diagonal joining of the odometer factors); is anti-synchronous if is a joining of with and its projection to is the graph joining determined by the map . 2. 2.
Let and be circular systems with the same coefficient sequence and a joining between and . Then is synchronous if joins and and the projection to a joining of with is the graph joining determined by the identity map of with , the underlying rotations; is anti-synchronous if it is a joining of with and projects to the graph joining determined by on .
The Categories Let be the category whose objects are ergodic odometer based systems with coefficients . The morphisms between objects and will be synchronous graph joinings of and or anti-synchronous graph joinings of and . We call this the category of odometer based systems.
Let be the category whose objects consists of all ergodic circular systems with coefficients . The morphisms between objects and will be synchronous graph joinings of and or anti-synchronous graph joinings of and . We call this the category of circular systems.
The main theorem of [12] is the following:
Theorem 64
For a fixed circular coefficient sequence the categories and are isomorphic by a function that takes synchronous joinings to synchronous joinings, anti-synchronous joinings to anti-synchronous joinings, isomorphisms to isomorphisms and weakly mixing extensions to weakly mixing extensions.252525Glasner showed that it takes compact extensions to compact extensions.
It is also easy to verify that the map takes uniform construction sequences to uniform construction sequences and strongly uniform construction sequences to strongly uniform construction sequences.
Remark 65
Were we to be completely precise we would take objects in to be presentations of odometer based systems by construction sequences without spacers and the objects in to be presentations by circular construction sequences. This subtlety does not cause problems in the sequel so we ignore it.
5.10 Propagating Equivalence Relations and Actions
In [8], the number is the first stage in the tree for which has length . It is the first stage that the equivalence relation is defined.
The main result of [8] is the existence of a continuous function from the space of trees to odometer based transformations that reduces ill-founded trees to ergodic transformations isomorphic to their inverses. Components of the construction include equivalence relations and groups . Some of their properties are:
is a monotone, strictly increasing function from to , 2. 2.
is the trivial equivalence relation with one equivalence class on . 3. 3.
is an equivalence relation on 4. 4.
For , viewing elements of as concatenations of words in , is the product equivalence relation of . Hence we can view as sequences of elements of and similarly for . These sequences have length and are made of many constant blocks of length . 5. 5.
The groups are direct sums of copies of that have a designated canonical collection of free generators.262626These groups are described in detail in Section 10.2. Each , where is either or is trivial. 6. 6.
Each group acts freely on in a manner that even parity group elements preserve the sets and and the odd parity group elements send elements of to . 7. 7.
The action of on is propagated from by the skew-diagonal action: if is a canonical generator and is of the form then
[TABLE]
We now define corresponding equivalence relations and group actions on . They will be used in section 8.2.1 to state the timing assumptions and in section 10.2 which gives the construction specifications from [8].272727If is an equivalence relation on define by if and only if . In abuse of notation we will not distinguish between as a relation on , as a relation on or .
An inductive understanding of and the -actions is quite useful.
Inductive definition of : Define
- •
to have exactly one class in each ,
- •
For put if and only if .
Suppose we are given on . Define an equivalence relation on by setting equivalent to if and only if for all is -equivalent to .
Rather than a full definition of the action of on , we describe the how the action of propagates: via the circular skew diagonal action:
Identify with the collection of sequences of the form
[TABLE]
as ranges over the elements of .
To define the skew-diagonal action of on classes of circular words it suffices to specify it on the canonical generators, This is done by setting282828We use to denote .
[TABLE]
whenever is a canonical generator of . Note that the skew-diagonal action has the property that the canonical generators take elements of to elements of . It follows that the even parity elements of leave the sets and invariant and odd parity elements of take to elements of and vice versa.
As in [8] the equivalence relations define factors of and similarly define factors of The equivariant definitions given here imply that takes each to and respects the actions of the .
6 Understanding Rotations
Let be a rotation factor of a circular system with coefficient sequence . This section analyzes how automorphisms of affect the parsing of elements of .
Let and be two circular systems with that share a given circular coefficient sequence and let . Any isomorphism between and induces a unitary isomorphism from to , and this isomorphism sends eigenfunctions for to eigenfunctions for . Thus every isomorphism has to send the canonical factor of to the canonical factor of . Explicitly: suppose that is an isomorphism. Then , and takes the space generated by eigenfunctions of in with eigenvalues to the space generated by corresponding eigenfunctions in . Consequently there is a measure preserving transformation making the following diagram commute:
[TABLE]
By Theorem 53, is conjugate to the rotation of the unit circle by a map . Hence (using additive notation) must be conjugate to a transformation defined on the unit interval of the form for some , where is either or , depending on whether maps to or . Since is an isomorphism, if maps to , can serve as an alternative to the benchmark to the map . Explicitly: the associated to is the number making ; equivalently, .292929The reader is referred to the Caveat at the end of section 5.8, for the reason is used.
Summarizing,
- A.)
If is an isomorphism, then viewed as a map from to , there is a unique for almost every , .
- B.)
If then there is a unique for almost every , .
Definition 66
In cases A.) and B.), we call the map the rotation associated with .
We record the following facts:
Lemma 67
Let be a circular system. Then
The set of associated with automorphisms of form a group. 2. 2.
If and are isomorphisms where and , then where .
It is easy to check that
- •
If are isomorphisms from to with and , then is also an isomorphism from to and , where .
- •
If is an isomorphism from to , and , then .
The second assertion is similar.
Given a rotation , set
[TABLE]
This can be described independently of as:
[TABLE]
It is clear that .
Define a sequence of functions . Each
[TABLE]
For and we have and . All large enough are mature for , and is determined by a tail segment of .
Definition 68
If is mature for both and , let
[TABLE]
and otherwise. (We could have made a more general definition for arbitrary and take when we want to use .)
Explicitly: from the definition of , belongs to the interval in the dynamical ordering of .303030More accurately: if and , then belongs to the interval in the dynamical ordering of . Recall the relationship between symbolic shifts and the towers of intervals in the dynamical ordering given in Section 5.7.
Fix an and suppose that is not a multiple of . Then the interval intersects two geometrically consecutive intervals of the form .
Lemma 69
Suppose that is mature for and . Then belongs to . Thus there are only two possible values for and these values differ by .
Suppose that and . Then . We claim that, relative to those for which is mature for both and , is constant on and on , where it takes values and respectively (see figure 2).
We show that is constant on the first set. Suppose that is mature for and belongs to the interval . Then . Hence . Since we know that . Now suppose that and is mature for and . Let . Then for some . So . Hence
[TABLE]
Thus
[TABLE]
If the proof is parallel.
Finally and fall into consecutive intervals of in the geometric ordering, and hence .
Define and by setting and . Let
[TABLE]
and
[TABLE]
We refer to and as the left lane and right lane respectively.
Notation: Let , be the measures of the left and right lanes at stage .
Lemma 70
Consider and let be the measure of the collection of that are not mature at stage . Then:
, 2. 2.
** 3. 3.
**
In particular if and only if and if and only if .
Let be the collection of that are mature at stage . In the proof of Lemma 69, we showed that is and is , where and . Since there are many levels and the inequalities in item 1 follow. Item 2 is similar. Item 3 follows since . The final assertion follows from Lemma 45.
Restating the discussion:
Lemma 71
For almost all that are mature at stage , where if and if .
Assume that is mature for . Then on its principal -block, the projection of to agrees with .313131Recall is the notation for the unique member of the element of the construction sequence for . The values and are the and the values of the word . From equation 25, . Hence , and the lemma follows.
The items in the following lemma are essentially Remark 12 and Lemma 56 in a different context.
Lemma 72
For almost all and for that are mature for and
If and then the place in the principal -block of is in the place of the principal -block of . 2. 2.
Let be the interval of and the interval of in the dynamical orderings. Then .
This follows from Remark 55 and Lemma 56. To see this note that ; i.e. is in the place of the principal -block of where .
Thus typical points in and are those in which the -block of containing [math] is the shift of the block of containing [math] by and respectively.
We now describe how changes. As varies, measures the shift between and . In regions where the principal -subwords of both and exist and are repeating is constant. It is also constant as it crosses boundary regions of and as long as those boundary regions have length and are lined up with adjacent -subwords. However for , if the boundary section of an -word of or has length not divisible by , the relative alignment between and changes. This happens on regions of .
If is mature for , the principal -word of repeats on both sides of and thus we see:
Lemma 73
If is mature at stage , then is constant on the principal -block of . Moreover on is constant on the even and odd overlaps of 2-subsections of subwords of and .
The next lemma is used for the “nesting” arguments in Section 7.3. It says that the measure of the set of with or can be closely computed as a density in every scale bigger than .
Remark
The notation and are supposed to be suggestive of the left and right lanes. To a close approximation, if is mature and in a left lane then and similarly for the right lanes.
Lemma 74
*Let be natural numbers. Then such that for almost every for which is mature:323232Properly speaking the and notation should indicate as well. Without any contextual indication of what is we take . *
If , then , 2. 2.
If then , 3. 3.
, 4. 4.
* and* 5. 5.
.
As in Lemma 69, let , where (See figure 2). The partition splits each interval into subintervals. Let be the indices of the intervals that lie over or under and . Explicitly: suppose that and . Let
[TABLE]
Then , and if , then either:
[TABLE]
For , put if it satisfies equation 26 and if it satisfies equation 27. It follows that for almost all , if is mature for and , then and similarly for . Since is a partition of and , the lemma follows.
Lemma 75
Let and be a typical member of .
Let . Then converges to in the -topology.
As a result, in the language of symbolic systems: 2. 2.
Let and be the shift map on . Then is either or , depending on the value of and for almost every . 3. 3.
With as in item 2 and an arbitrary circular system with the given coefficient sequence , define and to be the left and right endpoints of the principal -block of . Then for almost all , and .
The first item follows because . Hence converges rapidly to . The second item follows from the first via the isomorphism . The third item follows because and converges to topologically. Hence for all there is an such that for all , the principal -block of is the same as the principal -block of . Since the principal -block of contains the principal -block of and , item three follows.
If and are as in item 3, then:
[TABLE]
7 The Displacement Function
In this section we define a function from to the extended positive real numbers that will eventually be shown to have the properties that
- •
implies that there is an element of the centralizer of having as its associated rotation.
- •
if is built suitably randomly, then every element of the centralizer of , or isomorphism from to has rotation factor with .
The idea behind the displacement function is simple: the number determines and hence a shift at each scale . The words in are of the from . If the shift at stage lines up most -words with other -words in the same argument of then it is possible to build an element of the centralizer of any having rotation factor . If not, and we build suitably randomly, then we can arrange that is not a central value.
Fix for the rest of this section, and let be the shift map. The next lemma says that the principal -blocks of and are exactly aligned.
Lemma 76
Let be typical and be mature for both. Define . Then is in the same position of its principal -block as is in ’s principal -block. In particular, has its 0 in a position inside an -word in the construction sequence for some copy of .
Since the -blocks of repeat on either side of the principal -block of , and these have length , it suffices to show that . Let and consider the point . Then is in the place in its principal -block. By Lemma 72, is in the place in its principal -block. Since , the point has its [math] in the place of its principal -block. Hence and by so by remark 59 .
At first glance Lemma 76 looks puzzling as we are not assuming that any of , or . However the assertion is a statement about how the -towers sit inside the -towers. For mature this nesting repeats on either side of the principal -blocks and hence behaves as in the cyclical approximations. Thus it is independent of the value of or , and simply reflects the cyclical structure.
For a particular , the sequence of shifts converges to . Lemma 76 tells us that this happens promptly: for mature , has its place in the same position of its principal -block as does.
We now consider the location of 0 in the principal -block of the point relative to the position of [math] in the principal -block of . For some and the principal -block of arises from the argument of and the principal -block of is in a position coming from the argument.
Definition 77
*Let . With and as just described, the argument of -matches the argument. The point is well--matched at stage if is mature at and . If is mature for and , then is ill--matched. *
Lemma 78
Let be a circular system and consider . Let and suppose that is mature for and and that is well--matched at stage . Let and . Then:
* and* 2. 2.
if is the interval , then
[TABLE]
Lemma 76 asserts that [math] is in the same place in the principal -block of as [math] is in the principal -block of . Since is mature for , the principal -block of is repeated on either side of . Since is mature for , the principal -block of is repeated at least twice on either side of . It follows that [math] is in the same place in the principal -block of as [math] is in the principal -block of . This proves the first assertion.
A repetition of this argument shows the second assertion as well, using the fact that is well--matched. Indeed the definition of well--matched implies that the principal -words of and are identical. Applying to both, and using the fact that the principal -words repeat one sees that the principal -words and are identical. Since the issue of alignment only involves , item 2 holds for all with . Moreover, arguing as in the last paragraph using the repetition of the principal -blocks, shifting by an does not change this.
Comment
The terminology in this definition extends easily to general circular systems by saying that argument and arguments are -matched in if and only if this is true in , where is the projection of to . Similarly we write for .
7.1 The Definition of
Let be a circular system. Define
[TABLE]
and set333333Since being well or ill-matched only depends on in this section we will not carefully distinguish between and .
[TABLE]
Definition 79
The number is a central value iff
Note that is defined using the block structure of the and hence is determined by together with the sequences and . Thus for the property of being central depends only on the circular coefficient sequence , rather than on the particular circular system .
In section 8.1, we show that if is finite then there is an element in the weak closure of such that In particular is the rotation factor of an element of the centralizer. That result does not use the results of the rest of this section.
7.2 Deconstructing
Fix a . Recall that is the sequence satisfying numerical requirement 2: .
Suppose that is typical, is mature and is ill--matched. Then there are 4 possibilities:
or and 2. 2.
or
Call these possibilities .
Lemma 80
Let with . There is a partition of the set such that for , if is mature for then
* implies * 2. 2.
.
This follows immediately from Lemma 74 by holding fixed and applying the lemma successively to and . Except for a set that has at most elements, every point in belongs to some .
The levels of the -tower reflect the construction of from -words with . If and are mature at stage , then the locations of and in their principal -block and the pair determine whether is ill--matched or not. For particular choices of either all typical in with mature for both and are well--matched or none are.
In the next section we will fix a particular choice of and . For now let and be such that all -mature in configuration are ill--matched. We use the symbol (In LaTeX: \not\Downarrow) to indicate the misaligned points at stage . Let
[TABLE]
We need to localize the sets . The next lemma tells us that they are uniformly close to open sets:
Proposition 81
Let with . Then there is a set such that if , is mature for and , then
* is mature for ,* 2. 2.
* and .* 3. 3.
* and*
[TABLE]
Let be an arbitrary point in that is mature for . Take to be those numbers of the form (where ) such that has its zero point in and is mature for . Then is independent of the choice of . By Lemma 49, the collection of such that is not mature for has density at most .
7.3 Red Zones
Suppose that is not central, i.e. that . Then for some fixed choice of , with belonging to ,
[TABLE]
is infinite. Fix such an . Then with this choice for all is well-defined, and moreover there is a set such that if belong to then and
[TABLE]
Let be a point in such that all of the shifts of and are generic with respect to basic open sets, the ’s, , and the sets , . For large enough , we describe how to use and to identify a subset of the interval consisting of misaligned points and having density arbitrarily close to one.
Assume that and is mature for and . In defining , the choice that together with , give us the relative locations of the overlap of the principal -blocks of and .
Let be the principal -block of and be the principal -block of . and assume that they are in the position determined by . By Lemma 42, on the overlap the 2-subsections of split the 2-subsections of into either one or two pieces, and the positions of all of the even pieces are shifted by the same amount relative to the 2-subsections of and similarly for the odd pieces.
We analyze the case where occurs in an -block where the 2-subsections are split into two pieces. If they are only split into one piece (i.e. they aren’t split) the analysis is similar and easier. Without loss of generality we will assume that occurs in an even overlap.
Since neither , nor occur in the first or last 1-subsections of the principal 2-subsection that contains them, we know that the overlaps of the principal 2-subsections of and contain at least 1-subsections. The 0-subsections of the form of each 1-subsection of in this overlap are split into at most three pieces, powers of the form , and where , and the middle power crosses a boundary section of . The powers and are constant on the overlap of the 2-subsections, constant in all of the even pieces of the overlap of the 2-subsections of the principal -block, and are determined by . Moreover, . Again, without loss of generality we assume that is in the left overlap corresponding to the power .
Observation: There is a number between [math] and determined by the pair such that the even piece of a 2-subsection that contains is of the form , except that the last 1-subsection may be truncated. Moreover, since is constant for in the principal -block of , if
[TABLE]
then and for all the powers are -matched with except for portions of the first and last power.
In particular, if is such that the [math] position of lies in the interior of initial power in an even overlap and , then because it is lined up with .
Lemma 82
Let and suppose that and are generic, and that is mature at . Suppose that . Then there is a set such that if and , then:
* has its zero located in ,* 2. 2.
* is mature for ,* 3. 3.
, 4. 4.
There is a and a such that is:
- (a)
a union of sets, each of the form 2. (b)
each set is a subset of a position of an occurrence in of an -subword of (with ), 3. (c)
each is a collection of positions non--boundary positions in such that is -matched with , except perhaps for the first or last copy of in
and 4. (d)
each set is the collection of all non--boundary positions in in a block of the form .
and
[TABLE]
The first statement is automatic since . Let be as in Proposition 81. If then, as in the discussion before the statement of Lemma 82, occurs in the position of a power , where is the principal -block of and occurs on the left overlap of 1-subsections of the principal -block of .
As in the observation before this lemma, to each we can associate a set containing by taking all of the positions of the powers in the even overlap determined , where is not in the boundary of a . Let be the union of all of the collections as ranges over .
Assertion 4(c) follows from the observation and the fact that and are constant (and equal to and ) on .
We show that if then is mature for and that . The maturity of follows immediately from the maturity of and the fact that the location of 0 in is in a non-boundary portion of an -subword of its principal -block. That follows from the fact that is -matched with , and .
To finish, note that
[TABLE]
Hence
[TABLE]
Thus Lemma 82 follows from Lemma 81.
We now define the red zones corresponding to . Recall that if then and . For consecutive elements of , define
[TABLE]
Then we see that:
- •
, so
- •
and if is the set defined in Lemma 82, then .
Lemma 83
Let be a natural number and . Suppose that and are generic, and that is mature at . Then there is a sequence of natural numbers , an and sets , for , such that
* and ,* 2. 2.
* is disjoint from for ,* 3. 3.
* is a union of blocks of the form described in condition 4 in Lemma 82 inside -subwords of * 4. 4.
if , then , 5. 5.
the density of in is at least .
We can assume that is so large that has measure less than and . From the definition of we can find a collection of consecutive elements of so that
[TABLE]
Choose an , and for notation purposes set .
Define sets and by reverse induction from to with the following properties:
i.
consists of entire locations of words in ,
ii.
and has relative density at least ,
iii.
the set and hence,
iv.
has density less than or equal in
To start, apply Lemma 82 with , to get a set of density at least satisfying conditions 3-4 of the lemma we are proving. Set . Let .
Suppose that we have defined and satisfying the induction hypothesis (i-iv).
Apply Lemma 82 again to get a set a subset of . Inside each copy corresponding to a location in of a in the complement of , we have a translated copy of , . Let be the union of the sets where runs over the locations the words in the complement of .
Then the density of relative to
[TABLE]
is at least . It follows from conclusion 3 of lemma 82 that is a union of non-boundary portions of blocks of length corresponding to positions of in ,
Since consists of a union of the non-boundary portion of locations of words ,
[TABLE]
consists of the entire blocks of locations of together with elements of . The latter set has density less than or equal to . Let
[TABLE]
It remains is to calculate the density of . At each step in the induction, we remove a portion of density at least from . Let . Then the density of the union of the ’s is at least
[TABLE]
which is at least
8 The Centralizer and Central Values
In the first part of this section we show that every central value is rotation factor of an element of the closure of the powers of and hence an element of the centralizer.
The second part shows a converse: if is built sufficiently randomly then the rotation factor of every element of the centralizer is a rotation by a central value.
We note in passing that every circular system is rigid: if is mature for , then has the same principal -block as does. It follows that is a perfect Polish monothetic group.
8.1 Building Elements of the Centralizer
If is finite, then the Borel-Cantelli lemma implies that for -almost every , there is an such that for all , is well--matched at stage . As a consequence, certain sequences of translations converge. Precisely:
Theorem 84
Suppose that is a uniform circular system with coefficient sequence . Let be the shift map on and be a number such that . Then there is a sequence of integers such that converges pointwise almost everywhere to a with . In particular there is a sequence such that converges in the weak topology to a with .
Corollary 85
If is central, then there is a such that .
Let be the tree of finite sequences . Choose an such that
[TABLE]
has positive measure. By the König Infinity Lemma there is a function such that for all , for all with has positive measure. Let .
By Lemma 75, item 3 it follows that for a typical the left and right endpoints of the principal -blocks of go to negative and positive infinity respectively. Let be a typical element of ; e.g. and both belong to , large enough are mature for and for all large , is well--matched at stage . Then for all large , the left and right endpoints of the principal -block of and are the same. If is well--matched at stage , then the words constituting principal -block of and are the same. It follows that for typical , the sequence converges in the product topology on .
We now show that the map is one-to-one. If , then either or there is an such that for all the principal -blocks of and differ. We can assume that this is so large that is mature and well--matched for .
If , then . Hence the limits of and differ. So assume that . Then, since is a translation by at most and is mature for all parties (so the principal -blocks of and repeat) we know that the principal -blocks of and differ. But for all , the principal -blocks of agree with the principal -blocks of (and similarly for ). Hence for all the principal -blocks of and differ. It follows that the limit map is one-to-one.
We need to see that for almost all belongs to . By definition of this is equivalent to showing that for almost all if is an interval, then is a subword of some for some . However, by Lemma 78, for almost all we can find an so large that:
, 2. 2.
and agree on the location of the principal -block of containing , and 3. 3.
and agree on what word lies on the principal -block.
Since the principal -block of belongs to , we are done.
Summarizing, if , then for almost all , is defined and belongs to . Moreover is one-to-one and commutes with the shift map.
Define a measure on by setting . Then is a non-atomic, shift invariant measure on . By Lemma 38, we must have . In particular we have shown that is an invertible measure preserving transformation belonging to , with .
We make the following remark without proof as it is not needed in the sequel:
Remark 86
Suppose that satisfies the hypothesis of Theorem 84 and is a central value. Then for any sequence of natural numbers such that converges to sufficiently fast, the sequence converges to a with .
8.2 Characterizing Central Values
The main result of this section is a converse of Corollary 85. If is a circular system built from sufficiently random collections of words and is an isomorphism between and then for some central . Moreover, if is an isomorphism between and then is of the form for some central .
In this section we will return to considering as with the forward shift, and hence can use instead of .
8.2.1 The Timing Assumptions
Randomness assumptions about the words in the ’s will allow us to assert that that the rotations associated with elements of the centralizer of or isomorphisms between and arise from central ’s. The last part of the paper shows that these additional randomness assumptions are consistent with the randomness assumptions used in [8] and describes how to build words with both collections of specifications.
Recall from Definition 34, that to specify a circular system with coefficient sequence it suffices to inductively specify collections of prewords , and define as the collection of words:
[TABLE]
In the construction, there will be an equivalence relation on that is lifted from an analogous equivalence relation on the first step of the odometer construction . It is built in section 10; we describe its properties here. Let be the sequence of propagations of . As the construction progresses there are groups acting freely on the set of equivalence classes of words in . Each is a finite sum of copies of . Inductively, or . The action of on arising from the action via the inclusion map of into is the skew-diagonal action. We will write for the -equivalence class of a and for the orbit of under . If and then we say that occurs at if there is a sitting on the interval inside and .
Numerical Requirement 4
. This can be satisfied by taking .
We note that is determined directly by the first -nodes in tree we are using in the domain of the reduction, and hence is determined by the tree. So this requirement on does not depend on any of the other variables being chosen during the construction. In what follows we call such requirements absolute requirements.
Notation: As an aid to tracking corresponding variables, script letters are used for sets and non-script Roman letters for the corresponding cardinalities. For example we will use for an equivalence relation and for the number of classes in that equivalence relation.
Here are the the assumptions used to prove the converse to Corollary 85. The first three assumptions follow immediately from the definitions in section 5.10.
- T1
The equivalence relation is the equivalence relation on propagated from . 2. T2
acts freely on 3. T3
The canonical generators of send elements of to elements of and vice versa.
The next axiom states that the classes are widely separated from each other.
- T4
There is a such that such that for each and each pair and each if , then:
[TABLE]
Remark 87
In the axioms we write to mean that where .
Numerical Requirement 5
* is chosen small relative to . Explicitly: if then .*
In the next assumption we count the occurrences of particular -word that are lined up in an -preword with the occurrences of a particular -class in the shift of another -preword or its reverse. The shift (by n-subwords), must be non-zero and be such that there is a non-trivial overlap after the shift.
- T5
Let be prewords in , and be either or . Write and , with . Let or according to whether or . For all integers with , :
- T5a
(This is comparing with .) Let
[TABLE]
Then
[TABLE] 2. T5b
(This is comparing with .) Let
[TABLE]
Then
[TABLE] 2. T6
Suppose that are prewords, and . Let
[TABLE]
Then:
[TABLE] 3. T7
Let be prewords in , and be either or . Suppose that . Write and , with . Let or according to whether or . Then for all if
[TABLE]
then
[TABLE]
Definition 88
We will call the collection of axioms T1-T7 the timing assumptions for a construction sequence and an equivalence relation .
8.2.2 Codes and -Distance
We now prove some lemmas about .343434Basic notation and facts about stationary codes are reviewed in section 4.4.
Lemma 89
Let and . Let and be intervals in of length . Let be the intersection of the two intervals. Put on and on and suppose that all but (possibly) the first or last copies of are included in . Let be a stationary code such that the length of is less than . Then:
[TABLE]
Since the length of the code is much smaller than and , the end effects of are limited to the first and last copies of and thus affect at most proportion of . Removing the portion of across from the first or last copy of leaves a segment of of proportion at least .
For all of the copies of , except perhaps at most one at the end of , there is a corresponding copy of that overlaps in a section of at least . Discard the portions of arising from copies of not overlapping the corresponding copies of . After the first two removals we have a portion of of proportion at least .
Because and have the same lengths, the relative alignment between any two corresponding copies of and in the powers and are the same. In particular, the “even overlaps” and “odd overlaps” are the same in each remaining portion of the corresponding copies of and .
By Lemma 42, there are such that on the even overlaps all of the -subwords of are either lined up with an -subword of or with a boundary section of , and all of the -subwords of in an odd overlap are lined up with an -subword or a boundary section of by .
Either the even overlaps or the odd overlaps contain at least of the -subwords that are not across from boundary portions of . Assume that of the -subwords lie in even overlaps and discard the portion of on the odd overlaps. (If more than of the -subwords are in odd overlaps we would focus on those.)
Let on the even overlaps. Denote any particular copy of in as . Then, except for -words that get lined up with a boundary section of , every -subword of coming from an even overlap of gets lined up with an -subword of . Write and (or, respectively, ). Then each -subword of coming from an even overlap is of the form for some . There is a such that for all if occurs in any copy of and comes from an even overlap then either:
- a.)
is lined up with (respectively ) or 2. b.)
is lined up with a boundary portion of or 3. c.)
is lined up with (respectively ).
On copies of coming from even overlaps of 2-subsections the powers of in alternatives a.) and c.) are constant. Since the even overlaps of the 2-subwords has size at least half of the lengths of the 2-subwords, it follows that .
Since all of satisfies a.), b.), or c.), after discarding the ’s in case b.) half of the remaining ’s satisfy a.) or c.). Keep the larger alternative and discard the other. What is left after all of the trimming has size at least:
[TABLE]
proportion of .
For some what remains consists of -subwords in even overlaps of that, after being shifted by to be subwords of , are aligned with occurrences of -subwords of of the form ( respectively). For the rest of this proof of Lemma 89 we will call these the good occurrences of -subwords.
Claim: Suppose that and let
[TABLE]
Let or depending on whether or . Then
[TABLE]
is bounded by .
We prove the claim. We have two cases:
Case 1: .
In this case we have a trivial split in the language of section 5.4. The overlap of the 2-subsections contains the whole of the two subsections except for a portion of one 1-subsection. Since we can apply axiom T7 to the words and . The claim follows from inequality 35, which is the preword version of formula 37, after taking into account the boundary and the words at the ends of the blocks of and the truncated 1-subsections.
Case 2: .
In this case the split is non-trivial. Because the even overlaps are at least as big as the odd overlaps of 2-subsections, the even overlap looks like:
[TABLE]
but with a portion of its last 1-subsection possibly truncated. In particular it has an initial segment of the form
[TABLE]
where .
It follows from the timing assumption T5 that if some element of occurs across from a word starting at in the first 1-subsections then
[TABLE]
Any variation between the quantity in formula 37 and the estimate in T5 is due to the portion of the last 1-subsection of the even overlaps. This takes up a proportion of the remaining even overlap less than or equal to . This proves the Claim.353535The axiom T5b takes care of the case where the relevant overlaps is odd.
We now shift back to be and consider . There is an such that all of the good occurrences of a in are in a power . Depending on whether or , for each good occurrence of a in either:
- a.)
there are at least powers of in the corresponding occurrence in such that their left overlap with has length at least
or 2. b.)
there are at least powers of in the corresponding occurrence in such that their right overlap with has length at least
Without loss of generality we assume alternative a.). Suppose that the overlap has length in all of the good occurrences. Then the left side of overlaps the right side of by at least .
By axiom T4, if ,
[TABLE]
and
[TABLE]
then . It follows that if we fix a and let
[TABLE]
then
[TABLE]
is less than .
Since at least proportion of consists of left halves of good occurrences of the various ’s belonging to it follows that
[TABLE]
The lemma follows.
8.2.3 Elements of the Centralizer
In this section we prove the theorem linking central values to elements of the centralizer of .
Theorem 90
Suppose that is a circular system built from a circular construction sequence satisfying the timing assumptions. Let be an automorphism of . Then for some central value .
This is a condition that does not involve any of the other variables being chosen: at the moment when is being chosen is already determined by the tree in the domain of the reduction.
Fix a and suppose that . We must show that is central. Suppose not. The idea of the proof is to choose a stationary code well approximating and an such such for all , passing over the principal -block of most with gives a string very close to in -distance. Consider an where codes well on this principal -block.
Use Lemma 83 to build a red zone corresponding to . Lemma 89 implies that cannot code well on the red zone. Since the red zone takes up the vast majority of the principal -block, cannot code well on the principal -block, yielding a contradiction. In more detail:
Let be as in Axiom T4. By Proposition 20 there is an code such that for almost all ,
[TABLE]
By the Ergodic Theorem there is a so large that for a set of measure for all and all , is mature for and if is the principal -block of then
[TABLE]
Let . Choose an such that the code length of is much smaller than , and . Apply Lemma 83, with to find an and satisfying the conclusions of Lemma 83. Since we view as a subset of the principal -block of .
Each is a union of collections of locations of the form , with each consisting of the locations of for (for some ).363636 is as in condition 4.c) of Lemma 82. Moreover there is a such that each power is -matched with a in for some .
Because axiom T6 applies and thus for at least proportion of , and are in different -orbits. In Lemma 89, inequality 36 implies that if and are in different orbits then, restricted to the overlaps of the locations of all of the and , the distance between and is at least . Since the first and last powers of in ’s take up of and , we know that
[TABLE]
Because the proportion of ’s for which and are in different -orbits is at least it follows that
[TABLE]
is at least
[TABLE]
This in turn is at least . Since is a union of sets of the form :
[TABLE]
Since has density at least if is the principal -block of :
[TABLE]
However this contradicts the inequality 39.
Corollary 91
Let be a circular system built from a circular construction sequence satisfying the timing assumptions. Then is a central value if and only if there is a with . It follows that for each construction sequence satisfying the Numerical Requirements collected in Section 11, the central values form a subgroup of the unit circle.
Theorem 84 says that if is central, then there is a with . Theorem 90 is the converse. To see the last statement we prove in Section 10 that for every coefficient sequence satisfying the Numerical Requirements, we can find a circular construction sequence satisfying the timing assumptions.
8.2.4 Isomorphisms Between and
We now prove a theorem closely related to Theorem 90
Theorem 92
Suppose that is a circular system built from a circular construction sequence satisfying the timing assumptions. Suppose that is an isomorphism. Then for some central value .
We concentrate here on the differences with the proof of Theorem 90. The general outline is the same: Fix a . Then there is a unique such that . Suppose that is not central. Choose a stationary code that well approximates in terms of distance (say within ), and derive a contradiction by choosing a large and getting lower bounds for distance along the principal -block of a generic .
This is done by first comparing a typical with . As in Theorem 90, a definite fraction of a large principal -block of is misaligned with . But most of the -blocks of are aligned with reversed -blocks of that have been shifted by a very small amount. This can be quantified by looking at the codes for large , which agree with on the -block of .
Here are more details. Recall is the limit of a particular sequence of stationary codes . The proof of Theorem 60 showed that for almost all for all large enough the principal -blocks of and agree. Fix a generic and a large such that:
the code codes well on the principal -block of for all , 2. 2.
for all the principal -blocks of and agree, 3. 3.
is mature at , 4. 4.
the length of is very small relative to and 5. 5.
is very large.
Comparing and , Lemma 83 gives us an and a red zone in the principal -block . We assume that the red zones take up at least proportion of the principal -block and have the form given in Lemma 83.
We will derive a contradiction by showing that cannot code well. This is done by considering the blocks of that are lined up with the red zones of the principal -block of and using Lemma 89 to see that cannot code well on these sections. This is possible because the mismatched -blocks of are lined up closely with the -blocks of . Explicitly:
Use Lemma 83, to choose red zones that take up a proportion of the principal -block of .373737We use the notation in Lemma 83 and Theorem 60.
The boundary portions of -words with take up at most proportion of the overlap of the principal -blocks of and . Since this proportion is so small, Remark 22 allows us to completely ignore blocks corresponding to -words in that are lined up with boundary in and vice versa.
We now examine the how compares with . Temporarily denote by . By the choice of , for all the alignments of the principal -blocks of and agree.
The red zones of line up blocks of the form with blocks of the form occurring in that are shifted by (so ). Except for those blocks that line up with boundary portions of these blocks are lined up with blocks of the form in .383838See the qualititative discussion of that occurs after its definition in [12]. Inequality 21, says that . In particular the blocks of powers of are lined up with a very small shift of in .
Thus vast majority of blocks that are positions of in are lined up with a shift by less than of a block of in a position of in . Consider and . Suppose that are the -words of corresponding to the and are the -words of across from them. By axiom T5a, at most of the happen to have . At least proportion of the powers of the -distance between and is at least .
It follows that on the -distance is at least . If we choose to have density at least and let be the principal -block of then (as in Theorem 90)
[TABLE]
a contradiction.
8.3 Synchronous and Anti-synchronous Isomorphisms
View a circular system as an element of the space MPT endowed with the weak topology.
Theorem 93
Suppose that is a circular system satisfying the timing assumptions. Then:
If there is an isomorphism such that , then there is an isomorphism such that and is the identity map. 2. 2.
If there is an isomorphism then there is an isomorphism such that .
To see the first assertion, let be an isomorphism with . Then by Theorem 90, for a central . Corollary 91 implies that there is a such that . Then is an isomorphism such that is the identity map. Since is a group, .
The proof of the second assertion is very similar. Suppose that is an isomorphism. Then, by Theorem 92, for a central . Let be such that . Then is an isomorphism between and with .
9 The Proof of the Main Theorem
In this section we prove the main theorem of this paper, Theorem 2. By Fact 24, it suffices to prove the following:
Theorem 94
There is a continuous function such that for , if :
* has an infinite branch if and only if ,* 2. 2.
* has two distinct infinite branches if and only if*
[TABLE]
We split the proof of this theorem into three parts. In the first we assume the timing assumptions hold, define and show that it is a reduction. In the second part we show that is continuous.
The third part of the proof augments the specifications of [8] with two additional randomness properties, shows that the additional properties imply the timing assumptions and describes how to perform the word construction from [8] with these additional requirements. We present the third part of the proof separately in Section 10.
We begin by defining . The main result of [8] relied on the construction of a continuous function such that for all , if then:
Fact 1
has an infinite branch if and only if ,
Fact 2
has two distinct infinite branches if and only if
[TABLE]
Fact 3
The function took values in the strongly uniform odometer based transformations and for in the range of , if and only if there is an anti-synchronous isomorphism between and .
Fact 4
([8], Corollary 40, page 1565) If is in the range of and then there is a synchronous such that for some , non-identity element and all generic and all large enough , if and are the principal -subwords of and respectively then:
[TABLE]
Fact 5
(Equations 1 and 2 on pages 1546 and 1547 of [8]) For all there is an such that if and are trees and393939See Section 4.6 for notation.
[TABLE]
then the first -steps of the construction sequences for are equal to the first -steps of the construction sequence for ; i.e. .
Fact 6
The construction sequence for satisfies the specifications given in [8]. In Section 10.2, these specifications are augmented by the addition of J10.1 and J11.1. In Section 10.3 we argue that if is a construction sequence for an odometer based system that satisfies the augmented specifications, then the associated circular construction sequence satisfies the timing assumptions.
Moreover the construction sequence for is strongly uniform and hence the construction sequence for is strongly uniform.
Fact 7
Construction sequences satisfying the augmented specifications are easily built using the techniques of [8] with no essential changes; consequently we can assume that the the construction sequences for satisfy the augmented specifications.
In [13] (Theorem 60) it is shown that if is a strongly uniform circular construction sequence with coefficients , where grows fast enough and goes to infinity then there is a smooth measure preserving diffeomorphism measure theoretically isomorphic to . This gives a map from circular systems with fast growing coefficients to .
If is the canonical functor from odometer systems to circular systems we define
[TABLE]
.
9.1 is a Reduction
Because preserves isomorphism, to show that is a reduction, it is suffices to show that is a reduction. Let be the transformation corresponding to the system and the transformation corresponding to .
Item 1 of Theorem 94: Suppose that is a tree and has an infinite branch. By Facts 1 and 3, there is an anti-synchronous isomorphism . By Theorem 105 of [12], if , there is an isomorphism .
Now suppose that , then . By Fact 6, the construction sequence for satisfies the timing assumptions. By Theorem 93, there is an anti-synchronous isomorphism . Again by Theorem 105 of [12], there is an isomorphism between and . By [8], has an infinite branch.
Item 2 of Theorem 94: Suppose that has at least two infinite branches. Then the centralizer of is not equal to the powers of . By Fact 4, we can find a synchronous . Let , then is synchronous. We claim that . Using Fact 4, and lifting the group action of and the equivalence relation , we see that for all generic , and all large enough , if and are the principal -subwords of and , respectively, then:
[TABLE]
for some . In particular, .
By the timing assumption T4, there is a such that for all large and all shifts with of size less than , we have
[TABLE]
Suppose that . Then, by Proposition 21, we can find an and a generic such that
[TABLE]
But inequality 41 and the Ergodic Theorem imply that for large enough if and are the principal -blocks of and then
[TABLE]
contradicting inequality 40.
Now suppose that there is a such that . Then by Theorem 93, there is such a that is synchronous. In particular, for all , . Thus if is the transformation corresponding to , belongs to the centralizer of and is not a power of .
9.2 is Continuous
Fix a metric on yielding the -topology. For each circular system , let be the sequence of collections of prewords used to construct . By Proposition 61 of [13], given and a -neighborhood of , there is a large enough , for all if , then . For all odometer based transformations, the sequence determines . Hence for all , if the first members of the construction sequence for are the same as the first members of the construction sequence for , then . By Fact 5, there is a basic open interval that contains and is such that the first members of the construction sequence are the same for all . It follows that for all .
9.3 Numerical Requirements Arising from Smooth Realizations
The construction of depends on various estimates that put lower bounds on the growth of the coefficient sequences. We now list these numerical requirements. The claims in this subsection presuppose a knowledge of [13].
The map depends on various smoothed versions of the permutations of the unit interval arising from . To solve this problem, we fix in advance such approximations, making sure that each approximation agrees sufficiently well with as to not disturb the other estimates.
This introduces various numerical constraints on the growth of the ’s. The diffeomorphism is built as a limit of periodic approximations . To make the sequence of ’s converge at each stage, must be chosen sufficiently large. Thus the growth rate of depends on , . Since there are only finitely many possibilities for ’s corresponding to a given sequence , we can find one growth rate that is sufficiently fast to work for all choices of ’s. This is discussed in detail on page 34 of [13], where the lower bound is called .
Numerical Requirement 6
* is big enough relative to a lower bound determined by , and to make the periodic approximations to the diffeomorphism converge. Moreover .*
Remark 95
Choosing close to is a fundamental idea of the method of Approximation by Conjugacy, due to Anosov and Katok. By equations 9 and 10, this is equivalent to taking large. The magnitude of is not calculated, but instead it shown that as increases a sequence of periodic diffeomorphisms well approximates a given periodic diffeomorphism. Then in the original sources [1] and [18], one simply takes sufficiently large. This is what requirement 6 is repeating.
The argument for the ergodicity of the diffeomorphism formally required that:
Numerical Requirement 7
* goes to infinity as goes to infinity, is a multiple of .*
The reader is referred to example 5 for a discussion of and its growth.
The next requirement makes it possible to choose and then, by making sufficiently large, construct sufficiently random words using elements of .
Numerical Requirement 8
**
10 The Specifications
In this section we describe how the timing assumptions are related to the specifications given in [8], show that they are compatible and indicate how to construct odometer words so that both sets of assumptions hold. This completes the proof of Theorem 94, subject to the verification that all of the Numerical Requirements we have introduced are consistent with the numerical requirements of [8]. We take this up in section 11. We will assume that the reader is familiar with sections 7 and 8 of [8].
10.1 Corresponding Specifications
Figure 4 gives a table that links the Timing Assumptions we use in this paper to the corresponding Specification in [8]. (We remind the reader that Appendix A has a table giving corresponding notation betwen [8] and this paper.)
Specification T4 doesn’t directly correspond to one of the Specifications, but (as we will show) holds naturally in the circular words lifted from an odometer construction satisfying the specifications.
Numerical Requirement 9
In the current construction we have two summable sequences: and . We use the lunate “” notation for the specifications from [8] and the classical “” notation (“varepsilon” in LaTeX) for the numerical requirements relating to circular systems and their realizations as diffeomorphisms. A requirement for the construction is that
[TABLE]
We also assume that the ’s are decreasing and .
10.2 Augmenting the Specifications from [8]
The paper [8] constructs a reduction from the space of trees to the odometer based systems. The system was built according to a list of specifications which we reproduce here in order to show how to strengthen them to imply the timing assumptions used in the proofs of Theorems 93 and 94 and to verify that the strengthened assumptions are consistent. The specifications directly relevant to the timing assumptions are J10 and J11. The others, which describe the scaffolding for the construction, are only relevant in that they set the stage for the application of the functor defined in section 5.
Here are some definitions from [8] that are used in the specifications. We advise the reader that a table giving the notational changes between [8] and this paper is in Appendix A.
Fix an enumeration of the finite sequences of natural numbers, , with the property that if is an initial segment of then is enumerated before . Let be a tree whose elements are . Here are the specifications for the construction sequence used to build .
There is a sequence of groups built as follows. For all , is the trivial group and if we let
[TABLE]
then
[TABLE]
i.e. is a direct sum of copies of indexed by elements of . There are canonical homomorphisms from to that send a generator of corresponding to a sequence of the form to the generator of corresponding to .
The sequence , equivalence relations and the group actions of are constructed inductively. The words in are sequences of elements of . To start and is the trivial equivalence relation with one class. The collection of words is built when the element of is considered. We will say that words in have even parity and words in have odd parity.
We begin by restating the specifications from [8] using the indexing conventions in this paper ( vs ). E1-A9 are exactly the same, however we modify the joining specifications J10, J11 slightly for the needs of this paper.
- E1.
Any pair of words in have the same length. 2. E2.
Every word in is built by concatenating words in . Every word in occurs in each word of exactly times, where is a large prime number chosen when the element of is considered. 3. E3.
(Unique Readability) If and
[TABLE]
where each and or are sequences of [math]’s and ’s that have length less than that of any word in , then both and are the empty word. If and and with , and we have ; i.e. the first half of is not equal to the second half of .
Let be the length of the longest sequence among the first sequences in and if then is the least such that has length .
The equivalence relations on are defined for all . The equivalence relation on is the trivial equivalence relation with one class.
- Q4.
Suppose that . Then any two words in the same equivalence class agree on an initial segment of proportion least . 2. Q5.
For , is the product equivalence relation of . Hence we can view as sequences of elements of and similarly for . 3. Q6.
refines and each class contains many classes, where is a strictly increasing function. The speed of growth of is discussed in section 11. 4. A7.
acts freely on and the action is subordinate to the action via the natural homomorphism from to . 5. A8.
The canonical generators of send elements of to elements of and vice versa. 6. A9.
If and we view then the action of on is extended to an action on by the skew diagonal action. If is non-trivial then and its canonical generator maps to .
Note:
While it is not explicitly given as a specification in [8], the construction sequence has the property that if is a canonical generator, then for is closed under the skew diagonal action of .
Suppose that and are elements of and an ordered pair from . Suppose that and are in positions shifted relative to each other by units. Then an occurrence of in is a such that occurs in starting at and in starting at . Let be the number of classes of and be the number of elements of each class.404040We have changed the variables used in the statement of J10 in [8] to conform to the notation described in the appendix A.
To prove the timing assumptions we need to strengthen specifications J10 and J11 to deal with -distance on initial and tail segments and on words that are shifted. The spirit of specification J10 is that pairs of -words occur randomly in the overlap of and when is shifted by a suitable multiple of the lengths of -words. J10.1 says the same thing relative to non-trivial initial segments of the overlap of the shift of and .
The specification says that if is in the -orbit of and is maximal with this property, then the occurrences of are approximately conditionally random. More explicitly, suppose that , and we are given . Then there are many pairs of -classes with , and so should occur randomly proportion of the time. There are many elements of in the -classes, and conditional on , the chances of such a pair randomly matching is . The specification J11.1 strengthens this (but only for , which is the trivial equivalence relation and ) by asking that this holds over any non-trivial interval of length at the beginning or end of an -word.
Here are the joining specifications as given in [8]:
- J10.
Let and be elements of . Let be an integer. Then for each pair such that has the same parity as and has the same parity as , let be the number of occurrences of in on their overlap. Then
[TABLE] 2. J11.
Suppose that and . We let be the maximal such that there is a such that . Let be the unique with this property and be such that . Let be the number of occurrences of in . Then:
[TABLE]
The strengthening of J10 is:
J10.1
Let and be elements of . Let . Let be a number between and . Then for each pair such that has the same parity as and has the same parity as , let be the number of such that occurs in in the position in their overlap. Then
[TABLE]
The next assumption is a strengthening of a special case of J11.
J11.1
Suppose that and and .414141In the language of J11: , and . Let be a number between and . Suppose that is either an initial or a tail segment of the interval having length . Then for each pair such that has the same parity as and has the same parity as , let be the number of occurrences of in . Then:
[TABLE]
We have augmented the specifications in [8] with J10.1 and J11.1. Formally we must argue that it is possible to build construction sequences satisfying the additional specifications. This means constructing many pseudo-random words. This is done using a routine modification of the techniques of [8], where the collections of words are built probabilistically. For the words in are built by iteratively substituting words into -sequences of classes , by induction on where is maximal with . The classes of words are built by induction on . A word (or in if ) can be viewed as a result of substituting elements of (or ) into a word in .
Suppose that has been built and is given by many consecutive classes . Then . Viewing these as independent trials and taking large enough (so that is very large) the finitary Law of Large Numbers shows that the vast majority of choices of words satisfy J10, J10.1, J11 and J11.1
Remark 96
As noted in Example 5, given the number of substitutions to be made (which is one more than the maximal such that is defined) and the size of the groups one can give an explicit formula relating the sizes of and . Given one of the two one can solve for the other. Moreover when one goes up the other does as well. This co-determination means that the requirements can be stated in terms of either variable. We state the requirements in terms of the ’s.
In the construction, getting the additional .1 for J10 and J11 only involves taking larger than was necessary in [8]. This is described in this notation in [7].
This leads to a numerical requirement:
Numerical Requirement 10
* is chosen sufficiently large relative to a lower bound determined by for the Law of Large Numbers arguments to work.*
10.3 Verifying the Timing Assumptions
In this section we prove that the augmented specifications E1-J11.1 imply the timing assumptions, introduced in Section 8.2.1. The first three timing assumptions T1-T3 follow easily from the results in section 5.10 together with specifications Q5, Q7 and A8.
The following remark is easy and illustrates the idea behind the demonstrations of T4-T7.
Remark 97
Suppose that is an alphabet with symbols in it and with . For words in of the same length and , set to be the number of occurrences of in , to be the number of occurrences of some element of opposite an occurrence of in and to be the number of occurrences of in . Then for all there is a such that whenever are two words in of the same length , if for all ,
[TABLE]
then for all :
[TABLE]
Because , by taking sufficiently small we can arrange that
[TABLE]
and the approximation improves as gets smaller. Simplemindedly:
[TABLE]
Since we see that
[TABLE]
As we take smaller the final approximation improves.
We now establish the timing assumptions T4-T7. Recall that in the context of the timing assumptions the notation means that .
Assumption T5: Assume that specification J10 holds for sufficiently small . To use remark 97 to see T5, take , the number to be and to be the cardinality of any equivalence class of and . Since each class of has the same number of elements, is equal to the number of classes: . Thus and T5 follows.
Assumption T6: We can write the set as:
[TABLE]
which can be written in turn as:
[TABLE]
Thus, using J10.1, we can estimate the size of as
[TABLE]
Since we can simplify this to . The assumption T6 follows.
Assumption T7: Under the assumption that , and is the trivial equivalence relation. The estimate in J11 simplifies to:
[TABLE]
To apply Remark 97, we again set and and , in the language of the remark. With this notation, and equation 42 is the hypothesis of Remark 97. The conclusion of the remark is that
[TABLE]
Since , assumption T7 follows.
We note that the verification of T5-T7 uses remark 97 for a small enough . We make this a requirement on .
Numerical Requirement 11
[TABLE]
Assumption T4: T4 is the hardest timing assumption to verify. We motivate the proof by remarking that if are long mutually random words in a language that has letters, then . Thus and are far apart. Specifications J10.1 and J11.1 imply that most and their relative shifts are nearly mutually random. We use this to establish that and are distant in .
Numerical Requirement 12
, the ’s are increasing and is finite.
Let
[TABLE]
For , set:
[TABLE]
and finally:
[TABLE]
Assumption T4 says that if are not -equivalent, then the overlaps of sufficiently long initial segments, or sufficiently long tail segments or of a sufficiently long initial segment with a tail segment of and are at least distant in . In T4 sufficiently long means at least half of the length of the word. We prove something stronger by induction on :
Proposition 98
Let and with . Let be an initial segment and be a tail segment of of of the same length . Then we have:
[TABLE]
We will consider the situation where . The situation where they both belong to follows, and the argument in the case where have different parities is a small variation of the basic argument.
The strategy for the proof is to consider -words and and gradually eliminate small portions of and so that we are left with only segments of -words that align in and in such a way that they have large -distance. The remaining portions of the and are far apart and they constitute most of the segments of each word. By Remark 22, we get an estimate on the distance of and .
Suppose that
[TABLE]
and let .
A general initial segment of a word has the following form with . For some :
[TABLE]
where is a possibly empty, possibly incomplete -word, , , . This is a block of complete 2-subsections, followed by a block of complete 1-subsections, followed by a possibly empty, incomplete 1-subsection.
Similary a general tail segment as the following form:
[TABLE]
Initial Segments: We now argue for inequality 44. To start we take . In this case and . The initial segment are of the form where is a proper initial segment of a word of the form that has length , for some .
If we throw away the tail segment we have thrown away proportion . Since we have removed a portion of less than and the segment that is left has proportion at least and is made up of a product of many 1-subsections.
We now consider . Since , one of the following holds:
There are no complete 2-subsections, in which case we must have
. 2. 2.
There is at least one complete 2-subsection and . 3. 3.
There is at least one complete 2-subsection and .
In the first case, since we know that . Thus eliminating the partial 1-subsection at the end we are left with a concatenation of at least complete 1-subsections and we have removed less than portion of . Similarly in the second case we can eliminate the incomplete 1-subsection at the end by removing proportion less than of . In the final case by removing both the final incomplete 1-subsection and we eliminate at most proportion of .
In all three cases, we are left an such that and are made up of a possibly empty initial segment of complete 2-subsections followed either by no complete 1-subsections or at least complete 1-subsections. We now delete the boundary portions of , which are aligned with the boundary portions of . These have proportion in each complete 1-subsection–hence proportion of . Let be the remaining portion of . Then contains proportion at least of .
Case 1: 424242We note that because , if we are in Case 1..
Let be the concatenation of , and similarly the concatenation of the ’s. Then and . Let and be an initial or final segment of of length at least .
Sublemma 99
If is sufficiently small as a function of , then
[TABLE]
is within of
[TABLE]
Let be the concatenations of and . By J11.1, we see that the number of occurrences of in satisfies:
[TABLE]
Fix such an and let be a -class. Then has elements. It follows from equation 47 that the number of occurrences of a pair in with takes proportion of approximately
[TABLE]
Since there are many classes that need to be considered we see that the number of pairs and with is approximately
[TABLE]
Hence for small enough , we can see the conclusion of the sublemma.
Numerical Requirement 13
The numbers should be small enough as a function of that estimate in the conclusion of sublemma 99 hold:
[TABLE]
The locations in are made up of powers . These fall into two categories, those locations occurring in whole 2-subsections and those occurring in the final product of 1-subsections. Applying the previous reasoning separately to the whole 2-subsections and the either-empty-or-relatively-long product of 1-subsections at the end of , we see that the proportion of occurring in across from a in that is equivalent is also extremely close to .
If , then specification J11.1 implies that
[TABLE]
So and hence
[TABLE]
In general, the induction hypothesis yields that -inequivalent words have -distance at least -apart. Thus on :
[TABLE]
Allowing for agreement on boundary portions and applying Remark 22 we see that
[TABLE]
Case 2: .
In this case . Let with . Since , is not the identity. Since acts diagonally, for all with intersecting the interval , we have . In particular, .
Hence , and thus
[TABLE]
Tail Segments: The argument for tail segments (inequality 45) follows the argument for initial segments, except that we delete small parts of the beginning of , instead of the end of .
Tail Segments compared to initial segments: To show inequality 46, we proceed by induction, considering . In the comparing two initial segments or two tail segments, not only did the 2 and 1-subsections line up, but the -subwords did as well. When comparing initial segments with tail segments, the -subwords may be shifted, causing additional complications. The proof proceeds as in the easier cases, eliminating small sections of (or equivalently ) a bit at a time until we are left with -words. The alignment of these -words allows us to apply the induction hypothesis and conclude that the vast majority of and have -distance at least .
a.) Of the 2-subsections of that intersect , at most one is not a subset of (namely the last one), and similarly except for possibly the first 2-subsection intersecting , is made up of whole 2-subsections.
b.) Each 2-subsection of overlaps one or two 2-subsections of . An overlap of a 2-subsection of with a 2-subsection of that has proportion bigger than of the 2-subsection implies that the overlap contains at least complete 1-subsections.
- 1.
Among the complete 2-subsections of , delete overlaps of proportion less than . 2. 2.
Delete the possible partial 2-subsection at the end of if it contains less than complete 1-subsections.
The proportion of that has been deleted is less than .
c.) It could be that some of the portions of the remaining 2-subsections start or end with incomplete 1-subsections; i.e. not a whole word of the form . Delete these incomplete sections. This leaves initial or tail segments of 2-subsections of the form that consist of at least whole 1-subsections. This trimming removes at most proportion of .
d.) We also remove the boundary sections of . This removes at most of what remains of at this stage.
e.) We are left with a portion such that consisting entirely of 0-subsections. These are blocks of the form , where . Each individual -word can occur opposite a portion of in various ways. These are:
i. might occur exactly opposite a 434343This is what happens in the case that . or
ii. might span portions of two copies of in a power . The two copies have the form , or
iii. might overlap a portion of the boundary of . This can happen in two ways: boundary inside a 2-subsection (i.e. boundary of the form ) and boundary between consecutive 2-subsections (i.e. boundary of the form ). In each there are at most 3 copies of overlapping boundary portions of .
Hence by removing proportion at most we are left with a portion of consisting of powers of ’s that do not overlap any boundary in .
f.) After the deletions described in a.)-e.) the remaining portions of consists of blocks of powers of ’s in initial segments of 2-subsections:
[TABLE]
and in tail segments of 2-subsections:
[TABLE]
where ’s stand for ’s deleted opposite boundary of and ’s stand for the boundary of that has been deleted. An important point for us is that in each block and .
Consider the ’s in situation described in item e).ii. above. The ’s split into two pieces. By deleting a portion of the individual ’s of size less than we can assume that all of the overlap of ’s is in sections of length at least . By doing this for all ’s we remove a parts of the remaining elements of of proportion at most .
g.) We now look more carefully at the two types of blocks of words described in item f.). The case in item e.)i. is similar and easier than the case in item e.)ii. so we omit it. Along the blocks described in f.) the initial portions of are lined up with and the second portions are lined up with . Critically, the is constant along the block.
According to whether or not, we apply specifications J11.1 (as in Case 1 of the Initial Segments argument) and J10 to see that at most proportion of the ’s in a segment of the forms in f.) are lined up with are -equivalent. Hence we can make a final deletion of proportion at most to get a portion consisting of relatively long pieces of -words in overlapping -words in that lie in different equivalence classes.
We now finish the argument using Remark 22. After all of the deletions we are left with having at least -proportion of and consists of relatively long pieces of words that are overlapping portions of words in that lie in different -classes.
By the induction hypothesis each of the pieces of n-words in of -distance at least from the corresponding portion of . Consequently:
[TABLE]
thus finishing the proof of Proposition 98.
Since assumption T4 is an immediate corollary of Proposition 98 we have finished verifying the timing assumptions.
We note in passing that inequality 46 holds even if provided that the choice of initial and tail segment misalign corresponding 1-subsections.
We have proved:
Theorem 100
Suppose that is a system in the range of with construction sequence . Then satisfies the timing assumptions.
11 The Consistency of the Numerical Requirements
During the course of this construction we have accumulated numerical conditions about growth and decay rates of several sequences. The majority of the numerical constants are not inductively determined–they are given immediately by knowing a small portion of the tree . We call these exogenous requirements. Other sequences of numbers depend on previous choices for the numbers–hence are determined recursively. In this section we list the the recursive requirements, explicate their interdependencies and resolve their consistency.
Some of the conditions are easy to satisfy, as they don’t refer to other sequences. For example, Numerical Requirement 1 (that ) can be satisfied once and for all by assuming that . Others are trickier, in that they depend on the growth rates of the other sequences. For example, in defining the sequence of ’s we require that be large relative to the choice of . We call the former type of conditions Absolute and the latter Dependent. The Dependent conditions introduce the risk of circular or inconsistent growth and decay rate conditions.
Our approach here is to gather all of the conditions arising in this paper and its predecessors and classify them as Absolute or Dependent. We label them A or D accordingly. This process allows us to make a diagram of the Dependent conditions to verify that there are no circularities. The lack of a cycle in the diagram gives a clear method of recursively satisfying all of the numerical conditions.
Due to an overabundance of numerical parameters we were forced into some awkward notational choices. As noted before we have two types of epsilons: the lunate , often used for set membership and the classical . They play similar but slightly different roles. The lunate epsilons come from construction requirements arising in [8] and their strengthenings. The classical epsilons come from requirements related to circular systems and realizing them as smooth systems. As is to be expected there is interaction between the two. This occurs via the intermediary numbers we called ’s in Numerical Requirements 5 and 11.
11.1 The Numerical Requirements Collected.
In this section we collect the relevant numerical requirements. Specifically, in constructing we are presented with as a subsequence of a fixed enumeration of .
In the formal statements of the specifications in [8] for the construction sequence corresponding to , is built just in case . This leads to a construction sequence of the form with gaps corresponding to ’s where . To simplify notation we reindex as where is determined by . In [8], the specifications discussed “successive” (or “consecutive”) elements of . These are and that belong to , but have no with . In our new notation successive elements and of correspond to and where . Having adopted this convention we don’t distinguish between and . To emphasize the dependence on we will occasionally write .
We begin with the requirements inherited from [8].
Inherited Numerical Requirements
We have changed the notation from [8] as described in the appendix A. The number of elements of is denoted ; the numbers and denote the number of classes and sizes of each class of respectively. In [8] we have sequences ,
Inherited Requirement 1
is summable.
Inherited Requirement 2
the number of classes inside each class. The numbers will be chosen to grow fast enough that
[TABLE]
If is the maximal length of an element of and then we set as well. This forces and all to be powers of 2 that are determined by . In particular let and be successive elements of . Then is the number of words one gets by iteratively substituting many elements into words in and closing under are successive for .444444It is possible to give a closed form formula for this, but it is complicated and uninformative.
By remark 96 and are monotonically co-determined. Hence we can state this requirement as saying:
is large enough in terms of that inequality 51 holds.
Inherited Requirement 3
If then
[TABLE]
Inherited Requirement 4
[TABLE]
Inherited Requirement 5
[TABLE]
Since this is equivalent to the summability of the -sequence, it is redundant and we will ignore in the rest of this paper
Inherited Requirement 6
There will be prime numbers such that (i.e. ). The ’s grow fast enough to allow the probabilistic arguments in [8] involving to go through.
Inherited Requirement 7
is a power of 2.
Inherited Requirement 8
The construction of requires that if then .
Numerical Requirements introduced in this paper:
Numerical Requirement 1
and .
Numerical Requirement 2
is a sequence of numbers in such that .
Numerical Requirement 3
and grow fast enough that ,
, .
Numerical Requirement 4
which is satisfied if .
Numerical Requirement 5
is chosen small relative to .
Numerical Requirement 6
The number is big enough relative to a lower bound determined by , and to make the periodic approximations to the diffeomorphism converge.454545This is discussed in detail on pages 34-35 of [13], where the lower bound is called . Moreover .
Numerical Requirement 7
goes to infinity as goes to infinity and is a power of .
Numerical Requirement 8
.
Numerical Requirement 9
The ’s are decreasing, and
Numerical Requirement 10
is chosen sufficiently large relative to a lower bound determined by that the Law of Large Numbers argument from [8] works.
Numerical Requirement 11
is small relative to
Numerical Requirement 12
, the ’s are increasing and .
Numerical Requirement 13
The numbers should be small enough, as a function of , that estimate 49 holds.
11.2 Resolution
A list of parameters, their first appearances and their constraints
We classify the constraints on a given sequence according to whether they refer to other sequences or not. Requirements that inductively refer to the same sequence are straightforwardly consistent. Those that refer to other sequences risk the possibility of being circular and thus inconsistent. As noted above refer to the former as Absolute conditions and the latter as Dependent conditions.
The sequence .
Absolute conditions: None for .
Dependent conditions:
D1
Numerical Requirement 10, depends on .
D2
Inherited Requirement 6. We can satisfy Inherited Requirement 6 by taking large enough to satisfy Numerical Requirement 10 and of the form .
D3
From Inherited Requirement 4, equation 53 requires that goes to as goes to . This can be satisfied by choosing large enough as a function of .
We note that equation 53 implies that is finite.
D4
Numerical Requirement 12 says that and the ’s are increasing and is finite. As noted the last condition follows from D3. The other parts of Numerical Requirement 12 are satisfied by taking large relative to .
D5
Numerical Requirement 8 implies that is large enough that . This implies that is large relative to .
From D1-D5, we see that is dependent on the choices of , and . 2. 2.
The sequence .
Absolute conditions
A1
Numerical Requirement 1 says that . We also require that , an exogenous requirement.
Dependent conditions
D6
By Numerical Requirement 6, is bigger than a number determined by and .
D7
The sequence must grow fast enough that . This can be arranged by making . Since , this puts lower bound on dependent on .
**Thus depends on , , and . ** 3. 3.
The sequences and . We treat these sequences as equivalent since is a power of determined by and the elements of the tree in the domain of the reduction. Moreover increasing one increases the other and vice versa. Since they are co-determined, they are chosen at the same time.
Absolute conditions
A2
Inherited Requirement 7 says that is a power of 2.
Numerical Requirement 7 says that:
A3
The sequence goes to infinity.
A4
is a multiple of .
A5
Since determines , Numerical Requirement 4 puts an exogenous sequence of lower bounds on , for example that . This requires that be chosen large and, since and are inter-determined, can be satisfied by taking large.
Dependent conditions
D8
Numerical Requirement 3 makes depend on .
The result is that depends on the first elements of , , , and .464646It is important to observe that the choice of does not depend on or . 4. 4.
The sequence .
Absolute conditions
A6
Numerical Requirement 9 and Inherited Requirement 1 require that the is decreasing and summable and .
A7
Inherited Requirement 8 says that if then
Dependent conditions
D9
Numerical Requirement 9 requires that .
D10
Equation 52 of Inherited Requirement 3 says
D11
Numerical Requirement 11 says that must be small enough relative to .
D12
Numerical Requirement 13 says that is small as a function of .
The result is that depends exogenously on the first elements of , and on , , and . 5. 5.
**The sequence . **
Absolute conditions
A8
Numerical Requirement 2 says that . This can be arranged by taking .
Dependent conditions
Numerical Requirement 3 imposes three Dependent conditions on : ,
, . We deal with these in turn.
- (a)
The requirement that goes to infinity already follows from the fact that and item D4. 2. (b)
goes to infinity. This follows from , which is covered in Dependent condition D6. 3. (c)
goes to infinity. This follows from Dependent condition D7.
Thus there are no new Dependent conditions. 6. 6.
The sequence .
Absolute conditions: There are no new Absolute conditions.
Dependent conditions
D13
Numerical Requirement 4 says that . But since is determined by and the first -elements of the tree, Numerical requirement 4 is taken care of by A5.
There are no new Dependent conditions. 7. 7.
The sequence .
This sequence gives the required pseudo-randomness in the timing assumptions.
Absolute conditions: There are no new Absolute conditions.
Dependent conditions
D14
Numerical Requirement 5 requires that be very small relative to and .
** is dependent on and .**
The recursive dependencies of the various coefficients are summarized in Figure 5, in which an arrow from a coefficient to another coefficient shows that the latter is dependent on the former. Here is the order the the coefficients can be chosen consistently.
11.3 The inductive order of choices
We begin by setting: . is not defined, but is determined by . , , ,
Assume:
The coefficient sequences , and have been chosen. The first sequences on the tree are known.
To do:
Choose and . Each requirement is to choose the corresponding variable large enough or small enough where these adjectives are determined by the dependencies enumerated above.
Figure 5 gives an order to consistently choose the next elements on the sequences; Choose the successor coefficients in the following order:
[TABLE]
We note that is redundant in the diagram above since it is determined by , but we include it as a bridge from stage .
Appendix
Appendix A Notation table
In this paper we have adopted the notation used in [1], which conflicts with the notation in [8], accordingly we provide a table for translating between the two. In the table, NEW means the notation used in this paper, OLD means the notation used in [8].
[TABLE]
An equivalent description of the numbers we are calling in this paper is that they are the number of words in concatenated to form elements of . The number is equal to the number and in the old notation of [8].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. V. Anosov and A. B. Katok. New examples in smooth ergodic theory. Ergodic diffeomorphisms. Trudy Moskov. Mat. Obšč. , 23:3–36, 1970.
- 2[2] Hirotada Anzai. On an example of a measure preserving transformation which is not conjugate to its inverse. Proc. Japan Acad. , 27:517–522, 1951.
- 3[3] Ferenc Beleznay and Matthew Foreman. The complexity of the collection of measure-distal transformations. Ergodic Theory Dynam. Systems , 16(5):929–962, 1996.
- 4[4] Andrew-David Bjork. Criteria for rank-one transformations to be weakly mixing, and the generating property . Pro Quest LLC, Ann Arbor, MI, 2009. Thesis (Ph.D.)–University of California, Irvine.
- 5[5] Matthew Foreman. A descriptive view of ergodic theory. In Descriptive set theory and dynamical systems (Marseille-Luminy, 1996) , volume 277 of London Math. Soc. Lecture Note Ser. , pages 87–171. Cambridge Univ. Press, Cambridge, 2000.
- 6[6] Matthew Foreman. What is a Borel reduction? Notices Amer. Math. Soc. , 65(10):1263–1268, 2018.
- 7[7] Matthew Foreman. Gödel diffeomorphisms. https://arxiv.org/abs/2009.06735. 94 pages , 2020.
- 8[8] Matthew Foreman, Daniel J. Rudolph, and Benjamin Weiss. The conjugacy problem in ergodic theory. Ann. of Math. (2) , 173(3):1529–1586, 2011.
