One-sided versus two-sided stochastic descriptions
Aernout C.D. van Enter

TL;DR
This paper explores the relationship between one-sided and two-sided stochastic models, showing that under weak dependence conditions, the equivalence between Markov chains and Markov fields no longer holds, with a counterexample based on long-range Ising models.
Contribution
It demonstrates that the equivalence between Markov chains and Markov fields breaks down under weak dependence, providing a counterexample involving long-range Ising models.
Findings
Equivalence holds under strong dependence but breaks down with weak dependence.
Counterexample based on entropic repulsion in long-range Ising models.
Neither class contains the other under relaxed dependence conditions.
Abstract
It is well-known that discrete-time finite-state Markov Chains, which are described by one-sided conditional probabilities which describe a dependence on the past as only dependent on the present, can also be described as one-dimensional Markov Fields, that is, nearest-neighbour Gibbs measures for finite-spin models, which are described by two-sided conditional probabilities. In such Markov Fields the time interpretation of past and future is being replaced by the space interpretation of an interior volume, surrounded by an exterior to the left and to the right. If we relax the Markov requirement to weak dependence, that is, continuous dependence, either on the past (generalising the Markov-Chain description) or on the external configuration (generalising the Markov-Field description), it turns out this equivalence breaks down, and neither class contains the other. In one direction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
One-sided versus two-sided stochastic descriptions.
Aernout C.D. van Enter 111 Bernoulli Institute, University of Groningen, Nijenborgh 9, 9747AG,Groningen, Netherlands,
email: [email protected],
Abstract:
It is well-known that discrete-time finite-state Markov Chains, which are described by one-sided conditional probabilities which describe a dependence on the past as only dependent on the present, can also be described as one-dimensional Markov Fields, that is, nearest-neighbor Gibbs measures for finite-spin models, which are described by two-sided conditional probabilities. In such Markov Fields the time interpretation of past and future is being replaced by the space interpretation of an interior volume, surrounded by an exterior to the left and to the right.
If we relax the Markov requirement to weak dependence, that is, continuous dependence, either on the past (generalising the Markov-Chain description) or on the external configuration (generalising the Markov-Field description), it turns out this equivalence breaks down, and neither class contains the other. In one direction this result has been known for a few years, in the opposite direction a counterexample was found recently. Our counterexample is based on the phenomenon of entropic repulsion in long-range Ising (or ”Dyson”) models.
AMS 2000 subject classification: Primary- 60K35 ; secondary- 82B20
Keywords and phrases: Long-range Ising models, g-measures, Gibbs measures, entropic repulsion.
1 Introduction
It has been known since more than 40 years that finite-state discrete-time Markov Chains are equivalent to Markov Fields (one-dimensional nearest-neighbour finite-spin Gibbs measures) [34], Ch 3. This result was independently obtained by Brascamp and Spitzer.
In a Markov Chain the future is independent of the past, given the present; in a Markov Field the inside of a (finite) area is independent of the outside, given the border. If space is , one-dimensional, and time is discrete (thus also the one-dimensional integer line ), the difference in description is therefore between a one-sided (time-like) versus a two-sided (space-like) conditioning. But despite this, one obtains the same class of measures, as long as the conditioning is Markovian.
A Markovian modeller displays a certain short-sightedness:
For a Markov-Chain modeller, if one knows the present one controls the future (“All History is Bunk” (Henry Ford)); and for a Markov-Field modeller, to rule an area it suffices to “Control the Borders”. And in one dimension these are indeed the same.
On the other hand, if we consider arbitrary stochastic processes, one-sided descriptions and two-sided descriptions can provide highly non-equivalent results.
A famous example thereof is the existence of measures which are ergodic and have a positive – one-sided – Kolmogorov-Sinai entropy, (and have possibly even a one-sided trivial tail), thus being one-sided ”stochastic”, despite having a full two-sided tail, therefore being two-sided deterministic [36, 51] and thus having zero two-sided entropy density. We remark that the quantity which we here call a “two-sided “ entropy, is a one-dimensional example of entropy-like quantities which can be defined on more general graphs; such quantities have been also called “inner” or “conditional” or “lower” or ”erasure” entropies in the literature [18, 22, 29, 57, 58].
These above examples, due to Gurevich, and to Ornstein and Weiss, provided non-equivalence of one-sided and two-sided entropies [57], one-sided and two-sided tail properties, [38] etc. We notice that many quantities and properties from Ergodic Theory (Kolmogorov-Sinai entropy, K-property, Bernoulli property, Isomorphism as Dynamical Systems) are defined in terms of one-sided objects or descriptions.
However, those examples lack continuity properties of their conditional probabilities. Such continuity properties in space in fact characterise Gibbs measures [46, 56].
Thus the Gurevich and Ornstein-Weiss examples are excluded if one restricts oneself to the class of Gibbs measures.
In fact, it is known that one-sided and two-sided entropy densities for Gibbs measures (for absolutely summable potentials) are identical [57, 18], and that under some stronger conditions on the interaction decay, one-sided and two-sided tail properties of Gibbs measures are the same [38].
For Gibbs measures in one dimension, it was open for a long time if one-sided and two-sided characterisations were equivalent.
The measures which have one-sided continuous conditional probabilities are known as
-measures (or “chains with complete connections” or “ chains of infinite order”, or “random Markov chains”). They were introduced in the thirties and repeatedly rediscovered (under different names) [16, 50, 37, 45, 44]. A few years ago, Fernández, Gallo and Maillard [24] constructed a -measure – with one-sided continuous conditional probabilities– which is not a Gibbs measure, as its two-sided conditional probabilities are not continuous.
Here we discuss our [5] recently finding an opposite result, namely that the Gibbs measures of the Dyson models –which have two-sided continuous conditional probabilities– are not -measures, as their one-sided conditional probabilities are not continuous.
Note that, in contrast to all the earlier counterexamples of Gurevic, Ornstein-Weiss, and Fernández-Gallo-Maillard, in our case the two-sided behaviour is more “regular”, more
“stochastic”, than the one-sided behaviour.
2 Background and Notation
2.1 Dyson Models
Here we describe some properties of one-dimensional long-range spin models, also known as Dyson models.
In his original work, Dyson [17] considered an Ising spin system in one dimension (on ), with formal Hamiltonian given by
[TABLE]
and for is of the form .
A conjecture due to Kac and Thompson [43] had stated that there should be a phase transition for low enough temperatures if and only if (in zero magnetic field). Dyson proved a part of the Kac-Thompson conjecture, namely that for long-range models with interactions of the form with , there is a phase transition at low temperatures.
Later different proofs were found, [32, 42, 10, 1] and also the case was shown to have a transition [33].
In summary the following holds:
Proposition 1
[17, 33, 54, 34, 32, 1, 10, 49, 42]**. The Dyson model with polynomially decaying potential, for , exhibits a phase transition at low temperature:
[TABLE]
where the extremal measures and are translation-invariant. They have in particular opposite magnetisations at low temperatures. Moreover, the Dyson model in a non-zero homogeneous field has a unique Gibbs measure at all temperatures.
It is well-known that there is no phase transition for being of finite range, and neither for with .
Remark 1
The case of is more complicated to analyse, and richer in its behaviour, than the other ones. There exists a hybrid transition (the ”Thouless effect”), as the magnetisation is discontinuous while the energy density is continuous at the transition point. Moreover, there is second transition below this transition temperature. In the intermediate phase there is a positive magnetisation with non-summable covariance, while at very low temperatures the covariance decays at the same rate as the interaction, which is summable. For these results, see [1, 39, 40].
Here we will make use of the approach of [10], which has been extended to a number of other situations (Dyson models in random fields [13], interfaces [11], phase separation [12], inhomogeneous decaying fields [6], etc). The disadvantage of this approach is that it works only at very low temperatures, as it is perturbative, and it works only for a reduced set of -values, , with . The advantage, however, compared to other proofs, is that translation invariance does not play that much of a role.
The main idea of the approach of [10], which was introduced in the case by Fröhlich and Spencer in [33], is to construct a kind of triangular contours for which a Peierls-type contour argument can be obtained. The energy of a contour of length has an energy cost associated to it of order , (and of order when ).
There has been substantial interest in the Dyson model over the years.
Varying the decay parameter plays a similar role as varying the dimension in short-range models. This can be done in a continuous manner, so one obtains analogues of well-defined models in continuously varying non-integer dimensions. This is one major reason why these models have attracted a lot of attention in the study of phase transitions and critical behaviour (see e.g. [10] and references therein).
For some recent results for these long-range Dyson models with polynomially decaying interactions, see [49, 21, 6, 12, 20, 14].
2.2 Specifications and Measures
We refer to [7, 34, 19, 23, 31, 53] for more general treatments of the Gibbs formalism.
Dyson models are special, as they are ferromagnetic Ising models with long-range pair interactions in one dimension.
We consider these models as belonging to a more general class of lattice (spin) models with Gibbs measures on infinite-volume product configuration spaces . In our case , and the single-site state space is the Ising space , with the a priori counting measure . We denote by the set of the finite subsets of and, for any , write for the finite-volume configuration space . We also will consider only translation-invariant models.
Microscopic states or configurations, denoted by etc., are elements of , equipped with the product topology of the discrete topology on , for which these configurations are close when they coincide on large finite regions (the larger the region where they are equal, the closer the configurations are).
We denote by the set of continuous (quasilocal) functions on , characterized by
[TABLE]
The fact that we consider ferromagnetic pair interactions provides us with an extra tool:
We can make use of FKG inequalities. Monotonicity for functions and measures concerns the natural partial (FKG) [30] order ” ”, which we have on our Ising spin systems : if and only if for all . Its maximal and minimal elements are the configurations and , and this order extends to functions: is called monotone increasing when implies . For measures, we write if and only if for all monotone increasing222We denote for the expectation under a measure .. FKG arguments are based on the observation that increasing the interaction, or an external field, will FKG-increase the associated Gibbs measure.
Macroscopic states are represented by probability measures on .
Following Dobrushin, Lanford and Ruelle, DLR or Gibbs measures are defined in terms of consistent systems of (regular versions of) finite-volume conditional probabilities, of finite-volume configurations with prescribed boundary conditions outside of those volumes [15, 47].
Such a family of (everywhere, rather than almost everywhere, defined, as one has not yet a measure, which could determine that something is almost sure with respect to it) conditional probabilities is called a specification.
A measure for which a version of its conditional probabilities provides those of the specification is said to satisfy the DLR conditions for that specification. The specifications of interest in the theory of lattice systems usually satisfy a finite-energy condition. This says that no configuration in a local volume is excluded (has probability zero), uniformly in the boundary conditions on which one conditions. Moreover, the condition of continuity (or quasilocality) is required. This says that all conditional probabilities are continuous (quasilocal) functions of the boundary conditions.
Remark 2
In contrast to Kolmogorov’s theorem, which says that a system of consistent marginal probabilities defines precisely one probability measure, for finite-spin specifications the number of measures satisying the DLR conditions for that specification can be either one or more. The latter situation sometimes is taken as the definition of a Phase Transition.
A measure is said to be quasilocal when it is specified by a quasilocal specification.
A particularly important approach to quasilocal measures consists in considering the Gibbs measures with (formal) Hamiltonian defined via a potential (or interaction) , a family of local functions . The contributions of spins in finite sets to the total energy define the finite-volume Hamiltonians with free boundary conditions
[TABLE]
To define Gibbs measures, we require for that it is Uniformly Absolutely Summable (UAS), i.e. that . One then can give sense to the *Hamiltonian at
volume with boundary condition * defined for all as
. The Gibbs specification at inverse temperature is then defined by
[TABLE]
where the partition function is a normalizing constant. Due to the UAS condition, these specifications are quasilocal. It turns out that the converse is also true up to a non-nullness condition333expressing that , implies that for any . (see e.g. [34, 23, 46, 56, 48]) and one can take :
Definition 1** (Gibbs measures)**
*:
is a Gibbs measure iff , that is, the conditional probabilities of - at least a version thereof- are those given by where is a non-null and quasilocal specification.*
[TABLE]
Thus, for Gibbs measures the conditional probabilities always have continuous versions, or equivalently there is no point of essential discontinuity. Points of essential discontinuity are configurations which are points of discontinuity for ALL versions of the conditional probability. In particular one cannot make conditional probabilities continuous by redefining them on a measure-zero set if such points exist. In the generalized Gibbsian framework, one also says that such a configuration is a bad configuration for the considered measure, see e.g. [48]. The existence of such bad configurations implies non-Gibbsianness of the associated measures.
Remark 3
If the interaction is of finite range (a Markov Field), or sufficiently fast decaying, uniqueness of the Gibbs measure holds; indeed no phase transition is expected in one dimension in considerable generality. But the Dyson model gives a counterexample if just UAS is required.
Remark 4
In fact it is enough to know the single-site conditional probabilities – the single-site specification –, all other conditional probabilities can be obtained from those.
Although the extension of the DLR equation to infinite sets is direct in case of uniqueness of the DLR-measure for a given specification [27, 28, 35], it can be more problematic otherwise: it is valid for finite sets only and measurability problems might arise in case of phase transitions when one wants to extend them to infinite sets. Nevertheless, beyond the uniqueness case, such an extension was made possible by Fernández and Pfister [27] in the case of attractive models, that is models satisfying FKG properties.
As we will make use of it, we describe it now in our particular case. The concept they introduced is that of a global specification.
A global specification is a set of consistent conditional probabilities where one considers probabilities of sets which have their supports not only in finite sets, but in more general sets , which can be infinite, possibly with infinite complements. The existence of such a global specification can be invoked to derive the existence of conditional probabilities of sets in , and the possibility of conditioning on configurations in .
Note, by considering , that the set of measures a version of whose conditional probabilities is given by a global specification contains at most one element.
The case we will be most interested in is the situation where we condition on only one half-line. This leads us to the concept of -measures.
The formalism of -measures can be developed in a parallel manner to the Gibbs formalism, but only using one-sided objects (conditional probabilities, specifications, etc.).
We will call a measure a -measure, once the future depends in a continuous manner on the past.
Definition 2
Let be a measure on . We will call a -measure for the function if the conditional probability for the next symbol being , , depends in a continuous manner on the past, that is, is continuous function on .
For further description and background, we refer to [3]
Remark 5
We can also obtain the equivalent of a specification, a “LIS” (Left Interval Specification) for which the measure is a -measure, analogously to the Gibbs measure definition [25, 26]. This is based on the observation that one can build general conditional probabilities from single-site ones.
Remark 6
Although in the Markov-Chain set-up (which is a simple and well-known example of a -measure) uniqueness holds, in the general -measure case, similarly to the Gibbsian set-up, under only the condition of continuity phase transitions are possible [8, 2]. However, although under appropriate uniqueness conditions (uniform boundedness of boundary energies, or Dobrushin uniqueness, for example) -measures and Gibbs measures turn out to be the same objects ([3, 25, 26], no general equivalence seems to apply in a more general setting. Thus the connection between the different classes of phase transitions possible is mostly unknown (if there even is one).
3 From interface localisation to entropic repulsion,
and from there to the lack of the g-measure property:
One Ingredient and three Observations
I) **The Main Ingredient:
Mesoscopic Interface Localisation**:
In [11] the main result describes the mesoscopic localisation of an “interface point” at low temperatures, for a Dyson model with Dobrushin boundary conditions (minus to the left, plus to the right).
If one considers an interval with such Dobrushin boundary conditions, with probability close to 1 the interface point is near the center, that is, with large probability it is not more than away from the center with a (Gaussian) probability distribution; moreover the probability for the interface point being at larger distances than from the origin is bounded by . The system is in a minus-like phase with strictly negative magnetisation left of the interface point, and in a plus-like phase with positive magnetisation to the right of the interface point.
Although there are some extra conditions required (on the decay power, and the strength of the nearest-neighbour term), due to the use of the triangular contours of [10], it seems that one can rid of those with enough effort, see [49, 6]. Moreover, for a counterexample, of course we need not strive for the greatest generality, so we assume that all necessary conditions are satisfied (and there are enough situations known where this happens).
II) **First Observation:
From Interface Localisation To Entropic Repulsion.
**
It is a simple observation that if we change all spins at sites left of from minus to plus, with chosen large enough so that is small ( in particular, then is much larger than ), this does not change the probability distribution in the interval by that much. If the energy differences are small, so are the differences in (conditional) Gibbsian probabilities.
If we move the right border further to the right than , the interface point can only move to the right, due to an FKG argument. This implies that if we take the plus measure and condition on a large interval to be minus, typically at a distance at least of size one finds oneself in a minus-like phase. The interpretation of this is that a “hard” (frozen) minus-interval pushes an interface away, due to an entropic repulsion mechanism. The minus-phase can be seen in terms of a wetting phenomenon, as a “wet region”.
It should be noted that the effect requires positive, non-zero temperature.
Moreover for the case there is no mesoscopic localisation, but the position of the interface point has macroscopic fluctuations. Thus in that case, our proof breaks down.
III) **Second Observation:
Decoupling A Not Too Large Interval Does Not Shift The Interface.
**The position of the interface point does not change much if, next to the frozen minus interval, we decouple all the sites in a large but not too large interval to the right of it (that is at distance less than , with much less than , but possibly diverging with growing ).
This follows again from the contour analysis of [11]. Moving the interface point by a macroscopic distance (proportional to ) costs an energy which diverges with . If the total energy of an interval of size is less than that, then the interface point won’t move on a macroscopic (O(L)) scale.
IV) **Third Observation:
Changing A Decoupled Interval To An Alternating One Costs Finite Energy.**
If we choose an alternating configuration in the interval of size , and restore its
interaction with its complement, the interaction energy with its exterior is uniformly bounded.
Namely, the sum below satisfies:
.
It is known that finite-energy perturbations will cause only relatively small, essentially microscopic, changes [9]. In particular, shifting the interface point costs a finite energy only if the shift is over a finite distance.
Thus a large alternating interval, preceded by a (VERY) large frozen minus interval configuration again is succeeded by a minus phase, while, when it is preceded by a plus interval, it is succeeded by a plus-phase interval. But this dependence on the presence of a frozen plus or minus interval far (of order ) to the left (= in the past), violates the continuity condition which is required for to be a -measure. As our measure was defined to be a Gibbs measure for the Dyson interaction, it automatically has two-sided continuity; thus we have obtained our counterexample.
We can therefore conclude [5]:
Theorem 1
The low-temperature Gibbs measures of Dyson models cannot be written as -measures for a continuous -function. Therefore, of the class of Gibbs measures for quasilocal specifications and the class of -measures with continuous -functions, neither of the two classes contains the other one.
4 Conclusion, Final Remarks, Higher Dimensions
We have shown that one-sided and two-sided continuity of conditional probabilities not only are not equivalent, but that neither of the two continuity conditions implies the other one.
In other words, controlling your borders and controlling the future are not the same things, except if you are a short-sighted Markovian.
Although the result in [5] was proven under some restrictions on , and the proof also requires the presence of a large enough nearest-neighbour interaction, in view of the results of [49, 6] these conditions can presumably be removed.
However, the situation for remains unclear, as the main idea of our proof (mesoscopic localisation of the interface point) fails. It is not clear to us what to expect in that case.
In higher dimensions, the Markov-Field property occurs as a “Local” Markov property, whereas other properties, such as the Global Markov property or being a (tree-indexed) Markov Chain (a “splitting Gibbs measure”) on tree graphs, play a role which looks more like the Markov-Chain property. It is known, however, that in contrast to the one-dimensional situation these properties do not follow from the Local Markov properties [28, 35, 41, 52, 34].
Moreover, if one tries to compare such non-local Markov properties with nonlocal continuity properties by for example considering continuity properties as a function of the lexicographic past, (as the analogue of the one-dimensional ordinary past), it is not difficult to see that the low-temperature plus-phase of the Ising model in , for example, although it is actually known to be well-behaved enough that it even satisfies the Global Markov Property, has conditional probabilities which display an essential point of discontinuity. So even a Global Markov Field can have conditional probabilities which are discontinuous as a function of the (lexicographic) past. The proof of this statement is very close to that of the non-Gibbsianness of the Schonmann projection [55] (the marginal of the low-temperature two-dimensional Gibbs measure on the configurations a line ) as given in [19], and further studied in [27, 4] for example; only we now replace the line surrounding the origin by two half-lines and in the lexicographic past of the origin, then the proof goes through more or less literally. The wetting phenomenon which is responsible is identical: there is an entropic repulsion from a frozen interval into the “future” direction producing a wet droplet, and having two intervals left and right which are large enough, causes the two wet droplets to merge. Under conditions of strong uniqueness (high-temperature Dobrushin uniqueness, or Dobrushin-Shlosman conditions e.g.), continuity of the magnetisation in the origin as a function of the lexicographic past configurations holds, however. (I thank Brian Marcus and Siamak Taati for asking me this question and discussions on this issue).
Acknowledgments: I dedicate this paper with pleasure to Anton Bovier, on the occasion of his 60th birthday. Anton has been a reliable and stimulating guide for me in matters literary, culinary and scientific, for more than 30 years. His many-sided personality and friendship have enriched my life in many respects. I wish him many more years, and look forward to have a share therein.
I thank my collaborators and discussion partners on Dyson models, long-range models and -measures for the pleasure of these collaborations and for all I have learned from them. In alphabetical order I owe: Stein Bethuelsen, Rodrigo Bissacot, Diana Conache, Loren Coquille, Eric Endo, Roberto Fernández, Frank den Hollander, Bruno Kimura, Arnaud Le Ny, Brian Marcus, Pierre Picco, Wioletta Ruszel, Cristian Spitoni, Siamak Taati and Evgeny Verbitskiy.
I thank Eric, Siamak and Arnaud for helpful advice on the manuscript.
I thank Louis-Pierre Arguin, Veronique Gayrard, Nicola Kistler and Irina Kourkova for inviting me to be present at the CIRM meeting to celebrate Anton, and also to contribute to this volume.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Aizenman, J. Chayes, L. Chayes, C. Newman. Discontinuity of the Magnetization in the One-Dimensional 1 / ∣ x − y ∣ 2 1 superscript delimited-∣∣ 𝑥 𝑦 2 1/\mid x-y\mid^{2} Percolation, Ising and Potts Models. J. Stat. Phys. 50, no 1/2: 1-40, 1988.
- 2[2] N. Berger, C. Hoffman, V. Sidoravicius. Nonuniqueness for specifications in l 2 + ϵ superscript 𝑙 2 italic-ϵ l^{2+\epsilon} . ar Xiv:math/0312344, 2003. Erg. Th. Dyn. Syst. 38: 1342–1352, 2018.
- 3[3] S. Berghout, R. Fernández, E. Verbitskiy. On the Relation between Gibbs and g 𝑔 g -measures. Erg. Th. Dyn. Syst. , to appear.
- 4[4] S.A. Bethuelsen and D. Conache. One-sided continuity properties for the Schonmann projection. J. Stat. Phys. 172: 1147-1163, 2018.
- 5[5] R. Bissacot, E.O. Endo, A.C.D. van Enter and A. Le Ny. Entropic Repulsion and Lack of the g-Measure Property for Dyson Models. Comm. Math. Phys. 363: 767–788, 2018.
- 6[6] R. Bissacot, E.O. Endo, A.C.D. van Enter, B. Kimura, W.M. Ruszel. Contour Methods for Long-Range Ising Models: Weakening Nearest-Neighbor Interactions and Adding Decaying Fields. Ann. Henri Poincaré 19: 2557–2574, 2018.
- 7[7] A. Bovier. Statistical mechanics of disordered systems. Cambridge University Press, 2006.
- 8[8] M. Bramson, S. Kalikow. Non-uniqueness in g 𝑔 g -functions. Israel J. Math. 84: 153-160, 1993.
