Rate Distortion Theorem and the Multicritical Point of Spin Glass
Tatsuto Murayama, Asaki Saito, Peter Davis

TL;DR
This paper applies Shannon's rate-distortion theorem to Ising spin systems, deriving a universal constraint on spin correlations that bounds the multicritical point in the phase diagram.
Contribution
It introduces a novel application of information theory to spin systems, linking the rate-distortion theorem to phase transition boundaries.
Findings
Derived a universal constraint on neighboring spin correlations.
Provided a bound for the multicritical point in the phase diagram.
Linked information theory principles with statistical physics of spin systems.
Abstract
A spin system can be thought of as an information coding system that transfers information of the interaction configuration into information of the equilibrium state of the spin variables. Hence it can be expected that the relations between the interaction configuration and equilibrium states are consistent with the known laws of information theory. We show that Shannon's rate-distortion theorem can be used to obtain an universal constraint on neighboring spin correlations for a broad range of Ising spin systems with two-body spin interactions. Remarkably, this constraint gives a bound for the multicritical point in the phase diagram, when a mean-field behavior for the neighboring spin pairs can be expected in the paramagnetic phase.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Rate Distortion Theorem and the Multicritical Point of Spin Glass
Tatsuto Murayama
Graduate School of Science and Engineering, University of Toyama, 3190 Gofuku, Toyama-shi, Toyama 930-8555, Japan
Asaki Saito
Department of Complex and Intelligent Systems, Future University Hakodate, 116-2 Kamedanakano-cho, Hakodate, Hokkaido 041-8655, Japan
Peter Davis
Telecognix Corporation, 58-13 Yoshida Shimooji-cho, Sakyo-ku, Kyoto-shi, Kyoto 606-8314, Japan
Abstract
A spin system can be thought of as an information coding system that transfers information of the interaction configuration into information of the equilibrium state of the spin variables. Hence it can be expected that the relations between the interaction configuration and equilibrium states are consistent with the known laws of information theory. We show that Shannon’s rate-distortion theorem can be used to obtain an universal constraint on neighboring spin correlations for a broad range of Ising spin systems with two-body spin interactions. Remarkably, this constraint gives a bound for the multicritical point in the phase diagram, when a mean-field behavior for the neighboring spin pairs can be expected in the paramagnetic phase.
Understanding the experimental observations of disordered materials has been a challenge to theoretical physicists. This triggered the rise of a special area of statistical mechanics that deals with a variety of statistical models with frozen disorder, where a series of mathematical techniques has become a common language for the systematic analysis [1, 2]. Moreover, these techniques of statistical mechanics have been applied to the study of communication and information systems [3, 4], including noisy channel coding [5, 6, 7], recursive data compression [8, 9, 10], CDMA multiuser detection [11, 12, 13], modern cryptography [14], and some combinatorial optimization problems and methods for them [15, 16, 17]. Overall, the physicist’s toolbox has successfully been applied to solve issues of information science; but not vice versa. To our knowledge, no classical theorem in information theory has been used to analyze the physics of complex condensed matter such as spin glass.
This Rapid Communication shows that information theory can be effectively applied to the analysis of spin glass systems. In our scenario, each of the equilibrium states of the Ising spins is regarded as one encoding of the interaction configuration [18, 19]. This scenario enables us to apply the Shannon rate-distortion theorem of information coding theory [20], which then allows us to develop a new method for investigating fundamental restrictions on the phase diagram. As a result, we obtain a previously unknown general bound for the location of the multicritical point for Ising spin glasses, where paramagnetic, ferromagnetic and spin glass phases merge [21]. Remarkably, our argument is independent of detail structure of the lattice. Numerical studies of problems related to the location of the multicritical point for specific lattice models have been carried out by many physicists [22, 23]. However, we still have little knowledge about these significant issues from a theoretical point of view [24, 25].
In our spin glass model, we assign a binary spin to each site and the local energy to a set of pairwise bonds with a binary interaction . We investigate a class of Ising spin systems with the Hamiltonian
[TABLE]
only assuming that the total number of the sites and the bonds are and , respectively. Specifically, we do not restrict the range of the sum in (1). This sum could be over nearest neighbors, or it might include farther pairs, etc. Special features of each lattice will be reflected only through the ratio . For the simplicity, this work deals with a Hamiltonian with two-body interactions to elucidate the benefit of Shannon’s rate-distortion theorem, although the same arguments apply to other multi-body spin systems.
Each is supposed to be distributed independently according to the common distribution
[TABLE]
for a value of disorder parameter in the interval . Here denotes the Kronecker’s delta function and the set of interaction coefficients is called the Bernoulli() random variables. In general, we write the inverse temperature as and then the phase diagram of the system can be depicted in the space of disorder parameter and temperature . Now, we consider the Nishimori temperature for the spin system, defined to be
[TABLE]
Notice that the above equation specifies a line, the Nishimori line, in the space of and [26]. It has been shown that the multicritical point can be always found on this line. And so, we can specify the multicritical point by giving a value for the disorder parameter , say . Moreover, since spin glass phase does not exist on the Nishimori line, the multicritical point can be characterized as a ferromagnetic transition along the line [27].
In this Rapid Communication, we present a general bound for the location of the multicritical point of spin systems on any lattice with a Hamiltonian (1). Solid line in FIG. 1 shows the upper bound of for a given , only below which we find the multicritical point. Notice that we can use Shannon’s rate-distortion theorem to obtain this remarkable constraint when a mean-field behavior can be expected in the paramagnetic phase. More precisely, on the Nishimori line, we assume that
[TABLE]
holds in the paramagnetic phase, where the denotes the joint distribution of and in the whole complex system and the means equality up to a normalization constant. This implies that the effect of the rest of the lattice on local marginals should not be dominant and our potential target systems have a certain mean-field property in the paramagnetic state, at least on the Nishimori line. However, we insist that no further physical assumption is required to the Ising spin system. As an example, the dashed line in FIG. 1 represents the exact value of the multicritical point for a family of spin glass on a Bethe lattice [25]. Here all the bonds are chosen randomly to give a diluted lattice with the fixed connectivity of . The standard cavity analysis shows that the relation (2) holds at any temperature in the paramagnetic phase [28, 26]. As is expected, we can confirm that upper bounds for this specific model.
In the remainder of the work we will explain how this general bound can be obtained using the rate-distortion theorem. For the reader’s convenience, we now outline the proof and then go into specific details afterwards. We first define an average of local correlation functions
[TABLE]
where we assume that represents the expectation value in the equilibrium state of the Hamiltonian (1) at temperature , and suppose that a bracket indicates averaging over an ensemble of configurations . If the assumption (2) holds within the paramagnetic phase, we always get at for all . However, if is small enough, derived from the paramagnetic assumption is smaller than the lower bound , which is imposed by Shannon’s rate-distortion theorem. This implies that such for a given indicates the ferromagnetic state; otherwise contradiction. The infimum of such ferromagnetic , therefore, gives an upper bound for the transition point . FIG. 2 illustrates a typical example with ratio .
We first show that the local correlation function at is
[TABLE]
if the system is in the paramagnetic state. Since the relation (2) implies the explicit form
[TABLE]
it is an easy matter to check that and averaging over gives .
Now, we will explain how to obtain theoretical bound for based on Shannon’s rate-distortion theorem. Let us first consider a virtual communication channel where the interaction configuration sequence is a set of the Bernoulli() random variables to be compressed, the set of spins is its compressed representation/codeword, and the spin products are its reproduction at Nishimori temperature . This choice of communication channel is motivated by the fact that at the Nishimori temperature, the Hamming distortion, or the normalized Hamming distance, between the and its reproduction can be easily obtained as [27]. This specific distortion measure defines the goodness of as a representation of a set of given Bernoulli() random variables . The basic problem in Shannon’s rate-distortion theory can then be stated as follows. What is the minimum description ratio required to achieve a given Hamming distortion between the two sequences? Shannon’s rate-distortion theorem gives the lower bound, say , as a function of the distortion measure for the theoretically achievable ratio . The ratio, or rate, is called the rate-distortion function for the Bernoulli() random variables. However, the distortion only gives a trivial lower bound and results in no restrictions for this specific channel [20].
We thus introduce a coding ‘trick’, a set of the Bernoulli() random variables with , which allows us to tighten the bound on . In the communication channel picture, the manipulation of the Bernoulli() sequence to get the sequence corresponds to a preprocessing step in the encoding operation. After we preprocess to get , the is not Bernoulli() assumed in the Nishimori’s theory. However, this difference becomes negligible when we take the large system limit of . As a result, we can use the Nishimori’s theory to calculate the Hamming distortion between and , which then offers a positive minimum ratio of . Since distortion redefined for the new pair depends on and , a positive bound on for the , if any, imposes a constraint on as a function of and . Hence, we obtain the theoretical lower bound on for a given ratio . Notice here that we require no physical assumptions such as (2) in this argument. In the following paragraphs we explain the essential details of this universal analysis.
We first introduce a set of Bernoulli() random variables for some satisfying . Define the set of all configurations with relative frequency of s equal to . For sufficiently large , we can consider and , respectively [29]. So we suppose that any configuration can be switched to a configuration by flipping elements from to . We consider the set of spin products as an estimate of the original .
Here we evaluate the normalized Hamming distance between the samples and , i.e., . We first notice that the identity leads to
[TABLE]
The equality holds if and only if there is no chance of getting and simultaneously. By definition, the preprocessing gives . The second term on the right would be
[TABLE]
since the gauge theory tells us that the internal energy becomes on the Nishimori line [26]. Assume that the bracket also indicates averaging over an ensemble of configurations as well as . Then we have
[TABLE]
To directly calculate the Hamming distance between the samples and on the Nishimori line, we introduce a pair of auxiliary variables and defined to be
[TABLE]
where is the empirical probability of when and denotes a frequency of s at the random variables . Notice that the former equation just counts up every difference , while the latter indicates the total number of in the reconstruction. By solving the two equations, we have
[TABLE]
It is easy to check that these formulas are well defined as probabilities in the interval for a given . Notice also that . Then it follows that
[TABLE]
(see [26]). In other words, the normalized Hamming distance between and on the Nishimori line can be estimated by the formula
[TABLE]
which is non-negative for the relevant intervals.
Lastly, it is possible to invoke Shannon’s rate-distortion theorem for the Bernoulli() random variables [20]. In the new communication channel picture with preprocessing, we first write and focus on the Hamming distortion between the original and its reproduction . Define the rate-distortion function for the Bernoulli() random variables as
[TABLE]
where we denote . For the ratio and the distortion , the theorem states that
[TABLE]
This inequality provides a bound on the compression ratio , dependent only on distortion . By letting , we can use the formula to lower bound the ratio as for every in the relevant interval . Now write
[TABLE]
It is obvious that we can still lower bound as
[TABLE]
Notice also that the is a non-increasing continuous function of . Suppose that the ratio is small enough to satisfy an inequality . Here the is the largest value of for over the interval . Since , it is an easy matter to check that for every . Then, by the intermediate value theorem, there exists a number in the closed interval such that
[TABLE]
We compare the formulas (6) and (7) to conclude that
[TABLE]
i.e., the lower bounds .
For the small enough, we numerically examine the equation (7) which implicitly determines for a given pair of and . Evaluation of the equation shows that there exists such a solution for some for every smaller than . Notice that the lower bound for the Bernoulli parameter gives the lower bound for local spin product . FIG. 2 compares this universal lower bound with the preceding paramagnetic solution . However, in this figure, violates our lower bound for larger than the intersection point . Hence, the larger than implies the ferromagnetic phase, in which the paramagnetic solution could break down. In other words, the multicritical transition point should be smaller than the intersection point . For a given , this offers an upper bound for as is shown by the solid line in FIG. 1, which is identified with .
In this Rapid Communication, we considered the ‘-bit’ spin state of the Ising spin glass model as compressed representations of a set of Bernoulli() binary random variables encoded in the interaction configuration. We showed that the Shannon rate-distortion theorem, which provides a bound on the compression ratio dependent only on distortion, can give an upper bound for the location of the multicritical point for a sufficiently small compression ratio . Remarkably, our argument is independent of detail structure of the lattice and only requires a mean-field assumption for the joint marginals of neighboring spins in the paramagnetic phase. Results obtained here for a certain class of lattice models with two-body Ising spin interactions will motivate applications of Shannon’s rate-distortion theorem to other Ising spin systems.
Acknowledgments
We would like to thank Federico Ricci-Tersenghi and Yoshiyuki Kabashima for useful discussions. We also thank an anonymous reviewer for suggestions that improved the explanations. This work was in part supported by JSPS KAKENHI Grant Numbers JP16KK0005, JP17K00009.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mézard et al. [1987] M. Mézard, G. Parisi, and M. A. Virasoro, Spin Glass Theory and Beyond (World Scientific, 1987).
- 2Mézard and Montanari [2009] M. Mézard and A. Montanari, Information, Physics, and Computation (Oxford University Press, 2009).
- 3Richardson and Urbanke [2008] T. Richardson and R. Urbanke, Modern Coding Theory (Cambridge University Press, 2008).
- 4Merhav [2010] N. Merhav, Foundations and Trends® in Communications and Information Theory 6 , 1 (2010).
- 5Sourlas [1989] N. Sourlas, Nature 339 , 693 (1989).
- 6Vicente et al. [1999] R. Vicente, D. Saad, and Y. Kabashima, Physical Review E 60 , 5352 (1999).
- 7Macris [2007] N. Macris, IEEE Transactions on Information Theory 53 , 664 (2007).
- 8Murayama [2004] T. Murayama, Physical Review E 69 , 035105 (2004).
