Error bounds in normal approximation for the squared-length of total spin in the mean field classical $N$-vector models
L\^e V\v{a}n Th\`anh, Nguyen Ngoc Tu

TL;DR
This paper establishes new Kolmogorov bounds and improved Wasserstein bounds for the normal approximation of the squared-length of total spin in mean field classical N-vector models, using Stein's method.
Contribution
It introduces a new Kolmogorov bound and enhances the Wasserstein bound for the normal approximation in these models, advancing existing results.
Findings
Kolmogorov bound is newly derived.
Wasserstein bound is improved over previous work.
Results apply Stein's method to exchangeable pairs in spin models.
Abstract
This paper gives the Kolmogorov and Wasserstein bounds in normal approximation for the squared-length of total spin in the mean field classical -vector models. The Kolmogorov bound is new while the Wasserstein bound improves a result obtained recently by Kirkpatrick and Nawaz [Journal of Statistical Physics, \textbf{165} (2016), no. 6, 1114--1140]. The proof is based on Stein's method for exchangeable pairs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\SHORTTITLE
Normal approximation for the mean field classical -vector models \TITLEError bounds in normal approximation for the squared-length of total spin in the mean field classical -vector models ††thanks: This work was supported by National Foundation for Science and Technology Development (NAFOSTED), grant no. 101.03-2015.11. \AUTHORSLê Vǎn Thành 111Department of Mathematics, Vinh University, Nghe An 42118, Vietnam. \[email protected] and Nguyen Ngoc Tu 222Department of Applied Sciences, HCMC University of Technology and Education, Ho Chi Minh City, Vietnam. 333 Department of Mathematics and Computer Science, University of Science, Viet Nam National University Ho Chi Minh City, Ho Chi Minh City, Vietnam. \[email protected] \KEYWORDSStein’s method, Kolmogorov distance, Wasserstein distance, mean-field model \AMSSUBJ60F05 \SUBMITTEDJuly 05, 2018 \ACCEPTEDFebruary 16, 2019 \VOLUME24 \YEAR2019 \PAPERNUM16 \DOI218 \ABSTRACTThis paper gives the Kolmogorov and Wasserstein bounds in normal approximation for the squared-length of total spin in the mean field classical -vector models. The Kolmogorov bound is new while the Wasserstein bound improves a result obtained recently by Kirkpatrick and Nawaz [Journal of Statistical Physics, 165 (2016), no. 6, 1114–1140]. The proof is based on Stein’s method for exchangeable pairs.
1 Introduction and main result
Let be an integer, and let denote the unit sphere in . In this paper, we consider the mean-field classical -vector spin models, where each spin is in , at a complete graph vertex among vertices ([5, Chapter 9]). The state space is with product measure , where is the uniform probability measure on . In the absence of an external field, each spin configuration in the state space has a Hamiltonian defined by
[TABLE]
where is the inner product in . Let be the inverse temperature. The Gibbs measure with Hamiltonian is the probability measure on with density function:
[TABLE]
where is the partition function: This model is also called the mean field model. It reduces to the model, the Heisenberg model and the Toy model when , respectively (see, e.g., [5, p. 412]).
Before proceeding, we introduce the following notations. Throughout this paper, is a standard normal random variable, and is the probability distribution function of . For a real-valued function , we write . The symbol denotes a positive constant which depends only on the inverse temperature , and its value may be different for each appearance. For two random variables and , the Wasserstein distance and the Kolmogorov distance between and are as follows:
[TABLE]
and
[TABLE]
In the Heisenberg model (), Kirkpatrick and Meckes [6] established large deviation, normal approximation results for total spin in the non-critical phase (), and a non-normal approximation result in the critical phase (). The results in [6] are generalized by Kirkpatrick and Nawaz [7] to the mean field -vector models with .
Let denote the modified Bessel function of the first kind (see, e.g., [2, p. 713]) and
[TABLE]
By Lemma A.2 in the Appendix, we have
[TABLE]
We also have
[TABLE]
In the case , from (2) and (3), there is a unique strictly positive solution to the equation
[TABLE]
Based on their large deviations, Kirkpatrick and Nawaz [7] argued that in the case , there exists such that
[TABLE]
for all , where is total spin. It means that is close to with high probability. On the other hand, all points on the hypersphere of radius will have equal probability due to symmetry. Based on these facts, they considered the fluctuations of the squared-length of total spin:
[TABLE]
where . Let
[TABLE]
Kirkpatrick and Nawaz [7] proved that when , the bounded-Lipschitz distance between and is bounded by . Their proof is based on Stein’s method for exchangeable pairs (see Stein [10]). Recall that a random vector is called an exchangeable pair if and have the same distribution. Kirkpatrick and Nawaz [7] construct an exchangeable pair as follows. Let be as in (5) and let , where for each fixed, is an independent copy of given , i.e., given , and have the same distribution and is conditionally independent of (see, e.g., [4, p. 964]). Let be a random index independent of all others and uniformly distributed over , and let
[TABLE]
where . Then is an exchangeable pair (see Kirkpatrick and Nawaz [7, p. 1124], Kirkpatrick and Meckes [6, p. 66]).
The bound obtained by Kirkpatrick and Nawaz [7] is not sharp. The aim of this paper is to give the Kolmogorov and Wasserstein distances between and with optimal rate .
The main result is the following theorem. We recall that, throughout this paper, is a positive constant which depends only on , and its value may be different for each appearance.
Theorem 1.1**.**
Let and be as in (1). Let be the unique strictly positive solution to the equation and as in (6). For as defined in (5), we have
[TABLE]
and
[TABLE]
The Wasserstein bound in Theorem 1.1 will be a consequence of the following proposition, a version of Stein’s method for exchangeable pairs. It is a special case of Theorem 2.4 of Eichelsbacher and Löwe [4] or Theorem 13.1 in [3].
Proposition 1.2**.**
Let be an exchangeable pair and . If for some random variable and , then
[TABLE]
The Kolmogorov distance is more commonly used in probability and statistics, and is usually more difficult to handle than the Wasserstein distance. Recently, Shao and Zhang [9] proved a very general theorem. Their result is as follows.
Proposition 1.3**.**
Let be an exchangeable pair and . Let be any random variable satisfying and . If for some random variable and , then
[TABLE]
Shao and Zhang [9] applied their bound in Proposition 1.3 to get optimal bound in many problems, including a bound of for the Kolmogorov distance in normal approximation of total spin in the Heisenberg model. We note that if , then the following result is an immediate corollary of Proposition 1.3. In this case, the bound is much simpler than that of Proposition 1.3.
Corollary 1.4**.**
If , then
[TABLE]
Proof.
In Proposition 1.3, let , then
[TABLE]
If , then (10) is trivial. If , then (10) follows immediately from (11) and Proposition 1.3. ∎
For , and for and respectively defined in (5) and (7), we have
[TABLE]
since and Therefore, we will apply to obtain the Kolmogorov bound in Theorem 1.1.
2 Proof of the main result
The proof of Theorem 1.1 depends on Kirkpatrick and Nawaz’s finding [7]. Applying Proposition 1.2 and , Theorem 1.1 follows from the following proposition.
Proposition 2.1**.**
Let , and let be as in (1), the unique strictly positive solution to the equation . Let and be as in (5) and (7), respectively. Then the following statements hold:
(i) and
(ii) where and is a random variable satisfying ,
(iii) E\Big{|}\dfrac{1}{2\lambda}E((W_{n}-W_{n}^{{}^{\prime}})^{2}|W_{n}))-B^{2}\Big{|}\leq Cn^{-1/2}, where is defined in (6).
Remark 2.2**.**
Kirkpatrick and Nawaz’s [7] used their large deviation result for total spin to prove that Intuitively, we see that this bound would be improved to since approximates a normal distribution. By a more careful estimate, we can prove that (see Lemma A.1). This will lead to desired bound . Kirkpatrick and Nawaz’s [7] also proved that
[TABLE]
To get optimal bound of order for this term, we use a fine estimate of function (Lemma A.2) and a technique developed recently by Shao and Zhang [9, Proof of (5.51)]. **
Proof of Proposition 2.1.
(i) We have
[TABLE]
The proof of the first half of (i) is completed. Now, apply Lemma A.1 given in the Appendix, we have
[TABLE]
(ii) Kirkpatrick and Nawaz [7, equation (9)] showed that
[TABLE]
where is a random variable satisying . Set . By Taylor’s expansion, we have for some positive random variable :
[TABLE]
Set we have and
[TABLE]
Combining (12)-(14) and noting that , we have
[TABLE]
where
[TABLE]
By Lemma A.2 (ii), we have . Since , and , we conclude that . The proof of (ii) is completed.
(iii) Denote is the identity matrix and set From Kirkpatrick and Nawaz [7, Equations (11) and (12)], we have
[TABLE]
where
[TABLE]
and is orthogonal projection onto . Therefore,
[TABLE]
where
[TABLE]
For , noting that , then by Lemma A.1, we have
[TABLE]
Thus,
[TABLE]
For , we have
[TABLE]
To bound , we note that
[TABLE]
Since and , we have
[TABLE]
Similarly,
[TABLE]
Combining (19) and (20), we have
[TABLE]
Bounding is the most difficult part. Here we follow a technique developed by Shao and Zhang [9, Proof of (5.51)]. Set
[TABLE]
we have
[TABLE]
It follows that
[TABLE]
Define a probability density function
[TABLE]
where is the normalizing constant. Let given , and for
[TABLE]
Similar to Shao and Zhang [9, pages 97, 98], we can show that
[TABLE]
and
[TABLE]
where and . Let
[TABLE]
By Lemma A.3 and the definition of , we have
[TABLE]
Using similar estimate for , then we have
[TABLE]
Note that given , and are conditionally independent. It implies that
[TABLE]
Combining (22)-(29), we have , and so
[TABLE]
Combining (15), (17), (18), (21) and (30), we have
[TABLE]
The proposition is proved. ∎
Appendix A Appendix
In this Section, we will prove the technical results that used in the proof of Theorem 1.1.
Lemma A.1**.**
We have
[TABLE]
Proof.
By the large deviation for [7, Proposition 2] and the argument in [7, p. 1126], one can prove that there exists such that
[TABLE]
for all . Since it implies that
[TABLE]
∎
Lemma A.2**.**
Let and . Then the following statements hold:
- (i)
**
- (ii)
**
- (iii)
**
Proof.
As was showed in [7, p. 1134], we have
[TABLE]
It implies
[TABLE]
and
[TABLE]
Amos [1, p. 243] proved that
[TABLE]
[TABLE]
Therefore,
[TABLE]
The proof of (i) and (ii) is completed. For (iii), we have
[TABLE]
Combining the first half of (35) and (36), we have . It follows from (31), (35) and (36) that
[TABLE]
Apply Theorem 2 (a) of Nȧsell [8], we can show that
[TABLE]
Combining (37) and (38), we have The proof of (iii) is completed. ∎
Lemma A.3**.**
With the notation in the proof of Theorem 1.1, we have
[TABLE]
Proof.
Let the Lebesgue measure of . It follows from (23) that
[TABLE]
where we have used formula
[TABLE]
(see, e.g., Exercise 11.5.4 in [2]) in the last equation. For , we have
[TABLE]
∎
Finally, we would like to note again that Proposition 1.2 is a special case of Theorem 2.4 of Eichelsbacher and Löwe [4] or Theorem 13.1 in [3], but the constants in the bound may be different from those of Theorem 2.4 in [4] or Theorem 13.1 in [3]. Since the proof is short and simple, we will present here.
Proof of the Proposition 1.2.
Let such that and , and let be the unique solution to the Stein’s equation Since is an exchangeable pair and ,
[TABLE]
It thus follows that
[TABLE]
By Lemma 2.4 in [3] we have
[TABLE]
The conclusion of the proposition follows from (39) and (40). ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Amos, D.E.: Computation of modified bessel functions and their ratios. Math. Comput. 28 (1974), no. 125, 239–251. \MR 0333287
- 2[2] Arfken, G. B. and Weber, H. J.: Mathematical methods for Physicists. Elsevier Academic Press , 6th edition, 2005. xii+1182 pp.
- 3[3] Chen, L. H. Y., Goldstein, L. and Shao, Q. M.: Normal approximation by Stein’s method. Probability and its Applications (New York). Springer, Heidelberg , 2011. xii+405 pp. \MR 2732624
- 4[4] Eichelsbacher, P. and Löwe, M.: Stein’s method for dependent random variables occuring in statistical mechanics. Electron. J. Probab. 15 (2010), no. 30, 962–988. \MR 2659754
- 5[5] Friedli, S. and Velenik, Y. Statistical mechanics of lattice systems. A concrete mathematical introduction. Cambridge University Press, Cambridge , 2018. xix+622 pp. \MR 3752129
- 6[6] Kirkpatrick, K. and Meckes, E.: Asymptotics of the mean-field Heisenberg model. J. Stat. Phys. , 152 (2013), 54–92. \MR 3067076
- 7[7] Kirkpatrick, K. and Nawaz, T.: Asymptotics of mean-field O(N) model. J. Stat. Phys. , 165 (2016), no. 6, 1114–1140. \MR 3575640
- 8[8] Nȧsell, I.: Rational bounds for ratios of modified Bessel functions. SIAM J. Math. Anal. 9 (1978), no. 1, 1–11. \MR 0466662
