Random fields, large deviations and triviality in quantum field theory. Part I
Adnan Aboulalaa

TL;DR
This paper investigates the existence and triviality of four-dimensional Euclidean quantum scalar fields using large deviations, showing that the fields become trivial and non-stable as the ultraviolet cutoff is removed.
Contribution
It applies large deviations techniques to demonstrate the non-existence and triviality of quantum scalar fields in four dimensions, extending results to vector fields and polynomial Lagrangians.
Findings
The density of regularized measures tends to zero almost surely as cutoff is removed.
Normalization sequences diverge, indicating non-ultraviolet stability.
Results hold for vector fields and polynomial Lagrangians.
Abstract
The issue of the existence and possible triviality of the Euclidean quantum scalar field in dimension 4 is investigated by using some large deviations techniques. As usual, the field is obtained as a limit of regularized fields associated with a probability measures , where represent ultraviolet and volume cutoffs. The result obtained is that in a fixed volume, the almost sure limit (as ) of the density of , with respect to the Gaussian free field measure, exists and is equal to , when the coupling constant is not vanishing. This implies that can not have a strong limit as the ultraviolet cutoff is removed. Furthermore, the normalization sequence is divergent as for dimensions when the vacuum renormalization is lower thanβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlack Holes and Theoretical Physics Β· Cosmology and Gravitation Theories Β· Particle physics theoretical and experimental studies
Random fields, large deviations and triviality in quantum field theory. Part I.
Adnan Aboulalaa111 E-mail: [email protected]
Abstract
The issue of the existence and possible triviality of the Euclidean quantum scalar field in dimension 4 is investigated by using some large deviations techniques. As usual, the field is obtained as a limit of regularized fields associated with a probability measures , where represent ultraviolet and volume cutoffs. The result obtained is that in a fixed volume, the almost sure limit (as ) of the density of , with respect to the Gaussian free field measure, exists and is equal to [math], when the coupling constant is not vanishing. This implies that can not have a strong limit as the ultraviolet cutoff is removed. Furthermore, the normalization sequence is divergent as for dimensions when the vacuum renormalization is lower than some threshold, which leads to the non ultraviolet stability of the field in this case. These assertions are also valid for vector fields and can be extended to polynomial Lagrangians.
Key-words: Random Fields, Large deviations, Constructive quantum field theory, Non-pertubative renormalization, The triviality problem.
Mathematics Subject Classification (2010): 60G60, 60F10, 60K35, 81T08, 81T16.
1 Introduction
Let and be the space of tempered distributions, and be the Gaussian measure on associated to the free field. In this paper we are interested in the class of random fields corresponding to the probability measure given by:
[TABLE]
where is a Lagrangian density, which, in general, is a polynomial function. The fact that the sample fields are irregular distributions, so that neither the pointwise values nor their products are well defined, makes the expression (1.1) a formal one.
The connection between (1.1) and field theory is not obvious at all and had taken quite a long time to be established. For a survey of the subject we refer to Jaffe [36] and Summers [59]. In very few words, let us say that Quantum Field Theory (QFT) is currently the theoretical framework of modern particle physics with predictions that agree with experimental results with very high precision orders. That the mathematical foundation of this theory is problematic was recognized in the 1950s and has given rise to a new discipline in mathematical physics, where two approaches have emerged: the algebraic quantum field theory (Haag, Kastler, Araki, see, e.g., [33], [6]) and what can be called the analytic approach (Wightman, GΓ₯rding, see [37], [57], [58], [38], [18] for an introduction to these topics), in which we find most of the Constructive Quantum Field Theory program, whose aim is the construction of rigorous mathematical models of QFT.
The objects dealt with in QFT are operators indexed by which is considered as a Minkowski space ; these operators act on some Hilbert space . It was found that the operators are not regular with respect to and a proposal has been made to consider them as operator valued distributions indexed by a space of test functions and satisfying certain conditions, the GΓ₯rding-Wightman axioms. Furthermore as QFT considers also objects like , where is a special state (the vaccum), it was proved that the operators can be reconstructed from a given distributions (the Wightman functions) that satisfy a set of axioms.
While the axioms are justified and accepted, the pivotal issue was (and still is) whether there exists a non trivial field that corresponds to those of theoretical physics. This matter has been extensively studied, cf. Glimm, Jaffe [31] for references ; see also the survey and bibliography in Summers [59] and Malyshev [42] for an overview of the probabilistic aspects.
Early constructions have been performed in the operator framework in the 1960s. To begin with dimension 2, following a pioneering work by Nelson [43], a positive answer was given for the scalar field by Glimm and Jaffe [27] in finite volume and subsequently in infinite volume (Glimm, Jaffe, Spencer [29]). In the meantime, the Euclidean treatment of these problems was developed by many authors (Nelson, Symanzik [60], [44], [45] ) and proved to be much more convenient than the Minkowski setting. Osterwalder and Schrader [49] discovered that the Wightman distributions can be associated to their Euclidean counterpart , the Schwinger functions that fulfill a set of conditions, the Osterwalder-Schrader (OS) axioms, see Zinoviev [61] for the question of the equivalence of the two constructions. Even more, it was noticed (see, e.g., [21]) that these Euclidean fields can be constructed through a probability measure on the some space, which is often , and the are the moments of this probability measure ; we end at this point this very short and incomplete description of the link between QFT and probability measures like (1.1).
After the construction of quantum fields in dimension 2, the case of dimension 3 was also solved by Glimm-Jaffe [28], Feldman-Osterwalder [19], and Magnen-SΓ©neor [41] and many authors, with different methods ([50], [12], [9], [10], [13], etc.).
In all these constructions, the interacting field is obtained as a limit of regularized fields with volume and cutoffs denoted by and ; for the later, a momentum or lattice regularizations are used. In the case of a momentum cutoff, the regularized field corresponds to a well defined measure by:
[TABLE]
And the problem is whether a limit of exists in some sense, including that of the convergence of the Schwinger functions and is non trivial. For the field, let us write as (we drop the subscript ):
[TABLE]
where are renormalization constants of the coupling constant, mass, field strength and vacuum respectively (we use the notation instead of the usual ). For dimensions , a negative answer was obtained, in the case of one or two components fields, by Aizenman [4] and FrΓΆhlich [22]; using a lattice cutoff it was proved that the corresponding limiting field is Gaussian and hence a trivial one, for it is similar to the free field without interaction. We refer to Fernandez, FrΓΆhlich and Sokal [20] for a detailed account on these questions and to Callaway [15] for a survey of the problem of triviality in QFT.
The border case of dimension 4, which is the physical one, has so far remained an open problem, although partial results have been obtained for one and two component fields and with some conditions on the renormalization constants***For the 1-component field, the remaining case corresponding to non vanishing has been treated recently by Aizenman and Dumenil-Copin [5] within the lattice approximation framework. See also the part II of this work [3].; the case of negative coupling constant is also studied in Gawedzki, Kupiainen [26]. It is believed that the limits of the regularized fields may also be trivial or that the interacting field may not exist, which would rise questions about the foundations and consistency of quantum field theory.
The purpose of this paper is to show that for dimensions , and depending on the renormalization constants adopted, we have the following alternative: when the ultraviolet cutoff is removed and the volume is fixed: (1) either converges strongly to the Gaussian measure and is trivial in this case, or (2) the almost sure limit of exists and is equal to [math]. This implies, in the second case, which is the physical one, that can not have a strong limit as . The second possibility is valid provided the coupling constant sequence of the modified action is not vanishing in the sense that does not converge to [math]. On the other hand, the proof of these results shows that the normalization sequence is divergent as when the vacuum renormalization , is lower than some threshold, namely : , with . We recall that the boundedness of is linked to the boundedness from below of the full Hamiltonian (L being the interaction Lagrangian). This was precisely the basic result of Nelson [43] subsequently used to prove the existence of scalar field in dimension , and in dimension this was the main result of Glimm-Jaffe paper on the positivity of the Hamiltonian [28]. The non positivity (or non boundedness from below) of the full interacting Hamiltonian in dimension 4 is an obstruction to the existence of non trivial interacting scalar quantum field in dimension 4 or greater.
The approach adopted in this paper is different from the previous ones. It uses direct calculations based on a momentum cutoff. A normalization of the field and transformation of the integral of the Lagrangian to a mean of an array of random variables establish a link with the classical probabilistic questions of the law of large numbers and large deviations theory, and we are led to find an estimate of the repartition function corresponding to via a Laplace type method.
The remainder of the paper is organized as follows. The notations and estimates of the covariance functions are recalled in section 2. The statement of the result with some remarks are presented in section 3. Section 4 contains the proof, which is divided into four parts. The first part deals with a transformation of the Lagrangian and the action to normalized ones and their expression as means of arrays of random variables; a law of large numbers and some estimates for this array of rvβs are stated. In the second part, a lower bound of large deviations of the mentioned array and normalized Lagrangian is derived by using an approach of Bahadur, Zabell and Gupta [8] ; we also use a result of CsiszΓ‘r [17] that enables to identify the large deviations rate function. The third part uses this result to get a lower bound of the partition function (by Varadhanβs lemma). In the last part, the proof is completed with the different cases of the renormalization sequences. Finally, let us point out that the specific characters of the or ferromagnetic fields does not play a role in the approach adopted in this work; the results obtained are also valid for multi-component (vector) fields and can be extended to polynomial Lagrangians.
2 Notations and settings
2.1 Notations
We consider the usual framework of Euclidean field theory. The probability space is the Schwartz space of tempered distributions, with its Borel -field ; the reference probability measure denoted by is the Gaussian measure whose covariance operator is . Unless otherwise specified, the norm , will denote the norm with respect to the measure .
We shall use a volume cutoff and a momentum ultraviolet cutoff denoted by (instead of ):
For , let be a approximation of the delta distribution ; then the regularized field is given by
[TABLE]
where is the free field associated to . The expectation with respect to will be denoted by or .
If are two probability measures on some space , then will denote the Kullback-Leibler information (or number) or -divergence of with respect to , that is:
[TABLE]
2.2 Variances, covariances and estimates
We shall use the following notations:
[TABLE]
NB. In the estimates we are concerned with, the constant factors will be denoted by the same letters , although they may be different and depend on the quantities estimated. In the case where these constants depend of some parameter e.g., , this will be taken into account in the notation.
Remark 2.1
The covariance of the free field has the expression (see [31] pp. 162-163):
[TABLE]
From this expression it can be easily seen (see [31]) that:
[TABLE]
Furthermore, if we take the derivatives of when , an inspection of the dominating terms in their expressions shows that we have also:
[TABLE]
and that in the manipulations involving the estimates of these terms, we are authorized to take only the previous dominating terms. We have for instance:
[TABLE]
As for the terms involved in the action, we set:
[TABLE]
where denotes the Wick product and is the norm of the vector
[TABLE]
We recall that if is a regular homogeneous random field, with covariance function , then the variance of its derivatives is given by:
[TABLE]
We note that:
[TABLE]
The covariances and their integrals depend on the dimension , and we have the following estimates:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
As for the gradient field we have the following result:
Lemma 2.1
The gradient field is a Gaussian field and its variance has the following estimate:
[TABLE]
**Proof.
**By a change of variable in the estimate (2.8) we have:
[TABLE]
In view of Remark 2.1, by taking the derivatives:
[TABLE]
and:
[TABLE]
Which gives for :
[TABLE]
This proves the lemma.
Note that we can rewrite (2.16) by remaking the change of variable :
[TABLE]
We shall use this estimate later.
3 Statement of the results
3.1 The modified Lagrangian and random field
With the notations of Β§2, we consider the Euclidean quantum field with interaction, which is associated to a Lagrangian density and the action:
[TABLE]
and to a measure on given by:
[TABLE]
In the scalar field case (bosonic interactions), the Lagrangian density is usually a polynomial function , and the most simple but fundamental scalar interaction is the case where or . Yet, the expressions (3.19) and (3.20) are formal because is a distribution. So regularizations are performed via several methods and we shall use the momentum and volume regularizations recalled in : will be replaced by a regular field and by a finite volume .
On the other hand, in the study of the limit of the interacting regularize field, we are led to add counter-terms to the the interactions in order to get meaningful and finite quantities: in the perturbative (or physical) renormalization, the quantities in question are the moments of the type . In the constructive renormalization, the aim is to obtain the limit of the field itself as a well defined and non trivial object that gives finite moments, and to see whether they are the same as the moments obtained by the perturbative procedure (cf., e.g., Rivasseau [52]). The outcome depends on the space-time dimension .
Dimension 2: The Wick regularization, replacing by , is sufficient to construct an interacting field in infinite volume ; this case had been studied by Nelson [43], Glimm, Jaffe [27] and many authors by the end of the 1960s and beginning of the 1970s, cf. Glimm-Jaffe [31] and Simon [56] for references.
Dimension 3: The Wick regularization is not sufficient: a mass renormalization term is added and the modified Lagrangian has the form:
[TABLE]
In both cases (d=2,3), the renormalization scheme is similar to that of the perturbative renormalization, cf. Gallavotti [23], [24]. Still, the proof of the existence of these fields and their non triviality is a highly non-trivial task and is considered as a major achievement in the constructive quantum field theory program. In fact the difficulty is rather to prove that a regularized field with a given renormalization constants converges to some field that is non trivial and satisfies the OS axioms.
Dimension 4: In this case, besides the mass renormalization term , the perturbative renormalization procedure requires the addition of other counter-terms, namely the constant coupling and wave function renormalization (see e.g [39], [16], [53]); the modified Lagrangian density will have the following form:
[TABLE]
Remark and notations: In the complete Lagrangian density we have to add a vacuum renormalization term , so that the full Lagrangian density is :
[TABLE]
and the complete action when we consider a finite volume will be:
[TABLE]
Except some remarks below, this vacuum renormalization term will be left aside in the sequel: the interacting field measures given by , with are the same as those given by , with , the term being eliminated between and in the first expression.
In this paper we shall consider the following action :
[TABLE]
where is a finite volume. The mass term renormalization will be denoted by instead of the usual . We call the renormalization constant sequences or simply the renormalization constants. In order to simplify the notations we will often set: and it is understood that the Lagrangian depends also on the gradient of the field. The interacting field measure for in a finite volume and with a momentum cutoff is given by:
[TABLE]
3.2 Main result and remarks
The following theorem and corollaries are the main result of this paper:
Theorem 3.1
*Let be the probability measure on associated to the field with a momentum and a volume cutoffs indexed by , and renormalization constant sequences and . We suppose that the coupling constant sequence is positive . Then, with and , we have the following alternative:
(1) If the sequences and are bounded, then converges strongly (setwise) to the Gaussian measure and is therefore the free field.
or
(2) If at least one of the sequences or is unbounded then, provided that the dimension and either the coupling constant sequence is not vanishing in the sense that does not converge to [math], or if this condition is not fulfilled: or for some constant , we have:*
[TABLE]
where
[TABLE]
and in this case there exists a constant such that:
[TABLE]
The previous assertions are also valid in the case the multi-component field and for all dimensions .
The following first corollary shows that in dimensions , it is not possible to construct an interacting field in a finite volume as a strong limit of its cutoff measure :
Corollary 3.2
With the notations of the previous theorem, in the case (2) where one of the sequences or is unbounded with one of the further conditions mentioned in the theorem, the sequence:
[TABLE]
is not uniformly integrable and the sequence of measures does not converge strongly (setwise) to any probability measure.
**Proof.
**If is uniformly integrable, then, since a.e, we will have (by the extension of the dominated convergence theorem), which is not possible because .
Now, if converges setwise, i.e. converges to some for every , then for every . But this would imply that is uniformly integrable by the Vitali-Hahn-Sacks theorem (cf., e.g., Neveu [46], Proposition IV-2.2), which is in contradiction with the first part of the corollary. .
The second corollary concerns the ultraviolet stability of the model:
Corollary 3.3
Let be the complete renormalized action of the model in a finite volume and dimension :
[TABLE]
and . We also assume that the coupling constant is not vanishing. Then for any renormalization scheme in which the vacuum renormalization satisfies , we will have:
[TABLE]
The corresponding Hamiltonian of the limiting field will not be bounded from below (and can not provide a quantum field theory model)
**Proof.
**Since and , we will have in the case where :
[TABLE]
for some constant . For the second assertion we refer to the remarks below.
Let us now give some comments on the these results:
- On the role of the space-time dimension: although the possibility of the theorem is valid for all dimensions , there is no interference between theorem 3.1 and the non-trivial constructions of quantum field carried out for dimensions :
In dimension , we have and the construction of of quantum fields in this case uses or , while there is no need of mass and wave renormalizations. Theorem 1 does not apply to this configuration.
In dimension , we have , and the construction of of quantum fields in this case uses and . Hence, the term is unbounded, so we are in the Case 2 of the theorem which requires . In fact, we will see in the proof that the requirement is set in order to fulfill the condition :
[TABLE]
which is satisfied in dimensions and if . (3.32) would be satisfied in dimension 3 if the coupling constant is divergent and , which is not the renormalization scheme adopted in this dimension.
-
In the constructive quantum field theory literature the uniform boundedness of (sometimes together with the existence of a limit field as the UV cutoff is removed) is usually referred to as the ultraviolet stability of the model. That is uniformly bounded (in , the volume being fixed) is indeed a key step towards the construction of mathematical QFT models. In dimension 2, this result was obtained by Nelson [43] and proved in other ways by many authors, see [56] for references. It was subsequently used by Glimm and Jaffe for the first construction of scalar quantum field in dimension in finite volume within the Hamiltonian framework. For dimension 3, the uniform bound for was the main purpose of the Glimm-Jaffe article, Positivity of the Hamiltonian [28], with its consequence, the uniform boundedness from below of the total interacting Hamiltonian . Besides being a physical requirement, the boundedness from below of the total Hamiltonian is used to prove the essential self-adjointness of the Hamiltonian obtained as the UV cutoff is removed, which enables to continue the construction of the interacting quantum field. The other consequence of this positivity of the Hamiltonian is that which is a condition for the existence of the lowest energy state, i.e. the vacuum. The Feynman-Kac-Nelson formula (used for fields with ultraviolet cutoff) provides a link between the boundedness from below of the total Hamiltonian and the ultraviolet stability (see [28], [56], [32]). The divergence of stated above (besides the trivial cases) presents therefore a problem for the construction of a scalar quantum field and its possible existence in dimensions (if ).
-
We recall that a standard method to prove the existence of a field in -dimension, is to show that the Schwinger functions:
[TABLE]
or in the case of lattice regularization, have a limit as the cutoff are removed and to prove the properties related to the Osterwalder-Schrader axioms. One can also seek the limits of the characteristic functionals . We also recall that when taking the infinite volume limit we can not hope a strong convergence of the measures to a non trivial measure : we would then have but due to the Haag theorem (Euclidean version, see [21], [54], [40]), would be a Gaussian measure.
However, in finite volume it is possible to have a non trivial strong limit of (The Haag theorem is valid only in infinite volume). This has been accomplished in dimension 2, see, eg. Newman [47], and the remarks in Simon [56], p. 142.
In dimensions , this possibility of a non trivial strong limit of with a fixed finite volume is nevertheless ruled out by theorem 3.1 and corollary 3.2
Several attempts have been made in 4 dimension, see e.g. [55], [25]. In relation with, this point Glimm and Jaffe made a remark in [30], that for a class of fields, a bound of the two point function of the type is sufficient to prove a bound on the n-point Schwinger function independent of , in the lattice framework, which yields the existence of as a limit when the lattice cutoffs , by a compactness argument. One has to prove then the OS axioms. Still, the question of non-triviality of the field obtained has to be addressed and may be more difficult.
- For a discussion about the triviality concept we refer to [25], [20] and [15]. A trivial limit includes the cases of free field (no interaction) or singular field; the limit of the regularized field (or measure) may also not exist at all. The standard method to prove that the limiting field is trivial within the lattice framework is to show that the Ursell functions (truncated 4-point function):
[TABLE]
converges to [math] as the the lattice spacing goes to [math]. In this expression denotes the expectation w.r.t a lattice measure approximating the scalar field. That implies that the resulting field is trivial, is a consequence of either a theorem of Baumann [11] or a theorem of Newman [48], which assert that when , the field under consideration is a generalized free field, see [20] for the validity conditions of these theorems and other details.
- The case : In the lattice framework, the limiting field is trivial in this case and this is a consequence of the skeleton inequalities ([14]) :
[TABLE]
(the correspondence between coupling constant in the lattice () and continuum framework () is: ). In our case we have the following situations :
Case (a) : and : this happens in dimensions : in this case the results of theorem 3.1 are valid.
Case (b) : : theorem 3.1 does no apply in this case; that the resulting field is also trivial in this case does not seem to be straightforward from the arguments of the proof below, besides the case (1) of the previous theorem, where is rapidly vanishing (). However it can be shown from the results of the lattice framework (the skeleton inequalities), that the limiting field is actually trivial.
We leave aside the details for the moment. Let us just mention that in the two situations (a) and (b) the limiting field might be null, in particular when (cf. [2]).
- : The proof below is valid without modification of the arguments to cover the case of polynomial interaction of course with the condition the polynomial is bounded below.
4 Proofs
4.1 Overview of the proof
The idea of the proof is to show that is much larger that the values of:
[TABLE]
We therefore seek an estimate (in fact a minorization) of and this is done via the following steps:
(1) is first transformed to an expression of the form:
[TABLE]
(2) Estimating expressions like (4.35) is usually done with a Laplace type method. In infinite dimension, the Varadhan lemma, which is used in many situations, requires that the sequence of measures satisfies a large deviations principle (LDP), with assumptions that can be more or less relaxed depending of the situations. However, proving a LDP in our case seems to be complicated.
(3) Nonetheless, it turns out that a lower bound of large deviations for the sequence can be established and this is sufficient to get a minorization of .
(4) To get this lower bound, the action and Lagrangian are transformed to arrays of random variables with an expression like:
[TABLE]
We are thus led to the study of the large deviations properties of the array and this is the major part of the proof.
4.2 Transformation to an array of random variables and a law of large numbers for the integrated fields
We normalize the field by the field defined by:
[TABLE]
The renormalized action may be written as:
[TABLE]
where we have added a factor in the last term that will be justified later. Depending on which of the factors , or is the dominant one, we shall define a new Lagrangian density and action by factorizing by the dominant term. For example when , with we write :
[TABLE]
[TABLE]
[TABLE]
and we have in this case: , and .
**Notations related to the volume V. **
For definiteness, the finite volume will be taken as and for each , the volume will be denoted by and will be divided to a subdivision of volumes , with and . There indices , and for the convenience of the notations used in the summations we shall make this index running in the set instead of the set by a trivial correspondence. We also set:
[TABLE]
The small volumes form a subdivision of .
Let us make the following transformation:
[TABLE]
and are thus expressed as a mean of an array of the random variables :
[TABLE]
or by a change of variable :
[TABLE]
Let us introduce the following notations:
[TABLE]
[TABLE]
The following proposition shows that the array of the random variables has correct properties like the convergence of the norms of the elements to non trivial values (not null and not infinite) and that they are asymptotically decorrelated.
Proposition 4.1
*The arrays of random variables have the following properties:
(1) There exists constants such that, with , we have the following limits:*
[TABLE]
* (2) In particular: and and if the sequences converge to then the -norm of the modified array:*
[TABLE]
*converges to a real independent of , that is .
(3) For , the -norms are also convergent and hence bounded.
(4) For all with , and are asymptotically decorrelated:*
[TABLE]
and the same property holds for .
Proof.
To begin with (1), we have:
[TABLE]
where we have made a first change of variable , with and a second one: , . We also set with its volume . This yields:
[TABLE]
where we use the change of variable and in the last step we use of course the fact that and , as .
Let us set:
[TABLE]
Then we have
[TABLE]
To estimate the last integrals we use (2.18) to get:
[TABLE]
where we set and the convergence of the intergrals w.r.t. can be seen by the change of variables . Notice that we have also:
[TABLE]
the integral is taken this time on .
With this and the fact that and , we obtain:
[TABLE]
With we see that:
[TABLE]
converges to some as .
The point (3) of the proposition can be proved by similar calculations: this time we deal with expressions like: and in the integrals we will get give rise to terms like , the integrals of each couple can be made independently of the others and this reduces the calculations to the case . Finally, the point (4) is easy, we omit the details.
Next, we have a kind of law of large numbers (LLN) for the continuous field in a finite volume, which implies LLNs for the arrays
Proposition 4.2
We have the following law of large numbers of the continuous field :
[TABLE]
Proof. We have:
[TABLE]
which implies that and that (by the Borel-Cantelli lemma). In the same way we have
[TABLE]
which implies that and that
As for we have
[TABLE]
where we have used (4.44). And as before, for which implies the almost sure convergence
4.3 A lower bound of large deviations probabilities for dependent sequences and arrays
Our motivation is to obtain a lower bound for the sequence like:
[TABLE]
Usual results in large deviations can not be applied in our case ; they often deal with the i.i.d case or dependent sequence (whose mean is ) with specific conditions. Few results address the case of arrays of rvβs. The Gartner-Ellis theorem can not be used: in this theorem, the LDP is deduced from conditions on the limit:
[TABLE]
which is supposed to exist with some properties. But this is precisely what we are looking for. In fact, we take the opposite direction, by seeking a LDP to be satisfied by we wish to get an estimate of the limit (4.46).
Lower bounds of large deviations probabilities stated in different or more general forms than those currently used in large deviation theory are useful for many applications. One of these forms can be found in Bahadur, Zabell and Gupta [8] which contains some interesting examples ; we shall use a formulation given in [1], [2] which deals with the i.i.d random variables case ; for the sake of clarity, we reproduce it here with its short proof. This formulation will be generalized to arrays of dependent random variables (Proposition 4.4), and the i.i.d proof will be adapted to that purpose. But to get a utilizable lower bound for the proof of Theorem 3.1, further intermediate results will be needed.
Proposition 4.3
*Let be a Banach space, a -field on , and the coordinate maps. Suppose that we have also the following data:
A probability measure on
A probability measure on and be such that is -measurable and*
[TABLE]
Then for all probability measures we have
[TABLE]
**Proof.
**We recall here the proof given in [2]: we may assume that (otherwise (4.48) is obvious) and observe that
[TABLE]
(4.49) is clearly verified if ; otherwise, taking , we have
[TABLE]
and we get (4.49) by using Fatouβs lemma. Now, applying (4.49) to and and using the Jensen inequality, we have
[TABLE]
Consequently
[TABLE]
hence, using (4.47) and Lebesgueβs theorem we get (4.48).
The next proposition is a generalization of this result to the case of array of dependent random variables. Let be a Banach space and a -field on (e.g. the Borel -field).
Proposition 4.4
Let be a Banach space and an array of -valued rvβs. We suppose that the have the same law but are not necessarily independent. Let be the law of , defined as a measure on . We denote by the -field . Suppose that we have also the following data:
* The probability measures on with the same marginals , so that the are i.i.d under .*
* A probability measure on ; we also consider the the pmβs on , under which the are i.i.d with the law .*
* A subset be such that is -measurable and*
[TABLE]
If:
[TABLE]
then we have
[TABLE]
For the proof of this proposition, we need some intermediate results stated in the following lemma and propositions:
Lemma 4.1
Let be a product measure on a space and a probability measure on such that each marginal is absolutely continuous w.r.t the corresponding marginal of ( ). We also suppose that is supported by the open sets of . Then
**Proof.
**Let be such that . Then there exists some such that . On the other hand,
[TABLE]
And since and we have and .
Now, let be an open set such that . Then for all rectangular set , we have which, by the previous argument, implies that and as is a countable union of rectangles, this gives .
The following proposition is inspired from Bahadur and Raghavachari [7] Theorem 1 p. 133, which provides an interesting property of the limit of with a sequence of fields:
Proposition 4.5
*Let be two probability measures on a measurable space and a sequence of fields.
Suppose that is absolutely continuous w.r.t on , with*
[TABLE]
Then:
[TABLE]
**Proof.
**Let . Then, for , iif = and:
[TABLE]
Now let be the event . From the last inequality we get , which implies that . And:
[TABLE]
This shows that and .
We associate to the array of random variables , the sequence of fields . The sequence of probability measure defined on can be extended to a probability measures on such that .
Proposition 4.6
With the notations of Proposition 4.4, let:
[TABLE]
Then:
[TABLE]
**Proof.
**In the same way, the sequence defined on can be extended to a probability measure on such that .
On we have . The previous proposition can be applied and we have:
[TABLE]
**Proof of Proposition 4.4
**By the previous lemma 4.1, we have: . Then, by the same arguments as those of the beginning of the proof of Proposition 4.3:
[TABLE]
By the Jensenβs inequality, we have
[TABLE]
We have to estimate each of these 3 terms. For the second one we have:
[TABLE]
as by the assumption of the proposition. As for the 3d term, using :
[TABLE]
as . Next we turn to the proof of:
[TABLE]
We have:
[TABLE]
By proposition 4.6, we have ; but it does not imply directly that , because the integration in the left hand side is w.r.t and we have not proved that it is absolutely continuous w.r.t . We can proceed as follows: By Proposition 4.6 there exists a subset with such that:
[TABLE]
In the integration performed in the begining of the proof we use instead of integrating in the whole space:
[TABLE]
Then
[TABLE]
By Proposition 4.6:
[TABLE]
hence . As for , using again :
[TABLE]
as . Finally, as before, .
The next proposition is the initial motivation of the previous general lower bound of large deviations.
Proposition 4.7
Let be a sequence of reals that converge to limits that are finite and at least one of them is not null. We suppose that . Then, with
[TABLE]
there exists a function such that:
[TABLE]
In other words:
[TABLE]
With
[TABLE]
*The function has the following properties:
and the definition domain of includes the interval and for a given constant , may be chosen such that for all .*
The proof of this proposition requires some intermediate results stated in the following lemma and propositions.
Notations. For
[TABLE]
let us set:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Lemma 4.2
*Suppose that that the sequences have limits . Then:
The functions are uniformly bounded in every closed interval of . The same result holds for and in in any closed subinterval of and respectively.
The derivatives of are uniformly bounded in every closed interval of . Those of are uniformly bounded in every closed interval of .
The derivatives of are uniformly bounded in every closed interval of .
The functions and their first and second derivatives are bounded in any closed subinterval of .*
**Proof.
**To begin with , let us remark that:
[TABLE]
The last inequality is due to the fact that . On the other hand, we have always , then
[TABLE]
so that is bounded in any closed interval of , because of the convergence of . As for the the derivatives:
[TABLE]
their boundedness follows from that of and by proposition 4.1 and the Cauchy-Schwarz inequality.
The boundedness of follows from that of and the fact that (Jensenβs inequality).
Let us turn to the case of . To simplify the notations, we suppose first that . We have:
[TABLE]
where is a normal gaussian random variable. The tail probabilities of have bounds like:
[TABLE]
We get:
[TABLE]
The last integral is known to converge iif , i.e. which proves the uniform boundedness (in ) of and in the same way that of in any closed subinterval of . We have the same result for their derivatives with the same arguments as for .
Now if , we replace the previous Gaussian variable by a Gaussian variable with variance . The factor in the proof will be replaced by and by a simple limit argument, the boundedness of is valid if i.e. . The case of is treated in the same way.
As for the case of , we use the Holder inequality with:
[TABLE]
and we have:
[TABLE]
In view of the first assertions of the lemma, the uniform boundedness of is guaranteed if , that is . As is arbitrary we get the result for every closed interval with . The same result holds of course of , and the case of the derivatives of is proved in the same way.
The following Proposition is another important piece of the proof:
Proposition 4.8
. With the assumptions of Proposition 4.7, let and be the Legendre-Fenchel transform of : , with . Let:
[TABLE]
Then, the definition domain of is not empty: there exists a real such that for all , is finite and moreover, for a given , we may choose such that for all .
Let us recall some facts about the Legendre-Fenchel transform in the case where is a convex function. As is concave, it will have a maximum iif at some point, i.e. iif there exists a such that . Now, suppose that is strictly increasing and has thus an inverse at the point : . Then:
[TABLE]
We shall also need the following elementary technical lemma:
Lemma 4.3
*Let be a sequence of real random variables splitted into their positive and negative parts. Suppose that:
(1) for all
(2) .
(3) There exists such that for all .
Then*
[TABLE]
and:
[TABLE]
**Proof.
**As , we have , and . By the Cauchy-Schwarz inequality:
[TABLE]
which yields
[TABLE]
This proves (4.62). As for (4.63), for a given , by (4.62), there exists such that for all . On the other hand:
[TABLE]
Hence for all , we should have and for this we must have . The same result holds for .
Remark 4.9
The results of this lemma fail if the condition (3): is not satisfied: consider this example:
[TABLE]
*We have and . But . Here the condition (3) is not satisfied: .
Let us note that the estimate of positive/negative of a random variable in general is a quite complex problem.*
**Proof of Proposition 4.8
**First, by lemma 4.2, the functions and their first and second derivatives are well defined and bounded in an interval , with independent of . Moreover is strictly increasing in this interval and is a bijection from to . We can choose to be .
Now, a crucial point is that:
[TABLE]
To prove (4.64), let and be the positive / negative parts decomposition of ; as we consider the expectations we drop the index, because, the laws of are independent of it. Since is convex when we have ; on the other hand , so that:
[TABLE]
from which we deduce that:
[TABLE]
where we have used the property that provided that is continuous and increasing (this last condition is necessary), which is applied to .
We can use lemma 4.3 for : its 3 conditions are satisfied by Proposition 4.1: . Then satisfies:
[TABLE]
where is a bound of (lemma 4.2), so that (4.64) is proved.
By definition, (4.64) implies that for any , there is a such that for all , we have ; by taking we have for all .
This means that the interval if , and therefore is defined on for all .
Now for all we have which implies that:
[TABLE]
Hence, with , we note that may depends on . This completes the proof of Proposition 4.8
**Proof of Proposition 4.7
**Let and . We start from the inequality (4.58) and we consider the probability measures such that , where for a probability measure , denotes the expectation or resultant of , see [17]. Then we have:
[TABLE]
We shall focus on the first term and the aim is to get an expression of
[TABLE]
and its limit as . From now on we specialize the proof to the case related to our initial problem: are real random variables with the same law , and will be the law of . Then it is easy to see that . By CsiszΓ‘r [17], Theorem 2 (or Theorem 3 which gives the same result here) we have:
[TABLE]
This is nothing but the Legendre-Fenchel transform of the or . With the notation we have indeed:
[TABLE]
because . Then
[TABLE]
and:
[TABLE]
The liminf of the last two terms of (4.68) being by the proof of Proposition 4.4, and with we get finally:
[TABLE]
According to Proposition 4.8, is well defined on a set where it is finite and moreover for a given constant , we can choose such that . This completes the proof of Proposition 4.7.
4.4 The Laplace Method and large deviations
After the transformation , the action may be written as:
[TABLE]
where, in one of the cases that will be distinguished below, and a.e., obeys to a kind of law of large numbers. As previously mentioned ( Β§4.1), the investigation regarding the limiting field leads to the study of the estimate of:
[TABLE]
with . Such an estimate are performed with generalizations of the Laplace method and its various generalizations in infinite dimension ; we refer to Pitebarg and Fatalov [51] for a detailed review of this topic. In large deviations contexts, the Varadhan lemma is often used and may be formulated in fairly general settings. Roughly speaking, the Laplace method tells us that if a sequence of measures on some space satisfies:
[TABLE]
with being some rate function, then, for a function having some properties, we will have:
[TABLE]
Now, the condition (4.76) is correctly formulated by requiring that the sequence of measures satisfies a large deviation principle (LDP) with a rate function :
[TABLE]
for all (the Borel -field of ) ; and are the interior and closure of the set w.r.t the topology of . The rate function is assumed to be lower semi-continuous. If in addition the level sets are compact, then is said to be a good or proper rate function. The sequence may be a family depending on (, in which case (4.74) is written as
[TABLE]
Although the measure space is often supposed to be metric, separable and complete (Polish space), many large deviations results are valid in more general settings (and this may be actually needed, cf., e.g., [1] where the reference space , the set of piecewise continuous functions is non separable when endowed with the supremum norm ). In our case, the reference probability space is , but we shall work with the arrays of real random variables .
Now, when the sequence of measures satisfies the LDP (4.74), the estimate (4.77) is formulated as follows: for every bounded function , we have:
[TABLE]
or:
[TABLE]
The Varadhan lemma is usually stated with assumptions that are not be fulfilled in our case. As we need only the lower bound part of this lemma, we recall it in a following form:
Theorem 4.10
*Let be a sequence of probability measures on a Polish space and a function.
(1) Lower bound: Suppose that has a large deviations lower bound, that is: there is a function , such that for each open set :*
[TABLE]
Then, if is lower semi-continuous, we have:
[TABLE]
(2) Upper bound: Suppose that satisfies a large deviations upper bound, that is, there is a proper rate function (which is lower semi-continuous and its level sets are compact) such that for each closed set :
[TABLE]
Then, if is lower semi-continuous and bounded above, we have:
[TABLE]
And therefore if the conditions of (1) and (2) are fulfilled, we have:
[TABLE]
Let us remark that for the lower bound part, no assumption is made about , and the only condition set for is the lower semi-continuity. Its proof is quite simple and it is the only part used in this paper ; we remind it here:
By the lower semi-continuity of , for each and , there exists a neighborhood of such that for all ; so that we have:
[TABLE]
and
[TABLE]
By the lower bound assumption on the we get:
[TABLE]
This inequality being true for all and , we get the lower bound (4.79).
4.5 End of the proof of the main theorem
The action is written in the following form:
[TABLE]
where is chosen in order to ensure that converges to some . We distinguish the two cases of the theorem:
Case (A): At least one of the terms or or is not bounded:
Following the remark of we may suppose the limit of the unbounded term(s) is . We shall discuss 3 cases, depending of the dominating sequence of the above-mentioned 3 terms or :
Case (A.1): is the dominating factor in (4.83) and and and with
[TABLE]
The previous two sequences are bounded and as we can consider subsequences, we may suppose that they converge respectively to some and . We also might have or or both.
Let us rewrite the modified action and the exponent as:
[TABLE]
And is of the form:
[TABLE]
where with and . We can apply the results obtained in the previous sections: may be transformed as a mean of an array of random variables, which makes the link with the possibility of using large deviations techniques and other probabilistic results. By the law of large numbers (Proposition 4.2):
[TABLE]
Let:
[TABLE]
At this point we make the assumption that does not converge to [math]. This means that in dimension 4 we suppose that does not converge to [math], and in dimensions we may accept that converge to [math] but with a speed less that :
(A.1.1): does not converge to [math], i.e.
This assumption is legitimate only for dimension : in lower dimensions we can not have when the coupling is constant for . We shall also see that the following arguments do not work for a negative coupling constant sequence. As we can consider subsequences, we may suppose with this assumption that there is such that for all :
On the other hand , and for large ( for some ) we have:
[TABLE]
and then:
[TABLE]
Therefore:
[TABLE]
and
[TABLE]
(we use for , the corresponding in the last equation is indeed large (hence ) as we shall see.). This implies that:
[TABLE]
(1) By the lower bound result (Proposition 4.7) for the array or the sequence given by (4.84), and with , there exists a function such that, for any open set :
[TABLE]
with the properties that is well defined and is finite on an interval and furthermore can be chosen so that .
(2) We can the use the lower bound of the Varadhan lemma, applied in fact to the sequence :
[TABLE]
We apply this formula with (notice that is not bounded) and we get:
[TABLE]
This, with (4.90) shows that: and we have even more:
[TABLE]
(3)We turn now to the limit of the Radon-Nikodym density of the interacting field. Since and for , we write this time:
[TABLE]
and since we have:
[TABLE]
which implies that:
[TABLE]
where we have used for . Hence:
[TABLE]
By the law of large numbers (4.85) and the lower bound (4.92) we get:
[TABLE]
which implies that:
[TABLE]
This completes the proof of Theorem 3.1 (Part (2)) in the case ( A.1).
**Case (A.2): ** The dominating term is : That is
[TABLE]
In this case the action and the exponent can be written as:
[TABLE]
So that is of the form:
[TABLE]
where we may suppose that and have a limit (possibly ) and ; may be written as the mean of an array of random variables. We use as before Proposition 4.7 to the get a large deviations lower bound for the ; we conclude the proof exactly as in the case A.1 provided we add the assumption:
Condition (A.2.1): for sufficiently large and for some constant . Or at least this inequality holds for a subsequence of .
We note that:
Condition (A.2.1) is automatically satisfied if (A.1.1) is satisfied (i.e. in dimension 4 that does not tend to [math]).
Indeed In this case (A.2), we have for some constant : .
In the case where (A.1.1) is not satisfied i.e. , condition (A.2.1) means that for some constant .
**Case (A.3): ** The dominating term is : That is
[TABLE]
In this case we write the action and the exponent are written as:
[TABLE]
So that is of the form:
[TABLE]
where we may suppose this time that and have a limit (possibly ) and . may be written as the mean of an array of random variables in the same way as the case . The assertions of Proposition 4.7 related to the lower bound can be applied. And with the same arguments as in the case A.1 we will get a.e. and , provided we add the assumption:
Condition (A.3.1): for sufficiently large and for some constant . Or at least this inequality holds for a subsequence of .
We note that:
Condition (A.3.1) is automatically satisfied if (A.1.1) is satisfied (i.e. in dimension 4 that does not tend to [math]).
Indeed, in this case (A.3) we have for some constant :
[TABLE]
We recall . Then in the case where (A.1.1) is not satisfied i.e. , condition (A.3.1) means that for some constant , i.e. .
Remark. The above arguments can be summarized as follows: The Radon-Nikodym derivative can be written as
[TABLE]
If , i.e. for some (we can suppose this as being valid for every ), then:
[TABLE]
and:
[TABLE]
because . This yields:
[TABLE]
and by the lower bound of large deviations of and Varadhanβs lemma we get:
[TABLE]
The properties of mentioned in propositions 4.7 and 4.7 imply that for with sufficiently small and we deduce from (4.101) that:
[TABLE]
and
[TABLE]
which means that , a.e.
**Case B: ** The terms and are bounded:
In this case, since we have:
[TABLE]
almost everywhere, we get , and therefore:
[TABLE]
On the other hand, we have which obviously implies that .
Now, it is a well known result that if a sequence of integrable random variables converges almost everywhere (or even in probability) to an integrable random variable , and if , then converges to in (cf., e.g., Neveu [46], p.56, Ex. II-6-5). From this we deduce that converges to in , and therefore the sequence of measures converges strongly (setwise) to , which means the the limiting field is the free field. This completes the proof of theorem 3.1.
Let us remark that the later arguments are valid for all dimensions in the case where the terms and are bounded; while the former ones, corresponding to the case where at least one of the previous three terms is unbounded, are valid only for dimensions .
Finally, we also note that the scalar character of the field (i.e. the fact that does not play any role in the intermediate propositions or the end of the proof and Theorem 3.1 is thus also valid for the vector field. .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Aboulalaa, Grandes dΓ©viations et principe de maximum dβentropie pour les processus de sauts, C. R. Acad. Sc. Paris, SΓ©rie 1, MathΓ©matiques , 321 (7), 919-921, 1995.
- 2[2] A. Aboulalaa, The Gibbs principle of Markov jump processes, Stoch. Proc. Appl. , 64, 257-271, 1996
- 3[3] A. Aboulalaa, Random fields, large deviations and triviality in quantum field theory. Part II, ar Xiv preprint ar Xiv:2212.14879, 2022.
- 4[4] M. Aizenman, Geometric analysis of Ο 4 4 superscript subscript π 4 4 \varphi_{4}^{4} fileds and Ising models, Com. Math. Phys. , 86 1-48, 1982, and Phys. Rev. lett. 47, 1-4, 1981.
- 5[5] M. Aizenman, and Hugo Duminil-Copin, Marginal triviality of the scaling limits of critical 4D Ising and Ο 4 4 superscript subscript π 4 4 \varphi_{4}^{4} models, Annals of Mathematics 194.1 (2021): 163-235.
- 6[6] H. Araki, Mathematical Theory of Quantum Fields , Oxford Univ. Press, Oxford 1999.
- 7[7] R.R. Bahadur, M. Raghavachari, Some asymptotic properties of likelihood ratios in general sample spaces, Proc. Sixth Berkeley Symp. Probab. 129-152, 1968.
- 8[8] R.R. Bahadur, S.L. Zabell, J.C. Gupta, Large deviations, tests and estimates, in I.M. Chakravarti, ed. Asymptotic Theory of Statistical Tests and Estimation, Proc. Adv. In. Symp., Chapel Hill, NC , Academic Press Boston, pp. 33-64, 1980.
